How large a training set is needed to develop a classifier for microarray data?
Dobbin, Kevin K; Zhao, Yingdong; Simon, Richard M
2008-01-01
A common goal of gene expression microarray studies is the development of a classifier that can be used to divide patients into groups with different prognoses, or with different expected responses to a therapy. These types of classifiers are developed on a training set, which is the set of samples used to train a classifier. The question of how many samples are needed in the training set to produce a good classifier from high-dimensional microarray data is challenging. We present a model-based approach to determining the sample size required to adequately train a classifier. It is shown that sample size can be determined from three quantities: standardized fold change, class prevalence, and number of genes or features on the arrays. Numerous examples and important experimental design issues are discussed. The method is adapted to address ex post facto determination of whether the size of a training set used to develop a classifier was adequate. An interactive web site for performing the sample size calculations is provided. We showed that sample size calculations for classifier development from high-dimensional microarray data are feasible, discussed numerous important considerations, and presented examples.
Cook, Michael A; Chan, Chi-Kin; Jorgensen, Paul; Ketela, Troy; So, Daniel; Tyers, Mike; Ho, Chi-Yip
2008-02-06
Molecular barcode arrays provide a powerful means to analyze cellular phenotypes in parallel through detection of short (20-60 base) unique sequence tags, or "barcodes", associated with each strain or clone in a collection. However, costs of current methods for microarray construction, whether by in situ oligonucleotide synthesis or ex situ coupling of modified oligonucleotides to the slide surface are often prohibitive to large-scale analyses. Here we demonstrate that unmodified 20mer oligonucleotide probes printed on conventional surfaces show comparable hybridization signals to covalently linked 5'-amino-modified probes. As a test case, we undertook systematic cell size analysis of the budding yeast Saccharomyces cerevisiae genome-wide deletion collection by size separation of the deletion pool followed by determination of strain abundance in size fractions by barcode arrays. We demonstrate that the properties of a 13K unique feature spotted 20 mer oligonucleotide barcode microarray compare favorably with an analogous covalently-linked oligonucleotide array. Further, cell size profiles obtained with the size selection/barcode array approach recapitulate previous cell size measurements of individual deletion strains. Finally, through atomic force microscopy (AFM), we characterize the mechanism of hybridization to unmodified barcode probes on the slide surface. These studies push the lower limit of probe size in genome-scale unmodified oligonucleotide microarray construction and demonstrate a versatile, cost-effective and reliable method for molecular barcode analysis.
Cook, Michael A.; Chan, Chi-Kin; Jorgensen, Paul; Ketela, Troy; So, Daniel; Tyers, Mike; Ho, Chi-Yip
2008-01-01
Background Molecular barcode arrays provide a powerful means to analyze cellular phenotypes in parallel through detection of short (20–60 base) unique sequence tags, or “barcodes”, associated with each strain or clone in a collection. However, costs of current methods for microarray construction, whether by in situ oligonucleotide synthesis or ex situ coupling of modified oligonucleotides to the slide surface are often prohibitive to large-scale analyses. Methodology/Principal Findings Here we demonstrate that unmodified 20mer oligonucleotide probes printed on conventional surfaces show comparable hybridization signals to covalently linked 5′-amino-modified probes. As a test case, we undertook systematic cell size analysis of the budding yeast Saccharomyces cerevisiae genome-wide deletion collection by size separation of the deletion pool followed by determination of strain abundance in size fractions by barcode arrays. We demonstrate that the properties of a 13K unique feature spotted 20 mer oligonucleotide barcode microarray compare favorably with an analogous covalently-linked oligonucleotide array. Further, cell size profiles obtained with the size selection/barcode array approach recapitulate previous cell size measurements of individual deletion strains. Finally, through atomic force microscopy (AFM), we characterize the mechanism of hybridization to unmodified barcode probes on the slide surface. Conclusions/Significance These studies push the lower limit of probe size in genome-scale unmodified oligonucleotide microarray construction and demonstrate a versatile, cost-effective and reliable method for molecular barcode analysis. PMID:18253494
High-density, microsphere-based fiber optic DNA microarrays.
Epstein, Jason R; Leung, Amy P K; Lee, Kyong Hoon; Walt, David R
2003-05-01
A high-density fiber optic DNA microarray has been developed consisting of oligonucleotide-functionalized, 3.1-microm-diameter microspheres randomly distributed on the etched face of an imaging fiber bundle. The fiber bundles are comprised of 6000-50000 fused optical fibers and each fiber terminates with an etched well. The microwell array is capable of housing complementary-sized microspheres, each containing thousands of copies of a unique oligonucleotide probe sequence. The array fabrication process results in random microsphere placement. Determining the position of microspheres in the random array requires an optical encoding scheme. This array platform provides many advantages over other array formats. The microsphere-stock suspension concentration added to the etched fiber can be controlled to provide inherent sensor redundancy. Examining identical microspheres has a beneficial effect on the signal-to-noise ratio. As other sequences of interest are discovered, new microsphere sensing elements can be added to existing microsphere pools and new arrays can be fabricated incorporating the new sequences without altering the existing detection capabilities. These microarrays contain the smallest feature sizes (3 microm) of any DNA array, allowing interrogation of extremely small sample volumes. Reducing the feature size results in higher local target molecule concentrations, creating rapid and highly sensitive assays. The microsphere array platform is also flexible in its applications; research has included DNA-protein interaction profiles, microbial strain differentiation, and non-labeled target interrogation with molecular beacons. Fiber optic microsphere-based DNA microarrays have a simple fabrication protocol enabling their expansion into other applications, such as single cell-based assays.
Flow-pattern Guided Fabrication of High-density Barcode Antibody Microarray
Ramirez, Lisa S.; Wang, Jun
2016-01-01
Antibody microarray as a well-developed technology is currently challenged by a few other established or emerging high-throughput technologies. In this report, we renovate the antibody microarray technology by using a novel approach for manufacturing and by introducing new features. The fabrication of our high-density antibody microarray is accomplished through perpendicularly oriented flow-patterning of single stranded DNAs and subsequent conversion mediated by DNA-antibody conjugates. This protocol outlines the critical steps in flow-patterning DNA, producing and purifying DNA-antibody conjugates, and assessing the quality of the fabricated microarray. The uniformity and sensitivity are comparable with conventional microarrays, while our microarray fabrication does not require the assistance of an array printer and can be performed in most research laboratories. The other major advantage is that the size of our microarray units is 10 times smaller than that of printed arrays, offering the unique capability of analyzing functional proteins from single cells when interfacing with generic microchip designs. This barcode technology can be widely employed in biomarker detection, cell signaling studies, tissue engineering, and a variety of clinical applications. PMID:26780370
Fuzzy support vector machine for microarray imbalanced data classification
NASA Astrophysics Data System (ADS)
Ladayya, Faroh; Purnami, Santi Wulan; Irhamah
2017-11-01
DNA microarrays are data containing gene expression with small sample sizes and high number of features. Furthermore, imbalanced classes is a common problem in microarray data. This occurs when a dataset is dominated by a class which have significantly more instances than the other minority classes. Therefore, it is needed a classification method that solve the problem of high dimensional and imbalanced data. Support Vector Machine (SVM) is one of the classification methods that is capable of handling large or small samples, nonlinear, high dimensional, over learning and local minimum issues. SVM has been widely applied to DNA microarray data classification and it has been shown that SVM provides the best performance among other machine learning methods. However, imbalanced data will be a problem because SVM treats all samples in the same importance thus the results is bias for minority class. To overcome the imbalanced data, Fuzzy SVM (FSVM) is proposed. This method apply a fuzzy membership to each input point and reformulate the SVM such that different input points provide different contributions to the classifier. The minority classes have large fuzzy membership so FSVM can pay more attention to the samples with larger fuzzy membership. Given DNA microarray data is a high dimensional data with a very large number of features, it is necessary to do feature selection first using Fast Correlation based Filter (FCBF). In this study will be analyzed by SVM, FSVM and both methods by applying FCBF and get the classification performance of them. Based on the overall results, FSVM on selected features has the best classification performance compared to SVM.
Ooi, Chia Huey; Chetty, Madhu; Teng, Shyh Wei
2006-06-23
Due to the large number of genes in a typical microarray dataset, feature selection looks set to play an important role in reducing noise and computational cost in gene expression-based tissue classification while improving accuracy at the same time. Surprisingly, this does not appear to be the case for all multiclass microarray datasets. The reason is that many feature selection techniques applied on microarray datasets are either rank-based and hence do not take into account correlations between genes, or are wrapper-based, which require high computational cost, and often yield difficult-to-reproduce results. In studies where correlations between genes are considered, attempts to establish the merit of the proposed techniques are hampered by evaluation procedures which are less than meticulous, resulting in overly optimistic estimates of accuracy. We present two realistically evaluated correlation-based feature selection techniques which incorporate, in addition to the two existing criteria involved in forming a predictor set (relevance and redundancy), a third criterion called the degree of differential prioritization (DDP). DDP functions as a parameter to strike the balance between relevance and redundancy, providing our techniques with the novel ability to differentially prioritize the optimization of relevance against redundancy (and vice versa). This ability proves useful in producing optimal classification accuracy while using reasonably small predictor set sizes for nine well-known multiclass microarray datasets. For multiclass microarray datasets, especially the GCM and NCI60 datasets, DDP enables our filter-based techniques to produce accuracies better than those reported in previous studies which employed similarly realistic evaluation procedures.
Temperature-controlled microintaglio printing for high-resolution micropatterning of RNA molecules.
Kobayashi, Ryo; Biyani, Manish; Ueno, Shingo; Kumal, Subhashini Raj; Kuramochi, Hiromi; Ichiki, Takanori
2015-05-15
We have developed an advanced microintaglio printing method for fabricating fine and high-density micropatterns and applied it to the microarraying of RNA molecules. The microintaglio printing of RNA reported here is based on the hybridization of RNA with immobilized complementary DNA probes. The hybridization was controlled by switching the RNA conformation via the temperature, and an RNA microarray with a diameter of 1.5 µm and a density of 40,000 spots/mm(2) with high contrast was successfully fabricated. Specifically, no size effects were observed in the uniformity of patterned signals over a range of microarray feature sizes spanning one order of magnitude. Additionally, we have developed a microintaglio printing method for transcribed RNA microarrays on demand using DNA-immobilized magnetic beads. The beads were arrayed on wells fabricated on a printing mold and the wells were filled with in vitro transcription reagent and sealed with a DNA-immobilized glass substrate. Subsequently, RNA was in situ synthesized using the bead-immobilized DNA as a template and printed onto the substrate via hybridization. Since the microintaglio printing of RNA using DNA-immobilized beads enables the fabrication of a microarray of spots composed of multiple RNA sequences, it will be possible to screen or analyze RNA functions using an RNA microarray fabricated by temperature-controlled microintaglio printing (TC-µIP). Copyright © 2014 Elsevier B.V. All rights reserved.
Wong, Gerard; Leckie, Christopher; Kowalczyk, Adam
2012-01-15
Feature selection is a key concept in machine learning for microarray datasets, where features represented by probesets are typically several orders of magnitude larger than the available sample size. Computational tractability is a key challenge for feature selection algorithms in handling very high-dimensional datasets beyond a hundred thousand features, such as in datasets produced on single nucleotide polymorphism microarrays. In this article, we present a novel feature set reduction approach that enables scalable feature selection on datasets with hundreds of thousands of features and beyond. Our approach enables more efficient handling of higher resolution datasets to achieve better disease subtype classification of samples for potentially more accurate diagnosis and prognosis, which allows clinicians to make more informed decisions in regards to patient treatment options. We applied our feature set reduction approach to several publicly available cancer single nucleotide polymorphism (SNP) array datasets and evaluated its performance in terms of its multiclass predictive classification accuracy over different cancer subtypes, its speedup in execution as well as its scalability with respect to sample size and array resolution. Feature Set Reduction (FSR) was able to reduce the dimensions of an SNP array dataset by more than two orders of magnitude while achieving at least equal, and in most cases superior predictive classification performance over that achieved on features selected by existing feature selection methods alone. An examination of the biological relevance of frequently selected features from FSR-reduced feature sets revealed strong enrichment in association with cancer. FSR was implemented in MATLAB R2010b and is available at http://ww2.cs.mu.oz.au/~gwong/FSR.
Multi-task feature selection in microarray data by binary integer programming.
Lan, Liang; Vucetic, Slobodan
2013-12-20
A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solving a quadratic objective function with binary integer constraints. To improve the computational efficiency, the binary integer constraints are relaxed and a low-rank approximation to the quadratic term is applied. The proposed feature selection algorithm was extended to solve multi-task microarray classification problems. We compared the single-task version of the proposed feature selection algorithm with 9 existing feature selection methods on 4 benchmark microarray data sets. The empirical results show that the proposed method achieved the most accurate predictions overall. We also evaluated the multi-task version of the proposed algorithm on 8 multi-task microarray datasets. The multi-task feature selection algorithm resulted in significantly higher accuracy than when using the single-task feature selection methods.
A study of metaheuristic algorithms for high dimensional feature selection on microarray data
NASA Astrophysics Data System (ADS)
Dankolo, Muhammad Nasiru; Radzi, Nor Haizan Mohamed; Sallehuddin, Roselina; Mustaffa, Noorfa Haszlinna
2017-11-01
Microarray systems enable experts to examine gene profile at molecular level using machine learning algorithms. It increases the potentials of classification and diagnosis of many diseases at gene expression level. Though, numerous difficulties may affect the efficiency of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data pre-processing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper discusses the application of the metaheuristics algorithms for feature selection in microarray dataset. This study reveals that, the algorithms have yield an interesting result with limited resources thereby saving computational expenses of machine learning algorithms.
Predicting features of breast cancer with gene expression patterns.
Lu, Xuesong; Lu, Xin; Wang, Zhigang C; Iglehart, J Dirk; Zhang, Xuegong; Richardson, Andrea L
2008-03-01
Data from gene expression arrays hold an enormous amount of biological information. We sought to determine if global gene expression in primary breast cancers contained information about biologic, histologic, and anatomic features of the disease in individual patients. Microarray data from the tumors of 129 patients were analyzed for the ability to predict biomarkers [estrogen receptor (ER) and HER2], histologic features [grade and lymphatic-vascular invasion (LVI)], and stage parameters (tumor size and lymph node metastasis). Multiple statistical predictors were used and the prediction accuracy was determined by cross-validation error rate; multidimensional scaling (MDS) allowed visualization of the predicted states under study. Models built from gene expression data accurately predict ER and HER2 status, and divide tumor grade into high-grade and low-grade clusters; intermediate-grade tumors are not a unique group. In contrast, gene expression data is inaccurate at predicting tumor size, lymph node status or LVI. The best model for prediction of nodal status included tumor size, LVI status and pathologically defined tumor subtype (based on combinations of ER, HER2, and grade); the addition of microarray-based prediction to this model failed to improve the prediction accuracy. Global gene expression supports a binary division of ER, HER2, and grade, clearly separating tumors into two categories; intermediate values for these bio-indicators do not define intermediate tumor subsets. Results are consistent with a model of regional metastasis that depends on inherent biologic differences in metastatic propensity between breast cancer subtypes, upon which time and chance then operate.
Cloud-scale genomic signals processing classification analysis for gene expression microarray data.
Harvey, Benjamin; Soo-Yeon Ji
2014-01-01
As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
Single molecule fluorescence microscopy for ultra-sensitive RNA expression profiling
NASA Astrophysics Data System (ADS)
Hesse, Jan; Jacak, Jaroslaw; Regl, Gerhard; Eichberger, Thomas; Aberger, Fritz; Schlapak, Robert; Howorka, Stefan; Muresan, Leila; Frischauf, Anna-Maria; Schütz, Gerhard J.
2007-02-01
We developed a microarray analysis platform for ultra-sensitive RNA expression profiling of minute samples. It utilizes a novel scanning system for single molecule fluorescence detection on cm2 size samples in combination with specialized biochips, optimized for low autofluorescence and weak unspecific adsorption. 20 μg total RNA was extracted from 10 6 cells of a human keratinocyte cell line (HaCaT) and reversely transcribed in the presence of Alexa647-aha-dUTP. 1% of the resulting labeled cDNA was used for complex hybridization to a custom-made oligonucleotide microarray representing a set of 125 different genes. For low abundant genes, individual cDNA molecules hybridized to the microarray spots could be resolved. Single cDNA molecules hybridized to the chip surface appeared as diffraction limited features in the fluorescence images. The à trous wavelet method was utilized for localization and counting of the separated cDNA signals. Subsequently, the degree of labeling of the localized cDNA molecules was determined by brightness analysis for the different genes. Variations by factors up to 6 were found, which in conventional microarray analysis would result in a misrepresentation of the relative abundance of mRNAs.
A novel feature extraction approach for microarray data based on multi-algorithm fusion
Jiang, Zhu; Xu, Rong
2015-01-01
Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions. PMID:25780277
A novel feature extraction approach for microarray data based on multi-algorithm fusion.
Jiang, Zhu; Xu, Rong
2015-01-01
Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions.
Kumar, Mukesh; Rath, Nitish Kumar; Rath, Santanu Kumar
2016-04-01
Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as it keeps changing with time. Therefore, the analysis of microarray datasets in a small amount of time is essential. They often contain a large amount of expression, but only a fraction of it comprises genes that are significantly expressed. The precise identification of genes of interest that are responsible for causing cancer are imperative in microarray data analysis. Most existing schemes employ a two-phase process such as feature selection/extraction followed by classification. In this paper, various statistical methods (tests) based on MapReduce are proposed for selecting relevant features. After feature selection, a MapReduce-based K-nearest neighbor (mrKNN) classifier is also employed to classify microarray data. These algorithms are successfully implemented in a Hadoop framework. A comparative analysis is done on these MapReduce-based models using microarray datasets of various dimensions. From the obtained results, it is observed that these models consume much less execution time than conventional models in processing big data. Copyright © 2016 Elsevier Inc. All rights reserved.
A Review of Feature Extraction Software for Microarray Gene Expression Data
Tan, Ching Siang; Ting, Wai Soon; Mohamad, Mohd Saberi; Chan, Weng Howe; Deris, Safaai; Ali Shah, Zuraini
2014-01-01
When gene expression data are too large to be processed, they are transformed into a reduced representation set of genes. Transforming large-scale gene expression data into a set of genes is called feature extraction. If the genes extracted are carefully chosen, this gene set can extract the relevant information from the large-scale gene expression data, allowing further analysis by using this reduced representation instead of the full size data. In this paper, we review numerous software applications that can be used for feature extraction. The software reviewed is mainly for Principal Component Analysis (PCA), Independent Component Analysis (ICA), Partial Least Squares (PLS), and Local Linear Embedding (LLE). A summary and sources of the software are provided in the last section for each feature extraction method. PMID:25250315
2009-01-01
Background Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. Conclusion Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments. PMID:19941644
Classification of Microarray Data Using Kernel Fuzzy Inference System
Kumar Rath, Santanu
2014-01-01
The DNA microarray classification technique has gained more popularity in both research and practice. In real data analysis, such as microarray data, the dataset contains a huge number of insignificant and irrelevant features that tend to lose useful information. Classes with high relevance and feature sets with high significance are generally referred for the selected features, which determine the samples classification into their respective classes. In this paper, kernel fuzzy inference system (K-FIS) algorithm is applied to classify the microarray data (leukemia) using t-test as a feature selection method. Kernel functions are used to map original data points into a higher-dimensional (possibly infinite-dimensional) feature space defined by a (usually nonlinear) function ϕ through a mathematical process called the kernel trick. This paper also presents a comparative study for classification using K-FIS along with support vector machine (SVM) for different set of features (genes). Performance parameters available in the literature such as precision, recall, specificity, F-measure, ROC curve, and accuracy are considered to analyze the efficiency of the classification model. From the proposed approach, it is apparent that K-FIS model obtains similar results when compared with SVM model. This is an indication that the proposed approach relies on kernel function. PMID:27433543
Cheng, Ningtao; Wu, Leihong; Cheng, Yiyu
2013-01-01
The promise of microarray technology in providing prediction classifiers for cancer outcome estimation has been confirmed by a number of demonstrable successes. However, the reliability of prediction results relies heavily on the accuracy of statistical parameters involved in classifiers. It cannot be reliably estimated with only a small number of training samples. Therefore, it is of vital importance to determine the minimum number of training samples and to ensure the clinical value of microarrays in cancer outcome prediction. We evaluated the impact of training sample size on model performance extensively based on 3 large-scale cancer microarray datasets provided by the second phase of MicroArray Quality Control project (MAQC-II). An SSNR-based (scale of signal-to-noise ratio) protocol was proposed in this study for minimum training sample size determination. External validation results based on another 3 cancer datasets confirmed that the SSNR-based approach could not only determine the minimum number of training samples efficiently, but also provide a valuable strategy for estimating the underlying performance of classifiers in advance. Once translated into clinical routine applications, the SSNR-based protocol would provide great convenience in microarray-based cancer outcome prediction in improving classifier reliability. PMID:23861920
Ou, Hong-Yu; He, Xinyi; Harrison, Ewan M.; Kulasekara, Bridget R.; Thani, Ali Bin; Kadioglu, Aras; Lory, Stephen; Hinton, Jay C. D.; Barer, Michael R.; Rajakumar, Kumar
2007-01-01
MobilomeFINDER (http://mml.sjtu.edu.cn/MobilomeFINDER) is an interactive online tool that facilitates bacterial genomic island or ‘mobile genome’ (mobilome) discovery; it integrates the ArrayOme and tRNAcc software packages. ArrayOme utilizes a microarray-derived comparative genomic hybridization input data set to generate ‘inferred contigs’ produced by merging adjacent genes classified as ‘present’. Collectively these ‘fragments’ represent a hypothetical ‘microarray-visualized genome (MVG)’. ArrayOme permits recognition of discordances between physical genome and MVG sizes, thereby enabling identification of strains rich in microarray-elusive novel genes. Individual tRNAcc tools facilitate automated identification of genomic islands by comparative analysis of the contents and contexts of tRNA sites and other integration hotspots in closely related sequenced genomes. Accessory tools facilitate design of hotspot-flanking primers for in silico and/or wet-science-based interrogation of cognate loci in unsequenced strains and analysis of islands for features suggestive of foreign origins; island-specific and genome-contextual features are tabulated and represented in schematic and graphical forms. To date we have used MobilomeFINDER to analyse several Enterobacteriaceae, Pseudomonas aeruginosa and Streptococcus suis genomes. MobilomeFINDER enables high-throughput island identification and characterization through increased exploitation of emerging sequence data and PCR-based profiling of unsequenced test strains; subsequent targeted yeast recombination-based capture permits full-length sequencing and detailed functional studies of novel genomic islands. PMID:17537813
Larger core size has superior technical and analytical accuracy in bladder tissue microarray.
Eskaros, Adel Rh; Egloff, Shanna A Arnold; Boyd, Kelli L; Richardson, Joyce E; Hyndman, M Eric; Zijlstra, Andries
2017-03-01
The construction of tissue microarrays (TMAs) with cores from a large number of paraffin-embedded tissues (donors) into a single paraffin block (recipient) is an effective method of analyzing samples from many patient specimens simultaneously. For the TMA to be successful, the cores within it must capture the correct histologic areas from the donor blocks (technical accuracy) and maintain concordance with the tissue of origin (analytical accuracy). This can be particularly challenging for tissues with small histological features such as small islands of carcinoma in situ (CIS), thin layers of normal urothelial lining of the bladder, or cancers that exhibit intratumor heterogeneity. In an effort to create a comprehensive TMA of a bladder cancer patient cohort that accurately represents the tumor heterogeneity and captures the small features of normal and CIS, we determined how core size (0.6 vs 1.0 mm) impacted the technical and analytical accuracy of the TMA. The larger 1.0 mm core exhibited better technical accuracy for all tissue types at 80.9% (normal), 94.2% (tumor), and 71.4% (CIS) compared with 58.6%, 85.9%, and 63.8% for 0.6 mm cores. Although the 1.0 mm core provided better tissue capture, increasing the number of replicates from two to three allowed with the 0.6 mm core compensated for this reduced technical accuracy. However, quantitative image analysis of proliferation using both Ki67+ immunofluorescence counts and manual mitotic counts demonstrated that the 1.0 mm core size also exhibited significantly greater analytical accuracy (P=0.004 and 0.035, respectively, r 2 =0.979 and 0.669, respectively). Ultimately, our findings demonstrate that capturing two or more 1.0 mm cores for TMA construction provides superior technical and analytical accuracy over the smaller 0.6 mm cores, especially for tissues harboring small histological features or substantial heterogeneity.
Skip to main content DNA Microarray Technology Enter Search Term(s): Español Research Funding An Overview Bioinformatics Current Grants Education and Training Funding Extramural Research News Features Funding Divisions Funding ...
Dendrimeric coating of glass slides for sensitive DNA microarrays analysis
Le Berre, Véronique; Trévisiol, Emmanuelle; Dagkessamanskaia, Adilia; Sokol, Serguei; Caminade, Anne-Marie; Majoral, Jean Pierre; Meunier, Bernard; François, Jean
2003-01-01
Successful use and reliability of microarray technology is highly dependent on several factors, including surface chemistry parameters and accessibility of cDNA targets to the DNA probes fixed onto the surface. Here, we show that functionalisation of glass slides with homemade dendrimers allow production of more sensitive and reliable DNA microarrays. The dendrimers are nanometric structures of size-controlled diameter with aldehyde function at their periphery. Covalent attachment of these spherical reactive chemical structures on amino-silanised glass slides generates a reactive ∼100 Å layer onto which amino-modified DNA probes are covalently bound. This new grafting chemistry leads to the formation of uniform and homogenous spots. More over, probe concentration before spotting could be reduced from 0.2 to 0.02 mg/ml with PCR products and from 20 to 5 µM with 70mer oligonucleotides without affecting signal intensities after hybridisation with Cy3- and Cy5-labelled targets. More interestingly, while the binding capacity of captured probes on dendrimer-activated glass surface (named dendrislides) is roughly similar to other functionalised glass slides from commercial sources, detection sensitivity was 2-fold higher than with other available DNA microarrays. This detection limit was estimated to 0.1 pM of cDNA targets. Altogether, these features make dendrimer-activated slides ideal for manufacturing cost-effective DNA arrays applicable for gene expression and detection of mutations. PMID:12907740
Evaluation of artificial time series microarray data for dynamic gene regulatory network inference.
Xenitidis, P; Seimenis, I; Kakolyris, S; Adamopoulos, A
2017-08-07
High-throughput technology like microarrays is widely used in the inference of gene regulatory networks (GRNs). We focused on time series data since we are interested in the dynamics of GRNs and the identification of dynamic networks. We evaluated the amount of information that exists in artificial time series microarray data and the ability of an inference process to produce accurate models based on them. We used dynamic artificial gene regulatory networks in order to create artificial microarray data. Key features that characterize microarray data such as the time separation of directly triggered genes, the percentage of directly triggered genes and the triggering function type were altered in order to reveal the limits that are imposed by the nature of microarray data on the inference process. We examined the effect of various factors on the inference performance such as the network size, the presence of noise in microarray data, and the network sparseness. We used a system theory approach and examined the relationship between the pole placement of the inferred system and the inference performance. We examined the relationship between the inference performance in the time domain and the true system parameter identification. Simulation results indicated that time separation and the percentage of directly triggered genes are crucial factors. Also, network sparseness, the triggering function type and noise in input data affect the inference performance. When two factors were simultaneously varied, it was found that variation of one parameter significantly affects the dynamic response of the other. Crucial factors were also examined using a real GRN and acquired results confirmed simulation findings with artificial data. Different initial conditions were also used as an alternative triggering approach. Relevant results confirmed that the number of datasets constitutes the most significant parameter with regard to the inference performance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Split-plot microarray experiments: issues of design, power and sample size.
Tsai, Pi-Wen; Lee, Mei-Ling Ting
2005-01-01
This article focuses on microarray experiments with two or more factors in which treatment combinations of the factors corresponding to the samples paired together onto arrays are not completely random. A main effect of one (or more) factor(s) is confounded with arrays (the experimental blocks). This is called a split-plot microarray experiment. We utilise an analysis of variance (ANOVA) model to assess differentially expressed genes for between-array and within-array comparisons that are generic under a split-plot microarray experiment. Instead of standard t- or F-test statistics that rely on mean square errors of the ANOVA model, we use a robust method, referred to as 'a pooled percentile estimator', to identify genes that are differentially expressed across different treatment conditions. We illustrate the design and analysis of split-plot microarray experiments based on a case application described by Jin et al. A brief discussion of power and sample size for split-plot microarray experiments is also presented.
ERIC Educational Resources Information Center
Plomin, Robert; Schalkwyk, Leonard C.
2007-01-01
Microarrays are revolutionizing genetics by making it possible to genotype hundreds of thousands of DNA markers and to assess the expression (RNA transcripts) of all of the genes in the genome. Microarrays are slides the size of a postage stamp that contain millions of DNA sequences to which single-stranded DNA or RNA can hybridize. This…
Yılmaz Isıkhan, Selen; Karabulut, Erdem; Alpar, Celal Reha
2016-01-01
Background/Aim . Evaluating the success of dose prediction based on genetic or clinical data has substantially advanced recently. The aim of this study is to predict various clinical dose values from DNA gene expression datasets using data mining techniques. Materials and Methods . Eleven real gene expression datasets containing dose values were included. First, important genes for dose prediction were selected using iterative sure independence screening. Then, the performances of regression trees (RTs), support vector regression (SVR), RT bagging, SVR bagging, and RT boosting were examined. Results . The results demonstrated that a regression-based feature selection method substantially reduced the number of irrelevant genes from raw datasets. Overall, the best prediction performance in nine of 11 datasets was achieved using SVR; the second most accurate performance was provided using a gradient-boosting machine (GBM). Conclusion . Analysis of various dose values based on microarray gene expression data identified common genes found in our study and the referenced studies. According to our findings, SVR and GBM can be good predictors of dose-gene datasets. Another result of the study was to identify the sample size of n = 25 as a cutoff point for RT bagging to outperform a single RT.
A biomimetic algorithm for the improved detection of microarray features
NASA Astrophysics Data System (ADS)
Nicolau, Dan V., Jr.; Nicolau, Dan V.; Maini, Philip K.
2007-02-01
One the major difficulties of microarray technology relate to the processing of large and - importantly - error-loaded images of the dots on the chip surface. Whatever the source of these errors, those obtained in the first stage of data acquisition - segmentation - are passed down to the subsequent processes, with deleterious results. As it has been demonstrated recently that biological systems have evolved algorithms that are mathematically efficient, this contribution attempts to test an algorithm that mimics a bacterial-"patented" algorithm for the search of available space and nutrients to find, "zero-in" and eventually delimitate the features existent on the microarray surface.
Robust gene selection methods using weighting schemes for microarray data analysis.
Kang, Suyeon; Song, Jongwoo
2017-09-02
A common task in microarray data analysis is to identify informative genes that are differentially expressed between two different states. Owing to the high-dimensional nature of microarray data, identification of significant genes has been essential in analyzing the data. However, the performances of many gene selection techniques are highly dependent on the experimental conditions, such as the presence of measurement error or a limited number of sample replicates. We have proposed new filter-based gene selection techniques, by applying a simple modification to significance analysis of microarrays (SAM). To prove the effectiveness of the proposed method, we considered a series of synthetic datasets with different noise levels and sample sizes along with two real datasets. The following findings were made. First, our proposed methods outperform conventional methods for all simulation set-ups. In particular, our methods are much better when the given data are noisy and sample size is small. They showed relatively robust performance regardless of noise level and sample size, whereas the performance of SAM became significantly worse as the noise level became high or sample size decreased. When sufficient sample replicates were available, SAM and our methods showed similar performance. Finally, our proposed methods are competitive with traditional methods in classification tasks for microarrays. The results of simulation study and real data analysis have demonstrated that our proposed methods are effective for detecting significant genes and classification tasks, especially when the given data are noisy or have few sample replicates. By employing weighting schemes, we can obtain robust and reliable results for microarray data analysis.
Image microarrays (IMA): Digital pathology's missing tool
Hipp, Jason; Cheng, Jerome; Pantanowitz, Liron; Hewitt, Stephen; Yagi, Yukako; Monaco, James; Madabhushi, Anant; Rodriguez-canales, Jaime; Hanson, Jeffrey; Roy-Chowdhuri, Sinchita; Filie, Armando C.; Feldman, Michael D.; Tomaszewski, John E.; Shih, Natalie NC.; Brodsky, Victor; Giaccone, Giuseppe; Emmert-Buck, Michael R.; Balis, Ulysses J.
2011-01-01
Introduction: The increasing availability of whole slide imaging (WSI) data sets (digital slides) from glass slides offers new opportunities for the development of computer-aided diagnostic (CAD) algorithms. With the all-digital pathology workflow that these data sets will enable in the near future, literally millions of digital slides will be generated and stored. Consequently, the field in general and pathologists, specifically, will need tools to help extract actionable information from this new and vast collective repository. Methods: To address this limitation, we designed and implemented a tool (dCORE) to enable the systematic capture of image tiles with constrained size and resolution that contain desired histopathologic features. Results: In this communication, we describe a user-friendly tool that will enable pathologists to mine digital slides archives to create image microarrays (IMAs). IMAs are to digital slides as tissue microarrays (TMAs) are to cell blocks. Thus, a single digital slide could be transformed into an array of hundreds to thousands of high quality digital images, with each containing key diagnostic morphologies and appropriate controls. Current manual digital image cut-and-paste methods that allow for the creation of a grid of images (such as an IMA) of matching resolutions are tedious. Conclusion: The ability to create IMAs representing hundreds to thousands of vetted morphologic features has numerous applications in education, proficiency testing, consensus case review, and research. Lastly, in a manner analogous to the way conventional TMA technology has significantly accelerated in situ studies of tissue specimens use of IMAs has similar potential to significantly accelerate CAD algorithm development. PMID:22200030
NASA Astrophysics Data System (ADS)
Bychkov, Dmitrii; Turkki, Riku; Haglund, Caj; Linder, Nina; Lundin, Johan
2016-03-01
Recent advances in computer vision enable increasingly accurate automated pattern classification. In the current study we evaluate whether a convolutional neural network (CNN) can be trained to predict disease outcome in patients with colorectal cancer based on images of tumor tissue microarray samples. We compare the prognostic accuracy of CNN features extracted from the whole, unsegmented tissue microarray spot image, with that of CNN features extracted from the epithelial and non-epithelial compartments, respectively. The prognostic accuracy of visually assessed histologic grade is used as a reference. The image data set consists of digitized hematoxylin-eosin (H and E) stained tissue microarray samples obtained from 180 patients with colorectal cancer. The patient samples represent a variety of histological grades, have data available on a series of clinicopathological variables including long-term outcome and ground truth annotations performed by experts. The CNN features extracted from images of the epithelial tissue compartment significantly predicted outcome (hazard ratio (HR) 2.08; CI95% 1.04-4.16; area under the curve (AUC) 0.66) in a test set of 60 patients, as compared to the CNN features extracted from unsegmented images (HR 1.67; CI95% 0.84-3.31, AUC 0.57) and visually assessed histologic grade (HR 1.96; CI95% 0.99-3.88, AUC 0.61). As a conclusion, a deep-learning classifier can be trained to predict outcome of colorectal cancer based on images of H and E stained tissue microarray samples and the CNN features extracted from the epithelial compartment only resulted in a prognostic discrimination comparable to that of visually determined histologic grade.
Yang, Mingxing; Li, Xiumin; Li, Zhibin; Ou, Zhimin; Liu, Ming; Liu, Suhuan; Li, Xuejun; Yang, Shuyu
2013-01-01
DNA microarray analysis is characterized by obtaining a large number of gene variables from a small number of observations. Cluster analysis is widely used to analyze DNA microarray data to make classification and diagnosis of disease. Because there are so many irrelevant and insignificant genes in a dataset, a feature selection approach must be employed in data analysis. The performance of cluster analysis of this high-throughput data depends on whether the feature selection approach chooses the most relevant genes associated with disease classes. Here we proposed a new method using multiple Orthogonal Partial Least Squares-Discriminant Analysis (mOPLS-DA) models and S-plots to select the most relevant genes to conduct three-class disease classification and prediction. We tested our method using Golub's leukemia microarray data. For three classes with subtypes, we proposed hierarchical orthogonal partial least squares-discriminant analysis (OPLS-DA) models and S-plots to select features for two main classes and their subtypes. For three classes in parallel, we employed three OPLS-DA models and S-plots to choose marker genes for each class. The power of feature selection to classify and predict three-class disease was evaluated using cluster analysis. Further, the general performance of our method was tested using four public datasets and compared with those of four other feature selection methods. The results revealed that our method effectively selected the most relevant features for disease classification and prediction, and its performance was better than that of the other methods.
Derivation of an artificial gene to improve classification accuracy upon gene selection.
Seo, Minseok; Oh, Sejong
2012-02-01
Classification analysis has been developed continuously since 1936. This research field has advanced as a result of development of classifiers such as KNN, ANN, and SVM, as well as through data preprocessing areas. Feature (gene) selection is required for very high dimensional data such as microarray before classification work. The goal of feature selection is to choose a subset of informative features that reduces processing time and provides higher classification accuracy. In this study, we devised a method of artificial gene making (AGM) for microarray data to improve classification accuracy. Our artificial gene was derived from a whole microarray dataset, and combined with a result of gene selection for classification analysis. We experimentally confirmed a clear improvement of classification accuracy after inserting artificial gene. Our artificial gene worked well for popular feature (gene) selection algorithms and classifiers. The proposed approach can be applied to any type of high dimensional dataset. Copyright © 2011 Elsevier Ltd. All rights reserved.
SoFoCles: feature filtering for microarray classification based on gene ontology.
Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A
2010-02-01
Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.
Compact cancer biomarkers discovery using a swarm intelligence feature selection algorithm.
Martinez, Emmanuel; Alvarez, Mario Moises; Trevino, Victor
2010-08-01
Biomarker discovery is a typical application from functional genomics. Due to the large number of genes studied simultaneously in microarray data, feature selection is a key step. Swarm intelligence has emerged as a solution for the feature selection problem. However, swarm intelligence settings for feature selection fail to select small features subsets. We have proposed a swarm intelligence feature selection algorithm based on the initialization and update of only a subset of particles in the swarm. In this study, we tested our algorithm in 11 microarray datasets for brain, leukemia, lung, prostate, and others. We show that the proposed swarm intelligence algorithm successfully increase the classification accuracy and decrease the number of selected features compared to other swarm intelligence methods. Copyright © 2010 Elsevier Ltd. All rights reserved.
Direct on-chip DNA synthesis using electrochemically modified gold electrodes as solid support
NASA Astrophysics Data System (ADS)
Levrie, Karen; Jans, Karolien; Schepers, Guy; Vos, Rita; Van Dorpe, Pol; Lagae, Liesbet; Van Hoof, Chris; Van Aerschot, Arthur; Stakenborg, Tim
2018-04-01
DNA microarrays have propelled important advancements in the field of genomic research by enabling the monitoring of thousands of genes in parallel. The throughput can be increased even further by scaling down the microarray feature size. In this respect, microelectronics-based DNA arrays are promising as they can leverage semiconductor processing techniques with lithographic resolutions. We propose a method that enables the use of metal electrodes for de novo DNA synthesis without the need for an insulating support. By electrochemically functionalizing gold electrodes, these electrodes can act as solid support for phosphoramidite-based synthesis. The proposed method relies on the electrochemical reduction of diazonium salts, enabling site-specific incorporation of hydroxyl groups onto the metal electrodes. An automated DNA synthesizer was used to couple phosphoramidite moieties directly onto the OH-modified electrodes to obtain the desired oligonucleotide sequence. Characterization was done via cyclic voltammetry and fluorescence microscopy. Our results present a valuable proof-of-concept for the integration of solid-phase DNA synthesis with microelectronics.
Robust Learning of High-dimensional Biological Networks with Bayesian Networks
NASA Astrophysics Data System (ADS)
Nägele, Andreas; Dejori, Mathäus; Stetter, Martin
Structure learning of Bayesian networks applied to gene expression data has become a potentially useful method to estimate interactions between genes. However, the NP-hardness of Bayesian network structure learning renders the reconstruction of the full genetic network with thousands of genes unfeasible. Consequently, the maximal network size is usually restricted dramatically to a small set of genes (corresponding with variables in the Bayesian network). Although this feature reduction step makes structure learning computationally tractable, on the downside, the learned structure might be adversely affected due to the introduction of missing genes. Additionally, gene expression data are usually very sparse with respect to the number of samples, i.e., the number of genes is much greater than the number of different observations. Given these problems, learning robust network features from microarray data is a challenging task. This chapter presents several approaches tackling the robustness issue in order to obtain a more reliable estimation of learned network features.
Microfabricated Fountain Pens for High-Density DNA Arrays
Reese, Matthew O.; van Dam, R. Michae; Scherer, Axel; Quake, Stephen R.
2003-01-01
We used photolithographic microfabrication techniques to create very small stainless steel fountain pens that were installed in place of conventional pens on a microarray spotter. Because of the small feature size produced by the microfabricated pens, we were able to print arrays with up to 25,000 spots/cm2, significantly higher than can be achieved by other deposition methods. This feature density is sufficiently large that a standard microscope slide can contain multiple replicates of every gene in a complex organism such as a mouse or human. We tested carryover during array printing with dye solution, labeled DNA, and hybridized DNA, and we found it to be indistinguishable from background. Hybridization also showed good sequence specificity to printed oligonucleotides. In addition to improved slide capacity, the microfabrication process offers the possibility of low-cost mass-produced pens and the flexibility to include novel pen features that cannot be machined with conventional techniques. PMID:12975313
Application of machine learning on brain cancer multiclass classification
NASA Astrophysics Data System (ADS)
Panca, V.; Rustam, Z.
2017-07-01
Classification of brain cancer is a problem of multiclass classification. One approach to solve this problem is by first transforming it into several binary problems. The microarray gene expression dataset has the two main characteristics of medical data: extremely many features (genes) and only a few number of samples. The application of machine learning on microarray gene expression dataset mainly consists of two steps: feature selection and classification. In this paper, the features are selected using a method based on support vector machine recursive feature elimination (SVM-RFE) principle which is improved to solve multiclass classification, called multiple multiclass SVM-RFE. Instead of using only the selected features on a single classifier, this method combines the result of multiple classifiers. The features are divided into subsets and SVM-RFE is used on each subset. Then, the selected features on each subset are put on separate classifiers. This method enhances the feature selection ability of each single SVM-RFE. Twin support vector machine (TWSVM) is used as the method of the classifier to reduce computational complexity. While ordinary SVM finds single optimum hyperplane, the main objective Twin SVM is to find two non-parallel optimum hyperplanes. The experiment on the brain cancer microarray gene expression dataset shows this method could classify 71,4% of the overall test data correctly, using 100 and 1000 genes selected from multiple multiclass SVM-RFE feature selection method. Furthermore, the per class results show that this method could classify data of normal and MD class with 100% accuracy.
Where statistics and molecular microarray experiments biology meet.
Kelmansky, Diana M
2013-01-01
This review chapter presents a statistical point of view to microarray experiments with the purpose of understanding the apparent contradictions that often appear in relation to their results. We give a brief introduction of molecular biology for nonspecialists. We describe microarray experiments from their construction and the biological principles the experiments rely on, to data acquisition and analysis. The role of epidemiological approaches and sample size considerations are also discussed.
Vafaee Sharbaf, Fatemeh; Mosafer, Sara; Moattar, Mohammad Hossein
2016-06-01
This paper proposes an approach for gene selection in microarray data. The proposed approach consists of a primary filter approach using Fisher criterion which reduces the initial genes and hence the search space and time complexity. Then, a wrapper approach which is based on cellular learning automata (CLA) optimized with ant colony method (ACO) is used to find the set of features which improve the classification accuracy. CLA is applied due to its capability to learn and model complicated relationships. The selected features from the last phase are evaluated using ROC curve and the most effective while smallest feature subset is determined. The classifiers which are evaluated in the proposed framework are K-nearest neighbor; support vector machine and naïve Bayes. The proposed approach is evaluated on 4 microarray datasets. The evaluations confirm that the proposed approach can find the smallest subset of genes while approaching the maximum accuracy. Copyright © 2016 Elsevier Inc. All rights reserved.
A mixture model-based approach to the clustering of microarray expression data.
McLachlan, G J; Bean, R W; Peel, D
2002-03-01
This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets. EMMIX-GENE is available at http://www.maths.uq.edu.au/~gjm/emmix-gene/
Kračun, Stjepan Krešimir; Fangel, Jonatan Ulrik; Rydahl, Maja Gro; Pedersen, Henriette Lodberg; Vidal-Melgosa, Silvia; Willats, William George Tycho
2017-01-01
Cell walls are an important feature of plant cells and a major component of the plant glycome. They have both structural and physiological functions and are critical for plant growth and development. The diversity and complexity of these structures demand advanced high-throughput techniques to answer questions about their structure, functions and roles in both fundamental and applied scientific fields. Microarray technology provides both the high-throughput and the feasibility aspects required to meet that demand. In this chapter, some of the most recent microarray-based techniques relating to plant cell walls are described together with an overview of related contemporary techniques applied to carbohydrate microarrays and their general potential in glycoscience. A detailed experimental procedure for high-throughput mapping of plant cell wall glycans using the comprehensive microarray polymer profiling (CoMPP) technique is included in the chapter and provides a good example of both the robust and high-throughput nature of microarrays as well as their applicability to plant glycomics.
Two-Dimensional VO2 Mesoporous Microarrays for High-Performance Supercapacitor
NASA Astrophysics Data System (ADS)
Fan, Yuqi; Ouyang, Delong; Li, Bao-Wen; Dang, Feng; Ren, Zongming
2018-05-01
Two-dimensional (2D) mesoporous VO2 microarrays have been prepared using an organic-inorganic liquid interface. The units of microarrays consist of needle-like VO2 particles with a mesoporous structure, in which crack-like pores with a pore size of about 2 nm and depth of 20-100 nm are distributed on the particle surface. The liquid interface acts as a template for the formation of the 2D microarrays, as identified from the kinetic observation. Due to the mesoporous structure of the units and high conductivity of the microarray, such 2D VO2 microarrays exhibit a high specific capacitance of 265 F/g at 1 A/g and excellent rate capability (182 F/g at 10 A/g) and cycling stability, suggesting the effect of unique microstructure for improving the electrochemical performance.
Stekel, Dov J.; Sarti, Donatella; Trevino, Victor; Zhang, Lihong; Salmon, Mike; Buckley, Chris D.; Stevens, Mark; Pallen, Mark J.; Penn, Charles; Falciani, Francesco
2005-01-01
A key step in the analysis of microarray data is the selection of genes that are differentially expressed. Ideally, such experiments should be properly replicated in order to infer both technical and biological variability, and the data should be subjected to rigorous hypothesis tests to identify the differentially expressed genes. However, in microarray experiments involving the analysis of very large numbers of biological samples, replication is not always practical. Therefore, there is a need for a method to select differentially expressed genes in a rational way from insufficiently replicated data. In this paper, we describe a simple method that uses bootstrapping to generate an error model from a replicated pilot study that can be used to identify differentially expressed genes in subsequent large-scale studies on the same platform, but in which there may be no replicated arrays. The method builds a stratified error model that includes array-to-array variability, feature-to-feature variability and the dependence of error on signal intensity. We apply this model to the characterization of the host response in a model of bacterial infection of human intestinal epithelial cells. We demonstrate the effectiveness of error model based microarray experiments and propose this as a general strategy for a microarray-based screening of large collections of biological samples. PMID:15800204
Analytical Protein Microarrays: Advancements Towards Clinical Applications
Sauer, Ursula
2017-01-01
Protein microarrays represent a powerful technology with the potential to serve as tools for the detection of a broad range of analytes in numerous applications such as diagnostics, drug development, food safety, and environmental monitoring. Key features of analytical protein microarrays include high throughput and relatively low costs due to minimal reagent consumption, multiplexing, fast kinetics and hence measurements, and the possibility of functional integration. So far, especially fundamental studies in molecular and cell biology have been conducted using protein microarrays, while the potential for clinical, notably point-of-care applications is not yet fully utilized. The question arises what features have to be implemented and what improvements have to be made in order to fully exploit the technology. In the past we have identified various obstacles that have to be overcome in order to promote protein microarray technology in the diagnostic field. Issues that need significant improvement to make the technology more attractive for the diagnostic market are for instance: too low sensitivity and deficiency in reproducibility, inadequate analysis time, lack of high-quality antibodies and validated reagents, lack of automation and portable instruments, and cost of instruments necessary for chip production and read-out. The scope of the paper at hand is to review approaches to solve these problems. PMID:28146048
Bennet, Jaison; Ganaprakasam, Chilambuchelvan Arul; Arputharaj, Kannan
2014-01-01
Cancer classification by doctors and radiologists was based on morphological and clinical features and had limited diagnostic ability in olden days. The recent arrival of DNA microarray technology has led to the concurrent monitoring of thousands of gene expressions in a single chip which stimulates the progress in cancer classification. In this paper, we have proposed a hybrid approach for microarray data classification based on nearest neighbor (KNN), naive Bayes, and support vector machine (SVM). Feature selection prior to classification plays a vital role and a feature selection technique which combines discrete wavelet transform (DWT) and moving window technique (MWT) is used. The performance of the proposed method is compared with the conventional classifiers like support vector machine, nearest neighbor, and naive Bayes. Experiments have been conducted on both real and benchmark datasets and the results indicate that the ensemble approach produces higher classification accuracy than conventional classifiers. This paper serves as an automated system for the classification of cancer and can be applied by doctors in real cases which serve as a boon to the medical community. This work further reduces the misclassification of cancers which is highly not allowed in cancer detection.
GeneXplorer: an interactive web application for microarray data visualization and analysis.
Rees, Christian A; Demeter, Janos; Matese, John C; Botstein, David; Sherlock, Gavin
2004-10-01
When publishing large-scale microarray datasets, it is of great value to create supplemental websites where either the full data, or selected subsets corresponding to figures within the paper, can be browsed. We set out to create a CGI application containing many of the features of some of the existing standalone software for the visualization of clustered microarray data. We present GeneXplorer, a web application for interactive microarray data visualization and analysis in a web environment. GeneXplorer allows users to browse a microarray dataset in an intuitive fashion. It provides simple access to microarray data over the Internet and uses only HTML and JavaScript to display graphic and annotation information. It provides radar and zoom views of the data, allows display of the nearest neighbors to a gene expression vector based on their Pearson correlations and provides the ability to search gene annotation fields. The software is released under the permissive MIT Open Source license, and the complete documentation and the entire source code are freely available for download from CPAN http://search.cpan.org/dist/Microarray-GeneXplorer/.
Gregori, Josep; Villarreal, Laura; Sánchez, Alex; Baselga, José; Villanueva, Josep
2013-12-16
The microarray community has shown that the low reproducibility observed in gene expression-based biomarker discovery studies is partially due to relying solely on p-values to get the lists of differentially expressed genes. Their conclusions recommended complementing the p-value cutoff with the use of effect-size criteria. The aim of this work was to evaluate the influence of such an effect-size filter on spectral counting-based comparative proteomic analysis. The results proved that the filter increased the number of true positives and decreased the number of false positives and the false discovery rate of the dataset. These results were confirmed by simulation experiments where the effect size filter was used to evaluate systematically variable fractions of differentially expressed proteins. Our results suggest that relaxing the p-value cut-off followed by a post-test filter based on effect size and signal level thresholds can increase the reproducibility of statistical results obtained in comparative proteomic analysis. Based on our work, we recommend using a filter consisting of a minimum absolute log2 fold change of 0.8 and a minimum signal of 2-4 SpC on the most abundant condition for the general practice of comparative proteomics. The implementation of feature filtering approaches could improve proteomic biomarker discovery initiatives by increasing the reproducibility of the results obtained among independent laboratories and MS platforms. Quality control analysis of microarray-based gene expression studies pointed out that the low reproducibility observed in the lists of differentially expressed genes could be partially attributed to the fact that these lists are generated relying solely on p-values. Our study has established that the implementation of an effect size post-test filter improves the statistical results of spectral count-based quantitative proteomics. The results proved that the filter increased the number of true positives whereas decreased the false positives and the false discovery rate of the datasets. The results presented here prove that a post-test filter applying a reasonable effect size and signal level thresholds helps to increase the reproducibility of statistical results in comparative proteomic analysis. Furthermore, the implementation of feature filtering approaches could improve proteomic biomarker discovery initiatives by increasing the reproducibility of results obtained among independent laboratories and MS platforms. This article is part of a Special Issue entitled: Standardization and Quality Control in Proteomics. Copyright © 2013 Elsevier B.V. All rights reserved.
Caryoscope: An Open Source Java application for viewing microarray data in a genomic context
Awad, Ihab AB; Rees, Christian A; Hernandez-Boussard, Tina; Ball, Catherine A; Sherlock, Gavin
2004-01-01
Background Microarray-based comparative genome hybridization experiments generate data that can be mapped onto the genome. These data are interpreted more easily when represented graphically in a genomic context. Results We have developed Caryoscope, which is an open source Java application for visualizing microarray data from array comparative genome hybridization experiments in a genomic context. Caryoscope can read General Feature Format files (GFF files), as well as comma- and tab-delimited files, that define the genomic positions of the microarray reporters for which data are obtained. The microarray data can be browsed using an interactive, zoomable interface, which helps users identify regions of chromosomal deletion or amplification. The graphical representation of the data can be exported in a number of graphic formats, including publication-quality formats such as PostScript. Conclusion Caryoscope is a useful tool that can aid in the visualization, exploration and interpretation of microarray data in a genomic context. PMID:15488149
Fuzzy support vector machine: an efficient rule-based classification technique for microarrays.
Hajiloo, Mohsen; Rabiee, Hamid R; Anooshahpour, Mahdi
2013-01-01
The abundance of gene expression microarray data has led to the development of machine learning algorithms applicable for tackling disease diagnosis, disease prognosis, and treatment selection problems. However, these algorithms often produce classifiers with weaknesses in terms of accuracy, robustness, and interpretability. This paper introduces fuzzy support vector machine which is a learning algorithm based on combination of fuzzy classifiers and kernel machines for microarray classification. Experimental results on public leukemia, prostate, and colon cancer datasets show that fuzzy support vector machine applied in combination with filter or wrapper feature selection methods develops a robust model with higher accuracy than the conventional microarray classification models such as support vector machine, artificial neural network, decision trees, k nearest neighbors, and diagonal linear discriminant analysis. Furthermore, the interpretable rule-base inferred from fuzzy support vector machine helps extracting biological knowledge from microarray data. Fuzzy support vector machine as a new classification model with high generalization power, robustness, and good interpretability seems to be a promising tool for gene expression microarray classification.
Giancarlo, R; Scaturro, D; Utro, F
2015-02-01
The prediction of the number of clusters in a dataset, in particular microarrays, is a fundamental task in biological data analysis, usually performed via validation measures. Unfortunately, it has received very little attention and in fact there is a growing need for software tools/libraries dedicated to it. Here we present ValWorkBench, a software library consisting of eleven well known validation measures, together with novel heuristic approximations for some of them. The main objective of this paper is to provide the interested researcher with the full software documentation of an open source cluster validation platform having the main features of being easily extendible in a homogeneous way and of offering software components that can be readily re-used. Consequently, the focus of the presentation is on the architecture of the library, since it provides an essential map that can be used to access the full software documentation, which is available at the supplementary material website [1]. The mentioned main features of ValWorkBench are also discussed and exemplified, with emphasis on software abstraction design and re-usability. A comparison with existing cluster validation software libraries, mainly in terms of the mentioned features, is also offered. It suggests that ValWorkBench is a much needed contribution to the microarray software development/algorithm engineering community. For completeness, it is important to mention that previous accurate algorithmic experimental analysis of the relative merits of each of the implemented measures [19,23,25], carried out specifically on microarray data, gives useful insights on the effectiveness of ValWorkBench for cluster validation to researchers in the microarray community interested in its use for the mentioned task. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Dielectrophoretic manipulation and separation of microparticles using microarray dot electrodes.
Yafouz, Bashar; Kadri, Nahrizul Adib; Ibrahim, Fatimah
2014-04-03
This paper introduces a dielectrophoretic system for the manipulation and separation of microparticles. The system is composed of five layers and utilizes microarray dot electrodes. We validated our system by conducting size-dependent manipulation and separation experiments on 1, 5 and 15 μm polystyrene particles. Our findings confirm the capability of the proposed device to rapidly and efficiently manipulate and separate microparticles of various dimensions, utilizing positive and negative dielectrophoresis (DEP) effects. Larger size particles were repelled and concentrated in the center of the dot by negative DEP, while the smaller sizes were attracted and collected by the edge of the dot by positive DEP.
Wang, Yun; Huang, Fangzhou
2018-01-01
The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC2), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible. PMID:29666661
Xu, Jiucheng; Mu, Huiyu; Wang, Yun; Huang, Fangzhou
2018-01-01
The selection of feature genes with high recognition ability from the gene expression profiles has gained great significance in biology. However, most of the existing methods have a high time complexity and poor classification performance. Motivated by this, an effective feature selection method, called supervised locally linear embedding and Spearman's rank correlation coefficient (SLLE-SC 2 ), is proposed which is based on the concept of locally linear embedding and correlation coefficient algorithms. Supervised locally linear embedding takes into account class label information and improves the classification performance. Furthermore, Spearman's rank correlation coefficient is used to remove the coexpression genes. The experiment results obtained on four public tumor microarray datasets illustrate that our method is valid and feasible.
Draghici, Sorin; Tarca, Adi L; Yu, Longfei; Ethier, Stephen; Romero, Roberto
2008-03-01
The BioArray Software Environment (BASE) is a very popular MIAME-compliant, web-based microarray data repository. However in BASE, like in most other microarray data repositories, the experiment annotation and raw data uploading can be very timeconsuming, especially for large microarray experiments. We developed KUTE (Karmanos Universal daTabase for microarray Experiments), as a plug-in for BASE 2.0 that addresses these issues. KUTE provides an automatic experiment annotation feature and a completely redesigned data work-flow that dramatically reduce the human-computer interaction time. For instance, in BASE 2.0 a typical Affymetrix experiment involving 100 arrays required 4 h 30 min of user interaction time forexperiment annotation, and 45 min for data upload/download. In contrast, for the same experiment, KUTE required only 28 min of user interaction time for experiment annotation, and 3.3 min for data upload/download. http://vortex.cs.wayne.edu/kute/index.html.
Microarrays (DNA Chips) for the Classroom Laboratory
ERIC Educational Resources Information Center
Barnard, Betsy; Sussman, Michael; BonDurant, Sandra Splinter; Nienhuis, James; Krysan, Patrick
2006-01-01
We have developed and optimized the necessary laboratory materials to make DNA microarray technology accessible to all high school students at a fraction of both cost and data size. The primary component is a DNA chip/array that students "print" by hand and then analyze using research tools that have been adapted for classroom use. The…
Tabu search and binary particle swarm optimization for feature selection using microarray data.
Chuang, Li-Yeh; Yang, Cheng-Huei; Yang, Cheng-Hong
2009-12-01
Gene expression profiles have great potential as a medical diagnosis tool because they represent the state of a cell at the molecular level. In the classification of cancer type research, available training datasets generally have a fairly small sample size compared to the number of genes involved. This fact poses an unprecedented challenge to some classification methodologies due to training data limitations. Therefore, a good selection method for genes relevant for sample classification is needed to improve the predictive accuracy, and to avoid incomprehensibility due to the large number of genes investigated. In this article, we propose to combine tabu search (TS) and binary particle swarm optimization (BPSO) for feature selection. BPSO acts as a local optimizer each time the TS has been run for a single generation. The K-nearest neighbor method with leave-one-out cross-validation and support vector machine with one-versus-rest serve as evaluators of the TS and BPSO. The proposed method is applied and compared to the 11 classification problems taken from the literature. Experimental results show that our method simplifies features effectively and either obtains higher classification accuracy or uses fewer features compared to other feature selection methods.
Development and application of a microarray meter tool to optimize microarray experiments
Rouse, Richard JD; Field, Katrine; Lapira, Jennifer; Lee, Allen; Wick, Ivan; Eckhardt, Colleen; Bhasker, C Ramana; Soverchia, Laura; Hardiman, Gary
2008-01-01
Background Successful microarray experimentation requires a complex interplay between the slide chemistry, the printing pins, the nucleic acid probes and targets, and the hybridization milieu. Optimization of these parameters and a careful evaluation of emerging slide chemistries are a prerequisite to any large scale array fabrication effort. We have developed a 'microarray meter' tool which assesses the inherent variations associated with microarray measurement prior to embarking on large scale projects. Findings The microarray meter consists of nucleic acid targets (reference and dynamic range control) and probe components. Different plate designs containing identical probe material were formulated to accommodate different robotic and pin designs. We examined the variability in probe quality and quantity (as judged by the amount of DNA printed and remaining post-hybridization) using three robots equipped with capillary printing pins. Discussion The generation of microarray data with minimal variation requires consistent quality control of the (DNA microarray) manufacturing and experimental processes. Spot reproducibility is a measure primarily of the variations associated with printing. The microarray meter assesses array quality by measuring the DNA content for every feature. It provides a post-hybridization analysis of array quality by scoring probe performance using three metrics, a) a measure of variability in the signal intensities, b) a measure of the signal dynamic range and c) a measure of variability of the spot morphologies. PMID:18710498
A perspective on microarrays: current applications, pitfalls, and potential uses
Jaluria, Pratik; Konstantopoulos, Konstantinos; Betenbaugh, Michael; Shiloach, Joseph
2007-01-01
With advances in robotics, computational capabilities, and the fabrication of high quality glass slides coinciding with increased genomic information being available on public databases, microarray technology is increasingly being used in laboratories around the world. In fact, fields as varied as: toxicology, evolutionary biology, drug development and production, disease characterization, diagnostics development, cellular physiology and stress responses, and forensics have benefiting from its use. However, for many researchers not familiar with microarrays, current articles and reviews often address neither the fundamental principles behind the technology nor the proper designing of experiments. Although, microarray technology is relatively simple, conceptually, its practice does require careful planning and detailed understanding of the limitations inherently present. Without these considerations, it can be exceedingly difficult to ascertain valuable information from microarray data. Therefore, this text aims to outline key features in microarray technology, paying particular attention to current applications as outlined in recent publications, experimental design, statistical methods, and potential uses. Furthermore, this review is not meant to be comprehensive, but rather substantive; highlighting important concepts and detailing steps necessary to conduct and interpret microarray experiments. Collectively, the information included in this text will highlight the versatility of microarray technology and provide a glimpse of what the future may hold. PMID:17254338
"Print-n-Shrink" technology for the rapid production of microfluidic chips and protein microarrays.
Sollier, Kevin; Mandon, Céline A; Heyries, Kevin A; Blum, Loïc J; Marquette, Christophe A
2009-12-21
An innovative method for the production of microfluidic chips integrating protein spots is described. The technology, called "Print-n-Shrink", is based on the screen-printing of a microfluidic design (using a dielectric ink) onto Polyshrink polystyrene sheets. The initial print which has a minimum size of 15 microm (height) x 230 microm (width) is thermally treated (30 seconds, 163 degrees C) to shrink and generate features of 85 microm (height) x 100 microm (width). Concomitantly, proteins such as monoclonal antibodies or cellular adhesion proteins are spotted onto the Polyshrink sheets and shrunk together with the microfluidic design, creating a complete biochip integrating both complex microfluidic designs and protein spots for bioanalytical applications.
Development of a DNA microarray for species identification of quarantine aphids.
Lee, Won Sun; Choi, Hwalran; Kang, Jinseok; Kim, Ji-Hoon; Lee, Si Hyeock; Lee, Seunghwan; Hwang, Seung Yong
2013-12-01
Aphid pests are being brought into Korea as a result of increased crop trading. Aphids exist on growth areas of plants, and thus plant growth is seriously affected by aphid pests. However, aphids are very small and have several sexual morphs and life stages, so it is difficult to identify species on the basis of morphological features. This problem was approached using DNA microarray technology. DNA targets of the cytochrome c oxidase subunit I gene were generated with a fluorescent dye-labelled primer and were hybridised onto a DNA microarray consisting of specific probes. After analysing the signal intensity of the specific probes, the unique patterns from the DNA microarray, consisting of 47 species-specific probes, were obtained to identify 23 aphid species. To confirm the accuracy of the developed DNA microarray, ten individual blind samples were used in blind trials, and the identifications were completely consistent with the sequencing data of all individual blind samples. A microarray has been developed to distinguish aphid species. DNA microarray technology provides a rapid, easy, cost-effective and accurate method for identifying aphid species for pest control management. © 2013 Society of Chemical Industry.
2011-01-01
Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping with a concurrent objective of reducing microarray costs. HIgh-density gene-rich maps represent a powerful resource to assist gene discovery endeavors when used in combination with QTL and association mapping and should be especially valuable to assist the assembly of reference genome sequences soon to come for several plant and animal species. PMID:21492453
Transcriptional profiling of the parr–smolt transformation in Atlantic salmon
Robertson, Laura S.; McCormick, Stephen D.
2012-01-01
The parr–smolt transformation in Atlantic salmon (Salmo salar) is a complex developmental process that culminates in the ability to migrate to and live in seawater. We used GRASP 16K cDNA microarrays to identify genes that are differentially expressed in the liver, gill, hypothalamus, pituitary, and olfactory rosettes of smolts compared to parr. Smolts had higher levels of gill Na+/K+-ATPase activity, plasma cortisol and plasma thyroid hormones relative to parr. Across all five tissues, stringent microarray analyses identified 48 features that were differentially expressed in smolts compared to parr. Using a less stringent method we found 477 features that were differentially expressed at least 1.2-fold in smolts, including 172 features in the gill. Smolts had higher mRNA levels of genes involved in transcription, protein biosynthesis and folding, electron transport, oxygen transport, and sensory perception and lower mRNA levels for genes involved in proteolysis. Quantitative RT-PCR was used to confirm differential expression in select genes identified by microarray analyses and to quantify expression of other genes known to be involved in smolting. This study expands our understanding of the molecular processes that underlie smolting in Atlantic salmon and identifies genes for further investigation.
Jiang, Wenyu; Simon, Richard
2007-12-20
This paper first provides a critical review on some existing methods for estimating the prediction error in classifying microarray data where the number of genes greatly exceeds the number of specimens. Special attention is given to the bootstrap-related methods. When the sample size n is small, we find that all the reviewed methods suffer from either substantial bias or variability. We introduce a repeated leave-one-out bootstrap (RLOOB) method that predicts for each specimen in the sample using bootstrap learning sets of size ln. We then propose an adjusted bootstrap (ABS) method that fits a learning curve to the RLOOB estimates calculated with different bootstrap learning set sizes. The ABS method is robust across the situations we investigate and provides a slightly conservative estimate for the prediction error. Even with small samples, it does not suffer from large upward bias as the leave-one-out bootstrap and the 0.632+ bootstrap, and it does not suffer from large variability as the leave-one-out cross-validation in microarray applications. Copyright (c) 2007 John Wiley & Sons, Ltd.
Microintaglio Printing for Soft Lithography-Based in Situ Microarrays
Biyani, Manish; Ichiki, Takanori
2015-01-01
Advances in lithographic approaches to fabricating bio-microarrays have been extensively explored over the last two decades. However, the need for pattern flexibility, a high density, a high resolution, affordability and on-demand fabrication is promoting the development of unconventional routes for microarray fabrication. This review highlights the development and uses of a new molecular lithography approach, called “microintaglio printing technology”, for large-scale bio-microarray fabrication using a microreactor array (µRA)-based chip consisting of uniformly-arranged, femtoliter-size µRA molds. In this method, a single-molecule-amplified DNA microarray pattern is self-assembled onto a µRA mold and subsequently converted into a messenger RNA or protein microarray pattern by simultaneously producing and transferring (immobilizing) a messenger RNA or a protein from a µRA mold to a glass surface. Microintaglio printing allows the self-assembly and patterning of in situ-synthesized biomolecules into high-density (kilo-giga-density), ordered arrays on a chip surface with µm-order precision. This holistic aim, which is difficult to achieve using conventional printing and microarray approaches, is expected to revolutionize and reshape proteomics. This review is not written comprehensively, but rather substantively, highlighting the versatility of microintaglio printing for developing a prerequisite platform for microarray technology for the postgenomic era. PMID:27600226
On-Chip, Amplification-Free Quantification of Nucleic Acid for Point-of-Care Diagnosis
NASA Astrophysics Data System (ADS)
Yen, Tony Minghung
This dissertation demonstrates three physical device concepts to overcome limitations in point-of-care quantification of nucleic acids. Enabling sensitive, high throughput nucleic acid quantification on a chip, outside of hospital and centralized laboratory setting, is crucial for improving pathogen detection and cancer diagnosis and prognosis. Among existing platforms, microarray have the advantages of being amplification free, low instrument cost, and high throughput, but are generally less sensitive compared to sequencing and PCR assays. To bridge this performance gap, this dissertation presents theoretical and experimental progress to develop a platform nucleic acid quantification technology that is drastically more sensitive than current microarrays while compatible with microarray architecture. The first device concept explores on-chip nucleic acid enrichment by natural evaporation of nucleic acid solution droplet. Using a micro-patterned super-hydrophobic black silicon array device, evaporative enrichment is coupled with nano-liter droplet self-assembly workflow to produce a 50 aM concentration sensitivity, 6 orders of dynamic range, and rapid hybridization time at under 5 minutes. The second device concept focuses on improving target copy number sensitivity, instead of concentration sensitivity. A comprehensive microarray physical model taking into account of molecular transport, electrostatic intermolecular interactions, and reaction kinetics is considered to guide device optimization. Device pattern size and target copy number are optimized based on model prediction to achieve maximal hybridization efficiency. At a 100-mum pattern size, a quantum leap in detection limit of 570 copies is achieved using black silicon array device with self-assembled pico-liter droplet workflow. Despite its merits, evaporative enrichment on black silicon device suffers from coffee-ring effect at 100-mum pattern size, and thus not compatible with clinical patient samples. The third device concept utilizes an integrated optomechanical laser system and a Cytop microarray device to reverse coffee-ring effect during evaporative enrichment at 100-mum pattern size. This method, named "laser-induced differential evaporation" is expected to enable 570 copies detection limit for clinical samples in near future. While the work is ongoing as of the writing of this dissertation, a clear research plan is in place to implement this method on microarray platform toward clinical sample testing for disease applications and future commercialization.
Design of 240,000 orthogonal 25mer DNA barcode probes.
Xu, Qikai; Schlabach, Michael R; Hannon, Gregory J; Elledge, Stephen J
2009-02-17
DNA barcodes linked to genetic features greatly facilitate screening these features in pooled formats using microarray hybridization, and new tools are needed to design large sets of barcodes to allow construction of large barcoded mammalian libraries such as shRNA libraries. Here we report a framework for designing large sets of orthogonal barcode probes. We demonstrate the utility of this framework by designing 240,000 barcode probes and testing their performance by hybridization. From the test hybridizations, we also discovered new probe design rules that significantly reduce cross-hybridization after their introduction into the framework of the algorithm. These rules should improve the performance of DNA microarray probe designs for many applications.
Design of 240,000 orthogonal 25mer DNA barcode probes
Xu, Qikai; Schlabach, Michael R.; Hannon, Gregory J.; Elledge, Stephen J.
2009-01-01
DNA barcodes linked to genetic features greatly facilitate screening these features in pooled formats using microarray hybridization, and new tools are needed to design large sets of barcodes to allow construction of large barcoded mammalian libraries such as shRNA libraries. Here we report a framework for designing large sets of orthogonal barcode probes. We demonstrate the utility of this framework by designing 240,000 barcode probes and testing their performance by hybridization. From the test hybridizations, we also discovered new probe design rules that significantly reduce cross-hybridization after their introduction into the framework of the algorithm. These rules should improve the performance of DNA microarray probe designs for many applications. PMID:19171886
PAX3 gene deletion detected by microarray analysis in a girl with hearing loss.
Drozniewska, Malgorzata; Haus, Olga
2014-01-01
Deletions of the PAX3 gene have been rarely reported in the literature. Mutations of this gene are a common cause of Waardenburg syndrome type 1 and 3. We report a 16 year old female presenting hearing loss and normal intellectual development, without major features of Waardenburg syndrome type 1, and without family history of the syndrome. Her phenotype, however, overlaps with features of craniofacial-deafness-hand syndrome. Microarray analysis showed ~862 kb de novo deletion at 2q36.1 including PAX3. The above findings suggest that the rearrangement found in our patient appeared de novo and with high probability is a cause of her phenotype.
A novel approach for dimension reduction of microarray.
Aziz, Rabia; Verma, C K; Srivastava, Namita
2017-12-01
This paper proposes a new hybrid search technique for feature (gene) selection (FS) using Independent component analysis (ICA) and Artificial Bee Colony (ABC) called ICA+ABC, to select informative genes based on a Naïve Bayes (NB) algorithm. An important trait of this technique is the optimization of ICA feature vector using ABC. ICA+ABC is a hybrid search algorithm that combines the benefits of extraction approach, to reduce the size of data and wrapper approach, to optimize the reduced feature vectors. This hybrid search technique is facilitated by evaluating the performance of ICA+ABC on six standard gene expression datasets of classification. Extensive experiments were conducted to compare the performance of ICA+ABC with the results obtained from recently published Minimum Redundancy Maximum Relevance (mRMR) +ABC algorithm for NB classifier. Also to check the performance that how ICA+ABC works as feature selection with NB classifier, compared the combination of ICA with popular filter techniques and with other similar bio inspired algorithm such as Genetic Algorithm (GA) and Particle Swarm Optimization (PSO). The result shows that ICA+ABC has a significant ability to generate small subsets of genes from the ICA feature vector, that significantly improve the classification accuracy of NB classifier compared to other previously suggested methods. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Wisesty, Untari N.; Warastri, Riris S.; Puspitasari, Shinta Y.
2018-03-01
Cancer is one of the major causes of mordibility and mortality problems in the worldwide. Therefore, the need of a system that can analyze and identify a person suffering from a cancer by using microarray data derived from the patient’s Deoxyribonucleic Acid (DNA). But on microarray data has thousands of attributes, thus making the challenges in data processing. This is often referred to as the curse of dimensionality. Therefore, in this study built a system capable of detecting a patient whether contracted cancer or not. The algorithm used is Genetic Algorithm as feature selection and Momentum Backpropagation Neural Network as a classification method, with data used from the Kent Ridge Bio-medical Dataset. Based on system testing that has been done, the system can detect Leukemia and Colon Tumor with best accuracy equal to 98.33% for colon tumor data and 100% for leukimia data. Genetic Algorithm as feature selection algorithm can improve system accuracy, which is from 64.52% to 98.33% for colon tumor data and 65.28% to 100% for leukemia data, and the use of momentum parameters can accelerate the convergence of the system in the training process of Neural Network.
DNA microarrays: a powerful genomic tool for biomedical and clinical research
Trevino, Victor; Falciani, Francesco; Barrera-Saldaña, Hugo A.
2007-01-01
Among the many benefits of the Human Genome Project are new and powerful tools such as the genome-wide hybridization devices referred as microarrays. Initially designed to measure gene transcriptional levels, microarray technologies are now used for comparing other genome features among individuals and their tissues and cells. Results provide valuable information on disease subcategories, disease prognosis, and treatment outcome. Likewise, reveal differences in genetic makeup, regulatory mechanisms and subtle variations are approaching the era of personalized medicine. To understand this powerful tool, its versatility and how it is dramatically changing the molecular approach to biomedical and clinical research, this review describes the technology, its applications, a didactic step-by-step review of a typical microarray protocol, and a real experiment. Finally, it calls the attention of the medical community to integrate multidisciplinary teams, to take advantage of this technology and its expanding applications that in a slide reveals our genetic inheritance and destiny. PMID:17660860
SVS: data and knowledge integration in computational biology.
Zycinski, Grzegorz; Barla, Annalisa; Verri, Alessandro
2011-01-01
In this paper we present a framework for structured variable selection (SVS). The main concept of the proposed schema is to take a step towards the integration of two different aspects of data mining: database and machine learning perspective. The framework is flexible enough to use not only microarray data, but other high-throughput data of choice (e.g. from mass spectrometry, microarray, next generation sequencing). Moreover, the feature selection phase incorporates prior biological knowledge in a modular way from various repositories and is ready to host different statistical learning techniques. We present a proof of concept of SVS, illustrating some implementation details and describing current results on high-throughput microarray data.
Zhou, Ming; Kort, Eric; Hoekstra, Philip; Westphal, Michael; Magi-Galluzzi, Cristina; Sercia, Linda; Lane, Brian; Rini, Brian; Bukowski, Ronald; Teh, Bin T
2009-01-01
Adult cystic nephroma (CN) and mixed epithelial and stromal tumor of the kidney (MEST) are considered as separate entities in the 2004 World Health Organization classification of renal neoplasms. Recent studies suggested that the two share clinicopathologic features and may represent the same disease process of varying morphology. However, definitive genetic evidence is lacking. We examined their relationship using gene expression profiling and histologic analysis. Gene expression profiles of 3 CN and 3 MEST were analyzed using HGU133 Plus 2.0 microarrays (Affymetrix) and were compared with each other and also with 48 other renal tumors and 13 normal kidneys. Histologic examination of 26 CN and 13 MEST focused on the cystic septal thickness, cyst-to-stroma ratio, stromal cellularity and composition, types of epithelial cells lining cysts and glands, and estrogen and progesterone receptors expression. Patients' age, sex distribution, and tumor size were similar between the two. They also shared many histologic features, including lining epithelium of cysts and glands, stromal cellularity and composition. Unsupervised clustering of mRNA expression profiles demonstrated that they had very similar expression profiles that were distinct from other renal tumors. By microarray analysis, progesterone receptor expression was significantly higher in CN and MEST relative to both normal and other renal tumors, while estrogen receptor expression was not. By immunohistochemistry, expression of both receptors was similar between CN and MEST. This study provides the most convincing molecular evidence that CN and MEST represent different parts of the morphologic spectrum of the same disease.
Tao, Qi-Fei; Yuan, Sheng-Xian; Yang, Fu; Yang, Sen; Yang, Yuan; Yuan, Ji-Hang; Wang, Zhen-Guang; Xu, Qing-Guo; Lin, Kong-Ying; Cai, Jie; Yu, Jian; Huang, Wei-Long; Teng, Xiao-Lei; Zhou, Chuan-Chuan; Wang, Fang; Sun, Shu-Han; Zhou, Wei-Ping
2015-09-17
Downregulation of Aldolase B (ALDOB) has been reported in hepatocellular carcinoma. However, its clinical significance and its role in pathogenesis of HCC remain largely unknown. We analyzed the expression of ALDOB and its clinical features in a large cohort of 313 HCC patients using tissue microarray and immunohistochemistry. Moreover, the function of stably overexpressed ALDOB in HCC cells was explored in vitro and in vivo. Gene expression microarray analysis was performed on ALDOB-overexpressing SMMC7721 cells to elucidate its mechanism of action. ALDOB downregulation in HCC was significantly correlated with aggressive characteristics including absence of encapsulation, increased tumor size (>5 cm) and early recurrence. ALDOB downregulation was indicative of a shorter recurrence-free survival (RFS) and overall survival (OS) for all HCC patients and early-stage HCC patients (BCLC 0-A and TNM I stage patients). Multiple analyses revealed that ALDOB downregulation was an independent risk factor of RFS and OS. Stable expression of ALDOB in HCC cell lines reduced cell migration in vitro and inhibited lung metastasis, intrahepatic metastasis, and reduced circulating tumor cells in vivo. Mechanistically, we found that cells stably expressing ALDOB show elevated Ten-Eleven Translocation 1 (TET1) expression. Moreover, ALDOB expressing cells have higher levels of methylglyoxal than do control cells, which can upregulate TET1 expression. The downregulation of ALDOB could indicate a poor prognosis for HCC patients, and therefore, ALDOB might be considered a prognostic biomarker for HCC, especially at the early stage. In addition, ALDOB inhibits the invasive features of cell lines partly through TET1 expression.
Steger, Doris; Berry, David; Haider, Susanne; Horn, Matthias; Wagner, Michael; Stocker, Roman; Loy, Alexander
2011-01-01
The hybridization of nucleic acid targets with surface-immobilized probes is a widely used assay for the parallel detection of multiple targets in medical and biological research. Despite its widespread application, DNA microarray technology still suffers from several biases and lack of reproducibility, stemming in part from an incomplete understanding of the processes governing surface hybridization. In particular, non-random spatial variations within individual microarray hybridizations are often observed, but the mechanisms underpinning this positional bias remain incompletely explained. This study identifies and rationalizes a systematic spatial bias in the intensity of surface hybridization, characterized by markedly increased signal intensity of spots located at the boundaries of the spotted areas of the microarray slide. Combining observations from a simplified single-probe block array format with predictions from a mathematical model, the mechanism responsible for this bias is found to be a position-dependent variation in lateral diffusion of target molecules. Numerical simulations reveal a strong influence of microarray well geometry on the spatial bias. Reciprocal adjustment of the size of the microarray hybridization chamber to the area of surface-bound probes is a simple and effective measure to minimize or eliminate the diffusion-based bias, resulting in increased uniformity and accuracy of quantitative DNA microarray hybridization.
Haider, Susanne; Horn, Matthias; Wagner, Michael; Stocker, Roman; Loy, Alexander
2011-01-01
Background The hybridization of nucleic acid targets with surface-immobilized probes is a widely used assay for the parallel detection of multiple targets in medical and biological research. Despite its widespread application, DNA microarray technology still suffers from several biases and lack of reproducibility, stemming in part from an incomplete understanding of the processes governing surface hybridization. In particular, non-random spatial variations within individual microarray hybridizations are often observed, but the mechanisms underpinning this positional bias remain incompletely explained. Methodology/Principal Findings This study identifies and rationalizes a systematic spatial bias in the intensity of surface hybridization, characterized by markedly increased signal intensity of spots located at the boundaries of the spotted areas of the microarray slide. Combining observations from a simplified single-probe block array format with predictions from a mathematical model, the mechanism responsible for this bias is found to be a position-dependent variation in lateral diffusion of target molecules. Numerical simulations reveal a strong influence of microarray well geometry on the spatial bias. Conclusions Reciprocal adjustment of the size of the microarray hybridization chamber to the area of surface-bound probes is a simple and effective measure to minimize or eliminate the diffusion-based bias, resulting in increased uniformity and accuracy of quantitative DNA microarray hybridization. PMID:21858215
2012-01-01
Background Because of the large volume of data and the intrinsic variation of data intensity observed in microarray experiments, different statistical methods have been used to systematically extract biological information and to quantify the associated uncertainty. The simplest method to identify differentially expressed genes is to evaluate the ratio of average intensities in two different conditions and consider all genes that differ by more than an arbitrary cut-off value to be differentially expressed. This filtering approach is not a statistical test and there is no associated value that can indicate the level of confidence in the designation of genes as differentially expressed or not differentially expressed. At the same time the fold change by itself provide valuable information and it is important to find unambiguous ways of using this information in expression data treatment. Results A new method of finding differentially expressed genes, called distributional fold change (DFC) test is introduced. The method is based on an analysis of the intensity distribution of all microarray probe sets mapped to a three dimensional feature space composed of average expression level, average difference of gene expression and total variance. The proposed method allows one to rank each feature based on the signal-to-noise ratio and to ascertain for each feature the confidence level and power for being differentially expressed. The performance of the new method was evaluated using the total and partial area under receiver operating curves and tested on 11 data sets from Gene Omnibus Database with independently verified differentially expressed genes and compared with the t-test and shrinkage t-test. Overall the DFC test performed the best – on average it had higher sensitivity and partial AUC and its elevation was most prominent in the low range of differentially expressed features, typical for formalin-fixed paraffin-embedded sample sets. Conclusions The distributional fold change test is an effective method for finding and ranking differentially expressed probesets on microarrays. The application of this test is advantageous to data sets using formalin-fixed paraffin-embedded samples or other systems where degradation effects diminish the applicability of correlation adjusted methods to the whole feature set. PMID:23122055
Improved microarray methods for profiling the yeast knockout strain collection
Yuan, Daniel S.; Pan, Xuewen; Ooi, Siew Loon; Peyser, Brian D.; Spencer, Forrest A.; Irizarry, Rafael A.; Boeke, Jef D.
2005-01-01
A remarkable feature of the Yeast Knockout strain collection is the presence of two unique 20mer TAG sequences in almost every strain. In principle, the relative abundances of strains in a complex mixture can be profiled swiftly and quantitatively by amplifying these sequences and hybridizing them to microarrays, but TAG microarrays have not been widely used. Here, we introduce a TAG microarray design with sophisticated controls and describe a robust method for hybridizing high concentrations of dye-labeled TAGs in single-stranded form. We also highlight the importance of avoiding PCR contamination and provide procedures for detection and eradication. Validation experiments using these methods yielded false positive (FP) and false negative (FN) rates for individual TAG detection of 3–6% and 15–18%, respectively. Analysis demonstrated that cross-hybridization was the chief source of FPs, while TAG amplification defects were the main cause of FNs. The materials, protocols, data and associated software described here comprise a suite of experimental resources that should facilitate the use of TAG microarrays for a wide variety of genetic screens. PMID:15994458
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.
Tong, Dong Ling; Schierz, Amanda C
2011-09-01
Suitable techniques for microarray analysis have been widely researched, particularly for the study of marker genes expressed to a specific type of cancer. Most of the machine learning methods that have been applied to significant gene selection focus on the classification ability rather than the selection ability of the method. These methods also require the microarray data to be preprocessed before analysis takes place. The objective of this study is to develop a hybrid genetic algorithm-neural network (GANN) model that emphasises feature selection and can operate on unpreprocessed microarray data. The GANN is a hybrid model where the fitness value of the genetic algorithm (GA) is based upon the number of samples correctly labelled by a standard feedforward artificial neural network (ANN). The model is evaluated by using two benchmark microarray datasets with different array platforms and differing number of classes (a 2-class oligonucleotide microarray data for acute leukaemia and a 4-class complementary DNA (cDNA) microarray dataset for SRBCTs (small round blue cell tumours)). The underlying concept of the GANN algorithm is to select highly informative genes by co-evolving both the GA fitness function and the ANN weights at the same time. The novel GANN selected approximately 50% of the same genes as the original studies. This may indicate that these common genes are more biologically significant than other genes in the datasets. The remaining 50% of the significant genes identified were used to build predictive models and for both datasets, the models based on the set of genes extracted by the GANN method produced more accurate results. The results also suggest that the GANN method not only can detect genes that are exclusively associated with a single cancer type but can also explore the genes that are differentially expressed in multiple cancer types. The results show that the GANN model has successfully extracted statistically significant genes from the unpreprocessed microarray data as well as extracting known biologically significant genes. We also show that assessing the biological significance of genes based on classification accuracy may be misleading and though the GANN's set of extra genes prove to be more statistically significant than those selected by other methods, a biological assessment of these genes is highly recommended to confirm their functionality. Copyright © 2011 Elsevier B.V. All rights reserved.
Nie, Zhi; Yang, Tao; Liu, Yashu; Li, Qingyang; Narayan, Vaibhav A; Wittenberg, Gayle; Ye, Jieping
2015-01-01
Recent studies have revealed that melancholic depression, one major subtype of depression, is closely associated with the concentration of some metabolites and biological functions of certain genes and pathways. Meanwhile, recent advances in biotechnologies have allowed us to collect a large amount of genomic data, e.g., metabolites and microarray gene expression. With such a huge amount of information available, one approach that can give us new insights into the understanding of the fundamental biology underlying melancholic depression is to build disease status prediction models using classification or regression methods. However, the existence of strong empirical correlations, e.g., those exhibited by genes sharing the same biological pathway in microarray profiles, tremendously limits the performance of these methods. Furthermore, the occurrence of missing values which are ubiquitous in biomedical applications further complicates the problem. In this paper, we hypothesize that the problem of missing values might in some way benefit from the correlation between the variables and propose a method to learn a compressed set of representative features through an adapted version of sparse coding which is capable of identifying correlated variables and addressing the issue of missing values simultaneously. An efficient algorithm is also developed to solve the proposed formulation. We apply the proposed method on metabolic and microarray profiles collected from a group of subjects consisting of both patients with melancholic depression and healthy controls. Results show that the proposed method can not only produce meaningful clusters of variables but also generate a set of representative features that achieve superior classification performance over those generated by traditional clustering and data imputation techniques. In particular, on both datasets, we found that in comparison with the competing algorithms, the representative features learned by the proposed method give rise to significantly improved sensitivity scores, suggesting that the learned features allow prediction with high accuracy of disease status in those who are diagnosed with melancholic depression. To our best knowledge, this is the first work that applies sparse coding to deal with high feature correlations and missing values, which are common challenges in many biomedical applications. The proposed method can be readily adapted to other biomedical applications involving incomplete and high-dimensional data.
Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.
Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias
2015-06-25
Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.
Severgnini, Marco; Bicciato, Silvio; Mangano, Eleonora; Scarlatti, Francesca; Mezzelani, Alessandra; Mattioli, Michela; Ghidoni, Riccardo; Peano, Clelia; Bonnal, Raoul; Viti, Federica; Milanesi, Luciano; De Bellis, Gianluca; Battaglia, Cristina
2006-06-01
Meta-analysis of microarray data is increasingly important, considering both the availability of multiple platforms using disparate technologies and the accumulation in public repositories of data sets from different laboratories. We addressed the issue of comparing gene expression profiles from two microarray platforms by devising a standardized investigative strategy. We tested this procedure by studying MDA-MB-231 cells, which undergo apoptosis on treatment with resveratrol. Gene expression profiles were obtained using high-density, short-oligonucleotide, single-color microarray platforms: GeneChip (Affymetrix) and CodeLink (Amersham). Interplatform analyses were carried out on 8414 common transcripts represented on both platforms, as identified by LocusLink ID, representing 70.8% and 88.6% of annotated GeneChip and CodeLink features, respectively. We identified 105 differentially expressed genes (DEGs) on CodeLink and 42 DEGs on GeneChip. Among them, only 9 DEGs were commonly identified by both platforms. Multiple analyses (BLAST alignment of probes with target sequences, gene ontology, literature mining, and quantitative real-time PCR) permitted us to investigate the factors contributing to the generation of platform-dependent results in single-color microarray experiments. An effective approach to cross-platform comparison involves microarrays of similar technologies, samples prepared by identical methods, and a standardized battery of bioinformatic and statistical analyses.
Identification of significant features by the Global Mean Rank test.
Klammer, Martin; Dybowski, J Nikolaj; Hoffmann, Daniel; Schaab, Christoph
2014-01-01
With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such as small numbers of replicates, missing values, non-normally distributed expression levels, and non-identical distributions of features. With the MeanRank test we aimed at developing a test that performs robustly under these conditions, while favorably scaling with the number of replicates. The test proposed here is a global one-sample location test, which is based on the mean ranks across replicates, and internally estimates and controls the false discovery rate. Furthermore, missing data is accounted for without the need of imputation. In extensive simulations comparing MeanRank to other frequently used methods, we found that it performs well with small and large numbers of replicates, feature dependent variance between replicates, and variable regulation across features on simulation data and a recent two-color microarray spike-in dataset. The tests were then used to identify significant changes in the phosphoproteomes of cancer cells induced by the kinase inhibitors erlotinib and 3-MB-PP1 in two independently published mass spectrometry-based studies. MeanRank outperformed the other global rank-based methods applied in this study. Compared to the popular Significance Analysis of Microarrays and Linear Models for Microarray methods, MeanRank performed similar or better. Furthermore, MeanRank exhibits more consistent behavior regarding the degree of regulation and is robust against the choice of preprocessing methods. MeanRank does not require any imputation of missing values, is easy to understand, and yields results that are easy to interpret. The software implementing the algorithm is freely available for academic and commercial use.
The analysis of image feature robustness using cometcloud
Qi, Xin; Kim, Hyunjoo; Xing, Fuyong; Parashar, Manish; Foran, David J.; Yang, Lin
2012-01-01
The robustness of image features is a very important consideration in quantitative image analysis. The objective of this paper is to investigate the robustness of a range of image texture features using hematoxylin stained breast tissue microarray slides which are assessed while simulating different imaging challenges including out of focus, changes in magnification and variations in illumination, noise, compression, distortion, and rotation. We employed five texture analysis methods and tested them while introducing all of the challenges listed above. The texture features that were evaluated include co-occurrence matrix, center-symmetric auto-correlation, texture feature coding method, local binary pattern, and texton. Due to the independence of each transformation and texture descriptor, a network structured combination was proposed and deployed on the Rutgers private cloud. The experiments utilized 20 randomly selected tissue microarray cores. All the combinations of the image transformations and deformations are calculated, and the whole feature extraction procedure was completed in 70 minutes using a cloud equipped with 20 nodes. Center-symmetric auto-correlation outperforms all the other four texture descriptors but also requires the longest computational time. It is roughly 10 times slower than local binary pattern and texton. From a speed perspective, both the local binary pattern and texton features provided excellent performance for classification and content-based image retrieval. PMID:23248759
High-Density Droplet Microarray of Individually Addressable Electrochemical Cells.
Zhang, Huijie; Oellers, Tobias; Feng, Wenqian; Abdulazim, Tarik; Saw, En Ning; Ludwig, Alfred; Levkin, Pavel A; Plumeré, Nicolas
2017-06-06
Microarray technology has shown great potential for various types of high-throughput screening applications. The main read-out methods of most microarray platforms, however, are based on optical techniques, limiting the scope of potential applications of such powerful screening technology. Electrochemical methods possess numerous complementary advantages over optical detection methods, including its label-free nature, capability of quantitative monitoring of various reporter molecules, and the ability to not only detect but also address compositions of individual compartments. However, application of electrochemical methods for the purpose of high-throughput screening remains very limited. In this work, we develop a high-density individually addressable electrochemical droplet microarray (eDMA). The eDMA allows for the detection of redox-active reporter molecules irrespective of their electrochemical reversibility in individual nanoliter-sized droplets. Orthogonal band microelectrodes are arranged to form at their intersections an array of three-electrode systems for precise control of the applied potential, which enables direct read-out of the current related to analyte detection. The band microelectrode array is covered with a layer of permeable porous polymethacrylate functionalized with a highly hydrophobic-hydrophilic pattern, forming spatially separated nanoliter-sized droplets on top of each electrochemical cell. Electrochemical characterization of single droplets demonstrates that the underlying electrode system is accessible to redox-active molecules through the hydrophilic polymeric pattern and that the nonwettable hydrophobic boundaries can spatially separate neighboring cells effectively. The eDMA technology opens the possibility to combine the high-throughput biochemical or living cell screenings using the droplet microarray platform with the sequential electrochemical read-out of individual droplets.
Yu, Hualong; Hong, Shufang; Yang, Xibei; Ni, Jun; Dan, Yuanyuan; Qin, Bin
2013-01-01
DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that most DNA microarray datasets are skewed, which causes traditional learning algorithms to produce inaccurate results. Some studies have considered this problem, yet they merely focus on binary-class problem. In this paper, we dealt with multiclass imbalanced classification problem, as encountered in cancer DNA microarray, by using ensemble learning. We utilized one-against-all coding strategy to transform multiclass to multiple binary classes, each of them carrying out feature subspace, which is an evolving version of random subspace that generates multiple diverse training subsets. Next, we introduced one of two different correction technologies, namely, decision threshold adjustment or random undersampling, into each training subset to alleviate the damage of class imbalance. Specifically, support vector machine was used as base classifier, and a novel voting rule called counter voting was presented for making a final decision. Experimental results on eight skewed multiclass cancer microarray datasets indicate that unlike many traditional classification approaches, our methods are insensitive to class imbalance.
Automatic Identification and Quantification of Extra-Well Fluorescence in Microarray Images.
Rivera, Robert; Wang, Jie; Yu, Xiaobo; Demirkan, Gokhan; Hopper, Marika; Bian, Xiaofang; Tahsin, Tasnia; Magee, D Mitchell; Qiu, Ji; LaBaer, Joshua; Wallstrom, Garrick
2017-11-03
In recent studies involving NAPPA microarrays, extra-well fluorescence is used as a key measure for identifying disease biomarkers because there is evidence to support that it is better correlated with strong antibody responses than statistical analysis involving intraspot intensity. Because this feature is not well quantified by traditional image analysis software, identification and quantification of extra-well fluorescence is performed manually, which is both time-consuming and highly susceptible to variation between raters. A system that could automate this task efficiently and effectively would greatly improve the process of data acquisition in microarray studies, thereby accelerating the discovery of disease biomarkers. In this study, we experimented with different machine learning methods, as well as novel heuristics, for identifying spots exhibiting extra-well fluorescence (rings) in microarray images and assigning each ring a grade of 1-5 based on its intensity and morphology. The sensitivity of our final system for identifying rings was found to be 72% at 99% specificity and 98% at 92% specificity. Our system performs this task significantly faster than a human, while maintaining high performance, and therefore represents a valuable tool for microarray image analysis.
Battaglia, Agatino; Doccini, Viola; Bernardini, Laura; Novelli, Antonio; Loddo, Sara; Capalbo, Anna; Filippi, Tiziana; Carey, John C
2013-11-01
Submicroscopic chromosomal rearrangements are the most common identifiable causes of intellectual disability and autism spectrum disorders associated with dysmorphic features. Chromosomal microarray (CMA) can detect copy number variants <1 Mb and identifies size and presence of known genes. The aim of this study was to demonstrate the usefulness of CMA, as a first-tier tool in detecting the etiology of unexplained intellectual disability/autism spectrum disorders (ID/ASDs) associated with dysmorphic features in a large cohort of pediatric patients. We studied 349 individuals; 223 males, 126 females, aged 5 months-19 years. Blood samples were analyzed with CMA at a resolution ranging from 1 Mb to 40 Kb. The imbalance was confirmed by FISH or qPCR. We considered copy number variants (CNVs) causative if the variant was responsible for a known syndrome, encompassed gene/s of known function, occurred de novo or, if inherited, the parent was variably affected, and/or the involved gene/s had been reported in association with ID/ASDs in dedicated databases. 91 CNVs were detected in 77 (22.06%) patients: 5 (6.49%) of those presenting with borderline cognitive impairment, 54 (70.13%) with a variable degree of DD/ID, and 18/77 (23.38%) with ID of variable degree and ASDs. 16/77 (20.8%) patients had two different rearrangements. Deletions exceeded duplications (58 versus 33); 45.05% (41/91) of the detected CNVs were de novo, 45.05% (41/91) inherited, and 9.9% (9/91) unknown. The CNVs caused the phenotype in 57/77 (74%) patients; 12/57 (21.05%) had ASDs/ID, and 45/57 (78.95%) had DD/ID. Our study provides further evidence of the high diagnostic yield of CMA for genetic testing in children with unexplained ID/ASDs who had dysmorphic features. We confirm the value of CMA as the first-tier tool in the assessment of those conditions in the pediatric setting. Copyright © 2013 European Paediatric Neurology Society. Published by Elsevier Ltd. All rights reserved.
BRIC-17 Mapping Spaceflight-Induced Hypoxic Signaling and Response in Plants
NASA Technical Reports Server (NTRS)
Gilroy, Simon; Choi, Won-Gyu; Swanson, Sarah
2012-01-01
Goals of this work are: (1) Define global changes in gene expression patterns in Arabidopsis plants grown in microgravity using whole genome microarrays (2) Compare to mutants resistant to low oxygen challenge using whole genome microarrays Also measuring root and shoot size Outcomes from this research are: (1) Provide fundamental information on plant responses to the stresses inherent in spaceflight (2) Potential for informing on genetic strategies to engineer plants for optimal growth in space
Shimizu, Kenji; Wakui, Keiko; Kosho, Tomoki; Okamoto, Nobuhiko; Mizuno, Seiji; Itomi, Kazuya; Hattori, Shigeto; Nishio, Kimio; Samura, Osamu; Kobayashi, Yoshiyuki; Kako, Yuko; Arai, Takashi; Tsutomu, Oh-ishi; Kawame, Hiroshi; Narumi, Yoko; Ohashi, Hirofumi; Fukushima, Yoshimitsu
2014-03-01
Wolf-Hirschhorn syndrome (WHS) is a contiguous gene deletion syndrome of the distal 4p chromosome, characterized by craniofacial features, growth impairment, intellectual disability, and seizures. Although genotype-phenotype correlation studies have previously been published, several important issues remain to be elucidated including seizure severity. We present detailed clinical and molecular-cytogenetic findings from a microarray and fluorescence in situ hybridization (FISH)-based genotype-phenotype analysis of 22 Japanese WHS patients, the first large non-Western series. 4p deletions were terminal in 20 patients and interstitial in two, with deletion sizes ranging from 2.06 to 29.42 Mb. The new Wolf-Hirschhorn syndrome critical region (WHSCR2) was deleted in all cases, and duplication of other chromosomal regions occurred in four. Complex mosaicism was identified in two cases: two different 4p terminal deletions; a simple 4p terminal deletion and an unbalanced translocation with the same 4p breakpoint. Seizures began in infancy in 33% (2/6) of cases with small (<6 Mb) deletions and in 86% (12/14) of cases with larger deletions (>6 Mb). Status epilepticus occurred in 17% (1/6) with small deletions and in 87% (13/15) with larger deletions. Renal hypoplasia or dysplasia and structural ocular anomalies were more prevalent in those with larger deletions. A new susceptible region for seizure occurrence is suggested between 0.76 and 1.3 Mb from 4 pter, encompassing CTBP1 and CPLX1, and distal to the previously-supposed candidate gene LETM1. The usefulness of bromide therapy for seizures and additional clinical features including hypercholesterolemia are also described. © 2013 Wiley Periodicals, Inc.
Vidyasagar, Mathukumalli
2015-01-01
This article reviews several techniques from machine learning that can be used to study the problem of identifying a small number of features, from among tens of thousands of measured features, that can accurately predict a drug response. Prediction problems are divided into two categories: sparse classification and sparse regression. In classification, the clinical parameter to be predicted is binary, whereas in regression, the parameter is a real number. Well-known methods for both classes of problems are briefly discussed. These include the SVM (support vector machine) for classification and various algorithms such as ridge regression, LASSO (least absolute shrinkage and selection operator), and EN (elastic net) for regression. In addition, several well-established methods that do not directly fall into machine learning theory are also reviewed, including neural networks, PAM (pattern analysis for microarrays), SAM (significance analysis for microarrays), GSEA (gene set enrichment analysis), and k-means clustering. Several references indicative of the application of these methods to cancer biology are discussed.
The Cross-Entropy Based Multi-Filter Ensemble Method for Gene Selection.
Sun, Yingqiang; Lu, Chengbo; Li, Xiaobo
2018-05-17
The gene expression profile has the characteristics of a high dimension, low sample, and continuous type, and it is a great challenge to use gene expression profile data for the classification of tumor samples. This paper proposes a cross-entropy based multi-filter ensemble (CEMFE) method for microarray data classification. Firstly, multiple filters are used to select the microarray data in order to obtain a plurality of the pre-selected feature subsets with a different classification ability. The top N genes with the highest rank of each subset are integrated so as to form a new data set. Secondly, the cross-entropy algorithm is used to remove the redundant data in the data set. Finally, the wrapper method, which is based on forward feature selection, is used to select the best feature subset. The experimental results show that the proposed method is more efficient than other gene selection methods and that it can achieve a higher classification accuracy under fewer characteristic genes.
Recursive feature selection with significant variables of support vectors.
Tsai, Chen-An; Huang, Chien-Hsun; Chang, Ching-Wei; Chen, Chun-Houh
2012-01-01
The development of DNA microarray makes researchers screen thousands of genes simultaneously and it also helps determine high- and low-expression level genes in normal and disease tissues. Selecting relevant genes for cancer classification is an important issue. Most of the gene selection methods use univariate ranking criteria and arbitrarily choose a threshold to choose genes. However, the parameter setting may not be compatible to the selected classification algorithms. In this paper, we propose a new gene selection method (SVM-t) based on the use of t-statistics embedded in support vector machine. We compared the performance to two similar SVM-based methods: SVM recursive feature elimination (SVMRFE) and recursive support vector machine (RSVM). The three methods were compared based on extensive simulation experiments and analyses of two published microarray datasets. In the simulation experiments, we found that the proposed method is more robust in selecting informative genes than SVMRFE and RSVM and capable to attain good classification performance when the variations of informative and noninformative genes are different. In the analysis of two microarray datasets, the proposed method yields better performance in identifying fewer genes with good prediction accuracy, compared to SVMRFE and RSVM.
Ballif, Blake C.; Theisen, Aaron; Rosenfeld, Jill A.; Traylor, Ryan N.; Gastier-Foster, Julie; Thrush, Devon Lamb; Astbury, Caroline; Bartholomew, Dennis; McBride, Kim L.; Pyatt, Robert E.; Shane, Kate; Smith, Wendy E.; Banks, Valerie; Gallentine, William B.; Brock, Pamela; Rudd, M. Katharine; Adam, Margaret P.; Keene, Julia A.; Phillips, John A.; Pfotenhauer, Jean P.; Gowans, Gordon C.; Stankiewicz, Pawel; Bejjani, Bassem A.; Shaffer, Lisa G.
2010-01-01
Segmental duplications, which comprise ∼5%–10% of the human genome, are known to mediate medically relevant deletions, duplications, and inversions through nonallelic homologous recombination (NAHR) and have been suggested to be hot spots in chromosome evolution and human genomic instability. We report seven individuals with microdeletions at 17q23.1q23.2, identified by microarray-based comparative genomic hybridization (aCGH). Six of the seven deletions are ∼2.2 Mb in size and flanked by large segmental duplications of >98% sequence identity and in the same orientation. One of the deletions is ∼2.8 Mb in size and is flanked on the distal side by a segmental duplication, whereas the proximal breakpoint falls between segmental duplications. These characteristics suggest that NAHR mediated six out of seven of these rearrangements. These individuals have common features, including mild to moderate developmental delay (particularly speech delay), microcephaly, postnatal growth retardation, heart defects, and hand, foot, and limb abnormalities. Although all individuals had at least mild dysmorphic facial features, there was no characteristic constellation of features that would elicit clinical suspicion of a specific disorder. The identification of common clinical features suggests that microdeletions at 17q23.1q23.2 constitute a novel syndrome. Furthermore, the inclusion in the minimal deletion region of TBX2 and TBX4, transcription factors belonging to a family of genes implicated in a variety of developmental pathways including those of heart and limb, suggests that these genes may play an important role in the phenotype of this emerging syndrome. PMID:20206336
MICROARRAY ANALYSIS OF PM-INDUCED GENEEXPRESSION IN HUMAN BRONCHIAL EPITHELIAL CELLS
Ambient air particles (PM) are generally classified into 3 sizes; coarse (2.5, 10m), fine (0.1, 2.5m), and ultrafine (<0.lpm). Each particle size is evolved from different sources and transformation processes (e.g., combustion vs. mechanical abrasion, and atmospheric conversion ...
Shahidi Bonjar, Mohammad Rashid; Shahidi Bonjar, Leyla
2015-01-01
The hypothesis proposed here would provide near to optimum homeostasis for patients with chronic kidney disease (CKD) without the need for hemodialysis. This strategy has not been described previously in the scientific literature. It involves a targeted therapy that may prevent progression of the disease and help to improve the well-being of CKD patients. It proposes a nanotechnological device, ie, a microarray-oriented homeostasis provider (MOHP), to improve homeostasis in CKD patients. MOHP would be an auxiliary kidney aid, and would improve the filtration functions that impaired kidneys cannot perform by their own. MOHP is composed of two main computer-oriented components, ie, a quantitative microarray detector (QMD) and a homeostasis-oriented microarray column (HOMC). QMD detects and HOMC selectively removes defined quantities of uremic wastes, toxins and any other metabolites which is programmed for. The QMD and HOMC would accomplish this with the help of a peristaltic blood pump that would circulate blood aseptically in an extracorporeal closed circuit. During the passage of blood through the QMD, this microarray detector would quantitatively monitor all of the blood compounds that accumulate in the blood of a patient with impaired glomerular filtration, including small-sized, middle-sized and large-sized molecules. The electronic information collected by QMD would be electronically transmitted to the HOMC, which would adjust the molecules to the concentrations they are electronically programmed for and/or receive from QMD. This process of monitoring and removal of waste continues until the programmed homeostasis criteria are reached. Like a conventional kidney machine, MOHP can be used in hospitals and homes under the supervision of a trained technician. The main advantages of this treatment would include improved homeostasis, a reduced likelihood of side effects and of the morbidity resulting from CKD, slower progression of kidney impairment, prevention of end-stage renal failure, a decreased need for hemodialysis therapy, avoidance of dialysis-related side effects later on in the patient's life, improved quality of life and increased life expectancy.
Spectral Analysis of Breast Cancer on Tissue Microarrays: Seeing Beyond Morphology
2005-04-01
Harvey N., Szymanski J.J., Bloch J.J., Mitchell M. investigation of image feature extraction by a genetic algorithm. Proc. SPIE 1999;3812:24-31. 11...automated feature extraction using multiple data sources. Proc. SPIE 2003;5099:190-200. 15 4 Spectral-Spatial Analysis of Urine Cytology Angeletti et al...Appendix Contents: 1. Harvey, N.R., Levenson, R.M., Rimm, D.L. (2003) Investigation of Automated Feature Extraction Techniques for Applications in
Zhang, Fan; Briones, Andrea; Soloviev, Mikhail
2016-01-01
This chapter describes the principles of selection of antigenic peptides for the development of anti-peptide antibodies for use in microarray-based multiplex affinity assays and also with mass-spectrometry detection. The methods described here are mostly applicable to small to medium scale arrays. Although the same principles of peptide selection would be suitable for larger scale arrays (with 100+ features) the actual informatics software and printing methods may well be different. Because of the sheer number of proteins/peptides to be processed and analyzed dedicated software capable of processing all the proteins and an enterprise level array robotics may be necessary for larger scale efforts. This report aims to provide practical advice to those who develop or use arrays with up to ~100 different peptide or protein features.
Assessment of gene order computing methods for Alzheimer's disease
2013-01-01
Background Computational genomics of Alzheimer disease (AD), the most common form of senile dementia, is a nascent field in AD research. The field includes AD gene clustering by computing gene order which generates higher quality gene clustering patterns than most other clustering methods. However, there are few available gene order computing methods such as Genetic Algorithm (GA) and Ant Colony Optimization (ACO). Further, their performance in gene order computation using AD microarray data is not known. We thus set forth to evaluate the performances of current gene order computing methods with different distance formulas, and to identify additional features associated with gene order computation. Methods Using different distance formulas- Pearson distance and Euclidean distance, the squared Euclidean distance, and other conditions, gene orders were calculated by ACO and GA (including standard GA and improved GA) methods, respectively. The qualities of the gene orders were compared, and new features from the calculated gene orders were identified. Results Compared to the GA methods tested in this study, ACO fits the AD microarray data the best when calculating gene order. In addition, the following features were revealed: different distance formulas generated a different quality of gene order, and the commonly used Pearson distance was not the best distance formula when used with both GA and ACO methods for AD microarray data. Conclusion Compared with Pearson distance and Euclidean distance, the squared Euclidean distance generated the best quality gene order computed by GA and ACO methods. PMID:23369541
Boswell, Mikki G; Wells, Melissa C; Kirk, Lyndsey M; Ju, Zhenlin; Zhang, Ziping; Booth, Rachell E; Walter, Ronald B
2009-03-01
Gene expression profiling using DNA microarray technology is a useful tool for assessing gene transcript level responses after an organism is exposed to environmental stress. Herein, we detail results from studies using an 8 k medaka (Oryzias latipes) microarray to assess modulated gene expression patterns upon hypoxia exposure of the live-bearing aquaria fish, Xiphophorus maculatus. To assess the reproducibility and reliability of using the medaka array in cross-genus hybridization, a two-factor ANOVA analysis of gene expression was employed. The data show the tissue source of the RNA used for array hybridization contributed more to the observed response of modulated gene targets than did the species source of the RNA. In addition, hierarchical clustering via heat map analyses of groupings of tissues and species (Xiphophorus and medaka) suggests that hypoxia induced similar responses in the same tissues from these two diverse aquatic model organisms. Our Xiphophorus results indicate 206 brain, 37 liver, and 925 gill gene targets exhibit hypoxia induced expression changes. Analysis of the Xiphophorus data to determine those features exhibiting a significant (p<0.05)+/-3 fold change produced only two gene targets within brain tissue and 80 features within gill tissue. Of these 82 characterized features, 39 were identified via homology searching (cut-off E-value of 1 x 10(-5)) and placed into one or more biological process gene ontology groups. Among these 39 genes, metabolic energy changes and manipulation was the most affected biological pathway (13 genes).
Striano, Pasquale; Coppola, Antonietta; Paravidino, Roberta; Malacarne, Michela; Gimelli, Stefania; Robbiano, Angela; Traverso, Monica; Pezzella, Marianna; Belcastro, Vincenzo; Bianchi, Amedeo; Elia, Maurizio; Falace, Antonio; Gazzerro, Elisabetta; Ferlazzo, Edoardo; Freri, Elena; Galasso, Roberta; Gobbi, Giuseppe; Molinatto, Cristina; Cavani, Simona; Zuffardi, Orsetta; Striano, Salvatore; Ferrero, Giovanni Battista; Silengo, Margherita; Cavaliere, Maria Luigia; Benelli, Matteo; Magi, Alberto; Piccione, Maria; Dagna Bricarelli, Franca; Coviello, Domenico A; Fichera, Marco; Minetti, Carlo; Zara, Federico
2012-03-01
To perform an extensive search for genomic rearrangements by microarray-based comparative genomic hybridization in patients with epilepsy. Prospective cohort study. Epilepsy centers in Italy. Two hundred seventy-nine patients with unexplained epilepsy, 265 individuals with nonsyndromic mental retardation but no epilepsy, and 246 healthy control subjects were screened by microarray-based comparative genomic hybridization. Identification of copy number variations (CNVs) and gene enrichment. Rare CNVs occurred in 26 patients (9.3%) and 16 healthy control subjects (6.5%) (P = .26). The CNVs identified in patients were larger (P = .03) and showed higher gene content (P = .02) than those in control subjects. The CNVs larger than 1 megabase (P = .002) and including more than 10 genes (P = .005) occurred more frequently in patients than in control subjects. Nine patients (34.6%) among those harboring rare CNVs showed rearrangements associated with emerging microdeletion or microduplication syndromes. Mental retardation and neuropsychiatric features were associated with rare CNVs (P = .004), whereas epilepsy type was not. The CNV rate in patients with epilepsy and mental retardation or neuropsychiatric features is not different from that observed in patients with mental retardation only. Moreover, significant enrichment of genes involved in ion transport was observed within CNVs identified in patients with epilepsy. Patients with epilepsy show a significantly increased burden of large, rare, gene-rich CNVs, particularly when associated with mental retardation and neuropsychiatric features. The limited overlap between CNVs observed in the epilepsy group and those observed in the group with mental retardation only as well as the involvement of specific (ion channel) genes indicate a specific association between the identified CNVs and epilepsy. Screening for CNVs should be performed for diagnostic purposes preferentially in patients with epilepsy and mental retardation or neuropsychiatric features.
Metadata management and semantics in microarray repositories.
Kocabaş, F; Can, T; Baykal, N
2011-12-01
The number of microarray and other high-throughput experiments on primary repositories keeps increasing as do the size and complexity of the results in response to biomedical investigations. Initiatives have been started on standardization of content, object model, exchange format and ontology. However, there are backlogs and inability to exchange data between microarray repositories, which indicate that there is a great need for a standard format and data management. We have introduced a metadata framework that includes a metadata card and semantic nets that make experimental results visible, understandable and usable. These are encoded in syntax encoding schemes and represented in RDF (Resource Description Frame-word), can be integrated with other metadata cards and semantic nets, and can be exchanged, shared and queried. We demonstrated the performance and potential benefits through a case study on a selected microarray repository. We concluded that the backlogs can be reduced and that exchange of information and asking of knowledge discovery questions can become possible with the use of this metadata framework.
Yamamura, Shohei; Yamada, Eriko; Kimura, Fukiko; Miyajima, Kumiko; Shigeto, Hajime
2017-10-21
A new single-cell microarray chip was designed and developed to separate and analyze single adherent and non-adherent cancer cells. The single-cell microarray chip is made of polystyrene with over 60,000 microchambers of 10 different size patterns (31-40 µm upper diameter, 11-20 µm lower diameter). A drop of suspension of adherent carcinoma (NCI-H1650) and non-adherent leukocyte (CCRF-CEM) cells was placed onto the chip, and single-cell occupancy of NCI-H1650 and CCRF-CEM was determined to be 79% and 84%, respectively. This was achieved by controlling the chip design and surface treatment. Analysis of protein expression in single NCI-H1650 and CCRF-CEM cells was performed on the single-cell microarray chip by multi-antibody staining. Additionally, with this system, we retrieved positive single cells from the microchambers by a micromanipulator. Thus, this system demonstrates the potential for easy and accurate separation and analysis of various types of single cells.
A meta-data based method for DNA microarray imputation.
Jörnsten, Rebecka; Ouyang, Ming; Wang, Hui-Yu
2007-03-29
DNA microarray experiments are conducted in logical sets, such as time course profiling after a treatment is applied to the samples, or comparisons of the samples under two or more conditions. Due to cost and design constraints of spotted cDNA microarray experiments, each logical set commonly includes only a small number of replicates per condition. Despite the vast improvement of the microarray technology in recent years, missing values are prevalent. Intuitively, imputation of missing values is best done using many replicates within the same logical set. In practice, there are few replicates and thus reliable imputation within logical sets is difficult. However, it is in the case of few replicates that the presence of missing values, and how they are imputed, can have the most profound impact on the outcome of downstream analyses (e.g. significance analysis and clustering). This study explores the feasibility of imputation across logical sets, using the vast amount of publicly available microarray data to improve imputation reliability in the small sample size setting. We download all cDNA microarray data of Saccharomyces cerevisiae, Arabidopsis thaliana, and Caenorhabditis elegans from the Stanford Microarray Database. Through cross-validation and simulation, we find that, for all three species, our proposed imputation using data from public databases is far superior to imputation within a logical set, sometimes to an astonishing degree. Furthermore, the imputation root mean square error for significant genes is generally a lot less than that of non-significant ones. Since downstream analysis of significant genes, such as clustering and network analysis, can be very sensitive to small perturbations of estimated gene effects, it is highly recommended that researchers apply reliable data imputation prior to further analysis. Our method can also be applied to cDNA microarray experiments from other species, provided good reference data are available.
Hancock, David; Wilson, Michael; Velarde, Giles; Morrison, Norman; Hayes, Andrew; Hulme, Helen; Wood, A Joseph; Nashar, Karim; Kell, Douglas B; Brass, Andy
2005-11-03
maxdLoad2 is a relational database schema and Java application for microarray experimental annotation and storage. It is compliant with all standards for microarray meta-data capture; including the specification of what data should be recorded, extensive use of standard ontologies and support for data exchange formats. The output from maxdLoad2 is of a form acceptable for submission to the ArrayExpress microarray repository at the European Bioinformatics Institute. maxdBrowse is a PHP web-application that makes contents of maxdLoad2 databases accessible via web-browser, the command-line and web-service environments. It thus acts as both a dissemination and data-mining tool. maxdLoad2 presents an easy-to-use interface to an underlying relational database and provides a full complement of facilities for browsing, searching and editing. There is a tree-based visualization of data connectivity and the ability to explore the links between any pair of data elements, irrespective of how many intermediate links lie between them. Its principle novel features are: the flexibility of the meta-data that can be captured, the tools provided for importing data from spreadsheets and other tabular representations, the tools provided for the automatic creation of structured documents, the ability to browse and access the data via web and web-services interfaces. Within maxdLoad2 it is very straightforward to customise the meta-data that is being captured or change the definitions of the meta-data. These meta-data definitions are stored within the database itself allowing client software to connect properly to a modified database without having to be specially configured. The meta-data definitions (configuration file) can also be centralized allowing changes made in response to revisions of standards or terminologies to be propagated to clients without user intervention.maxdBrowse is hosted on a web-server and presents multiple interfaces to the contents of maxd databases. maxdBrowse emulates many of the browse and search features available in the maxdLoad2 application via a web-browser. This allows users who are not familiar with maxdLoad2 to browse and export microarray data from the database for their own analysis. The same browse and search features are also available via command-line and SOAP server interfaces. This both enables scripting of data export for use embedded in data repositories and analysis environments, and allows access to the maxd databases via web-service architectures. maxdLoad2 http://www.bioinf.man.ac.uk/microarray/maxd/ and maxdBrowse http://dbk.ch.umist.ac.uk/maxdBrowse are portable and compatible with all common operating systems and major database servers. They provide a powerful, flexible package for annotation of microarray experiments and a convenient dissemination environment. They are available for download and open sourced under the Artistic License.
Fully automated analysis of multi-resolution four-channel micro-array genotyping data
NASA Astrophysics Data System (ADS)
Abbaspour, Mohsen; Abugharbieh, Rafeef; Podder, Mohua; Tebbutt, Scott J.
2006-03-01
We present a fully-automated and robust microarray image analysis system for handling multi-resolution images (down to 3-micron with sizes up to 80 MBs per channel). The system is developed to provide rapid and accurate data extraction for our recently developed microarray analysis and quality control tool (SNP Chart). Currently available commercial microarray image analysis applications are inefficient, due to the considerable user interaction typically required. Four-channel DNA microarray technology is a robust and accurate tool for determining genotypes of multiple genetic markers in individuals. It plays an important role in the state of the art trend where traditional medical treatments are to be replaced by personalized genetic medicine, i.e. individualized therapy based on the patient's genetic heritage. However, fast, robust, and precise image processing tools are required for the prospective practical use of microarray-based genetic testing for predicting disease susceptibilities and drug effects in clinical practice, which require a turn-around timeline compatible with clinical decision-making. In this paper we have developed a fully-automated image analysis platform for the rapid investigation of hundreds of genetic variations across multiple genes. Validation tests indicate very high accuracy levels for genotyping results. Our method achieves a significant reduction in analysis time, from several hours to just a few minutes, and is completely automated requiring no manual interaction or guidance.
Wimmer, Isabella; Tröscher, Anna R; Brunner, Florian; Rubino, Stephen J; Bien, Christian G; Weiner, Howard L; Lassmann, Hans; Bauer, Jan
2018-04-20
Formalin-fixed paraffin-embedded (FFPE) tissues are valuable resources commonly used in pathology. However, formalin fixation modifies nucleic acids challenging the isolation of high-quality RNA for genetic profiling. Here, we assessed feasibility and reliability of microarray studies analysing transcriptome data from fresh, fresh-frozen (FF) and FFPE tissues. We show that reproducible microarray data can be generated from only 2 ng FFPE-derived RNA. For RNA quality assessment, fragment size distribution (DV200) and qPCR proved most suitable. During RNA isolation, extending tissue lysis time to 10 hours reduced high-molecular-weight species, while additional incubation at 70 °C markedly increased RNA yields. Since FF- and FFPE-derived microarrays constitute different data entities, we used indirect measures to investigate gene signal variation and relative gene expression. Whole-genome analyses revealed high concordance rates, while reviewing on single-genes basis showed higher data variation in FFPE than FF arrays. Using an experimental model, gene set enrichment analysis (GSEA) of FFPE-derived microarrays and fresh tissue-derived RNA-Seq datasets yielded similarly affected pathways confirming the applicability of FFPE tissue in global gene expression analysis. Our study provides a workflow comprising RNA isolation, quality assessment and microarray profiling using minimal RNA input, thus enabling hypothesis-generating pathway analyses from limited amounts of precious, pathologically significant FFPE tissues.
Bălăcescu, Loredana; Bălăcescu, O; Crişan, N; Fetica, B; Petruţ, B; Bungărdean, Cătălina; Rus, Meda; Tudoran, Oana; Meurice, G; Irimie, Al; Dragoş, N; Berindan-Neagoe, Ioana
2011-01-01
Prostate cancer represents the first leading cause of cancer among western male population, with different clinical behavior ranging from indolent to metastatic disease. Although many molecules and deregulated pathways are known, the molecular mechanisms involved in the development of prostate cancer are not fully understood. The aim of this study was to explore the molecular variation underlying the prostate cancer, based on microarray analysis and bioinformatics approaches. Normal and prostate cancer tissues were collected by macrodissection from prostatectomy pieces. All prostate cancer specimens used in our study were Gleason score 7. Gene expression microarray (Agilent Technologies) was used for Whole Human Genome evaluation. The bioinformatics and functional analysis were based on Limma and Ingenuity software. The microarray analysis identified 1119 differentially expressed genes between prostate cancer and normal prostate, which were up- or down-regulated at least 2-fold. P-values were adjusted for multiple testing using Benjamini-Hochberg method with a false discovery rate of 0.01. These genes were analyzed with Ingenuity Pathway Analysis software and were established 23 genetic networks. Our microarray results provide new information regarding the molecular networks in prostate cancer stratified as Gleason 7. These data highlighted gene expression profiles for better understanding of prostate cancer progression.
Protein microarray analysis reveals BAFF-binding autoantibodies in systemic lupus erythematosus
Price, Jordan V.; Haddon, David J.; Kemmer, Dodge; Delepine, Guillaume; Mandelbaum, Gil; Jarrell, Justin A.; Gupta, Rohit; Balboni, Imelda; Chakravarty, Eliza F.; Sokolove, Jeremy; Shum, Anthony K.; Anderson, Mark S.; Cheng, Mickie H.; Robinson, William H.; Browne, Sarah K.; Holland, Steven M.; Baechler, Emily C.; Utz, Paul J.
2013-01-01
Autoantibodies against cytokines, chemokines, and growth factors inhibit normal immunity and are implicated in inflammatory autoimmune disease and diseases of immune deficiency. In an effort to evaluate serum from autoimmune and immunodeficient patients for Abs against cytokines, chemokines, and growth factors in a high-throughput and unbiased manner, we constructed a multiplex protein microarray for detection of serum factor–binding Abs and used the microarray to detect autoantibody targets in SLE. We designed a nitrocellulose-surface microarray containing human cytokines, chemokines, and other circulating proteins and demonstrated that the array permitted specific detection of serum factor–binding probes. We used the arrays to detect previously described autoantibodies against cytokines in samples from individuals with autoimmune polyendocrine syndrome type 1 and chronic mycobacterial infection. Serum profiling from individuals with SLE revealed that among several targets, elevated IgG autoantibody reactivity to B cell–activating factor (BAFF) was associated with SLE compared with control samples. BAFF reactivity correlated with the severity of disease-associated features, including IFN-α–driven SLE pathology. Our results showed that serum factor protein microarrays facilitate detection of autoantibody reactivity to serum factors in human samples and that BAFF-reactive autoantibodies may be associated with an elevated inflammatory disease state within the spectrum of SLE. PMID:24270423
An efficient pseudomedian filter for tiling microrrays.
Royce, Thomas E; Carriero, Nicholas J; Gerstein, Mark B
2007-06-07
Tiling microarrays are becoming an essential technology in the functional genomics toolbox. They have been applied to the tasks of novel transcript identification, elucidation of transcription factor binding sites, detection of methylated DNA and several other applications in several model organisms. These experiments are being conducted at increasingly finer resolutions as the microarray technology enjoys increasingly greater feature densities. The increased densities naturally lead to increased data analysis requirements. Specifically, the most widely employed algorithm for tiling array analysis involves smoothing observed signals by computing pseudomedians within sliding windows, a O(n2logn) calculation in each window. This poor time complexity is an issue for tiling array analysis and could prove to be a real bottleneck as tiling microarray experiments become grander in scope and finer in resolution. We therefore implemented Monahan's HLQEST algorithm that reduces the runtime complexity for computing the pseudomedian of n numbers to O(nlogn) from O(n2logn). For a representative tiling microarray dataset, this modification reduced the smoothing procedure's runtime by nearly 90%. We then leveraged the fact that elements within sliding windows remain largely unchanged in overlapping windows (as one slides across genomic space) to further reduce computation by an additional 43%. This was achieved by the application of skip lists to maintaining a sorted list of values from window to window. This sorted list could be maintained with simple O(log n) inserts and deletes. We illustrate the favorable scaling properties of our algorithms with both time complexity analysis and benchmarking on synthetic datasets. Tiling microarray analyses that rely upon a sliding window pseudomedian calculation can require many hours of computation. We have eased this requirement significantly by implementing efficient algorithms that scale well with genomic feature density. This result not only speeds the current standard analyses, but also makes possible ones where many iterations of the filter may be required, such as might be required in a bootstrap or parameter estimation setting. Source code and executables are available at http://tiling.gersteinlab.org/pseudomedian/.
An efficient pseudomedian filter for tiling microrrays
Royce, Thomas E; Carriero, Nicholas J; Gerstein, Mark B
2007-01-01
Background Tiling microarrays are becoming an essential technology in the functional genomics toolbox. They have been applied to the tasks of novel transcript identification, elucidation of transcription factor binding sites, detection of methylated DNA and several other applications in several model organisms. These experiments are being conducted at increasingly finer resolutions as the microarray technology enjoys increasingly greater feature densities. The increased densities naturally lead to increased data analysis requirements. Specifically, the most widely employed algorithm for tiling array analysis involves smoothing observed signals by computing pseudomedians within sliding windows, a O(n2logn) calculation in each window. This poor time complexity is an issue for tiling array analysis and could prove to be a real bottleneck as tiling microarray experiments become grander in scope and finer in resolution. Results We therefore implemented Monahan's HLQEST algorithm that reduces the runtime complexity for computing the pseudomedian of n numbers to O(nlogn) from O(n2logn). For a representative tiling microarray dataset, this modification reduced the smoothing procedure's runtime by nearly 90%. We then leveraged the fact that elements within sliding windows remain largely unchanged in overlapping windows (as one slides across genomic space) to further reduce computation by an additional 43%. This was achieved by the application of skip lists to maintaining a sorted list of values from window to window. This sorted list could be maintained with simple O(log n) inserts and deletes. We illustrate the favorable scaling properties of our algorithms with both time complexity analysis and benchmarking on synthetic datasets. Conclusion Tiling microarray analyses that rely upon a sliding window pseudomedian calculation can require many hours of computation. We have eased this requirement significantly by implementing efficient algorithms that scale well with genomic feature density. This result not only speeds the current standard analyses, but also makes possible ones where many iterations of the filter may be required, such as might be required in a bootstrap or parameter estimation setting. Source code and executables are available at . PMID:17555595
Assawamakin, Anunchai; Prueksaaroon, Supakit; Kulawonganunchai, Supasak; Shaw, Philip James; Varavithya, Vara; Ruangrajitpakorn, Taneth; Tongsima, Sissades
2013-01-01
Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First, a Naïve Bayes estimator is used to rank features from which the top-ranked will most likely contain the most informative features for prediction of the underlying biological classes. The top-ranked features are then used in a Hidden Naïve Bayes classifier to construct a classification prediction model from these filtered attributes. In order to obtain the minimum set of the most informative biomarkers, the bottom-ranked features are successively removed from the Naïve Bayes-filtered feature list one at a time, and the classification accuracy of the Hidden Naïve Bayes classifier is checked for each pruned feature set. The performance of the proposed two-step Bayes classification framework was tested on different types of -omics datasets including gene expression microarray, single nucleotide polymorphism microarray (SNParray), and surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) proteomic data. The proposed two-step Bayes classification framework was equal to and, in some cases, outperformed other classification methods in terms of prediction accuracy, minimum number of classification markers, and computational time.
Mutual information estimation reveals global associations between stimuli and biological processes
Suzuki, Taiji; Sugiyama, Masashi; Kanamori, Takafumi; Sese, Jun
2009-01-01
Background Although microarray gene expression analysis has become popular, it remains difficult to interpret the biological changes caused by stimuli or variation of conditions. Clustering of genes and associating each group with biological functions are often used methods. However, such methods only detect partial changes within cell processes. Herein, we propose a method for discovering global changes within a cell by associating observed conditions of gene expression with gene functions. Results To elucidate the association, we introduce a novel feature selection method called Least-Squares Mutual Information (LSMI), which computes mutual information without density estimaion, and therefore LSMI can detect nonlinear associations within a cell. We demonstrate the effectiveness of LSMI through comparison with existing methods. The results of the application to yeast microarray datasets reveal that non-natural stimuli affect various biological processes, whereas others are no significant relation to specific cell processes. Furthermore, we discover that biological processes can be categorized into four types according to the responses of various stimuli: DNA/RNA metabolism, gene expression, protein metabolism, and protein localization. Conclusion We proposed a novel feature selection method called LSMI, and applied LSMI to mining the association between conditions of yeast and biological processes through microarray datasets. In fact, LSMI allows us to elucidate the global organization of cellular process control. PMID:19208155
HOXB9 Expression Correlates with Histological Grade and Prognosis in LSCC
2017-01-01
The purpose of this study was to investigate the HOX gene expression profile in laryngeal squamous cell carcinoma (LSCC) and assess whether some genes are associated with the clinicopathological features and prognosis in LSCC patients. The HOX gene levels were tested by microarray and validated by qRT-PCR in paired cancerous and adjacent noncancerous LSCC tissue samples. The microarray testing data of 39 HOX genes revealed 15 HOX genes that were at least 2-fold upregulated and 2 that were downregulated. After qRT-PCR evaluation, the three most upregulated genes (HOXB9, HOXB13, and HOXD13) were selected for tissue microarray (TMA) analysis. The correlations between the HOXB9, HOXB13, and HOXD13 expression levels and both clinicopathological features and prognosis were analyzed. Three HOX gene expression levels were markedly increased in LSCC tissues compared with adjacent noncancerous tissues (P < 0.001). HOXB9 was found to correlate with histological grade (P < 0.01) and prognosis (P < 0.01) in LSCC. In conclusion, this study revealed that HOXB9, HOXB13, and HOXD13 were upregulated and may play important roles in LSCC. Moreover, HOXB9 may serve as a novel marker of poor prognosis and a potential therapeutic target in LSCC patients. PMID:28808656
Correcting for batch effects in case-control microbiome studies
Gibbons, Sean M.; Duvallet, Claire
2018-01-01
High-throughput data generation platforms, like mass-spectrometry, microarrays, and second-generation sequencing are susceptible to batch effects due to run-to-run variation in reagents, equipment, protocols, or personnel. Currently, batch correction methods are not commonly applied to microbiome sequencing datasets. In this paper, we compare different batch-correction methods applied to microbiome case-control studies. We introduce a model-free normalization procedure where features (i.e. bacterial taxa) in case samples are converted to percentiles of the equivalent features in control samples within a study prior to pooling data across studies. We look at how this percentile-normalization method compares to traditional meta-analysis methods for combining independent p-values and to limma and ComBat, widely used batch-correction models developed for RNA microarray data. Overall, we show that percentile-normalization is a simple, non-parametric approach for correcting batch effects and improving sensitivity in case-control meta-analyses. PMID:29684016
Wang, Huibin; Zhang, Yiming; Yuan, Xun; Chen, Yi; Yan, Mingdi
2010-01-01
A universal photochemical method has been established for the immobilization of intact carbohydrates and their analogues, and for the fabrication of carbohydrate microarrays. The method features the use of perfluorophenyl azide (PFPA)-modified substrates and the photochemical reaction of surface azido groups with printed carbohydrates. Various aldoses, ketoses, non-reducing sugars such as alditols and their derivatives can be directly arrayed on the PFPA-modified chips. The lectin-recognition ability of arrayed mannose, glucose and their oligo- and polysaccharides were confirmed using surface plasmon resonance imaging and laser-induced fluorescence imaging. PMID:21138274
Wang, Huibin; Zhang, Yiming; Yuan, Xun; Chen, Yi; Yan, Mingdi
2011-01-19
A universal photochemical method has been established for the immobilization of intact carbohydrates and their analogues, and for the fabrication of carbohydrate microarrays. The method features the use of perfluorophenyl azide (PFPA)-modified substrates and the photochemical reaction of surface azido groups with printed carbohydrates. Various aldoses, ketoses, nonreducing sugars such as alditols, and their derivatives can be directly arrayed on the PFPA-modified chips. The lectin-recognition ability of arrayed mannose, glucose, and their oligo- and polysaccharides were confirmed using surface plasmon resonance imaging and laser-induced fluorescence imaging.
Multiclass classification of microarray data samples with a reduced number of genes
2011-01-01
Background Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained. Results A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples. Conclusions A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples. PMID:21342522
te Beest, Dennis; de Bruin, Erwin; Imholz, Sandra; Wallinga, Jacco; Teunis, Peter; Koopmans, Marion; van Boven, Michiel
2014-01-01
Reliable discrimination of recent influenza A infection from previous exposure using hemagglutination inhibition (HI) or virus neutralization tests is currently not feasible. This is due to low sensitivity of the tests and the interference of antibody responses generated by previous infections. Here we investigate the diagnostic characteristics of a newly developed antibody (HA1) protein microarray using data from cross-sectional serological studies carried out before and after the pandemic of 2009. The data are analysed by mixture models, providing a probabilistic classification of sera (susceptible, prior-exposed, recently infected). Estimated sensitivity and specificity for identifying A/2009 infections are low using HI (66% and 51%), and high when using A/2009 microarray data alone or together with A/1918 microarray data (96% and 95%). As a heuristic, a high A/2009 to A/1918 antibody ratio (>1.05) is indicative of recent infection, while a low ratio is indicative of a pre-existing response, even if the A/2009 titer is high. We conclude that highly sensitive and specific classification of individual sera is possible using the protein microarray, thereby enabling precise estimation of age-specific infection attack rates in the population even if sample sizes are small. PMID:25405997
NASA Astrophysics Data System (ADS)
Dan, X.; Yang, J. J.
2016-07-01
Self-assembled films with needle-like microarrays were fabricated using a mixture of cobalt and fluorocarbon resin under a magnetic field. The various influences of magnetic powder content, viscosity and size distribution on the structure of the self-assembled films were investigated. The self-assembled film morphologies were characterized by stereomicroscope and scanning electron microscopy. Experimental results indicate that an increase in magnetic powder content results in greater unit height and diameter, and that a reduction in viscosity results in increasing array density and decreasing unit width. Additionally, particles with narrow size distribution were able to attain more regular microarray structures. The structural alterations were closely related to numerous effects such as van der Waals forces, dipole-dipole interactions, and external-dipole interactions. The self-assembled film demonstrated magnetic anisotropy, as identified by vibrating sample magnetometry (VSM).
Erickson, A; Fisher, M; Furukawa-Stoffer, T; Ambagala, A; Hodko, D; Pasick, J; King, D P; Nfon, C; Ortega Polo, R; Lung, O
2018-04-01
Microarray technology can be useful for pathogen detection as it allows simultaneous interrogation of the presence or absence of a large number of genetic signatures. However, most microarray assays are labour-intensive and time-consuming to perform. This study describes the development and initial evaluation of a multiplex reverse transcription (RT)-PCR and novel accompanying automated electronic microarray assay for simultaneous detection and differentiation of seven important viruses that affect swine (foot-and-mouth disease virus [FMDV], swine vesicular disease virus [SVDV], vesicular exanthema of swine virus [VESV], African swine fever virus [ASFV], classical swine fever virus [CSFV], porcine respiratory and reproductive syndrome virus [PRRSV] and porcine circovirus type 2 [PCV2]). The novel electronic microarray assay utilizes a single, user-friendly instrument that integrates and automates capture probe printing, hybridization, washing and reporting on a disposable electronic microarray cartridge with 400 features. This assay accurately detected and identified a total of 68 isolates of the seven targeted virus species including 23 samples of FMDV, representing all seven serotypes, and 10 CSFV strains, representing all three genotypes. The assay successfully detected viruses in clinical samples from the field, experimentally infected animals (as early as 1 day post-infection (dpi) for FMDV and SVDV, 4 dpi for ASFV, 5 dpi for CSFV), as well as in biological material that were spiked with target viruses. The limit of detection was 10 copies/μl for ASFV, PCV2 and PRRSV, 100 copies/μl for SVDV, CSFV, VESV and 1,000 copies/μl for FMDV. The electronic microarray component had reduced analytical sensitivity for several of the target viruses when compared with the multiplex RT-PCR. The integration of capture probe printing allows custom onsite array printing as needed, while electrophoretically driven hybridization generates results faster than conventional microarrays that rely on passive hybridization. With further refinement, this novel, rapid, highly automated microarray technology has potential applications in multipathogen surveillance of livestock diseases. © 2017 Her Majesty the Queen in Right of Canada • Transboundary and Emerging Diseases.
NASA Astrophysics Data System (ADS)
Wang, Chenglin; Wang, Mengye; Xie, Kunpeng; Wu, Qi; Sun, Lan; Lin, Zhiqun; Lin, Changjian
2011-07-01
Microarrays of N-doped flower-like TiO2 composed of well-defined multilayer nanoflakes were synthesized at room temperature by electrochemical anodization of Ti in NH4F aqueous solution. The TiO2 flowers were of good anatase crystallinity. The effects of anodizing time, applied voltage and NH4F concentration on the flower-like morphology were systematically examined. It was found that the morphologies of the anodized Ti were related to the anodizing time and NH4F concentration. The size and density of the TiO2 flowers could be tuned by changing the applied voltage. The obtained N-doped flower-like TiO2 microarrays exhibited intense absorption in wavelengths ranging from 320 to 800 nm. Under both UV and visible light irradiation, the photocatalytic activity of the N-doped flower-like TiO2 microarrays in the oxidation of methyl orange showed a significant increase compared with that of commercial P25 TiO2 film.
Wang, Chenglin; Wang, Mengye; Xie, Kunpeng; Wu, Qi; Sun, Lan; Lin, Zhiqun; Lin, Changjian
2011-07-29
Microarrays of N-doped flower-like TiO(2) composed of well-defined multilayer nanoflakes were synthesized at room temperature by electrochemical anodization of Ti in NH(4)F aqueous solution. The TiO(2) flowers were of good anatase crystallinity. The effects of anodizing time, applied voltage and NH(4)F concentration on the flower-like morphology were systematically examined. It was found that the morphologies of the anodized Ti were related to the anodizing time and NH(4)F concentration. The size and density of the TiO(2) flowers could be tuned by changing the applied voltage. The obtained N-doped flower-like TiO(2) microarrays exhibited intense absorption in wavelengths ranging from 320 to 800 nm. Under both UV and visible light irradiation, the photocatalytic activity of the N-doped flower-like TiO(2) microarrays in the oxidation of methyl orange showed a significant increase compared with that of commercial P25 TiO(2) film.
Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia
2012-01-01
Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of “injury" gene predictors that can classify lung injury samples and identify patients at risk for clinically relevant lung injury complications. PMID:23071521
EDGE3: A web-based solution for management and analysis of Agilent two color microarray experiments
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-01-01
Background The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE3 was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. Results EDGE3 has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE3 is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Conclusion Here, we present EDGE3, an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE3 provides a means for managing RNA samples and arrays during the hybridization process. EDGE3 is freely available for download at . PMID:19732451
Vollrath, Aaron L; Smith, Adam A; Craven, Mark; Bradfield, Christopher A
2009-09-04
The ability to generate transcriptional data on the scale of entire genomes has been a boon both in the improvement of biological understanding and in the amount of data generated. The latter, the amount of data generated, has implications when it comes to effective storage, analysis and sharing of these data. A number of software tools have been developed to store, analyze, and share microarray data. However, a majority of these tools do not offer all of these features nor do they specifically target the commonly used two color Agilent DNA microarray platform. Thus, the motivating factor for the development of EDGE(3) was to incorporate the storage, analysis and sharing of microarray data in a manner that would provide a means for research groups to collaborate on Agilent-based microarray experiments without a large investment in software-related expenditures or extensive training of end-users. EDGE(3) has been developed with two major functions in mind. The first function is to provide a workflow process for the generation of microarray data by a research laboratory or a microarray facility. The second is to store, analyze, and share microarray data in a manner that doesn't require complicated software. To satisfy the first function, EDGE3 has been developed as a means to establish a well defined experimental workflow and information system for microarray generation. To satisfy the second function, the software application utilized as the user interface of EDGE(3) is a web browser. Within the web browser, a user is able to access the entire functionality, including, but not limited to, the ability to perform a number of bioinformatics based analyses, collaborate between research groups through a user-based security model, and access to the raw data files and quality control files generated by the software used to extract the signals from an array image. Here, we present EDGE(3), an open-source, web-based application that allows for the storage, analysis, and controlled sharing of transcription-based microarray data generated on the Agilent DNA platform. In addition, EDGE(3) provides a means for managing RNA samples and arrays during the hybridization process. EDGE(3) is freely available for download at http://edge.oncology.wisc.edu/.
Autonomous system for Web-based microarray image analysis.
Bozinov, Daniel
2003-12-01
Software-based feature extraction from DNA microarray images still requires human intervention on various levels. Manual adjustment of grid and metagrid parameters, precise alignment of superimposed grid templates and gene spots, or simply identification of large-scale artifacts have to be performed beforehand to reliably analyze DNA signals and correctly quantify their expression values. Ideally, a Web-based system with input solely confined to a single microarray image and a data table as output containing measurements for all gene spots would directly transform raw image data into abstracted gene expression tables. Sophisticated algorithms with advanced procedures for iterative correction function can overcome imminent challenges in image processing. Herein is introduced an integrated software system with a Java-based interface on the client side that allows for decentralized access and furthermore enables the scientist to instantly employ the most updated software version at any given time. This software tool is extended from PixClust as used in Extractiff incorporated with Java Web Start deployment technology. Ultimately, this setup is destined for high-throughput pipelines in genome-wide medical diagnostics labs or microarray core facilities aimed at providing fully automated service to its users.
Schröder, Christoph; Jacob, Anette; Tonack, Sarah; Radon, Tomasz P.; Sill, Martin; Zucknick, Manuela; Rüffer, Sven; Costello, Eithne; Neoptolemos, John P.; Crnogorac-Jurcevic, Tatjana; Bauer, Andrea; Fellenberg, Kurt; Hoheisel, Jörg D.
2010-01-01
Antibody microarrays have the potential to enable comprehensive proteomic analysis of small amounts of sample material. Here, protocols are presented for the production, quality assessment, and reproducible application of antibody microarrays in a two-color mode with an array of 1,800 features, representing 810 antibodies that were directed at 741 cancer-related proteins. In addition to measures of array quality, we implemented indicators for the accuracy and significance of dual-color detection. Dual-color measurements outperform a single-color approach concerning assay reproducibility and discriminative power. In the analysis of serum samples, depletion of high-abundance proteins did not improve technical assay quality. On the contrary, depletion introduced a strong bias in protein representation. In an initial study, we demonstrated the applicability of the protocols to proteins derived from urine samples. We identified differences between urine samples from pancreatic cancer patients and healthy subjects and between sexes. This study demonstrates that biomedically relevant data can be produced. As demonstrated by the thorough quality analysis, the dual-color antibody array approach proved to be competitive with other proteomic techniques and comparable in performance to transcriptional microarray analyses. PMID:20164060
Multiclassifier information fusion methods for microarray pattern recognition
NASA Astrophysics Data System (ADS)
Braun, Jerome J.; Glina, Yan; Judson, Nicholas; Herzig-Marx, Rachel
2004-04-01
This paper addresses automatic recognition of microarray patterns, a capability that could have a major significance for medical diagnostics, enabling development of diagnostic tools for automatic discrimination of specific diseases. The paper presents multiclassifier information fusion methods for microarray pattern recognition. The input space partitioning approach based on fitness measures that constitute an a-priori gauging of classification efficacy for each subspace is investigated. Methods for generation of fitness measures, generation of input subspaces and their use in the multiclassifier fusion architecture are presented. In particular, two-level quantification of fitness that accounts for the quality of each subspace as well as the quality of individual neighborhoods within the subspace is described. Individual-subspace classifiers are Support Vector Machine based. The decision fusion stage fuses the information from mulitple SVMs along with the multi-level fitness information. Final decision fusion stage techniques, including weighted fusion as well as Dempster-Shafer theory based fusion are investigated. It should be noted that while the above methods are discussed in the context of microarray pattern recognition, they are applicable to a broader range of discrimination problems, in particular to problems involving a large number of information sources irreducible to a low-dimensional feature space.
Quantum-enhanced feature selection with forward selection and backward elimination
NASA Astrophysics Data System (ADS)
He, Zhimin; Li, Lvzhou; Huang, Zhiming; Situ, Haozhen
2018-07-01
Feature selection is a well-known preprocessing technique in machine learning, which can remove irrelevant features to improve the generalization capability of a classifier and reduce training and inference time. However, feature selection is time-consuming, particularly for the applications those have thousands of features, such as image retrieval, text mining and microarray data analysis. It is crucial to accelerate the feature selection process. We propose a quantum version of wrapper-based feature selection, which converts a classical feature selection to its quantum counterpart. It is valuable for machine learning on quantum computer. In this paper, we focus on two popular kinds of feature selection methods, i.e., wrapper-based forward selection and backward elimination. The proposed feature selection algorithm can quadratically accelerate the classical one.
Rapid and Facile Microwave-Assisted Surface Chemistry for Functionalized Microarray Slides
Lee, Jeong Heon; Hyun, Hoon; Cross, Conor J.; Henary, Maged; Nasr, Khaled A.; Oketokoun, Rafiou; Choi, Hak Soo; Frangioni, John V.
2011-01-01
We describe a rapid and facile method for surface functionalization and ligand patterning of glass slides based on microwave-assisted synthesis and a microarraying robot. Our optimized reaction enables surface modification 42-times faster than conventional techniques and includes a carboxylated self-assembled monolayer, polyethylene glycol linkers of varying length, and stable amide bonds to small molecule, peptide, or protein ligands to be screened for binding to living cells. We also describe customized slide racks that permit functionalization of 100 slides at a time to produce a cost-efficient, highly reproducible batch process. Ligand spots can be positioned on the glass slides precisely using a microarraying robot, and spot size adjusted for any desired application. Using this system, we demonstrate live cell binding to a variety of ligands and optimize PEG linker length. Taken together, the technology we describe should enable high-throughput screening of disease-specific ligands that bind to living cells. PMID:23467787
Design of an ultrasonic micro-array for near field sensing during retinal microsurgery.
Clarke, Clyde; Etienne-Cummings, Ralph
2006-01-01
A method for obtaining the optimal and specific sensor parameters for a tool-tip mountable ultrasonic transducer micro-array is presented. The ultrasonic transducer array sensor parameters, such as frequency of operation, element size, inter-element spacing, number of elements and transducer geometry are obtained using a quadratic programming method to obtain a maximum directivity while being constrained to a total array size of 4 mm2 and the required resolution for retinal imaging. The technique is used to design a uniformly spaced NxN transducer array that is capable of resolving structures in the retina that are as small as 2 microm from a distance of 100 microm. The resultant 37x37 array of 16 microm transducers with 26 microm spacing will be realized as a Capacitive Micromachined Ultrasonic Transducer (CMUT) array and used for imaging and robotic guidance during retinal microsurgery.
Even-Desrumeaux, Klervi; Baty, Daniel; Chames, Patrick
2010-01-01
Antibodies microarrays are among the novel class of rapidly emerging proteomic technologies that will allow us to efficiently perform specific diagnosis and proteome analysis. Recombinant antibody fragments are especially suited for this approach but their stability is often a limiting factor. Camelids produce functional antibodies devoid of light chains (HCAbs) of which the single N-terminal domain is fully capable of antigen binding. When produced as an independent domain, these so-called single domain antibody fragments (sdAbs) have several advantages for biotechnological applications thanks to their unique properties of size (15 kDa), stability, solubility, and expression yield. These features should allow sdAbs to outperform other antibody formats in a number of applications, notably as capture molecule for antibody arrays. In this study, we have produced antibody microarrays using direct and oriented immobilization of sdAbs produced in crude bacterial lysates to generate proof-of-principle of a high-throughput compatible array design. Several sdAb immobilization strategies have been explored. Immobilization of in vivo biotinylated sdAbs by direct spotting of bacterial lysate on streptavidin and sandwich detection was developed to achieve high sensitivity and specificity, whereas immobilization of “multi-tagged” sdAbs via anti-tag antibodies and direct labeled sample detection strategy was optimized for the design of high-density antibody arrays for high-throughput proteomics and identification of potential biomarkers. PMID:20859568
Genome-wide analyses of LINE–LINE-mediated nonallelic homologous recombination
Startek, Michał; Szafranski, Przemyslaw; Gambin, Tomasz; Campbell, Ian M.; Hixson, Patricia; Shaw, Chad A.; Stankiewicz, Paweł; Gambin, Anna
2015-01-01
Nonallelic homologous recombination (NAHR), occurring between low-copy repeats (LCRs) >10 kb in size and sharing >97% DNA sequence identity, is responsible for the majority of recurrent genomic rearrangements in the human genome. Recent studies have shown that transposable elements (TEs) can also mediate recurrent deletions and translocations, indicating the features of substrates that mediate NAHR may be significantly less stringent than previously believed. Using >4 kb length and >95% sequence identity criteria, we analyzed of the genome-wide distribution of long interspersed element (LINE) retrotransposon and their potential to mediate NAHR. We identified 17 005 directly oriented LINE pairs located <10 Mbp from each other as potential NAHR substrates, placing 82.8% of the human genome at risk of LINE–LINE-mediated instability. Cross-referencing these regions with CNVs in the Baylor College of Medicine clinical chromosomal microarray database of 36 285 patients, we identified 516 CNVs potentially mediated by LINEs. Using long-range PCR of five different genomic regions in a total of 44 patients, we confirmed that the CNV breakpoints in each patient map within the LINE elements. To additionally assess the scale of LINE–LINE/NAHR phenomenon in the human genome, we tested DNA samples from six healthy individuals on a custom aCGH microarray targeting LINE elements predicted to mediate CNVs and identified 25 LINE–LINE rearrangements. Our data indicate that LINE–LINE-mediated NAHR is widespread and under-recognized, and is an important mechanism of structural rearrangement contributing to human genomic variability. PMID:25613453
Comparative study of classification algorithms for immunosignaturing data
2012-01-01
Background High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of features. As new microarrays are invented, classification systems that worked well for other array types may not be ideal. Expression microarrays, arguably one of the most prevalent array types, have been used for years to help develop classification algorithms. Many biological assumptions are built into classifiers that were designed for these types of data. One of the more problematic is the assumption of independence, both at the probe level and again at the biological level. Probes for RNA transcripts are designed to bind single transcripts. At the biological level, many genes have dependencies across transcriptional pathways where co-regulation of transcriptional units may make many genes appear as being completely dependent. Thus, algorithms that perform well for gene expression data may not be suitable when other technologies with different binding characteristics exist. The immunosignaturing microarray is based on complex mixtures of antibodies binding to arrays of random sequence peptides. It relies on many-to-many binding of antibodies to the random sequence peptides. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides. This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. However, it is not clear what is the optimal classification algorithm for analyzing this new type of data. Results We characterized several classification algorithms to analyze immunosignaturing data. We selected several datasets that range from easy to difficult to classify, from simple monoclonal binding to complex binding patterns in asthma patients. We then classified the biological samples using 17 different classification algorithms. Using a wide variety of assessment criteria, we found ‘Naïve Bayes’ far more useful than other widely used methods due to its simplicity, robustness, speed and accuracy. Conclusions ‘Naïve Bayes’ algorithm appears to accommodate the complex patterns hidden within multilayered immunosignaturing microarray data due to its fundamental mathematical properties. PMID:22720696
Implementation of spectral clustering on microarray data of carcinoma using k-means algorithm
NASA Astrophysics Data System (ADS)
Frisca, Bustamam, Alhadi; Siswantining, Titin
2017-03-01
Clustering is one of data analysis methods that aims to classify data which have similar characteristics in the same group. Spectral clustering is one of the most popular modern clustering algorithms. As an effective clustering technique, spectral clustering method emerged from the concepts of spectral graph theory. Spectral clustering method needs partitioning algorithm. There are some partitioning methods including PAM, SOM, Fuzzy c-means, and k-means. Based on the research that has been done by Capital and Choudhury in 2013, when using Euclidian distance k-means algorithm provide better accuracy than PAM algorithm. So in this paper we use k-means as our partition algorithm. The major advantage of spectral clustering is in reducing data dimension, especially in this case to reduce the dimension of large microarray dataset. Microarray data is a small-sized chip made of a glass plate containing thousands and even tens of thousands kinds of genes in the DNA fragments derived from doubling cDNA. Application of microarray data is widely used to detect cancer, for the example is carcinoma, in which cancer cells express the abnormalities in his genes. The purpose of this research is to classify the data that have high similarity in the same group and the data that have low similarity in the others. In this research, Carcinoma microarray data using 7457 genes. The result of partitioning using k-means algorithm is two clusters.
Harvey, Benjamin Simeon; Ji, Soo-Yeon
2017-01-01
As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring forth oncological inference to the bioinformatics community through the analysis of large-scale cancer genomic (LSCG) DNA and mRNA microarray data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological interpretation by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale distributed parallel (CSDP) separable 1-D wavelet decomposition technique for denoising through differential expression thresholding and classification of LSCG microarray data. This research presents a novel methodology that utilizes a CSDP separable 1-D method for wavelet-based transformation in order to initialize a threshold which will retain significantly expressed genes through the denoising process for robust classification of cancer patients. Additionally, the overall study was implemented and encompassed within CSDP environment. The utilization of cloud computing and wavelet-based thresholding for denoising was used for the classification of samples within the Global Cancer Map, Cancer Cell Line Encyclopedia, and The Cancer Genome Atlas. The results proved that separable 1-D parallel distributed wavelet denoising in the cloud and differential expression thresholding increased the computational performance and enabled the generation of higher quality LSCG microarray datasets, which led to more accurate classification results.
Discovery and mapping of single feature polymorphisms in wheat using Affymetrix arrays
Bernardo, Amy N; Bradbury, Peter J; Ma, Hongxiang; Hu, Shengwa; Bowden, Robert L; Buckler, Edward S; Bai, Guihua
2009-01-01
Background Wheat (Triticum aestivum L.) is a staple food crop worldwide. The wheat genome has not yet been sequenced due to its huge genome size (~17,000 Mb) and high levels of repetitive sequences; the whole genome sequence may not be expected in the near future. Available linkage maps have low marker density due to limitation in available markers; therefore new technologies that detect genome-wide polymorphisms are still needed to discover a large number of new markers for construction of high-resolution maps. A high-resolution map is a critical tool for gene isolation, molecular breeding and genomic research. Single feature polymorphism (SFP) is a new microarray-based type of marker that is detected by hybridization of DNA or cRNA to oligonucleotide probes. This study was conducted to explore the feasibility of using the Affymetrix GeneChip to discover and map SFPs in the large hexaploid wheat genome. Results Six wheat varieties of diverse origins (Ning 7840, Clark, Jagger, Encruzilhada, Chinese Spring, and Opata 85) were analyzed for significant probe by variety interactions and 396 probe sets with SFPs were identified. A subset of 164 unigenes was sequenced and 54% showed polymorphism within probes. Microarray analysis of 71 recombinant inbred lines from the cross Ning 7840/Clark identified 955 SFPs and 877 of them were mapped together with 269 simple sequence repeat markers. The SFPs were randomly distributed within a chromosome but were unevenly distributed among different genomes. The B genome had the most SFPs, and the D genome had the least. Map positions of a selected set of SFPs were validated by mapping single nucleotide polymorphism using SNaPshot and comparing with expressed sequence tags mapping data. Conclusion The Affymetrix array is a cost-effective platform for SFP discovery and SFP mapping in wheat. The new high-density map constructed in this study will be a useful tool for genetic and genomic research in wheat. PMID:19480702
Drost, Derek R; Novaes, Evandro; Boaventura-Novaes, Carolina; Benedict, Catherine I; Brown, Ryan S; Yin, Tongming; Tuskan, Gerald A; Kirst, Matias
2009-06-01
Microarrays have demonstrated significant power for genome-wide analyses of gene expression, and recently have also revolutionized the genetic analysis of segregating populations by genotyping thousands of loci in a single assay. Although microarray-based genotyping approaches have been successfully applied in yeast and several inbred plant species, their power has not been proven in an outcrossing species with extensive genetic diversity. Here we have developed methods for high-throughput microarray-based genotyping in such species using a pseudo-backcross progeny of 154 individuals of Populus trichocarpa and P. deltoides analyzed with long-oligonucleotide in situ-synthesized microarray probes. Our analysis resulted in high-confidence genotypes for 719 single-feature polymorphism (SFP) and 1014 gene expression marker (GEM) candidates. Using these genotypes and an established microsatellite (SSR) framework map, we produced a high-density genetic map comprising over 600 SFPs, GEMs and SSRs. The abundance of gene-based markers allowed us to localize over 35 million base pairs of previously unplaced whole-genome shotgun (WGS) scaffold sequence to putative locations in the genome of P. trichocarpa. A high proportion of sampled scaffolds could be verified for their placement with independently mapped SSRs, demonstrating the previously un-utilized power that high-density genotyping can provide in the context of map-based WGS sequence reassembly. Our results provide a substantial contribution to the continued improvement of the Populus genome assembly, while demonstrating the feasibility of microarray-based genotyping in a highly heterozygous population. The strategies presented are applicable to genetic mapping efforts in all plant species with similarly high levels of genetic diversity.
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-12-21
Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences.
Shaw, Joseph R; Colbourne, John K; Davey, Jennifer C; Glaholt, Stephen P; Hampton, Thomas H; Chen, Celia Y; Folt, Carol L; Hamilton, Joshua W
2007-01-01
Background Genomic research tools such as microarrays are proving to be important resources to study the complex regulation of genes that respond to environmental perturbations. A first generation cDNA microarray was developed for the environmental indicator species Daphnia pulex, to identify genes whose regulation is modulated following exposure to the metal stressor cadmium. Our experiments revealed interesting changes in gene transcription that suggest their biological roles and their potentially toxicological features in responding to this important environmental contaminant. Results Our microarray identified genes reported in the literature to be regulated in response to cadmium exposure, suggested functional attributes for genes that share no sequence similarity to proteins in the public databases, and pointed to genes that are likely members of expanded gene families in the Daphnia genome. Genes identified on the microarray also were associated with cadmium induced phenotypes and population-level outcomes that we experimentally determined. A subset of genes regulated in response to cadmium exposure was independently validated using quantitative-realtime (Q-RT)-PCR. These microarray studies led to the discovery of three genes coding for the metal detoxication protein metallothionein (MT). The gene structures and predicted translated sequences of D. pulex MTs clearly place them in this gene family. Yet, they share little homology with previously characterized MTs. Conclusion The genomic information obtained from this study represents an important first step in characterizing microarray patterns that may be diagnostic to specific environmental contaminants and give insights into their toxicological mechanisms, while also providing a practical tool for evolutionary, ecological, and toxicological functional gene discovery studies. Advances in Daphnia genomics will enable the further development of this species as a model organism for the environmental sciences. PMID:18154678
An efficient ensemble learning method for gene microarray classification.
Osareh, Alireza; Shadgar, Bita
2013-01-01
The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.
NCBI GEO: archive for high-throughput functional genomic data.
Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Rudnev, Dmitry; Evangelista, Carlos; Kim, Irene F; Soboleva, Alexandra; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Edgar, Ron
2009-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest public repository for high-throughput gene expression data. Additionally, GEO hosts other categories of high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding. These data are generated by the research community using high-throughput technologies like microarrays and, more recently, next-generation sequencing. The database has a flexible infrastructure that can capture fully annotated raw and processed data, enabling compliance with major community-derived scientific reporting standards such as 'Minimum Information About a Microarray Experiment' (MIAME). In addition to serving as a centralized data storage hub, GEO offers many tools and features that allow users to effectively explore, analyze and download expression data from both gene-centric and experiment-centric perspectives. This article summarizes the GEO repository structure, content and operating procedures, as well as recently introduced data mining features. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.
Workflows for microarray data processing in the Kepler environment.
Stropp, Thomas; McPhillips, Timothy; Ludäscher, Bertram; Bieda, Mark
2012-05-17
Microarray data analysis has been the subject of extensive and ongoing pipeline development due to its complexity, the availability of several options at each analysis step, and the development of new analysis demands, including integration with new data sources. Bioinformatics pipelines are usually custom built for different applications, making them typically difficult to modify, extend and repurpose. Scientific workflow systems are intended to address these issues by providing general-purpose frameworks in which to develop and execute such pipelines. The Kepler workflow environment is a well-established system under continual development that is employed in several areas of scientific research. Kepler provides a flexible graphical interface, featuring clear display of parameter values, for design and modification of workflows. It has capabilities for developing novel computational components in the R, Python, and Java programming languages, all of which are widely used for bioinformatics algorithm development, along with capabilities for invoking external applications and using web services. We developed a series of fully functional bioinformatics pipelines addressing common tasks in microarray processing in the Kepler workflow environment. These pipelines consist of a set of tools for GFF file processing of NimbleGen chromatin immunoprecipitation on microarray (ChIP-chip) datasets and more comprehensive workflows for Affymetrix gene expression microarray bioinformatics and basic primer design for PCR experiments, which are often used to validate microarray results. Although functional in themselves, these workflows can be easily customized, extended, or repurposed to match the needs of specific projects and are designed to be a toolkit and starting point for specific applications. These workflows illustrate a workflow programming paradigm focusing on local resources (programs and data) and therefore are close to traditional shell scripting or R/BioConductor scripting approaches to pipeline design. Finally, we suggest that microarray data processing task workflows may provide a basis for future example-based comparison of different workflow systems. We provide a set of tools and complete workflows for microarray data analysis in the Kepler environment, which has the advantages of offering graphical, clear display of conceptual steps and parameters and the ability to easily integrate other resources such as remote data and web services.
Jain, Ruchi; Dey, Bappaditya; Tyagi, Anil K
2012-10-02
The Guinea pig (Cavia porcellus) is one of the most extensively used animal models to study infectious diseases. However, despite its tremendous contribution towards understanding the establishment, progression and control of a number of diseases in general and tuberculosis in particular, the lack of fully annotated guinea pig genome sequence as well as appropriate molecular reagents has severely hampered detailed genetic and immunological analysis in this animal model. By employing the cross-species hybridization technique, we have developed an oligonucleotide microarray with 44,000 features assembled from different mammalian species, which to the best of our knowledge is the first attempt to employ microarray to study the global gene expression profile in guinea pigs. To validate and demonstrate the merit of this microarray, we have studied, as an example, the expression profile of guinea pig lungs during the advanced phase of M. tuberculosis infection. A significant upregulation of 1344 genes and a marked down regulation of 1856 genes in the lungs identified a disease signature of pulmonary tuberculosis infection. We report the development of first comprehensive microarray for studying the global gene expression profile in guinea pigs and validation of its usefulness with tuberculosis as a case study. An important gap in the area of infectious diseases has been addressed and a valuable molecular tool is provided to optimally harness the potential of guinea pig model to develop better vaccines and therapies against human diseases.
Exploiting fluorescence for multiplex immunoassays on protein microarrays
NASA Astrophysics Data System (ADS)
Herbáth, Melinda; Papp, Krisztián; Balogh, Andrea; Matkó, János; Prechl, József
2014-09-01
Protein microarray technology is becoming the method of choice for identifying protein interaction partners, detecting specific proteins, carbohydrates and lipids, or for characterizing protein interactions and serum antibodies in a massively parallel manner. Availability of the well-established instrumentation of DNA arrays and development of new fluorescent detection instruments promoted the spread of this technique. Fluorescent detection has the advantage of high sensitivity, specificity, simplicity and wide dynamic range required by most measurements. Fluorescence through specifically designed probes and an increasing variety of detection modes offers an excellent tool for such microarray platforms. Measuring for example the level of antibodies, their isotypes and/or antigen specificity simultaneously can offer more complex and comprehensive information about the investigated biological phenomenon, especially if we take into consideration that hundreds of samples can be measured in a single assay. Not only body fluids, but also cell lysates, extracted cellular components, and intact living cells can be analyzed on protein arrays for monitoring functional responses to printed samples on the surface. As a rapidly evolving area, protein microarray technology offers a great bulk of information and new depth of knowledge. These are the features that endow protein arrays with wide applicability and robust sample analyzing capability. On the whole, protein arrays are emerging new tools not just in proteomics, but glycomics, lipidomics, and are also important for immunological research. In this review we attempt to summarize the technical aspects of planar fluorescent microarray technology along with the description of its main immunological applications.
Focused Screening of ECM-Selective Adhesion Peptides on Cellulose-Bound Peptide Microarrays.
Kanie, Kei; Kondo, Yuto; Owaki, Junki; Ikeda, Yurika; Narita, Yuji; Kato, Ryuji; Honda, Hiroyuki
2016-11-19
The coating of surfaces with bio-functional proteins is a promising strategy for the creation of highly biocompatible medical implants. Bio-functional proteins from the extracellular matrix (ECM) provide effective surface functions for controlling cellular behavior. We have previously screened bio-functional tripeptides for feasibility of mass production with the aim of identifying those that are medically useful, such as cell-selective peptides. In this work, we focused on the screening of tripeptides that selectively accumulate collagen type IV (Col IV), an ECM protein that accelerates the re-endothelialization of medical implants. A SPOT peptide microarray was selected for screening owing to its unique cellulose membrane platform, which can mimic fibrous scaffolds used in regenerative medicine. However, since the library size on the SPOT microarray was limited, physicochemical clustering was used to provide broader variation than that of random peptide selection. Using the custom focused microarray of 500 selected peptides, we assayed the relative binding rates of tripeptides to Col IV, collagen type I (Col I), and albumin. We discovered a cluster of Col IV-selective adhesion peptides that exhibit bio-safety with endothelial cells. The results from this study can be used to improve the screening of regeneration-enhancing peptides.
Fabrication of high quality cDNA microarray using a small amount of cDNA.
Park, Chan Hee; Jeong, Ha Jin; Jung, Jae Jun; Lee, Gui Yeon; Kim, Sang-Chul; Kim, Tae Soo; Yang, Sang Hwa; Chung, Hyun Cheol; Rha, Sun Young
2004-05-01
DNA microarray technology has become an essential part of biological research. It enables the genome-scale analysis of gene expression in various types of model systems. Manufacturing high quality cDNA microarrays of microdeposition type depends on some key factors including a printing device, spotting pins, glass slides, spotting solution, and humidity during spotting. UsingEthe Microgrid II TAS model printing device, this study defined the optimal conditions for producing high density, high quality cDNA microarrays with the least amount of cDNA product. It was observed that aminosilane-modified slides were superior to other types of surface modified-slides. A humidity of 30+/-3% in a closed environment and the overnight drying of the spotted slides gave the best conditions for arraying. In addition, the cDNA dissolved in 30% DMSO gave the optimal conditions for spotting compared to the 1X ArrayIt, 3X SSC and 50% DMSO. Lastly, cDNA in the concentration range of 100-300 ng/ micro l was determined to be best for arraying and post-processing. Currently, the printing system in this study yields reproducible 9000 spots with a spot size 150 mm diameter, and a 200 nm spot spacing.
Focused Screening of ECM-Selective Adhesion Peptides on Cellulose-Bound Peptide Microarrays
Kanie, Kei; Kondo, Yuto; Owaki, Junki; Ikeda, Yurika; Narita, Yuji; Kato, Ryuji; Honda, Hiroyuki
2016-01-01
The coating of surfaces with bio-functional proteins is a promising strategy for the creation of highly biocompatible medical implants. Bio-functional proteins from the extracellular matrix (ECM) provide effective surface functions for controlling cellular behavior. We have previously screened bio-functional tripeptides for feasibility of mass production with the aim of identifying those that are medically useful, such as cell-selective peptides. In this work, we focused on the screening of tripeptides that selectively accumulate collagen type IV (Col IV), an ECM protein that accelerates the re-endothelialization of medical implants. A SPOT peptide microarray was selected for screening owing to its unique cellulose membrane platform, which can mimic fibrous scaffolds used in regenerative medicine. However, since the library size on the SPOT microarray was limited, physicochemical clustering was used to provide broader variation than that of random peptide selection. Using the custom focused microarray of 500 selected peptides, we assayed the relative binding rates of tripeptides to Col IV, collagen type I (Col I), and albumin. We discovered a cluster of Col IV-selective adhesion peptides that exhibit bio-safety with endothelial cells. The results from this study can be used to improve the screening of regeneration-enhancing peptides. PMID:28952593
Novianti, Putri W; Roes, Kit C B; Eijkemans, Marinus J C
2014-01-01
Classification methods used in microarray studies for gene expression are diverse in the way they deal with the underlying complexity of the data, as well as in the technique used to build the classification model. The MAQC II study on cancer classification problems has found that performance was affected by factors such as the classification algorithm, cross validation method, number of genes, and gene selection method. In this paper, we study the hypothesis that the disease under study significantly determines which method is optimal, and that additionally sample size, class imbalance, type of medical question (diagnostic, prognostic or treatment response), and microarray platform are potentially influential. A systematic literature review was used to extract the information from 48 published articles on non-cancer microarray classification studies. The impact of the various factors on the reported classification accuracy was analyzed through random-intercept logistic regression. The type of medical question and method of cross validation dominated the explained variation in accuracy among studies, followed by disease category and microarray platform. In total, 42% of the between study variation was explained by all the study specific and problem specific factors that we studied together.
Estimating differential expression from multiple indicators
Ilmjärv, Sten; Hundahl, Christian Ansgar; Reimets, Riin; Niitsoo, Margus; Kolde, Raivo; Vilo, Jaak; Vasar, Eero; Luuk, Hendrik
2014-01-01
Regardless of the advent of high-throughput sequencing, microarrays remain central in current biomedical research. Conventional microarray analysis pipelines apply data reduction before the estimation of differential expression, which is likely to render the estimates susceptible to noise from signal summarization and reduce statistical power. We present a probe-level framework, which capitalizes on the high number of concurrent measurements to provide more robust differential expression estimates. The framework naturally extends to various experimental designs and target categories (e.g. transcripts, genes, genomic regions) as well as small sample sizes. Benchmarking in relation to popular microarray and RNA-sequencing data-analysis pipelines indicated high and stable performance on the Microarray Quality Control dataset and in a cell-culture model of hypoxia. Experimental-data-exhibiting long-range epigenetic silencing of gene expression was used to demonstrate the efficacy of detecting differential expression of genomic regions, a level of analysis not embraced by conventional workflows. Finally, we designed and conducted an experiment to identify hypothermia-responsive genes in terms of monotonic time-response. As a novel insight, hypothermia-dependent up-regulation of multiple genes of two major antioxidant pathways was identified and verified by quantitative real-time PCR. PMID:24586062
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Genetic programming based ensemble system for microarray data classification.
Liu, Kun-Hong; Tong, Muchenxuan; Xie, Shu-Tong; Yee Ng, Vincent To
2015-01-01
Recently, more and more machine learning techniques have been applied to microarray data analysis. The aim of this study is to propose a genetic programming (GP) based new ensemble system (named GPES), which can be used to effectively classify different types of cancers. Decision trees are deployed as base classifiers in this ensemble framework with three operators: Min, Max, and Average. Each individual of the GP is an ensemble system, and they become more and more accurate in the evolutionary process. The feature selection technique and balanced subsampling technique are applied to increase the diversity in each ensemble system. The final ensemble committee is selected by a forward search algorithm, which is shown to be capable of fitting data automatically. The performance of GPES is evaluated using five binary class and six multiclass microarray datasets, and results show that the algorithm can achieve better results in most cases compared with some other ensemble systems. By using elaborate base classifiers or applying other sampling techniques, the performance of GPES may be further improved.
Genetic Programming Based Ensemble System for Microarray Data Classification
Liu, Kun-Hong; Tong, Muchenxuan; Xie, Shu-Tong; Yee Ng, Vincent To
2015-01-01
Recently, more and more machine learning techniques have been applied to microarray data analysis. The aim of this study is to propose a genetic programming (GP) based new ensemble system (named GPES), which can be used to effectively classify different types of cancers. Decision trees are deployed as base classifiers in this ensemble framework with three operators: Min, Max, and Average. Each individual of the GP is an ensemble system, and they become more and more accurate in the evolutionary process. The feature selection technique and balanced subsampling technique are applied to increase the diversity in each ensemble system. The final ensemble committee is selected by a forward search algorithm, which is shown to be capable of fitting data automatically. The performance of GPES is evaluated using five binary class and six multiclass microarray datasets, and results show that the algorithm can achieve better results in most cases compared with some other ensemble systems. By using elaborate base classifiers or applying other sampling techniques, the performance of GPES may be further improved. PMID:25810748
Papazisi, Leka; Ratnayake, Shashikala; Remortel, Brian G.; Bock, Geoffrey R.; Liang, Wei; Saeed, Alexander I.; Liu, Jia; Fleischmann, Robert D.; Kilian, Mogens; Peterson, Scott N.
2010-01-01
Here we report the use of a multi-genome DNA microarray to elucidate the genomic events associated with the emergence of the clonal variants of H. influenzae biogroup aegyptius causing Brazilian Purpuric Fever (BPF), an important pediatric disease with a high mortality rate. We performed directed genome sequencing of strain HK1212 unique loci to construct a species DNA microarray. Comparative genome hybridization using this microarray enabled us to determine and compare gene complements, and infer reliable phylogenomic relationships among members of the species. The higher genomic variability observed in the genomes of BPF-related strains (clones) and their close relatives may be characterized by significant gene flux related to a subset of functional role categories. We found that the acquisition of a large number of virulence determinants featuring numerous cell membrane proteins coupled to the loss of genes involved in transport, central biosynthetic pathways and in particular, energy production pathways to be characteristics of the BPF genomic variants. PMID:20654709
Stankiewicz, Paweł; Kulkarni, Shashikant; Dharmadhikari, Avinash V.; Sampath, Srirangan; Bhatt, Samarth S.; Shaikh, Tamim H.; Xia, Zhilian; Pursley, Amber N.; Cooper, M. Lance; Shinawi, Marwan; Paciorkowski, Alex R.; Grange, Dorothy K.; Noetzel, Michael J.; Saunders, Scott; Simons, Paul; Summar, Marshall; Lee, Brendan; Scaglia, Fernando; Fellmann, Florence; Martinet, Danielle; Beckmann, Jacques S.; Asamoah, Alexander; Platky, Kathryn; Sparks, Susan; Martin, Ann S.; Madan-Khetarpal, Suneeta; Hoover, Jacqueline; Medne, Livija; Bonnemann, Carsten G.; Moeschler, John B.; Vallee, Stephanie E.; Parikh, Sumit; Irwin, Polly; Dalzell, Victoria P.; Smith, Wendy E.; Banks, Valerie C.; Flannery, David B.; Lovell, Carolyn M.; Bellus, Gary A.; Golden-Grant, Kathryn; Gorski, Jerome L.; Kussmann, Jennifer L.; McGregor, Tracy L.; Hamid, Rizwan; Pfotenhauer, Jean; Ballif, Blake C.; Shaw, Chad A.; Kang, Sung-Hae L.; Bacino, Carlos A.; Patel, Ankita; Rosenfeld, Jill A.; Cheung, Sau Wai; Shaffer, Lisa G.
2013-01-01
We report 24 unrelated individuals with deletions and 17 additional cases with duplications at 10q11.21q21.1 identified by chromosomal microarray analysis. The rearrangements range in size from 0.3 to 12 Mb. Nineteen of the deletions and eight duplications are flanked by large, directly oriented segmental duplications of >98% sequence identity, suggesting that nonallelic homologous recombination (NAHR) caused these genomic rearrangements. Nine individuals with deletions and five with duplications have additional copy number changes. Detailed clinical evaluation of 20 patients with deletions revealed variable clinical features, with developmental delay (DD) and/or intellectual disability (ID) as the only features common to a majority of individuals. We suggest that some of the other features present in more than one patient with deletion, including hypotonia, sleep apnea, chronic constipation, gastroesophageal and vesicoureteral refluxes, epilepsy, ataxia, dysphagia, nystagmus, and ptosis may result from deletion of the CHAT gene, encoding choline acetyltransferase, and the SLC18A3 gene, mapping in the first intron of CHAT and encoding vesicular acetylcholine transporter. The phenotypic diversity and presence of the deletion in apparently normal carrier parents suggest that subjects carrying 10q11.21q11.23 deletions may exhibit variable phenotypic expressivity and incomplete penetrance influenced by additional genetic and nongenetic modifiers. PMID:21948486
Asoglu, Mehmet Resit; Higgs, Amanda; Esin, Sertac; Kaplan, Julie; Turan, Sifa
2018-06-01
PIK3CA-related overgrowth spectrum, caused by mosaic mutations in the PIK3CA gene, is associated with regional or generalized asymmetric overgrowth of the body or a body part in addition to other clinical findings. Three-dimensional ultrasonography (3-D US) has the capability to display structural abnormalities in soft tissues or other organs, thereby facilitating identification of segmental overgrowth lesions. We present a case suspected of having a segmental overgrowth disorder based on 3-D US, whose chromosomal microarray result was abnormal, but apparently was not the cause of the majority of the fetus's clinical features. © 2017 Wiley Periodicals, Inc.
eQTL Mapping Using RNA-seq Data
Hu, Yijuan
2012-01-01
As RNA-seq is replacing gene expression microarrays to assess genome-wide transcription abundance, gene expression Quantitative Trait Locus (eQTL) studies using RNA-seq have emerged. RNA-seq delivers two novel features that are important for eQTL studies. First, it provides information on allele-specific expression (ASE), which is not available from gene expression microarrays. Second, it generates unprecedentedly rich data to study RNA-isoform expression. In this paper, we review current methods for eQTL mapping using ASE and discuss some future directions. We also review existing works that use RNA-seq data to study RNA-isoform expression and we discuss the gaps between these works and isoform-specific eQTL mapping. PMID:23667399
Single-feature polymorphism discovery in the barley transcriptome
Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie
2005-01-01
A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
Multiplexed protein profiling on microarrays by rolling-circle amplification
Schweitzer, Barry; Roberts, Scott; Grimwade, Brian; Shao, Weiping; Wang, Minjuan; Fu, Qin; Shu, Quiping; Laroche, Isabelle; Zhou, Zhimin; Tchernev, Velizar T.; Christiansen, Jason; Velleca, Mark; Kingsmore, Stephen F.
2010-01-01
Fluorescent-sandwich immunoassays on microarrays hold appeal for proteomics studies, because equipment and antibodies are readily available, and assays are simple, scalable, and reproducible. The achievement of adequate sensitivity and specificity, however, requires a general method of immunoassay amplification. We describe coupling of isothermal rolling-circle amplification (RCA) to universal antibodies for this purpose. A total of 75 cytokines were measured simultaneously on glass arrays with signal amplification by RCA with high specificity, femtomolar sensitivity, 3 log quantitative range, and economy of sample consumption. A 51-feature RCA cytokine glass array was used to measure secretion from human dendritic cells (DCs) induced by lipopolysaccharide (LPS) or tumor necrosis factor-α (TNF-α). As expected, LPS induced rapid secretion of inflammatory cytokines such as macrophage inflammatory protein (MIP)-1β, interleukin (IL)-8, and interferon-inducible protein (IP)-10. We found that eotaxin-2 and I-309 were induced by LPS; in addition, macrophage-derived chemokine (MDC), thymus and activation-regulated chemokine (TARC), soluble interleukin 6 receptor (sIL-6R), and soluble tumor necrosis factor receptor I (sTNF-RI) were induced by TNF-α treatment. Because microarrays can accommodat ~1,000 sandwich immunoassays of this type, a relatively small number of RCA microarrays seem to offer a tractable approach for proteomic surveys. PMID:11923841
Bruno, D L; Ganesamoorthy, D; Schoumans, J; Bankier, A; Coman, D; Delatycki, M; Gardner, R J M; Hunter, M; James, P A; Kannu, P; McGillivray, G; Pachter, N; Peters, H; Rieubland, C; Savarirayan, R; Scheffer, I E; Sheffield, L; Tan, T; White, S M; Yeung, A; Bowman, Z; Ngo, C; Choy, K W; Cacheux, V; Wong, L; Amor, D J; Slater, H R
2009-02-01
Microarray genome analysis is realising its promise for improving detection of genetic abnormalities in individuals with mental retardation and congenital abnormality. Copy number variations (CNVs) are now readily detectable using a variety of platforms and a major challenge is the distinction of pathogenic from ubiquitous, benign polymorphic CNVs. The aim of this study was to investigate replacement of time consuming, locus specific testing for specific microdeletion and microduplication syndromes with microarray analysis, which theoretically should detect all known syndromes with CNV aetiologies as well as new ones. Genome wide copy number analysis was performed on 117 patients using Affymetrix 250K microarrays. 434 CNVs (195 losses and 239 gains) were found, including 18 pathogenic CNVs and 9 identified as "potentially pathogenic". Almost all pathogenic CNVs were larger than 500 kb, significantly larger than the median size of all CNVs detected. Segmental regions of loss of heterozygosity larger than 5 Mb were found in 5 patients. Genome microarray analysis has improved diagnostic success in this group of patients. Several examples of recently discovered "new syndromes" were found suggesting they are more common than previously suspected and collectively are likely to be a major cause of mental retardation. The findings have several implications for clinical practice. The study revealed the potential to make genetic diagnoses that were not evident in the clinical presentation, with implications for pretest counselling and the consent process. The importance of contributing novel CNVs to high quality databases for genotype-phenotype analysis and review of guidelines for selection of individuals for microarray analysis is emphasised.
Petersen, Jesper; Poulsen, Lena; Birgens, Henrik; Dufva, Martin
2009-01-01
The development of DNA microarray assays is hampered by two important aspects: processing of the microarrays is done under a single stringency condition, and characteristics such as melting temperature are difficult to predict for immobilized probes. A technical solution to these limitations is to use a thermal gradient and information from melting curves, for instance to score genotypes. However, application of temperature gradients normally requires complicated equipment, and the size of the arrays that can be investigated is restricted due to heat dissipation. Here we present a simple microfluidic device that creates a gradient comprising zones of defined ionic strength over a glass slide, in which each zone corresponds to a subarray. Using this device, we demonstrated that ionic strength gradients function in a similar fashion as corresponding thermal gradients in assay development. More specifically, we noted that (i) the two stringency modulators generated melting curves that could be compared, (ii) both led to increased assay robustness, and (iii) both were associated with difficulties in genotyping the same mutation. These findings demonstrate that ionic strength stringency buffers can be used instead of thermal gradients. Given the flexibility of design of ionic gradients, these can be created over all types of arrays, and encompass an attractive alternative to temperature gradients, avoiding curtailment of the size or spacing of subarrays on slides associated with temperature gradients. PMID:19277213
Petersen, Jesper; Poulsen, Lena; Birgens, Henrik; Dufva, Martin
2009-01-01
The development of DNA microarray assays is hampered by two important aspects: processing of the microarrays is done under a single stringency condition, and characteristics such as melting temperature are difficult to predict for immobilized probes. A technical solution to these limitations is to use a thermal gradient and information from melting curves, for instance to score genotypes. However, application of temperature gradients normally requires complicated equipment, and the size of the arrays that can be investigated is restricted due to heat dissipation. Here we present a simple microfluidic device that creates a gradient comprising zones of defined ionic strength over a glass slide, in which each zone corresponds to a subarray. Using this device, we demonstrated that ionic strength gradients function in a similar fashion as corresponding thermal gradients in assay development. More specifically, we noted that (i) the two stringency modulators generated melting curves that could be compared, (ii) both led to increased assay robustness, and (iii) both were associated with difficulties in genotyping the same mutation. These findings demonstrate that ionic strength stringency buffers can be used instead of thermal gradients. Given the flexibility of design of ionic gradients, these can be created over all types of arrays, and encompass an attractive alternative to temperature gradients, avoiding curtailment of the size or spacing of subarrays on slides associated with temperature gradients.
2013-01-01
Background The synthesis of information across microarray studies has been performed by combining statistical results of individual studies (as in a mosaic), or by combining data from multiple studies into a large pool to be analyzed as a single data set (as in a melting pot of data). Specific issues relating to data heterogeneity across microarray studies, such as differences within and between labs or differences among experimental conditions, could lead to equivocal results in a melting pot approach. Results We applied statistical theory to determine the specific effect of different means and heteroskedasticity across 19 groups of microarray data on the sign and magnitude of gene-to-gene Pearson correlation coefficients obtained from the pool of 19 groups. We quantified the biases of the pooled coefficients and compared them to the biases of correlations estimated by an effect-size model. Mean differences across the 19 groups were the main factor determining the magnitude and sign of the pooled coefficients, which showed largest values of bias as they approached ±1. Only heteroskedasticity across the pool of 19 groups resulted in less efficient estimations of correlations than did a classical meta-analysis approach of combining correlation coefficients. These results were corroborated by simulation studies involving either mean differences or heteroskedasticity across a pool of N > 2 groups. Conclusions The combination of statistical results is best suited for synthesizing the correlation between expression profiles of a gene pair across several microarray studies. PMID:23822712
Mining featured biomarkers associated with prostatic carcinoma based on bioinformatics.
Piao, Guanying; Wu, Jiarui
2013-11-01
To analyze the differentially expressed genes and identify featured biomarkers from prostatic carcinoma. The software "Significance Analysis of Microarray" (SAM) was used to identify the differentially coexpressed genes (DCGs). The DCGs existed in two datasets were analyzed by GO (Gene Ontology) functional annotation. A total of 389 DCGs were obtained. By GO analysis, we found these DCGs were closely related with the acinus development, TGF-β receptor and signal transduction pathways. Furthermore, five featured biomarkers were discovered by interaction analysis. These important signal pathways and oncogenes may provide potential therapeutic targets for prostatic carcinoma.
On hydrophilicity improvement of the porous anodic alumina film by hybrid nano/micro structuring
NASA Astrophysics Data System (ADS)
Wang, Weichao; Zhao, Wei; Wang, Kaige; Wang, Lei; Wang, Xuewen; Wang, Shuang; Zhang, Chen; Bai, Jintao
2017-09-01
In both, laboratory and industry, tremendous attention is paid to discover an effective technique to produce uniform, controllable and (super) hydrophilic surfaces over large areas that are useful in a wide range of applications. In this investigation, by combing porous anodic alumina (PAA) film with nano-structures and microarray of aluminum, the hydrophilicity of hybrid nano-micro structure has been significantly improved. It is found some factors can affect the hydrophilicity of film, such as the size and aspect ratio of microarray, the thickness of nano-PAA film etc. Comparing with pure nano-PAA films and microarray, the hybrid nano-micro structure can provide uniform surface with significantly better hydrophilicity. The improvement can be up to 84%. Also, this technique exhibits good stability and repeatability for industrial production. By optimizing the thickness of nano-PAA film and aspect ratio of micro-structures, super-hydrophilicity can be reached. This study has obvious prospect in the fields of chemical industry, biomedical engineering and lab-on-a-chip applications.
Patel, Isha R.; Gangiredla, Jayanthi; Lacher, David W.; Mammel, Mark K.; Jackson, Scott A.; Lampel, Keith A.
2016-01-01
ABSTRACT Most Escherichia coli strains are nonpathogenic. However, for clinical diagnosis and food safety analysis, current identification methods for pathogenic E. coli either are time-consuming and/or provide limited information. Here, we utilized a custom DNA microarray with informative genetic features extracted from 368 sequence sets for rapid and high-throughput pathogen identification. The FDA Escherichia coli Identification (FDA-ECID) platform contains three sets of molecularly informative features that together stratify strain identification and relatedness. First, 53 known flagellin alleles, 103 alleles of wzx and wzy, and 5 alleles of wzm provide molecular serotyping utility. Second, 41,932 probe sets representing the pan-genome of E. coli provide strain-level gene content information. Third, approximately 125,000 single nucleotide polymorphisms (SNPs) of available whole-genome sequences (WGS) were distilled to 9,984 SNPs capable of recapitulating the E. coli phylogeny. We analyzed 103 diverse E. coli strains with available WGS data, including those associated with past foodborne illnesses, to determine robustness and accuracy. The array was able to accurately identify the molecular O and H serotypes, potentially correcting serological failures and providing better resolution for H-nontypeable/nonmotile phenotypes. In addition, molecular risk assessment was possible with key virulence marker identifications. Epidemiologically, each strain had a unique comparative genomic fingerprint that was extended to an additional 507 food and clinical isolates. Finally, a 99.7% phylogenetic concordance was established between microarray analysis and WGS using SNP-level data for advanced genome typing. Our study demonstrates FDA-ECID as a powerful tool for epidemiology and molecular risk assessment with the capacity to profile the global landscape and diversity of E. coli. IMPORTANCE This study describes a robust, state-of-the-art platform developed from available whole-genome sequences of E. coli and Shigella spp. by distilling useful signatures for epidemiology and molecular risk assessment into one assay. The FDA-ECID microarray contains features that enable comprehensive molecular serotyping and virulence profiling along with genome-scale genotyping and SNP analysis. Hence, it is a molecular toolbox that stratifies strain identification and pathogenic potential in the contexts of epidemiology and phylogeny. We applied this tool to strains from food, environmental, and clinical sources, resulting in significantly greater phylogenetic and strain-specific resolution than previously reported for available typing methods. PMID:27037122
The expression of Toll-like receptors 2, 4, 5, 7 and 9 in Merkel cell carcinoma.
Jouhi, Lauri; Koljonen, Virve; Böhling, Tom; Haglund, Caj; Hagström, Jaana
2015-04-01
We sought to clarify whether the expression of toll-like receptors (TLR) in Merkel cell carcinoma (MCC) is linked to tumor and patient characteristics, especially the presence of Merkel cell polyoma virus (MCV). The study comprised of 128 patients with data on Merkel cell polyomavirus (MCV) status and clinical features were included in the study. Immunohistochemistry for TLR expression was performed on tissue microarray (TMA) slides. TLR 2, 4, 5, 7 and 9 expression was noted in most of the tumor specimens. Decreased expression of TLR 9 correlated strongly with MCV positivity. Cytoplasmic TLR 2 expression correlated with small tumor size, while nuclear TLR 2 and TLR 5 expressions with larger tumors. Increased nuclear TLR 4 expression and decreased TLR 7 expression were associated with older age. TLR 2, 4, 5, 7 and 9 appear to reflect certain clinicopathological variables and prognostic markers of MCC tumors. Copyright© 2015 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Adema, Coen M; Hanington, Patrick C.; Lun, Cheng-Man; Rosenberg, George H.; Aragon, Anthony D; Stout, Barbara A; Richard, Mara L. Lennard; Gross, Paul S.; Loker, Eric S
2009-01-01
A 70-mer oligonucleotide-based microarray (1152 features) that emphasizes stress and immune responses factors was constructed to study transcriptomic responses of the snail Biomphalaria glabrata to different immune challenges. In addition to sequences with relevant putative ID and Gene Ontology (GO) annotation, the array features non-immune factors and unknown B. glabrata ESTs for functional gene discovery. The transcription profiles of B. glabrata (3 biological replicates, each a pool of 5 snails) were recorded at 12 hours post wounding, exposure to Gram negative or Gram positive bacteria (Escherichia coli and Micrococcus luteus, respectively), or infection with compatible trematode parasites (S. mansoni or E. paraensei, 20 miracidia/snail), relative to controls, using universal reference RNA. The data were subjected to Significance Analysis for Microarrays (SAM), with a false positive rate (FPR) ≤10%. Wounding yielded a modest differential expression profile (27 up/21 down) with affected features mostly dissimilar from other treatments. Partially overlapping, yet distinct expression profiles were recorded from snails challenged with E. coli (83 up/20 down) or M. luteus (120 up/42 down), mostly showing up-regulation of defense and stress-related features. Significantly altered expression of selected immune features indicates that B. glabrata detects and responds differently to compatible trematodes. Echinostoma paraensei infection was associated mostly with down regulation of many (immune-) transcripts (42 up/68 down), whereas S. mansoni exposure yielded a preponderance of up-regulated features (140 up/23 down), with only few known immune genes affected. These observations may reflect the divergent strategies developed by trematodes during their evolution as specialized pathogens of snails to negate host defense responses. Clearly, the immune defenses of B. glabrata distinguish and respond differently to various immune challenges. PMID:19962194
Saini, Harsh; Lal, Sunil Pranit; Naidu, Vimal Vikash; Pickering, Vincel Wince; Singh, Gurmeet; Tsunoda, Tatsuhiko; Sharma, Alok
2016-12-05
High dimensional feature space generally degrades classification in several applications. In this paper, we propose a strategy called gene masking, in which non-contributing dimensions are heuristically removed from the data to improve classification accuracy. Gene masking is implemented via a binary encoded genetic algorithm that can be integrated seamlessly with classifiers during the training phase of classification to perform feature selection. It can also be used to discriminate between features that contribute most to the classification, thereby, allowing researchers to isolate features that may have special significance. This technique was applied on publicly available datasets whereby it substantially reduced the number of features used for classification while maintaining high accuracies. The proposed technique can be extremely useful in feature selection as it heuristically removes non-contributing features to improve the performance of classifiers.
Hidden discriminative features extraction for supervised high-order time series modeling.
Nguyen, Ngoc Anh Thi; Yang, Hyung-Jeong; Kim, Sunhee
2016-11-01
In this paper, an orthogonal Tucker-decomposition-based extraction of high-order discriminative subspaces from a tensor-based time series data structure is presented, named as Tensor Discriminative Feature Extraction (TDFE). TDFE relies on the employment of category information for the maximization of the between-class scatter and the minimization of the within-class scatter to extract optimal hidden discriminative feature subspaces that are simultaneously spanned by every modality for supervised tensor modeling. In this context, the proposed tensor-decomposition method provides the following benefits: i) reduces dimensionality while robustly mining the underlying discriminative features, ii) results in effective interpretable features that lead to an improved classification and visualization, and iii) reduces the processing time during the training stage and the filtering of the projection by solving the generalized eigenvalue issue at each alternation step. Two real third-order tensor-structures of time series datasets (an epilepsy electroencephalogram (EEG) that is modeled as channel×frequency bin×time frame and a microarray data that is modeled as gene×sample×time) were used for the evaluation of the TDFE. The experiment results corroborate the advantages of the proposed method with averages of 98.26% and 89.63% for the classification accuracies of the epilepsy dataset and the microarray dataset, respectively. These performance averages represent an improvement on those of the matrix-based algorithms and recent tensor-based, discriminant-decomposition approaches; this is especially the case considering the small number of samples that are used in practice. Copyright © 2016 Elsevier Ltd. All rights reserved.
ArrayWiki: an enabling technology for sharing public microarray data repositories and meta-analyses
Stokes, Todd H; Torrance, JT; Li, Henry; Wang, May D
2008-01-01
Background A survey of microarray databases reveals that most of the repository contents and data models are heterogeneous (i.e., data obtained from different chip manufacturers), and that the repositories provide only basic biological keywords linking to PubMed. As a result, it is difficult to find datasets using research context or analysis parameters information beyond a few keywords. For example, to reduce the "curse-of-dimension" problem in microarray analysis, the number of samples is often increased by merging array data from different datasets. Knowing chip data parameters such as pre-processing steps (e.g., normalization, artefact removal, etc), and knowing any previous biological validation of the dataset is essential due to the heterogeneity of the data. However, most of the microarray repositories do not have meta-data information in the first place, and do not have a a mechanism to add or insert this information. Thus, there is a critical need to create "intelligent" microarray repositories that (1) enable update of meta-data with the raw array data, and (2) provide standardized archiving protocols to minimize bias from the raw data sources. Results To address the problems discussed, we have developed a community maintained system called ArrayWiki that unites disparate meta-data of microarray meta-experiments from multiple primary sources with four key features. First, ArrayWiki provides a user-friendly knowledge management interface in addition to a programmable interface using standards developed by Wikipedia. Second, ArrayWiki includes automated quality control processes (caCORRECT) and novel visualization methods (BioPNG, Gel Plots), which provide extra information about data quality unavailable in other microarray repositories. Third, it provides a user-curation capability through the familiar Wiki interface. Fourth, ArrayWiki provides users with simple text-based searches across all experiment meta-data, and exposes data to search engine crawlers (Semantic Agents) such as Google to further enhance data discovery. Conclusions Microarray data and meta information in ArrayWiki are distributed and visualized using a novel and compact data storage format, BioPNG. Also, they are open to the research community for curation, modification, and contribution. By making a small investment of time to learn the syntax and structure common to all sites running MediaWiki software, domain scientists and practioners can all contribute to make better use of microarray technologies in research and medical practices. ArrayWiki is available at . PMID:18541053
Chang, Vivian Y; Quintero-Rivera, Fabiola; Baldwin, Erin E; Woo, Kathy; Martinez-Agosto, Julian A; Fu, Cecilia; Gomperts, Brigitte N
2011-03-01
Duplication 22q11.2 syndrome is the result of a microduplication of the same chromosomal region that is deleted in DiGeorge and Velocardiofacial syndromes. We describe a patient with dysmorphic features who was diagnosed with pre-B acute lymphoblastic leukemia, and developed cystinuria and pancreatitis during treatment. Duplication 22q11.2 has not been previously described in association with hematologic abnormalities. Chromosomal microarray technology was used to diagnose duplication 22q11.2 syndrome. In this era of advanced genomics, this technology has become an important method for helping to determine the molecular basis of diseases, best treatments and ultimately patient outcomes. Copyright © 2010 Wiley-Liss, Inc.
Evolutionary Approach for Relative Gene Expression Algorithms
Czajkowski, Marcin
2014-01-01
A Relative Expression Analysis (RXA) uses ordering relationships in a small collection of genes and is successfully applied to classiffication using microarray data. As checking all possible subsets of genes is computationally infeasible, the RXA algorithms require feature selection and multiple restrictive assumptions. Our main contribution is a specialized evolutionary algorithm (EA) for top-scoring pairs called EvoTSP which allows finding more advanced gene relations. We managed to unify the major variants of relative expression algorithms through EA and introduce weights to the top-scoring pairs. Experimental validation of EvoTSP on public available microarray datasets showed that the proposed solution significantly outperforms in terms of accuracy other relative expression algorithms and allows exploring much larger solution space. PMID:24790574
NASA Astrophysics Data System (ADS)
Liu, Robin H.; Lodes, Mike; Fuji, H. Sho; Danley, David; McShea, Andrew
Microarray assays typically involve multistage sample processing and fluidic handling, which are generally labor-intensive and time-consuming. Automation of these processes would improve robustness, reduce run-to-run and operator-to-operator variation, and reduce costs. In this chapter, a fully integrated and self-contained microfluidic biochip device that has been developed to automate the fluidic handling steps for microarray-based gene expression or genotyping analysis is presented. The device consists of a semiconductor-based CustomArray® chip with 12,000 features and a microfluidic cartridge. The CustomArray was manufactured using a semiconductor-based in situ synthesis technology. The micro-fluidic cartridge consists of microfluidic pumps, mixers, valves, fluid channels, and reagent storage chambers. Microarray hybridization and subsequent fluidic handling and reactions (including a number of washing and labeling steps) were performed in this fully automated and miniature device before fluorescent image scanning of the microarray chip. Electrochemical micropumps were integrated in the cartridge to provide pumping of liquid solutions. A micromixing technique based on gas bubbling generated by electrochemical micropumps was developed. Low-cost check valves were implemented in the cartridge to prevent cross-talk of the stored reagents. Gene expression study of the human leukemia cell line (K562) and genotyping detection and sequencing of influenza A subtypes have been demonstrated using this integrated biochip platform. For gene expression assays, the microfluidic CustomArray device detected sample RNAs with a concentration as low as 0.375 pM. Detection was quantitative over more than three orders of magnitude. Experiment also showed that chip-to-chip variability was low indicating that the integrated microfluidic devices eliminate manual fluidic handling steps that can be a significant source of variability in genomic analysis. The genotyping results showed that the device identified influenza A hemagglutinin and neuraminidase subtypes and sequenced portions of both genes, demonstrating the potential of integrated microfluidic and microarray technology for multiple virus detection. The device provides a cost-effective solution to eliminate labor-intensive and time-consuming fluidic handling steps and allows microarray-based DNA analysis in a rapid and automated fashion.
Wang, Jing-Jing; Sun, Xi-Cai; Hu, Li; Liu, Zhuo-Fu; Yu, Hua-Peng; Li, Han; Wang, Shu-Yi; Wang, De-Hui
2013-12-01
The purpose of this study was to examine endoglin (CD105) expression on microvessel endothelial cells (ECs) in juvenile nasopharyngeal angiofibroma (JNA) and its relationship with recurrence. Immunohistochemistry was performed to detect CD105 expression in a tissue microarray from 70 patients with JNA. Correlation between CD105 expression on microvessel ECs and clinicopathological features, as well as tumor recurrence, were analyzed. Immunohistochemistry revealed CD105 expression on ECs but not in stroma of patients with JNA. Chi-square analysis indicated CD105-based microvessel density (MVD) was correlated with JNA recurrence (p = .013). Univariate and multivariate analyses determined that MVD was a significant predictor of time to recurrence (p = .009). The CD105-based MVD was better for predicting disease recurrence (AUROC: 0.673; p = .036) than other clinicopathological features. MVD is a useful predictor for poor prognosis of patients with JNA after curative resection. Angiogenesis, which may play an important role in the occurrence and development of JNA, is therefore a potential therapeutic target for JNA. Copyright © 2013 Wiley Periodicals, Inc., A Wiley Company.
A Granular Self-Organizing Map for Clustering and Gene Selection in Microarray Data.
Ray, Shubhra Sankar; Ganivada, Avatharam; Pal, Sankar K
2016-09-01
A new granular self-organizing map (GSOM) is developed by integrating the concept of a fuzzy rough set with the SOM. While training the GSOM, the weights of a winning neuron and the neighborhood neurons are updated through a modified learning procedure. The neighborhood is newly defined using the fuzzy rough sets. The clusters (granules) evolved by the GSOM are presented to a decision table as its decision classes. Based on the decision table, a method of gene selection is developed. The effectiveness of the GSOM is shown in both clustering samples and developing an unsupervised fuzzy rough feature selection (UFRFS) method for gene selection in microarray data. While the superior results of the GSOM, as compared with the related clustering methods, are provided in terms of β -index, DB-index, Dunn-index, and fuzzy rough entropy, the genes selected by the UFRFS are not only better in terms of classification accuracy and a feature evaluation index, but also statistically more significant than the related unsupervised methods. The C-codes of the GSOM and UFRFS are available online at http://avatharamg.webs.com/software-code.
Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system
Sunkin, Susan M.; Ng, Lydia; Lau, Chris; Dolbeare, Tim; Gilbert, Terri L.; Thompson, Carol L.; Hawrylycz, Michael; Dang, Chinh
2013-01-01
The Allen Brain Atlas (http://www.brain-map.org) provides a unique online public resource integrating extensive gene expression data, connectivity data and neuroanatomical information with powerful search and viewing tools for the adult and developing brain in mouse, human and non-human primate. Here, we review the resources available at the Allen Brain Atlas, describing each product and data type [such as in situ hybridization (ISH) and supporting histology, microarray, RNA sequencing, reference atlases, projection mapping and magnetic resonance imaging]. In addition, standardized and unique features in the web applications are described that enable users to search and mine the various data sets. Features include both simple and sophisticated methods for gene searches, colorimetric and fluorescent ISH image viewers, graphical displays of ISH, microarray and RNA sequencing data, Brain Explorer software for 3D navigation of anatomy and gene expression, and an interactive reference atlas viewer. In addition, cross data set searches enable users to query multiple Allen Brain Atlas data sets simultaneously. All of the Allen Brain Atlas resources can be accessed through the Allen Brain Atlas data portal. PMID:23193282
Characterization of genetic variability of Venezuelan equine encephalitis viruses
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...
2016-04-07
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Papazisi, Leka; Ratnayake, Shashikala; Remortel, Brian G; Bock, Geoffrey R; Liang, Wei; Saeed, Alexander I; Liu, Jia; Fleischmann, Robert D; Kilian, Mogens; Peterson, Scott N
2010-11-01
Here we report the use of a multi-genome DNA microarray to elucidate the genomic events associated with the emergence of the clonal variants of Haemophilus influenzae biogroup aegyptius causing Brazilian Purpuric Fever (BPF), an important pediatric disease with a high mortality rate. We performed directed genome sequencing of strain HK1212 unique loci to construct a species DNA microarray. Comparative genome hybridization using this microarray enabled us to determine and compare gene complements, and infer reliable phylogenomic relationships among members of the species. The higher genomic variability observed in the genomes of BPF-related strains (clones) and their close relatives may be characterized by significant gene flux related to a subset of functional role categories. We found that the acquisition of a large number of virulence determinants featuring numerous cell membrane proteins coupled to the loss of genes involved in transport, central biosynthetic pathways and in particular, energy production pathways to be characteristics of the BPF genomic variants. Copyright © 2010 Elsevier Inc. All rights reserved.
Removing technical variability in RNA-seq data using conditional quantile normalization.
Hansen, Kasper D; Irizarry, Rafael A; Wu, Zhijin
2012-04-01
The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions.
Liu, Shu; Wang, Zhiqing; Wei, Sisi; Liang, Jinqun; Chen, Nuan; OuYang, Haimei; Zeng, Weihong; Chen, Liying; Xie, Xunjie; Jiang, Jianhui
2018-04-14
Ring chromosome 6, r(6), is an extremely rare cytogenetic abnormality with clinical heterogeneity which arises typically de novo. The phenotypes of r(6) can be highly variable, ranging from almost normal to severe malformations and neurological defects. Up to now, only 33 cases have been reported in the literature. In this 10-year follow-up study, we report a case presenting distinctive facial features, severe developmental delay, and gray matter heterotopia with r(6) and terminal deletions of 6p25.3 (115426-384174, 268 kb) and 6q26-27 (168697778-170732033, 2.03 Mb) encompassing 2 and 15 candidate genes, respectively, which were detected using G-banding karyotyping, FISH, and chromosomal microarray analysis. We also analyzed the available information on the clinical features of the reported r(6) cases in order to provide more valuable information on genotype-phenotype correlations. To the best of our knowledge, this is the first report of gray matter heterotopia manifested in a patient with r(6) in China, and the deletions of 6p and 6q in our case are the smallest with the precise size of euchromatic material loss currently known. © 2018 S. Karger AG, Basel.
Adaptive wettability-enhanced surfaces ordered on molded etched substrates using shrink film
NASA Astrophysics Data System (ADS)
Jayadev, Shreshta; Pegan, Jonathan; Dyer, David; McLane, Jolie; Lim, Jessica; Khine, Michelle
2013-01-01
Superhydrophobic surfaces in nature exhibit desirable properties including self-cleaning, bacterial resistance, and flight efficiency. However, creating such intricate multi-scale features with conventional fabrication approaches is difficult, expensive, and not scalable. By patterning photoresist on pre-stressed shrink-wrap film, which contracts by 95% in surface area when heated, such features over large areas can be obtained easily. Photoresist serves as a dry etch mask to create complex and high-aspect ratio microstructures in the film. Using a double-shrink process, we introduce adaptive wettability-enhanced surfaces ordered on molded etched (AWESOME) substrates. We first create a mask out of the children’s toy ‘Shrinky-Dinks’ by printing dots using a laserjet printer. Heating this thermoplastic sheet causes the printed dots to shrink to a fraction of their original size. We then lithographically transfer the inverse pattern onto photoresist-coated shrink-wrap polyolefin film. The film is then plasma etched. After shrinking, the film serves as a high-aspect ratio mold for polydimethylsiloxane, creating a superhydrophobic surface with water contact angles >150° and sliding angles <10°. We pattern a microarray of ‘sticky’ spots with a dramatically different sliding angle compared to that of the superhydrophobic region, enabling microtiter-plate type assays without the need for a well plate.
Das, Sayan; Bhat, Prasanna R; Sudhakar, Chinta; Ehlers, Jeffrey D; Wanamaker, Steve; Roberts, Philip A; Cui, Xinping; Close, Timothy J
2008-02-28
Cowpea (Vigna unguiculata L. Walp) is an important food and fodder legume of the semiarid tropics and subtropics worldwide, especially in sub-Saharan Africa. High density genetic linkage maps are needed for marker assisted breeding but are not available for cowpea. A single feature polymorphism (SFP) is a microarray-based marker which can be used for high throughput genotyping and high density mapping. Here we report detection and validation of SFPs in cowpea using a readily available soybean (Glycine max) genome array. Robustified projection pursuit (RPP) was used for statistical analysis using RNA as a surrogate for DNA. Using a 15% outlying score cut-off, 1058 potential SFPs were enumerated between two parents of a recombinant inbred line (RIL) population segregating for several important traits including drought tolerance, Fusarium and brown blotch resistance, grain size and photoperiod sensitivity. Sequencing of 25 putative polymorphism-containing amplicons yielded a SFP probe set validation rate of 68%. We conclude that the Affymetrix soybean genome array is a satisfactory platform for identification of some 1000's of SFPs for cowpea. This study provides an example of extension of genomic resources from a well supported species to an orphan crop. Presumably, other legume systems are similarly tractable to SFP marker development using existing legume array resources.
Leung, Yuk Yee; Chang, Chun Qi; Hung, Yeung Sam
2012-01-01
Using hybrid approach for gene selection and classification is common as results obtained are generally better than performing the two tasks independently. Yet, for some microarray datasets, both classification accuracy and stability of gene sets obtained still have rooms for improvement. This may be due to the presence of samples with wrong class labels (i.e. outliers). Outlier detection algorithms proposed so far are either not suitable for microarray data, or only solve the outlier detection problem on their own. We tackle the outlier detection problem based on a previously proposed Multiple-Filter-Multiple-Wrapper (MFMW) model, which was demonstrated to yield promising results when compared to other hybrid approaches (Leung and Hung, 2010). To incorporate outlier detection and overcome limitations of the existing MFMW model, three new features are introduced in our proposed MFMW-outlier approach: 1) an unbiased external Leave-One-Out Cross-Validation framework is developed to replace internal cross-validation in the previous MFMW model; 2) wrongly labeled samples are identified within the MFMW-outlier model; and 3) a stable set of genes is selected using an L1-norm SVM that removes any redundant genes present. Six binary-class microarray datasets were tested. Comparing with outlier detection studies on the same datasets, MFMW-outlier could detect all the outliers found in the original paper (for which the data was provided for analysis), and the genes selected after outlier removal were proven to have biological relevance. We also compared MFMW-outlier with PRAPIV (Zhang et al., 2006) based on same synthetic datasets. MFMW-outlier gave better average precision and recall values on three different settings. Lastly, artificially flipped microarray datasets were created by removing our detected outliers and flipping some of the remaining samples' labels. Almost all the 'wrong' (artificially flipped) samples were detected, suggesting that MFMW-outlier was sufficiently powerful to detect outliers in high-dimensional microarray datasets.
Alshamlan, Hala; Badr, Ghada; Alohali, Yousef
2015-01-01
An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems. PMID:25961028
Alshamlan, Hala; Badr, Ghada; Alohali, Yousef
2015-01-01
An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.
An integrated method for cancer classification and rule extraction from microarray data
Huang, Liang-Tsung
2009-01-01
Different microarray techniques recently have been successfully used to investigate useful information for cancer diagnosis at the gene expression level due to their ability to measure thousands of gene expression levels in a massively parallel way. One important issue is to improve classification performance of microarray data. However, it would be ideal that influential genes and even interpretable rules can be explored at the same time to offer biological insight. Introducing the concepts of system design in software engineering, this paper has presented an integrated and effective method (named X-AI) for accurate cancer classification and the acquisition of knowledge from DNA microarray data. This method included a feature selector to systematically extract the relative important genes so as to reduce the dimension and retain as much as possible of the class discriminatory information. Next, diagonal quadratic discriminant analysis (DQDA) was combined to classify tumors, and generalized rule induction (GRI) was integrated to establish association rules which can give an understanding of the relationships between cancer classes and related genes. Two non-redundant datasets of acute leukemia were used to validate the proposed X-AI, showing significantly high accuracy for discriminating different classes. On the other hand, I have presented the abilities of X-AI to extract relevant genes, as well as to develop interpretable rules. Further, a web server has been established for cancer classification and it is freely available at . PMID:19272192
Ma, Y; Dai, X; Hong, T; Munk, G B; Libera, M
2016-12-19
Despite their many advantages and successes, molecular beacon (MB) hybridization probes have not been extensively used in microarray formats because of the complicating probe-substrate interactions that increase the background intensity. We have previously shown that tethering to surface-patterned microgels is an effective means for localizing MB probes to specific surface locations in a microarray format while simultaneously maintaining them in as water-like an environment as possible and minimizing probe-surface interactions. Here we extend this approach to include both real-time detection together with integrated NASBA amplification. We fabricate small (∼250 μm × 250 μm) simplex, duplex, and five-plex assays with microarray spots of controllable size (∼20 μm diameter), position, and shape to detect bacteria and fungi in a bloodstream-infection model. The targets, primers, and microgel-tethered probes can be combined in a single isothermal reaction chamber with no post-amplification labelling. We extract total RNA from clinical blood samples and differentiate between Gram-positive and Gram-negative bloodstream infection in a duplex assay to detect RNA- amplicons. The sensitivity based on our current protocols in a simplex assay to detect specific ribosomal RNA sequences within total RNA extracted from S. aureus and E. coli cultures corresponds to tens of bacteria per ml. We furthermore show that the platform can detect RNA- amplicons from synthetic target DNA with 1 fM sensitivity in sample volumes that contain about 12 000 DNA molecules. These experiments demonstrate an alternative approach that can enable rapid and real-time microarray-based molecular diagnostics.
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline
Rahmatallah, Yasir; Emmert-Streib, Frank
2016-01-01
Transcriptome sequencing (RNA-seq) is gradually replacing microarrays for high-throughput studies of gene expression. The main challenge of analyzing microarray data is not in finding differentially expressed genes, but in gaining insights into the biological processes underlying phenotypic differences. To interpret experimental results from microarrays, gene set analysis (GSA) has become the method of choice, in particular because it incorporates pre-existing biological knowledge (in a form of functionally related gene sets) into the analysis. Here we provide a brief review of several statistically different GSA approaches (competitive and self-contained) that can be adapted from microarrays practice as well as those specifically designed for RNA-seq. We evaluate their performance (in terms of Type I error rate, power, robustness to the sample size and heterogeneity, as well as the sensitivity to different types of selection biases) on simulated and real RNA-seq data. Not surprisingly, the performance of various GSA approaches depends only on the statistical hypothesis they test and does not depend on whether the test was developed for microarrays or RNA-seq data. Interestingly, we found that competitive methods have lower power as well as robustness to the samples heterogeneity than self-contained methods, leading to poor results reproducibility. We also found that the power of unsupervised competitive methods depends on the balance between up- and down-regulated genes in tested gene sets. These properties of competitive methods have been overlooked before. Our evaluation provides a concise guideline for selecting GSA approaches, best performing under particular experimental settings in the context of RNA-seq. PMID:26342128
Chen, Kai; Cai, Yuanxia; Zhang, Min; Wu, Zhixiang; Wu, Yeming
2018-05-24
Neuroblastoma is one of the most common pediatric solid tumors. Although the 5-year overall survival rate has increased over the past few decades, high-risk patients still have a poor prognosis due to a lack of biomonitoring therapy. This study was performed to investigate the role of Galectin-1 in neuroblastoma biomonitoring therapy. A tissue microarray containing 37 neuroblastoma tissue samples was used to evaluate the correlation between Galectin-1 expression and clinical features. Blood samples were examined to better understand whether serum Galectin-1 (sGalectin-1) could be used for biomonitoring therapy. Kaplan-Meier analysis and ROC analysis was conducted to distinguish the outcome associated with high or low expression of Galectin-1 in patients with neuroblastoma. Increased Galectin-1 expression was found in neuroblastoma and it was further demonstrated that elevated tissue Galectin-1 expression was related to INSS stage, histology, bone marrow metastasis, and poor survival. sGalectin-1 levels were higher in newly diagnosed patients with neuroblastoma than healthy subjects. Patients with elevated sGalectin-1 through treatment cycles correlated with the poor chemo-responses and tended to have worse outcomes, such as metastasis or stable tumor size, whereas gradually decreasing sGalectin-1 levels correlated with no observed progression in clinical symptoms. Tissue and serum Galectin-1 levels were associated with adverse clinical features in patients with neuroblastoma, and sGalectin-1 could be a potential biomarker for monitoring therapy. © 2018 Wiley Periodicals, Inc.
Sansović, Ivona; Ivankov, Ana-Maria; Bobinec, Adriana; Kero, Mijana; Barišić, Ingeborg
2017-06-14
To determine the diagnostic yield and criteria that could help to classify and interpret the copy number variations (CNVs) detected by chromosomal microarray (CMA) technique in patients with congenital and developmental abnormalities including dysmorphia, developmental delay (DD) or intellectual disability (ID), autism spectrum disorders (ASD) and congenital anomalies (CA). CMA analysis was performed in 337 patients with DD/ID with or without dysmorphism, ASD, and/or CA. In 30 of 337 patients, chromosomal imbalances had previously been detected by classical cytogenetic and molecular cytogenetic methods. In 73 of 337 patients, clinically relevant variants were detected and better characterized. Most of them were >1 Mb. Variants of unknown clinical significance (VOUS) were discovered in 35 patients. The most common VOUS size category was <300 kb (40.5%). Deletions and de novo imbalances were more frequent in pathogenic CNV than in VOUS category. CMA had a high diagnostic yield of 43/307, excluding patients previously detected by other methods. CMA was valuable in establishing the diagnosis in a high proportion of patients. Criteria for classification and interpretation of CNVs include CNV size and type, mode of inheritance, and genotype-phenotype correlation. Agilent ISCA v2 Human Genome 8x60 K oligonucleotide microarray format proved to be reasonable resolution for clinical use, particularly in the regions that are recommended by the International Standard Cytogenomic Array (ISCA) Consortium and associated with well-established syndromes.
A comprehensive simulation study on classification of RNA-Seq data.
Zararsız, Gökmen; Goksuluk, Dincer; Korkmaz, Selcuk; Eldem, Vahap; Zararsiz, Gozde Erturk; Duru, Izzet Parug; Ozturk, Ahmet
2017-01-01
RNA sequencing (RNA-Seq) is a powerful technique for the gene-expression profiling of organisms that uses the capabilities of next-generation sequencing technologies. Developing gene-expression-based classification algorithms is an emerging powerful method for diagnosis, disease classification and monitoring at molecular level, as well as providing potential markers of diseases. Most of the statistical methods proposed for the classification of gene-expression data are either based on a continuous scale (eg. microarray data) or require a normal distribution assumption. Hence, these methods cannot be directly applied to RNA-Seq data since they violate both data structure and distributional assumptions. However, it is possible to apply these algorithms with appropriate modifications to RNA-Seq data. One way is to develop count-based classifiers, such as Poisson linear discriminant analysis and negative binomial linear discriminant analysis. Another way is to bring the data closer to microarrays and apply microarray-based classifiers. In this study, we compared several classifiers including PLDA with and without power transformation, NBLDA, single SVM, bagging SVM (bagSVM), classification and regression trees (CART), and random forests (RF). We also examined the effect of several parameters such as overdispersion, sample size, number of genes, number of classes, differential-expression rate, and the transformation method on model performances. A comprehensive simulation study is conducted and the results are compared with the results of two miRNA and two mRNA experimental datasets. The results revealed that increasing the sample size, differential-expression rate and decreasing the dispersion parameter and number of groups lead to an increase in classification accuracy. Similar with differential-expression studies, the classification of RNA-Seq data requires careful attention when handling data overdispersion. We conclude that, as a count-based classifier, the power transformed PLDA and, as a microarray-based classifier, vst or rlog transformed RF and SVM classifiers may be a good choice for classification. An R/BIOCONDUCTOR package, MLSeq, is freely available at https://www.bioconductor.org/packages/release/bioc/html/MLSeq.html.
Hartmann, Luise; Stephenson, Christine F; Verkamp, Stephanie R; Johnson, Krystal R; Burnworth, Bettina; Hammock, Kelle; Brodersen, Lisa Eidenschink; de Baca, Monica E; Wells, Denise A; Loken, Michael R; Zehentner, Barbara K
2014-12-01
Array comparative genomic hybridization (aCGH) has become a powerful tool for analyzing hematopoietic neoplasms and identifying genome-wide copy number changes in a single assay. aCGH also has superior resolution compared with fluorescence in situ hybridization (FISH) or conventional cytogenetics. Integration of single nucleotide polymorphism (SNP) probes with microarray analysis allows additional identification of acquired uniparental disomy, a copy neutral aberration with known potential to contribute to tumor pathogenesis. However, a limitation of microarray analysis has been the inability to detect clonal heterogeneity in a sample. This study comprised 16 samples (acute myeloid leukemia, myelodysplastic syndrome, chronic lymphocytic leukemia, plasma cell neoplasm) with complex cytogenetic features and evidence of clonal evolution. We used an integrated manual peak reassignment approach combining analysis of aCGH and SNP microarray data for characterization of subclonal abnormalities. We compared array findings with results obtained from conventional cytogenetic and FISH studies. Clonal heterogeneity was detected in 13 of 16 samples by microarray on the basis of log2 values. Use of the manual peak reassignment analysis approach improved resolution of the sample's clonal composition and genetic heterogeneity in 10 of 13 (77%) patients. Moreover, in 3 patients, clonal disease progression was revealed by array analysis that was not evident by cytogenetic or FISH studies. Genetic abnormalities originating from separate clonal subpopulations can be identified and further characterized by combining aCGH and SNP hybridization results from 1 integrated microarray chip by use of the manual peak reassignment technique. Its clinical utility in comparison to conventional cytogenetic or FISH studies is demonstrated. © 2014 American Association for Clinical Chemistry.
Glycome Diagnosis of Human Induced Pluripotent Stem Cells Using Lectin Microarray*
Tateno, Hiroaki; Toyota, Masashi; Saito, Shigeru; Onuma, Yasuko; Ito, Yuzuru; Hiemori, Keiko; Fukumura, Mihoko; Matsushima, Asako; Nakanishi, Mio; Ohnuma, Kiyoshi; Akutsu, Hidenori; Umezawa, Akihiro; Horimoto, Katsuhisa; Hirabayashi, Jun; Asashima, Makoto
2011-01-01
Induced pluripotent stem cells (iPSCs) can now be produced from various somatic cell (SC) lines by ectopic expression of the four transcription factors. Although the procedure has been demonstrated to induce global change in gene and microRNA expressions and even epigenetic modification, it remains largely unknown how this transcription factor-induced reprogramming affects the total glycan repertoire expressed on the cells. Here we performed a comprehensive glycan analysis using 114 types of human iPSCs generated from five different SCs and compared their glycomes with those of human embryonic stem cells (ESCs; nine cell types) using a high density lectin microarray. In unsupervised cluster analysis of the results obtained by lectin microarray, both undifferentiated iPSCs and ESCs were clustered as one large group. However, they were clearly separated from the group of differentiated SCs, whereas all of the four SCs had apparently distinct glycome profiles from one another, demonstrating that SCs with originally distinct glycan profiles have acquired those similar to ESCs upon induction of pluripotency. Thirty-eight lectins discriminating between SCs and iPSCs/ESCs were statistically selected, and characteristic features of the pluripotent state were then obtained at the level of the cellular glycome. The expression profiles of relevant glycosyltransferase genes agreed well with the results obtained by lectin microarray. Among the 38 lectins, rBC2LCN was found to detect only undifferentiated iPSCs/ESCs and not differentiated SCs. Hence, the high density lectin microarray has proved to be valid for not only comprehensive analysis of glycans but also diagnosis of stem cells under the concept of the cellular glycome. PMID:21471226
Purcell, Maureen K.; Marjara, Inderjit Singh; Batts, William; Kurath, Gael; Hansen, John D.
2010-01-01
There are three main genetic lineages or genogroups of Infectious hematopoietic necrosis virus (IHNV) in N. America. Strains representing the M genogroup are more virulent in rainbow trout relative to the U genogroup. In this study, we used microarray analysis to evaluate potential mechanisms responsible for host-specific virulence in rainbow trout that were given intraperitoneal injections of buffer or a representative M or U type virus strain. Reverse transcriptase quantitative PCR (RT-qPCR) was used to assess viral load and gene expression of select immune genes. Viral load was significantly higher in trout infected with the M virus starting at 24 h post-infection (p.i.) and continuing until 72 h p.i. Microarray analysis of the 48 h time point revealed 153 up-regulated and 248 down-regulated features in response to M virus infection but only 62 up-regulated and 49 down-regulated features following U virus infection. Translation and transcription features were among the most frequent down-regulated features in response to M virus infection and may be associated with the host cell shutoff phenomenon. A greater host cell shutoff response by the M virus may facilitate subversion of the host cell transcriptional machinery and enhance viral replication, suggesting the M virus may be better optimized to manipulate the rainbow trout transcriptional and translational machinery. Anti-viral associated features were the most commonly up-regulated features. A common set of features were up-regulated in both the M and U infection groups, but were induced to a higher magnitude in the M infection group. Gene expression of the anti-viral genes Mx-1 and Vig-1 was correlated but not entirely dependent on viral load in the anterior kidney. Slower replication of the U virus may allow the host more time to induce protective anti-viral immune mechanisms.
Characterization of Retinal Vascular and Neural Damage in a Novel Model of Diabetic Retinopathy.
Weerasekera, Lakshini Y; Balmer, Lois A; Ram, Ramesh; Morahan, Grant
2015-06-01
Diabetic retinopathy (DR) is a major cause of blindness globally. Investigating the underlying mechanisms of DR would be aided by a suitable mouse model that developed key features seen in the human disease, and did so without carrying genetic modifications. This study was undertaken to produce such a model. Our panel of Collaborative Cross strains was screened for DR-like features after induction of diabetes by intravenous injection with alloxan or streptozotocin. Both flat-mounted whole-retina and histologic sections were studied for the presence of retinal lesions. Progression of DR was also studied by histologic examination of the retinal vascular and neural structure at various time points after diabetes onset. In addition, microarray investigations were conducted on retinas from control and diabetic mice. Features of DR such as degenerated pericytes, acellular capillaries, minor vascular proliferation, gliosis of Müller cells, and loss of ganglion cells were noted as early as day 7 in some mice. These lesions became more evident with time. After 21 days of diabetes, severe vascular proliferation, microaneurysms, preretinal damage, increased Müller cell gliosis, and damage to the outer retina were all obvious. Microarray studies found significant differential expression of multiple genes known to be involved in DR. The FOT_FB strain provides a useful model to investigate the pathogenesis of DR and to develop treatments for this vision-threatening disease.
The first Korean patient with Potocki-Shaffer syndrome: a rare cause of multiple exostoses.
Sohn, Young Bae; Yim, Shin-Young; Cho, Eun-Hae; Kim, Ok-Hwa
2015-02-01
Potocki-Shaffer syndrome (PSS, OMIM #601224) is a rare contiguous gene deletion syndrome caused by haploinsufficiency of genes located on the 11p11.2p12. Affected individuals have a number of characteristic features including multiple exostoses, biparietal foramina, abnormalities of genitourinary system, hypotonia, developmental delay, and intellectual disability. We report here on the first Korean case of an 8-yr-old boy with PSS diagnosed by high resolution microarray. Initial evaluation was done at age 6 months because of a history of developmental delay, hypotonia, and dysmorphic face. Coronal craniosynostosis and enlarged parietal foramina were found on skull radiographs. At age 6 yr, he had severe global developmental delay. Multiple exostoses of long bones were detected during a radiological check-up. Based on the clinical and radiological features, PSS was highly suspected. Subsequently, chromosomal microarray analysis identified an 8.6 Mb deletion at 11p11.2 [arr 11p12p11.2 (Chr11:39,204,770-47,791,278)×1]. The patient continued rehabilitation therapy for profound developmental delay. The progression of multiple exostosis has being monitored. This case confirms and extends data on the genetic basis of PSS. In clinical and radiologic aspect, a patient with multiple exostoses accompanying with syndromic features, including craniofacial abnormalities and mental retardation, the diagnosis of PSS should be considered.
Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data
Zhao, Xin; Cheung, Leo Wang-Kit
2007-01-01
Background Designing appropriate machine learning methods for identifying genes that have a significant discriminating power for disease outcomes has become more and more important for our understanding of diseases at genomic level. Although many machine learning methods have been developed and applied to the area of microarray gene expression data analysis, the majority of them are based on linear models, which however are not necessarily appropriate for the underlying connection between the target disease and its associated explanatory genes. Linear model based methods usually also bring in false positive significant features more easily. Furthermore, linear model based algorithms often involve calculating the inverse of a matrix that is possibly singular when the number of potentially important genes is relatively large. This leads to problems of numerical instability. To overcome these limitations, a few non-linear methods have recently been introduced to the area. Many of the existing non-linear methods have a couple of critical problems, the model selection problem and the model parameter tuning problem, that remain unsolved or even untouched. In general, a unified framework that allows model parameters of both linear and non-linear models to be easily tuned is always preferred in real-world applications. Kernel-induced learning methods form a class of approaches that show promising potentials to achieve this goal. Results A hierarchical statistical model named kernel-imbedded Gaussian process (KIGP) is developed under a unified Bayesian framework for binary disease classification problems using microarray gene expression data. In particular, based on a probit regression setting, an adaptive algorithm with a cascading structure is designed to find the appropriate kernel, to discover the potentially significant genes, and to make the optimal class prediction accordingly. A Gibbs sampler is built as the core of the algorithm to make Bayesian inferences. Simulation studies showed that, even without any knowledge of the underlying generative model, the KIGP performed very close to the theoretical Bayesian bound not only in the case with a linear Bayesian classifier but also in the case with a very non-linear Bayesian classifier. This sheds light on its broader usability to microarray data analysis problems, especially to those that linear methods work awkwardly. The KIGP was also applied to four published microarray datasets, and the results showed that the KIGP performed better than or at least as well as any of the referred state-of-the-art methods did in all of these cases. Conclusion Mathematically built on the kernel-induced feature space concept under a Bayesian framework, the KIGP method presented in this paper provides a unified machine learning approach to explore both the linear and the possibly non-linear underlying relationship between the target features of a given binary disease classification problem and the related explanatory gene expression data. More importantly, it incorporates the model parameter tuning into the framework. The model selection problem is addressed in the form of selecting a proper kernel type. The KIGP method also gives Bayesian probabilistic predictions for disease classification. These properties and features are beneficial to most real-world applications. The algorithm is naturally robust in numerical computation. The simulation studies and the published data studies demonstrated that the proposed KIGP performs satisfactorily and consistently. PMID:17328811
Manthiram, Kalpana; Correa, Hernan; Boyd, Kelli; Roland, Joseph; Edwards, Kathryn
2018-05-01
The objective of this study is to compare the histology and immune cell composition of tonsils from patients with periodic fever, aphthous stomatitis, pharyngitis, and cervical adenitis (PFAPA) syndrome to those from patients with obstructive sleep apnea (OSA). Patients with PFAPA and age-matched controls with OSA who had undergone tonsillectomy at Vanderbilt Children's Hospital were recruited. After informed consent, archival paraffin-embedded, formalin-fixed tonsil tissues were obtained. Sizes of major histologic regions were measured. Cores of germinal centers, crypts, and squamous epithelium were assembled on a tissue microarray for immunohistochemical staining and digital image analysis. Features of tonsils from PFAPA and OSA patients were compared with Wilcoxon signed-rank test. Samples from 16 cases with PFAPA and 16 controls with OSA were evaluated. Tonsils from PFAPA cases had significantly smaller germinal centers (0.18 vs. 0.47 mm 2 , p = 0.001) and wider squamous epithelia (176 vs. 138 μm, p = 0.008) than those of OSA patients. The percentages of B and T lymphocytes and myeloid cells were comparable in germinal centers, crypts, and squamous epithelia from PFAPA and OSA patients. Longer time from the last febrile episode in PFAPA cases was associated with larger germinal center area (Spearman's rho = 0.61, p = 0.02). We found differences in the sizes of germinal centers and squamous epithelia in tonsils of patients with PFAPA and OSA, but the cellular compositions within these areas were comparable. Our results suggest that tonsils from patients with PFAPA change histologically over time with enlarging germinal centers following a febrile episode. Additional studies are needed to understand the pathogenesis of PFAPA.
Xu, Joshua; Gong, Binsheng; Wu, Leihong; Thakkar, Shraddha; Hong, Huixiao; Tong, Weida
2016-03-15
Studies on gene expression in response to therapy have led to the discovery of pharmacogenomics biomarkers and advances in precision medicine. Whole transcriptome sequencing (RNA-seq) is an emerging tool for profiling gene expression and has received wide adoption in the biomedical research community. However, its value in regulatory decision making requires rigorous assessment and consensus between various stakeholders, including the research community, regulatory agencies, and industry. The FDA-led SEquencing Quality Control (SEQC) consortium has made considerable progress in this direction, and is the subject of this review. Specifically, three RNA-seq platforms (Illumina HiSeq, Life Technologies SOLiD, and Roche 454) were extensively evaluated at multiple sites to assess cross-site and cross-platform reproducibility. The results demonstrated that relative gene expression measurements were consistently comparable across labs and platforms, but not so for the measurement of absolute expression levels. As part of the quality evaluation several studies were included to evaluate the utility of RNA-seq in clinical settings and safety assessment. The neuroblastoma study profiled tumor samples from 498 pediatric neuroblastoma patients by both microarray and RNA-seq. RNA-seq offers more utilities than microarray in determining the transcriptomic characteristics of cancer. However, RNA-seq and microarray-based models were comparable in clinical endpoint prediction, even when including additional features unique to RNA-seq beyond gene expression. The toxicogenomics study compared microarray and RNA-seq profiles of the liver samples from rats exposed to 27 different chemicals representing multiple toxicity modes of action. Cross-platform concordance was dependent on chemical treatment and transcript abundance. Though both RNA-seq and microarray are suitable for developing gene expression based predictive models with comparable prediction performance, RNA-seq offers advantages over microarray in profiling genes with low expression. The rat BodyMap study provided a comprehensive rat transcriptomic body map by performing RNA-Seq on 320 samples from 11 organs in either sex of juvenile, adolescent, adult and aged Fischer 344 rats. Lastly, the transferability study demonstrated that signature genes of predictive models are reciprocally transferable between microarray and RNA-seq data for model development using a comprehensive approach with two large clinical data sets. This result suggests continued usefulness of legacy microarray data in the coming RNA-seq era. In conclusion, the SEQC project enhances our understanding of RNA-seq and provides valuable guidelines for RNA-seq based clinical application and safety evaluation to advance precision medicine.
Li, Huiyan; Leulmi, Rym Feriel; Juncker, David
2011-02-07
Antibody microarrays are a powerful tool for rapid, multiplexed profiling of proteins. 3D microarray substrates have been developed to improve binding capacity, assay sensitivity, and mass transport, however, they often rely on photopolymers which are difficult to manufacture and have a small pore size that limits mass transport and demands long incubation time. Here, we present a novel 3D antibody microarray format based on the entrapment of antibody-coated microbeads within alginate droplets that were spotted onto a glass slide using an inkjet. Owing to the low concentration of alginate used, the gels were highly porous to proteins, and together with the 3D architecture helped enhance mass transport during the assays. The spotting parameters were optimized for the attachment of the alginate to the substrate. Beads with 0.2 µm, 0.5 µm and 1 µm diameter were tested and 1 µm beads were selected based on their superior retention within the hydrogel. The beads were found to be distributed within the entire volume of the gel droplet using confocal microscopy. The assay time and the concentration of beads in the gels were investigated for maximal binding signal using one-step immunoassays. As a proof of concept, six proteins including cytokines (TNFα, IL-8 and MIP/CCL4), breast cancer biomarkers (CEA and HER2) and one cancer-related protein (ENG) were profiled in multiplex using sandwich assays down to pg mL(-1) concentrations with 1 h incubation without agitation in both buffer solutions and 10% serum. These results illustrate the potential of beads-in-gel microarrays for highly sensitive and multiplexed protein analysis.
NASA Astrophysics Data System (ADS)
Coccini, Teresa; Fabbri, Marco; Roda, Elisa; Grazia Sacco, Maria; Manzo, Luigi; Gribaldo, Laura
2011-07-01
Silica nanoparticles (NPs) incorporating cadmium (Cd) have been developed for a range of potential application including drug delivery devices. Occupational Cd inhalation has been associated with emphysema, pulmonary fibrosis and lung tumours. Mechanistically, Cd can induce oxidative stress and mediate cell-signalling pathways that are involved in inflammation.This in vivo study aimed at investigating pulmonary molecular effects of NPs doped with Cd (NP-Cd, 1 mg/animal) compared to soluble CdCl2 (400 μg/animal), in Sprague Dawley rats treated intra-tracheally, 7 and 30 days after administration. NPs of silica containing Cd salt were prepared starting from commercial nano-size silica powder (HiSil™ T700 Degussa) with average pore size of 20 nm and surface area of 240 m2/g. Toxicogenomic analysis was performed by the DNA microarray technology (using Agilent Whole Rat Genome Microarray 4×44K) to evaluate changes in gene expression of the entire genome. These findings indicate that the whole genome analysis may represent a valuable approach to assess the whole spectrum of biological responses to cadmium containing nanomaterials.
2011-01-01
Background Phytohormones organize plant development and environmental adaptation through cell-to-cell signal transduction, and their action involves transcriptional activation. Recent international efforts to establish and maintain public databases of Arabidopsis microarray data have enabled the utilization of this data in the analysis of various phytohormone responses, providing genome-wide identification of promoters targeted by phytohormones. Results We utilized such microarray data for prediction of cis-regulatory elements with an octamer-based approach. Our test prediction of a drought-responsive RD29A promoter with the aid of microarray data for response to drought, ABA and overexpression of DREB1A, a key regulator of cold and drought response, provided reasonable results that fit with the experimentally identified regulatory elements. With this succession, we expanded the prediction to various phytohormone responses, including those for abscisic acid, auxin, cytokinin, ethylene, brassinosteroid, jasmonic acid, and salicylic acid, as well as for hydrogen peroxide, drought and DREB1A overexpression. Totally 622 promoters that are activated by phytohormones were subjected to the prediction. In addition, we have assigned putative functions to 53 octamers of the Regulatory Element Group (REG) that have been extracted as position-dependent cis-regulatory elements with the aid of their feature of preferential appearance in the promoter region. Conclusions Our prediction of Arabidopsis cis-regulatory elements for phytohormone responses provides guidance for experimental analysis of promoters to reveal the basis of the transcriptional network of phytohormone responses. PMID:21349196
LS Bound based gene selection for DNA microarray data.
Zhou, Xin; Mao, K Z
2005-04-15
One problem with discriminant analysis of DNA microarray data is that each sample is represented by quite a large number of genes, and many of them are irrelevant, insignificant or redundant to the discriminant problem at hand. Methods for selecting important genes are, therefore, of much significance in microarray data analysis. In the present study, a new criterion, called LS Bound measure, is proposed to address the gene selection problem. The LS Bound measure is derived from leave-one-out procedure of LS-SVMs (least squares support vector machines), and as the upper bound for leave-one-out classification results it reflects to some extent the generalization performance of gene subsets. We applied this LS Bound measure for gene selection on two benchmark microarray datasets: colon cancer and leukemia. We also compared the LS Bound measure with other evaluation criteria, including the well-known Fisher's ratio and Mahalanobis class separability measure, and other published gene selection algorithms, including Weighting factor and SVM Recursive Feature Elimination. The strength of the LS Bound measure is that it provides gene subsets leading to more accurate classification results than the filter method while its computational complexity is at the level of the filter method. A companion website can be accessed at http://www.ntu.edu.sg/home5/pg02776030/lsbound/. The website contains: (1) the source code of the gene selection algorithm; (2) the complete set of tables and figures regarding the experimental study; (3) proof of the inequality (9). ekzmao@ntu.edu.sg.
Yang, Liu; Wang, Zhihua; Deng, Yuliang; Li, Yan; Wei, Wei; Shi, Qihui
2016-11-15
Circulating tumor cells (CTCs) shed from tumor sites and represent the molecular characteristics of the tumor. Besides genetic and transcriptional characterization, it is important to profile a panel of proteins with single-cell precision for resolving CTCs' phenotype, organ-of-origin, and drug targets. We describe a new technology that enables profiling multiple protein markers of extraordinarily rare tumor cells at the single-cell level. This technology integrates a microchip consisting of 15000 60 pL-sized microwells and a novel beads-on-barcode antibody microarray (BOBarray). The BOBarray allows for multiplexed protein detection by assigning two independent identifiers (bead size and fluorescent color) of the beads to each protein. Four bead sizes (1.75, 3, 4.5, and 6 μm) and three colors (blue, green, and yellow) are utilized to encode up to 12 different proteins. The miniaturized BOBarray can fit an array of 60 pL-sized microwells that isolate single cells for cell lysis and the subsequent detection of protein markers. An enclosed 60 pL-sized microchamber defines a high concentration of proteins released from lysed single cells, leading to single-cell resolution of protein detection. The protein markers assayed in this study include organ-specific markers and drug targets that help to characterize the organ-of-origin and drug targets of isolated rare tumor cells from blood samples. This new approach enables handling a very small number of cells and achieves single-cell, multiplexed protein detection without loss of rare but clinically important tumor cells.
Xu, Huihui; Xiao, Bing; Ji, Xing; Hu, Qin; Chen, Yingwei; Qiu, Wenjuan
2014-07-01
Tetrasomy for the distal chromosome 15q is rare, and only 22 patients (including 6 cases without detailed information) have been described to date in the literature. Here we report on another patient with nonmosaic tetrasomy 15q25.2-qter resulted from an inverted duplication of distal chromosome 15. This patient presents with features of development delay, arachnodactyly, joint contractures and typical facial dysmorphism including frontal bossing, short palpebral fissures, long philtrum, low-set ears, high-arched palate and retrognathia. Unlike most of the related patients, abdominal ultrasound test and brain MRI showed normal. Karyotyping analysis revealed a supernumerary marker chromosome presented in all metaphase cells examined. Parental karyotyping analysis was normal, indicating a de novo chromosome aberration of the patient. SNP microarray analysis found a two copy gain of 17.7 Mb from the distal long arm of chromosome 15 (15q25.2-qter). Further FISH analysis using SureFISH 15q26.3 IGF1R probe proved an inverted duplication of distal long arm of chromosome 15. The segmental duplications which lie in the hotspots of 15q24-26 might increase the susceptibility of chromosome rearrangement. Compared with the George-Abraham' study [2012], ADAMTSL3 might be more related to the cardiac disorders in tetrasomy 15q patients. Considering all patients reported in the literature, different mosaic degrees and segmental sizes don't correlate to the severity of phenotypes. A clear delineation on tetrasomy for distal chromosome 15q could still be investigated. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Bi, Weimin; Cheung, Sau-Wai; Breman, Amy M; Bacino, Carlos A
2016-10-01
Deletions in the 4p16.3 region cause Wolf-Hirschhorn syndrome, a well known contiguous microdeletion syndrome with the critical region for common phenotype mapped in WHSCR2. Recently, duplications in 4p16.3 were reported in three patients with developmental delay and dysmorphic features. Through chromosomal microarray analysis, we identified 156 patients with a deletion (n = 109) or duplication (n = 47) in 4p16.3 out of approximately 60,000 patients analyzed by Baylor Miraca Genetics Laboratories. Seventy-five of the postnatally detected deletions encompassed the entire critical region, 32 (43%) of which were associated with other chromosome rearrangements, including six patients (8%) that had a duplication adjacent to the terminal deletion. Our data indicate that Wolf-Hirschhorn syndrome deletions with an adjacent duplication occur at a higher frequency than previously appreciated. Pure deletions (n = 14) or duplications (n = 15) without other copy number changes distal to or inside the WHSCR2 were identified for mapping of critical regions. Our data suggest that deletion of the segment from 0.6 to 0.9 Mb from the terminus of 4p causes a seizure phenotype and duplications of a region distal to the previously defined smallest region of overlap for 4p16.3 microduplication syndrome are associated with neurodevelopmental problems. We detected seven Wolf-Hirschhorn syndrome deletions and one 4p16.3 duplication prenatally; all of the seven are either >8 Mb in size and/or associated with large duplications. In conclusion, our study provides deeper insight into the molecular mechanisms, the critical regions and effective prenatal diagnosis for 4p16.3 deletions/ duplications. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Sharrow, Allison C; Perkins, Brandy; Collector, Michael I; Yu, Wayne; Simons, Brian W; Jones, Richard J
2016-08-01
The cancer stem cell (CSC) paradigm hypothesizes that successful clinical eradication of CSCs may lead to durable remission for patients with ovarian cancer. Despite mounting evidence in support of ovarian CSCs, their phenotype and clinical relevance remain unclear. We and others have found high aldehyde dehydrogenase 1 (ALDH(high)) expression in a variety of normal and malignant stem cells, and sought to better characterize ALDH(high) cells in ovarian cancer. We compared ALDH(high) to ALDH(low) cells in two ovarian cancer models representing distinct subtypes: FNAR-C1 cells, derived from a spontaneous rat endometrioid carcinoma, and the human SKOV3 cell line (described as both serous and clear cell subtypes). We assessed these populations for stem cell features then analyzed expression by microarray and qPCR. ALDH(high) cells displayed CSC properties, including: smaller size, quiescence, regenerating the phenotypic diversity of the cell lines in vitro, lack of contact inhibition, nonadherent growth, multi-drug resistance, and in vivo tumorigenicity. Microarray and qPCR analysis of the expression of markers reported by others to enrich for ovarian CSCs revealed that ALDH(high) cells of both models showed downregulation of CD24, but inconsistent expression of CD44, KIT and CD133. However, the following druggable targets were consistently expressed in the ALDH(high) cells from both models: mTOR signaling, her-2/neu, CD47 and FGF18/FGFR3. Based on functional characterization, ALDH(high) ovarian cancer cells represent an ovarian CSC population. Differential gene expression identified druggable targets that have the potential for therapeutic efficacy against ovarian CSCs from multiple subtypes. Copyright © 2016 Elsevier Inc. All rights reserved.
High-density fiber optic biosensor arrays
NASA Astrophysics Data System (ADS)
Epstein, Jason R.; Walt, David R.
2002-02-01
Novel approaches are required to coordinate the immense amounts of information derived from diverse genomes. This concept has influenced the expanded role of high-throughput DNA detection and analysis in the biological sciences. A high-density fiber optic DNA biosensor was developed consisting of oligonucleotide-functionalized, 3.1 mm diameter microspheres deposited into the etched wells on the distal face of a 500 micrometers imaging fiber bundle. Imaging fiber bundles containing thousands of optical fibers, each associated with a unique oligonucleotide probe sequence, were the foundation for an optically connected, individually addressable DNA detection platform. Different oligonucleotide-functionalized microspheres were combined in a stock solution, and randomly dispersed into the etched wells. Microsphere positions were registered from optical dyes incorporated onto the microspheres. The distribution process provided an inherent redundancy that increases the signal-to-noise ratio as the square root of the number of sensors examined. The representative amount of each probe-type in the array was dependent on their initial stock solution concentration, and as other sequences of interest arise, new microsphere elements can be added to arrays without altering the existing detection capabilities. The oligonucleotide probe sequences hybridize to fluorescently-labeled, complementary DNA target solutions. Fiber optic DNA microarray research has included DNA-protein interaction profiles, microbial strain differentiation, non-labeled target interrogation with molecular beacons, and single cell-based assays. This biosensor array is proficient in DNA detection linked to specific disease states, single nucleotide polymorphism (SNP's) discrimination, and gene expression analysis. This array platform permits multiple detection formats, provides smaller feature sizes, and enables sensor design flexibility. High-density fiber optic microarray biosensors provide a fast, reversible format with the detection limit of a few hundred molecules.
Microarray analysis of miRNA expression profiles following whole body irradiation in a mouse model.
Aryankalayil, Molykutty J; Chopra, Sunita; Makinde, Adeola; Eke, Iris; Levin, Joel; Shankavaram, Uma; MacMillan, Laurel; Vanpouille-Box, Claire; Demaria, Sandra; Coleman, C Norman
2018-06-19
Accidental exposure to life-threatening radiation in a nuclear event is a major concern; there is an enormous need for identifying biomarkers for radiation biodosimetry to triage populations and treat critically exposed individuals. To identify dose-differentiating miRNA signatures from whole blood samples of whole body irradiated mice. Mice were whole body irradiated with X-rays (2 Gy-15 Gy); blood was collected at various time-points post-exposure; total RNA was isolated; miRNA microarrays were performed; miRNAs differentially expressed in irradiated vs. unirradiated controls were identified; feature extraction and classification models were applied to predict dose-differentiating miRNA signature. We observed a time and dose responsive alteration in the expression levels of miRNAs. Maximum number of miRNAs were altered at 24-h and 48-h time-points post-irradiation. A 23-miRNA signature was identified using feature selection algorithms and classifier models. An inverse correlation in the expression level changes of miR-17 members, and their targets were observed in whole body irradiated mice and non-human primates. Whole blood-based miRNA expression signatures might be used for predicting radiation exposures in a mass casualty nuclear incident.
Peterson, Jess F; Basel, Donald G; Bick, David P; Chirempes, Brett; Lorier, Rachel B; Zemlicka, Nykula; Grignon, John W; Weik, LuAnn; Kappes, Ulrike
2018-03-01
We report a 19-year-old female patient with a history of short stature, primary ovarian insufficiency, sensorineural hearing loss, sacral teratoma, neurogenic bladder, and intellectual disability with underlying mosaicism for der(X)t(X;3)(q13.2;q25.33), a ring X chromosome, and monosomy X. Derivative X chromosomes from unbalanced X-autosomal translocations are preferentially silenced by the XIST gene (Xq13.2) located within the X-inactivation center. The unbalanced X-autosomal translocation in our case resulted in loss of the XIST gene thus precluding the inactivation of the derivative X chromosome. As a result, clinical features of functional disomy Xp, Turner's syndrome, and duplication 3q syndrome were observed. Importantly, indications of the derivative X chromosome were revealed by microarray analysis following an initial diagnosis of Turner's syndrome made by conventional cytogenetic studies approximately 18 months earlier. This case demonstrates the importance of utilizing microarray analysis as a first-line test in patients with clinical features beyond the scope of a well-defined genetic syndrome.
Feature Selection with Conjunctions of Decision Stumps and Learning from Microarray Data.
Shah, M; Marchand, M; Corbeil, J
2012-01-01
One of the objectives of designing feature selection learning algorithms is to obtain classifiers that depend on a small number of attributes and have verifiable future performance guarantees. There are few, if any, approaches that successfully address the two goals simultaneously. To the best of our knowledge, such algorithms that give theoretical bounds on the future performance have not been proposed so far in the context of the classification of gene expression data. In this work, we investigate the premise of learning a conjunction (or disjunction) of decision stumps in Occam's Razor, Sample Compression, and PAC-Bayes learning settings for identifying a small subset of attributes that can be used to perform reliable classification tasks. We apply the proposed approaches for gene identification from DNA microarray data and compare our results to those of the well-known successful approaches proposed for the task. We show that our algorithm not only finds hypotheses with a much smaller number of genes while giving competitive classification accuracy but also having tight risk guarantees on future performance, unlike other approaches. The proposed approaches are general and extensible in terms of both designing novel algorithms and application to other domains.
Genetic biomarkers for brain hemisphere differentiation in Parkinson's Disease
NASA Astrophysics Data System (ADS)
Hourani, Mou'ath; Mendes, Alexandre; Berretta, Regina; Moscato, Pablo
2007-11-01
This work presents a study on the genetic profile of the left and right hemispheres of the brain of a mouse model of Parkinson's disease (PD). The goal is to characterize, in a genetic basis, PD as a disease that affects these two brain regions in different ways. Using the same whole-genome microarray expression data introduced by Brown et al. (2002) [1], we could find significant differences in the expression of some key genes, well-known to be involved in the mechanisms of dopamine production control and PD. The problem of selecting such genes was modeled as the MIN (α,β)—FEATURE SET problem [2]; a similar approach to that employed previously to find biomarkers for different types of cancer using gene expression microarray data [3]. The Feature Selection method produced a series of genetic signatures for PD, with distinct expression profiles in the Parkinson's model and control mice experiments. In addition, a close examination of the genes composing those signatures shows that many of them belong to genetic pathways or have ontology annotations considered to be involved in the onset and development of PD. Such elements could provide new clues on which mechanisms are implicated in hemisphere differentiation in PD.
Electrosonic ejector microarray for drug and gene delivery.
Zarnitsyn, Vladimir G; Meacham, J Mark; Varady, Mark J; Hao, Chunhai; Degertekin, F Levent; Fedorov, Andrei G
2008-04-01
We report on development and experimental characterization of a novel cell manipulation device-the electrosonic ejector microarray-which establishes a pathway for drug and/or gene delivery with control of biophysical action on the length scale of an individual cell. The device comprises a piezoelectric transducer for ultrasound wave generation, a reservoir for storing the sample mixture and a set of acoustic horn structures that form a nozzle array for focused application of mechanical energy. The nozzles are micromachined in silicon or plastic using simple and economical batch fabrication processes. When the device is driven at a particular resonant frequency of the acoustic horn structures, the sample mixture of cells and desired transfection agents/molecules suspended in culture medium is ejected from orifices located at the nozzle tips. During sample ejection, focused mechanical forces (pressure and shear) are generated on a microsecond time scale (dictated by nozzle size/geometry and ejection velocity) resulting in identical "active" microenvironments for each ejected cell. This process enables a number of cellular bioeffects, from uptake of small molecules and gene delivery/transfection to cell lysis. Specifically, we demonstrate successful calcein uptake and transfection of DNA plasmid encoding green fluorescent protein (GFP) into human malignant glioma cells (cell line LN443) using electrosonic microarrays with 36, 45 and 50 mum diameter nozzle orifices and operating at ultrasound frequencies between 0.91 and 0.98 MHz. Our results suggest that efficacy and the extent of bioeffects are mainly controlled by nozzle orifice size and the localized intensity of the applied acoustic field.
Aberrant expression of long noncoding RNAs in cumulus cells isolated from PCOS patients.
Huang, Xin; Hao, Cuifang; Bao, Hongchu; Wang, Meimei; Dai, Huangguan
2016-01-01
To describe the long noncoding RNA (lncRNA) profiles in cumulus cells isolated from polycystic ovary syndrome (PCOS) patients by employing a microarray and in-depth bioinformatics analysis. This information will help us understand the occurrence and development of PCOS. In this study, we used a microarray to describe lncRNA profiles in cumulus cells isolated from ten patients (five PCOS and five normal women). Several differentially expressed lncRNAs were chosen to validate the microarray results by quantitative RT-PCR (qRT-PCR). Then, the differentially expressed lncRNAs were classified into three subgroups (HOX loci lncRNA, enhancer-like lncRNA, and lincRNA) to deduce their potential features. Furthermore, a lncRNA/mRNA co-expression network was constructed by using the Cytoscape software (V2.8.3, http://www.cytoscape.org/ ). We observed that 623 lncRNAs and 260 messenger RNAs (mRNAs) were significantly up- or down-regulated (≥2-fold change), and these differences could be used to discriminate cumulus cells of PCOS from those of normal patients. Five differentially expressed lncRNAs (XLOC_011402, ENST00000454271, ENST00000433673, ENST00000450294, and ENST00000432431) were selected to validate the microarray results using quantitative RT-PCR (qRT-PCR). The qRT-PCR results were consistent with the microarray data. Further analysis indicated that many differentially expressed lncRNAs were transcribed from chromosome 2 and may act as enhancers to regulate their neighboring protein-coding genes. Forty-three lncRNAs and 29 mRNAs were used to construct the coding-non-coding gene co-expression network. Most pairs positively correlated, and one mRNA correlated with one or more lncRNAs. Our study is the first to determine genome-wide lncRNA expression patterns in cumulus cells isolated from PCOS patients by microarray. The results show that clusters of lncRNAs were aberrantly expressed in cumulus cells of PCOS patients compared with those of normal women, which revealed that lncRNAs differentially expressed in PCOS and normal women may contribute to the occurrence of PCOS and affect oocyte development.
Photo- and electropatterning of hydrogel-encapsulated living cell arrays.
Albrecht, Dirk R; Tsang, Valerie Liu; Sah, Robert L; Bhatia, Sangeeta N
2005-01-01
Living cells have the potential to serve as sensors, naturally integrating the response to stimuli to generate predictions about cell fate (e.g., differentiation, migration, proliferation, apoptosis). Miniaturized arrays of living cells further offer the capability to interrogate many cells in parallel and thereby enable high-throughput and/or combinatorial assays. However, the interface between living cells and synthetic chip platforms is a critical one wherein the cellular phenotype must be preserved to generate useful signals. While some cell types retain tissue-specific features on a flat (2-D) surface, it has become increasingly apparent that a 3-D physical environment will be required for others. In this paper, we present two independent methods for creating living cell arrays that are encapsulated within a poly(ethylene glycol)-based hydrogel to create a local 3-D microenvironment. First, 'photopatterning' selectively crosslinks hydrogel microstructures containing living cells with approximately 100 microm feature size. Second, 'electropatterning' utilizes dielectrophoretic forces to position cells within a prepolymer solution prior to crosslinking, forming cell patterns with micron resolution. We further combine these methods to obtain hierarchical control of cell positioning over length scales ranging from microns to centimeters. This level of microenvironmental control should enable the fabrication of next-generation cellular microarrays in which robust 3-D cultures of cells are presented with appropriate physical and chemical cues and, consequently, report on cellular responses that resemble in vivo behavior.
Thomas, Minta; De Brabanter, Kris; De Moor, Bart
2014-05-10
DNA microarrays are potentially powerful technology for improving diagnostic classification, treatment selection, and prognostic assessment. The use of this technology to predict cancer outcome has a history of almost a decade. Disease class predictors can be designed for known disease cases and provide diagnostic confirmation or clarify abnormal cases. The main input to this class predictors are high dimensional data with many variables and few observations. Dimensionality reduction of these features set significantly speeds up the prediction task. Feature selection and feature transformation methods are well known preprocessing steps in the field of bioinformatics. Several prediction tools are available based on these techniques. Studies show that a well tuned Kernel PCA (KPCA) is an efficient preprocessing step for dimensionality reduction, but the available bandwidth selection method for KPCA was computationally expensive. In this paper, we propose a new data-driven bandwidth selection criterion for KPCA, which is related to least squares cross-validation for kernel density estimation. We propose a new prediction model with a well tuned KPCA and Least Squares Support Vector Machine (LS-SVM). We estimate the accuracy of the newly proposed model based on 9 case studies. Then, we compare its performances (in terms of test set Area Under the ROC Curve (AUC) and computational time) with other well known techniques such as whole data set + LS-SVM, PCA + LS-SVM, t-test + LS-SVM, Prediction Analysis of Microarrays (PAM) and Least Absolute Shrinkage and Selection Operator (Lasso). Finally, we assess the performance of the proposed strategy with an existing KPCA parameter tuning algorithm by means of two additional case studies. We propose, evaluate, and compare several mathematical/statistical techniques, which apply feature transformation/selection for subsequent classification, and consider its application in medical diagnostics. Both feature selection and feature transformation perform well on classification tasks. Due to the dynamic selection property of feature selection, it is hard to define significant features for the classifier, which predicts classes of future samples. Moreover, the proposed strategy enjoys a distinctive advantage with its relatively lesser time complexity.
Jones, Kevin; Weiss, Shelly K; Minassian, Berge
2016-07-01
Patients presenting with infantile spasms, dysmorphic features, and periventricular nodular heterotopia may benefit from genetic copy number variation microarray, or whole-exome sequencing to identify candidate genes. This will allow personalized diagnosis and prognostication and the eventual understanding of single and combined gene functions in brain health and disease.
Non-Cell-Adhesive Substrates for Printing of Arrayed Biomaterials
Appel, Eric A.; Larson, Benjamin L.; Luly, Kathryn M.; Kim, Jinseong D.
2015-01-01
Cellular microarrays have become extremely useful in expediting the investigation of large libraries of (bio)materials for both in vitro and in vivo biomedical applications. We have developed an exceedingly simple strategy for the fabrication of non-cell-adhesive substrates supporting the immobilization of diverse (bio)material features, including both monomeric and polymeric adhesion molecules (e.g. RGD and polylysine), hydrogels, and polymers. PMID:25430948
Caroline M. Press; Niklaus J. Grunwald
2008-01-01
The release of the draft genome sequence of P. ramorum strain Pr102, enabled the construction of an oligonucleotide microarray of the entire genome of Pr102. The array contains 344,680 features (oligos) that represent the transcriptome of Pr102. P. ramorum RNA was extracted from mycelium and sporangia and used to compare gene...
Role of Macrophage-Induced Inflammation in Mesothelioma
2010-07-01
in human mesothelioma tumors and correlate immune cell infiltration with histopathologic subtype (months 1-6). Using tumor tissue microarrays of... histopathologic subtype (months 1-6). • Acquired 71 fixed and paraffin-embedded mesothelioma tumor samples • Prepared mesothelioma tumor tissue...Biol., 2008. 84: p. 1-8. 5. Dave, S.S., et al., Prediction of survival in follicular lymphoma based on molecular features of tumor-infiltrating
2012-01-01
Background High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa). Results We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types. Conclusion By hybridizing genomic DNA to a custom oligonucleotide array designed for maximum gene coverage, we were able to identify polymorphisms using two approaches for pair-wise comparisons, as well as a highly parallel method that compared all 52 genotypes simultaneously. PMID:22583801
Stoffel, Kevin; van Leeuwen, Hans; Kozik, Alexander; Caldwell, David; Ashrafi, Hamid; Cui, Xinping; Tan, Xiaoping; Hill, Theresa; Reyes-Chin-Wo, Sebastian; Truco, Maria-Jose; Michelmore, Richard W; Van Deynze, Allen
2012-05-14
High-resolution genetic maps are needed in many crops to help characterize the genetic diversity that determines agriculturally important traits. Hybridization to microarrays to detect single feature polymorphisms is a powerful technique for marker discovery and genotyping because of its highly parallel nature. However, microarrays designed for gene expression analysis rarely provide sufficient gene coverage for optimal detection of nucleotide polymorphisms, which limits utility in species with low rates of polymorphism such as lettuce (Lactuca sativa). We developed a 6.5 million feature Affymetrix GeneChip® for efficient polymorphism discovery and genotyping, as well as for analysis of gene expression in lettuce. Probes on the microarray were designed from 26,809 unigenes from cultivated lettuce and an additional 8,819 unigenes from four related species (L. serriola, L. saligna, L. virosa and L. perennis). Where possible, probes were tiled with a 2 bp stagger, alternating on each DNA strand; providing an average of 187 probes covering approximately 600 bp for each of over 35,000 unigenes; resulting in up to 13 fold redundancy in coverage per nucleotide. We developed protocols for hybridization of genomic DNA to the GeneChip® and refined custom algorithms that utilized coverage from multiple, high quality probes to detect single position polymorphisms in 2 bp sliding windows across each unigene. This allowed us to detect greater than 18,000 polymorphisms between the parental lines of our core mapping population, as well as numerous polymorphisms between cultivated lettuce and wild species in the lettuce genepool. Using marker data from our diversity panel comprised of 52 accessions from the five species listed above, we were able to separate accessions by species using both phylogenetic and principal component analyses. Additionally, we estimated the diversity between different types of cultivated lettuce and distinguished morphological types. By hybridizing genomic DNA to a custom oligonucleotide array designed for maximum gene coverage, we were able to identify polymorphisms using two approaches for pair-wise comparisons, as well as a highly parallel method that compared all 52 genotypes simultaneously.
Configurable pattern-based evolutionary biclustering of gene expression data
2013-01-01
Background Biclustering algorithms for microarray data aim at discovering functionally related gene sets under different subsets of experimental conditions. Due to the problem complexity and the characteristics of microarray datasets, heuristic searches are usually used instead of exhaustive algorithms. Also, the comparison among different techniques is still a challenge. The obtained results vary in relevant features such as the number of genes or conditions, which makes it difficult to carry out a fair comparison. Moreover, existing approaches do not allow the user to specify any preferences on these properties. Results Here, we present the first biclustering algorithm in which it is possible to particularize several biclusters features in terms of different objectives. This can be done by tuning the specified features in the algorithm or also by incorporating new objectives into the search. Furthermore, our approach bases the bicluster evaluation in the use of expression patterns, being able to recognize both shifting and scaling patterns either simultaneously or not. Evolutionary computation has been chosen as the search strategy, naming thus our proposal Evo-Bexpa (Evolutionary Biclustering based in Expression Patterns). Conclusions We have conducted experiments on both synthetic and real datasets demonstrating Evo-Bexpa abilities to obtain meaningful biclusters. Synthetic experiments have been designed in order to compare Evo-Bexpa performance with other approaches when looking for perfect patterns. Experiments with four different real datasets also confirm the proper performing of our algorithm, whose results have been biologically validated through Gene Ontology. PMID:23433178
Microarrays for the evaluation of cell-biomaterial surface interactions
NASA Astrophysics Data System (ADS)
Thissen, H.; Johnson, G.; McFarland, G.; Verbiest, B. C. H.; Gengenbach, T.; Voelcker, N. H.
2007-01-01
The evaluation of cell-material surface interactions is important for the design of novel biomaterials which are used in a variety of biomedical applications. While traditional in vitro test methods have routinely used samples of relatively large size, microarrays representing different biomaterials offer many advantages, including high throughput and reduced sample handling. Here, we describe the simultaneous cell-based testing of matrices of polymeric biomaterials, arrayed on glass slides with a low cell-attachment background coating. Arrays were constructed using a microarray robot at 6 fold redundancy with solid pins having a diameter of 375 μm. Printed solutions contained at least one monomer, an initiator and a bifunctional crosslinker. After subsequent UV polymerisation, the arrays were washed and characterised by X-ray photoelectron spectroscopy. Cell culture experiments were carried out over 24 hours using HeLa cells. After labelling with CellTracker ® Green for the final hour of incubation and subsequent fixation, the arrays were scanned. In addition, individual spots were also viewed by fluorescence microscopy. The evaluation of cell-surface interactions in high-throughput assays as demonstrated here is a key enabling technology for the effective development of future biomaterials.
NASA Astrophysics Data System (ADS)
Sims, Wesley Daniel
This dissertation aims to add to the scientific knowledge of physics in the field of optics by investigating the feasibility to develop a novel technique for immobilization of dye- labeled biomolecules on a polymer substrate. The development of this platform could potentially be used for bio sensing of biohazards in food and the environment. The process is facilitated by excitation of a dye-label attached to the biomolecule of interest with visible light of 488 nm wavelength. Biomolecules from an aqueous medium can be attached at any desired spot on the substrate simply by exposing the area to light. The area of the focused laser beam can control the spot-size of immobilized biomolecules. The technique is used to fabricate microarrays of immobilized antibodies (immunomicroarray) having spot-size of the order of 1 micron. This is a significant improvement over the typical commercial microarrays with spot-size in 10-100 micron range. The immobilization technique is characterized by attaching phospholipids, which have been shown to be useful as platforms for bio sensing applications. It can further be developed by attaching common proteins like Avidin as well as other antibodies against toxins and pathogens known for potential bio-terrorism through food and water systems. Absorption of laser-excited dye labeled biomolecules within the polymer appears to be the mechanism for attachment technique.
Roberts, Jennifer L; Hovanes, Karine; Dasouki, Majed; Manzardo, Ann M; Butler, Merlin G
2014-02-01
Chromosomal microarray analysis is now commonly used in clinical practice to identify copy number variants (CNVs) in the human genome. We report our experience with the use of the 105 K and 180K oligonucleotide microarrays in 215 consecutive patients referred with either autism or autism spectrum disorders (ASD) or developmental delay/learning disability for genetic services at the University of Kansas Medical Center during the past 4 years (2009-2012). Of the 215 patients [140 males and 75 females (male/female ratio=1.87); 65 with ASD and 150 with learning disability], abnormal microarray results were seen in 45 individuals (21%) with a total of 49 CNVs. Of these findings, 32 represented a known diagnostic CNV contributing to the clinical presentation and 17 represented non-diagnostic CNVs (variants of unknown significance). Thirteen patients with ASD had a total of 14 CNVs, 6 CNVs recognized as diagnostic and 8 as non-diagnostic. The most common chromosome involved in the ASD group was chromosome 15. For those with a learning disability, 32 patients had a total of 35 CNVs. Twenty-six of the 35 CNVs were classified as a known diagnostic CNV, usually a deletion (n=20). Nine CNVs were classified as an unknown non-diagnostic CNV, usually a duplication (n=8). For the learning disability subgroup, chromosomes 2 and 22 were most involved. Thirteen out of 65 patients (20%) with ASD had a CNV compared with 32 out of 150 patients (21%) with a learning disability. The frequency of chromosomal microarray abnormalities compared by subject group or gender was not statistically different. A higher percentage of individuals with a learning disability had clinical findings of seizures, dysmorphic features and microcephaly, but not statistically significant. While both groups contained more males than females, a significantly higher percentage of males were present in the ASD group. © 2013 Elsevier B.V. All rights reserved.
Design and evaluation of Actichip, a thematic microarray for the study of the actin cytoskeleton
Muller, Jean; Mehlen, André; Vetter, Guillaume; Yatskou, Mikalai; Muller, Arnaud; Chalmel, Frédéric; Poch, Olivier; Friederich, Evelyne; Vallar, Laurent
2007-01-01
Background The actin cytoskeleton plays a crucial role in supporting and regulating numerous cellular processes. Mutations or alterations in the expression levels affecting the actin cytoskeleton system or related regulatory mechanisms are often associated with complex diseases such as cancer. Understanding how qualitative or quantitative changes in expression of the set of actin cytoskeleton genes are integrated to control actin dynamics and organisation is currently a challenge and should provide insights in identifying potential targets for drug discovery. Here we report the development of a dedicated microarray, the Actichip, containing 60-mer oligonucleotide probes for 327 genes selected for transcriptome analysis of the human actin cytoskeleton. Results Genomic data and sequence analysis features were retrieved from GenBank and stored in an integrative database called Actinome. From these data, probes were designed using a home-made program (CADO4MI) allowing sequence refinement and improved probe specificity by combining the complementary information recovered from the UniGene and RefSeq databases. Actichip performance was analysed by hybridisation with RNAs extracted from epithelial MCF-7 cells and human skeletal muscle. Using thoroughly standardised procedures, we obtained microarray images with excellent quality resulting in high data reproducibility. Actichip displayed a large dynamic range extending over three logs with a limit of sensitivity between one and ten copies of transcript per cell. The array allowed accurate detection of small changes in gene expression and reliable classification of samples based on the expression profiles of tissue-specific genes. When compared to two other oligonucleotide microarray platforms, Actichip showed similar sensitivity and concordant expression ratios. Moreover, Actichip was able to discriminate the highly similar actin isoforms whereas the two other platforms did not. Conclusion Our data demonstrate that Actichip is a powerful alternative to commercial high density microarrays for cytoskeleton gene profiling in normal or pathological samples. Actichip is available upon request. PMID:17727702
Detecting novel genes with sparse arrays
Haiminen, Niina; Smit, Bart; Rautio, Jari; Vitikainen, Marika; Wiebe, Marilyn; Martinez, Diego; Chee, Christine; Kunkel, Joe; Sanchez, Charles; Nelson, Mary Anne; Pakula, Tiina; Saloheimo, Markku; Penttilä, Merja; Kivioja, Teemu
2014-01-01
Species-specific genes play an important role in defining the phenotype of an organism. However, current gene prediction methods can only efficiently find genes that share features such as sequence similarity or general sequence characteristics with previously known genes. Novel sequencing methods and tiling arrays can be used to find genes without prior information and they have demonstrated that novel genes can still be found from extensively studied model organisms. Unfortunately, these methods are expensive and thus are not easily applicable, e.g., to finding genes that are expressed only in very specific conditions. We demonstrate a method for finding novel genes with sparse arrays, applying it on the 33.9 Mb genome of the filamentous fungus Trichoderma reesei. Our computational method does not require normalisations between arrays and it takes into account the multiple-testing problem typical for analysis of microarray data. In contrast to tiling arrays, that use overlapping probes, only one 25mer microarray oligonucleotide probe was used for every 100 b. Thus, only relatively little space on a microarray slide was required to cover the intergenic regions of a genome. The analysis was done as a by-product of a conventional microarray experiment with no additional costs. We found at least 23 good candidates for novel transcripts that could code for proteins and all of which were expressed at high levels. Candidate genes were found to neighbour ire1 and cre1 and many other regulatory genes. Our simple, low-cost method can easily be applied to finding novel species-specific genes without prior knowledge of their sequence properties. PMID:20691772
2013-01-01
Background Analysis of global gene expression by DNA microarrays is widely used in experimental molecular biology. However, the complexity of such high-dimensional data sets makes it difficult to fully understand the underlying biological features present in the data. The aim of this study is to introduce a method for DNA microarray analysis that provides an intuitive interpretation of data through dimension reduction and pattern recognition. We present the first “Archetypal Analysis” of global gene expression. The analysis is based on microarray data from five integrated studies of Pseudomonas aeruginosa isolated from the airways of cystic fibrosis patients. Results Our analysis clustered samples into distinct groups with comprehensible characteristics since the archetypes representing the individual groups are closely related to samples present in the data set. Significant changes in gene expression between different groups identified adaptive changes of the bacteria residing in the cystic fibrosis lung. The analysis suggests a similar gene expression pattern between isolates with a high mutation rate (hypermutators) despite accumulation of different mutations for these isolates. This suggests positive selection in the cystic fibrosis lung environment, and changes in gene expression for these isolates are therefore most likely related to adaptation of the bacteria. Conclusions Archetypal analysis succeeded in identifying adaptive changes of P. aeruginosa. The combination of clustering and matrix factorization made it possible to reveal minor similarities among different groups of data, which other analytical methods failed to identify. We suggest that this analysis could be used to supplement current methods used to analyze DNA microarray data. PMID:24059747
Alshamlan, Hala M; Badr, Ghada H; Alohali, Yousef A
2015-06-01
Naturally inspired evolutionary algorithms prove effectiveness when used for solving feature selection and classification problems. Artificial Bee Colony (ABC) is a relatively new swarm intelligence method. In this paper, we propose a new hybrid gene selection method, namely Genetic Bee Colony (GBC) algorithm. The proposed algorithm combines the used of a Genetic Algorithm (GA) along with Artificial Bee Colony (ABC) algorithm. The goal is to integrate the advantages of both algorithms. The proposed algorithm is applied to a microarray gene expression profile in order to select the most predictive and informative genes for cancer classification. In order to test the accuracy performance of the proposed algorithm, extensive experiments were conducted. Three binary microarray datasets are use, which include: colon, leukemia, and lung. In addition, another three multi-class microarray datasets are used, which are: SRBCT, lymphoma, and leukemia. Results of the GBC algorithm are compared with our recently proposed technique: mRMR when combined with the Artificial Bee Colony algorithm (mRMR-ABC). We also compared the combination of mRMR with GA (mRMR-GA) and Particle Swarm Optimization (mRMR-PSO) algorithms. In addition, we compared the GBC algorithm with other related algorithms that have been recently published in the literature, using all benchmark datasets. The GBC algorithm shows superior performance as it achieved the highest classification accuracy along with the lowest average number of selected genes. This proves that the GBC algorithm is a promising approach for solving the gene selection problem in both binary and multi-class cancer classification. Copyright © 2015 Elsevier Ltd. All rights reserved.
Combined rule extraction and feature elimination in supervised classification.
Liu, Sheng; Patel, Ronak Y; Daga, Pankaj R; Liu, Haining; Fu, Gang; Doerksen, Robert J; Chen, Yixin; Wilkins, Dawn E
2012-09-01
There are a vast number of biology related research problems involving a combination of multiple sources of data to achieve a better understanding of the underlying problems. It is important to select and interpret the most important information from these sources. Thus it will be beneficial to have a good algorithm to simultaneously extract rules and select features for better interpretation of the predictive model. We propose an efficient algorithm, Combined Rule Extraction and Feature Elimination (CRF), based on 1-norm regularized random forests. CRF simultaneously extracts a small number of rules generated by random forests and selects important features. We applied CRF to several drug activity prediction and microarray data sets. CRF is capable of producing performance comparable with state-of-the-art prediction algorithms using a small number of decision rules. Some of the decision rules are biologically significant.
2011-01-01
Background Global transcriptional analysis of loblolly pine (Pinus taeda L.) is challenging due to limited molecular tools. PtGen2, a 26,496 feature cDNA microarray, was fabricated and used to assess drought-induced gene expression in loblolly pine propagule roots. Statistical analysis of differential expression and weighted gene correlation network analysis were used to identify drought-responsive genes and further characterize the molecular basis of drought tolerance in loblolly pine. Results Microarrays were used to interrogate root cDNA populations obtained from 12 genotype × treatment combinations (four genotypes, three watering regimes). Comparison of drought-stressed roots with roots from the control treatment identified 2445 genes displaying at least a 1.5-fold expression difference (false discovery rate = 0.01). Genes commonly associated with drought response in pine and other plant species, as well as a number of abiotic and biotic stress-related genes, were up-regulated in drought-stressed roots. Only 76 genes were identified as differentially expressed in drought-recovered roots, indicating that the transcript population can return to the pre-drought state within 48 hours. Gene correlation analysis predicts a scale-free network topology and identifies eleven co-expression modules that ranged in size from 34 to 938 members. Network topological parameters identified a number of central nodes (hubs) including those with significant homology (E-values ≤ 2 × 10-30) to 9-cis-epoxycarotenoid dioxygenase, zeatin O-glucosyltransferase, and ABA-responsive protein. Identified hubs also include genes that have been associated previously with osmotic stress, phytohormones, enzymes that detoxify reactive oxygen species, and several genes of unknown function. Conclusion PtGen2 was used to evaluate transcriptome responses in loblolly pine and was leveraged to identify 2445 differentially expressed genes responding to severe drought stress in roots. Many of the genes identified are known to be up-regulated in response to osmotic stress in pine and other plant species and encode proteins involved in both signal transduction and stress tolerance. Gene expression levels returned to control values within a 48-hour recovery period in all but 76 transcripts. Correlation network analysis indicates a scale-free network topology for the pine root transcriptome and identifies central nodes that may serve as drivers of drought-responsive transcriptome dynamics in the roots of loblolly pine. PMID:21609476
Lipid Microarray Biosensor for Biotoxin Detection.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Singh, Anup K.; Throckmorton, Daniel J.; Moran-Mirabal, Jose C.
2006-05-01
We present the use of micron-sized lipid domains, patterned onto planar substrates and within microfluidic channels, to assay the binding of bacterial toxins via total internal reflection fluorescence microscopy (TIRFM). The lipid domains were patterned using a polymer lift-off technique and consisted of ganglioside-populated DSPC:cholesterol supported lipid bilayers (SLBs). Lipid patterns were formed on the substrates by vesicle fusion followed by polymer lift-off, which revealed micron-sized SLBs containing either ganglioside GT1b or GM1. The ganglioside-populated SLB arrays were then exposed to either Cholera toxin subunit B (CTB) or Tetanus toxin fragment C (TTC). Binding was assayed on planar substrates bymore » TIRFM down to 1 nM concentration for CTB and 100 nM for TTC. Apparent binding constants extracted from three different models applied to the binding curves suggest that binding of a protein to a lipid-based receptor is strongly affected by the lipid composition of the SLB and by the substrate on which the bilayer is formed. Patterning of SLBs inside microfluidic channels also allowed the preparation of lipid domains with different compositions on a single device. Arrays within microfluidic channels were used to achieve segregation and selective binding from a binary mixture of the toxin fragments in one device. The binding and segregation within the microfluidic channels was assayed with epifluorescence as proof of concept. We propose that the method used for patterning the lipid microarrays on planar substrates and within microfluidic channels can be easily adapted to proteins or nucleic acids and can be used for biosensor applications and cell stimulation assays under different flow conditions. KEYWORDS. Microarray, ganglioside, polymer lift-off, cholera toxin, tetanus toxin, TIRFM, binding constant.4« less
Breast Reference Set Application: Karen Anderson-ASU (2014) — EDRN Public Portal
In order to increase the predictive value of tumor-specific antibodies for use as immunodiagnostics, our EDRN BDL has developed a novel protein microarray technology, termed Nucleic Acid Protein Programmable Array (NAPPA), which circumvents many of the limitations of traditional protein microarrays. NAPPA arrays are generated by printing full-length cDNAs encoding the target proteins at each feature of the array. The proteins are then transcribed and translated by a cell-free system and immobilized in situ using epitope tags fused to the proteins. Sera are added, and bound IgG is detected by standard secondary reagents. Using a sequential screening strategy to select AAb from 4,988 candidate tumor antigens, we have identified 28 potential AAb biomarkers for the early detection of breast cancer, and here we propose to evaluate these biomarkers using the EDRN Breast Cancer Reference Set.
Hybrid feature selection algorithm using symmetrical uncertainty and a harmony search algorithm
NASA Astrophysics Data System (ADS)
Salameh Shreem, Salam; Abdullah, Salwani; Nazri, Mohd Zakree Ahmad
2016-04-01
Microarray technology can be used as an efficient diagnostic system to recognise diseases such as tumours or to discriminate between different types of cancers in normal tissues. This technology has received increasing attention from the bioinformatics community because of its potential in designing powerful decision-making tools for cancer diagnosis. However, the presence of thousands or tens of thousands of genes affects the predictive accuracy of this technology from the perspective of classification. Thus, a key issue in microarray data is identifying or selecting the smallest possible set of genes from the input data that can achieve good predictive accuracy for classification. In this work, we propose a two-stage selection algorithm for gene selection problems in microarray data-sets called the symmetrical uncertainty filter and harmony search algorithm wrapper (SU-HSA). Experimental results show that the SU-HSA is better than HSA in isolation for all data-sets in terms of the accuracy and achieves a lower number of genes on 6 out of 10 instances. Furthermore, the comparison with state-of-the-art methods shows that our proposed approach is able to obtain 5 (out of 10) new best results in terms of the number of selected genes and competitive results in terms of the classification accuracy.
Chromosome r(10)(p15.3q26.12) in a newborn child: case report.
Gunnarsson, Cecilia; Graffmann, Barbara; Jonasson, Jon
2009-12-07
Ring chromosome 10 is a rare cytogenetic finding. Of the less than 10 reported cases we have found in the literature, none was characterized using high-resolution microarray analysis. Ring chromosomes are frequently unstable due to sister chromatid exchanges and mitotic failures. When mosaicism is present, the interpretation of genotype-phenotype correlations becomes extremely difficult. We report on a newborn girl with growth retardation, microcephaly, congenital heart defects, dysmorphic features and psychomotor retardation. Karyotyping revealed a non-mosaic apparently stable ring chromosome 10 replacing one of the normal homologues in all analyzed metaphases. High-resolution oligonucleotide microarray analysis showed a de novo approximately 12.5 Mb terminal deletion 10q26.12 -> qter and a corresponding 285 kb terminal deletion of 10pter -> p15.3. This case demonstrates that an increased nuchal translucency thickness detected by early ultrasonography should preferably lead to not only QF-PCR for the diagnosis of Down syndrome but also karyotyping. In the future, microarray analysis, which needs further evaluation, might become the method of choice. The clinical phenotype of our patient was in agreement with that of patients with a terminal 10q deletion. For the purpose of genotype-phenotype analysis, there seems to be no need for a "ring syndrome" concept.
Kim, Hyo Jeong; Park, Chang Il; Lim, Jae Woo; Lee, Gyung Min; Cho, Eunhae; Kim, Hyon J
2018-05-01
The present study aimed to investigate chromosomal microarray (CMA) and clinical data in patients with unexplained developmental delay/intellectual disability (DD/ID) accompanying dysmorphism, congenital anomalies, or epilepsy. We also aimed to evaluate phenotypic clues in patients with pathogenic copy number variants (CNVs). We collected clinical and CMA data from patients at Konyang University Hospital between September 2013 and October 2014. We included patients who had taken the CMA test to evaluate the etiology of unexplained DD/ID. All of the 50 patients identified had DD/ID. Thirty-nine patients had dysmorphism, 19 patients suffered from epilepsy, and 12 patients had congenital anomalies. Twenty-nine of the 50 patients (58%) showed abnormal results. Eighteen (36%) were considered to have pathogenic CNVs. Dysmorphism (p=0.028) was significantly higher in patients with pathogenic CNVs than in those with normal CMA. Two or more clinical features were presented by 61.9% (13/21) of the patients with normal CMA and by 83.3% (15/18) of the patients with pathogenic CMA. Dysmorphism can be a phenotypic clue to pathogenic CNVs. Furthermore, pathogenic CNV might be more frequently found if patients have two or more clinical features in addition to DD/ID. © Copyright: Yonsei University College of Medicine 2018.
Mitsios, Nick; Saka, Mohamad; Krupinski, Jerzy; Pennucci, Roberta; Sanfeliu, Coral; Wang, Qiuyu; Rubio, Francisco; Gaffney, John; Kumar, Pat; Kumar, Shant; Sullivan, Matthew; Slevin, Mark
2007-01-01
Background Altered gene expression is an important feature of ischemic cerebral injury and affects proteins of many functional classes. We have used microarrays to investigate the changes in gene expression at various times after middle cerebral artery occlusion in human and rat brain. Results Our results demonstrated a significant difference in the number of genes affected and the time-course of expression between the two cases. The total number of deregulated genes in the rat was 335 versus 126 in the human, while, of 393 overlapping genes between the two array sets, 184 were changed only in the rat and 36 in the human with a total of 41 genes deregulated in both cases. Interestingly, the mean fold changes were much higher in the human. The expression of novel genes, including p21-activated kinase 1 (PAK1), matrix metalloproteinase 11 (MMP11) and integrase interactor 1, was further analyzed by RT-PCR, Western blotting and immunohistochemistry. Strong neuronal staining was seen for PAK1 and MMP11. Conclusion Our findings confirmed previous studies reporting that gene expression screening can detect known and unknown transcriptional features of stroke and highlight the importance of research using human brain tissue in the search for novel therapeutic agents. PMID:17997827
Friedrich, Torben; Rahmann, Sven; Weigel, Wilfried; Rabsch, Wolfgang; Fruth, Angelika; Ron, Eliora; Gunzer, Florian; Dandekar, Thomas; Hacker, Jörg; Müller, Tobias; Dobrindt, Ulrich
2010-10-21
The Enterobacteriaceae comprise a large number of clinically relevant species with several individual subspecies. Overlapping virulence-associated gene pools and the high overall genome plasticity often interferes with correct enterobacterial strain typing and risk assessment. Array technology offers a fast, reproducible and standardisable means for bacterial typing and thus provides many advantages for bacterial diagnostics, risk assessment and surveillance. The development of highly discriminative broad-range microbial diagnostic microarrays remains a challenge, because of marked genome plasticity of many bacterial pathogens. We developed a DNA microarray for strain typing and detection of major antimicrobial resistance genes of clinically relevant enterobacteria. For this purpose, we applied a global genome-wide probe selection strategy on 32 available complete enterobacterial genomes combined with a regression model for pathogen classification. The discriminative power of the probe set was further tested in silico on 15 additional complete enterobacterial genome sequences. DNA microarrays based on the selected probes were used to type 92 clinical enterobacterial isolates. Phenotypic tests confirmed the array-based typing results and corroborate that the selected probes allowed correct typing and prediction of major antibiotic resistances of clinically relevant Enterobacteriaceae, including the subspecies level, e.g. the reliable distinction of different E. coli pathotypes. Our results demonstrate that the global probe selection approach based on longest common factor statistics as well as the design of a DNA microarray with a restricted set of discriminative probes enables robust discrimination of different enterobacterial variants and represents a proof of concept that can be adopted for diagnostics of a wide range of microbial pathogens. Our approach circumvents misclassifications arising from the application of virulence markers, which are highly affected by horizontal gene transfer. Moreover, a broad range of pathogens have been covered by an efficient probe set size enabling the design of high-throughput diagnostics.
Hess, Jonathan L.; Tylee, Daniel S.; Barve, Rahul; de Jong, Simone; Ophoff, Roel A.; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J.; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T.; Glatt, Stephen J.
2016-01-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n = 315) and from ex-vivo blood tissues (n = 578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. PMID:27450777
Hess, Jonathan L; Tylee, Daniel S; Barve, Rahul; de Jong, Simone; Ophoff, Roel A; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T; Glatt, Stephen J
2016-10-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n=315) and from ex-vivo blood tissues (n=578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. Published by Elsevier B.V.
CrossQuery: a web tool for easy associative querying of transcriptome data.
Wagner, Toni U; Fischer, Andreas; Thoma, Eva C; Schartl, Manfred
2011-01-01
Enormous amounts of data are being generated by modern methods such as transcriptome or exome sequencing and microarray profiling. Primary analyses such as quality control, normalization, statistics and mapping are highly complex and need to be performed by specialists. Thereafter, results are handed back to biomedical researchers, who are then confronted with complicated data lists. For rather simple tasks like data filtering, sorting and cross-association there is a need for new tools which can be used by non-specialists. Here, we describe CrossQuery, a web tool that enables straight forward, simple syntax queries to be executed on transcriptome sequencing and microarray datasets. We provide deep-sequencing data sets of stem cell lines derived from the model fish Medaka and microarray data of human endothelial cells. In the example datasets provided, mRNA expression levels, gene, transcript and sample identification numbers, GO-terms and gene descriptions can be freely correlated, filtered and sorted. Queries can be saved for later reuse and results can be exported to standard formats that allow copy-and-paste to all widespread data visualization tools such as Microsoft Excel. CrossQuery enables researchers to quickly and freely work with transcriptome and microarray data sets requiring only minimal computer skills. Furthermore, CrossQuery allows growing association of multiple datasets as long as at least one common point of correlated information, such as transcript identification numbers or GO-terms, is shared between samples. For advanced users, the object-oriented plug-in and event-driven code design of both server-side and client-side scripts allow easy addition of new features, data sources and data types.
Cross-Platform Toxicogenomics for the Prediction of Non-Genotoxic Hepatocarcinogenesis in Rat
Metzger, Ute; Templin, Markus F.; Plummer, Simon; Ellinger-Ziegelbauer, Heidrun; Zell, Andreas
2014-01-01
In the area of omics profiling in toxicology, i.e. toxicogenomics, characteristic molecular profiles have previously been incorporated into prediction models for early assessment of a carcinogenic potential and mechanism-based classification of compounds. Traditionally, the biomarker signatures used for model construction were derived from individual high-throughput techniques, such as microarrays designed for monitoring global mRNA expression. In this study, we built predictive models by integrating omics data across complementary microarray platforms and introduced new concepts for modeling of pathway alterations and molecular interactions between multiple biological layers. We trained and evaluated diverse machine learning-based models, differing in the incorporated features and learning algorithms on a cross-omics dataset encompassing mRNA, miRNA, and protein expression profiles obtained from rat liver samples treated with a heterogeneous set of substances. Most of these compounds could be unambiguously classified as genotoxic carcinogens, non-genotoxic carcinogens, or non-hepatocarcinogens based on evidence from published studies. Since mixed characteristics were reported for the compounds Cyproterone acetate, Thioacetamide, and Wy-14643, we reclassified these compounds as either genotoxic or non-genotoxic carcinogens based on their molecular profiles. Evaluating our toxicogenomics models in a repeated external cross-validation procedure, we demonstrated that the prediction accuracy of our models could be increased by joining the biomarker signatures across multiple biological layers and by adding complex features derived from cross-platform integration of the omics data. Furthermore, we found that adding these features resulted in a better separation of the compound classes and a more confident reclassification of the three undefined compounds as non-genotoxic carcinogens. PMID:24830643
Probe classification of on-off type DNA microarray images with a nonlinear matching measure
NASA Astrophysics Data System (ADS)
Ryu, Munho; Kim, Jong Dae; Min, Byoung Goo; Kim, Jongwon; Kim, Y. Y.
2006-01-01
We propose a nonlinear matching measure, called counting measure, as a signal detection measure that is defined as the number of on pixels in the spot area. It is applied to classify probes for an on-off type DNA microarray, where each probe spot is classified as hybridized or not. The counting measure also incorporates the maximum response search method, where the expected signal is obtained by taking the maximum among the measured responses of the various positions and sizes of the spot template. The counting measure was compared to existing signal detection measures such as the normalized covariance and the median for 2390 patient samples tested on the human papillomavirus (HPV) DNA chip. The counting measure performed the best regardless of whether or not the maximum response search method was used. The experimental results showed that the counting measure combined with the positional search was the most preferable.
Methods for monitoring multiple gene expression
Berka, Randy; Bachkirova, Elena; Rey, Michael
2013-10-01
The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Methods for monitoring multiple gene expression
Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA
2012-05-01
The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Methods for monitoring multiple gene expression
Berka, Randy [Davis, CA; Bachkirova, Elena [Davis, CA; Rey, Michael [Davis, CA
2008-06-01
The present invention relates to methods for monitoring differential expression of a plurality of genes in a first filamentous fungal cell relative to expression of the same genes in one or more second filamentous fungal cells using microarrays containing Trichoderma reesei ESTs or SSH clones, or a combination thereof. The present invention also relates to computer readable media and substrates containing such array features for monitoring expression of a plurality of genes in filamentous fungal cells.
Importing MAGE-ML format microarray data into BioConductor.
Durinck, Steffen; Allemeersch, Joke; Carey, Vincent J; Moreau, Yves; De Moor, Bart
2004-12-12
The microarray gene expression markup language (MAGE-ML) is a widely used XML (eXtensible Markup Language) standard for describing and exchanging information about microarray experiments. It can describe microarray designs, microarray experiment designs, gene expression data and data analysis results. We describe RMAGEML, a new Bioconductor package that provides a link between cDNA microarray data stored in MAGE-ML format and the Bioconductor framework for preprocessing, visualization and analysis of microarray experiments. http://www.bioconductor.org. Open Source.
NASA Astrophysics Data System (ADS)
Parro, V.; Rivas, L. A.; Rodríguez-Manfredi, J. A.; Blanco, Y.; de Diego-Castilla, G.; Cruz-Gil, P.; Moreno-Paz, M.; García-Villadangos, M.; Compostizo, C.; Herrero, P. L.
2009-04-01
Immunosensors have been extensively used since many years for environmental monitoring. Different technological platforms allow new biosensor designs and implementations. We have reported (Rivas et al., 2008) a shotgun approach for antibody production for biomarker detection in astrobiology and environmental monitoring, the production of 150 new polyclonal antibodies against microbial strains and environmental extracts, and the construction and validation of an antibody microarray (LDCHIP200, for "Life Detector Chip") containing 200 different antibodies. We have successfully used the LDCHIP200 for the detection of biological polymers in extreme environments in different parts of the world (e.g., a deep South African mine, Antarctica's Dry valleys, Yellowstone, Iceland, and Rio Tinto). Clustering analysis associated similar immunopatterns to samples from apparently very different environments, indicating that they indeed share similar universal biomarkers. A redundancy in the number of antibodies against different target biomarkers apart of revealing the presence of certain biomolecules, it renders a sample-specific immuno-profile, an "immnuno-fingerprint", which may constitute by itself an indirect biosignature. We will present a case study of immunoprofiling different iron-sulfur as well as phylosilicates rich samples along the Rio Tinto river banks. Based on protein microarray technology, we designed and built the concept instrument called SOLID (for "Signs Of LIfe Detector"; Parro et al., 2005; 2008a, b; http://cab.inta.es/solid) for automatic in situ analysis of soil samples and molecular biomarkers detection. A field prototype, SOLID2, was successfully tested for the analysis of grinded core samples during the 2005 "MARTE" campaign of a Mars drilling simulation experiment by a sandwich microarray immunoassay (Parro et al., 2008b). We will show the new version of the instrument (SOLID3) which is able to perform both sandwich and competitive immunoassays. SOLID3 consists of two separate functional units: a Sample Preparation Unit (SPU), for ten different extractions by ultrasonication, and a Sample Analysis Unit (SAU), for fluorescent immunoassays. The SAU consists of ten different flow cells each of one allocate one antibody microarray (up to 2000 spots), and is equipped with an unique designed optical package for fluorescent detection. We demonstrate the performance of SOLID3 for the detection of a broad range of molecular size compounds, from the amino acid size, peptides, proteins, to whole cells and spores, with sensitivities at the ppb level. References Parro, V., et al., 2005. Planetary and Space Science 53: 729-737. Parro, V., et al., 2008a. Space Science Reviews 135: 293-311 Parro, V., et al., 2008b. Astrobiology 8:987-99 Rivas, L. A., et al., 2008. Analytical Chemistry 80: 7970-7979
Dai, Yilin; Guo, Ling; Li, Meng; Chen, Yi-Bu
2012-06-08
Microarray data analysis presents a significant challenge to researchers who are unable to use the powerful Bioconductor and its numerous tools due to their lack of knowledge of R language. Among the few existing software programs that offer a graphic user interface to Bioconductor packages, none have implemented a comprehensive strategy to address the accuracy and reliability issue of microarray data analysis due to the well known probe design problems associated with many widely used microarray chips. There is also a lack of tools that would expedite the functional analysis of microarray results. We present Microarray Я US, an R-based graphical user interface that implements over a dozen popular Bioconductor packages to offer researchers a streamlined workflow for routine differential microarray expression data analysis without the need to learn R language. In order to enable a more accurate analysis and interpretation of microarray data, we incorporated the latest custom probe re-definition and re-annotation for Affymetrix and Illumina chips. A versatile microarray results output utility tool was also implemented for easy and fast generation of input files for over 20 of the most widely used functional analysis software programs. Coupled with a well-designed user interface, Microarray Я US leverages cutting edge Bioconductor packages for researchers with no knowledge in R language. It also enables a more reliable and accurate microarray data analysis and expedites downstream functional analysis of microarray results.
Testing an aflatoxin B1 gene signature in rat archival tissues.
Merrick, B Alex; Auerbach, Scott S; Stockton, Patricia S; Foley, Julie F; Malarkey, David E; Sills, Robert C; Irwin, Richard D; Tice, Raymond R
2012-05-21
Archival tissues from laboratory studies represent a unique opportunity to explore the relationship between genomic changes and agent-induced disease. In this study, we evaluated the applicability of qPCR for detecting genomic changes in formalin-fixed, paraffin-embedded (FFPE) tissues by determining if a subset of 14 genes from a 90-gene signature derived from microarray data and associated with eventual tumor development could be detected in archival liver, kidney, and lung of rats exposed to aflatoxin B1 (AFB1) for 90 days in feed at 1 ppm. These tissues originated from the same rats used in the microarray study. The 14 genes evaluated were Adam8, Cdh13, Ddit4l, Mybl2, Akr7a3, Akr7a2, Fhit, Wwox, Abcb1b, Abcc3, Cxcl1, Gsta5, Grin2c, and the C8orf46 homologue. The qPCR FFPE liver results were compared to the original liver microarray data and to qPCR results using RNA from fresh frozen liver. Archival liver paraffin blocks yielded 30 to 50 μg of degraded RNA that ranged in size from 0.1 to 4 kB. qPCR results from FFPE and fresh frozen liver samples were positively correlated (p ≤ 0.05) by regression analysis and showed good agreement in direction and proportion of change with microarray data for 11 of 14 genes. All 14 transcripts could be amplified from FFPE kidney RNA except the glutamate receptor gene Grin2c; however, only Abcb1b was significantly upregulated from control. Abundant constitutive transcripts, S18 and β-actin, could be amplified from lung FFPE samples, but the narrow RNA size range (25-500 bp length) prevented consistent detection of target transcripts. Overall, a discrete gene signature derived from prior transcript profiling and representing cell cycle progression, DNA damage response, and xenosensor and detoxication pathways was successfully applied to archival liver and kidney by qPCR and indicated that gene expression changes in response to subchronic AFB1 exposure occurred predominantly in the liver, the primary target for AFB1-induced tumors. We conclude that an evaluation of gene signatures in archival tissues can be an important toxicological tool for evaluating critical molecular events associated with chemical exposures.
2011-01-01
Background Sporadic amyotrophic lateral sclerosis (sALS) is a motor neuron disease with poorly understood etiology. Results of gene expression profiling studies of whole blood from ALS patients have not been validated and are difficult to relate to ALS pathogenesis because gene expression profiles depend on the relative abundance of the different cell types present in whole blood. We conducted microarray analyses using Agilent Human Whole Genome 4 × 44k Arrays on a more homogeneous cell population, namely purified peripheral blood lymphocytes (PBLs), from ALS patients and healthy controls to identify molecular signatures possibly relevant to ALS pathogenesis. Methods Differentially expressed genes were determined by LIMMA (Linear Models for MicroArray) and SAM (Significance Analysis of Microarrays) analyses. The SAFE (Significance Analysis of Function and Expression) procedure was used to identify molecular pathway perturbations. Proteasome inhibition assays were conducted on cultured peripheral blood mononuclear cells (PBMCs) from ALS patients to confirm alteration of the Ubiquitin/Proteasome System (UPS). Results For the first time, using SAFE in a global gene ontology analysis (gene set size 5-100), we show significant perturbation of the KEGG (Kyoto Encyclopedia of Genes and Genomes) ALS pathway of motor neuron degeneration in PBLs from ALS patients. This was the only KEGG disease pathway significantly upregulated among 25, and contributing genes, including SOD1, represented 54% of the encoded proteins or protein complexes of the KEGG ALS pathway. Further SAFE analysis, including gene set sizes >100, showed that only neurodegenerative diseases (4 out of 34 disease pathways) including ALS were significantly upregulated. Changes in UBR2 expression correlated inversely with time since onset of disease and directly with ALSFRS-R, implying that UBR2 was increased early in the course of ALS. Cultured PBMCs from ALS patients accumulated more ubiquitinated proteins than PBMCs from healthy controls in a serum-dependent manner confirming changes in this pathway. Conclusions Our study indicates that PBLs from sALS patients are strong responders to systemic signals or local signals acquired by cell trafficking, representing changes in gene expression similar to those present in brain and spinal cord of sALS patients. PBLs may provide a useful means to study ALS pathogenesis. PMID:22027401
Oligo Design: a computer program for development of probes for oligonucleotide microarrays.
Herold, Keith E; Rasooly, Avraham
2003-12-01
Oligonucleotide microarrays have demonstrated potential for the analysis of gene expression, genotyping, and mutational analysis. Our work focuses primarily on the detection and identification of bacteria based on known short sequences of DNA. Oligo Design, the software described here, automates several design aspects that enable the improved selection of oligonucleotides for use with microarrays for these applications. Two major features of the program are: (i) a tiling algorithm for the design of short overlapping temperature-matched oligonucleotides of variable length, which are useful for the analysis of single nucleotide polymorphisms and (ii) a set of tools for the analysis of multiple alignments of gene families and related short DNA sequences, which allow for the identification of conserved DNA sequences for PCR primer selection and variable DNA sequences for the selection of unique probes for identification. Note that the program does not address the full genome perspective but, instead, is focused on the genetic analysis of short segments of DNA. The program is Internet-enabled and includes a built-in browser and the automated ability to download sequences from GenBank by specifying the GI number. The program also includes several utilities, including audio recital of a DNA sequence (useful for verifying sequences against a written document), a random sequence generator that provides insight into the relationship between melting temperature and GC content, and a PCR calculator.
Sun, Xicai; Guo, Limin; Wang, Jingjing; Wang, Huan; Liu, Zhuofu; Liu, Juan; Yu, Huapeng; Hu, Li; Li, Han; Wang, Dehui
2014-08-01
Although JNA is a benign neoplasm histopathologically, it has a propensity for locally destructive growth and remains a higher postoperative recurrence rate. The aim of this study was to analyze the expression and localization of MMP-9 in JNA using tissue microarray to elucidate its correlation with clinicopathological features and recurrence. The expression of MMP-9 was assessed by immunohistochemistry in a tissue microarray from 70 patients with JNA and 10 control subjects. Correlation between the levels of MMP-9 expression and clinicopathologic variables, as well as tumor recurrence, were analyzed. MMP-9 was detected in perivascular and extravascular less differentiated cells and stromal cells of patients with JNA but not in the matured vascular endothelial cells of these patients. The presence of MMP-9 expression in JNA was correlated with patient's age (p=0.001). Spearman correlation analysis suggested that high expression of MMP-9 in JNA had negative correlation with patient's age (r=-0.412, p<0.001). The recurrence rate in JNA patients with high MMP-9 expression was significantly higher than those with low MMP-9 expression (p=0.002). In multivariate and ROC curve analysis, MMP-9 was a good prognostic factor for tumor recurrence of JNA. Higher MMP-9 expression is a poor prognostic factor for patients with JNA who have been surgically treated. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Inamura, Kentaro; Togashi, Yuki; Ninomiya, Hironori; Shimoji, Takashi; Noda, Tetsuo; Ishikawa, Yuichi
2008-01-01
Previously, using microarray and real-time RT-PCR analysis, we established that HOXB2 is an adverse prognostic indicator for Stage I lung adenocarcinomas. HOXB2 is one of the homeobox master development-controlling genes regulating morphogenesis and cell differentiation. The molecular functions of HOXB2 were analyzed with a small interfering RNA (siRNA) approach in HOP-62 human non-small cell lung cancer (NSCLC) cells featuring high HOXB2 expression. Matrigel invasion assays and microarray gene expression analysis were compared between the HOXB2-siRNA cells and the control cells. The Matrigel invasion assays showed attenuation of HOXB2 expression by siRNA to result in a significant decrease of invasiveness compared to the control cells (p = 0.0013, paired t-test). On microarray gene expression analysis, up-regulation of many metastasis-related genes and others correlating with HOXB2 expression was observed in the control case. With attenuation of HOXB2 expression, downregulation was noted for laminins alpha 4 and 5, involved in enriched signaling, and for Mac-2BP (Mac-2 binding protein) and integrin beta 4 amongst the genes having an enriched glycoprotein ontology. HOXB2 promotes invasion of lung cancer cells through the regulation of metastasis-related genes.
Abbey, Darren; Hickman, Meleah; Gresham, David; Berman, Judith
2011-01-01
Phenotypic diversity can arise rapidly through loss of heterozygosity (LOH) or by the acquisition of copy number variations (CNV) spanning whole chromosomes or shorter contiguous chromosome segments. In Candida albicans, a heterozygous diploid yeast pathogen with no known meiotic cycle, homozygosis and aneuploidy alter clinical characteristics, including drug resistance. Here, we developed a high-resolution microarray that simultaneously detects ∼39,000 single nucleotide polymorphism (SNP) alleles and ∼20,000 copy number variation loci across the C. albicans genome. An important feature of the array analysis is a computational pipeline that determines SNP allele ratios based upon chromosome copy number. Using the array and analysis tools, we constructed a haplotype map (hapmap) of strain SC5314 to assign SNP alleles to specific homologs, and we used it to follow the acquisition of loss of heterozygosity (LOH) and copy number changes in a series of derived laboratory strains. This high-resolution SNP/CGH microarray and the associated hapmap facilitated the phasing of alleles in lab strains and revealed detrimental genome changes that arose frequently during molecular manipulations of laboratory strains. Furthermore, it provided a useful tool for rapid, high-resolution, and cost-effective characterization of changes in allele diversity as well as changes in chromosome copy number in new C. albicans isolates. PMID:22384363
Detection of copy number variations in epilepsy using exome data.
Tsuchida, N; Nakashima, M; Kato, M; Heyman, E; Inui, T; Haginoya, K; Watanabe, S; Chiyonobu, T; Morimoto, M; Ohta, M; Kumakura, A; Kubota, M; Kumagai, Y; Hamano, S-I; Lourenco, C M; Yahaya, N A; Ch'ng, G-S; Ngu, L-H; Fattal-Valevski, A; Weisz Hubshman, M; Orenstein, N; Marom, D; Cohen, L; Goldberg-Stern, H; Uchiyama, Y; Imagawa, E; Mizuguchi, T; Takata, A; Miyake, N; Nakajima, H; Saitsu, H; Miyatake, S; Matsumoto, N
2018-03-01
Epilepsies are common neurological disorders and genetic factors contribute to their pathogenesis. Copy number variations (CNVs) are increasingly recognized as an important etiology of many human diseases including epilepsy. Whole-exome sequencing (WES) is becoming a standard tool for detecting pathogenic mutations and has recently been applied to detecting CNVs. Here, we analyzed 294 families with epilepsy using WES, and focused on 168 families with no causative single nucleotide variants in known epilepsy-associated genes to further validate CNVs using 2 different CNV detection tools using WES data. We confirmed 18 pathogenic CNVs, and 2 deletions and 2 duplications at chr15q11.2 of clinically unknown significance. Of note, we were able to identify small CNVs less than 10 kb in size, which might be difficult to detect by conventional microarray. We revealed 2 cases with pathogenic CNVs that one of the 2 CNV detection tools failed to find, suggesting that using different CNV tools is recommended to increase diagnostic yield. Considering a relatively high discovery rate of CNVs (18 out of 168 families, 10.7%) and successful detection of CNV with <10 kb in size, CNV detection by WES may be able to surrogate, or at least complement, conventional microarray analysis. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Lymph microvascularization as a prognostic indicator in neuroblastoma.
Tadeo, Irene; Gamero-Sandemetrio, Esther; Berbegall, Ana P; Gironella, Marta; Ritort, Félix; Cañete, Adela; Bueno, Gloria; Navarro, Samuel; Noguera, Rosa
2018-05-25
Neuroblastoma is the most common extra-cranial solid pediatric cancer and causes approximately 15% of all childhood deaths from cancer. Although lymphatic vasculature is a prerequisite for the maintenance of tissue fluid balance and immunity in the body, little is known about the relationship between lymphatic vascularization and prognosis in neuroblastoma. We used our previously-published custom-designed tool to close open-outline vessels and measure the density, size and shape of all lymphatic vessels and microvascular segments in 332 primary neuroblastoma contained in tissue microarrays. The results were correlated with clinical and biological features of known prognostic value and with risk of progression to establish histological lymphatic vascular patterns associated with unfavorable histology. A high proportion of irregular intermediate lymphatic capillaries and irregular small collector vessels were present in tumors from patients with metastatic stage, undifferentiating neuroblasts and/or classified in the high risk. In addition, a higher lymphatic microvascularization density was found to be predictive of overall survival. Our findings show the crucial role of lymphatic vascularization in metastatic development and maintenance of tumor tissue homeostasis. These patterns may therefore help to indicate more accurate pre-treatment risk stratification and could provide candidate targets for novel therapies.
Identifying pathogenic processes by integrating microarray data with prior knowledge
2014-01-01
Background It is of great importance to identify molecular processes and pathways that are involved in disease etiology. Although there has been an extensive use of various high-throughput methods for this task, pathogenic pathways are still not completely understood. Often the set of genes or proteins identified as altered in genome-wide screens show a poor overlap with canonical disease pathways. These findings are difficult to interpret, yet crucial in order to improve the understanding of the molecular processes underlying the disease progression. We present a novel method for identifying groups of connected molecules from a set of differentially expressed genes. These groups represent functional modules sharing common cellular function and involve signaling and regulatory events. Specifically, our method makes use of Bayesian statistics to identify groups of co-regulated genes based on the microarray data, where external information about molecular interactions and connections are used as priors in the group assignments. Markov chain Monte Carlo sampling is used to search for the most reliable grouping. Results Simulation results showed that the method improved the ability of identifying correct groups compared to traditional clustering, especially for small sample sizes. Applied to a microarray heart failure dataset the method found one large cluster with several genes important for the structure of the extracellular matrix and a smaller group with many genes involved in carbohydrate metabolism. The method was also applied to a microarray dataset on melanoma cancer patients with or without metastasis, where the main cluster was dominated by genes related to keratinocyte differentiation. Conclusion Our method found clusters overlapping with known pathogenic processes, but also pointed to new connections extending beyond the classical pathways. PMID:24758699
Baumann, Antoine; Devaux, Yvan; Audibert, Gérard; Zhang, Lu; Bracard, Serge; Colnat-Coulbois, Sophie; Klein, Olivier; Zannad, Faiez; Charpentier, Claire; Longrois, Dan; Mertes, Paul-Michel
2013-01-01
Delayed cerebral ischemia (DCI) is a potentially devastating complication after intracranial aneurysm rupture and its mechanisms remain poorly elucidated. Early identification of the patients prone to developing DCI after rupture may represent a major breakthrough in its prevention and treatment. The single gene approach of DCI has demonstrated interest in humans. We hypothesized that whole genome expression profile of blood cells may be useful for better comprehension and prediction of aneurysmal DCI. Over a 35-month period, 218 patients with aneurysm rupture were included in this study. DCI was defined as the occurrence of a new delayed neurological deficit occurring within 2 weeks after aneurysm rupture with evidence of ischemia either on perfusion-diffusion MRI, CT angiography or CT perfusion imaging, or with cerebral angiography. DCI patients were matched against controls based on 4 out of 5 criteria (age, sex, Fisher grade, aneurysm location and smoking status). Genome-wide expression analysis of blood cells obtained at admission was performed by microarrays. Transcriptomic analysis was performed using long oligonucleotide microarrays representing 25,000 genes. Quantitative PCR: 1 µg of total RNA extracted was reverse-transcribed, and the resulting cDNA was diluted 10-fold before performing quantitative PCR. Microarray data were first analyzed by 'Significance Analysis of Microarrays' software which includes the Benjamini correction for multiple testing. In a second step, microarray data fold change was compared using a two-tailed, paired t test. Analysis of receiver-operating characteristic (ROC) curves and the area under the ROC curves were used for prediction analysis. Logistic regression models were used to investigate the additive value of multiple biomarkers. A total of 16 patients demonstrated DCI. Significance Analysis of Microarrays software failed to retrieve significant genes, most probably because of the heterogeneity of the patients included in the microarray experiments and the small size of the DCI population sample. Standard two-tailed paired t test and C-statistic revealed significant associations between gene expression and the occurrence of DCI: in particular, the expression of neuroregulin 1 was 1.6-fold upregulated in patients with DCI (p = 0.01) and predicted DCI with an area under the ROC curve of 0.96. Logistic regression analyses revealed a significant association between neuroregulin 1 and DCI (odds ratio 1.46, 95% confidence interval 1.02-2.09, p = 0.02). This pilot study suggests that blood cells may be a reservoir of prognostic biomarkers of DCI in patients with intracranial aneurysm rupture. Despite an evident lack of power, this study elicited neuroregulin 1, a vasoreactivity-, inflammation- and angiogenesis-related gene, as a possible candidate predictor of DCI. Larger cohort studies are needed but genome-wide microarray-based studies are promising research tools for the understanding of DCI after intracranial aneurysm rupture. © 2013 S. Karger AG, Basel.
Huerta, Mario; Munyi, Marc; Expósito, David; Querol, Enric; Cedano, Juan
2014-06-15
The microarrays performed by scientific teams grow exponentially. These microarray data could be useful for researchers around the world, but unfortunately they are underused. To fully exploit these data, it is necessary (i) to extract these data from a repository of the high-throughput gene expression data like Gene Expression Omnibus (GEO) and (ii) to make the data from different microarrays comparable with tools easy to use for scientists. We have developed these two solutions in our server, implementing a database of microarray marker genes (Marker Genes Data Base). This database contains the marker genes of all GEO microarray datasets and it is updated monthly with the new microarrays from GEO. Thus, researchers can see whether the marker genes of their microarray are marker genes in other microarrays in the database, expanding the analysis of their microarray to the rest of the public microarrays. This solution helps not only to corroborate the conclusions regarding a researcher's microarray but also to identify the phenotype of different subsets of individuals under investigation, to frame the results with microarray experiments from other species, pathologies or tissues, to search for drugs that promote the transition between the studied phenotypes, to detect undesirable side effects of the treatment applied, etc. Thus, the researcher can quickly add relevant information to his/her studies from all of the previous analyses performed in other studies as long as they have been deposited in public repositories. Marker-gene database tool: http://ibb.uab.es/mgdb © The Author 2014. Published by Oxford University Press.
2008 Microarray Research Group (MARG Survey): Sensing the State of Microarray Technology
Over the past several years, the field of microarrays has grown and evolved drastically. In its continued efforts to track this evolution and transformation, the ABRF-MARG has once again conducted a survey of international microarray facilities and individual microarray users. Th...
Bian, Zhong-Rui; Yin, Juan; Sun, Wen; Lin, Dian-Jie
2017-04-01
Diagnose of active tuberculosis (TB) is challenging and treatment response is also difficult to efficiently monitor. The aim of this study was to use an integrated analysis of microarray and network-based method to the samples from publically available datasets to obtain a diagnostic module set and pathways in active TB. Towards this goal, background protein-protein interactions (PPI) network was generated based on global PPI information and gene expression data, following by identification of differential expression network (DEN) from the background PPI network. Then, ego genes were extracted according to the degree features in DEN. Next, module collection was conducted by ego gene expansion based on EgoNet algorithm. After that, differential expression of modules between active TB and controls was evaluated using random permutation test. Finally, biological significance of differential modules was detected by pathways enrichment analysis based on Reactome database, and Fisher's exact test was implemented to extract differential pathways for active TB. Totally, 47 ego genes and 47 candidate modules were identified from the DEN. By setting the cutoff-criteria of gene size >5 and classification accuracy ≥0.9, 7 ego modules (Module 4, Module 7, Module 9, Module 19, Module 25, Module 38 and Module 43) were extracted, and all of them had the statistical significance between active TB and controls. Then, Fisher's exact test was conducted to capture differential pathways for active TB. Interestingly, genes in Module 4, Module 25, Module 38, and Module 43 were enriched in the same pathway, formation of a pool of free 40S subunits. Significant pathway for Module 7 and Module 9 was eukaryotic translation termination, and for Module 19 was nonsense mediated decay enhanced by the exon junction complex (EJC). Accordingly, differential modules and pathways might be potential biomarkers for treating active TB, and provide valuable clues for better understanding of molecular mechanism of active TB. Copyright © 2017 Elsevier Ltd. All rights reserved.
Chen, Dawei; Zhang, Chenyue; Lin, Jiamao; Song, Xinyu; Wang, Haiyong
2018-01-01
The aim of this study was to analyze the diagnostic and prognostic values of the circular RNA (circRNA) hsa_circ_0128298 in hepatocellular carcinoma (HCC). The global circRNA expression was measured using circRNA microarray using three pairs of cancer and noncancerous tissues from HCC patients. The microarray analysis revealed that two circRNAs were differentially expressed in the three pairs of cancerous and noncancerous tissues. The higher levels of two representative circRNAs, such as hsa_circ_0128298 and hsa_circ_0091582, were further confirmed by real-time polymerase chain reaction. In addition, the association between the expression level of hsa_circ_0128298 and the clinicopathological features of patients with HCC was further analyzed. The clinical diagnosis value was confirmed by receiver operating characteristic (ROC) curve analysis. Independent prognostic factors of patient outcome were identified using the Cox regression model. The survival data were analyzed by the Kaplan-Meier method, and the differences were evaluated using log-rank tests. Two-sided P -values <0.05 were considered statistically significant. The expression levels of hsa_circ_0128298 in HCC were significantly higher than those of paratumorous tissues ( P <0.001). Additionally, hsa_circ_0128298 was a diagnostic factor, with the area under the ROC curve of 0.668 (95% CI =0.503-0.794, P <0.001). The sensitivity and specificity values were 0.716 and 0.815, respectively. The AFP and hsa_circ_0128298 expression levels were independent prognostic factors. The overall survival of patients with low hsa_circ_0128298 expression was significantly higher than that of patients with high hsa_circ_0128298 expression. hsa_circ_0128298 may promote proliferation and metastasis and potentially represents a novel diagnostic and prognostic biomarker for HCC patients. However, studies with larger sample size are needed to confirm our conclusion.
Transcriptome remodeling associated with chronological aging in the dinoflagellate, Karenia brevis.
Johnson, Jillian G; Morey, Jeanine S; Neely, Marion G; Ryan, James C; Van Dolah, Frances M
2012-03-01
The toxic dinoflagellate, Karenia brevis, forms dense blooms in the Gulf of Mexico that persist for many months in coastal waters, where they can cause extensive marine animal mortalities and human health impacts. The mechanisms that enable cell survival in high density, low growth blooms, and the mechanisms leading to often rapid bloom demise are not well understood. To gain an understanding of processes that underlie chronological aging in this dinoflagellate, a microarray study was carried out to identify changes in the global transcriptome that accompany the entry and maintenance of stationary phase up to the onset of cell death. The transcriptome of K. brevis was assayed using a custom 10,263 feature oligonucleotide microarray from mid-logarithmic growth to the onset of culture demise. A total of 2958 (29%) features were differentially expressed, with the mid-stationary phase timepoint demonstrating peak changes in expression. Gene ontology enrichment analyses identified a significant shift in transcripts involved in energy acquisition, ribosome biogenesis, gene expression, stress adaptation, calcium signaling, and putative brevetoxin biosynthesis. The extensive remodeling of the transcriptome observed in the transition into a quiescent non-dividing phase appears to be indicative of a global shift in the metabolic and signaling requirements and provides the basis from which to understand the process of chronological aging in a dinoflagellate. Published by Elsevier B.V.
THE ABRF-MARG MICROARRAY SURVEY 2004: TAKING THE PULSE OF THE MICROARRAY FIELD
Over the past several years, the field of microarrays has grown and evolved drastically. In its continued efforts to track this evolution, the ABRF-MARG has once again conducted a survey of international microarray facilities and individual microarray users. The goal of the surve...
Contributions to Statistical Problems Related to Microarray Data
ERIC Educational Resources Information Center
Hong, Feng
2009-01-01
Microarray is a high throughput technology to measure the gene expression. Analysis of microarray data brings many interesting and challenging problems. This thesis consists three studies related to microarray data. First, we propose a Bayesian model for microarray data and use Bayes Factors to identify differentially expressed genes. Second, we…
NASA Astrophysics Data System (ADS)
Bogdanov, Valery L.; Boyce-Jacino, Michael
1999-05-01
Confined arrays of biochemical probes deposited on a solid support surface (analytical microarray or 'chip') provide an opportunity to analysis multiple reactions simultaneously. Microarrays are increasingly used in genetics, medicine and environment scanning as research and analytical instruments. A power of microarray technology comes from its parallelism which grows with array miniaturization, minimization of reagent volume per reaction site and reaction multiplexing. An optical detector of microarray signals should combine high sensitivity, spatial and spectral resolution. Additionally, low-cost and a high processing rate are needed to transfer microarray technology into biomedical practice. We designed an imager that provides confocal and complete spectrum detection of entire fluorescently-labeled microarray in parallel. Imager uses microlens array, non-slit spectral decomposer, and high- sensitive detector (cooled CCD). Two imaging channels provide a simultaneous detection of localization, integrated and spectral intensities for each reaction site in microarray. A dimensional matching between microarray and imager's optics eliminates all in moving parts in instrumentation, enabling highly informative, fast and low-cost microarray detection. We report theory of confocal hyperspectral imaging with microlenses array and experimental data for implementation of developed imager to detect fluorescently labeled microarray with a density approximately 103 sites per cm2.
NCBI GEO: mining millions of expression profiles--database and tools.
Barrett, Tanya; Suzek, Tugba O; Troup, Dennis B; Wilhite, Stephen E; Ngau, Wing-Chi; Ledoux, Pierre; Rudnev, Dmitry; Lash, Alex E; Fujibuchi, Wataru; Edgar, Ron
2005-01-01
The Gene Expression Omnibus (GEO) at the National Center for Biotechnology Information (NCBI) is the largest fully public repository for high-throughput molecular abundance data, primarily gene expression data. The database has a flexible and open design that allows the submission, storage and retrieval of many data types. These data include microarray-based experiments measuring the abundance of mRNA, genomic DNA and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. GEO currently holds over 30,000 submissions representing approximately half a billion individual molecular abundance measurements, for over 100 organisms. Here, we describe recent database developments that facilitate effective mining and visualization of these data. Features are provided to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise. The GEO database is publicly accessible through the World Wide Web at http://www.ncbi.nlm.nih.gov/geo.
Flexible hemispheric microarrays of highly pressure-sensitive sensors based on breath figure method.
Wang, Zhihui; Zhang, Ling; Liu, Jin; Jiang, Hao; Li, Chunzhong
2018-05-30
Recently, flexible pressure sensors featuring high sensitivity, broad sensing range and real-time detection have aroused great attention owing to their crucial role in the development of artificial intelligent devices and healthcare systems. Herein, highly sensitive pressure sensors based on hemisphere-microarray flexible substrates are fabricated via inversely templating honeycomb structures deriving from a facile and static breath figure process. The interlocked and subtle microstructures greatly improve the sensing characteristics and compressibility of the as-prepared pressure sensor, endowing it a sensitivity as high as 196 kPa-1 and a wide pressure sensing range (0-100 kPa), as well as other superior performance, including a lower detection limit of 0.5 Pa, fast response time (<26 ms) and high reversibility (>10 000 cycles). Based on the outstanding sensing performance, the potential capability of our pressure sensor in capturing physiological information and recognizing speech signals has been demonstrated, indicating promising application in wearable and intelligent electronics.
Gene regulatory network identification from the yeast cell cycle based on a neuro-fuzzy system.
Wang, B H; Lim, J W; Lim, J S
2016-08-30
Many studies exist for reconstructing gene regulatory networks (GRNs). In this paper, we propose a method based on an advanced neuro-fuzzy system, for gene regulatory network reconstruction from microarray time-series data. This approach uses a neural network with a weighted fuzzy function to model the relationships between genes. Fuzzy rules, which determine the regulators of genes, are very simplified through this method. Additionally, a regulator selection procedure is proposed, which extracts the exact dynamic relationship between genes, using the information obtained from the weighted fuzzy function. Time-series related features are extracted from the original data to employ the characteristics of temporal data that are useful for accurate GRN reconstruction. The microarray dataset of the yeast cell cycle was used for our study. We measured the mean squared prediction error for the efficiency of the proposed approach and evaluated the accuracy in terms of precision, sensitivity, and F-score. The proposed method outperformed the other existing approaches.
Linear model for fast background subtraction in oligonucleotide microarrays.
Kroll, K Myriam; Barkema, Gerard T; Carlon, Enrico
2009-11-16
One important preprocessing step in the analysis of microarray data is background subtraction. In high-density oligonucleotide arrays this is recognized as a crucial step for the global performance of the data analysis from raw intensities to expression values. We propose here an algorithm for background estimation based on a model in which the cost function is quadratic in a set of fitting parameters such that minimization can be performed through linear algebra. The model incorporates two effects: 1) Correlated intensities between neighboring features in the chip and 2) sequence-dependent affinities for non-specific hybridization fitted by an extended nearest-neighbor model. The algorithm has been tested on 360 GeneChips from publicly available data of recent expression experiments. The algorithm is fast and accurate. Strong correlations between the fitted values for different experiments as well as between the free-energy parameters and their counterparts in aqueous solution indicate that the model captures a significant part of the underlying physical chemistry.
Mixture models for detecting differentially expressed genes in microarrays.
Jones, Liat Ben-Tovim; Bean, Richard; McLachlan, Geoffrey J; Zhu, Justin Xi
2006-10-01
An important and common problem in microarray experiments is the detection of genes that are differentially expressed in a given number of classes. As this problem concerns the selection of significant genes from a large pool of candidate genes, it needs to be carried out within the framework of multiple hypothesis testing. In this paper, we focus on the use of mixture models to handle the multiplicity issue. With this approach, a measure of the local FDR (false discovery rate) is provided for each gene. An attractive feature of the mixture model approach is that it provides a framework for the estimation of the prior probability that a gene is not differentially expressed, and this probability can subsequently be used in forming a decision rule. The rule can also be formed to take the false negative rate into account. We apply this approach to a well-known publicly available data set on breast cancer, and discuss our findings with reference to other approaches.
Pounds, Stan; Cao, Xueyuan; Cheng, Cheng; Yang, Jun; Campana, Dario; Evans, William E.; Pui, Ching-Hon; Relling, Mary V.
2010-01-01
Powerful methods for integrated analysis of multiple biological data sets are needed to maximize interpretation capacity and acquire meaningful knowledge. We recently developed Projection Onto the Most Interesting Statistical Evidence (PROMISE). PROMISE is a statistical procedure that incorporates prior knowledge about the biological relationships among endpoint variables into an integrated analysis of microarray gene expression data with multiple biological and clinical endpoints. Here, PROMISE is adapted to the integrated analysis of pharmacologic, clinical, and genome-wide genotype data that incorporating knowledge about the biological relationships among pharmacologic and clinical response data. An efficient permutation-testing algorithm is introduced so that statistical calculations are computationally feasible in this higher-dimension setting. The new method is applied to a pediatric leukemia data set. The results clearly indicate that PROMISE is a powerful statistical tool for identifying genomic features that exhibit a biologically meaningful pattern of association with multiple endpoint variables. PMID:21516175
Chemiluminescence microarrays in analytical chemistry: a critical review.
Seidel, Michael; Niessner, Reinhard
2014-09-01
Multi-analyte immunoassays on microarrays and on multiplex DNA microarrays have been described for quantitative analysis of small organic molecules (e.g., antibiotics, drugs of abuse, small molecule toxins), proteins (e.g., antibodies or protein toxins), and microorganisms, viruses, and eukaryotic cells. In analytical chemistry, multi-analyte detection by use of analytical microarrays has become an innovative research topic because of the possibility of generating several sets of quantitative data for different analyte classes in a short time. Chemiluminescence (CL) microarrays are powerful tools for rapid multiplex analysis of complex matrices. A wide range of applications for CL microarrays is described in the literature dealing with analytical microarrays. The motivation for this review is to summarize the current state of CL-based analytical microarrays. Combining analysis of different compound classes on CL microarrays reduces analysis time, cost of reagents, and use of laboratory space. Applications are discussed, with examples from food safety, water safety, environmental monitoring, diagnostics, forensics, toxicology, and biosecurity. The potential and limitations of research on multiplex analysis by use of CL microarrays are discussed in this review.
Analysis of High-Throughput ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Daly, Don S.; Zangar, Richard C.
Our research group develops analytical methods and software for the high-throughput analysis of quantitative enzyme-linked immunosorbent assay (ELISA) microarrays. ELISA microarrays differ from DNA microarrays in several fundamental aspects and most algorithms for analysis of DNA microarray data are not applicable to ELISA microarrays. In this review, we provide an overview of the steps involved in ELISA microarray data analysis and how the statistically sound algorithms we have developed provide an integrated software suite to address the needs of each data-processing step. The algorithms discussed are available in a set of open-source software tools (http://www.pnl.gov/statistics/ProMAT).
Foda, Abd AlRahman Mohammad; El-Hawary, Amira Kamal; Hamed, Hazem
2016-10-01
CRC is a heterogeneous disease in terms of morphology, invasive behavior, metastatic capacity, and clinical outcome. Recently, many so-called mesothelial markers, including calretinin, D2-40, WT1, thrombomodulin, mesothelin, and others, have been certified. The aim of this study was to assess the immunohistochemical expression of calretinin and other mesothelial markers (D2-40 and mesothelin) in colorectal mucinous adenocarcinoma (MA) and non mucinous adenocarcinoma (NMA) specimens and relation to clinicopathological features and prognosis using manual tissue microarray technique. We studied tumor tissue specimens from 150 patients with colorectal MA and NMA who underwent radical surgery from January 2007 to January 2012. High-density manual tissue microarrays were constructed using a modified mechanical pencil tip technique, and paraffin sections were submitted for immunohistochemistry using Calretinin, D2-40 and mesothelin expressions. We found that NMA showed significantly more calretinin and D2-40 expression than MA In contrast, no statistically significant difference between NMA and MA was detected in mesothelin expression. There were no statistically significant relations between any of the clinicopathological or histological parameters and any of the three markers. In a univariate analysis, neither calretinin nor D2-40 expressions showed any significant relations to DFS or OS. However, mesothelin luminal expression was significantly associated with worse DFS. Multivariate Cox regression analysis proved that luminal mesothelin expression was an independent negative prognostic factor in NMA. In conclusion, Calretinin, D2-40 and mesothelin are aberrantly expressed in a proportion of CRC cases with more expression in NMA than MA. Aberrant expression of these mesothelial markers was not associated with clinicopathological or histological features of CRCs. Only mesothelin expression appears to be a strong predictor of adverse prognosis.
2013-01-01
Background As high-throughput genomic technologies become accurate and affordable, an increasing number of data sets have been accumulated in the public domain and genomic information integration and meta-analysis have become routine in biomedical research. In this paper, we focus on microarray meta-analysis, where multiple microarray studies with relevant biological hypotheses are combined in order to improve candidate marker detection. Many methods have been developed and applied in the literature, but their performance and properties have only been minimally investigated. There is currently no clear conclusion or guideline as to the proper choice of a meta-analysis method given an application; the decision essentially requires both statistical and biological considerations. Results We performed 12 microarray meta-analysis methods for combining multiple simulated expression profiles, and such methods can be categorized for different hypothesis setting purposes: (1) HS A : DE genes with non-zero effect sizes in all studies, (2) HS B : DE genes with non-zero effect sizes in one or more studies and (3) HS r : DE gene with non-zero effect in "majority" of studies. We then performed a comprehensive comparative analysis through six large-scale real applications using four quantitative statistical evaluation criteria: detection capability, biological association, stability and robustness. We elucidated hypothesis settings behind the methods and further apply multi-dimensional scaling (MDS) and an entropy measure to characterize the meta-analysis methods and data structure, respectively. Conclusions The aggregated results from the simulation study categorized the 12 methods into three hypothesis settings (HS A , HS B , and HS r ). Evaluation in real data and results from MDS and entropy analyses provided an insightful and practical guideline to the choice of the most suitable method in a given application. All source files for simulation and real data are available on the author’s publication website. PMID:24359104
Chang, Lun-Ching; Lin, Hui-Min; Sibille, Etienne; Tseng, George C
2013-12-21
As high-throughput genomic technologies become accurate and affordable, an increasing number of data sets have been accumulated in the public domain and genomic information integration and meta-analysis have become routine in biomedical research. In this paper, we focus on microarray meta-analysis, where multiple microarray studies with relevant biological hypotheses are combined in order to improve candidate marker detection. Many methods have been developed and applied in the literature, but their performance and properties have only been minimally investigated. There is currently no clear conclusion or guideline as to the proper choice of a meta-analysis method given an application; the decision essentially requires both statistical and biological considerations. We performed 12 microarray meta-analysis methods for combining multiple simulated expression profiles, and such methods can be categorized for different hypothesis setting purposes: (1) HS(A): DE genes with non-zero effect sizes in all studies, (2) HS(B): DE genes with non-zero effect sizes in one or more studies and (3) HS(r): DE gene with non-zero effect in "majority" of studies. We then performed a comprehensive comparative analysis through six large-scale real applications using four quantitative statistical evaluation criteria: detection capability, biological association, stability and robustness. We elucidated hypothesis settings behind the methods and further apply multi-dimensional scaling (MDS) and an entropy measure to characterize the meta-analysis methods and data structure, respectively. The aggregated results from the simulation study categorized the 12 methods into three hypothesis settings (HS(A), HS(B), and HS(r)). Evaluation in real data and results from MDS and entropy analyses provided an insightful and practical guideline to the choice of the most suitable method in a given application. All source files for simulation and real data are available on the author's publication website.
South, Sarah T; Lortz, Amanda; Hensel, Charles H; Sdano, Mallory R; Vanzo, Rena J; Martin, Megan M; Peiffer, Andreas; Lambert, Christophe G; Calhoun, Amy; Carey, John C; Battaglia, Agatino
2016-01-01
Background Wolf–Hirschhorn syndrome (WHS) is a contiguous gene deletion syndrome involving variable size deletions of the 4p16.3 region. Seizures are frequently, but not always, associated with WHS. We hypothesised that the size and location of the deleted region may correlate with seizure presentation. Methods Using chromosomal microarray analysis, we finely mapped the breakpoints of copy number variants (CNVs) in 48 individuals with WHS. Seizure phenotype data were collected through parent-reported answers to a comprehensive questionnaire and supplemented with available medical records. Results We observed a significant correlation between the presence of an interstitial 4p deletion and lack of a seizure phenotype (Fisher's exact test p=3.59e-6). In our cohort, there were five individuals with interstitial deletions with a distal breakpoint at least 751 kbp proximal to the 4p terminus. Four of these individuals have never had an observable seizure, and the fifth individual had a single febrile seizure at the age of 1.5 years. All other individuals in our cohort whose deletions encompass the terminal 751 kbp region report having seizures typical of WHS. Additional examples from the literature corroborate these observations and further refine the candidate seizure susceptibility region to a region 197 kbp in size, starting 368 kbp from the terminus of chromosome 4. Conclusions We identify a small terminal region of chromosome 4p that represents a seizure susceptibility region. Deletion of this region in the context of WHS is sufficient for seizure occurrence. PMID:26747863
NASA Technical Reports Server (NTRS)
Weitzel, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.
2016-01-01
Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photo-assimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be up-regulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS (Auxin-Regulated Gene Involved in Organ Size)-like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm up-regulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.
da Silva, Roberta Peres; Heiss, Christian; Black, Ian; ...
2015-09-21
Extracellular vesicles (EVs) mediate non-conventional transport of molecules across the fungal cell wall. We aimed at describing the carbohydrate composition and surface carbohydrate epitopes of EVs isolated from the pathogenic fungi Paracoccidioides brasiliensis and P. lutzii using standard procedures. Total EV carbohydrates were ethanol-precipitated from preparations depleted of lipids and proteins, then analyzed by chemical degradation, gas chromatography-mass spectrometry, nuclear magnetic resonance and size-exclusion chromatography. EV glycosyl residues of Glc, Man, and Gal comprised most probably two major components: a high molecular mass 4,6-α-glucan and a galactofuranosylmannan, possibly an oligomer, bearing a 2-α-Manp main chain linked to β-Galf (1,3) andmore » α-Manp (1,6) end units. The results also suggested the presence of small amounts of a (1→6)- Manp polymer, (1→3)-glucan and (1→6)-glucan. Glycan microarrays allowed identification of EV surface lectin(s), while plant lectin microarray profiling revealed terminal Man and GlcNAc residues exposed at the EVs surface. Mammalian lectin microarray profiling showed that DC-SIGN receptors recognized surface carbohydrate in Paracoccidioides EVs. Our results suggest that oligosaccharides, cytoplasmic storage, and cell wall polysaccharides can be exported in fungal EVs, which also expose surface PAMPs and lectins. As a result, the role of these newly identified components in the interaction with the host remains to be unraveled.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
da Silva, Roberta Peres; Heiss, Christian; Black, Ian
Extracellular vesicles (EVs) mediate non-conventional transport of molecules across the fungal cell wall. We aimed at describing the carbohydrate composition and surface carbohydrate epitopes of EVs isolated from the pathogenic fungi Paracoccidioides brasiliensis and P. lutzii using standard procedures. Total EV carbohydrates were ethanol-precipitated from preparations depleted of lipids and proteins, then analyzed by chemical degradation, gas chromatography-mass spectrometry, nuclear magnetic resonance and size-exclusion chromatography. EV glycosyl residues of Glc, Man, and Gal comprised most probably two major components: a high molecular mass 4,6-α-glucan and a galactofuranosylmannan, possibly an oligomer, bearing a 2-α-Manp main chain linked to β-Galf (1,3) andmore » α-Manp (1,6) end units. The results also suggested the presence of small amounts of a (1→6)- Manp polymer, (1→3)-glucan and (1→6)-glucan. Glycan microarrays allowed identification of EV surface lectin(s), while plant lectin microarray profiling revealed terminal Man and GlcNAc residues exposed at the EVs surface. Mammalian lectin microarray profiling showed that DC-SIGN receptors recognized surface carbohydrate in Paracoccidioides EVs. Our results suggest that oligosaccharides, cytoplasmic storage, and cell wall polysaccharides can be exported in fungal EVs, which also expose surface PAMPs and lectins. As a result, the role of these newly identified components in the interaction with the host remains to be unraveled.« less
Intra-Platform Repeatability and Inter-Platform Comparability of MicroRNA Microarray Technology
Sato, Fumiaki; Tsuchiya, Soken; Terasawa, Kazuya; Tsujimoto, Gozoh
2009-01-01
Over the last decade, DNA microarray technology has provided a great contribution to the life sciences. The MicroArray Quality Control (MAQC) project demonstrated the way to analyze the expression microarray. Recently, microarray technology has been utilized to analyze a comprehensive microRNA expression profiling. Currently, several platforms of microRNA microarray chips are commercially available. Thus, we compared repeatability and comparability of five different microRNA microarray platforms (Agilent, Ambion, Exiqon, Invitrogen and Toray) using 309 microRNAs probes, and the Taqman microRNA system using 142 microRNA probes. This study demonstrated that microRNA microarray has high intra-platform repeatability and comparability to quantitative RT-PCR of microRNA. Among the five platforms, Agilent and Toray array showed relatively better performances than the others. However, the current lineup of commercially available microRNA microarray systems fails to show good inter-platform concordance, probably because of lack of an adequate normalization method and severe divergence in stringency of detection call criteria between different platforms. This study provided the basic information about the performance and the problems specific to the current microRNA microarray systems. PMID:19436744
Living Cell Microarrays: An Overview of Concepts
Jonczyk, Rebecca; Kurth, Tracy; Lavrentieva, Antonina; Walter, Johanna-Gabriela; Scheper, Thomas; Stahl, Frank
2016-01-01
Living cell microarrays are a highly efficient cellular screening system. Due to the low number of cells required per spot, cell microarrays enable the use of primary and stem cells and provide resolution close to the single-cell level. Apart from a variety of conventional static designs, microfluidic microarray systems have also been established. An alternative format is a microarray consisting of three-dimensional cell constructs ranging from cell spheroids to cells encapsulated in hydrogel. These systems provide an in vivo-like microenvironment and are preferably used for the investigation of cellular physiology, cytotoxicity, and drug screening. Thus, many different high-tech microarray platforms are currently available. Disadvantages of many systems include their high cost, the requirement of specialized equipment for their manufacture, and the poor comparability of results between different platforms. In this article, we provide an overview of static, microfluidic, and 3D cell microarrays. In addition, we describe a simple method for the printing of living cell microarrays on modified microscope glass slides using standard DNA microarray equipment available in most laboratories. Applications in research and diagnostics are discussed, e.g., the selective and sensitive detection of biomarkers. Finally, we highlight current limitations and the future prospects of living cell microarrays. PMID:27600077
ELISA-BASE: An Integrated Bioinformatics Tool for Analyzing and Tracking ELISA Microarray Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
White, Amanda M.; Collett, James L.; Seurynck-Servoss, Shannon L.
ELISA-BASE is an open-source database for capturing, organizing and analyzing protein enzyme-linked immunosorbent assay (ELISA) microarray data. ELISA-BASE is an extension of the BioArray Soft-ware Environment (BASE) database system, which was developed for DNA microarrays. In order to make BASE suitable for protein microarray experiments, we developed several plugins for importing and analyzing quantitative ELISA microarray data. Most notably, our Protein Microarray Analysis Tool (ProMAT) for processing quantita-tive ELISA data is now available as a plugin to the database.
AVC: Selecting discriminative features on basis of AUC by maximizing variable complementarity.
Sun, Lei; Wang, Jun; Wei, Jinmao
2017-03-14
The Receiver Operator Characteristic (ROC) curve is well-known in evaluating classification performance in biomedical field. Owing to its superiority in dealing with imbalanced and cost-sensitive data, the ROC curve has been exploited as a popular metric to evaluate and find out disease-related genes (features). The existing ROC-based feature selection approaches are simple and effective in evaluating individual features. However, these approaches may fail to find real target feature subset due to their lack of effective means to reduce the redundancy between features, which is essential in machine learning. In this paper, we propose to assess feature complementarity by a trick of measuring the distances between the misclassified instances and their nearest misses on the dimensions of pairwise features. If a misclassified instance and its nearest miss on one feature dimension are far apart on another feature dimension, the two features are regarded as complementary to each other. Subsequently, we propose a novel filter feature selection approach on the basis of the ROC analysis. The new approach employs an efficient heuristic search strategy to select optimal features with highest complementarities. The experimental results on a broad range of microarray data sets validate that the classifiers built on the feature subset selected by our approach can get the minimal balanced error rate with a small amount of significant features. Compared with other ROC-based feature selection approaches, our new approach can select fewer features and effectively improve the classification performance.
Thermodynamically optimal whole-genome tiling microarray design and validation.
Cho, Hyejin; Chou, Hui-Hsien
2016-06-13
Microarray is an efficient apparatus to interrogate the whole transcriptome of species. Microarray can be designed according to annotated gene sets, but the resulted microarrays cannot be used to identify novel transcripts and this design method is not applicable to unannotated species. Alternatively, a whole-genome tiling microarray can be designed using only genomic sequences without gene annotations, and it can be used to detect novel RNA transcripts as well as known genes. The difficulty with tiling microarray design lies in the tradeoff between probe-specificity and coverage of the genome. Sequence comparison methods based on BLAST or similar software are commonly employed in microarray design, but they cannot precisely determine the subtle thermodynamic competition between probe targets and partially matched probe nontargets during hybridizations. Using the whole-genome thermodynamic analysis software PICKY to design tiling microarrays, we can achieve maximum whole-genome coverage allowable under the thermodynamic constraints of each target genome. The resulted tiling microarrays are thermodynamically optimal in the sense that all selected probes share the same melting temperature separation range between their targets and closest nontargets, and no additional probes can be added without violating the specificity of the microarray to the target genome. This new design method was used to create two whole-genome tiling microarrays for Escherichia coli MG1655 and Agrobacterium tumefaciens C58 and the experiment results validated the design.
Gunavathi, Chellamuthu; Premalatha, Kandasamy
2014-01-01
Feature selection in cancer classification is a central area of research in the field of bioinformatics and used to select the informative genes from thousands of genes of the microarray. The genes are ranked based on T-statistics, signal-to-noise ratio (SNR), and F-test values. The swarm intelligence (SI) technique finds the informative genes from the top-m ranked genes. These selected genes are used for classification. In this paper the shuffled frog leaping with Lévy flight (SFLLF) is proposed for feature selection. In SFLLF, the Lévy flight is included to avoid premature convergence of shuffled frog leaping (SFL) algorithm. The SI techniques such as particle swarm optimization (PSO), cuckoo search (CS), SFL, and SFLLF are used for feature selection which identifies informative genes for classification. The k-nearest neighbour (k-NN) technique is used to classify the samples. The proposed work is applied on 10 different benchmark datasets and examined with SI techniques. The experimental results show that the results obtained from k-NN classifier through SFLLF feature selection method outperform PSO, CS, and SFL.
Database construction for PromoterCAD: synthetic promoter design for mammals and plants.
Nishikata, Koro; Cox, Robert Sidney; Shimoyama, Sayoko; Yoshida, Yuko; Matsui, Minami; Makita, Yuko; Toyoda, Tetsuro
2014-03-21
Synthetic promoters can control a gene's timing, location, and expression level. The PromoterCAD web server ( http://promotercad.org ) allows the design of synthetic promoters to control plant gene expression, by novel arrangement of cis-regulatory elements. Recently, we have expanded PromoterCAD's scope with additional plant and animal data: (1) PLACE (Plant Cis-acting Regulatory DNA Elements), including various sized sequence motifs; (2) PEDB (Mammalian Promoter/Enhancer Database), including gene expression data for mammalian tissues. The plant PromoterCAD data now contains 22 000 Arabidopsis thaliana genes, 2 200 000 microarray measurements in 20 growth conditions and 79 tissue organs and developmental stages, while the new mammalian PromoterCAD data contains 679 Mus musculus genes and 65 000 microarray measurements in 96 tissue organs and cell types ( http://promotercad.org/mammal/ ). This work presents step-by-step instructions for adding both regulatory motif and gene expression data to PromoterCAD, to illustrate how users can expand PromoterCAD functionality for their own applications and organisms.
[Microarray CGH: principle and use for constitutional disorders].
Sanlaville, D; Lapierre, J M; Coquin, A; Turleau, C; Vermeesch, J; Colleaux, L; Borck, G; Vekemans, M; Aurias, A; Romana, S P
2005-10-01
Chips technology has allowed to miniaturize process making possible to realize in one step and using the same device a lot of chemical reactions. The application of this technology to molecular cytogenetics resulted in the development of comparative genomic hybridization (CGH) on microarrays technique. Using this technique it is possible to detect very small genetic imbalances anywhere in the genome. Its usefulness has been well documented in cancer and more recently in constitutional disorders. In particular it has been used to detect interstitial and subtelomeric submicroscopic imbalances, to characterize their size at the molecular level or to define the breakpoints of translocation. The challenge today is to transfer this technology in laboratory medicine. Nevertheless this technology remains expensive and the existence of numerous sequence polymorphisms makes its interpretation difficult. Finally its is unlikely that it will make karyotyping obsolete as it does not allow to detect balanced rearrangements which after meiotic segregation might result in genome imbalance in the progeny.
Wang, David B.; Dayton, Robert D.; Zweig, Richard M.; Klein, Ronald L.
2010-01-01
Neurofibrillary tangles comprised of the microtubule-associated protein tau are pathological features of Alzheimer's disease and several other neurodegenerative diseases, such as progressive supranuclear palsy. We previously overexpressed tau in the substantia nigra of rats and mimicked some of the neurodegenerative sequelae that occur in humans such as tangle formation, loss of dopamine neurons, and microgliosis. To study molecular changes involved in the tau-induced disease state, we used DNA microarrays at an early stage of the disease process. A range of adeno-associated virus (AAV9) vector doses for tau were injected in groups of rats with a survival interval of two weeks. Specific decreases in messages for dopamine related genes validated the technique with respect to the dopaminergic cell loss observed. Of the mRNAs upregulated, there was a dose-dependent effect on multiple genes involved in immune response such as chemokines, interferon-inducible genes and leukocyte markers, only in the tau vector groups and not in dose-matched controls of either transgene-less empty vector or control green fluorescent protein vector. Histological staining for dopamine neurons and microglia matched the loss of dopaminergic markers and upregulation of immune response mRNAs in the microarray data, respectively. RT-PCR for selected markers confirmed the microarray results, with similar changes found by either technique. The mRNA data correlate well with previous findings, and underscore microgliosis and immune response in the degenerative process following tau overexpression. PMID:20346943
cDNA microarray analysis of esophageal cancer: discoveries and prospects.
Shimada, Yutaka; Sato, Fumiaki; Shimizu, Kazuharu; Tsujimoto, Gozoh; Tsukada, Kazuhiro
2009-07-01
Recent progress in molecular biology has revealed many genetic and epigenetic alterations that are involved in the development and progression of esophageal cancer. Microarray analysis has also revealed several genetic networks that are involved in esophageal cancer. However, clinical application of microarray techniques and use of microarray data have not yet occurred. In this review, we focus on the recent developments and problems with microarray analysis of esophageal cancer.
Petersen, David W; Kawasaki, Ernest S
2007-01-01
DNA microarray technology has become a powerful tool in the arsenal of the molecular biologist. Capitalizing on high precision robotics and the wealth of DNA sequences annotated from the genomes of a large number of organisms, the manufacture of microarrays is now possible for the average academic laboratory with the funds and motivation. Microarray production requires attention to both biological and physical resources, including DNA libraries, robotics, and qualified personnel. While the fabrication of microarrays is a very labor-intensive process, production of quality microarrays individually tailored on a project-by-project basis will help researchers shed light on future scientific questions.
Killion, Patrick J; Sherlock, Gavin; Iyer, Vishwanath R
2003-01-01
Background The power of microarray analysis can be realized only if data is systematically archived and linked to biological annotations as well as analysis algorithms. Description The Longhorn Array Database (LAD) is a MIAME compliant microarray database that operates on PostgreSQL and Linux. It is a fully open source version of the Stanford Microarray Database (SMD), one of the largest microarray databases. LAD is available at Conclusions Our development of LAD provides a simple, free, open, reliable and proven solution for storage and analysis of two-color microarray data. PMID:12930545
The Porcelain Crab Transcriptome and PCAD, the Porcelain Crab Microarray and Sequence Database
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tagmount, Abderrahmane; Wang, Mei; Lindquist, Erika
2010-01-27
Background: With the emergence of a completed genome sequence of the freshwater crustacean Daphnia pulex, construction of genomic-scale sequence databases for additional crustacean sequences are important for comparative genomics and annotation. Porcelain crabs, genus Petrolisthes, have been powerful crustacean models for environmental and evolutionary physiology with respect to thermal adaptation and understanding responses of marine organisms to climate change. Here, we present a large-scale EST sequencing and cDNA microarray database project for the porcelain crab Petrolisthes cinctipes. Methodology/Principal Findings: A set of ~;;30K unique sequences (UniSeqs) representing ~;;19K clusters were generated from ~;;98K high quality ESTs from a set ofmore » tissue specific non-normalized and mixed-tissue normalized cDNA libraries from the porcelain crab Petrolisthes cinctipes. Homology for each UniSeq was assessed using BLAST, InterProScan, GO and KEGG database searches. Approximately 66percent of the UniSeqs had homology in at least one of the databases. All EST and UniSeq sequences along with annotation results and coordinated cDNA microarray datasets have been made publicly accessible at the Porcelain Crab Array Database (PCAD), a feature-enriched version of the Stanford and Longhorn Array Databases.Conclusions/Significance: The EST project presented here represents the third largest sequencing effort for any crustacean, and the largest effort for any crab species. Our assembly and clustering results suggest that our porcelain crab EST data set is equally diverse to the much larger EST set generated in the Daphnia pulex genome sequencing project, and thus will be an important resource to the Daphnia research community. Our homology results support the pancrustacea hypothesis and suggest that Malacostraca may be ancestral to Branchiopoda and Hexapoda. Our results also suggest that our cDNA microarrays cover as much of the transcriptome as can reasonably be captured in EST library sequencing approaches, and thus represent a rich resource for studies of environmental genomics.« less
2001-09-01
and pathogenicity in Erwinia carotovora (rsmA) (12). Additionally, csrA has been documented to affect cell size and surface properties, which is in...machinery to cell wall 13.1 b1502 Putative adhesin; similar to FimH protein 13.0 tap Methyl-accepting chemotaxis protein IV, peptide sensor receptor...oxohexanoyl)-L-homoserine lactone 5246 DELISA ET AL. J. BACTERIOL. regulates carbapenem antibiotic production in Erwinia carotovora . Biochem. J. 288:997
High Dimensional Classification Using Features Annealed Independence Rules.
Fan, Jianqing; Fan, Yingying
2008-01-01
Classification using high-dimensional features arises frequently in many contemporary statistical studies such as tumor classification using microarray or other high-throughput data. The impact of dimensionality on classifications is largely poorly understood. In a seminal paper, Bickel and Levina (2004) show that the Fisher discriminant performs poorly due to diverging spectra and they propose to use the independence rule to overcome the problem. We first demonstrate that even for the independence classification rule, classification using all the features can be as bad as the random guessing due to noise accumulation in estimating population centroids in high-dimensional feature space. In fact, we demonstrate further that almost all linear discriminants can perform as bad as the random guessing. Thus, it is paramountly important to select a subset of important features for high-dimensional classification, resulting in Features Annealed Independence Rules (FAIR). The conditions under which all the important features can be selected by the two-sample t-statistic are established. The choice of the optimal number of features, or equivalently, the threshold value of the test statistics are proposed based on an upper bound of the classification error. Simulation studies and real data analysis support our theoretical results and demonstrate convincingly the advantage of our new classification procedure.
Zhu, Yuerong; Zhu, Yuelin; Xu, Wei
2008-01-01
Background Though microarray experiments are very popular in life science research, managing and analyzing microarray data are still challenging tasks for many biologists. Most microarray programs require users to have sophisticated knowledge of mathematics, statistics and computer skills for usage. With accumulating microarray data deposited in public databases, easy-to-use programs to re-analyze previously published microarray data are in high demand. Results EzArray is a web-based Affymetrix expression array data management and analysis system for researchers who need to organize microarray data efficiently and get data analyzed instantly. EzArray organizes microarray data into projects that can be analyzed online with predefined or custom procedures. EzArray performs data preprocessing and detection of differentially expressed genes with statistical methods. All analysis procedures are optimized and highly automated so that even novice users with limited pre-knowledge of microarray data analysis can complete initial analysis quickly. Since all input files, analysis parameters, and executed scripts can be downloaded, EzArray provides maximum reproducibility for each analysis. In addition, EzArray integrates with Gene Expression Omnibus (GEO) and allows instantaneous re-analysis of published array data. Conclusion EzArray is a novel Affymetrix expression array data analysis and sharing system. EzArray provides easy-to-use tools for re-analyzing published microarray data and will help both novice and experienced users perform initial analysis of their microarray data from the location of data storage. We believe EzArray will be a useful system for facilities with microarray services and laboratories with multiple members involved in microarray data analysis. EzArray is freely available from . PMID:18218103
A Java-based tool for the design of classification microarrays.
Meng, Da; Broschat, Shira L; Call, Douglas R
2008-08-04
Classification microarrays are used for purposes such as identifying strains of bacteria and determining genetic relationships to understand the epidemiology of an infectious disease. For these cases, mixed microarrays, which are composed of DNA from more than one organism, are more effective than conventional microarrays composed of DNA from a single organism. Selection of probes is a key factor in designing successful mixed microarrays because redundant sequences are inefficient and limited representation of diversity can restrict application of the microarray. We have developed a Java-based software tool, called PLASMID, for use in selecting the minimum set of probe sequences needed to classify different groups of plasmids or bacteria. The software program was successfully applied to several different sets of data. The utility of PLASMID was illustrated using existing mixed-plasmid microarray data as well as data from a virtual mixed-genome microarray constructed from different strains of Streptococcus. Moreover, use of data from expression microarray experiments demonstrated the generality of PLASMID. In this paper we describe a new software tool for selecting a set of probes for a classification microarray. While the tool was developed for the design of mixed microarrays-and mixed-plasmid microarrays in particular-it can also be used to design expression arrays. The user can choose from several clustering methods (including hierarchical, non-hierarchical, and a model-based genetic algorithm), several probe ranking methods, and several different display methods. A novel approach is used for probe redundancy reduction, and probe selection is accomplished via stepwise discriminant analysis. Data can be entered in different formats (including Excel and comma-delimited text), and dendrogram, heat map, and scatter plot images can be saved in several different formats (including jpeg and tiff). Weights generated using stepwise discriminant analysis can be stored for analysis of subsequent experimental data. Additionally, PLASMID can be used to construct virtual microarrays with genomes from public databases, which can then be used to identify an optimal set of probes.
THE ABRF MARG MICROARRAY SURVEY 2005: TAKING THE PULSE ON THE MICROARRAY FIELD
Over the past several years microarray technology has evolved into a critical component of any discovery based program. Since 1999, the Association of Biomolecular Resource Facilities (ABRF) Microarray Research Group (MARG) has conducted biennial surveys designed to generate a pr...
Development of a Digital Microarray with Interferometric Reflectance Imaging
NASA Astrophysics Data System (ADS)
Sevenler, Derin
This dissertation describes a new type of molecular assay for nucleic acids and proteins. We call this technique a digital microarray since it is conceptually similar to conventional fluorescence microarrays, yet it performs enumerative ('digital') counting of the number captured molecules. Digital microarrays are approximately 10,000-fold more sensitive than fluorescence microarrays, yet maintain all of the strengths of the platform including low cost and high multiplexing (i.e., many different tests on the same sample simultaneously). Digital microarrays use gold nanorods to label the captured target molecules. Each gold nanorod on the array is individually detected based on its light scattering, with an interferometric microscopy technique called SP-IRIS. Our optimized high-throughput version of SP-IRIS is able to scan a typical array of 500 spots in less than 10 minutes. Digital DNA microarrays may have utility in applications where sequencing is prohibitively expensive or slow. As an example, we describe a digital microarray assay for gene expression markers of bacterial drug resistance.
Implementation of mutual information and bayes theorem for classification microarray data
NASA Astrophysics Data System (ADS)
Dwifebri Purbolaksono, Mahendra; Widiastuti, Kurnia C.; Syahrul Mubarok, Mohamad; Adiwijaya; Aminy Ma’ruf, Firda
2018-03-01
Microarray Technology is one of technology which able to read the structure of gen. The analysis is important for this technology. It is for deciding which attribute is more important than the others. Microarray technology is able to get cancer information to diagnose a person’s gen. Preparation of microarray data is a huge problem and takes a long time. That is because microarray data contains high number of insignificant and irrelevant attributes. So, it needs a method to reduce the dimension of microarray data without eliminating important information in every attribute. This research uses Mutual Information to reduce dimension. System is built with Machine Learning approach specifically Bayes Theorem. This theorem uses a statistical and probability approach. By combining both methods, it will be powerful for Microarray Data Classification. The experiment results show that system is good to classify Microarray data with highest F1-score using Bayesian Network by 91.06%, and Naïve Bayes by 88.85%.
Zhao, Yuanshun; Zhang, Yonghong; Lin, Dongdong; Li, Kang; Yin, Chengzeng; Liu, Xiuhong; Jin, Boxun; Sun, Libo; Liu, Jinhua; Zhang, Aiying; Li, Ning
2015-10-01
To develop and evaluate a protein microarray assay with horseradish peroxidase (HRP) chemiluminescence for quantification of α-fetoprotein (AFP) in serum from patients with hepatocellular carcinoma (HCC). A protein microarray assay for AFP was developed. Serum was collected from patients with HCC and healthy control subjects. AFP was quantified using protein microarray and enzyme-linked immunosorbent assay (ELISA). Serum AFP concentrations determined via protein microarray were positively correlated (r = 0.973) with those determined via ELISA in patients with HCC (n = 60) and healthy control subjects (n = 30). Protein microarray showed 80% sensitivity and 100% specificity for HCC diagnosis. ELISA had 83.3% sensitivity and 100% specificity. Protein microarray effectively distinguished between patients with HCC and healthy control subjects (area under ROC curve 0.974; 95% CI 0.000, 1.000). Protein microarray is a rapid, simple and low-cost alternative to ELISA for detecting AFP in human serum. © The Author(s) 2015.
Hyperspectral microscopic analysis of normal, benign and carcinoma microarray tissue sections
NASA Astrophysics Data System (ADS)
Maggioni, Mauro; Davis, Gustave L.; Warner, Frederick J.; Geshwind, Frank B.; Coppi, Andreas C.; DeVerse, Richard A.; Coifman, Ronald R.
2006-02-01
We apply a unique micro-optoelectromechanical tuned light source and new algorithms to the hyper-spectral microscopic analysis of human colon biopsies. The tuned light prototype (Plain Sight Systems Inc.) transmits any combination of light frequencies, range 440nm 700nm, trans-illuminating H and E stained tissue sections of normal (N), benign adenoma (B) and malignant carcinoma (M) colon biopsies, through a Nikon Biophot microscope. Hyper-spectral photomicrographs, randomly collected 400X magnication, are obtained with a CCD camera (Sensovation) from 59 different patient biopsies (20 N, 19 B, 20 M) mounted as a microarray on a single glass slide. The spectra of each pixel are normalized and analyzed to discriminate among tissue features: gland nuclei, gland cytoplasm and lamina propria/lumens. Spectral features permit the automatic extraction of 3298 nuclei with classification as N, B or M. When nuclei are extracted from each of the 59 biopsies the average classification among N, B and M nuclei is 97.1%; classification of the biopsies, based on the average nuclei classification, is 100%. However, when the nuclei are extracted from a subset of biopsies, and the prediction is made on nuclei in the remaining biopsies, there is a marked decrement in performance to 60% across the 3 classes. Similarly the biopsy classification drops to 54%. In spite of these classification differences, which we believe are due to instrument and biopsy normalization issues, hyper-spectral analysis has the potential to achieve diagnostic efficiency needed for objective microscopic diagnosis.
The function of BTG3 in colorectal cancer cells and its possible signaling pathway.
Lv, Chi; Wang, Heling; Tong, Yuxin; Yin, Hongzhuan; Wang, Dalu; Yan, Zhaopeng; Liang, Yichao; Wu, Di; Su, Qi
2018-02-01
B-cell translocation gene 3 (BTG3) has been identified as a candidate driver gene for various cancers, but its specific role in colorectal cancer (CRC) is poorly understood. We aimed to investigate the relationship between expression of BTG3 and clinicopathological features and prognosis, as well as to explore the effects and the role of a possible BTG3 molecular mechanism on aggressive colorectal cancer behavior. BTG3 expression was assessed by immunohistochemistry (IHC) on specimens from 140 patients with CRC. The association of BTG3 expression with clinicopathological features was examined. To confirm the biological role of BTG3 in CRC, two CRC cell lines expressing BTG3 were used and BTG3 expression was knocked down by shRNA. CCK-8, cell cycle, apoptosis, migration, and invasion assays were performed. The influence of BTG3 knockdown was further investigated by genomic microarray to uncover the potential molecular mechanisms underlying BTG3-mediated CRC development and progression. BTG3 was downregulated in colorectal cancer tissues and positively correlated with pathological classification (p = 0.037), depth of invasion (p = 0.016), distant metastasis (p = 0.024), TNM stage (p = 0.007), and overall survival (OS) and disease-free survival (DFS). BTG3 knockdown promoted cell proliferation, migration, invasion, relieved G2 arrest, and inhibited apoptosis in HCT116 and LoVo cells. A genomic microarray analysis showed that numerous tumor-associated signaling pathways and oncogenes were altered by BTG3 knockdown. At the mRNA level, nine genes referred to the extracellular-regulated kinase/mitogen-activated protein kinase pathway were differentially expressed. Western blotting revealed that BTG3 knockdown upregulated PAK2, RPS6KA5, YWHAB, and signal transducer and activator of transcription (STAT)3 protein levels, but downregulated RAP1A, DUSP6, and STAT1 protein expression, which was consistent with the genomic microarray data. BTG3 expression might contribute to CRC carcinogenesis. BTG3 knockdown might strengthen the aggressive colorectal cancer behavior.
The Microarray Revolution: Perspectives from Educators
ERIC Educational Resources Information Center
Brewster, Jay L.; Beason, K. Beth; Eckdahl, Todd T.; Evans, Irene M.
2004-01-01
In recent years, microarray analysis has become a key experimental tool, enabling the analysis of genome-wide patterns of gene expression. This review approaches the microarray revolution with a focus upon four topics: 1) the early development of this technology and its application to cancer diagnostics; 2) a primer of microarray research,…
Ho, Karen S; South, Sarah T; Lortz, Amanda; Hensel, Charles H; Sdano, Mallory R; Vanzo, Rena J; Martin, Megan M; Peiffer, Andreas; Lambert, Christophe G; Calhoun, Amy; Carey, John C; Battaglia, Agatino
2016-04-01
Wolf-Hirschhorn syndrome (WHS) is a contiguous gene deletion syndrome involving variable size deletions of the 4p16.3 region. Seizures are frequently, but not always, associated with WHS. We hypothesised that the size and location of the deleted region may correlate with seizure presentation. Using chromosomal microarray analysis, we finely mapped the breakpoints of copy number variants (CNVs) in 48 individuals with WHS. Seizure phenotype data were collected through parent-reported answers to a comprehensive questionnaire and supplemented with available medical records. We observed a significant correlation between the presence of an interstitial 4p deletion and lack of a seizure phenotype (Fisher's exact test p=3.59e-6). In our cohort, there were five individuals with interstitial deletions with a distal breakpoint at least 751 kbp proximal to the 4p terminus. Four of these individuals have never had an observable seizure, and the fifth individual had a single febrile seizure at the age of 1.5 years. All other individuals in our cohort whose deletions encompass the terminal 751 kbp region report having seizures typical of WHS. Additional examples from the literature corroborate these observations and further refine the candidate seizure susceptibility region to a region 197 kbp in size, starting 368 kbp from the terminus of chromosome 4. We identify a small terminal region of chromosome 4p that represents a seizure susceptibility region. Deletion of this region in the context of WHS is sufficient for seizure occurrence. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
La Serna-Infantes, Jorge; Pastor, Miguel Chávez; Trubnykova, Milana; Velásquez, Félix Chavesta; Sotomayor, Flor Vásquez; Barriga, Hugo Abarca
2018-07-01
Trichothiodystrophy type 4 is a rare autosomal recessive and ectodermal disorder, characterized by dry, brittle, sparse and sulfur-deficient hair and other features like intellectual disability, ichthyotic skin and short stature, caused by a homozygous mutation in MPLKIP gene. Glutaric aciduria type 3 is caused by a homozygous mutation in SUGCT gene with no distinctive phenotype. Both genes are localized on chromosome 7 (7p14). We report an 8-year-old female with short stature, microcephaly, development delay, intellectual disability and hair characterized for dark, short, coarse, sparse and brittle associated to classical trichorrhexis microscopy pattern. Chromosome microarray analysis showed a 125 kb homozygous pathogenic deletion, which includes genes MPLKIP and SUGCT, not described before. This is the first case described in Peru of a novel contiguous gene deletion of Trichothiodystrophy type 4 and Glutaric aciduria type 3 performed by chromosome microarray analysis, highlighting the contribution and importance of molecular technologies on diagnosis of rare genetic conditions. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
Choi, Young-Jun; Fuchs, Jeremy F.; Mayhew, George F.; Yu, Helen E.; Christensen, Bruce M.
2012-01-01
Hemocytes are integral components of mosquito immune mechanisms such as phagocytosis, melanization, and production of antimicrobial peptides. However, our understanding of hemocyte-specific molecular processes and their contribution to shaping the host immune response remains limited. To better understand the immunophysiological features distinctive of hemocytes, we conducted genome-wide analysis of hemocyte-enriched transcripts, and examined how tissue-enriched expression patterns change with the immune status of the host. Our microarray data indicate that the hemocyte-enriched trascriptome is dynamic and context-dependent. Analysis of transcripts enriched after bacterial challenge in circulating hemocytes with respect to carcass added a dimension to evaluating infection-responsive genes and immune-related gene families. We resolved patterns of transcriptional change unique to hemocytes from those that are likely shared by other immune responsive tissues, and identified clusters of genes preferentially induced in hemocytes, likely reflecting their involvement in cell type specific functions. In addition, the study revealed conserved hemocyte-enriched molecular repertoires which might be implicated in core hemocyte function by cross-species meta-analysis of microarray expression data from Anopheles gambiae and Drosophila melanogaster. PMID:22796331
Allanson, Judith; Smith, Amanda; Hare, Heather; Albrecht, Beate; Bijlsma, Emilia; Dallapiccola, Bruno; Donti, Emilio; Fitzpatrick, David; Isidor, Bertrand; Lachlan, Katherine; Le Caignec, Cedric; Prontera, Paolo; Raas-Rothschild, Annick; Rogaia, Daniela; van Bon, Bregje; Aradhya, Swaroop; Crocker, Susan F; Jarinova, Olga; McGowan-Jordan, Jean; Boycott, Kym; Bulman, Dennis; Fagerberg, Christina Ringmann
2012-09-01
Nablus mask-like facial syndrome (NMLFS) has many distinctive phenotypic features, particularly tight glistening skin with reduced facial expression, blepharophimosis, telecanthus, bulky nasal tip, abnormal external ear architecture, upswept frontal hairline, and sparse eyebrows. Over the last few years, several individuals with NMLFS have been reported to have a microdeletion of 8q21.3q22.1, demonstrated by microarray analysis. The minimal overlapping region is 93.98-96.22 Mb (hg19). Here we present clinical and microarray data from five singletons and two mother-child pairs who have heterozygous deletions significantly overlapping the region associated with NMLFS. Notably, while one mother and child were said to have mild tightening of facial skin, none of these individuals exhibited reduced facial expression or the classical facial phenotype of NMLFS. These findings indicate that deletion of the 8q21.3q22.1 region is necessary but not sufficient for development of the NMLFS. We discuss possible genetic mechanisms underlying the complex pattern of inheritance for this condition. Copyright © 2012 Wiley Periodicals, Inc.
A data model and database for high-resolution pathology analytical image informatics.
Wang, Fusheng; Kong, Jun; Cooper, Lee; Pan, Tony; Kurc, Tahsin; Chen, Wenjin; Sharma, Ashish; Niedermayr, Cristobal; Oh, Tae W; Brat, Daniel; Farris, Alton B; Foran, David J; Saltz, Joel
2011-01-01
The systematic analysis of imaged pathology specimens often results in a vast amount of morphological information at both the cellular and sub-cellular scales. While microscopy scanners and computerized analysis are capable of capturing and analyzing data rapidly, microscopy image data remain underutilized in research and clinical settings. One major obstacle which tends to reduce wider adoption of these new technologies throughout the clinical and scientific communities is the challenge of managing, querying, and integrating the vast amounts of data resulting from the analysis of large digital pathology datasets. This paper presents a data model, which addresses these challenges, and demonstrates its implementation in a relational database system. This paper describes a data model, referred to as Pathology Analytic Imaging Standards (PAIS), and a database implementation, which are designed to support the data management and query requirements of detailed characterization of micro-anatomic morphology through many interrelated analysis pipelines on whole-slide images and tissue microarrays (TMAs). (1) Development of a data model capable of efficiently representing and storing virtual slide related image, annotation, markup, and feature information. (2) Development of a database, based on the data model, capable of supporting queries for data retrieval based on analysis and image metadata, queries for comparison of results from different analyses, and spatial queries on segmented regions, features, and classified objects. The work described in this paper is motivated by the challenges associated with characterization of micro-scale features for comparative and correlative analyses involving whole-slides tissue images and TMAs. Technologies for digitizing tissues have advanced significantly in the past decade. Slide scanners are capable of producing high-magnification, high-resolution images from whole slides and TMAs within several minutes. Hence, it is becoming increasingly feasible for basic, clinical, and translational research studies to produce thousands of whole-slide images. Systematic analysis of these large datasets requires efficient data management support for representing and indexing results from hundreds of interrelated analyses generating very large volumes of quantifications such as shape and texture and of classifications of the quantified features. We have designed a data model and a database to address the data management requirements of detailed characterization of micro-anatomic morphology through many interrelated analysis pipelines. The data model represents virtual slide related image, annotation, markup and feature information. The database supports a wide range of metadata and spatial queries on images, annotations, markups, and features. We currently have three databases running on a Dell PowerEdge T410 server with CentOS 5.5 Linux operating system. The database server is IBM DB2 Enterprise Edition 9.7.2. The set of databases consists of 1) a TMA database containing image analysis results from 4740 cases of breast cancer, with 641 MB storage size; 2) an algorithm validation database, which stores markups and annotations from two segmentation algorithms and two parameter sets on 18 selected slides, with 66 GB storage size; and 3) an in silico brain tumor study database comprising results from 307 TCGA slides, with 365 GB storage size. The latter two databases also contain human-generated annotations and markups for regions and nuclei. Modeling and managing pathology image analysis results in a database provide immediate benefits on the value and usability of data in a research study. The database provides powerful query capabilities, which are otherwise difficult or cumbersome to support by other approaches such as programming languages. Standardized, semantic annotated data representation and interfaces also make it possible to more efficiently share image data and analysis results.
Recent progress in making protein microarray through BioLP
NASA Astrophysics Data System (ADS)
Yang, Rusong; Wei, Lian; Feng, Ying; Li, Xiujian; Zhou, Quan
2017-02-01
Biological laser printing (BioLP) is a promising biomaterial printing technique. It has the advantage of high resolution, high bioactivity, high printing frequency and small transported liquid amount. In this paper, a set of BioLP device is design and made, and protein microarrays are printed by this device. It's found that both laser intensity and fluid layer thickness have an influence on the microarrays acquired. Besides, two kinds of the fluid layer coating methods are compared, and the results show that blade coating method is better than well-coating method in BioLP. A microarray of 0.76pL protein microarray and a "NUDT" patterned microarray are printed to testify the printing ability of BioLP.
Aksu, Yaman; Miller, David J; Kesidis, George; Yang, Qing X
2010-05-01
Feature selection for classification in high-dimensional spaces can improve generalization, reduce classifier complexity, and identify important, discriminating feature "markers." For support vector machine (SVM) classification, a widely used technique is recursive feature elimination (RFE). We demonstrate that RFE is not consistent with margin maximization, central to the SVM learning approach. We thus propose explicit margin-based feature elimination (MFE) for SVMs and demonstrate both improved margin and improved generalization, compared with RFE. Moreover, for the case of a nonlinear kernel, we show that RFE assumes that the squared weight vector 2-norm is strictly decreasing as features are eliminated. We demonstrate this is not true for the Gaussian kernel and, consequently, RFE may give poor results in this case. MFE for nonlinear kernels gives better margin and generalization. We also present an extension which achieves further margin gains, by optimizing only two degrees of freedom--the hyperplane's intercept and its squared 2-norm--with the weight vector orientation fixed. We finally introduce an extension that allows margin slackness. We compare against several alternatives, including RFE and a linear programming method that embeds feature selection within the classifier design. On high-dimensional gene microarray data sets, University of California at Irvine (UCI) repository data sets, and Alzheimer's disease brain image data, MFE methods give promising results.
A Next-generation Tissue Microarray (ngTMA) Protocol for Biomarker Studies
Zlobec, Inti; Suter, Guido; Perren, Aurel; Lugli, Alessandro
2014-01-01
Biomarker research relies on tissue microarrays (TMA). TMAs are produced by repeated transfer of small tissue cores from a ‘donor’ block into a ‘recipient’ block and then used for a variety of biomarker applications. The construction of conventional TMAs is labor intensive, imprecise, and time-consuming. Here, a protocol using next-generation Tissue Microarrays (ngTMA) is outlined. ngTMA is based on TMA planning and design, digital pathology, and automated tissue microarraying. The protocol is illustrated using an example of 134 metastatic colorectal cancer patients. Histological, statistical and logistical aspects are considered, such as the tissue type, specific histological regions, and cell types for inclusion in the TMA, the number of tissue spots, sample size, statistical analysis, and number of TMA copies. Histological slides for each patient are scanned and uploaded onto a web-based digital platform. There, they are viewed and annotated (marked) using a 0.6-2.0 mm diameter tool, multiple times using various colors to distinguish tissue areas. Donor blocks and 12 ‘recipient’ blocks are loaded into the instrument. Digital slides are retrieved and matched to donor block images. Repeated arraying of annotated regions is automatically performed resulting in an ngTMA. In this example, six ngTMAs are planned containing six different tissue types/histological zones. Two copies of the ngTMAs are desired. Three to four slides for each patient are scanned; 3 scan runs are necessary and performed overnight. All slides are annotated; different colors are used to represent the different tissues/zones, namely tumor center, invasion front, tumor/stroma, lymph node metastases, liver metastases, and normal tissue. 17 annotations/case are made; time for annotation is 2-3 min/case. 12 ngTMAs are produced containing 4,556 spots. Arraying time is 15-20 hr. Due to its precision, flexibility and speed, ngTMA is a powerful tool to further improve the quality of TMAs used in clinical and translational research. PMID:25285857
The second phase of the MicroArray Quality Control (MAQC-II) project evaluated common practices for developing and validating microarray-based models aimed at predicting toxicological and clinical endpoints. Thirty-six teams developed classifiers for 13 endpoints - some easy, som...
Ma, Jianping; Wang, Jufang; Liu, Yanfen; Wang, Changyi; Duan, Donghui; Lu, Nanjia; Wang, Kaiyue; Zhang, Lu; Gu, Kaibo; Chen, Sihan; Zhang, Tao; You, Dingyun; Han, Liyuan
2017-02-01
The aim of this study was to compare the expression levels of serum miRNAs in diabetic retinopathy and type 2 diabetes mellitus. Serum miRNA expression profiles from diabetic retinopathy cases (type 2 diabetes mellitus patients with diabetic retinopathy) and type 2 diabetes mellitus controls (type 2 diabetes mellitus patients without diabetic retinopathy) were examined by miRNA-specific microarray analysis. Quantitative real-time polymerase chain reaction was used to validate the significantly differentially expressed serum miRNAs from the microarray analysis of 45 diabetic retinopathy cases and 45 age-, sex-, body mass index- and duration-of-diabetes-matched type 2 diabetes mellitus controls. The relative changes in serum miRNA expression levels were analyzed using the 2-ΔΔCt method. A total of 5 diabetic retinopathy cases and 5 type 2 diabetes mellitus controls were included in the miRNA-specific microarray analysis. The serum levels of miR-3939 and miR-1910-3p differed significantly between the two groups in the screening stage; however, quantitative real-time polymerase chain reaction did not reveal significant differences in miRNA expression for 45 diabetic retinopathy cases and their matched type 2 diabetes mellitus controls. Our findings indicate that miR-3939 and miR-1910-3p may not play important roles in the development of diabetic retinopathy; however, studies with a larger sample size are needed to confirm our findings.
Microarray platform for omics analysis
NASA Astrophysics Data System (ADS)
Mecklenburg, Michael; Xie, Bin
2001-09-01
Microarray technology has revolutionized genetic analysis. However, limitations in genome analysis has lead to renewed interest in establishing 'omic' strategies. As we enter the post-genomic era, new microarray technologies are needed to address these new classes of 'omic' targets, such as proteins, as well as lipids and carbohydrates. We have developed a microarray platform that combines self- assembling monolayers with the biotin-streptavidin system to provide a robust, versatile immobilization scheme. A hydrophobic film is patterned on the surface creating an array of tension wells that eliminates evaporation effects thereby reducing the shear stress to which biomolecules are exposed to during immobilization. The streptavidin linker layer makes it possible to adapt and/or develop microarray based assays using virtually any class of biomolecules including: carbohydrates, peptides, antibodies, receptors, as well as them ore traditional DNA based arrays. Our microarray technology is designed to furnish seamless compatibility across the various 'omic' platforms by providing a common blueprint for fabricating and analyzing arrays. The prototype microarray uses a microscope slide footprint patterned with 2 by 96 flat wells. Data on the microarray platform will be presented.
Um, Taewoong; Hong, Jiwoo; Im, Do Jin; Lee, Sang Joon; Kang, In Seok
2016-01-01
The dispensing of tiny droplets is a basic and crucial process in a myriad of applications, such as DNA/protein microarray, cell cultures, chemical synthesis of microparticles, and digital microfluidics. This work systematically demonstrates droplet dispensing into immiscible fluids through electric charge concentration (ECC) method. It exhibits three main modes (i.e., attaching, uniform, and bursting modes) as a function of flow rates, applied voltages, and gap distances between the nozzle and the oil surface. Through a conventional nozzle with diameter of a few millimeters, charged droplets with volumes ranging from a few μL to a few tens of nL can be uniformly dispensed into the oil chamber without reduction in nozzle size. Based on the features of the proposed method (e.g., formation of droplets with controllable polarity and amount of electric charge in water and oil system), a simple and straightforward method is developed for microparticle synthesis, including preparation of colloidosomes and fabrication of Janus microparticles with anisotropic internal structures. Finally, a combined system consisting of ECC-induced droplet dispensing and electrophoresis of charged droplet (ECD)-driven manipulation systems is constructed. This integrated platform will provide increased utility and flexibility in microfluidic applications because a charged droplet can be delivered toward the intended position by programmable electric control. PMID:27534580
Analysis of FOXF1 and the FOX gene cluster in patients with VACTERL association
Agochukwu, Nneamaka B.; Pineda-Alvarez, Daniel E.; Keaton, Amelia A.; Warren-Mora, Nicole; Raam, Manu S.; Kamat, Aparna; Chandrasekharappa, Settara C.; Solomon, Benjamin D.
2011-01-01
VACTERL association, a relatively common condition with an incidence of approximately 1 in 20,000 – 35,000 births, is a non-random association of birth defects that includes vertebral defects (V), anal atresia (A), cardiac defects (C), tracheo-esophageal fistula (TE), renal anomalies (R) and limb malformations (L). Although the etiology is unknown in the majority of patients, there is evidence that it is causally heterogeneous. Several studies have shown evidence for inheritance in VACTERL, implying a role for genetic loci. Recently, patients with component features of VACTERL and a lethal developmental pulmonary disorder, alveolar capillary dysplasia with misalignment of pulmonary veins (ACD/MPV), were found to harbor deletions or mutations affecting FOXF1 and the FOX gene cluster on chromosome 16q24. We investigated this gene through direct sequencing and high-density SNP microarray in 12 patients with VACTERL association but without ACD/MPV. Our mutational analysis of FOXF1 showed normal sequences and no genomic imbalances affecting the FOX gene cluster on chromosome 16q24 in the studied patients. Possible explanations for these results include the etiologic and clinical heterogeneity of VACTERL association, the possibility that mutations affecting this gene may occur only in more severely affected individuals, and insufficient study sample size. PMID:21315191
Seefeld, Ting H.; Halpern, Aaron R.; Corn, Robert M.
2012-01-01
Protein microarrays are fabricated from double-stranded DNA (dsDNA) microarrays by a one-step, multiplexed enzymatic synthesis in an on-chip microfluidic format and then employed for antibody biosensing measurements with surface plasmon resonance imaging (SPRI). A microarray of dsDNA elements (denoted as generator elements) that encode either a His-tagged green fluorescent protein (GFP) or a His-tagged luciferase protein is utilized to create multiple copies of messenger RNA (mRNA) in a surface RNA polymerase reaction; the mRNA transcripts are then translated into proteins by cell-free protein synthesis in a microfluidic format. The His-tagged proteins diffuse to adjacent Cu(II)-NTA microarray elements (denoted as detector elements) and are specifically adsorbed. The net result is the on-chip, cell-free synthesis of a protein microarray that can be used immediately for SPRI protein biosensing. The dual element format greatly reduces any interference from the nonspecific adsorption of enzyme or proteins. SPRI measurements for the detection of the antibodies anti-GFP and anti-luciferase were used to verify the formation of the protein microarray. This convenient on-chip protein microarray fabrication method can be implemented for multiplexed SPRI biosensing measurements in both clinical and research applications. PMID:22793370
Fully Automated Complementary DNA Microarray Segmentation using a Novel Fuzzy-based Algorithm.
Saberkari, Hamidreza; Bahrami, Sheyda; Shamsi, Mousa; Amoshahy, Mohammad Javad; Ghavifekr, Habib Badri; Sedaaghi, Mohammad Hossein
2015-01-01
DNA microarray is a powerful approach to study simultaneously, the expression of 1000 of genes in a single experiment. The average value of the fluorescent intensity could be calculated in a microarray experiment. The calculated intensity values are very close in amount to the levels of expression of a particular gene. However, determining the appropriate position of every spot in microarray images is a main challenge, which leads to the accurate classification of normal and abnormal (cancer) cells. In this paper, first a preprocessing approach is performed to eliminate the noise and artifacts available in microarray cells using the nonlinear anisotropic diffusion filtering method. Then, the coordinate center of each spot is positioned utilizing the mathematical morphology operations. Finally, the position of each spot is exactly determined through applying a novel hybrid model based on the principle component analysis and the spatial fuzzy c-means clustering (SFCM) algorithm. Using a Gaussian kernel in SFCM algorithm will lead to improving the quality in complementary DNA microarray segmentation. The performance of the proposed algorithm has been evaluated on the real microarray images, which is available in Stanford Microarray Databases. Results illustrate that the accuracy of microarray cells segmentation in the proposed algorithm reaches to 100% and 98% for noiseless/noisy cells, respectively.
Zhang, Aiying; Yin, Chengzeng; Wang, Zhenshun; Zhang, Yonghong; Zhao, Yuanshun; Li, Ang; Sun, Huanqin; Lin, Dongdong; Li, Ning
2016-12-01
Objective To develop a simple, effective, time-saving and low-cost fluorescence protein microarray method for detecting serum alpha-fetoprotein (AFP) in patients with hepatocellular carcinoma (HCC). Method Non-contact piezoelectric print techniques were applied to fluorescence protein microarray to reduce the cost of prey antibody. Serum samples from patients with HCC and healthy control subjects were collected and evaluated for the presence of AFP using a novel fluorescence protein microarray. To validate the fluorescence protein microarray, serum samples were tested for AFP using an enzyme-linked immunosorbent assay (ELISA). Results A total of 110 serum samples from patients with HCC ( n = 65) and healthy control subjects ( n = 45) were analysed. When the AFP cut-off value was set at 20 ng/ml, the fluorescence protein microarray had a sensitivity of 91.67% and a specificity of 93.24% for detecting serum AFP. Serum AFP quantified via fluorescence protein microarray had a similar diagnostic performance compared with ELISA in distinguishing patients with HCC from healthy control subjects (area under receiver operating characteristic curve: 0.906 for fluorescence protein microarray; 0.880 for ELISA). Conclusion A fluorescence protein microarray method was developed for detecting serum AFP in patients with HCC.
Zhang, Aiying; Yin, Chengzeng; Wang, Zhenshun; Zhang, Yonghong; Zhao, Yuanshun; Li, Ang; Sun, Huanqin; Lin, Dongdong
2016-01-01
Objective To develop a simple, effective, time-saving and low-cost fluorescence protein microarray method for detecting serum alpha-fetoprotein (AFP) in patients with hepatocellular carcinoma (HCC). Method Non-contact piezoelectric print techniques were applied to fluorescence protein microarray to reduce the cost of prey antibody. Serum samples from patients with HCC and healthy control subjects were collected and evaluated for the presence of AFP using a novel fluorescence protein microarray. To validate the fluorescence protein microarray, serum samples were tested for AFP using an enzyme-linked immunosorbent assay (ELISA). Results A total of 110 serum samples from patients with HCC (n = 65) and healthy control subjects (n = 45) were analysed. When the AFP cut-off value was set at 20 ng/ml, the fluorescence protein microarray had a sensitivity of 91.67% and a specificity of 93.24% for detecting serum AFP. Serum AFP quantified via fluorescence protein microarray had a similar diagnostic performance compared with ELISA in distinguishing patients with HCC from healthy control subjects (area under receiver operating characteristic curve: 0.906 for fluorescence protein microarray; 0.880 for ELISA). Conclusion A fluorescence protein microarray method was developed for detecting serum AFP in patients with HCC. PMID:27885040
García-Hoyos, María; Cortón, Marta; Ávila-Fernández, Almudena; Riveiro-Álvarez, Rosa; Giménez, Ascensión; Hernan, Inma; Carballo, Miguel; Ayuso, Carmen
2012-01-01
Purpose Presently, 22 genes have been described in association with autosomal dominant retinitis pigmentosa (adRP); however, they explain only 50% of all cases, making genetic diagnosis of this disease difficult and costly. The aim of this study was to evaluate a specific genotyping microarray for its application to the molecular diagnosis of adRP in Spanish patients. Methods We analyzed 139 unrelated Spanish families with adRP. Samples were studied by using a genotyping microarray (adRP). All mutations found were further confirmed with automatic sequencing. Rhodopsin (RHO) sequencing was performed in all negative samples for the genotyping microarray. Results The adRP genotyping microarray detected the mutation associated with the disease in 20 of the 139 families with adRP. As in other populations, RHO was found to be the most frequently mutated gene in these families (7.9% of the microarray genotyped families). The rate of false positives (microarray results not confirmed with sequencing) and false negatives (mutations in RHO detected with sequencing but not with the genotyping microarray) were established, and high levels of analytical sensitivity (95%) and specificity (100%) were found. Diagnostic accuracy was 15.1%. Conclusions The adRP genotyping microarray is a quick, cost-efficient first step in the molecular diagnosis of Spanish patients with adRP. PMID:22736939
Honoré, Paul; Granjeaud, Samuel; Tagett, Rebecca; Deraco, Stéphane; Beaudoing, Emmanuel; Rougemont, Jacques; Debono, Stéphane; Hingamp, Pascal
2006-09-20
High throughput gene expression profiling (GEP) is becoming a routine technique in life science laboratories. With experimental designs that repeatedly span thousands of genes and hundreds of samples, relying on a dedicated database infrastructure is no longer an option.GEP technology is a fast moving target, with new approaches constantly broadening the field diversity. This technology heterogeneity, compounded by the informatics complexity of GEP databases, means that software developments have so far focused on mainstream techniques, leaving less typical yet established techniques such as Nylon microarrays at best partially supported. MAF (MicroArray Facility) is the laboratory database system we have developed for managing the design, production and hybridization of spotted microarrays. Although it can support the widely used glass microarrays and oligo-chips, MAF was designed with the specific idiosyncrasies of Nylon based microarrays in mind. Notably single channel radioactive probes, microarray stripping and reuse, vector control hybridizations and spike-in controls are all natively supported by the software suite. MicroArray Facility is MIAME supportive and dynamically provides feedback on missing annotations to help users estimate effective MIAME compliance. Genomic data such as clone identifiers and gene symbols are also directly annotated by MAF software using standard public resources. The MAGE-ML data format is implemented for full data export. Journalized database operations (audit tracking), data anonymization, material traceability and user/project level confidentiality policies are also managed by MAF. MicroArray Facility is a complete data management system for microarray producers and end-users. Particular care has been devoted to adequately model Nylon based microarrays. The MAF system, developed and implemented in both private and academic environments, has proved a robust solution for shared facilities and industry service providers alike.
Honoré, Paul; Granjeaud, Samuel; Tagett, Rebecca; Deraco, Stéphane; Beaudoing, Emmanuel; Rougemont, Jacques; Debono, Stéphane; Hingamp, Pascal
2006-01-01
Background High throughput gene expression profiling (GEP) is becoming a routine technique in life science laboratories. With experimental designs that repeatedly span thousands of genes and hundreds of samples, relying on a dedicated database infrastructure is no longer an option. GEP technology is a fast moving target, with new approaches constantly broadening the field diversity. This technology heterogeneity, compounded by the informatics complexity of GEP databases, means that software developments have so far focused on mainstream techniques, leaving less typical yet established techniques such as Nylon microarrays at best partially supported. Results MAF (MicroArray Facility) is the laboratory database system we have developed for managing the design, production and hybridization of spotted microarrays. Although it can support the widely used glass microarrays and oligo-chips, MAF was designed with the specific idiosyncrasies of Nylon based microarrays in mind. Notably single channel radioactive probes, microarray stripping and reuse, vector control hybridizations and spike-in controls are all natively supported by the software suite. MicroArray Facility is MIAME supportive and dynamically provides feedback on missing annotations to help users estimate effective MIAME compliance. Genomic data such as clone identifiers and gene symbols are also directly annotated by MAF software using standard public resources. The MAGE-ML data format is implemented for full data export. Journalized database operations (audit tracking), data anonymization, material traceability and user/project level confidentiality policies are also managed by MAF. Conclusion MicroArray Facility is a complete data management system for microarray producers and end-users. Particular care has been devoted to adequately model Nylon based microarrays. The MAF system, developed and implemented in both private and academic environments, has proved a robust solution for shared facilities and industry service providers alike. PMID:16987406
DeScipio, Cheryl; Conlin, Laura; Rosenfeld, Jill; Tepperberg, James; Pasion, Romela; Patel, Ankita; McDonald, Marie T; Aradhya, Swaroop; Ho, Darlene; Goldstein, Jennifer; McGuire, Marianne; Mulchandani, Surabhi; Medne, Livija; Rupps, Rosemarie; Serrano, Alvaro H.; Thorland, Erik C; Tsai, Anne C-H; Hilhorst-Hofstee, Yvonne; Ruivenkamp, Claudia AL; Van Esch, Hilde; Addor, Marie-Claude; Martinet, Danielle; Mason, Thornton B.A.; Clark, Dinah; Spinner, Nancy B; Krantz, Ian D
2012-01-01
We describe 19 unrelated individuals with submicroscopic deletions involving 10p15.3 characterized by chromosomal microarray (CMA). Interestingly, to our knowledge, only two individuals with isolated, submicroscopic 10p15.3 deletion have been reported to date; however, only limited clinical information is available for these probands and the deleted region has not been molecularly mapped. Comprehensive clinical history was obtained for 12 of the 19 individuals described in this study. Common features among these 12 individuals include: cognitive/behavioral/developmental differences (11/11), speech delay/language disorder (10/10), motor delay (10/10), craniofacial dysmorphism (9/12), hypotonia (7/11,), brain anomalies (4/6) and seizures (3/7). Parental studies were performed for nine of the 19 individuals; the 10p15.3 deletion was de novo in seven of the probands, not maternally inherited in one proband and inherited from an apparently affected mother in one proband. Molecular mapping of the 19 individuals reported in this study has identified two genes, ZMYND11 (OMIM# 608668) and DIP2C (OMIM# 611380) (UCSC Genome Browser), mapping within 10p15.3 which are most commonly deleted. Although no single gene has been identified which is deleted in all 19 individuals studied, the deleted region in all but one individual includes ZMYND11 and the deleted region in all but one other individual includes DIP2C. There is not a clearly identifiable phenotypic difference between these two individuals and the size of the deleted region does not generally predict clinical features. Little is currently known about these genes complicating a direct genotype/phenotype correlation at this time. These data however, suggest that ZMYND11 and/or DIP2C haploinsufficiency contributes to the clinical features associated with 10p15 deletions in probands described in this study. PMID:22847950
DeScipio, Cheryl; Conlin, Laura; Rosenfeld, Jill; Tepperberg, James; Pasion, Romela; Patel, Ankita; McDonald, Marie T; Aradhya, Swaroop; Ho, Darlene; Goldstein, Jennifer; McGuire, Marianne; Mulchandani, Surabhi; Medne, Livija; Rupps, Rosemarie; Serrano, Alvaro H; Thorland, Erik C; Tsai, Anne C-H; Hilhorst-Hofstee, Yvonne; Ruivenkamp, Claudia A L; Van Esch, Hilde; Addor, Marie-Claude; Martinet, Danielle; Mason, Thornton B A; Clark, Dinah; Spinner, Nancy B; Krantz, Ian D
2012-09-01
We describe 19 unrelated individuals with submicroscopic deletions involving 10p15.3 characterized by chromosomal microarray (CMA). Interestingly, to our knowledge, only two individuals with isolated, submicroscopic 10p15.3 deletion have been reported to date; however, only limited clinical information is available for these probands and the deleted region has not been molecularly mapped. Comprehensive clinical history was obtained for 12 of the 19 individuals described in this study. Common features among these 12 individuals include: cognitive/behavioral/developmental differences (11/11), speech delay/language disorder (10/10), motor delay (10/10), craniofacial dysmorphism (9/12), hypotonia (7/11), brain anomalies (4/6) and seizures (3/7). Parental studies were performed for nine of the 19 individuals; the 10p15.3 deletion was de novo in seven of the probands, not maternally inherited in one proband and inherited from an apparently affected mother in one proband. Molecular mapping of the 19 individuals reported in this study has identified two genes, ZMYND11 (OMIM 608668) and DIP2C (OMIM 611380; UCSC Genome Browser), mapping within 10p15.3 which are most commonly deleted. Although no single gene has been identified which is deleted in all 19 individuals studied, the deleted region in all but one individual includes ZMYND11 and the deleted region in all but one other individual includes DIP2C. There is not a clearly identifiable phenotypic difference between these two individuals and the size of the deleted region does not generally predict clinical features. Little is currently known about these genes complicating a direct genotype/phenotype correlation at this time. These data however, suggest that ZMYND11 and/or DIP2C haploinsufficiency contributes to the clinical features associated with 10p15 deletions in probands described in this study. Copyright © 2012 Wiley Periodicals, Inc.
DAnTE: a statistical tool for quantitative analysis of –omics data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Polpitiya, Ashoka D.; Qian, Weijun; Jaitly, Navdeep
2008-05-03
DAnTE (Data Analysis Tool Extension) is a statistical tool designed to address challenges unique to quantitative bottom-up, shotgun proteomics data. This tool has also been demonstrated for microarray data and can easily be extended to other high-throughput data types. DAnTE features selected normalization methods, missing value imputation algorithms, peptide to protein rollup methods, an extensive array of plotting functions, and a comprehensive ANOVA scheme that can handle unbalanced data and random effects. The Graphical User Interface (GUI) is designed to be very intuitive and user friendly.
Myelodysplastic syndrome in an infant with constitutional pure duplication 1q41-qter.
Morokawa, Hirokazu; Kamiya, Motoko; Wakui, Keiko; Kobayashi, Mikiko; Kurata, Takashi; Matsuda, Kazuyuki; Kawamura, Rie; Kanno, Hiroyuki; Fukushima, Yoshimitsu; Nakazawa, Yozo; Kosho, Tomoki
2018-01-01
We report on a Japanese female infant as the fourth patient with the constitutional pure duplication 1q41-qter confirmed by chromosomal microarray and as the first who developed myelodysplastic syndrome (MDS) among those with the constitutional 1q duplication. Common clinical features of the constitutional pure duplication 1q41-qter include developmental delay, craniofacial characteristics, foot malformation, hypertrichosis, and respiratory insufficiency. The association between MDS and the duplication of the genes in the 1q41-qter region remains unknown.
Pota, Pruthvi; Grammatopoulou, Vasiliki; Torti, Erin; Braddock, Stephen; Batanian, Jacqueline R
2014-01-01
Constitutional chromosome instability so far has mainly been associated with ring formation. In addition, isochromosome formation involving the short arm with translocation of the entire long arm is rarely observed. This type of rearrangement has been reported for chromosomes 4, 5, 7, 9, 10, 12, and 20. Here, we present the third patient having an isochromosome 4p with 4q translocation, but showing for the first time chromosome instability detected by FISH following chromosome microarray analysis.
Microarrays in brain research: the good, the bad and the ugly.
Mirnics, K
2001-06-01
Making sense of microarray data is a complex process, in which the interpretation of findings will depend on the overall experimental design and judgement of the investigator performing the analysis. As a result, differences in tissue harvesting, microarray types, sample labelling and data analysis procedures make post hoc sharing of microarray data a great challenge. To ensure rapid and meaningful data exchange, we need to create some order out of the existing chaos. In these ground-breaking microarray standardization and data sharing efforts, NIH agencies should take a leading role
The FDA's Experience with Emerging Genomics Technologies-Past, Present, and Future.
Xu, Joshua; Thakkar, Shraddha; Gong, Binsheng; Tong, Weida
2016-07-01
The rapid advancement of emerging genomics technologies and their application for assessing safety and efficacy of FDA-regulated products require a high standard of reliability and robustness supporting regulatory decision-making in the FDA. To facilitate the regulatory application, the FDA implemented a novel data submission program, Voluntary Genomics Data Submission (VGDS), and also to engage the stakeholders. As part of the endeavor, for the past 10 years, the FDA has led an international consortium of regulatory agencies, academia, pharmaceutical companies, and genomics platform providers, which was named MicroArray Quality Control Consortium (MAQC), to address issues such as reproducibility, precision, specificity/sensitivity, and data interpretation. Three projects have been completed so far assessing these genomics technologies: gene expression microarrays, whole genome genotyping arrays, and whole transcriptome sequencing (i.e., RNA-seq). The resultant studies provide the basic parameters for fit-for-purpose application of these new data streams in regulatory environments, and the solutions have been made available to the public through peer-reviewed publications. The latest MAQC project is also called the SEquencing Quality Control (SEQC) project focused on next-generation sequencing. Using reference samples with built-in controls, SEQC studies have demonstrated that relative gene expression can be measured accurately and reliably across laboratories and RNA-seq platforms. Besides prediction performance comparable to microarrays in clinical settings and safety assessments, RNA-seq is shown to have better sensitivity for low expression and reveal novel transcriptomic features. Future effort of MAQC will be focused on quality control of whole genome sequencing and targeted sequencing.
González-Plaza, Juan J; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R
2016-01-01
Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species.
Liu, Ying; Ciliax, Brian J; Borges, Karin; Dasigi, Venu; Ram, Ashwin; Navathe, Shamkant B; Dingledine, Ray
2004-01-01
One of the key challenges of microarray studies is to derive biological insights from the unprecedented quatities of data on gene-expression patterns. Clustering genes by functional keyword association can provide direct information about the nature of the functional links among genes within the derived clusters. However, the quality of the keyword lists extracted from biomedical literature for each gene significantly affects the clustering results. We extracted keywords from MEDLINE that describes the most prominent functions of the genes, and used the resulting weights of the keywords as feature vectors for gene clustering. By analyzing the resulting cluster quality, we compared two keyword weighting schemes: normalized z-score and term frequency-inverse document frequency (TFIDF). The best combination of background comparison set, stop list and stemming algorithm was selected based on precision and recall metrics. In a test set of four known gene groups, a hierarchical algorithm correctly assigned 25 of 26 genes to the appropriate clusters based on keywords extracted by the TDFIDF weighting scheme, but only 23 og 26 with the z-score method. To evaluate the effectiveness of the weighting schemes for keyword extraction for gene clusters from microarray profiles, 44 yeast genes that are differentially expressed during the cell cycle were used as a second test set. Using established measures of cluster quality, the results produced from TFIDF-weighted keywords had higher purity, lower entropy, and higher mutual information than those produced from normalized z-score weighted keywords. The optimized algorithms should be useful for sorting genes from microarray lists into functionally discrete clusters.
The FDA’s Experience with Emerging Genomics Technologies—Past, Present, and Future
Xu, Joshua; Thakkar, Shraddha; Gong, Binsheng; Tong, Weida
2016-01-01
The rapid advancement of emerging genomics technologies and their application for assessing safety and efficacy of FDA-regulated products require a high standard of reliability and robustness supporting regulatory decision-making in the FDA. To facilitate the regulatory application, the FDA implemented a novel data submission program, Voluntary Genomics Data Submission (VGDS), and also to engage the stakeholders. As part of the endeavor, for the past 10 years, the FDA has led an international consortium of regulatory agencies, academia, pharmaceutical companies, and genomics platform providers, which was named MicroArray Quality Control Consortium (MAQC), to address issues such as reproducibility, precision, specificity/sensitivity, and data interpretation. Three projects have been completed so far assessing these genomics technologies: gene expression microarrays, whole genome genotyping arrays, and whole transcriptome sequencing (i.e., RNA-seq). The resultant studies provide the basic parameters for fit-for-purpose application of these new data streams in regulatory environments, and the solutions have been made available to the public through peer-reviewed publications. The latest MAQC project is also called the SEquencing Quality Control (SEQC) project focused on next-generation sequencing. Using reference samples with built-in controls, SEQC studies have demonstrated that relative gene expression can be measured accurately and reliably across laboratories and RNA-seq platforms. Besides prediction performance comparable to microarrays in clinical settings and safety assessments, RNA-seq is shown to have better sensitivity for low expression and reveal novel transcriptomic features. Future effort of MAQC will be focused on quality control of whole genome sequencing and targeted sequencing. PMID:27116022
Yi, SoJeong; An, Hyungmi; Lee, Howard; Lee, Sangin; Ieiri, Ichiro; Lee, Youngjo; Cho, Joo-Youn; Hirota, Takeshi; Fukae, Masato; Yoshida, Kenji; Nagatsuka, Shinichiro; Kimura, Miyuki; Irie, Shin; Sugiyama, Yuichi; Shin, Dong Wan; Lim, Kyoung Soo; Chung, Jae-Yong; Yu, Kyung-Sang; Jang, In-Jin
2014-10-01
Interethnic differences in genetic polymorphism in genes encoding drug-metabolizing enzymes and transporters are one of the major factors that cause ethnic differences in drug response. This study aimed to investigate genetic polymorphisms in genes involved in drug metabolism, transport, and excretion among Korean, Japanese, and Chinese populations, the three major East Asian ethnic groups. The frequencies of 1936 variants representing 225 genes encoding drug-metabolizing enzymes and transporters were determined from 786 healthy participants (448 Korean, 208 Japanese, and 130 Chinese) using the Affymetrix Drug-Metabolizing Enzymes and Transporters Plus microarray. To compare allele or genotype frequencies in the high-dimensional data among the three East Asian ethnic groups, multiple testing, principal component analysis (PCA), and regularized multinomial logit model through least absolute shrinkage and selection operator were used. On microarray analysis, 1071 of 1936 variants (>50% of markers) were found to be monomorphic. In a large number of genetic variants, the fixation index and Pearson's correlation coefficient of minor allele frequencies were less than 0.034 and greater than 0.95, respectively, among the three ethnic groups. PCA identified 47 genetic variants with multiple testing, but was unable to discriminate ethnic groups by the first three components. Multinomial least absolute shrinkage and selection operator analysis identified 269 genetic variants that showed different frequencies among the three ethnic groups. However, none of those variants distinguished between the three ethnic groups during subsequent PCA. Korean, Japanese, and Chinese populations are not pharmacogenetically distant from one another, at least with regard to drug disposition, metabolism, and elimination.
Ranjbar, Reza; Behzadi, Payam; Najafi, Ali; Roudi, Raheleh
2017-01-01
A rapid, accurate, flexible and reliable diagnostic method may significantly decrease the costs of diagnosis and treatment. Designing an appropriate microarray chip reduces noises and probable biases in the final result. The aim of this study was to design and construct a DNA Microarray Chip for a rapid detection and identification of 10 important bacterial agents. In the present survey, 10 unique genomic regions relating to 10 pathogenic bacterial agents including Escherichia coli (E.coli), Shigella boydii, Sh.dysenteriae, Sh.flexneri, Sh.sonnei, Salmonella typhi, S.typhimurium, Brucella sp., Legionella pneumophila, and Vibrio cholera were selected for designing specific long oligo microarray probes. For this reason, the in-silico operations including utilization of the NCBI RefSeq database, Servers of PanSeq and Gview, AlleleID 7.7 and Oligo Analyzer 3.1 was done. On the other hand, the in-vitro part of the study comprised stages of robotic microarray chip probe spotting, bacterial DNAs extraction and DNA labeling, hybridization and microarray chip scanning. In wet lab section, different tools and apparatus such as Nexterion® Slide E, Qarray mini spotter, NimbleGen kit, TrayMix TM S4, and Innoscan 710 were used. A DNA microarray chip including 10 long oligo microarray probes was designed and constructed for detection and identification of 10 pathogenic bacteria. The DNA microarray chip was capable to identify all 10 bacterial agents tested simultaneously. The presence of a professional bioinformatician as a probe designer is needed to design appropriate multifunctional microarray probes to increase the accuracy of the outcomes.
Richard, Arianne C; Lyons, Paul A; Peters, James E; Biasci, Daniele; Flint, Shaun M; Lee, James C; McKinney, Eoin F; Siegel, Richard M; Smith, Kenneth G C
2014-08-04
Although numerous investigations have compared gene expression microarray platforms, preprocessing methods and batch correction algorithms using constructed spike-in or dilution datasets, there remains a paucity of studies examining the properties of microarray data using diverse biological samples. Most microarray experiments seek to identify subtle differences between samples with variable background noise, a scenario poorly represented by constructed datasets. Thus, microarray users lack important information regarding the complexities introduced in real-world experimental settings. The recent development of a multiplexed, digital technology for nucleic acid measurement enables counting of individual RNA molecules without amplification and, for the first time, permits such a study. Using a set of human leukocyte subset RNA samples, we compared previously acquired microarray expression values with RNA molecule counts determined by the nCounter Analysis System (NanoString Technologies) in selected genes. We found that gene measurements across samples correlated well between the two platforms, particularly for high-variance genes, while genes deemed unexpressed by the nCounter generally had both low expression and low variance on the microarray. Confirming previous findings from spike-in and dilution datasets, this "gold-standard" comparison demonstrated signal compression that varied dramatically by expression level and, to a lesser extent, by dataset. Most importantly, examination of three different cell types revealed that noise levels differed across tissues. Microarray measurements generally correlate with relative RNA molecule counts within optimal ranges but suffer from expression-dependent accuracy bias and precision that varies across datasets. We urge microarray users to consider expression-level effects in signal interpretation and to evaluate noise properties in each dataset independently.
Gong, Wei; He, Kun; Covington, Mike; Dinesh-Kumar, S. P.; Snyder, Michael; Harmer, Stacey L.; Zhu, Yu-Xian; Deng, Xing Wang
2009-01-01
We used our collection of Arabidopsis transcription factor (TF) ORFeome clones to construct protein microarrays containing as many as 802 TF proteins. These protein microarrays were used for both protein-DNA and protein-protein interaction analyses. For protein-DNA interaction studies, we examined AP2/ERF family TFs and their cognate cis-elements. By careful comparison of the DNA-binding specificity of 13 TFs on the protein microarray with previous non-microarray data, we showed that protein microarrays provide an efficient and high throughput tool for genome-wide analysis of TF-DNA interactions. This microarray protein-DNA interaction analysis allowed us to derive a comprehensive view of DNA-binding profiles of AP2/ERF family proteins in Arabidopsis. It also revealed four TFs that bound the EE (evening element) and had the expected phased gene expression under clock-regulation, thus providing a basis for further functional analysis of their roles in clock regulation of gene expression. We also developed procedures for detecting protein interactions using this TF protein microarray and discovered four novel partners that interact with HY5, which can be validated by yeast two-hybrid assays. Thus, plant TF protein microarrays offer an attractive high-throughput alternative to traditional techniques for TF functional characterization on a global scale. PMID:19802365
Zhao, Zhengshan; Peytavi, Régis; Diaz-Quijada, Gerardo A.; Picard, Francois J.; Huletsky, Ann; Leblanc, Éric; Frenette, Johanne; Boivin, Guy; Veres, Teodor; Dumoulin, Michel M.; Bergeron, Michel G.
2008-01-01
Fabrication of microarray devices using traditional glass slides is not easily adaptable to integration into microfluidic systems. There is thus a need for the development of polymeric materials showing a high hybridization signal-to-background ratio, enabling sensitive detection of microbial pathogens. We have developed such plastic supports suitable for highly sensitive DNA microarray hybridizations. The proof of concept of this microarray technology was done through the detection of four human respiratory viruses that were amplified and labeled with a fluorescent dye via a sensitive reverse transcriptase PCR (RT-PCR) assay. The performance of the microarray hybridization with plastic supports made of PMMA [poly(methylmethacrylate)]-VSUVT or Zeonor 1060R was compared to that with high-quality glass slide microarrays by using both passive and microfluidic hybridization systems. Specific hybridization signal-to-background ratios comparable to that obtained with high-quality commercial glass slides were achieved with both polymeric substrates. Microarray hybridizations demonstrated an analytical sensitivity equivalent to approximately 100 viral genome copies per RT-PCR, which is at least 100-fold higher than the sensitivities of previously reported DNA hybridizations on plastic supports. Testing of these plastic polymers using a microfluidic microarray hybridization platform also showed results that were comparable to those with glass supports. In conclusion, PMMA-VSUVT and Zeonor 1060R are both suitable for highly sensitive microarray hybridizations. PMID:18784318
Burgarella, Sarah; Cattaneo, Dario; Masseroli, Marco
2006-01-01
We developed MicroGen, a multi-database Web based system for managing all the information characterizing spotted microarray experiments. It supports information gathering and storing according to the Minimum Information About Microarray Experiments (MIAME) standard. It also allows easy sharing of information and data among all multidisciplinary actors involved in spotted microarray experiments. PMID:17238488
2006-04-27
polysaccharide microarray platform was prepared by immobilizing Burkholderia pseudomallei and Burkholderia mallei polysaccharides . This... polysaccharide array was tested with success for detecting B. pseudomallei and B. mallei serum (human and animal) antibodies. The advantages of this microarray... Polysaccharide microarrays; Burkholderia pseudomallei; Burkholderia mallei; Glanders; Melioidosis1. Introduction There has been a great deal of emphasis on the
Microarray-integrated optoelectrofluidic immunoassay system
Han, Dongsik
2016-01-01
A microarray-based analytical platform has been utilized as a powerful tool in biological assay fields. However, an analyte depletion problem due to the slow mass transport based on molecular diffusion causes low reaction efficiency, resulting in a limitation for practical applications. This paper presents a novel method to improve the efficiency of microarray-based immunoassay via an optically induced electrokinetic phenomenon by integrating an optoelectrofluidic device with a conventional glass slide-based microarray format. A sample droplet was loaded between the microarray slide and the optoelectrofluidic device on which a photoconductive layer was deposited. Under the application of an AC voltage, optically induced AC electroosmotic flows caused by a microarray-patterned light actively enhanced the mass transport of target molecules at the multiple assay spots of the microarray simultaneously, which reduced tedious reaction time from more than 30 min to 10 min. Based on this enhancing effect, a heterogeneous immunoassay with a tiny volume of sample (5 μl) was successfully performed in the microarray-integrated optoelectrofluidic system using immunoglobulin G (IgG) and anti-IgG, resulting in improved efficiency compared to the static environment. Furthermore, the application of multiplex assays was also demonstrated by multiple protein detection. PMID:27190571
Microarray-integrated optoelectrofluidic immunoassay system.
Han, Dongsik; Park, Je-Kyun
2016-05-01
A microarray-based analytical platform has been utilized as a powerful tool in biological assay fields. However, an analyte depletion problem due to the slow mass transport based on molecular diffusion causes low reaction efficiency, resulting in a limitation for practical applications. This paper presents a novel method to improve the efficiency of microarray-based immunoassay via an optically induced electrokinetic phenomenon by integrating an optoelectrofluidic device with a conventional glass slide-based microarray format. A sample droplet was loaded between the microarray slide and the optoelectrofluidic device on which a photoconductive layer was deposited. Under the application of an AC voltage, optically induced AC electroosmotic flows caused by a microarray-patterned light actively enhanced the mass transport of target molecules at the multiple assay spots of the microarray simultaneously, which reduced tedious reaction time from more than 30 min to 10 min. Based on this enhancing effect, a heterogeneous immunoassay with a tiny volume of sample (5 μl) was successfully performed in the microarray-integrated optoelectrofluidic system using immunoglobulin G (IgG) and anti-IgG, resulting in improved efficiency compared to the static environment. Furthermore, the application of multiplex assays was also demonstrated by multiple protein detection.
Advances in cell-free protein array methods.
Yu, Xiaobo; Petritis, Brianne; Duan, Hu; Xu, Danke; LaBaer, Joshua
2018-01-01
Cell-free protein microarrays represent a special form of protein microarray which display proteins made fresh at the time of the experiment, avoiding storage and denaturation. They have been used increasingly in basic and translational research over the past decade to study protein-protein interactions, the pathogen-host relationship, post-translational modifications, and antibody biomarkers of different human diseases. Their role in the first blood-based diagnostic test for early stage breast cancer highlights their value in managing human health. Cell-free protein microarrays will continue to evolve to become widespread tools for research and clinical management. Areas covered: We review the advantages and disadvantages of different cell-free protein arrays, with an emphasis on the methods that have been studied in the last five years. We also discuss the applications of each microarray method. Expert commentary: Given the growing roles and impact of cell-free protein microarrays in research and medicine, we discuss: 1) the current technical and practical limitations of cell-free protein microarrays; 2) the biomarker discovery and verification pipeline using protein microarrays; and 3) how cell-free protein microarrays will advance over the next five years, both in their technology and applications.
Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S; Jaing, C
2012-03-27
The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interimmore » report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.« less
Expression and prognostic value of miR-92a in patients with gastric cancer.
Ren, Chuanli; Wang, Wenshu; Han, Chongxu; Chen, Hui; Fu, Deyuan; Luo, Yulin; Yao, Hanyu; Wang, Daxin; Ma, Li; Zhou, Lin; Han, Dongsheng; Shen, Ming
2016-07-01
MicroRNA (miR)-92 expression is often aberrant in human cancers. However, its expression in gastric carcinoma and its relation to clinicopathological features and prognosis are unclear.Tissue microarrays were constructed from 180 patients with gastric cancer (GC), who were undergoing radical resection. MiR-92a expression was detected using miRNA-locked nucleic acid in situ hybridization, and its correlation with clinicopathological features and overall survival was analyzed. MiR-92a expression was decreased in 13.9 % (25/180) of GC, increased in 81.1 % (146/180), and unchanged in 5.0 % (9/180), compared with paracancerous normal tissue (P < 0.001). Univariate analysis showed that high miR-92a expression, tumor stage, tumor status, node status, and tumor size were significant negative prognostic predictors for overall survival in patients with GC (P < 0.001, P < 0.001, P = 0.008, P < 0.001, and P = 0.001, respectively). High miR-92a expression still remained a significant predictor of shorter survival in stage II (n = 56, P = 0.001) and stage III (n = 92, P = 0.009) GC. Multivariate regression analysis demonstrated that tumor status (hazard ratio [HR], 3.10; 95 % confidence interval [CI], 1.51-6.37; P = 0.002), stage (HR, 3.54; 95 % CI, 1.65-7.63; P = 0.000), lymph node metastasis (HR, 2.83; 95 % CI, 1.88-4.28; P = 0.000), high expression of miR-92a (HR, 2.94; 95 % CI, 2.01-4.31; P = 0.000), and tumor size (HR, 2.34; 95 % CI, 1.45-3.79; P = 0.002) predicted shorter OS.High expression of miR-92a compared with adjacent normal tissues was associated with shorter OS. MiR-92a may thus be useful for evaluating prognosis and may provide a novel treatment target in patients with GC.
Sharma, Nirmala; Anderson, Maureen; Kumar, Arvind; Zhang, Yan; Giblin, E Michael; Abrams, Suzanne R; Zaharia, L Irina; Taylor, David C; Fobert, Pierre R
2008-12-19
Seed oil accumulates primarily as triacylglycerol (TAG). While the biochemical pathway for TAG biosynthesis is known, its regulation remains unclear. Previous research identified microsomal diacylglycerol acyltransferase 1 (DGAT1, EC 2.3.1.20) as controlling a rate-limiting step in the TAG biosynthesis pathway. Of note, overexpression of DGAT1 results in substantial increases in oil content and seed size. To further analyze the global consequences of manipulating DGAT1 levels during seed development, a concerted transcriptome and metabolome analysis of transgenic B. napus prototypes was performed. Using a targeted Brassica cDNA microarray, about 200 genes were differentially expressed in two independent transgenic lines analyzed. Interestingly, 24-33% of the targets showing significant changes have no matching gene in Arabidopsis although these represent only 5% of the targets on the microarray. Further analysis of some of these novel transcripts indicated that several are inducible by ABA in microspore-derived embryos. Of the 200 Arabidopsis genes implicated in lipid biology present on the microarray, 36 were found to be differentially regulated in DGAT transgenic lines. Furthermore, kinetic reverse transcriptase Polymerase Chain Reaction (k-PCR) analysis revealed up-regulation of genes encoding enzymes of the Kennedy pathway involved in assembly of TAGs. Hormone profiling indicated that levels of auxins and cytokinins varied between transgenic lines and untransformed controls, while differences in the pool sizes of ABA and catabolites were only observed at later stages of development. Our results indicate that the increased TAG accumulation observed in transgenic DGAT1 plants is associated with modest transcriptional and hormonal changes during seed development that are not limited to the TAG biosynthesis pathway. These might be associated with feedback or feed-forward effects due to altered levels of DGAT1 activity. The fact that a large fraction of significant amplicons have no matching genes in Arabidopsis compromised our ability to draw concrete inferences from the data at this stage, but has led to the identification of novel genes of potential interest.
Bermudo, R; Abia, D; Mozos, A; García-Cruz, E; Alcaraz, A; Ortiz, Á R; Thomson, T M; Fernández, P L
2011-01-01
Introduction: Currently, final diagnosis of prostate cancer (PCa) is based on histopathological analysis of needle biopsies, but this process often bears uncertainties due to small sample size, tumour focality and pathologist's subjective assessment. Methods: Prostate cancer diagnostic signatures were generated by applying linear discriminant analysis to microarray and real-time RT–PCR (qRT–PCR) data from normal and tumoural prostate tissue samples. Additionally, after removal of biopsy tissues, material washed off from transrectal biopsy needles was used for molecular profiling and discriminant analysis. Results: Linear discriminant analysis applied to microarray data for a set of 318 genes differentially expressed between non-tumoural and tumoural prostate samples produced 26 gene signatures, which classified the 84 samples used with 100% accuracy. To identify signatures potentially useful for the diagnosis of prostate biopsies, surplus material washed off from routine biopsy needles from 53 patients was used to generate qRT–PCR data for a subset of 11 genes. This analysis identified a six-gene signature that correctly assigned the biopsies as benign or tumoural in 92.6% of the cases, with 88.8% sensitivity and 96.1% specificity. Conclusion: Surplus material from prostate needle biopsies can be used for minimal-size gene signature analysis for sensitive and accurate discrimination between non-tumoural and tumoural prostates, without interference with current diagnostic procedures. This approach could be a useful adjunct to current procedures in PCa diagnosis. PMID:22009027
2010-01-01
Background The development of DNA microarrays has facilitated the generation of hundreds of thousands of transcriptomic datasets. The use of a common reference microarray design allows existing transcriptomic data to be readily compared and re-analysed in the light of new data, and the combination of this design with large datasets is ideal for 'systems'-level analyses. One issue is that these datasets are typically collected over many years and may be heterogeneous in nature, containing different microarray file formats and gene array layouts, dye-swaps, and showing varying scales of log2- ratios of expression between microarrays. Excellent software exists for the normalisation and analysis of microarray data but many data have yet to be analysed as existing methods struggle with heterogeneous datasets; options include normalising microarrays on an individual or experimental group basis. Our solution was to develop the Batch Anti-Banana Algorithm in R (BABAR) algorithm and software package which uses cyclic loess to normalise across the complete dataset. We have already used BABAR to analyse the function of Salmonella genes involved in the process of infection of mammalian cells. Results The only input required by BABAR is unprocessed GenePix or BlueFuse microarray data files. BABAR provides a combination of 'within' and 'between' microarray normalisation steps and diagnostic boxplots. When applied to a real heterogeneous dataset, BABAR normalised the dataset to produce a comparable scaling between the microarrays, with the microarray data in excellent agreement with RT-PCR analysis. When applied to a real non-heterogeneous dataset and a simulated dataset, BABAR's performance in identifying differentially expressed genes showed some benefits over standard techniques. Conclusions BABAR is an easy-to-use software tool, simplifying the simultaneous normalisation of heterogeneous two-colour common reference design cDNA microarray-based transcriptomic datasets. We show BABAR transforms real and simulated datasets to allow for the correct interpretation of these data, and is the ideal tool to facilitate the identification of differentially expressed genes or network inference analysis from transcriptomic datasets. PMID:20128918
Speeding up the Consensus Clustering methodology for microarray data analysis
2011-01-01
Background The inference of the number of clusters in a dataset, a fundamental problem in Statistics, Data Analysis and Classification, is usually addressed via internal validation measures. The stated problem is quite difficult, in particular for microarrays, since the inferred prediction must be sensible enough to capture the inherent biological structure in a dataset, e.g., functionally related genes. Despite the rich literature present in that area, the identification of an internal validation measure that is both fast and precise has proved to be elusive. In order to partially fill this gap, we propose a speed-up of Consensus (Consensus Clustering), a methodology whose purpose is the provision of a prediction of the number of clusters in a dataset, together with a dissimilarity matrix (the consensus matrix) that can be used by clustering algorithms. As detailed in the remainder of the paper, Consensus is a natural candidate for a speed-up. Results Since the time-precision performance of Consensus depends on two parameters, our first task is to show that a simple adjustment of the parameters is not enough to obtain a good precision-time trade-off. Our second task is to provide a fast approximation algorithm for Consensus. That is, the closely related algorithm FC (Fast Consensus) that would have the same precision as Consensus with a substantially better time performance. The performance of FC has been assessed via extensive experiments on twelve benchmark datasets that summarize key features of microarray applications, such as cancer studies, gene expression with up and down patterns, and a full spectrum of dimensionality up to over a thousand. Based on their outcome, compared with previous benchmarking results available in the literature, FC turns out to be among the fastest internal validation methods, while retaining the same outstanding precision of Consensus. Moreover, it also provides a consensus matrix that can be used as a dissimilarity matrix, guaranteeing the same performance as the corresponding matrix produced by Consensus. We have also experimented with the use of Consensus and FC in conjunction with NMF (Nonnegative Matrix Factorization), in order to identify the correct number of clusters in a dataset. Although NMF is an increasingly popular technique for biological data mining, our results are somewhat disappointing and complement quite well the state of the art about NMF, shedding further light on its merits and limitations. Conclusions In summary, FC with a parameter setting that makes it robust with respect to small and medium-sized datasets, i.e, number of items to cluster in the hundreds and number of conditions up to a thousand, seems to be the internal validation measure of choice. Moreover, the technique we have developed here can be used in other contexts, in particular for the speed-up of stability-based validation measures. PMID:21235792
Li, Hua; Liu, Jia-Jia; Deng, Mei; Guo, Li; Cheng, Ying; Song, Yuan-Zong
2017-10-01
Alagille syndrome (ALGS) is an autosomal dominant disease affecting multiple systems including the liver, heart, skeleton, eyes, kidneys and face. This paper reports the clinical and genetic features of an infant with this disease. A 3-month-and-10-day-old female infant was referred to the hospital with jaundiced skin and sclera for 3 months. Physical examination revealed wide forehead and micromandible. A systolic murmur of grade 3-4/6 was heard between the 2th and 3th intercostal spaces on the left side of the sternum. The abdomen was distended, and the liver palpable 3 cm under the right subcostal margin with a medium texture. Serum biochemistry analysis revealed abnormal liver function indices, with markedly elevated bilirubin (predominantly direct bilirubin), total bile acids (TBA) and gamma-glutamyl transpeptidase (GGT). Atrial septal defect and pulmonary stenosis were detected on echocardiography. Next generation sequencing detected entire deletion of the JAG1 gene, and then chromosomal microarray analysis revealed a novel interstitial deletion of 3.0 Mb in size on chr20p12.3p12.2, involving JAG1 gene. The child had special facial features, heart malformations, and cholestasis, and based on the genetic findings, ALGS was definitively diagnosed. Thereafter, symptomatic and supportive treatment was introduced. Thus far, the infant had been followed up till his age of 11 months. The hyperbilirubinemia got improved, but GGT and TBA were persistently elevated, and the long-term outcome needs to be observed. This study extended the JAG1 mutation spectrum, and provided laboratory evidences for the diagnosis and treatment of the patient, and for the genetic counseling and prenatal diagnosis in the family.
A genome-wide 20 K citrus microarray for gene expression analysis
Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose
2008-01-01
Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to catalogue genes expressed in citrus globular embryos. PMID:18598343
An evaluation of two-channel ChIP-on-chip and DNA methylation microarray normalization strategies
2012-01-01
Background The combination of chromatin immunoprecipitation with two-channel microarray technology enables genome-wide mapping of binding sites of DNA-interacting proteins (ChIP-on-chip) or sites with methylated CpG di-nucleotides (DNA methylation microarray). These powerful tools are the gateway to understanding gene transcription regulation. Since the goals of such studies, the sample preparation procedures, the microarray content and study design are all different from transcriptomics microarrays, the data pre-processing strategies traditionally applied to transcriptomics microarrays may not be appropriate. Particularly, the main challenge of the normalization of "regulation microarrays" is (i) to make the data of individual microarrays quantitatively comparable and (ii) to keep the signals of the enriched probes, representing DNA sequences from the precipitate, as distinguishable as possible from the signals of the un-enriched probes, representing DNA sequences largely absent from the precipitate. Results We compare several widely used normalization approaches (VSN, LOWESS, quantile, T-quantile, Tukey's biweight scaling, Peng's method) applied to a selection of regulation microarray datasets, ranging from DNA methylation to transcription factor binding and histone modification studies. Through comparison of the data distributions of control probes and gene promoter probes before and after normalization, and assessment of the power to identify known enriched genomic regions after normalization, we demonstrate that there are clear differences in performance between normalization procedures. Conclusion T-quantile normalization applied separately on the channels and Tukey's biweight scaling outperform other methods in terms of the conservation of enriched and un-enriched signal separation, as well as in identification of genomic regions known to be enriched. T-quantile normalization is preferable as it additionally improves comparability between microarrays. In contrast, popular normalization approaches like quantile, LOWESS, Peng's method and VSN normalization alter the data distributions of regulation microarrays to such an extent that using these approaches will impact the reliability of the downstream analysis substantially. PMID:22276688
Rai, Muhammad Farooq; Tycksen, Eric D; Sandell, Linda J; Brophy, Robert H
2018-01-01
Microarrays and RNA-seq are at the forefront of high throughput transcriptome analyses. Since these methodologies are based on different principles, there are concerns about the concordance of data between the two techniques. The concordance of RNA-seq and microarrays for genome-wide analysis of differential gene expression has not been rigorously assessed in clinically derived ligament tissues. To demonstrate the concordance between RNA-seq and microarrays and to assess potential benefits of RNA-seq over microarrays, we assessed differences in transcript expression in anterior cruciate ligament (ACL) tissues based on time-from-injury. ACL remnants were collected from patients with an ACL tear at the time of ACL reconstruction. RNA prepared from torn ACL remnants was subjected to Agilent microarrays (N = 24) and RNA-seq (N = 8). The correlation of biological replicates in RNA-seq and microarrays data was similar (0.98 vs. 0.97), demonstrating that each platform has high internal reproducibility. Correlations between the RNA-seq data and the individual microarrays were low, but correlations between the RNA-seq values and the geometric mean of the microarrays values were moderate. The cross-platform concordance for differentially expressed transcripts or enriched pathways was linearly correlated (r = 0.64). RNA-Seq was superior in detecting low abundance transcripts and differentiating biologically critical isoforms. Additional independent validation of transcript expression was undertaken using microfluidic PCR for selected genes. PCR data showed 100% concordance (in expression pattern) with RNA-seq and microarrays data. These findings demonstrate that RNA-seq has advantages over microarrays for transcriptome profiling of ligament tissues when available and affordable. Furthermore, these findings are likely transferable to other musculoskeletal tissues where tissue collection is challenging and cells are in low abundance. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc. J Orthop Res 36:484-497, 2018. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc.
Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu
2003-11-07
To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s).
Pan, Deyun; Sun, Ning; Cheung, Kei-Hoi; Guan, Zhong; Ma, Ligeng; Holford, Matthew; Deng, Xingwang; Zhao, Hongyu
2003-01-01
Background To date, many genomic and pathway-related tools and databases have been developed to analyze microarray data. In published web-based applications to date, however, complex pathways have been displayed with static image files that may not be up-to-date or are time-consuming to rebuild. In addition, gene expression analyses focus on individual probes and genes with little or no consideration of pathways. These approaches reveal little information about pathways that are key to a full understanding of the building blocks of biological systems. Therefore, there is a need to provide useful tools that can generate pathways without manually building images and allow gene expression data to be integrated and analyzed at pathway levels for such experimental organisms as Arabidopsis. Results We have developed PathMAPA, a web-based application written in Java that can be easily accessed over the Internet. An Oracle database is used to store, query, and manipulate the large amounts of data that are involved. PathMAPA allows its users to (i) upload and populate microarray data into a database; (ii) integrate gene expression with enzymes of the pathways; (iii) generate pathway diagrams without building image files manually; (iv) visualize gene expressions for each pathway at enzyme, locus, and probe levels; and (v) perform statistical tests at pathway, enzyme and gene levels. PathMAPA can be used to examine Arabidopsis thaliana gene expression patterns associated with metabolic pathways. Conclusion PathMAPA provides two unique features for the gene expression analysis of Arabidopsis thaliana: (i) automatic generation of pathways associated with gene expression and (ii) statistical tests at pathway level. The first feature allows for the periodical updating of genomic data for pathways, while the second feature can provide insight into how treatments affect relevant pathways for the selected experiment(s). PMID:14604444
Matsudate, Yoshihiro; Naruto, Takuya; Hayashi, Yumiko; Minami, Mitsuyoshi; Tohyama, Mikiko; Yokota, Kenji; Yamada, Daisuke; Imoto, Issei; Kubo, Yoshiaki
2017-06-01
Nevoid basal cell carcinoma syndrome (NBCCS) is an autosomal dominant disorder mainly caused by heterozygous mutations of PTCH1. In addition to characteristic clinical features, detection of a mutation in causative genes is reliable for the diagnosis of NBCCS; however, no mutations have been identified in some patients using conventional methods. To improve the method for the molecular diagnosis of NBCCS. We performed targeted exome sequencing (TES) analysis using a multi-gene panel, including PTCH1, PTCH2, SUFU, and other sonic hedgehog signaling pathway-related genes, based on next-generation sequencing (NGS) technology in 8 cases in whom possible causative mutations were not detected by previously performed conventional analysis and 2 recent cases of NBCCS. Subsequent analysis of gross deletion within or around PTCH1 detected by TES was performed using chromosomal microarray (CMA). Through TES analysis, specific single nucleotide variants or small indels of PTCH1 causing inferred amino acid changes were identified in 2 novel cases and 2 undiagnosed cases, whereas gross deletions within or around PTCH1, which are validated by CMA, were found in 3 undiagnosed cases. However, no mutations were detected even by TES in 3 cases. Among 3 cases with gross deletions of PTCH1, deletions containing the entire PTCH1 and additional neighboring genes were detected in 2 cases, one of which exhibited atypical clinical features, such as severe mental retardation, likely associated with genes located within the 4.3Mb deleted region, especially. TES-based simultaneous evaluation of sequences and copy number status in all targeted coding exons by NGS is likely to be more useful for the molecular diagnosis of NBCCS than conventional methods. CMA is recommended as a subsequent analysis for validation and detailed mapping of deleted regions, which may explain the atypical clinical features of NBCCS cases. Copyright © 2017 Japanese Society for Investigative Dermatology. Published by Elsevier B.V. All rights reserved.
Allantaz-Frager, Florence; Turrel-Davin, Fanny; Venet, Fabienne; Monnin, Cécile; De Saint Jean, Amélie; Barbalat, Véronique; Cerrato, Elisabeth; Pachot, Alexandre; Lepape, Alain; Monneret, Guillaume
2013-01-01
The rapid development in septic patients of features of marked immunosuppression associated with increased risk of nosocomial infections and mortality represents the rational for the initiation of immune targeted treatments in sepsis. However, as there is no clinical sign of immune dysfunctions, the current challenge is to develop biomarkers that will help clinicians identify the patients that would benefit from immunotherapy and monitor its efficacy. Using an in vitro model of endotoxin tolerance (ET), a pivotal feature of sepsis-induced immunosuppression in monocytes, we identified using gene expression profiling by microarray a panel of transcripts associated with the development of ET which expression was restored after immunostimulation with interferon-gamma (IFN-γ). These results were confirmed by qRT-PCR. Importantly, this short-list of markers was further evaluated in patients. Of these transcripts, six (TNFAIP6, FCN1, CXCL10, GBP1, CXCL5 and PID1) were differentially expressed in septic patients’ blood compared to healthy blood upon ex vivo LPS stimulation and were restored by IFN-γ. In this study, by combining a microarray approach in an in vitro model and a validation in clinical samples, we identified a panel of six new transcripts that could be used for the identification of septic patients eligible for IFNg therapy. Along with the previously identified markers TNFa, IL10 and HLA-DRA, the potential value of these markers should now be evaluated in a larger cohort of patients. Upon favorable results, they could serve as stratification tools prior to immunostimulatory treatment and to monitor drug efficacy. PMID:23874546
Allantaz-Frager, Florence; Turrel-Davin, Fanny; Venet, Fabienne; Monnin, Cécile; De Saint Jean, Amélie; Barbalat, Véronique; Cerrato, Elisabeth; Pachot, Alexandre; Lepape, Alain; Monneret, Guillaume
2013-01-01
The rapid development in septic patients of features of marked immunosuppression associated with increased risk of nosocomial infections and mortality represents the rational for the initiation of immune targeted treatments in sepsis. However, as there is no clinical sign of immune dysfunctions, the current challenge is to develop biomarkers that will help clinicians identify the patients that would benefit from immunotherapy and monitor its efficacy. Using an in vitro model of endotoxin tolerance (ET), a pivotal feature of sepsis-induced immunosuppression in monocytes, we identified using gene expression profiling by microarray a panel of transcripts associated with the development of ET which expression was restored after immunostimulation with interferon-gamma (IFN-γ). These results were confirmed by qRT-PCR. Importantly, this short-list of markers was further evaluated in patients. Of these transcripts, six (TNFAIP6, FCN1, CXCL10, GBP1, CXCL5 and PID1) were differentially expressed in septic patients' blood compared to healthy blood upon ex vivo LPS stimulation and were restored by IFN-γ. In this study, by combining a microarray approach in an in vitro model and a validation in clinical samples, we identified a panel of six new transcripts that could be used for the identification of septic patients eligible for IFNg therapy. Along with the previously identified markers TNFa, IL10 and HLA-DRA, the potential value of these markers should now be evaluated in a larger cohort of patients. Upon favorable results, they could serve as stratification tools prior to immunostimulatory treatment and to monitor drug efficacy.
Multiplex infectious disease microarrays: STAT serology on a drop of blood
NASA Astrophysics Data System (ADS)
Ewart, Tom; Tarnopolsky, Mark; Baker, Steve; Raha, Sandeep; Wong, Yuen-Yee; Ciebiera, Kathy
2009-06-01
New and resurgent viral and antibiotic-resistant bacterial diseases are being encountered worldwide. The US CDC now ranks hospital acquired infections among the top 10 leading causes of death in the US, costing $20 billion annually. Such nosocomial infections presently affect 5% - 10% of hospitalized patients leading to 2 million cases and 99,000 deaths annually. Until now, assays available to mount comprehensive surveillance of infectious disease exposure by biosecurity agencies and hospital infection control units have been too slow and too costly. In earlier clinical studies we have reported proteomic microarrays combining 13 autoimmune and 26 viral and bacterial pathogens that revealed correlations between autoimmune diseases and antecedent infections. In this work we have expanded the array to 40 viruses and bacteria and investigated a suspected role of human endogenous retroviruses in autoimmune neuropathies. Using scanning laser imaging, and fluorescence color multiplexing, serum IgG and IgM responses are measured concurrently on the same array, for 14 arrays (patient samples) per microscope slide in 15 minutes. Other advantages include internal calibration, 10 μL sample size, increased laboratory efficiency, and potential factor of 100 cost reduction.
Lin, Yii-Lih; Huang, Yen-Jun; Teerapanich, Pattamon; Leïchlé, Thierry
2016-01-01
Nanofluidic devices promise high reaction efficiency and fast kinetic responses due to the spatial constriction of transported biomolecules with confined molecular diffusion. However, parallel detection of multiple biomolecules, particularly proteins, in highly confined space remains challenging. This study integrates extended nanofluidics with embedded protein microarray to achieve multiplexed real-time biosensing and kinetics monitoring. Implementation of embedded standard-sized antibody microarray is attained by epoxy-silane surface modification and a room-temperature low-aspect-ratio bonding technique. An effective sample transport is achieved by electrokinetic pumping via electroosmotic flow. Through the nanoslit-based spatial confinement, the antigen-antibody binding reaction is enhanced with ∼100% efficiency and may be directly observed with fluorescence microscopy without the requirement of intermediate washing steps. The image-based data provide numerous spatially distributed reaction kinetic curves and are collectively modeled using a simple one-dimensional convection-reaction model. This study represents an integrated nanofluidic solution for real-time multiplexed immunosensing and kinetics monitoring, starting from device fabrication, protein immobilization, device bonding, sample transport, to data analysis at Péclet number less than 1. PMID:27375819
Rezk, Amgad R; Ramesan, Shwathy; Yeo, Leslie Y
2018-01-30
The microarray titre plate remains a fundamental workhorse in genomic, proteomic and cellomic analyses that underpin the drug discovery process. Nevertheless, liquid handling technologies for sample dispensing, processing and transfer have not progressed significantly beyond conventional robotic micropipetting techniques, which are not only at their fundamental sample size limit, but are also prone to mechanical failure and contamination. This is because alternative technologies to date suffer from a number of constraints, mainly their limitation to carry out only a single liquid operation such as dispensing or mixing at a given time, and their inability to address individual wells, particularly at high throughput. Here, we demonstrate the possibility for true sequential or simultaneous single- and multi-well addressability in a 96-well plate using a reconfigurable modular platform from which MHz-order hybrid surface and bulk acoustic waves can be coupled to drive a variety of microfluidic modes including mixing, sample preconcentration and droplet jetting/ejection in individual or multiple wells on demand, thus constituting a highly versatile yet simple setup capable of improving the functionality of existing laboratory protocols and processes.
Li, Ping; Dou, Xiaoqiu; Müller, Mareike; Feng, Chuanliang; Chang, Matthew Wook; Frettlöh, Martin; Schönherr, Holger
2017-11-01
A generally applicable strategy to obtain mechanically robust hydrogels for the incorporation and containment of functional reporter bacteria for the microarray and microparticle-based detection and signaling of N-acyl homoserine lactone autoinducers (3OC 12 HSL) at relevant concentrations is reported. For reinforcing hydrogels of 1,4-bi(phenylalanine-diglycol)-benzene (PDB), a hybrid hydrogel is formed by the combination of PDB self-assembly with Ca 2+ mediated alginate crosslinking. The different assembly mechanisms are shown not to interfere with each other and despite the more than four-fold increased moduli of the hydrogels, diffusion of autoinducers into the gels remains efficient and Escherichia coli pLuxR-green fluorescent protein (GFP) reporter bacteria are proliferating. Templating affords reporter bacteria-loaded hydrogels with controllable shape and size. Upon exposure to 3OC 12 HSL, the embedded bacteria exhibit an up to 12 ± 3 times increase in fluorescence intensity due to autoinducer-triggered GFP expression. This approach can serve as a potentially generally applicable strategy to sensitively detect bacteria via their secreted autoinducers. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Wright, George W; Simon, Richard M
2003-12-12
Microarray techniques provide a valuable way of characterizing the molecular nature of disease. Unfortunately expense and limited specimen availability often lead to studies with small sample sizes. This makes accurate estimation of variability difficult, since variance estimates made on a gene by gene basis will have few degrees of freedom, and the assumption that all genes share equal variance is unlikely to be true. We propose a model by which the within gene variances are drawn from an inverse gamma distribution, whose parameters are estimated across all genes. This results in a test statistic that is a minor variation of those used in standard linear models. We demonstrate that the model assumptions are valid on experimental data, and that the model has more power than standard tests to pick up large changes in expression, while not increasing the rate of false positives. This method is incorporated into BRB-ArrayTools version 3.0 (http://linus.nci.nih.gov/BRB-ArrayTools.html). ftp://linus.nci.nih.gov/pub/techreport/RVM_supplement.pdf
The effect of column purification on cDNA indirect labelling for microarrays
Molas, M Lia; Kiss, John Z
2007-01-01
Background The success of the microarray reproducibility is dependent upon the performance of standardized procedures. Since the introduction of microarray technology for the analysis of global gene expression, reproducibility of results among different laboratories has been a major problem. Two of the main contributors to this variability are the use of different microarray platforms and different laboratory practices. In this paper, we address the latter question in terms of how variation in one of the steps of a labelling procedure affects the cDNA product prior to microarray hybridization. Results We used a standard procedure to label cDNA for microarray hybridization and employed different types of column chromatography for cDNA purification. After purifying labelled cDNA, we used the Agilent 2100 Bioanalyzer and agarose gel electrophoresis to assess the quality of the labelled cDNA before its hybridization onto a microarray platform. There were major differences in the cDNA profile (i.e. cDNA fragment lengths and abundance) as a result of using four different columns for purification. In addition, different columns have different efficiencies to remove rRNA contamination. This study indicates that the appropriate column to use in this type of protocol has to be experimentally determined. Finally, we present new evidence establishing the importance of testing the method of purification used during an indirect labelling procedure. Our results confirm the importance of assessing the quality of the sample in the labelling procedure prior to hybridization onto a microarray platform. Conclusion Standardization of column purification systems to be used in labelling procedures will improve the reproducibility of microarray results among different laboratories. In addition, implementation of a quality control check point of the labelled samples prior to microarray hybridization will prevent hybridizing a poor quality sample to expensive micorarrays. PMID:17597522
The effect of column purification on cDNA indirect labelling for microarrays.
Molas, M Lia; Kiss, John Z
2007-06-27
The success of the microarray reproducibility is dependent upon the performance of standardized procedures. Since the introduction of microarray technology for the analysis of global gene expression, reproducibility of results among different laboratories has been a major problem. Two of the main contributors to this variability are the use of different microarray platforms and different laboratory practices. In this paper, we address the latter question in terms of how variation in one of the steps of a labelling procedure affects the cDNA product prior to microarray hybridization. We used a standard procedure to label cDNA for microarray hybridization and employed different types of column chromatography for cDNA purification. After purifying labelled cDNA, we used the Agilent 2100 Bioanalyzer and agarose gel electrophoresis to assess the quality of the labelled cDNA before its hybridization onto a microarray platform. There were major differences in the cDNA profile (i.e. cDNA fragment lengths and abundance) as a result of using four different columns for purification. In addition, different columns have different efficiencies to remove rRNA contamination. This study indicates that the appropriate column to use in this type of protocol has to be experimentally determined. Finally, we present new evidence establishing the importance of testing the method of purification used during an indirect labelling procedure. Our results confirm the importance of assessing the quality of the sample in the labelling procedure prior to hybridization onto a microarray platform. Standardization of column purification systems to be used in labelling procedures will improve the reproducibility of microarray results among different laboratories. In addition, implementation of a quality control check point of the labelled samples prior to microarray hybridization will prevent hybridizing a poor quality sample to expensive micorarrays.
McCoy, Gary R; Touzet, Nicolas; Fleming, Gerard T A; Raine, Robin
2015-07-01
The toxic microalgal species Prymnesium parvum and Prymnesium polylepis are responsible for numerous fish kills causing economic stress on the aquaculture industry and, through the consumption of contaminated shellfish, can potentially impact on human health. Monitoring of toxic phytoplankton is traditionally carried out by light microscopy. However, molecular methods of identification and quantification are becoming more common place. This study documents the optimisation of the novel Microarrays for the Detection of Toxic Algae (MIDTAL) microarray from its initial stages to the final commercial version now available from Microbia Environnement (France). Existing oligonucleotide probes used in whole-cell fluorescent in situ hybridisation (FISH) for Prymnesium species from higher group probes to species-level probes were adapted and tested on the first-generation microarray. The combination and interaction of numerous other probes specific for a whole range of phytoplankton taxa also spotted on the chip surface caused high cross reactivity, resulting in false-positive results on the microarray. The probe sequences were extended for the subsequent second-generation microarray, and further adaptations of the hybridisation protocol and incubation temperatures significantly reduced false-positive readings from the first to the second-generation chip, thereby increasing the specificity of the MIDTAL microarray. Additional refinement of the subsequent third-generation microarray protocols with the addition of a poly-T amino linker to the 5' end of each probe further enhanced the microarray performance but also highlighted the importance of optimising RNA labelling efficiency when testing with natural seawater samples from Killary Harbour, Ireland.
The Glycan Microarray Story from Construction to Applications.
Hyun, Ji Young; Pai, Jaeyoung; Shin, Injae
2017-04-18
Not only are glycan-mediated binding processes in cells and organisms essential for a wide range of physiological processes, but they are also implicated in various pathological processes. As a result, elucidation of glycan-associated biomolecular interactions and their consequences is of great importance in basic biological research and biomedical applications. In 2002, we and others were the first to utilize glycan microarrays in efforts aimed at the rapid analysis of glycan-associated recognition events. Because they contain a number of glycans immobilized in a dense and orderly manner on a solid surface, glycan microarrays enable multiple parallel analyses of glycan-protein binding events while utilizing only small amounts of glycan samples. Therefore, this microarray technology has become a leading edge tool in studies aimed at elucidating roles played by glycans and glycan binding proteins in biological systems. In this Account, we summarize our efforts on the construction of glycan microarrays and their applications in studies of glycan-associated interactions. Immobilization strategies of functionalized and unmodified glycans on derivatized glass surfaces are described. Although others have developed immobilization techniques, our efforts have focused on improving the efficiencies and operational simplicity of microarray construction. The microarray-based technology has been most extensively used for rapid analysis of the glycan binding properties of proteins. In addition, glycan microarrays have been employed to determine glycan-protein interactions quantitatively, detect pathogens, and rapidly assess substrate specificities of carbohydrate-processing enzymes. More recently, the microarrays have been employed to identify functional glycans that elicit cell surface lectin-mediated cellular responses. Owing to these efforts, it is now possible to use glycan microarrays to expand the understanding of roles played by glycans and glycan binding proteins in biological systems.
Plant-pathogen interactions: what microarray tells about it?
Lodha, T D; Basak, J
2012-01-01
Plant defense responses are mediated by elementary regulatory proteins that affect expression of thousands of genes. Over the last decade, microarray technology has played a key role in deciphering the underlying networks of gene regulation in plants that lead to a wide variety of defence responses. Microarray is an important tool to quantify and profile the expression of thousands of genes simultaneously, with two main aims: (1) gene discovery and (2) global expression profiling. Several microarray technologies are currently in use; most include a glass slide platform with spotted cDNA or oligonucleotides. Till date, microarray technology has been used in the identification of regulatory genes, end-point defence genes, to understand the signal transduction processes underlying disease resistance and its intimate links to other physiological pathways. Microarray technology can be used for in-depth, simultaneous profiling of host/pathogen genes as the disease progresses from infection to resistance/susceptibility at different developmental stages of the host, which can be done in different environments, for clearer understanding of the processes involved. A thorough knowledge of plant disease resistance using successful combination of microarray and other high throughput techniques, as well as biochemical, genetic, and cell biological experiments is needed for practical application to secure and stabilize yield of many crop plants. This review starts with a brief introduction to microarray technology, followed by the basics of plant-pathogen interaction, the use of DNA microarrays over the last decade to unravel the mysteries of plant-pathogen interaction, and ends with the future prospects of this technology.
Clustering-based spot segmentation of cDNA microarray images.
Uslan, Volkan; Bucak, Ihsan Ömür
2010-01-01
Microarrays are utilized as that they provide useful information about thousands of gene expressions simultaneously. In this study segmentation step of microarray image processing has been implemented. Clustering-based methods, fuzzy c-means and k-means, have been applied for the segmentation step that separates the spots from the background. The experiments show that fuzzy c-means have segmented spots of the microarray image more accurately than the k-means.
A Platform for Combined DNA and Protein Microarrays Based on Total Internal Reflection Fluorescence
Asanov, Alexander; Zepeda, Angélica; Vaca, Luis
2012-01-01
We have developed a novel microarray technology based on total internal reflection fluorescence (TIRF) in combination with DNA and protein bioassays immobilized at the TIRF surface. Unlike conventional microarrays that exhibit reduced signal-to-background ratio, require several stages of incubation, rinsing and stringency control, and measure only end-point results, our TIRF microarray technology provides several orders of magnitude better signal-to-background ratio, performs analysis rapidly in one step, and measures the entire course of association and dissociation kinetics between target DNA and protein molecules and the bioassays. In many practical cases detection of only DNA or protein markers alone does not provide the necessary accuracy for diagnosing a disease or detecting a pathogen. Here we describe TIRF microarrays that detect DNA and protein markers simultaneously, which reduces the probabilities of false responses. Supersensitive and multiplexed TIRF DNA and protein microarray technology may provide a platform for accurate diagnosis or enhanced research studies. Our TIRF microarray system can be mounted on upright or inverted microscopes or interfaced directly with CCD cameras equipped with a single objective, facilitating the development of portable devices. As proof-of-concept we applied TIRF microarrays for detecting molecular markers from Bacillus anthracis, the pathogen responsible for anthrax. PMID:22438738
Motakis, E S; Nason, G P; Fryzlewicz, P; Rutter, G A
2006-10-15
Many standard statistical techniques are effective on data that are normally distributed with constant variance. Microarray data typically violate these assumptions since they come from non-Gaussian distributions with a non-trivial mean-variance relationship. Several methods have been proposed that transform microarray data to stabilize variance and draw its distribution towards the Gaussian. Some methods, such as log or generalized log, rely on an underlying model for the data. Others, such as the spread-versus-level plot, do not. We propose an alternative data-driven multiscale approach, called the Data-Driven Haar-Fisz for microarrays (DDHFm) with replicates. DDHFm has the advantage of being 'distribution-free' in the sense that no parametric model for the underlying microarray data is required to be specified or estimated; hence, DDHFm can be applied very generally, not just to microarray data. DDHFm achieves very good variance stabilization of microarray data with replicates and produces transformed intensities that are approximately normally distributed. Simulation studies show that it performs better than other existing methods. Application of DDHFm to real one-color cDNA data validates these results. The R package of the Data-Driven Haar-Fisz transform (DDHFm) for microarrays is available in Bioconductor and CRAN.
Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset
2012-01-01
Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA) with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO) correctly identified (p < 0.05) microarray data in which genes annotated to differentially expressed GO terms are upregulated. We found that GSEA + MIMGO was slightly less effective than, or comparable to, GSEA (Pearson), a method that uses Pearson’s correlation as a metric, at detecting true differentially expressed GO terms. However, unlike other methods including GSEA (Pearson), GSEA + MIMGO can comprehensively identify the microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively. PMID:23232071
Effect of Dissolution Kinetics on Feature Size in Dip-Pen Nanolithography
NASA Astrophysics Data System (ADS)
Weeks, B. L.; Noy, A.; Miller, A. E.; de Yoreo, J. J.
2002-06-01
We have investigated the effects of humidity, tip speed, and dwell time on feature size during dip pen nanolithography. Our results indicate a transition between two distinct deposition regimes occurs at a dwell time independent of humidity. While feature size increases with humidity, the relative increase is independent of dwell time. The results are described by a model that accounts for detachment and reattachment at the tip. The model suggests that, at short dwell times (high speed), the most important parameter controlling the feature size is the activation energy for thiol detachment.
An Introduction to MAMA (Meta-Analysis of MicroArray data) System.
Zhang, Zhe; Fenstermacher, David
2005-01-01
Analyzing microarray data across multiple experiments has been proven advantageous. To support this kind of analysis, we are developing a software system called MAMA (Meta-Analysis of MicroArray data). MAMA utilizes a client-server architecture with a relational database on the server-side for the storage of microarray datasets collected from various resources. The client-side is an application running on the end user's computer that allows the user to manipulate microarray data and analytical results locally. MAMA implementation will integrate several analytical methods, including meta-analysis within an open-source framework offering other developers the flexibility to plug in additional statistical algorithms.
Methods to study legionella transcriptome in vitro and in vivo.
Faucher, Sebastien P; Shuman, Howard A
2013-01-01
The study of transcriptome responses can provide insight into the regulatory pathways and genetic factors that contribute to a specific phenotype. For bacterial pathogens, it can identify putative new virulence systems and shed light on the mechanisms underlying the regulation of virulence factors. Microarrays have been previously used to study gene regulation in Legionella pneumophila. In the past few years a sharp reduction of the costs associated with microarray experiments together with the availability of relatively inexpensive custom-designed commercial microarrays has made microarray technology an accessible tool for the majority of researchers. Here we describe the methodologies to conduct microarray experiments from in vitro and in vivo samples.
Fluorescence-based bioassays for the detection and evaluation of food materials.
Nishi, Kentaro; Isobe, Shin-Ichiro; Zhu, Yun; Kiyama, Ryoiti
2015-10-13
We summarize here the recent progress in fluorescence-based bioassays for the detection and evaluation of food materials by focusing on fluorescent dyes used in bioassays and applications of these assays for food safety, quality and efficacy. Fluorescent dyes have been used in various bioassays, such as biosensing, cell assay, energy transfer-based assay, probing, protein/immunological assay and microarray/biochip assay. Among the arrays used in microarray/biochip assay, fluorescence-based microarrays/biochips, such as antibody/protein microarrays, bead/suspension arrays, capillary/sensor arrays, DNA microarrays/polymerase chain reaction (PCR)-based arrays, glycan/lectin arrays, immunoassay/enzyme-linked immunosorbent assay (ELISA)-based arrays, microfluidic chips and tissue arrays, have been developed and used for the assessment of allergy/poisoning/toxicity, contamination and efficacy/mechanism, and quality control/safety. DNA microarray assays have been used widely for food safety and quality as well as searches for active components. DNA microarray-based gene expression profiling may be useful for such purposes due to its advantages in the evaluation of pathway-based intracellular signaling in response to food materials.
Fluorescence-Based Bioassays for the Detection and Evaluation of Food Materials
Nishi, Kentaro; Isobe, Shin-Ichiro; Zhu, Yun; Kiyama, Ryoiti
2015-01-01
We summarize here the recent progress in fluorescence-based bioassays for the detection and evaluation of food materials by focusing on fluorescent dyes used in bioassays and applications of these assays for food safety, quality and efficacy. Fluorescent dyes have been used in various bioassays, such as biosensing, cell assay, energy transfer-based assay, probing, protein/immunological assay and microarray/biochip assay. Among the arrays used in microarray/biochip assay, fluorescence-based microarrays/biochips, such as antibody/protein microarrays, bead/suspension arrays, capillary/sensor arrays, DNA microarrays/polymerase chain reaction (PCR)-based arrays, glycan/lectin arrays, immunoassay/enzyme-linked immunosorbent assay (ELISA)-based arrays, microfluidic chips and tissue arrays, have been developed and used for the assessment of allergy/poisoning/toxicity, contamination and efficacy/mechanism, and quality control/safety. DNA microarray assays have been used widely for food safety and quality as well as searches for active components. DNA microarray-based gene expression profiling may be useful for such purposes due to its advantages in the evaluation of pathway-based intracellular signaling in response to food materials. PMID:26473869
Chockalingam, Sriram; Aluru, Maneesha; Aluru, Srinivas
2016-09-19
Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.
2014-01-01
Background Long noncoding RNAs (lncRNAs) constitute a major, but poorly characterized part of human transcriptome. Recent evidence indicates that many lncRNAs are involved in cancer and can be used as predictive and prognostic biomarkers. Significant fraction of lncRNAs is represented on widely used microarray platforms, however they have usually been ignored in cancer studies. Results We developed a computational pipeline to annotate lncRNAs on popular Affymetrix U133 microarrays, creating a resource allowing measurement of expression of 1581 lncRNAs. This resource can be utilized to interrogate existing microarray datasets for various lncRNA studies. We found that these lncRNAs fall into three distinct classes according to their statistical distribution by length. Remarkably, these three classes of lncRNAs were co-localized with protein coding genes exhibiting distinct gene ontology groups. This annotation was applied to microarray analysis which identified a 159 lncRNA signature that discriminates between localized and metastatic stages of neuroblastoma. Analysis of an independent patient cohort revealed that this signature differentiates also relapsing from non-relapsing primary tumors. This is the first example of the signature developed via the analysis of expression of lncRNAs solely. One of these lncRNAs, termed HOXD-AS1, is encoded in HOXD cluster. HOXD-AS1 is evolutionary conserved among hominids and has all bona fide features of a gene. Studying retinoid acid (RA) response of SH-SY5Y cell line, a model of human metastatic neuroblastoma, we found that HOXD-AS1 is a subject to morphogenic regulation, is activated by PI3K/Akt pathway and itself is involved in control of RA-induced cell differentiation. Knock-down experiments revealed that HOXD-AS1 controls expression levels of clinically significant protein-coding genes involved in angiogenesis and inflammation, the hallmarks of metastatic cancer. Conclusions Our findings greatly extend the number of noncoding RNAs functionally implicated in tumor development and patient treatment and highlight their role as potential prognostic biomarkers of neuroblastomas. PMID:25522241
Abelson, Sagi
2014-02-24
In the past decade, the field of molecular biology has become increasingly quantitative; rapid development of new technologies enables researchers to investigate and address fundamental issues quickly and in an efficient manner which were once impossible. Among these technologies, DNA microarray provides methodology for many applications such as gene discovery, diseases diagnosis, drug development and toxicological research and it has been used increasingly since it first emerged. Multiple tools have been developed to interpret the high-throughput data produced by microarrays. However, many times, less consideration has been given to the fact that an extensive and effective interpretation requires close interplay between the bioinformaticians who analyze the data and the biologists who generate it. To bridge this gap and to simplify the usability of such tools we developed Eureka-DMA - an easy-to-operate graphical user interface that allows bioinformaticians and bench-biologists alike to initiate analyses as well as to investigate the data produced by DNA microarrays. In this paper, we describe Eureka-DMA, a user-friendly software that comprises a set of methods for the interpretation of gene expression arrays. Eureka-DMA includes methods for the identification of genes with differential expression between conditions; it searches for enriched pathways and gene ontology terms and combines them with other relevant features. It thus enables the full understanding of the data for following testing as well as generating new hypotheses. Here we show two analyses, demonstrating examples of how Eureka-DMA can be used and its capability to produce relevant and reliable results. We have integrated several elementary expression analysis tools to provide a unified interface for their implementation. Eureka-DMA's simple graphical user interface provides effective and efficient framework in which the investigator has the full set of tools for the visualization and interpretation of the data with the option of exporting the analysis results for later use in other platforms. Eureka-DMA is freely available for academic users and can be downloaded at http://blue-meduza.org/Eureka-DMA.
Use of whole genome expression analysis in the toxicity screening of nanoparticles
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fröhlich, Eleonore, E-mail: eleonore.froehlich@medunigraz.at; Meindl, Claudia; Wagner, Karin
2014-10-15
The use of nanoparticles (NPs) offers exciting new options in technical and medical applications provided they do not cause adverse cellular effects. Cellular effects of NPs depend on particle parameters and exposure conditions. In this study, whole genome expression arrays were employed to identify the influence of particle size, cytotoxicity, protein coating, and surface functionalization of polystyrene particles as model particles and for short carbon nanotubes (CNTs) as particles with potential interest in medical treatment. Another aim of the study was to find out whether screening by microarray would identify other or additional targets than commonly used cell-based assays formore » NP action. Whole genome expression analysis and assays for cell viability, interleukin secretion, oxidative stress, and apoptosis were employed. Similar to conventional assays, microarray data identified inflammation, oxidative stress, and apoptosis as affected by NP treatment. Application of lower particle doses and presence of protein decreased the total number of regulated genes but did not markedly influence the top regulated genes. Cellular effects of CNTs were small; only carboxyl-functionalized single-walled CNTs caused appreciable regulation of genes. It can be concluded that regulated functions correlated well with results in cell-based assays. Presence of protein mitigated cytotoxicity but did not cause a different pattern of regulated processes. - Highlights: • Regulated functions were screened using whole genome expression assays. • Polystyrene particles regulated more genes than short carbon nanotubes. • Protein coating of polystyrene particles did not change regulation pattern. • Functions regulated by microarray were confirmed by cell-based assay.« less
Influence of time and length size feature selections for human activity sequences recognition.
Fang, Hongqing; Chen, Long; Srinivasan, Raghavendiran
2014-01-01
In this paper, Viterbi algorithm based on a hidden Markov model is applied to recognize activity sequences from observed sensors events. Alternative features selections of time feature values of sensors events and activity length size feature values are tested, respectively, and then the results of activity sequences recognition performances of Viterbi algorithm are evaluated. The results show that the selection of larger time feature values of sensor events and/or smaller activity length size feature values will generate relatively better results on the activity sequences recognition performances. © 2013 ISA Published by ISA All rights reserved.
Microfluidic microarray systems and methods thereof
West, Jay A. A. [Castro Valley, CA; Hukari, Kyle W [San Ramon, CA; Hux, Gary A [Tracy, CA
2009-04-28
Disclosed are systems that include a manifold in fluid communication with a microfluidic chip having a microarray, an illuminator, and a detector in optical communication with the microarray. Methods for using these systems for biological detection are also disclosed.
cDNA Microarray Screening in Food Safety
ROY, SASHWATI; SEN, CHANDAN K
2009-01-01
The cDNA microarray technology and related bioinformatics tools presents a wide range of novel application opportunities. The technology may be productively applied to address food safety. In this mini-review article, we present an update highlighting the late breaking discoveries that demonstrate the vitality of cDNA microarray technology as a tool to analyze food safety with reference to microbial pathogens and genetically modified foods. In order to bring the microarray technology to mainstream food safety, it is important to develop robust user-friendly tools that may be applied in a field setting. In addition, there needs to be a standardized process for regulatory agencies to interpret and act upon microarray-based data. The cDNA microarray approach is an emergent technology in diagnostics. Its values lie in being able to provide complimentary molecular insight when employed in addition to traditional tests for food safety, as part of a more comprehensive battery of tests. PMID:16466843
Li, Zhiguang; Kwekel, Joshua C; Chen, Tao
2012-01-01
Functional comparison across microarray platforms is used to assess the comparability or similarity of the biological relevance associated with the gene expression data generated by multiple microarray platforms. Comparisons at the functional level are very important considering that the ultimate purpose of microarray technology is to determine the biological meaning behind the gene expression changes under a specific condition, not just to generate a list of genes. Herein, we present a method named percentage of overlapping functions (POF) and illustrate how it is used to perform the functional comparison of microarray data generated across multiple platforms. This method facilitates the determination of functional differences or similarities in microarray data generated from multiple array platforms across all the functions that are presented on these platforms. This method can also be used to compare the functional differences or similarities between experiments, projects, or laboratories.
ArrayNinja: An Open Source Platform for Unified Planning and Analysis of Microarray Experiments.
Dickson, B M; Cornett, E M; Ramjan, Z; Rothbart, S B
2016-01-01
Microarray-based proteomic platforms have emerged as valuable tools for studying various aspects of protein function, particularly in the field of chromatin biochemistry. Microarray technology itself is largely unrestricted in regard to printable material and platform design, and efficient multidimensional optimization of assay parameters requires fluidity in the design and analysis of custom print layouts. This motivates the need for streamlined software infrastructure that facilitates the combined planning and analysis of custom microarray experiments. To this end, we have developed ArrayNinja as a portable, open source, and interactive application that unifies the planning and visualization of microarray experiments and provides maximum flexibility to end users. Array experiments can be planned, stored to a private database, and merged with the imaged results for a level of data interaction and centralization that is not currently attainable with available microarray informatics tools. © 2016 Elsevier Inc. All rights reserved.
Emerging Use of Gene Expression Microarrays in Plant Physiology
Wullschleger, Stan D.; Difazio, Stephen P.
2003-01-01
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology weremore » selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.« less
Profiling In Situ Microbial Community Structure with an Amplification Microarray
Knickerbocker, Christopher; Bryant, Lexi; Golova, Julia; Wiles, Cory; Williams, Kenneth H.; Peacock, Aaron D.; Long, Philip E.
2013-01-01
The objectives of this study were to unify amplification, labeling, and microarray hybridization chemistries within a single, closed microfluidic chamber (an amplification microarray) and verify technology performance on a series of groundwater samples from an in situ field experiment designed to compare U(VI) mobility under conditions of various alkalinities (as HCO3−) during stimulated microbial activity accompanying acetate amendment. Analytical limits of detection were between 2 and 200 cell equivalents of purified DNA. Amplification microarray signatures were well correlated with 16S rRNA-targeted quantitative PCR results and hybridization microarray signatures. The succession of the microbial community was evident with and consistent between the two microarray platforms. Amplification microarray analysis of acetate-treated groundwater showed elevated levels of iron-reducing bacteria (Flexibacter, Geobacter, Rhodoferax, and Shewanella) relative to the average background profile, as expected. Identical molecular signatures were evident in the transect treated with acetate plus NaHCO3, but at much lower signal intensities and with a much more rapid decline (to nondetection). Azoarcus, Thaurea, and Methylobacterium were responsive in the acetate-only transect but not in the presence of bicarbonate. Observed differences in microbial community composition or response to bicarbonate amendment likely had an effect on measured rates of U reduction, with higher rates probable in the part of the field experiment that was amended with bicarbonate. The simplification in microarray-based work flow is a significant technological advance toward entirely closed-amplicon microarray-based tests and is generally extensible to any number of environmental monitoring applications. PMID:23160129
PRACTICAL STRATEGIES FOR PROCESSING AND ANALYZING SPOTTED OLIGONUCLEOTIDE MICROARRAY DATA
Thoughtful data analysis is as important as experimental design, biological sample quality, and appropriate experimental procedures for making microarrays a useful supplement to traditional toxicology. In the present study, spotted oligonucleotide microarrays were used to profile...
DNA Microarray-based Ecotoxicological Biomarker Discovery in a Small Fish Model Species
This paper addresses several issues critical to use of zebrafish oligonucleotide microarrays for computational toxicology research on endocrine disrupting chemicals using small fish models, and more generally, the use of microarrays in aquatic toxicology.
IMPROVING THE RELIABILITY OF MICROARRAYS FOR TOXICOLOGY RESEARCH: A COLLABORATIVE APPROACH
Microarray-based gene expression profiling is a critical tool to identify molecular biomarkers of specific chemical stressors. Although current microarray technologies have progressed from their infancy, biological and technical repeatability and reliability are often still limit...
Direct labeling of serum proteins by fluorescent dye for antibody microarray.
Klimushina, M V; Gumanova, N G; Metelskaya, V A
2017-05-06
Analysis of serum proteome by antibody microarray is used to identify novel biomarkers and to study signaling pathways including protein phosphorylation and protein-protein interactions. Labeling of serum proteins is important for optimal performance of the antibody microarray. Proper choice of fluorescent label and optimal concentration of protein loaded on the microarray ensure good quality of imaging that can be reliably scanned and processed by the software. We have optimized direct serum protein labeling using fluorescent dye Arrayit Green 540 (Arrayit Corporation, USA) for antibody microarray. Optimized procedure produces high quality images that can be readily scanned and used for statistical analysis of protein composition of the serum. Copyright © 2017 Elsevier Inc. All rights reserved.
Transfection microarray and the applications.
Miyake, Masato; Yoshikawa, Tomohiro; Fujita, Satoshi; Miyake, Jun
2009-05-01
Microarray transfection has been extensively studied for high-throughput functional analysis of mammalian cells. However, control of efficiency and reproducibility are the critical issues for practical use. By using solid-phase transfection accelerators and nano-scaffold, we provide a highly efficient and reproducible microarray-transfection device, "transfection microarray". The device would be applied to the limited number of available primary cells and stem cells not only for large-scale functional analysis but also reporter-based time-lapse cellular event analysis.
A Human Lectin Microarray for Sperm Surface Glycosylation Analysis *
Sun, Yangyang; Cheng, Li; Gu, Yihua; Xin, Aijie; Wu, Bin; Zhou, Shumin; Guo, Shujuan; Liu, Yin; Diao, Hua; Shi, Huijuan; Wang, Guangyu; Tao, Sheng-ce
2016-01-01
Glycosylation is one of the most abundant and functionally important protein post-translational modifications. As such, technology for efficient glycosylation analysis is in high demand. Lectin microarrays are a powerful tool for such investigations and have been successfully applied for a variety of glycobiological studies. However, most of the current lectin microarrays are primarily constructed from plant lectins, which are not well suited for studies of human glycosylation because of the extreme complexity of human glycans. Herein, we constructed a human lectin microarray with 60 human lectin and lectin-like proteins. All of the lectins and lectin-like proteins were purified from yeast, and most showed binding to human glycans. To demonstrate the applicability of the human lectin microarray, human sperm were probed on the microarray and strong bindings were observed for several lectins, including galectin-1, 7, 8, GalNAc-T6, and ERGIC-53 (LMAN1). These bindings were validated by flow cytometry and fluorescence immunostaining. Further, mass spectrometry analysis showed that galectin-1 binds several membrane-associated proteins including heat shock protein 90. Finally, functional assays showed that binding of galectin-8 could significantly enhance the acrosome reaction within human sperms. To our knowledge, this is the first construction of a human lectin microarray, and we anticipate it will find wide use for a range of human or mammalian studies, alone or in combination with plant lectin microarrays. PMID:27364157
Barton, G; Abbott, J; Chiba, N; Huang, DW; Huang, Y; Krznaric, M; Mack-Smith, J; Saleem, A; Sherman, BT; Tiwari, B; Tomlinson, C; Aitman, T; Darlington, J; Game, L; Sternberg, MJE; Butcher, SA
2008-01-01
Background Microarray experimentation requires the application of complex analysis methods as well as the use of non-trivial computer technologies to manage the resultant large data sets. This, together with the proliferation of tools and techniques for microarray data analysis, makes it very challenging for a laboratory scientist to keep up-to-date with the latest developments in this field. Our aim was to develop a distributed e-support system for microarray data analysis and management. Results EMAAS (Extensible MicroArray Analysis System) is a multi-user rich internet application (RIA) providing simple, robust access to up-to-date resources for microarray data storage and analysis, combined with integrated tools to optimise real time user support and training. The system leverages the power of distributed computing to perform microarray analyses, and provides seamless access to resources located at various remote facilities. The EMAAS framework allows users to import microarray data from several sources to an underlying database, to pre-process, quality assess and analyse the data, to perform functional analyses, and to track data analysis steps, all through a single easy to use web portal. This interface offers distance support to users both in the form of video tutorials and via live screen feeds using the web conferencing tool EVO. A number of analysis packages, including R-Bioconductor and Affymetrix Power Tools have been integrated on the server side and are available programmatically through the Postgres-PLR library or on grid compute clusters. Integrated distributed resources include the functional annotation tool DAVID, GeneCards and the microarray data repositories GEO, CELSIUS and MiMiR. EMAAS currently supports analysis of Affymetrix 3' and Exon expression arrays, and the system is extensible to cater for other microarray and transcriptomic platforms. Conclusion EMAAS enables users to track and perform microarray data management and analysis tasks through a single easy-to-use web application. The system architecture is flexible and scalable to allow new array types, analysis algorithms and tools to be added with relative ease and to cope with large increases in data volume. PMID:19032776
Hume, Maxwell A; Barrera, Luis A; Gisselbrecht, Stephen S; Bulyk, Martha L
2015-01-01
The Universal PBM Resource for Oligonucleotide Binding Evaluation (UniPROBE) serves as a convenient source of information on published data generated using universal protein-binding microarray (PBM) technology, which provides in vitro data about the relative DNA-binding preferences of transcription factors for all possible sequence variants of a length k ('k-mers'). The database displays important information about the proteins and displays their DNA-binding specificity data in terms of k-mers, position weight matrices and graphical sequence logos. This update to the database documents the growth of UniPROBE since the last update 4 years ago, and introduces a variety of new features and tools, including a new streamlined pipeline that facilitates data deposition by universal PBM data generators in the research community, a tool that generates putative nonbinding (i.e. negative control) DNA sequences for one or more proteins and novel motifs obtained by analyzing the PBM data using the BEEML-PBM algorithm for motif inference. The UniPROBE database is available at http://uniprobe.org. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
THE MAQC PROJECT: ESTABLISHING QC METRICS AND THRESHOLDS FOR MICROARRAY QUALITY CONTROL
Microarrays represent a core technology in pharmacogenomics and toxicogenomics; however, before this technology can successfully and reliably be applied in clinical practice and regulatory decision-making, standards and quality measures need to be developed. The Microarray Qualit...
Tylee, Daniel S; Hess, Jonathan L; Quinn, Thomas P; Barve, Rahul; Huang, Hailiang; Zhang-James, Yanli; Chang, Jeffrey; Stamova, Boryana S; Sharp, Frank R; Hertz-Picciotto, Irva; Faraone, Stephen V; Kong, Sek Won; Glatt, Stephen J
2017-04-01
Blood-based microarray studies comparing individuals affected with autism spectrum disorder (ASD) and typically developing individuals help characterize differences in circulating immune cell functions and offer potential biomarker signal. We sought to combine the subject-level data from previously published studies by mega-analysis to increase the statistical power. We identified studies that compared ex vivo blood or lymphocytes from ASD-affected individuals and unrelated comparison subjects using Affymetrix or Illumina array platforms. Raw microarray data and clinical meta-data were obtained from seven studies, totaling 626 affected and 447 comparison subjects. Microarray data were processed using uniform methods. Covariate-controlled mixed-effect linear models were used to identify gene transcripts and co-expression network modules that were significantly associated with diagnostic status. Permutation-based gene-set analysis was used to identify functionally related sets of genes that were over- and under-expressed among ASD samples. Our results were consistent with diminished interferon-, EGF-, PDGF-, PI3K-AKT-mTOR-, and RAS-MAPK-signaling cascades, and increased ribosomal translation and NK-cell related activity in ASD. We explored evidence for sex-differences in the ASD-related transcriptomic signature. We also demonstrated that machine-learning classifiers using blood transcriptome data perform with moderate accuracy when data are combined across studies. Comparing our results with those from blood-based studies of protein biomarkers (e.g., cytokines and trophic factors), we propose that ASD may feature decoupling between certain circulating signaling proteins (higher in ASD samples) and the transcriptional cascades which they typically elicit within circulating immune cells (lower in ASD samples). These findings provide insight into ASD-related transcriptional differences in circulating immune cells. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
González-Plaza, Juan J.; Ortiz-Martín, Inmaculada; Muñoz-Mérida, Antonio; García-López, Carmen; Sánchez-Sevilla, José F.; Luque, Francisco; Trelles, Oswaldo; Bejarano, Eduardo R.; De La Rosa, Raúl; Valpuesta, Victoriano; Beuzón, Carmen R.
2016-01-01
Plant architecture is a critical trait in fruit crops that can significantly influence yield, pruning, planting density and harvesting. Little is known about how plant architecture is genetically determined in olive, were most of the existing varieties are traditional with an architecture poorly suited for modern growing and harvesting systems. In the present study, we have carried out microarray analysis of meristematic tissue to compare expression profiles of olive varieties displaying differences in architecture, as well as seedlings from their cross pooled on the basis of their sharing architecture-related phenotypes. The microarray used, previously developed by our group has already been applied to identify candidates genes involved in regulating juvenile to adult transition in the shoot apex of seedlings. Varieties with distinct architecture phenotypes and individuals from segregating progenies displaying opposite architecture features were used to link phenotype to expression. Here, we identify 2252 differentially expressed genes (DEGs) associated to differences in plant architecture. Microarray results were validated by quantitative RT-PCR carried out on genes with functional annotation likely related to plant architecture. Twelve of these genes were further analyzed in individual seedlings of the corresponding pool. We also examined Arabidopsis mutants in putative orthologs of these targeted candidate genes, finding altered architecture for most of them. This supports a functional conservation between species and potential biological relevance of the candidate genes identified. This study is the first to identify genes associated to plant architecture in olive, and the results obtained could be of great help in future programs aimed at selecting phenotypes adapted to modern cultivation practices in this species. PMID:26973682
Coram, Tristan E; Pang, Edwin C K
2006-11-01
Using microarray technology and a set of chickpea (Cicer arietinum L.) unigenes, grasspea (Lathyrus sativus L.) expressed sequence tags (ESTs) and lentil (Lens culinaris Med.) resistance gene analogues, the ascochyta blight (Ascochyta rabiei (Pass.) L.) resistance response was studied in four chickpea genotypes, including resistant, moderately resistant, susceptible and wild relative (Cicer echinospermum L.) genotypes. The experimental system minimized environmental effects and was conducted in reference design, in which samples from mock-inoculated controls acted as reference against post-inoculation samples. Robust data quality was achieved through the use of three biological replicates (including a dye swap), the inclusion of negative controls and strict selection criteria for differentially expressed genes, including a fold change cut-off determined by self-self hybridizations, Student's t-test and multiple testing correction (P < 0.05). Microarray observations were also validated by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). The time course expression patterns of 756 microarray features resulted in the differential expression of 97 genes in at least one genotype at one time point. k-means clustering grouped the genes into clusters of similar observations for each genotype, and comparisons between A. rabiei-resistant and A. rabiei-susceptible genotypes revealed potential gene 'signatures' predictive of effective A. rabiei resistance. These genes included several pathogenesis-related proteins, SNAKIN2 antimicrobial peptide, proline-rich protein, disease resistance response protein DRRG49-C, environmental stress-inducible protein, leucine-zipper protein, polymorphic antigen membrane protein, Ca-binding protein and several unknown proteins. The potential involvement of these genes and their pathways of induction are discussed. This study represents the first large-scale gene expression profiling in chickpea, and future work will focus on the functional validation of the genes of interest.
Hook, Sharon E; Nagler, James J; Cavileer, Tim; Verducci, Joseph; Liu, Yushi; Hayton, William; Schultz, Irvin R
2011-02-01
Normal transcriptomic patterns along the brain-pituitary-gonad-liver (BPGL) axis should be better characterized if endocrine-disrupting compound-induced changes in gene expression are to be understood. Female rainbow trout were studied over a complete year-long reproductive cycle. Tissue samples from pituitary, ovary, and liver were collected for microarray analysis using the 16K Genomic Research on Atlantic Salmon Project (GRASP) microarray and for quantitative polymerase chain reaction measures of estrogen receptor (ER) isoform messenger RNA (mRNA) levels. Plasma was collected to determine levels of circulating estradiol-17β (E2), follicle-stimulating hormone (FSH), and luteinizing hormone (LH). As an a priori hypothesis, changes in gene expression were correlated to either circulating levels of E2, FSH, and LH, or ER mRNAs quantified by quantitative polymerase chain reaction. In the liver, most transcriptomic patterns correlated to levels of either E2, LH, or ERs. Fewer ovarian transcripts could be correlated to levels of E2, ERα, or FSH. No significant associations were obvious in the pituitary. As a post hoc hypothesis, changes in transcript abundance were compared with microarray features with known roles in gonadal maturation. Many altered transcripts in the ovary correlated to transcript levels of estradiol 17-beta-dehydrogenase 8 or 17 B HSD12, or to glycoprotein alpha chain 1 or 2. In the pituitary, genes involved with the growth axis (e.g., growth hormone, insulin-related growth factor binding protein) correlated with the most transcripts. These results suggest that transcriptional networks along the BPGL axis may be regulated by factors other than circulating steroid hormones. © 2010 SETAC.
Hata, Junya; Satoh, Yuichi; Akaihata, Hidenori; Hiraki, Hiroyuki; Ogawa, Soichiro; Haga, Nobuhiro; Ishibashi, Kei; Aikawa, Ken; Kojima, Yoshiyuki
2016-07-01
To characterize the molecular features of benign prostatic hyperplasia by carrying out a gene expression profiling analysis in a rat model. Fetal urogenital sinus isolated from 20-day-old male rat embryo was implanted into a pubertal male rat ventral prostate. The implanted urogenital sinus grew time-dependently, and the pathological findings at 3 weeks after implantation showed epithelial hyperplasia as well as stromal hyperplasia. Whole-genome oligonucleotide microarray analysis utilizing approximately 30 000 oligonucleotide probes was carried out using prostate specimens during the prostate growth process (3 weeks after implantation). Microarray analyses showed 926 upregulated (>2-fold change, P < 0.01) and 3217 downregulated genes (<0.5-fold change, P < 0.01) in benign prostatic hyperplasia specimens compared with normal prostate. Gene ontology analyses of upregulated genes showed predominant genetic themes of involvement in development (162 genes, P = 2.01 × 10(-4) ), response to stimulus (163 genes, P = 7.37 × 10(-13) ) and growth (32 genes, P = 1.93 × 10(-5) ). When we used both normal prostate and non-transplanted urogenital sinuses as controls to identify benign prostatic hyperplasia-specific genes, 507 and 406 genes were upregulated and downregulated, respectively. Functional network and pathway analyses showed that genes associated with apoptosis modulation by heat shock protein 70, interleukin-1, interleukin-2 and interleukin-5 signaling pathways, KIT signaling pathway, and secretin-like G-protein-coupled receptors, class B, were relatively activated during the growth process in the benign prostatic hyperplasia specimens. In contrast, genes associated with cholesterol biosynthesis were relatively inactivated. Our microarray analyses of the benign prostatic hyperplasia model rat might aid in clarifying the molecular mechanism of benign prostatic hyperplasia progression, and identifying molecular targets for benign prostatic hyperplasia treatment. © 2016 The Japanese Urological Association.
Tylee, Daniel S.; Hess, Jonathan L.; Quinn, Thomas P.; Barve, Rahul; Huang, Hailiang; Zhang-James, Yanli; Chang, Jeffrey; Stamova, Boryana S.; Sharp, Frank R.; Hertz-Picciotto, Irva; Faraone, Stephen V.; Kong, Sek Won; Glatt, Stephen J.
2017-01-01
Blood-based microarray studies comparing individuals affected with autism spectrum disorder (ASD) and typically developing individuals help characterize differences in circulating immune cell functions and offer potential biomarker signal. We sought to combine the subject-level data from previously published studies by mega-analysis to increase the statistical power. We identified studies that compared ex-vivo blood or lymphocytes from ASD-affected individuals and unrelated comparison subjects using Affymetrix or Illumina array platforms. Raw microarray data and clinical meta-data were obtained from seven studies, totaling 626 affected and 447 comparison subjects. Microarray data were processed using uniform methods. Covariate-controlled mixed-effect linear models were used to identify gene transcripts and co-expression network modules that were significantly associated with diagnostic status. Permutation-based gene-set analysis was used to identify functionally related sets of genes that were over- and under-expressed among ASD samples. Our results were consistent with diminished interferon-, EGF-, PDGF-, PI3K-AKT-mTOR-, and RAS-MAPK-signaling cascades, and increased ribosomal translation and NK-cell related activity in ASD. We explored evidence for sex-differences in the ASD-related transcriptomic signature. We also demonstrated that machine-learning classifiers using blood transcriptome data perform with moderate accuracy when data are combined across studies. Comparing our results with those from blood-based studies of protein biomarkers (e.g., cytokines and trophic factors), we propose that ASD may feature decoupling between certain circulating signaling proteins (higher in ASD samples) and the transcriptional cascades which they typically elicit within circulating immune cells (lower in ASD samples). These findings provide insight into ASD-related transcriptional differences in circulating immune cells. PMID:27862943
A Discovery Resource of Rare Copy Number Variations in Individuals with Autism Spectrum Disorder
Prasad, Aparna; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wei, John; Lionel, Anath C.; Sato, Daisuke; Rickaby, Jessica; Lu, Chao; Szatmari, Peter; Roberts, Wendy; Fernandez, Bridget A.; Marshall, Christian R.; Hatchwell, Eli; Eis, Peggy S.; Scherer, Stephen W.
2012-01-01
The identification of rare inherited and de novo copy number variations (CNVs) in human subjects has proven a productive approach to highlight risk genes for autism spectrum disorder (ASD). A variety of microarrays are available to detect CNVs, including single-nucleotide polymorphism (SNP) arrays and comparative genomic hybridization (CGH) arrays. Here, we examine a cohort of 696 unrelated ASD cases using a high-resolution one-million feature CGH microarray, the majority of which were previously genotyped with SNP arrays. Our objective was to discover new CNVs in ASD cases that were not detected by SNP microarray analysis and to delineate novel ASD risk loci via combined analysis of CGH and SNP array data sets on the ASD cohort and CGH data on an additional 1000 control samples. Of the 615 ASD cases analyzed on both SNP and CGH arrays, we found that 13,572 of 21,346 (64%) of the CNVs were exclusively detected by the CGH array. Several of the CGH-specific CNVs are rare in population frequency and impact previously reported ASD genes (e.g., NRXN1, GRM8, DPYD), as well as novel ASD candidate genes (e.g., CIB2, DAPP1, SAE1), and all were inherited except for a de novo CNV in the GPHN gene. A functional enrichment test of gene-sets in ASD cases over controls revealed nucleotide metabolism as a potential novel pathway involved in ASD, which includes several candidate genes for follow-up (e.g., DPYD, UPB1, UPP1, TYMP). Finally, this extensively phenotyped and genotyped ASD clinical cohort serves as an invaluable resource for the next step of genome sequencing for complete genetic variation detection. PMID:23275889
The Changes of Gene Expression on Human Hair during Long-Spaceflight
NASA Astrophysics Data System (ADS)
Terada, Masahiro; Mukai, Chiaki; Ishioka, Noriaki; Majima, Hideyuki J.; Yamada, Shin; Seki, Masaya; Takahashi, Rika; Higashibata, Akira; Ohshima, Hiroshi; Sudoh, Masamichi; Minamisawa, Susumu
Hair has many advantages as the experimental sample. In a hair follicle, hair matrix cells actively divide and these active changes sensitively reflect physical condition on human body. The hair shaft records the metabolic conditions of mineral elements in our body. From human hairs, we can detect physiological informations about the human health. Therefore, we focused on using hair root analysis to understand the effects of spaceflight on astronauts. In 2009, we started a research program focusing on the analysis of astronauts’ hairs to examine the effects of long-term spaceflight on the gene expression in the human body. We want to get basic information to invent the effectivly diagnostic methods to detect the health situations of astronauts during space flight by analyzing human hair. We extracted RNA form the collected samples. Then, these extracted RNA was amplified. Amplified RNA was processed and hybridized to the Whole Human Genome (4×44K) Oligo Microarray (Agilent Technologies) according to the manufacturer’s protocol. Slide scanning was performed using the Agilent DNA Microarray Scanner. Scanning data were normalized with Agilent’s Feature Extraction software. Data preprocessing and analysis were performed using GeneSpring software 11.0.1. Next, Synthesis of cDNA (1 mg) was carried out using the PrimeScript RT reagent Kit (TaKaRa Bio) following the manufacturer’s instructions. The qRT-PCR experiment was performed with SYBR Premix Ex Taq (TaKaRa Bio) using the 7500 Real-Time PCR system (Applied Biosystems). We detected the changes of some gene expressions during spaceflight from both microarray and qRT-PCR data. These genes seems to be related with the hair proliferation. We believe that these results will lead to the discovery of the important factor effected during space flight on the hair.
Benech, P D; Patatian, A
2014-12-01
There is no doubt that the DNA microarray-based technology contributed to increase our knowledge of a wide range of processes. However, integrating genes into functional networks, rather than terms describing generic characteristics, remains an important challenge. The highly context-dependent function of a given gene and feedback mechanisms complexify greatly the interpretation of the data. Moreover, it is difficult to determine whether changes in gene expression are the result or the cause of pathologies or physiological events. In both cases, the difficulty relies on the involvement of processes that, at an early stage, can be protective and later on, deleterious because of their runaway. Each individual cell has its own transcription profile that determines its behaviour and its relationships with its neighbours. This is particularly true when a mechanism such as cell cycle is concerned. Another issue concerns the analyses from samples of different donors. Whereas the statistical tools lead to determine common features among groups, they tend to smooth the overall data and consequently, the selected values represent the 'tip of the iceberg'. There is a significant overlap in the set of genes identified in the different studies on skin ageing processes described in the present review. The reason of this overlap is because most of these genes belong to the basic machinery controlling cell growth and arrest. To get a more full picture of these processes, a hard work has still to be done to determine the precise mechanisms conferring the cell type specificity of ageing. Integrative biology applied to the huge amount of existing microarray data should fulfil gaps, through the characterization of additional actors accounting for the activation of specific signalling pathways at crossing points. Furthermore, computational tools have to be developed taking into account that expression values among similar groups may not vary 'by chance' but may reflect, along with other subtle changes, specific features of one given donor. Through a better stratification, these tools will allow to recover genes from the 'bottom of the iceberg'. Identifying these genes should contribute to understand how skin ages among individuals, thus paving the way for personalized skin care. © 2014 Society of Cosmetic Scientists and the Société Française de Cosmétologie.
Li, Dongmei; Le Pape, Marc A; Parikh, Nisha I; Chen, Will X; Dye, Timothy D
2013-01-01
Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequately accommodates violations of normality assumptions, resulting in inflated Type I error rates. The Significance Analysis of Microarrays, another widely used microarray data analysis method, is based on a permutation test and is robust to non-normally distributed data; however, the Significance Analysis of Microarrays method fold change criteria are problematic, and can critically alter the conclusion of a study, as a result of compositional changes of the control data set in the analysis. We propose a novel approach, combining resampling with empirical Bayes methods: the Resampling-based empirical Bayes Methods. This approach not only reduces false discovery rates for non-normally distributed microarray data, but it is also impervious to fold change threshold since no control data set selection is needed. Through simulation studies, sensitivities, specificities, total rejections, and false discovery rates are compared across the Smyth's parametric method, the Significance Analysis of Microarrays, and the Resampling-based empirical Bayes Methods. Differences in false discovery rates controls between each approach are illustrated through a preterm delivery methylation study. The results show that the Resampling-based empirical Bayes Methods offer significantly higher specificity and lower false discovery rates compared to Smyth's parametric method when data are not normally distributed. The Resampling-based empirical Bayes Methods also offers higher statistical power than the Significance Analysis of Microarrays method when the proportion of significantly differentially expressed genes is large for both normally and non-normally distributed data. Finally, the Resampling-based empirical Bayes Methods are generalizable to next generation sequencing RNA-seq data analysis.
Evaluating concentration estimation errors in ELISA microarray experiments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daly, Don S.; White, Amanda M.; Varnum, Susan M.
Enzyme-linked immunosorbent assay (ELISA) is a standard immunoassay to predict a protein concentration in a sample. Deploying ELISA in a microarray format permits simultaneous prediction of the concentrations of numerous proteins in a small sample. These predictions, however, are uncertain due to processing error and biological variability. Evaluating prediction error is critical to interpreting biological significance and improving the ELISA microarray process. Evaluating prediction error must be automated to realize a reliable high-throughput ELISA microarray system. Methods: In this paper, we present a statistical method based on propagation of error to evaluate prediction errors in the ELISA microarray process. Althoughmore » propagation of error is central to this method, it is effective only when comparable data are available. Therefore, we briefly discuss the roles of experimental design, data screening, normalization and statistical diagnostics when evaluating ELISA microarray prediction errors. We use an ELISA microarray investigation of breast cancer biomarkers to illustrate the evaluation of prediction errors. The illustration begins with a description of the design and resulting data, followed by a brief discussion of data screening and normalization. In our illustration, we fit a standard curve to the screened and normalized data, review the modeling diagnostics, and apply propagation of error.« less
The Importance of Normalization on Large and Heterogeneous Microarray Datasets
DNA microarray technology is a powerful functional genomics tool increasingly used for investigating global gene expression in environmental studies. Microarrays can also be used in identifying biological networks, as they give insight on the complex gene-to-gene interactions, ne...
O-Charoen, Sirimon; Srivannavit, Onnop; Gulari, Erdogan
2008-01-01
Microfluidic microarrays have been developed for economical and rapid parallel synthesis of oligonucleotide and peptide libraries. For a synthesis system to be reproducible and uniform, it is crucial to have a uniform reagent delivery throughout the system. Computational fluid dynamics (CFD) is used to model and simulate the microfluidic microarrays to study geometrical effects on flow patterns. By proper design geometry, flow uniformity could be obtained in every microreactor in the microarrays. PMID:17480053
The application of DNA microarrays in gene expression analysis.
van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J
2000-03-31
DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.
Sandwich ELISA Microarrays: Generating Reliable and Reproducible Assays for High-Throughput Screens
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gonzalez, Rachel M.; Varnum, Susan M.; Zangar, Richard C.
The sandwich ELISA microarray is a powerful screening tool in biomarker discovery and validation due to its ability to simultaneously probe for multiple proteins in a miniaturized assay. The technical challenges of generating and processing the arrays are numerous. However, careful attention to possible pitfalls in the development of your antibody microarray assay can overcome these challenges. In this chapter, we describe in detail the steps that are involved in generating a reliable and reproducible sandwich ELISA microarray assay.
Optimal number of features as a function of sample size for various classification rules.
Hua, Jianping; Xiong, Zixiang; Lowey, James; Suh, Edward; Dougherty, Edward R
2005-04-15
Given the joint feature-label distribution, increasing the number of features always results in decreased classification error; however, this is not the case when a classifier is designed via a classification rule from sample data. Typically (but not always), for fixed sample size, the error of a designed classifier decreases and then increases as the number of features grows. The potential downside of using too many features is most critical for small samples, which are commonplace for gene-expression-based classifiers for phenotype discrimination. For fixed sample size and feature-label distribution, the issue is to find an optimal number of features. Since only in rare cases is there a known distribution of the error as a function of the number of features and sample size, this study employs simulation for various feature-label distributions and classification rules, and across a wide range of sample and feature-set sizes. To achieve the desired end, finding the optimal number of features as a function of sample size, it employs massively parallel computation. Seven classifiers are treated: 3-nearest-neighbor, Gaussian kernel, linear support vector machine, polynomial support vector machine, perceptron, regular histogram and linear discriminant analysis. Three Gaussian-based models are considered: linear, nonlinear and bimodal. In addition, real patient data from a large breast-cancer study is considered. To mitigate the combinatorial search for finding optimal feature sets, and to model the situation in which subsets of genes are co-regulated and correlation is internal to these subsets, we assume that the covariance matrix of the features is blocked, with each block corresponding to a group of correlated features. Altogether there are a large number of error surfaces for the many cases. These are provided in full on a companion website, which is meant to serve as resource for those working with small-sample classification. For the companion website, please visit http://public.tgen.org/tamu/ofs/ e-dougherty@ee.tamu.edu.
van Huet, Ramon A. C.; Pierrache, Laurence H.M.; Meester-Smoor, Magda A.; Klaver, Caroline C.W.; van den Born, L. Ingeborgh; Hoyng, Carel B.; de Wijs, Ilse J.; Collin, Rob W. J.; Hoefsloot, Lies H.
2015-01-01
Purpose To determine the efficacy of multiple versions of a commercially available arrayed primer extension (APEX) microarray chip for autosomal recessive retinitis pigmentosa (arRP). Methods We included 250 probands suspected of arRP who were genetically analyzed with the APEX microarray between January 2008 and November 2013. The mode of inheritance had to be autosomal recessive according to the pedigree (including isolated cases). If the microarray identified a heterozygous mutation, we performed Sanger sequencing of exons and exon–intron boundaries of that specific gene. The efficacy of this microarray chip with the additional Sanger sequencing approach was determined by the percentage of patients that received a molecular diagnosis. We also collected data from genetic tests other than the APEX analysis for arRP to provide a detailed description of the molecular diagnoses in our study cohort. Results The APEX microarray chip for arRP identified the molecular diagnosis in 21 (8.5%) of the patients in our cohort. Additional Sanger sequencing yielded a second mutation in 17 patients (6.8%), thereby establishing the molecular diagnosis. In total, 38 patients (15.2%) received a molecular diagnosis after analysis using the microarray and additional Sanger sequencing approach. Further genetic analyses after a negative result of the arRP microarray (n = 107) resulted in a molecular diagnosis of arRP (n = 23), autosomal dominant RP (n = 5), X-linked RP (n = 2), and choroideremia (n = 1). Conclusions The efficacy of the commercially available APEX microarray chips for arRP appears to be low, most likely caused by the limitations of this technique and the genetic and allelic heterogeneity of RP. Diagnostic yields up to 40% have been reported for next-generation sequencing (NGS) techniques that, as expected, thereby outperform targeted APEX analysis. PMID:25999674
Best practices for hybridization design in two-colour microarray analysis.
Knapen, Dries; Vergauwen, Lucia; Laukens, Kris; Blust, Ronny
2009-07-01
Two-colour microarrays are a popular platform of choice in gene expression studies. Because two different samples are hybridized on a single microarray, and several microarrays are usually needed in a given experiment, there are many possible ways to combine samples on different microarrays. The actual combination employed is commonly referred to as the 'hybridization design'. Different types of hybridization designs have been developed, all aimed at optimizing the experimental setup for the detection of differentially expressed genes while coping with technical noise. Here, we first provide an overview of the different classes of hybridization designs, discussing their advantages and limitations, and then we illustrate the current trends in the use of different hybridization design types in contemporary research.
Li, Ben; Li, Yunxiao; Qin, Zhaohui S
2017-06-01
Modern high-throughput biotechnologies such as microarray and next generation sequencing produce a massive amount of information for each sample assayed. However, in a typical high-throughput experiment, only limited amount of data are observed for each individual feature, thus the classical 'large p , small n ' problem. Bayesian hierarchical model, capable of borrowing strength across features within the same dataset, has been recognized as an effective tool in analyzing such data. However, the shrinkage effect, the most prominent feature of hierarchical features, can lead to undesirable over-correction for some features. In this work, we discuss possible causes of the over-correction problem and propose several alternative solutions. Our strategy is rooted in the fact that in the Big Data era, large amount of historical data are available which should be taken advantage of. Our strategy presents a new framework to enhance the Bayesian hierarchical model. Through simulation and real data analysis, we demonstrated superior performance of the proposed strategy. Our new strategy also enables borrowing information across different platforms which could be extremely useful with emergence of new technologies and accumulation of data from different platforms in the Big Data era. Our method has been implemented in R package "adaptiveHM", which is freely available from https://github.com/benliemory/adaptiveHM.
Li, Ben; Li, Yunxiao; Qin, Zhaohui S.
2016-01-01
Modern high-throughput biotechnologies such as microarray and next generation sequencing produce a massive amount of information for each sample assayed. However, in a typical high-throughput experiment, only limited amount of data are observed for each individual feature, thus the classical ‘large p, small n’ problem. Bayesian hierarchical model, capable of borrowing strength across features within the same dataset, has been recognized as an effective tool in analyzing such data. However, the shrinkage effect, the most prominent feature of hierarchical features, can lead to undesirable over-correction for some features. In this work, we discuss possible causes of the over-correction problem and propose several alternative solutions. Our strategy is rooted in the fact that in the Big Data era, large amount of historical data are available which should be taken advantage of. Our strategy presents a new framework to enhance the Bayesian hierarchical model. Through simulation and real data analysis, we demonstrated superior performance of the proposed strategy. Our new strategy also enables borrowing information across different platforms which could be extremely useful with emergence of new technologies and accumulation of data from different platforms in the Big Data era. Our method has been implemented in R package “adaptiveHM”, which is freely available from https://github.com/benliemory/adaptiveHM. PMID:28919931
A girl with early-onset epileptic encephalopathy associated with microdeletion involving CDKL5.
Saitsu, Hirotomo; Osaka, Hitoshi; Nishiyama, Kiyomi; Tsurusaki, Yoshinori; Doi, Hiroshi; Miyake, Noriko; Matsumoto, Naomichi
2012-05-01
Recent studies have shown that aberrations of CDKL5 in female patients cause early-onset intractable seizures, severe developmental delay or regression, and Rett syndrome-like features. We report on a Japanese girl with early-onset epileptic encephalopathy, hypotonia, developmental regression, and Rett syndrome-like features. The patient showed generalized tonic seizures, and later, massive myoclonus induced by phone and light stimuli. Brain magnetic resonance imaging showed no structural brain anomalies but cerebral atrophy. Electroencephalogram showed frontal dominant diffuse poly spikes and waves. Through copy number analysis by genomic microarray, we found a microdeletion at Xp22.13. A de novo 137-kb deletion, involving exons 5-21 of CDKL5, RS1, and part of PPEF1 gene, was confirmed by quantitative PCR and breakpoint specific PCR analyses. Our report suggests that the clinical features associated with CDKL5 deletions could be implicated in Japanese patients, and that genetic testing of CDKL5, including both sequencing and deletion analyses, should be considered in girls with early-onset epileptic encephalopathy and RTT-like features. Copyright © 2011 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.
Experimental Approaches to Microarray Analysis of Tumor Samples
ERIC Educational Resources Information Center
Furge, Laura Lowe; Winter, Michael B.; Meyers, Jacob I.; Furge, Kyle A.
2008-01-01
Comprehensive measurement of gene expression using high-density nucleic acid arrays (i.e. microarrays) has become an important tool for investigating the molecular differences in clinical and research samples. Consequently, inclusion of discussion in biochemistry, molecular biology, or other appropriate courses of microarray technologies has…
Challenges of microarray applications for microbial detection and gene expression profiling in food
USDA-ARS?s Scientific Manuscript database
Microarray technology represents one of the latest advances in molecular biology. The diverse types of microarrays have been applied to clinical and environmental microbiology, microbial ecology, and in human, veterinary, and plant diagnostics. Since multiple genes can be analyzed simultaneously, ...
CEM-designer: design of custom expression microarrays in the post-ENCODE Era.
Arnold, Christian; Externbrink, Fabian; Hackermüller, Jörg; Reiche, Kristin
2014-11-10
Microarrays are widely used in gene expression studies, and custom expression microarrays are popular to monitor expression changes of a customer-defined set of genes. However, the complexity of transcriptomes uncovered recently make custom expression microarray design a non-trivial task. Pervasive transcription and alternative processing of transcripts generate a wealth of interweaved transcripts that requires well-considered probe design strategies and is largely neglected in existing approaches. We developed the web server CEM-Designer that facilitates microarray platform independent design of custom expression microarrays for complex transcriptomes. CEM-Designer covers (i) the collection and generation of a set of unique target sequences from different sources and (ii) the selection of a set of sensitive and specific probes that optimally represents the target sequences. Probe design itself is left to third party software to ensure that probes meet provider-specific constraints. CEM-Designer is available at http://designpipeline.bioinf.uni-leipzig.de. Copyright © 2014 Elsevier B.V. All rights reserved.
Multiplex cDNA quantification method that facilitates the standardization of gene expression data
Gotoh, Osamu; Murakami, Yasufumi; Suyama, Akira
2011-01-01
Microarray-based gene expression measurement is one of the major methods for transcriptome analysis. However, current microarray data are substantially affected by microarray platforms and RNA references because of the microarray method can provide merely the relative amounts of gene expression levels. Therefore, valid comparisons of the microarray data require standardized platforms, internal and/or external controls and complicated normalizations. These requirements impose limitations on the extensive comparison of gene expression data. Here, we report an effective approach to removing the unfavorable limitations by measuring the absolute amounts of gene expression levels on common DNA microarrays. We have developed a multiplex cDNA quantification method called GEP-DEAN (Gene expression profiling by DCN-encoding-based analysis). The method was validated by using chemically synthesized DNA strands of known quantities and cDNA samples prepared from mouse liver, demonstrating that the absolute amounts of cDNA strands were successfully measured with a sensitivity of 18 zmol in a highly multiplexed manner in 7 h. PMID:21415008
Spot detection and image segmentation in DNA microarray data.
Qin, Li; Rueda, Luis; Ali, Adnan; Ngom, Alioune
2005-01-01
Following the invention of microarrays in 1994, the development and applications of this technology have grown exponentially. The numerous applications of microarray technology include clinical diagnosis and treatment, drug design and discovery, tumour detection, and environmental health research. One of the key issues in the experimental approaches utilising microarrays is to extract quantitative information from the spots, which represent genes in a given experiment. For this process, the initial stages are important and they influence future steps in the analysis. Identifying the spots and separating the background from the foreground is a fundamental problem in DNA microarray data analysis. In this review, we present an overview of state-of-the-art methods for microarray image segmentation. We discuss the foundations of the circle-shaped approach, adaptive shape segmentation, histogram-based methods and the recently introduced clustering-based techniques. We analytically show that clustering-based techniques are equivalent to the one-dimensional, standard k-means clustering algorithm that utilises the Euclidean distance.
Grubaugh, Nathan D.; Petz, Lawrence N.; Melanson, Vanessa R.; McMenamy, Scott S.; Turell, Michael J.; Long, Lewis S.; Pisarcik, Sarah E.; Kengluecha, Ampornpan; Jaichapor, Boonsong; O'Guinn, Monica L.; Lee, John S.
2013-01-01
Highly multiplexed assays, such as microarrays, can benefit arbovirus surveillance by allowing researchers to screen for hundreds of targets at once. We evaluated amplification strategies and the practicality of a portable DNA microarray platform to analyze virus-infected mosquitoes. The prototype microarray design used here targeted the non-structural protein 5, ribosomal RNA, and cytochrome b genes for the detection of flaviviruses, mosquitoes, and bloodmeals, respectively. We identified 13 of 14 flaviviruses from virus inoculated mosquitoes and cultured cells. Additionally, we differentiated between four mosquito genera and eight whole blood samples. The microarray platform was field evaluated in Thailand and successfully identified flaviviruses (Culex flavivirus, dengue-3, and Japanese encephalitis viruses), differentiated between mosquito genera (Aedes, Armigeres, Culex, and Mansonia), and detected mammalian bloodmeals (human and dog). We showed that the microarray platform and amplification strategies described here can be used to discern specific information on a wide variety of viruses and their vectors. PMID:23249687
Guo, Qingsheng; Bai, Zhixiong; Liu, Yuqian; Sun, Qingjiang
2016-03-15
In this work, we report the application of streptavidin-coated quantum dot (strAV-QD) in molecular beacon (MB) microarray assays by using the strAV-QD to label the immobilized MB, avoiding target labeling and meanwhile obviating the use of amplification. The MBs are stem-loop structured oligodeoxynucleotides, modified with a thiol and a biotin at two terminals of the stem. With the strAV-QD labeling an "opened" MB rather than a "closed" MB via streptavidin-biotin reaction, a sensitive and specific detection of label-free target DNA sequence is demonstrated by the MB microarray, with a signal-to-background ratio of 8. The immobilized MBs can be perfectly regenerated, allowing the reuse of the microarray. The MB microarray also is able to detect single nucleotide polymorphisms, exhibiting genotype-dependent fluorescence signals. It is demonstrated that the MB microarray can perform as a 4-to-2 encoder, compressing the genotype information into two outputs. Copyright © 2015 Elsevier B.V. All rights reserved.
MIGS-GPU: Microarray Image Gridding and Segmentation on the GPU.
Katsigiannis, Stamos; Zacharia, Eleni; Maroulis, Dimitris
2017-05-01
Complementary DNA (cDNA) microarray is a powerful tool for simultaneously studying the expression level of thousands of genes. Nevertheless, the analysis of microarray images remains an arduous and challenging task due to the poor quality of the images that often suffer from noise, artifacts, and uneven background. In this study, the MIGS-GPU [Microarray Image Gridding and Segmentation on Graphics Processing Unit (GPU)] software for gridding and segmenting microarray images is presented. MIGS-GPU's computations are performed on the GPU by means of the compute unified device architecture (CUDA) in order to achieve fast performance and increase the utilization of available system resources. Evaluation on both real and synthetic cDNA microarray images showed that MIGS-GPU provides better performance than state-of-the-art alternatives, while the proposed GPU implementation achieves significantly lower computational times compared to the respective CPU approaches. Consequently, MIGS-GPU can be an advantageous and useful tool for biomedical laboratories, offering a user-friendly interface that requires minimum input in order to run.
2012-01-01
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, has raised concerns about the reliability of this technology. The MicroArray Quality Control (MAQC) project was initiated to address these concerns, as well as other performance and data analysis issues. Expression data on four titration pools from two distinct reference RNA samples were generated at multiple test sites using a variety of microarray-based and alternative technology platforms. Here we describe the experimental design and probe mapping efforts behind the MAQC project. We show intraplatform consistency across test sites as well as a high level of interplatform concordance in terms of genes identified as differentially expressed. This study provides a resource that represents an important first step toward establishing a framework for the use of microarrays in clinical and regulatory settings. PMID:16964229
Parthasarathy, N; Saksena, R; Kováč, P; Deshazer, D; Peacock, S J; Wuthiekanun, V; Heine, H S; Friedlander, A M; Cote, C K; Welkos, S L; Adamovicz, J J; Bavari, S; Waag, D M
2008-11-03
We developed a microarray platform by immobilizing bacterial 'signature' carbohydrates onto epoxide modified glass slides. The carbohydrate microarray platform was probed with sera from non-melioidosis and melioidosis (Burkholderia pseudomallei) individuals. The platform was also probed with sera from rabbits vaccinated with Bacillus anthracis spores and Francisella tularensis bacteria. By employing this microarray platform, we were able to detect and differentiate B. pseudomallei, B. anthracis and F. tularensis antibodies in infected patients, and infected or vaccinated animals. These antibodies were absent in the sera of naïve test subjects. The advantages of the carbohydrate microarray technology over the traditional indirect hemagglutination and microagglutination tests for the serodiagnosis of melioidosis and tularemia are discussed. Furthermore, this array is a multiplex carbohydrate microarray for the detection of all three biothreat bacterial infections including melioidosis, anthrax and tularemia with one, multivalent device. The implication is that this technology could be expanded to include a wide array of infectious and biothreat agents.
Women's experiences receiving abnormal prenatal chromosomal microarray testing results.
Bernhardt, Barbara A; Soucier, Danielle; Hanson, Karen; Savage, Melissa S; Jackson, Laird; Wapner, Ronald J
2013-02-01
Genomic microarrays can detect copy-number variants not detectable by conventional cytogenetics. This technology is diffusing rapidly into prenatal settings even though the clinical implications of many copy-number variants are currently unknown. We conducted a qualitative pilot study to explore the experiences of women receiving abnormal results from prenatal microarray testing performed in a research setting. Participants were a subset of women participating in a multicenter prospective study "Prenatal Cytogenetic Diagnosis by Array-based Copy Number Analysis." Telephone interviews were conducted with 23 women receiving abnormal prenatal microarray results. We found that five key elements dominated the experiences of women who had received abnormal prenatal microarray results: an offer too good to pass up, blindsided by the results, uncertainty and unquantifiable risks, need for support, and toxic knowledge. As prenatal microarray testing is increasingly used, uncertain findings will be common, resulting in greater need for careful pre- and posttest counseling, and more education of and resources for providers so they can adequately support the women who are undergoing testing.
Separate-channel analysis of two-channel microarrays: recovering inter-spot information.
Smyth, Gordon K; Altman, Naomi S
2013-05-26
Two-channel (or two-color) microarrays are cost-effective platforms for comparative analysis of gene expression. They are traditionally analysed in terms of the log-ratios (M-values) of the two channel intensities at each spot, but this analysis does not use all the information available in the separate channel observations. Mixed models have been proposed to analyse intensities from the two channels as separate observations, but such models can be complex to use and the gain in efficiency over the log-ratio analysis is difficult to quantify. Mixed models yield test statistics for the null distributions can be specified only approximately, and some approaches do not borrow strength between genes. This article reformulates the mixed model to clarify the relationship with the traditional log-ratio analysis, to facilitate information borrowing between genes, and to obtain an exact distributional theory for the resulting test statistics. The mixed model is transformed to operate on the M-values and A-values (average log-expression for each spot) instead of on the log-expression values. The log-ratio analysis is shown to ignore information contained in the A-values. The relative efficiency of the log-ratio analysis is shown to depend on the size of the intraspot correlation. A new separate channel analysis method is proposed that assumes a constant intra-spot correlation coefficient across all genes. This approach permits the mixed model to be transformed into an ordinary linear model, allowing the data analysis to use a well-understood empirical Bayes analysis pipeline for linear modeling of microarray data. This yields statistically powerful test statistics that have an exact distributional theory. The log-ratio, mixed model and common correlation methods are compared using three case studies. The results show that separate channel analyses that borrow strength between genes are more powerful than log-ratio analyses. The common correlation analysis is the most powerful of all. The common correlation method proposed in this article for separate-channel analysis of two-channel microarray data is no more difficult to apply in practice than the traditional log-ratio analysis. It provides an intuitive and powerful means to conduct analyses and make comparisons that might otherwise not be possible.
The Effects of Royal Jelly on Fitness Traits and Gene Expression in Drosophila melanogaster
Shorter, John R.; Geisz, Matthew; Özsoy, Ergi; Magwire, Michael M.; Carbone, Mary Anna; Mackay, Trudy F. C.
2015-01-01
Royal Jelly (RJ) is a product made by honey bee workers and is required for queen differentiation and accompanying changes in queen body size, development time, lifespan and reproductive output relative to workers. Previous studies have reported similar changes in Drosophila melanogaster in response to RJ. Here, we quantified viability, development time, body size, productivity, lifespan and genome wide transcript abundance of D. melanogaster reared on standard culture medium supplemented with increasing concentrations of RJ. We found that lower concentrations of RJ do induce significant differences in body size in both sexes; higher concentrations reduce size, increase mortality, shorten lifespan and reduce productivity. Increased concentrations of RJ also consistently lengthened development time in both sexes. RJ is associated with changes in expression of 1,581 probe sets assessed using Affymetrix Drosophila 2.0 microarrays, which were enriched for genes associated with metabolism and amino acid degradation. The transcriptional changes are consistent with alterations in cellular processes to cope with excess nutrients provided by RJ, including biosynthesis and detoxification, which might contribute to accelerated senescence and reduced lifespan. PMID:26226016
Abou Assi, Hala; Gómez-Pinto, Irene; González, Carlos
2017-01-01
Abstract In situ fabricated nucleic acids microarrays are versatile and very high-throughput platforms for aptamer optimization and discovery, but the chemical space that can be probed against a given target has largely been confined to DNA, while RNA and non-natural nucleic acid microarrays are still an essentially uncharted territory. 2΄-Fluoroarabinonucleic acid (2΄F-ANA) is a prime candidate for such use in microarrays. Indeed, 2΄F-ANA chemistry is readily amenable to photolithographic microarray synthesis and its potential in high affinity aptamers has been recently discovered. We thus synthesized the first microarrays containing 2΄F-ANA and 2΄F-ANA/DNA chimeric sequences to fully map the binding affinity landscape of the TBA1 thrombin-binding G-quadruplex aptamer containing all 32 768 possible DNA-to-2΄F-ANA mutations. The resulting microarray was screened against thrombin to identify a series of promising 2΄F-ANA-modified aptamer candidates with Kds significantly lower than that of the unmodified control and which were found to adopt highly stable, antiparallel-folded G-quadruplex structures. The solution structure of the TBA1 aptamer modified with 2΄F-ANA at position T3 shows that fluorine substitution preorganizes the dinucleotide loop into the proper conformation for interaction with thrombin. Overall, our work strengthens the potential of 2΄F-ANA in aptamer research and further expands non-genomic applications of nucleic acids microarrays. PMID:28100695
Geue, Lutz; Stieber, Bettina; Monecke, Stefan; Engelmann, Ines; Gunzer, Florian; Slickers, Peter; Braun, Sascha D; Ehricht, Ralf
2014-08-01
In this study, we developed a new rapid, economic, and automated microarray-based genotyping test for the standardized subtyping of Shiga toxins 1 and 2 of Escherichia coli. The microarrays from Alere Technologies can be used in two different formats, the ArrayTube and the ArrayStrip (which enables high-throughput testing in a 96-well format). One microarray chip harbors all the gene sequences necessary to distinguish between all Stx subtypes, facilitating the identification of single and multiple subtypes within a single isolate in one experiment. Specific software was developed to automatically analyze all data obtained from the microarray. The assay was validated with 21 Shiga toxin-producing E. coli (STEC) reference strains that were previously tested by the complete set of conventional subtyping PCRs. The microarray results showed 100% concordance with the PCR results. Essentially identical results were detected when the standard DNA extraction method was replaced by a time-saving heat lysis protocol. For further validation of the microarray, we identified the Stx subtypes or combinations of the subtypes in 446 STEC field isolates of human and animal origin. In summary, this oligonucleotide array represents an excellent diagnostic tool that provides some advantages over standard PCR-based subtyping. The number of the spotted probes on the microarrays can be increased by additional probes, such as for novel alleles, species markers, or resistance genes, should the need arise. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Classical fragile-X phenotype in a female infant disclosed by comprehensive genomic studies.
Jorge, Paula; Garcia, Elsa; Gonçalves, Ana; Marques, Isabel; Maia, Nuno; Rodrigues, Bárbara; Santos, Helena; Fonseca, Jacinta; Soares, Gabriela; Correia, Cecília; Reis-Lima, Margarida; Cirigliano, Vincenzo; Santos, Rosário
2018-05-10
We describe a female infant with Fragile-X syndrome, with a fully expanded FMR1 allele and preferential inactivation of the homologous X-chromosome carrying a de novo deletion. This unusual and rare case demonstrates the importance of a detailed genomic approach, the absence of which could be misguiding, and calls for reflection on the current clinical and diagnostic workup for developmental disabilities. We present a female infant, referred for genetic testing due to psychomotor developmental delay without specific dysmorphic features or relevant family history. FMR1 mutation screening revealed a methylated full mutation and a normal but inactive FMR1 allele, which led to further investigation. Complete skewing of X-chromosome inactivation towards the paternally-inherited normal-sized FMR1 allele was found. No pathogenic variants were identified in the XIST promoter. Microarray analysis revealed a 439 kb deletion at Xq28, in a region known to be associated with extreme skewing of X-chromosome inactivation. Overall results enable us to conclude that the developmental delay is the cumulative result of a methylated FMR1 full mutation on the active X-chromosome and the inactivation of the other homologue carrying the de novo 439 kb deletion. Our findings should be taken into consideration in future guidelines for the diagnostic workup on the diagnosis of intellectual disabilities, particularly in female infant cases.
Agboola, Ayodeji O. J.; Banjo, Adekumbiola A. F.; Anunobi, Charles C.; Salami, Babatunde; Agboola, Mopelola Deji; Musa, Adewale A.; Nolan, Christopher C.; Rakha, Emad A.; Ellis, Ian O.; Green, Andrew R.
2013-01-01
Background. Black women with breast cancer (BC) in Nigeria have higher mortality rate compared with British women. This study investigated prognostic features of cell proliferation biomarker (Ki-67) in Nigerian breast cancer women. Materials and Methods. The protein expression of Ki-67 was investigated in series of 308 Nigerian women, prepared as a tissue microarray (TMA), using immunohistochemistry. Clinic-pathological parameters, biomarkers, and patient outcome of tumours expressing Ki-67 in Nigerian women were correlated with UK grade-matched series. Results. A significantly larger proportion of breast tumours from Nigerian women showed high Ki-67 expression. Those tumours were significantly correlated with negative expression of the steroid hormone receptors (ER and PgR), p21, p27, E-cadherin, BRCA-1, and Bcl-2 (all P < 0.001), but positively associated with EGFR (P = 0.003), p53, basal cytokeratins: CK56, CK14, triple negative, and basal phenotype using Nielsen's classification (all P < 0.001) compared to UK women. Multivariate analyses showed that race was also associated with BCSS independent of tumour size, lymph node status, and ER status. Conclusion. Ki-67 expression was observed to have contributed to the difference in the BCSS in Nigerian compared with British BC women. Therefore, targeting Ki-67 in the indigenous black women with BC might improve the patient outcome in the black women with BC. PMID:23691362
NASA Astrophysics Data System (ADS)
Um, Taewoong; Hong, Jiwoo; Kang, In Seok
2016-11-01
The dispensing of tiny droplets is a basic and crucial process in a myriad of applications, such as DNA/protein microarray, cell cultures, chemical synthesis of microparticles, and digital microfluidics. This work demonstrates the droplet dispensing into immiscible fluids through electric charge concentration (ECC) method. Three main modes (i.e., attaching, uniform and bursting modes) are exhibited as a function of flow rates, applied voltage and gap distance between the nozzle and the oil surface. Through a conventional nozzle with diameter of a few millimeters, charged droplets with volumes ranging from a few μL to a few tens of nL can be uniformly dispensed into the oil chamber without reduction in nozzle size. Based on the features of the proposed method (e.g., formation of droplets with controllable polarity and amount of electric charge in water and oil system), a simple and straightforward method is developed for microparticle synthesis, including preparation for colloidosomes and fabrication of Janus microparticles with anisotropic internal structures. Finally, a combined system consisting of ECC-induced droplet dispensing and electrophoresis of charged droplet (ECD)-driven manipulation systems is constructed. This work was supported by the BK21Plus Program for advanced education of creative chemical engineers of the National Research Foundation of Korea (NRF) Grant funded by the Korea government (MSIP).
Aleksiūnienė, Beata; Preiksaitiene, Egle; Morkūnienė, Aušra; Ambrozaitytė, Laima; Utkus, Algirdas
2018-01-01
Many studies have shown that molecular karyotyping is an effective diagnostic tool in individuals with developmental delay/intellectual disability. We report on a de novo interstitial 1q22q23.1 microdeletion, 1.6 Mb in size, detected in a patient with short stature, microcephaly, hypoplastic corpus callosum, cleft palate, minor facial anomalies, congenital heart defect, camptodactyly of the 4-5th fingers, and intellectual disability. Chromosomal microarray analysis revealed a 1.6-Mb deletion in the 1q22q23.1 region, arr[GRCh37] 1q22q23.1(155630752_157193893)×1. Real-time PCR analysis confirmed its de novo origin. The deleted region encompasses 50 protein-coding genes, including the morbid genes APOA1BP, ARHGEF2, LAMTOR2, LMNA, NTRK1, PRCC, RIT1, SEMA4A, and YY1AP1. Although the unique phenotype observed in our patient can arise from the haploinsufficiency of the dosage-sensitive LMNA gene, the dosage imbalance of other genes implicated in the rearrangement could also contribute to the phenotype. Further studies are required for the delineation of the phenotype associated with this rare chromosomal alteration and elucidation of the critical genes for manifestation of the specific clinical features. © 2018 S. Karger AG, Basel.
MRG15 Regulates Embryonic Development and Cell Proliferation
Tominaga, Kaoru; Kirtane, Bhakti; Jackson, James G.; Ikeno, Yuji; Ikeda, Takayoshi; Hawks, Christina; Smith, James R.; Matzuk, Martin M.; Pereira-Smith, Olivia M.
2005-01-01
MRG15 is a highly conserved protein, and orthologs exist in organisms from yeast to humans. MRG15 associates with at least two nucleoprotein complexes that include histone acetyltransferases and/or histone deacetylases, suggesting it is involved in chromatin remodeling. To study the role of MRG15 in vivo, we generated knockout mice and determined that the phenotype is embryonic lethal, with embryos and the few stillborn pups exhibiting developmental delay. Immunohistochemical analysis indicates that apoptosis in Mrg15−/− embryos is not increased compared with wild-type littermates. However, the number of proliferating cells is significantly reduced in various tissues of the smaller null embryos compared with control littermates. Cell proliferation defects are also observed in Mrg15−/− mouse embryonic fibroblasts. The hearts of the Mrg15−/− embryos exhibit some features of hypertrophic cardiomyopathy. The increase in size of the cardiomyocytes is most likely a response to decreased growth of the cells. Mrg15−/− embryos appeared pale, and microarray analysis revealed that α-globin gene expression was decreased in null versus wild-type embryos. We determined by chromatin immunoprecipitation that MRG15 was recruited to the α-globin promoter during dimethyl sulfoxide-induced mouse erythroleukemia cell differentiation. These findings demonstrate that MRG15 has an essential role in embryonic development via chromatin remodeling and transcriptional regulation. PMID:15798182
A fuzzy neural network for intelligent data processing
NASA Astrophysics Data System (ADS)
Xie, Wei; Chu, Feng; Wang, Lipo; Lim, Eng Thiam
2005-03-01
In this paper, we describe an incrementally generated fuzzy neural network (FNN) for intelligent data processing. This FNN combines the features of initial fuzzy model self-generation, fast input selection, partition validation, parameter optimization and rule-base simplification. A small FNN is created from scratch -- there is no need to specify the initial network architecture, initial membership functions, or initial weights. Fuzzy IF-THEN rules are constantly combined and pruned to minimize the size of the network while maintaining accuracy; irrelevant inputs are detected and deleted, and membership functions and network weights are trained with a gradient descent algorithm, i.e., error backpropagation. Experimental studies on synthesized data sets demonstrate that the proposed Fuzzy Neural Network is able to achieve accuracy comparable to or higher than both a feedforward crisp neural network, i.e., NeuroRule, and a decision tree, i.e., C4.5, with more compact rule bases for most of the data sets used in our experiments. The FNN has achieved outstanding results for cancer classification based on microarray data. The excellent classification result for Small Round Blue Cell Tumors (SRBCTs) data set is shown. Compared with other published methods, we have used a much fewer number of genes for perfect classification, which will help researchers directly focus their attention on some specific genes and may lead to discovery of deep reasons of the development of cancers and discovery of drugs.
Yuan, Haiming; Meng, Zhe; Zhang, Lina; Luo, Xiangyang; Liu, Liping; Chen, Mengfan; Li, Xinwei; Zhao, Weiwei; Liang, Liyang
2016-01-01
Interstitial duplications distal to 15q13 are very rare. Here, we reported a 14-year-old boy with severe short stature, delayed bone age, hypogonadism, global developmental delay and intellectual disability. His had distinctive facial features including macrocephaly, broad forehead, deep-set and widely spaced eyes, broad nose bridge, shallow philtrum and thick lips. A de novo 6.4 Mb interstitial duplication of 15q15.3q21.2 was detected by chromosomal microarray analysis. We compared our patient's clinical phenotypes with those of several individuals with overlapping duplications and several candidate genes responsible for the phenotypes were identified as well. The results suggest a novel contiguous gene duplication syndrome characterized with shared features including short stature, hypogonadism, global developmental delay and other congenital anomalies.
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, ...
Validation of a TFE3 break-apart FISH assay for Xp11.2 translocation renal cell carcinomas.
Mosquera, Juan-Miguel; Dal Cin, Paola; Mertz, Kirsten D; Perner, Sven; Davis, Ian J; Fisher, David E; Rubin, Mark A; Hirsch, Michelle S
2011-09-01
Renal cell carcinomas (RCCs) with an Xp11.2 translocation predominantly affect young patients, and can present at an advanced stage. However, more cases in older patients and incidentally detected cancers at earlier stages are also being identified. As the histology of Xp11.2 RCCs overlaps with clear cell and papillary RCCs, it is not infrequent that Xp11.2 RCCs are overlooked and misdiagnosed. The objective of this study was to validate the use of fluorescence in-situ hybridization (FISH) for identifying Xp11.2 RCCs. One hundred fifty-eight consecutive, unselected renal tumors were evaluated in tissue microarrays, including 109 clear cell RCCs, 20 papillary RCCs, 3 RCCs with mixed papillary and clear cell features, 1 Xp11.2 translocation RCC, 8 chromophobe RCCs, 10 oncocytomas, and 7 angiomyolipomas. FISH evaluation was performed blinded to karyotype data, available in about two-thirds of cases. Furthermore, conventional sections of 4 Xp11.2 RCCs, 4 RCCs with mixed papillary and clear cell features, and 4 cases of alveolar soft part sarcoma (the latter for control purposes) were also assessed by FISH. Break-apart signals were homogeneously identified throughout tumor cells in 2 cases from the tissue microarrays including 1 known Xp11.2 RCC and 1 misdiagnosed Xp11.2 RCC. All conventional sections from the Xp11.2 RCC and alveolar soft part sarcoma cases were positive for the TFE3 rearrangement by FISH. All remaining cases were negative. Our study shows the clinical application of FISH in formalin-fixed, paraffin-embedded tissue for detection of Xp11.2 translocation RCCs and other tumors with this genetic aberration.
Foda, Abd Al-Rahman Mohammad; Mohammad, Mie Ali; Abdel-Aziz, Azza; El-Hawary, Amira Kamal
2015-06-01
Glypican-3 (GPC3) is a member of the membrane-bound heparin sulfate proteoglycans. E-cadherin is an adhesive receptor that is believed to act as a tumor suppressor gene. Many studies had investigated E-cadherin expressions in colorectal carcinoma (CRC) while only one study had investigated GPC3 expression in CRC. This study aims to investigate expression of GCP3 and E-cadherin in colorectal mucinous carcinoma (MA) and non-mucinous adenocarcinoma (NMA) using manual tissue microarray technique. Tumor tissue specimens are collected from 75 cases of MC and 75 cases of NMA who underwent radical surgery from Jan 2007 to Jan 2012 at the Gastroenterology Centre, Mansoura University, Egypt. Their clinicopathological parameters and survival data were revised and analyzed using established statistical methodologies. High-density manual tissue microarrays were constructed using modified mechanical pencil tip technique and immunohistochemistry for GPC3 and E-cadherin was done. NMA showed higher expression of GPC3 than MA with no statistically significant relation. NMA showed a significantly higher E-cadherin expression than MA. GPC3 and E-cadherin positivity rates were significantly interrelated in NMA, but not in MA, group. In NMA group, there was no significant relation between either GPC3 or E-cadherin expression and the clinicopathological features. In a univariate analysis, neither GPC3 nor E-cadherin expression showed a significant impact on disease-free survival (DFS) or overall survival (OS). GPC3 and E-cadherin expressions are not independent prognostic factors in CRC. However, expressions of both are significantly interrelated in NMA patients, suggesting an excellent interplay between both, in contrast to MA. Further molecular studies are needed to further explore the relationship between GCP3 and E-cadherin in colorectal carcinogenesis.
Experimental study on pore structure and performance of sintered porous wick
NASA Astrophysics Data System (ADS)
He, Da; Wang, Shufan; Liu, Rutie; Wang, Zhubo; Xiong, Xiang; Zou, Jianpeng
2018-02-01
Porous wicks were prepared via powder metallurgy using NH4HCO3 powders as pore-forming agent. The pore-forming agent particle size was varied to control the pore structure and equivalent pore size distribution feature of porous wick. The effect of pore-forming agent particle size on the porosity, pore structures, equivalent pore size distribution and capillary pumping performance were investigated. Results show that with the particle size of pore-forming agent decrease, the green density and the volume shrinkage of the porous wicks gradually increase and the porosity reduces slightly. There are two types of pores inside the porous wick, large-sized prefabricated pores and small-sized gap pores. With the particle size of pore-forming agent decrease, the size of the prefabricated pores becomes smaller and the distribution tends to be uniform. Gap pores and prefabricated pores inside the wick can make up different types of pore channels. The equivalent pore size of wick is closely related to the structure of pore channels. Furthermore, the equivalent pore size distribution of wick shows an obvious double-peak feature when the pore-forming agent particle size is large. With the particle size of pore-forming agent decrease, the two peaks of equivalent pore size distribution approach gradually to each other, resulting in a single-peak feature. Porous wick with single-peak feature equivalent pore size distribution possesses the better capillary pumping performances.
USDA-ARS?s Scientific Manuscript database
The development of a fluorescent multiplexed microarray platform able to detect and quantify a wide variety of pollutants in seawater is reported. The microarray platform has been manufactured by spotting 6 different bioconjugate competitors and it uses a cocktail of 6 monoclonal and polyclonal anti...
Microarray technology is a powerful tool to investigate the gene expression profiles for thousands of genes simultaneously. In recent years, microarrays have been used to characterize environmental pollutants and identify molecular mode(s) of action of chemicals including endocri...
USDA-ARS?s Scientific Manuscript database
The amount of microarray gene expression data in public repositories has been increasing exponentially for the last couple of decades. High-throughput microarray data integration and analysis has become a critical step in exploring the large amount of expression data for biological discovery. Howeve...
Microarrays Made Simple: "DNA Chips" Paper Activity
ERIC Educational Resources Information Center
Barnard, Betsy
2006-01-01
DNA microarray technology is revolutionizing biological science. DNA microarrays (also called DNA chips) allow simultaneous screening of many genes for changes in expression between different cells. Now researchers can obtain information about genes in days or weeks that used to take months or years. The paper activity described in this article…
Over the last decade, the introduction of microarray technology has had a profound impact on gene expression research. The publication of studies with dissimilar or altogether contradictory results, obtained using different microarray platforms to analyze identical RNA samples, h...
Karampetsou, Evangelia; Morrogh, Deborah; Chitty, Lyn
2014-01-01
The advantage of microarray (array) over conventional karyotype for the diagnosis of fetal pathogenic chromosomal anomalies has prompted the use of microarrays in prenatal diagnostics. In this review we compare the performance of different array platforms (BAC, oligonucleotide CGH, SNP) and designs (targeted, whole genome, whole genome, and targeted, custom) and discuss their advantages and disadvantages in relation to prenatal testing. We also discuss the factors to consider when implementing a microarray testing service for the diagnosis of fetal chromosomal aberrations. PMID:26237396
A Perspective on DNA Microarrays in Pathology Research and Practice
Pollack, Jonathan R.
2007-01-01
DNA microarray technology matured in the mid-1990s, and the past decade has witnessed a tremendous growth in its application. DNA microarrays have provided powerful tools for pathology researchers seeking to describe, classify, and understand human disease. There has also been great expectation that the technology would advance the practice of pathology. This review highlights some of the key contributions of DNA microarrays to experimental pathology, focusing in the area of cancer research. Also discussed are some of the current challenges in translating utility to clinical practice. PMID:17600117
Bingle, Lynne; Fonseca, Felipe P; Farthing, Paula M
2017-01-01
Tissue microarrays were first constructed in the 1980s but were used by only a limited number of researchers for a considerable period of time. In the last 10 years there has been a dramatic increase in the number of publications describing the successful use of tissue microarrays in studies aimed at discovering and validating biomarkers. This, along with the increased availability of both manual and automated microarray builders on the market, has encouraged even greater use of this novel and powerful tool. This chapter describes the basic techniques required to build a tissue microarray using a manual method in order that the theory behind the practical steps can be fully explained. Guidance is given to ensure potential disadvantages of the technique are fully considered.
Identification of differentially expressed genes and false discovery rate in microarray studies.
Gusnanto, Arief; Calza, Stefano; Pawitan, Yudi
2007-04-01
To highlight the development in microarray data analysis for the identification of differentially expressed genes, particularly via control of false discovery rate. The emergence of high-throughput technology such as microarrays raises two fundamental statistical issues: multiplicity and sensitivity. We focus on the biological problem of identifying differentially expressed genes. First, multiplicity arises due to testing tens of thousands of hypotheses, rendering the standard P value meaningless. Second, known optimal single-test procedures such as the t-test perform poorly in the context of highly multiple tests. The standard approach of dealing with multiplicity is too conservative in the microarray context. The false discovery rate concept is fast becoming the key statistical assessment tool replacing the P value. We review the false discovery rate approach and argue that it is more sensible for microarray data. We also discuss some methods to take into account additional information from the microarrays to improve the false discovery rate. There is growing consensus on how to analyse microarray data using the false discovery rate framework in place of the classical P value. Further research is needed on the preprocessing of the raw data, such as the normalization step and filtering, and on finding the most sensitive test procedure.
2015-01-01
Biological assays formatted as microarrays have become a critical tool for the generation of the comprehensive data sets required for systems-level understanding of biological processes. Manual annotation of data extracted from images of microarrays, however, remains a significant bottleneck, particularly for protein microarrays due to the sensitivity of this technology to weak artifact signal. In order to automate the extraction and curation of data from protein microarrays, we describe an algorithm called Crossword that logically combines information from multiple approaches to fully automate microarray segmentation. Automated artifact removal is also accomplished by segregating structured pixels from the background noise using iterative clustering and pixel connectivity. Correlation of the location of structured pixels across image channels is used to identify and remove artifact pixels from the image prior to data extraction. This component improves the accuracy of data sets while reducing the requirement for time-consuming visual inspection of the data. Crossword enables a fully automated protocol that is robust to significant spatial and intensity aberrations. Overall, the average amount of user intervention is reduced by an order of magnitude and the data quality is increased through artifact removal and reduced user variability. The increase in throughput should aid the further implementation of microarray technologies in clinical studies. PMID:24417579
The detection and differentiation of canine respiratory pathogens using oligonucleotide microarrays.
Wang, Lih-Chiann; Kuo, Ya-Ting; Chueh, Ling-Ling; Huang, Dean; Lin, Jiunn-Horng
2017-05-01
Canine respiratory diseases are commonly seen in dogs along with co-infections with multiple respiratory pathogens, including viruses and bacteria. Virus infections in even vaccinated dogs were also reported. The clinical signs caused by different respiratory etiological agents are similar, which makes differential diagnosis imperative. An oligonucleotide microarray system was developed in this study. The wild type and vaccine strains of canine distemper virus (CDV), influenza virus, canine herpesvirus (CHV), Bordetella bronchiseptica and Mycoplasma cynos were detected and differentiated simultaneously on a microarray chip. The detection limit is 10, 10, 100, 50 and 50 copy numbers for CDV, influenza virus, CHV, B. bronchiseptica and M. cynos, respectively. The clinical test results of nasal swab samples showed that the microarray had remarkably better efficacy than the multiplex PCR-agarose gel method. The positive detection rate of microarray and agarose gel was 59.0% (n=33) and 41.1% (n=23) among the 56 samples, respectively. CDV vaccine strain and pathogen co-infections were further demonstrated by the microarray but not by the multiplex PCR-agarose gel. The oligonucleotide microarray provides a highly efficient diagnosis alternative that could be applied to clinical usage, greatly assisting in disease therapy and control. Copyright © 2017 Elsevier B.V. All rights reserved.
Gene selection for microarray data classification via subspace learning and manifold regularization.
Tang, Chang; Cao, Lijuan; Zheng, Xiao; Wang, Minhui
2017-12-19
With the rapid development of DNA microarray technology, large amount of genomic data has been generated. Classification of these microarray data is a challenge task since gene expression data are often with thousands of genes but a small number of samples. In this paper, an effective gene selection method is proposed to select the best subset of genes for microarray data with the irrelevant and redundant genes removed. Compared with original data, the selected gene subset can benefit the classification task. We formulate the gene selection task as a manifold regularized subspace learning problem. In detail, a projection matrix is used to project the original high dimensional microarray data into a lower dimensional subspace, with the constraint that the original genes can be well represented by the selected genes. Meanwhile, the local manifold structure of original data is preserved by a Laplacian graph regularization term on the low-dimensional data space. The projection matrix can serve as an importance indicator of different genes. An iterative update algorithm is developed for solving the problem. Experimental results on six publicly available microarray datasets and one clinical dataset demonstrate that the proposed method performs better when compared with other state-of-the-art methods in terms of microarray data classification. Graphical Abstract The graphical abstract of this work.
Schell-Apacik, Chayim; Hardt, Michael; Ertl-Wagner, Birgit; Klopocki, Eva; Möhrenschlager, Matthias; Heinrich, Uwe; von Voss, Hubertus
2008-09-01
Alopecia-contractures-dwarfism mental retardation syndrome (ACD syndrome; OMIM 203550) is a very rare genetic disorder with distinct features. To our knowledge, there have been four cases documented to date. In addition, another three patients, previously described as having IFAP syndrome (OMIM %308205), may also have ACD syndrome. We report on one patient with short stature, total alopecia, ichthyosis, photophobia, seizures, ectrodactyly, vertebral anomalies, scoliosis, multiple contractures, mental retardation, and striking facial and other features (e.g. microdolichocephaly, missing eyebrows and eyelashes, long nose, large ears) consistent with ACD syndrome. Results of laboratory testing in the literature case reports were normal, although in none of them, array-CGH (microarray-based comparative genomic hybridization) analysis was performed. In conclusion, the combination of specific features, including total alopecia, ichthyosis, mental retardation, and skeletal anomalies are suggestive of ACD syndrome. We propose that children with this syndrome undergo a certain social pediatric protocol including EEG diagnostics, ophthalmological investigation, psychological testing, management of dermatologic and orthopedic problems, and genetic counseling.
Sterlacci, William; Tzankov, Alexandar; Veits, Lothar; Oberaigner, Wilhelm; Schmid, Thomas; Hilbe, Wolfgang; Fiegl, Michael
2011-04-01
The increasing incidence of lung cancer in women and their supposed survival advantage over men requires clarification of the significance of sex. Age, stage, histologic features, differentiation grade, and Ki-67 index were assessed in 405 surgically resected non-small cell lung cancers (NSCLCs) using a standardized tissue microarray platform. Women were associated with well/moderate tumor differentiation, a Ki-67 index of 3% or less, and adenocarcinoma histologic features. Female sex predicted increased survival time only by univariate analysis. Stratified by sex, increased survival was noted for women older than 64 years, with a tumor at postsurgical International Union Against Cancer stage I, with adenocarcinoma histologic features, with well- or moderately differentiated tumors, or with a Ki-67 index of 3% or less. Sex is not an independent prognostic parameter for patients with surgically resected NSCLC. Sex-linked differences are associated with other factors, thus simulating a prognostic impact of sex. This study elucidates sex-specific interactions between patient and tumor characteristics, which are pivotal toward improving prognostic accuracy, individualized therapies, and screening efforts.
Pro-oncogene Pokemon promotes breast cancer progression by upregulating survivin expression.
Zu, Xuyu; Ma, Jun; Liu, Hongxia; Liu, Feng; Tan, Chunyan; Yu, Lingling; Wang, Jue; Xie, Zhenhua; Cao, Deliang; Jiang, Yuyang
2011-03-10
Pokemon is an oncogenic transcription factor involved in cell growth, differentiation and oncogenesis, but little is known about its role in human breast cancer. In this study, we aimed to reveal the role of Pokemon in breast cancer progression and patient survival and to understand its underlying mechanisms. Tissue microarray analysis of breast cancer tissues from patients with complete clinicopathological data and more than 20 years of follow-up were used to evaluate Pokemon expression and its correlation with the progression and prognosis of the disease. DNA microarray analysis of MCF-7 cells that overexpress Pokemon was used to identify Pokemon target genes. Chromatin immunoprecipitation (ChIP) and site-directed mutagenesis were utilized to determine how Pokemon regulates survivin expression, a target gene. Pokemon was found to be overexpressed in 158 (86.8%) of 182 breast cancer tissues, and its expression was correlated with tumor size (P = 0.0148) and lymph node metastasis (P = 0.0014). Pokemon expression led to worse overall (n = 175, P = 0.01) and disease-related (n = 79, P = 0.0134) patient survival. DNA microarray analyses revealed that in MCF-7 breast cancer cells, Pokemon regulates the expression of at least 121 genes involved in several signaling and metabolic pathways, including anti-apoptotic survivin. In clinical specimens, Pokemon and survivin expression were highly correlated (n = 49, r = 0.6799, P < 0.0001). ChIP and site-directed mutagenesis indicated that Pokemon induces survivin expression by binding to the GT boxes in its promoter. Pokemon promotes breast cancer progression by upregulating survivin expression and thus may be a potential target for the treatment of this malignancy.
Pro-oncogene Pokemon promotes breast cancer progression by upregulating survivin expression
2011-01-01
Introduction Pokemon is an oncogenic transcription factor involved in cell growth, differentiation and oncogenesis, but little is known about its role in human breast cancer. In this study, we aimed to reveal the role of Pokemon in breast cancer progression and patient survival and to understand its underlying mechanisms. Methods Tissue microarray analysis of breast cancer tissues from patients with complete clinicopathological data and more than 20 years of follow-up were used to evaluate Pokemon expression and its correlation with the progression and prognosis of the disease. DNA microarray analysis of MCF-7 cells that overexpress Pokemon was used to identify Pokemon target genes. Chromatin immunoprecipitation (ChIP) and site-directed mutagenesis were utilized to determine how Pokemon regulates survivin expression, a target gene. Results Pokemon was found to be overexpressed in 158 (86.8%) of 182 breast cancer tissues, and its expression was correlated with tumor size (P = 0.0148) and lymph node metastasis (P = 0.0014). Pokemon expression led to worse overall (n = 175, P = 0.01) and disease-related (n = 79, P = 0.0134) patient survival. DNA microarray analyses revealed that in MCF-7 breast cancer cells, Pokemon regulates the expression of at least 121 genes involved in several signaling and metabolic pathways, including anti-apoptotic survivin. In clinical specimens, Pokemon and survivin expression were highly correlated (n = 49, r = 0.6799, P < 0.0001). ChIP and site-directed mutagenesis indicated that Pokemon induces survivin expression by binding to the GT boxes in its promoter. Conclusions Pokemon promotes breast cancer progression by upregulating survivin expression and thus may be a potential target for the treatment of this malignancy. PMID:21392388
He, Wenyin; Sun, Xiaofang; Liu, Lian; Li, Man; Jin, Hua; Wang, Wei-Hua
2014-01-01
Chromosomal anomalies in human embryos produced by in vitro fertilization are very common, which include numerical (aneuploidy) and structural (deletion, duplication or others) anomalies. Our previous study indicated that chromosomal deletion(s) is the most common structural anomaly accounting for approximately 8% of euploid blastocysts. It is still unknown if these deletions in human euploid blastocysts have clinical significance. In this study, we analyzed 15 previously diagnosed euploid blastocysts that had chromosomal deletion(s) using Agilent oligonucleotide DNA microarray platform and localized the gene location in each deletion. Then, we used OMIM gene map and phenotype database to investigate if these deletions are related with some important genes that cause genetic diseases, especially developmental delay or intellectual disability. As results, we found that the detectable chromosomal deletion size with Agilent microarray is above 2.38 Mb, while the deletions observed in human blastocysts are between 11.6 to 103 Mb. With OMIM gene map and phenotype database information, we found that deletions can result in loss of 81-464 genes. Out of these genes, 34-149 genes are related with known genetic problems. Furthermore, we found that 5 out of 15 samples lost genes in the deleted region, which were related to developmental delay and/or intellectual disability. In conclusion, our data indicates that all human euploid blastocysts with chromosomal deletion(s) are abnormal and transfer of these embryos may cause birth defects and/or developmental and intellectual disabilities. Therefore, the embryos with chromosomal deletion revealed by DNA microarray should not be transferred to the patients, or further gene map and/or phenotype seeking is necessary before making a final decision.
Millson, Alison; Lagrave, Danielle; Willis, Mary J H; Rowe, Leslie R; Lyon, Elaine; South, Sarah T
2012-01-01
Neuroligin 1 (NLGN1) is one of five members of the neuroligin gene family and may represent a candidate gene for neurological disorders, as members of this family are involved in formation and remodeling of central nervous system synapses. NLGN1 is expressed predominantly in the central nervous system, where it dimerizes and then binds with β-neurexin to form a functional synapse. Mutations in neurexin 1 (NRXN1) as well as two other members of the neuroligin family, NLGN3 and NLGN4, have been associated with autism and mutations in NLGN4 have also been associated with intellectual disability, seizures, and EEG abnormalities. Genomic microarray is recommended for the detection of chromosomal gains or losses in patients with intellectual disability and multiple congenital anomalies. Results of uncertain significance are not uncommon. Parental studies can provide additional information by demonstrating that the imbalance is either de novo or inherited, and therefore is more or less likely to be causative of the clinical phenotype. However, the possibility that even inherited deletions and duplications may play a role in the phenotype of the proband cannot be excluded as many copy number variants associated with neurodevelopmental conditions show incomplete penetrance and may be inherited from an unaffected parent. Here, we report on a patient with a 2.2 Mb deletion at 3q26.3-3q26.32-encompassing the terminal end of NLGN1 and the entire NAALADL2 gene-detected by genomic microarray, and confirmed by FISH and real-time quantitative PCR. The same size deletion was subsequently found in her healthy, asymptomatic, adult mother. Copyright © 2011 Wiley Periodicals, Inc.
Characterization and simulation of cDNA microarray spots using a novel mathematical model
Kim, Hye Young; Lee, Seo Eun; Kim, Min Jung; Han, Jin Il; Kim, Bo Kyung; Lee, Yong Sung; Lee, Young Seek; Kim, Jin Hyuk
2007-01-01
Background The quality of cDNA microarray data is crucial for expanding its application to other research areas, such as the study of gene regulatory networks. Despite the fact that a number of algorithms have been suggested to increase the accuracy of microarray gene expression data, it is necessary to obtain reliable microarray images by improving wet-lab experiments. As the first step of a cDNA microarray experiment, spotting cDNA probes is critical to determining the quality of spot images. Results We developed a governing equation of cDNA deposition during evaporation of a drop in the microarray spotting process. The governing equation included four parameters: the surface site density on the support, the extrapolated equilibrium constant for the binding of cDNA molecules with surface sites on glass slides, the macromolecular interaction factor, and the volume constant of a drop of cDNA solution. We simulated cDNA deposition from the single model equation by varying the value of the parameters. The morphology of the resulting cDNA deposit can be classified into three types: a doughnut shape, a peak shape, and a volcano shape. The spot morphology can be changed into a flat shape by varying the experimental conditions while considering the parameters of the governing equation of cDNA deposition. The four parameters were estimated by fitting the governing equation to the real microarray images. With the results of the simulation and the parameter estimation, the phenomenon of the formation of cDNA deposits in each type was investigated. Conclusion This study explains how various spot shapes can exist and suggests which parameters are to be adjusted for obtaining a good spot. This system is able to explore the cDNA microarray spotting process in a predictable, manageable and descriptive manner. We hope it can provide a way to predict the incidents that can occur during a real cDNA microarray experiment, and produce useful data for several research applications involving cDNA microarrays. PMID:18096047
Venegas-Vega, Carlos A.; Zepeda, Luis M.; Garduño-Zarazúa, Luz M.; Berumen, Jaime; Kofman, Susana; Cervantes, Alicia
2013-01-01
The use of conventional cytogenetic techniques in combination with fluorescent in situ hybridization (FISH) and single-nucleotide polymorphism (SNP) microarrays is necessary for the identification of cryptic rearrangements in the diagnosis of chromosomal syndromes. We report two siblings, a boy of 9 years and 9 months of age and his 7-years- and 5-month-old sister, with the classic Wolf-Hirschhorn syndrome (WHS) phenotype. Using high-resolution GTG- and NOR-banding karyotypes, as well as FISH analysis, we characterized a pure 4p deletion in both sibs and a balanced rearrangement in their father, consisting in an insertion of 4p material within a nucleolar organizing region of chromosome 15. Copy number variant (CNV) analysis using SNP arrays showed that both siblings have a similar size of 4p deletion (~6.5 Mb). Our results strongly support the need for conventional cytogenetic and FISH analysis, as well as high-density microarray mapping for the optimal characterization of the genetic imbalance in patients with WHS; parents must always be studied for recognizing cryptic balanced chromosomal rearrangements for an adequate genetic counseling. PMID:23484094
NASA Astrophysics Data System (ADS)
Celen, Burcu; Demirel, Gökhan; Piskin, Erhan
2011-04-01
The rapid and sensitive detection of DNA has recently attracted worldwide attention for a variety of disease diagnoses and detection of harmful bacteria in food and drink. In this paper, we carried out a comparative study based on surface plasmon resonance enhanced ellipsometry (SPREE) for the detection of oligodeoxynucleotides (ODNs) using micro- and nano-array platforms. The micro-arrayed surfaces were fabricated by a photolithography approach using different types of mask having varying size and shape. Well-ordered arrays of high aspect ratio polymeric nanotubes were also obtained using high molecular weight polystyrene (PS) and anodic aluminum oxide (AAO) membranes having 200 nm pore diameters. The SPREE sensors were then prepared by direct coupling of thiolated probe-ODNs, which contain suitable spacer arms, on gold-coated micro- and nano-arrayed surfaces. We experimentally demonstrated that, for the first time, gold-coated free standing polymeric nano-arrayed platforms can easily be produced and lead to a significant sensor sensitivity gain compared to that of the conventional SPREE surfaces of about four times. We believe that such an enhancement in sensor response could be useful for next generation sensor systems.
Takagi, K; Ishida, T; Miki, Y; Hirakawa, H; Kakugawa, Y; Amano, G; Ebata, A; Mori, N; Nakamura, Y; Watanabe, M; Amari, M; Ohuchi, N; Sasano, H; Suzuki, T
2013-01-01
Background: Estrogens have important roles in ductal carcinoma in situ (DCIS) of the breast. However, the significance of presurgical aromatase inhibitor treatment remains unclear. Therefore, we examined intratumoral concentration of estrogens and changes of clinicopathological factors in DCIS after letrozole treatment. Methods: Ten cases of postmenopausal oestrogen receptor (ER)-positive DCIS were examined. They received oral letrozole before the surgery, and the tumour size was evaluated by ultrasonography. Surgical specimens and corresponding biopsy samples were used for immunohistochemistry. Snap-frozen specimens were also available in a subset of cases, and used for hormone assays and microarray analysis. Results: Intratumoral oestrogen levels were significantly lower in DCIS treated with letrozole compared with that in those without the therapy. A great majority of oestrogen-induced genes showed low expression levels in DCIS treated with letrozole by microarray analysis. Moreover, letrozole treatment reduced the greatest dimension of DCIS, and significantly decreased Ki-67 and progesterone receptor immunoreactivity in DCIS tissues. Conclusion: These results suggest that estrogens are mainly produced by aromatase in DCIS tissues, and aromatase inhibitors potently inhibit oestrogen actions in postmenopausal ER-positive DCIS through rapid deprivation of intratumoral estrogens. PMID:23756858
NASA Astrophysics Data System (ADS)
Reicher, Naama; Segev, Lior; Rudich, Yinon
2018-01-01
The WeIzmann Supercooled Droplets Observation on Microarray (WISDOM) is a new setup for studying ice nucleation in an array of monodisperse droplets for atmospheric implications. WISDOM combines microfluidics techniques for droplets production and a cryo-optic stage for observation and characterization of freezing events of individual droplets. This setup is designed to explore heterogeneous ice nucleation in the immersion freezing mode, down to the homogeneous freezing of water (235 K) in various cooling rates (typically 0.1-10 K min-1). It can also be used for studying homogeneous freezing of aqueous solutions in colder temperatures. Frozen fraction, ice nucleation active surface site densities and freezing kinetics can be obtained from WISDOM measurements for hundreds of individual droplets in a single freezing experiment. Calibration experiments using eutectic solutions and previously studied materials are described. WISDOM also allows repeatable cycles of cooling and heating for the same array of droplets. This paper describes the WISDOM setup, its temperature calibration, validation experiments and measurement uncertainties. Finally, application of WISDOM to study the ice nucleating particle (INP) properties of size-selected ambient Saharan dust particles is presented.
Kim, Tae Hoon; Dekker, Job
2018-05-01
ChIP-chip can be used to analyze protein-DNA interactions in a region-wide and genome-wide manner. DNA microarrays contain PCR products or oligonucleotide probes that are designed to represent genomic sequences. Identification of genomic sites that interact with a specific protein is based on competitive hybridization of the ChIP-enriched DNA and the input DNA to DNA microarrays. The ChIP-chip protocol can be divided into two main sections: Amplification of ChIP DNA and hybridization of ChIP DNA to arrays. A large amount of DNA is required to hybridize to DNA arrays, and hybridization to a set of multiple commercial arrays that represent the entire human genome requires two rounds of PCR amplifications. The relative hybridization intensity of ChIP DNA and that of the input DNA is used to determine whether the probe sequence is a potential site of protein-DNA interaction. Resolution of actual genomic sites bound by the protein is dependent on the size of the chromatin and on the genomic distance between the probes on the array. As with expression profiling using gene chips, ChIP-chip experiments require multiple replicates for reliable statistical measure of protein-DNA interactions. © 2018 Cold Spring Harbor Laboratory Press.
Huang, Shih-Hao; Hsueh, Hui-Jung; Jiang, Yeu-Long
2011-01-01
This paper describes a light-addressable electrolytic system used to perform an electrodeposition of calcium alginate hydrogels using a digital micromirror device (DMD). In this system, a patterned light illumination is projected onto a photoconductive substrate serving as a photo-anode to electrolytically produce protons, which can lead to a decreased pH gradient. The low pH generated at the anode can locally release calcium ions from insoluble calcium carbonate (CaCO3) to cause gelation of calcium alginate through sol-gel transition. By controlling the illumination pattern on the DMD, a light-addressable electrodeposition of calcium alginate hydrogels with different shapes and sizes, as well as multiplexed micropatterning was performed. The effects of the concentration of the alginate and CaCO3 solutions on the dimensional resolution of alginate hydrogel formation were experimentally examined. A 3 × 3 array of cell-encapsulated alginate hydrogels was also successfully demonstrated through light-addressable electrodeposition. Our proposed method provides a programmable method for the spatiotemporally controllable assembly of cell populations into cellular microarrays and could have a wide range of biological applications in cell-based biosensing, toxicology, and drug discovery. PMID:22685500
Huang, Shih-Hao; Hsueh, Hui-Jung; Jiang, Yeu-Long
2011-09-01
This paper describes a light-addressable electrolytic system used to perform an electrodeposition of calcium alginate hydrogels using a digital micromirror device (DMD). In this system, a patterned light illumination is projected onto a photoconductive substrate serving as a photo-anode to electrolytically produce protons, which can lead to a decreased pH gradient. The low pH generated at the anode can locally release calcium ions from insoluble calcium carbonate (CaCO(3)) to cause gelation of calcium alginate through sol-gel transition. By controlling the illumination pattern on the DMD, a light-addressable electrodeposition of calcium alginate hydrogels with different shapes and sizes, as well as multiplexed micropatterning was performed. The effects of the concentration of the alginate and CaCO(3) solutions on the dimensional resolution of alginate hydrogel formation were experimentally examined. A 3 × 3 array of cell-encapsulated alginate hydrogels was also successfully demonstrated through light-addressable electrodeposition. Our proposed method provides a programmable method for the spatiotemporally controllable assembly of cell populations into cellular microarrays and could have a wide range of biological applications in cell-based biosensing, toxicology, and drug discovery.
Clevert, Djork-Arné; Mitterecker, Andreas; Mayr, Andreas; Klambauer, Günter; Tuefferd, Marianne; De Bondt, An; Talloen, Willem; Göhlmann, Hinrich; Hochreiter, Sepp
2011-07-01
Cost-effective oligonucleotide genotyping arrays like the Affymetrix SNP 6.0 are still the predominant technique to measure DNA copy number variations (CNVs). However, CNV detection methods for microarrays overestimate both the number and the size of CNV regions and, consequently, suffer from a high false discovery rate (FDR). A high FDR means that many CNVs are wrongly detected and therefore not associated with a disease in a clinical study, though correction for multiple testing takes them into account and thereby decreases the study's discovery power. For controlling the FDR, we propose a probabilistic latent variable model, 'cn.FARMS', which is optimized by a Bayesian maximum a posteriori approach. cn.FARMS controls the FDR through the information gain of the posterior over the prior. The prior represents the null hypothesis of copy number 2 for all samples from which the posterior can only deviate by strong and consistent signals in the data. On HapMap data, cn.FARMS clearly outperformed the two most prevalent methods with respect to sensitivity and FDR. The software cn.FARMS is publicly available as a R package at http://www.bioinf.jku.at/software/cnfarms/cnfarms.html.
Mallén, Maria; Díaz-González, María; Bonilla, Diana; Salvador, Juan P; Marco, María P; Baldi, Antoni; Fernández-Sánchez, César
2014-06-17
Low-density protein microarrays are emerging tools in diagnostics whose deployment could be primarily limited by the cost of fluorescence detection schemes. This paper describes an electrical readout system of microarrays comprising an array of gold interdigitated microelectrodes and an array of polydimethylsiloxane microwells, which enabled multiplexed detection of up to thirty six biological events on the same substrate. Similarly to fluorescent readout counterparts, the microarray can be developed on disposable glass slide substrates. However, unlike them, the presented approach is compact and requires a simple and inexpensive instrumentation. The system makes use of urease labeled affinity reagents for developing the microarrays and is based on detection of conductivity changes taking place when ionic species are generated in solution due to the catalytic hydrolysis of urea. The use of a polydimethylsiloxane microwell array facilitates the positioning of the measurement solution on every spot of the microarray. Also, it ensures the liquid tightness and isolation from the surrounding ones during the microarray readout process, thereby avoiding evaporation and chemical cross-talk effects that were shown to affect the sensitivity and reliability of the system. The performance of the system is demonstrated by carrying out the readout of a microarray for boldenone anabolic androgenic steroid hormone. Analytical results are comparable to those obtained by fluorescent scanner detection approaches. The estimated detection limit is 4.0 ng mL(-1), this being below the threshold value set by the World Anti-Doping Agency and the European Community. Copyright © 2014 Elsevier B.V. All rights reserved.
Sevenler, Derin; Daaboul, George G; Ekiz Kanik, Fulya; Ünlü, Neşe Lortlar; Ünlü, M Selim
2018-05-21
DNA and protein microarrays are a high-throughput technology that allow the simultaneous quantification of tens of thousands of different biomolecular species. The mediocre sensitivity and limited dynamic range of traditional fluorescence microarrays compared to other detection techniques have been the technology's Achilles' heel and prevented their adoption for many biomedical and clinical diagnostic applications. Previous work to enhance the sensitivity of microarray readout to the single-molecule ("digital") regime have either required signal amplifying chemistry or sacrificed throughput, nixing the platform's primary advantages. Here, we report the development of a digital microarray which extends both the sensitivity and dynamic range of microarrays by about 3 orders of magnitude. This technique uses functionalized gold nanorods as single-molecule labels and an interferometric scanner which can rapidly enumerate individual nanorods by imaging them with a 10× objective lens. This approach does not require any chemical signal enhancement such as silver deposition and scans arrays with a throughput similar to commercial fluorescence scanners. By combining single-nanoparticle enumeration and ensemble measurements of spots when the particles are very dense, this system achieves a dynamic range of about 6 orders of magnitude directly from a single scan. As a proof-of-concept digital protein microarray assay, we demonstrated detection of hepatitis B virus surface antigen in buffer with a limit of detection of 3.2 pg/mL. More broadly, the technique's simplicity and high-throughput nature make digital microarrays a flexible platform technology with a wide range of potential applications in biomedical research and clinical diagnostics.
Burgarella, Sarah; Cattaneo, Dario; Pinciroli, Francesco; Masseroli, Marco
2005-12-01
Improvements of bio-nano-technologies and biomolecular techniques have led to increasing production of high-throughput experimental data. Spotted cDNA microarray is one of the most diffuse technologies, used in single research laboratories and in biotechnology service facilities. Although they are routinely performed, spotted microarray experiments are complex procedures entailing several experimental steps and actors with different technical skills and roles. During an experiment, involved actors, who can also be located in a distance, need to access and share specific experiment information according to their roles. Furthermore, complete information describing all experimental steps must be orderly collected to allow subsequent correct interpretation of experimental results. We developed MicroGen, a web system for managing information and workflow in the production pipeline of spotted microarray experiments. It is constituted of a core multi-database system able to store all data completely characterizing different spotted microarray experiments according to the Minimum Information About Microarray Experiments (MIAME) standard, and of an intuitive and user-friendly web interface able to support the collaborative work required among multidisciplinary actors and roles involved in spotted microarray experiment production. MicroGen supports six types of user roles: the researcher who designs and requests the experiment, the spotting operator, the hybridisation operator, the image processing operator, the system administrator, and the generic public user who can access the unrestricted part of the system to get information about MicroGen services. MicroGen represents a MIAME compliant information system that enables managing workflow and supporting collaborative work in spotted microarray experiment production.
DNA Microarray Detection of 18 Important Human Blood Protozoan Species
Chen, Jun-Hu; Feng, Xin-Yu; Chen, Shao-Hong; Cai, Yu-Chun; Lu, Yan; Zhou, Xiao-Nong; Chen, Jia-Xu; Hu, Wei
2016-01-01
Background Accurate detection of blood protozoa from clinical samples is important for diagnosis, treatment and control of related diseases. In this preliminary study, a novel DNA microarray system was assessed for the detection of Plasmodium, Leishmania, Trypanosoma, Toxoplasma gondii and Babesia in humans, animals, and vectors, in comparison with microscopy and PCR data. Developing a rapid, simple, and convenient detection method for protozoan detection is an urgent need. Methodology/Principal Findings The microarray assay simultaneously identified 18 species of common blood protozoa based on the differences in respective target genes. A total of 20 specific primer pairs and 107 microarray probes were selected according to conserved regions which were designed to identify 18 species in 5 blood protozoan genera. The positive detection rate of the microarray assay was 91.78% (402/438). Sensitivity and specificity for blood protozoan detection ranged from 82.4% (95%CI: 65.9% ~ 98.8%) to 100.0% and 95.1% (95%CI: 93.2% ~ 97.0%) to 100.0%, respectively. Positive predictive value (PPV) and negative predictive value (NPV) ranged from 20.0% (95%CI: 2.5% ~ 37.5%) to 100.0% and 96.8% (95%CI: 95.0% ~ 98.6%) to 100.0%, respectively. Youden index varied from 0.82 to 0.98. The detection limit of the DNA microarrays ranged from 200 to 500 copies/reaction, similar to PCR findings. The concordance rate between microarray data and DNA sequencing results was 100%. Conclusions/Significance Overall, the newly developed microarray platform provides a convenient, highly accurate, and reliable clinical assay for the determination of blood protozoan species. PMID:27911895
Goodman, Corey W.; Major, Heather J.; Walls, William D.; Sheffield, Val C.; Casavant, Thomas L.; Darbro, Benjamin W.
2016-01-01
Chromosomal microarrays (CMAs) are routinely used in both research and clinical laboratories; yet, little attention has been given to the estimation of genome-wide true and false negatives during the assessment of these assays and how such information could be used to calibrate various algorithmic metrics to improve performance. Low-throughput, locus-specific methods such as fluorescence in situ hybridization (FISH), quantitative PCR (qPCR), or multiplex ligation-dependent probe amplification (MLPA) preclude rigorous calibration of various metrics used by copy number variant (CNV) detection algorithms. To aid this task, we have established a comparative methodology, CNV-ROC, which is capable of performing a high throughput, low cost, analysis of CMAs that takes into consideration genome-wide true and false negatives. CNV-ROC uses a higher resolution microarray to confirm calls from a lower resolution microarray and provides for a true measure of genome-wide performance metrics at the resolution offered by microarray testing. CNV-ROC also provides for a very precise comparison of CNV calls between two microarray platforms without the need to establish an arbitrary degree of overlap. Comparison of CNVs across microarrays is done on a per-probe basis and receiver operator characteristic (ROC) analysis is used to calibrate algorithmic metrics, such as log2 ratio threshold, to enhance CNV calling performance. CNV-ROC addresses a critical and consistently overlooked aspect of analytical assessments of genome-wide techniques like CMAs which is the measurement and use of genome-wide true and false negative data for the calculation of performance metrics and comparison of CNV profiles between different microarray experiments. PMID:25595567
Parthasarathy, Narayanan; DeShazer, David; England, Marilyn; Waag, David M
2006-11-01
A polysaccharide microarray platform was prepared by immobilizing Burkholderia pseudomallei and Burkholderia mallei polysaccharides. This polysaccharide array was tested with success for detecting B. pseudomallei and B. mallei serum (human and animal) antibodies. The advantages of this microarray technology over the current serodiagnosis of the above bacterial infections were discussed.
EDRN Biomarker Reference Lab: Pacific Northwest National Laboratory — EDRN Public Portal
The purpose of this project is to develop antibody microarrays incorporating three major improvements compared to previous antibody microarray platforms, and to produce and disseminate these antibody microarray technologies for the Early Detection Research Network (EDRN) and the research community focusing on early detection, and risk assessment of cancer.
DEVELOPMENT AND VALIDATION OF A 2,000 GENE MICROARRAY FOR THE FATHEAD MINNOW, PIMEPHALES PROMELAS
The development of the gene microarray has provided the field of ecotoxicology a new tool to identify modes of action (MOA) of chemicals and chemical mixtures. Herein we describe the development and application of a 2,000 gene oligonucleotide microarray for the fathead minnow (P...
Fabrication of Carbohydrate Microarrays by Boronate Formation.
Adak, Avijit K; Lin, Ting-Wei; Li, Ben-Yuan; Lin, Chun-Cheng
2017-01-01
The interactions between soluble carbohydrates and/or surface displayed glycans and protein receptors are essential to many biological processes and cellular recognition events. Carbohydrate microarrays provide opportunities for high-throughput quantitative analysis of carbohydrate-protein interactions. Over the past decade, various techniques have been implemented for immobilizing glycans on solid surfaces in a microarray format. Herein, we describe a detailed protocol for fabricating carbohydrate microarrays that capitalizes on the intrinsic reactivity of boronic acid toward carbohydrates to form stable boronate diesters. A large variety of unprotected carbohydrates ranging in structure from simple disaccharides and trisaccharides to considerably more complex human milk and blood group (oligo)saccharides have been covalently immobilized in a single step on glass slides, which were derivatized with high-affinity boronic acid ligands. The immobilized ligands in these microarrays maintain the receptor-binding activities including those of lectins and antibodies according to the structures of their pendant carbohydrates for rapid analysis of a number of carbohydrate-recognition events within 30 h. This method facilitates the direct construction of otherwise difficult to obtain carbohydrate microarrays from underivatized glycans.
Laboratory Reflectance Spectra in the Middle-infrared: Effects of Grain Size on Spectral Features
NASA Astrophysics Data System (ADS)
Le Bras, A.; Erard, S.; Fulchignoni, M.
2000-10-01
Since spectral mineral features are sensitive to surface parameters, interpretation of remote-sensing asteroids spectra in terms of mineral composition is not easy nor unique, and laboratory spectra are needed in order to understand the influence of each parameter. We developped an experimental program at IAS, using the 2.5-120 microns interferometer spectrometer, to study the influence of surface parameters on mineral features. We present here the results obtained variing the grain size. We studied grain size effects with two types of terrestrial rocks: anorthosite (bright) and basalte (dark) in the 2-40 microns range. We observed variations of the spectral contrast with grain size, shifts in wavelengths and variations of the intensity of some characteristic spectral features, and appearence of transparency features at wavelengths longer than 8 microns.
Signal amplification by rolling circle amplification on DNA microarrays
Nallur, Girish; Luo, Chenghua; Fang, Linhua; Cooley, Stephanie; Dave, Varshal; Lambert, Jeremy; Kukanskis, Kari; Kingsmore, Stephen; Lasken, Roger; Schweitzer, Barry
2001-01-01
While microarrays hold considerable promise in large-scale biology on account of their massively parallel analytical nature, there is a need for compatible signal amplification procedures to increase sensitivity without loss of multiplexing. Rolling circle amplification (RCA) is a molecular amplification method with the unique property of product localization. This report describes the application of RCA signal amplification for multiplexed, direct detection and quantitation of nucleic acid targets on planar glass and gel-coated microarrays. As few as 150 molecules bound to the surface of microarrays can be detected using RCA. Because of the linear kinetics of RCA, nucleic acid target molecules may be measured with a dynamic range of four orders of magnitude. Consequently, RCA is a promising technology for the direct measurement of nucleic acids on microarrays without the need for a potentially biasing preamplification step. PMID:11726701
Cell-Based Microarrays for In Vitro Toxicology
NASA Astrophysics Data System (ADS)
Wegener, Joachim
2015-07-01
DNA/RNA and protein microarrays have proven their outstanding bioanalytical performance throughout the past decades, given the unprecedented level of parallelization by which molecular recognition assays can be performed and analyzed. Cell microarrays (CMAs) make use of similar construction principles. They are applied to profile a given cell population with respect to the expression of specific molecular markers and also to measure functional cell responses to drugs and chemicals. This review focuses on the use of cell-based microarrays for assessing the cytotoxicity of drugs, toxins, or chemicals in general. It also summarizes CMA construction principles with respect to the cell types that are used for such microarrays, the readout parameters to assess toxicity, and the various formats that have been established and applied. The review ends with a critical comparison of CMAs and well-established microtiter plate (MTP) approaches.
The use of open source bioinformatics tools to dissect transcriptomic data.
Nitsche, Benjamin M; Ram, Arthur F J; Meyer, Vera
2012-01-01
Microarrays are a valuable technology to study fungal physiology on a transcriptomic level. Various microarray platforms are available comprising both single and two channel arrays. Despite different technologies, preprocessing of microarray data generally includes quality control, background correction, normalization, and summarization of probe level data. Subsequently, depending on the experimental design, diverse statistical analysis can be performed, including the identification of differentially expressed genes and the construction of gene coexpression networks.We describe how Bioconductor, a collection of open source and open development packages for the statistical programming language R, can be used for dissecting microarray data. We provide fundamental details that facilitate the process of getting started with R and Bioconductor. Using two publicly available microarray datasets from Aspergillus niger, we give detailed protocols on how to identify differentially expressed genes and how to construct gene coexpression networks.
Ejected Particle Size Distributions from Shocked Metal Surfaces
Schauer, M. M.; Buttler, W. T.; Frayer, D. K.; ...
2017-04-12
Here, we present size distributions for particles ejected from features machined onto the surface of shocked Sn targets. The functional form of the size distributions is assumed to be log-normal, and the characteristic parameters of the distribution are extracted from the measured angular distribution of light scattered from a laser beam incident on the ejected particles. We also found strong evidence for a bimodal distribution of particle sizes with smaller particles evolved from features machined into the target surface and larger particles being produced at the edges of these features.
Ejected Particle Size Distributions from Shocked Metal Surfaces
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schauer, M. M.; Buttler, W. T.; Frayer, D. K.
Here, we present size distributions for particles ejected from features machined onto the surface of shocked Sn targets. The functional form of the size distributions is assumed to be log-normal, and the characteristic parameters of the distribution are extracted from the measured angular distribution of light scattered from a laser beam incident on the ejected particles. We also found strong evidence for a bimodal distribution of particle sizes with smaller particles evolved from features machined into the target surface and larger particles being produced at the edges of these features.
Zhang, Zhaowei; Li, Peiwu; Hu, Xiaofeng; Zhang, Qi; Ding, Xiaoxia; Zhang, Wen
2012-01-01
Chemical contaminants in food have caused serious health issues in both humans and animals. Microarray technology is an advanced technique suitable for the analysis of chemical contaminates. In particular, immuno-microarray approach is one of the most promising methods for chemical contaminants analysis. The use of microarrays for the analysis of chemical contaminants is the subject of this review. Fabrication strategies and detection methods for chemical contaminants are discussed in detail. Application to the analysis of mycotoxins, biotoxins, pesticide residues, and pharmaceutical residues is also described. Finally, future challenges and opportunities are discussed.
Enhancing Results of Microarray Hybridizations Through Microagitation
Toegl, Andreas; Kirchner, Roland; Gauer, Christoph; Wixforth, Achim
2003-01-01
Protein and DNA microarrays have become a standard tool in proteomics/genomics research. In order to guarantee fast and reproducible hybridization results, the diffusion limit must be overcome. Surface acoustic wave (SAW) micro-agitation chips efficiently agitate the smallest sample volumes (down to 10 μL and below) without introducing any dead volume. The advantages are reduced reaction time, increased signal-to-noise ratio, improved homogeneity across the microarray, and better slide-to-slide reproducibility. The SAW micromixer chips are the heart of the Advalytix ArrayBooster, which is compatible with all microarrays based on the microscope slide format. PMID:13678150
AFM 4.0: a toolbox for DNA microarray analysis
Breitkreutz, Bobby-Joe; Jorgensen, Paul; Breitkreutz, Ashton; Tyers, Mike
2001-01-01
We have developed a series of programs, collectively packaged as Array File Maker 4.0 (AFM), that manipulate and manage DNA microarray data. AFM 4.0 is simple to use, applicable to any organism or microarray, and operates within the familiar confines of Microsoft Excel. Given a database of expression ratios, AFM 4.0 generates input files for clustering, helps prepare colored figures and Venn diagrams, and can uncover aneuploidy in yeast microarray data. AFM 4.0 should be especially useful to laboratories that do not have access to specialized commercial or in-house software. PMID:11532221
Progress in the application of DNA microarrays.
Lobenhofer, E K; Bushel, P R; Afshari, C A; Hamadeh, H K
2001-01-01
Microarray technology has been applied to a variety of different fields to address fundamental research questions. The use of microarrays, or DNA chips, to study the gene expression profiles of biologic samples began in 1995. Since that time, the fundamental concepts behind the chip, the technology required for making and using these chips, and the multitude of statistical tools for analyzing the data have been extensively reviewed. For this reason, the focus of this review will be not on the technology itself but on the application of microarrays as a research tool and the future challenges of the field. PMID:11673116