Yu, Shunying; Yuan, Chengmei; Hong, Wu; Wang, Zuowei; Cui, Jian; Shi, Tieliu; Fang, Yiru
2012-01-01
Subsyndromal symptomatic depression (SSD) is a subtype of subthreshold depressive and also lead to significant psychosocial functional impairment as same as major depressive disorder (MDD). Several studies have suggested that SSD is a transitory phenomena in the depression spectrum and is thus considered a subtype of depression. However, the pathophysioloy of depression remain largely obscure and studies on SSD are limited. The present study compared the expression profile and made the classification with the leukocytes by using whole-genome cRNA microarrays among drug-free first-episode subjects with SSD, MDD, and matched controls (8 subjects in each group). Support vector machines (SVMs) were utilized for training and testing on candidate signature expression profiles from signature selection step. Firstly, we identified 63 differentially expressed SSD signatures in contrast to control (P< = 5.0E-4) and 30 differentially expressed MDD signatures in contrast to control, respectively. Then, 123 gene signatures were identified with significantly differential expression level between SSD and MDD. Secondly, in order to conduct priority selection for biomarkers for SSD and MDD together, we selected top gene signatures from each group of pair-wise comparison results, and merged the signatures together to generate better profiles used for clearly classify SSD and MDD sets in the same time. In details, we tried different combination of signatures from the three pair-wise compartmental results and finally determined 48 gene expression signatures with 100% accuracy. Our finding suggested that SSD and MDD did not exhibit the same expressed genome signature with peripheral blood leukocyte, and blood cell–derived RNA of these 48 gene models may have significant value for performing diagnostic functions and classifying SSD, MDD, and healthy controls. PMID:22348066
Glycosyltransferase Gene Expression Profiles Classify Cancer Types and Propose Prognostic Subtypes
NASA Astrophysics Data System (ADS)
Ashkani, Jahanshah; Naidoo, Kevin J.
2016-05-01
Aberrant glycosylation in tumours stem from altered glycosyltransferase (GT) gene expression but can the expression profiles of these signature genes be used to classify cancer types and lead to cancer subtype discovery? The differential structural changes to cellular glycan structures are predominantly regulated by the expression patterns of GT genes and are a hallmark of neoplastic cell metamorphoses. We found that the expression of 210 GT genes taken from 1893 cancer patient samples in The Cancer Genome Atlas (TCGA) microarray data are able to classify six cancers; breast, ovarian, glioblastoma, kidney, colon and lung. The GT gene expression profiles are used to develop cancer classifiers and propose subtypes. The subclassification of breast cancer solid tumour samples illustrates the discovery of subgroups from GT genes that match well against basal-like and HER2-enriched subtypes and correlates to clinical, mutation and survival data. This cancer type glycosyltransferase gene signature finding provides foundational evidence for the centrality of glycosylation in cancer.
Artificial neural network classifier predicts neuroblastoma patients' outcome.
Cangelosi, Davide; Pelassa, Simone; Morini, Martina; Conte, Massimo; Bosco, Maria Carla; Eva, Alessandra; Sementa, Angela Rita; Varesio, Luigi
2016-11-08
More than fifty percent of neuroblastoma (NB) patients with adverse prognosis do not benefit from treatment making the identification of new potential targets mandatory. Hypoxia is a condition of low oxygen tension, occurring in poorly vascularized tissues, which activates specific genes and contributes to the acquisition of the tumor aggressive phenotype. We defined a gene expression signature (NB-hypo), which measures the hypoxic status of the neuroblastoma tumor. We aimed at developing a classifier predicting neuroblastoma patients' outcome based on the assessment of the adverse effects of tumor hypoxia on the progression of the disease. Multi-layer perceptron (MLP) was trained on the expression values of the 62 probe sets constituting NB-hypo signature to develop a predictive model for neuroblastoma patients' outcome. We utilized the expression data of 100 tumors in a leave-one-out analysis to select and construct the classifier and the expression data of the remaining 82 tumors to test the classifier performance in an external dataset. We utilized the Gene set enrichment analysis (GSEA) to evaluate the enrichment of hypoxia related gene sets in patients predicted with "Poor" or "Good" outcome. We utilized the expression of the 62 probe sets of the NB-Hypo signature in 182 neuroblastoma tumors to develop a MLP classifier predicting patients' outcome (NB-hypo classifier). We trained and validated the classifier in a leave-one-out cross-validation analysis on 100 tumor gene expression profiles. We externally tested the resulting NB-hypo classifier on an independent 82 tumors' set. The NB-hypo classifier predicted the patients' outcome with the remarkable accuracy of 87 %. NB-hypo classifier prediction resulted in 2 % classification error when applied to clinically defined low-intermediate risk neuroblastoma patients. The prediction was 100 % accurate in assessing the death of five low/intermediated risk patients. GSEA of tumor gene expression profile demonstrated the hypoxic status of the tumor in patients with poor prognosis. We developed a robust classifier predicting neuroblastoma patients' outcome with a very low error rate and we provided independent evidence that the poor outcome patients had hypoxic tumors, supporting the potential of using hypoxia as target for neuroblastoma treatment.
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd
Wang, Zichen; Monteiro, Caroline D.; Jagodnik, Kathleen M.; Fernandez, Nicolas F.; Gundersen, Gregory W.; Rouillard, Andrew D.; Jenkins, Sherry L.; Feldmann, Axel S.; Hu, Kevin S.; McDermott, Michael G.; Duan, Qiaonan; Clark, Neil R.; Jones, Matthew R.; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R.; Szeto, Gregory L.; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M.; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M.; Kruth, Candice D.; Bongio, Nicholas J.; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E.; Malatras, Apostolos; Fulp, Carl T.; Galindo, John A.; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C.; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H.; Allison, Lindsey R.; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi
2016-01-01
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization. PMID:27667448
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd.
Wang, Zichen; Monteiro, Caroline D; Jagodnik, Kathleen M; Fernandez, Nicolas F; Gundersen, Gregory W; Rouillard, Andrew D; Jenkins, Sherry L; Feldmann, Axel S; Hu, Kevin S; McDermott, Michael G; Duan, Qiaonan; Clark, Neil R; Jones, Matthew R; Kou, Yan; Goff, Troy; Woodland, Holly; Amaral, Fabio M R; Szeto, Gregory L; Fuchs, Oliver; Schüssler-Fiorenza Rose, Sophia M; Sharma, Shvetank; Schwartz, Uwe; Bausela, Xabier Bengoetxea; Szymkiewicz, Maciej; Maroulis, Vasileios; Salykin, Anton; Barra, Carolina M; Kruth, Candice D; Bongio, Nicholas J; Mathur, Vaibhav; Todoric, Radmila D; Rubin, Udi E; Malatras, Apostolos; Fulp, Carl T; Galindo, John A; Motiejunaite, Ruta; Jüschke, Christoph; Dishuck, Philip C; Lahl, Katharina; Jafari, Mohieddin; Aibar, Sara; Zaravinos, Apostolos; Steenhuizen, Linda H; Allison, Lindsey R; Gamallo, Pablo; de Andres Segura, Fernando; Dae Devlin, Tyler; Pérez-García, Vicente; Ma'ayan, Avi
2016-09-26
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization.
Balow, James E; Ryan, John G; Chae, Jae Jin; Booty, Matthew G; Bulua, Ariel; Stone, Deborah; Sun, Hong-Wei; Greene, James; Barham, Beverly; Goldbach-Mansky, Raphaela; Kastner, Daniel L; Aksentijevich, Ivona
2013-06-01
To analyse gene expression patterns and to define a specific gene expression signature in patients with the severe end of the spectrum of cryopyrin-associated periodic syndromes (CAPS). The molecular consequences of interleukin 1 inhibition were examined by comparing gene expression patterns in 16 CAPS patients before and after treatment with anakinra. We collected peripheral blood mononuclear cells from 22 CAPS patients with active disease and from 14 healthy children. Transcripts that passed stringent filtering criteria (p values≤false discovery rate 1%) were considered as differentially expressed genes (DEG). A set of DEG was validated by quantitative reverse transcription PCR and functional studies with primary cells from CAPS patients and healthy controls. We used 17 CAPS and 66 non-CAPS patient samples to create a set of gene expression models that differentiates CAPS patients from controls and from patients with other autoinflammatory conditions. Many DEG include transcripts related to the regulation of innate and adaptive immune responses, oxidative stress, cell death, cell adhesion and motility. A set of gene expression-based models comprising the CAPS-specific gene expression signature correctly classified all 17 samples from an independent dataset. This classifier also correctly identified 15 of 16 post-anakinra CAPS samples despite the fact that these CAPS patients were in clinical remission. We identified a gene expression signature that clearly distinguished CAPS patients from controls. A number of DEG were in common with other systemic inflammatory diseases such as systemic onset juvenile idiopathic arthritis. The CAPS-specific gene expression classifiers also suggest incomplete suppression of inflammation at low doses of anakinra.
Balow, James E; Ryan, John G; Chae, Jae Jin; Booty, Matthew G; Bulua, Ariel; Stone, Deborah; Sun, Hong-Wei; Greene, James; Barham, Beverly; Goldbach-Mansky, Raphaela; Kastner, Daniel L; Aksentijevich, Ivona
2014-01-01
Objective To analyse gene expression patterns and to define a specific gene expression signature in patients with the severe end of the spectrum of cryopyrin-associated periodic syndromes (CAPS). The molecular consequences of interleukin 1 inhibition were examined by comparing gene expression patterns in 16 CAPS patients before and after treatment with anakinra. Methods We collected peripheral blood mononuclear cells from 22 CAPS patients with active disease and from 14 healthy children. Transcripts that passed stringent filtering criteria (p values ≤ false discovery rate 1%) were considered as differentially expressed genes (DEG). A set of DEG was validated by quantitative reverse transcription PCR and functional studies with primary cells from CAPS patients and healthy controls. We used 17 CAPS and 66 non-CAPS patient samples to create a set of gene expression models that differentiates CAPS patients from controls and from patients with other autoinflammatory conditions. Results Many DEG include transcripts related to the regulation of innate and adaptive immune responses, oxidative stress, cell death, cell adhesion and motility. A set of gene expression-based models comprising the CAPS-specific gene expression signature correctly classified all 17 samples from an independent dataset. This classifier also correctly identified 15 of 16 postanakinra CAPS samples despite the fact that these CAPS patients were in clinical remission. Conclusions We identified a gene expression signature that clearly distinguished CAPS patients from controls. A number of DEG were in common with other systemic inflammatory diseases such as systemic onset juvenile idiopathic arthritis. The CAPS-specific gene expression classifiers also suggest incomplete suppression of inflammation at low doses of anakinra. PMID:23223423
A comparison of machine learning techniques for survival prediction in breast cancer
2011-01-01
Background The ability to accurately classify cancer patients into risk classes, i.e. to predict the outcome of the pathology on an individual basis, is a key ingredient in making therapeutic decisions. In recent years gene expression data have been successfully used to complement the clinical and histological criteria traditionally used in such prediction. Many "gene expression signatures" have been developed, i.e. sets of genes whose expression values in a tumor can be used to predict the outcome of the pathology. Here we investigate the use of several machine learning techniques to classify breast cancer patients using one of such signatures, the well established 70-gene signature. Results We show that Genetic Programming performs significantly better than Support Vector Machines, Multilayered Perceptrons and Random Forests in classifying patients from the NKI breast cancer dataset, and comparably to the scoring-based method originally proposed by the authors of the 70-gene signature. Furthermore, Genetic Programming is able to perform an automatic feature selection. Conclusions Since the performance of Genetic Programming is likely to be improvable compared to the out-of-the-box approach used here, and given the biological insight potentially provided by the Genetic Programming solutions, we conclude that Genetic Programming methods are worth further investigation as a tool for cancer patient classification based on gene expression data. PMID:21569330
Li, Qiyuan; Eklund, Aron C.; Juul, Nicolai; Haibe-Kains, Benjamin; Workman, Christopher T.; Richardson, Andrea L.; Szallasi, Zoltan; Swanton, Charles
2010-01-01
Background Expression of the oestrogen receptor (ER) in breast cancer predicts benefit from endocrine therapy. Minimising the frequency of false negative ER status classification is essential to identify all patients with ER positive breast cancers who should be offered endocrine therapies in order to improve clinical outcome. In routine oncological practice ER status is determined by semi-quantitative methods such as immunohistochemistry (IHC) or other immunoassays in which the ER expression level is compared to an empirical threshold[1], [2]. The clinical relevance of gene expression-based ER subtypes as compared to IHC-based determination has not been systematically evaluated. Here we attempt to reduce the frequency of false negative ER status classification using two gene expression approaches and compare these methods to IHC based ER status in terms of predictive and prognostic concordance with clinical outcome. Methodology/Principal Findings Firstly, ER status was discriminated by fitting the bimodal expression of ESR1 to a mixed Gaussian model. The discriminative power of ESR1 suggested bimodal expression as an efficient way to stratify breast cancer; therefore we identified a set of genes whose expression was both strongly bimodal, mimicking ESR expression status, and highly expressed in breast epithelial cell lines, to derive a 23-gene ER expression signature-based classifier. We assessed our classifiers in seven published breast cancer cohorts by comparing the gene expression-based ER status to IHC-based ER status as a predictor of clinical outcome in both untreated and tamoxifen treated cohorts. In untreated breast cancer cohorts, the 23 gene signature-based ER status provided significantly improved prognostic power compared to IHC-based ER status (P = 0.006). In tamoxifen-treated cohorts, the 23 gene ER expression signature predicted clinical outcome (HR = 2.20, P = 0.00035). These complementary ER signature-based strategies estimated that between 15.1% and 21.8% patients of IHC-based negative ER status would be classified with ER positive breast cancer. Conclusion/Significance Expression-based ER status classification may complement IHC to minimise false negative ER status classification and optimise patient stratification for endocrine therapies. PMID:21152022
Comparison of transcriptomic signature of post-Chernobyl and postradiotherapy thyroid tumors.
Ory, Catherine; Ugolin, Nicolas; Hofman, Paul; Schlumberger, Martin; Likhtarev, Illya A; Chevillard, Sylvie
2013-11-01
We previously identified two highly discriminating and predictive radiation-induced transcriptomic signatures by comparing series of sporadic and postradiotherapy thyroid tumors (322-gene signature), and by reanalyzing a previously published data set of sporadic and post-Chernobyl thyroid tumors (106-gene signature). The aim of the present work was (i) to compare the two signatures in terms of gene expression deregulations and molecular features/pathways, and (ii) to test the capacity of the postradiotherapy signature in classifying the post-Chernobyl series of tumors and reciprocally of the post-Chernobyl signature in classifying the postradiotherapy-induced tumors. We now explored if postradiotherapy and post-Chernobyl papillary thyroid carcinomas (PTC) display common molecular features by comparing molecular pathways deregulated in the two tumor series, and tested the potential of gene subsets of the postradiotherapy signature to classify the post-Chernobyl series (14 sporadic and 12 post-Chernobyl PTC), and reciprocally of gene subsets of the post-Chernobyl signature to classify the postradiotherapy series (15 sporadic and 12 postradiotherapy PTC), by using conventional principal component analysis. We found that the five genes common to the two signatures classified the learning/training tumors (used to search these signatures) of both the postradiotherapy (seven PTC) and the post-Chernobyl (six PTC) thyroid tumor series as compared with the sporadic tumors (seven sporadic PTC in each series). Importantly, these five genes were also effective for classifying independent series of postradiotherapy (five PTC) and post-Chernobyl (six PTC) tumors compared to independent series of sporadic tumors (eight PTC and six PTC respectively; testing tumors). Moreover, part of each postradiotherapy (32 genes) and post-Chernobyl signature (16 genes) cross-classified the respective series of thyroid tumors. Finally, several molecular pathways deregulated in post-Chernobyl tumors matched those found to be deregulated in postradiotherapy tumors. Overall, our data suggest that thyroid tumors that developed following either external exposure or internal (131)I contamination shared common molecular features, related to DNA repair, oxidative and endoplasmic reticulum stresses, allowing their classification as radiation-induced tumors in comparison with sporadic counterparts, independently of doses and dose rates, which suggests there may be a "general" radiation-induced signature of thyroid tumors.
Pangeni, Rajendra P; Zhang, Zhou; Alvarez, Angel A; Wan, Xuechao; Sastry, Namratha; Lu, Songjian; Shi, Taiping; Huang, Tianzhi; Lei, Charles X; James, C David; Kessler, John A; Brennan, Cameron W; Nakano, Ichiro; Lu, Xinghua; Hu, Bo; Zhang, Wei; Cheng, Shi-Yuan
2018-06-21
Glioma stem cells (GSCs), a subpopulation of tumor cells, contribute to tumor heterogeneity and therapy resistance. Gene expression profiling classified glioblastoma (GBM) and GSCs into four transcriptomically-defined subtypes. Here, we determined the DNA methylation signatures in transcriptomically pre-classified GSC and GBM bulk tumors subtypes. We hypothesized that these DNA methylation signatures correlate with gene expression and are uniquely associated either with only GSCs or only GBM bulk tumors. Additional methylation signatures may be commonly associated with both GSCs and GBM bulk tumors, i.e., common to non-stem-like and stem-like tumor cell populations and correlating with the clinical prognosis of glioma patients. We analyzed Illumina 450K methylation array and expression data from a panel of 23 patient-derived GSCs. We referenced these results with The Cancer Genome Atlas (TCGA) GBM datasets to generate methylomic and transcriptomic signatures for GSCs and GBM bulk tumors of each transcriptomically pre-defined tumor subtype. Survival analyses were carried out for these signature genes using publicly available datasets, including from TCGA. We report that DNA methylation signatures in proneural and mesenchymal tumor subtypes are either unique to GSCs, unique to GBM bulk tumors, or common to both. Further, dysregulated DNA methylation correlates with gene expression and clinical prognoses. Additionally, many previously identified transcriptionally-regulated markers are also dysregulated due to DNA methylation. The subtype-specific DNA methylation signatures described in this study could be useful for refining GBM sub-classification, improving prognostic accuracy, and making therapeutic decisions.
Dunne, Philip D.; Alderdice, Matthew; O'Reilly, Paul G.; Roddy, Aideen C.; McCorry, Amy M. B.; Richman, Susan; Maughan, Tim; McDade, Simon S.; Johnston, Patrick G.; Longley, Daniel B.; Kay, Elaine; McArt, Darragh G.; Lawler, Mark
2017-01-01
Stromal-derived intratumoural heterogeneity (ITH) has been shown to undermine molecular stratification of patients into appropriate prognostic/predictive subgroups. Here, using several clinically relevant colorectal cancer (CRC) gene expression signatures, we assessed the susceptibility of these signatures to the confounding effects of ITH using gene expression microarray data obtained from multiple tumour regions of a cohort of 24 patients, including central tumour, the tumour invasive front and lymph node metastasis. Sample clustering alongside correlative assessment revealed variation in the ability of each signature to cluster samples according to patient-of-origin rather than region-of-origin within the multi-region dataset. Signatures focused on cancer-cell intrinsic gene expression were found to produce more clinically useful, patient-centred classifiers, as exemplified by the CRC intrinsic signature (CRIS), which robustly clustered samples by patient-of-origin rather than region-of-origin. These findings highlight the potential of cancer-cell intrinsic signatures to reliably stratify CRC patients by minimising the confounding effects of stromal-derived ITH. PMID:28561046
A novel algorithm for simplification of complex gene classifiers in cancer
Wilson, Raphael A.; Teng, Ling; Bachmeyer, Karen M.; Bissonnette, Mei Lin Z.; Husain, Aliya N.; Parham, David M.; Triche, Timothy J.; Wing, Michele R.; Gastier-Foster, Julie M.; Barr, Frederic G.; Hawkins, Douglas S.; Anderson, James R.; Skapek, Stephen X.; Volchenboum, Samuel L.
2013-01-01
The clinical application of complex molecular classifiers as diagnostic or prognostic tools has been limited by the time and cost needed to apply them to patients. Using an existing fifty-gene expression signature known to separate two molecular subtypes of the pediatric cancer rhabdomyosarcoma, we show that an exhaustive iterative search algorithm can distill this complex classifier down to two or three features with equal discrimination. We validated the two-gene signatures using three separate and distinct data sets, including one that uses degraded RNA extracted from formalin-fixed, paraffin-embedded material. Finally, to demonstrate the generalizability of our algorithm, we applied it to a lung cancer data set to find minimal gene signatures that can distinguish survival. Our approach can easily be generalized and coupled to existing technical platforms to facilitate the discovery of simplified signatures that are ready for routine clinical use. PMID:23913937
Ao, Lu; Zhang, Zimei; Guan, Qingzhou; Guo, Yating; Guo, You; Zhang, Jiahui; Lv, Xingwei; Huang, Haiyan; Zhang, Huarong; Wang, Xianlong; Guo, Zheng
2018-04-23
Currently, using biopsy specimens to confirm suspicious liver lesions of early hepatocellular carcinoma are not entirely reliable because of insufficient sampling amount and inaccurate sampling location. It is necessary to develop a signature to aid early hepatocellular carcinoma diagnosis using biopsy specimens even when the sampling location is inaccurate. Based on the within-sample relative expression orderings of gene pairs, we identified a simple qualitative signature to distinguish both hepatocellular carcinoma and adjacent non-tumour tissues from cirrhosis tissues of non-hepatocellular carcinoma patients. A signature consisting of 19 gene pairs was identified in the training data sets and validated in 2 large collections of samples from biopsy and surgical resection specimens. For biopsy specimens, 95.7% of 141 hepatocellular carcinoma tissues and all (100%) of 108 cirrhosis tissues of non-hepatocellular carcinoma patients were correctly classified. Especially, all (100%) of 60 hepatocellular carcinoma adjacent normal tissues and 77.5% of 80 hepatocellular carcinoma adjacent cirrhosis tissues were classified to hepatocellular carcinoma. For surgical resection specimens, 99.7% of 733 hepatocellular carcinoma specimens were correctly classified to hepatocellular carcinoma, while 96.1% of 254 hepatocellular carcinoma adjacent cirrhosis tissues and 95.9% of 538 hepatocellular carcinoma adjacent normal tissues were classified to hepatocellular carcinoma. In contrast, 17.0% of 47 cirrhosis from non-hepatocellular carcinoma patients waiting for liver transplantation were classified to hepatocellular carcinoma, indicating that some patients with long-lasting cirrhosis could have already gained hepatocellular carcinoma characteristics. The signature can distinguish both hepatocellular carcinoma tissues and tumour-adjacent tissues from cirrhosis tissues of non-hepatocellular carcinoma patients even using inaccurately sampled biopsy specimens, which can aid early diagnosis of hepatocellular carcinoma. © 2018 The Authors. Liver International Published by John Wiley & Sons Ltd.
Gene Expression Signatures Based on Variability can Robustly Predict Tumor Progression and Prognosis
Dinalankara, Wikum; Bravo, Héctor Corrada
2015-01-01
Gene expression signatures are commonly used to create cancer prognosis and diagnosis methods, yet only a small number of them are successfully deployed in the clinic since many fail to replicate performance on subsequent validation. A primary reason for this lack of reproducibility is the fact that these signatures attempt to model the highly variable and unstable genomic behavior of cancer. Our group recently introduced gene expression anti-profiles as a robust methodology to derive gene expression signatures based on the observation that while gene expression measurements are highly heterogeneous across tumors of a specific cancer type relative to the normal tissue, their degree of deviation from normal tissue expression in specific genes involved in tissue differentiation is a stable tumor mark that is reproducible across experiments and cancer types. Here we show that constructing gene expression signatures based on variability and the anti-profile approach yields classifiers capable of successfully distinguishing benign growths from cancerous growths based on deviation from normal expression. We then show that this same approach generates stable and reproducible signatures that predict probability of relapse and survival based on tumor gene expression. These results suggest that using the anti-profile framework for the discovery of genomic signatures is an avenue leading to the development of reproducible signatures suitable for adoption in clinical settings. PMID:26078586
Molecular Signature for Lymphatic Invasion Associated with Survival of Epithelial Ovarian Cancer.
Paik, E Sun; Choi, Hyun Jin; Kim, Tae-Joong; Lee, Jeong-Won; Kim, Byoung-Gie; Bae, Duk-Soo; Choi, Chel Hun
2018-04-01
We aimed to develop molecular classifier that can predict lymphatic invasion and their clinical significance in epithelial ovarian cancer (EOC) patients. We analyzed gene expression (mRNA, methylated DNA) in data from The Cancer Genome Atlas. To identify molecular signatures for lymphatic invasion, we found differentially expressed genes. The performance of classifier was validated by receiver operating characteristics analysis, logistic regression, linear discriminant analysis (LDA), and support vector machine (SVM). We assessed prognostic role of classifier using random survival forest (RSF) model and pathway deregulation score (PDS). For external validation,we analyzed microarray data from 26 EOC samples of Samsung Medical Center and curatedOvarianData database. We identified 21 mRNAs, and seven methylated DNAs from primary EOC tissues that predicted lymphatic invasion and created prognostic models. The classifier predicted lymphatic invasion well, which was validated by logistic regression, LDA, and SVM algorithm (C-index of 0.90, 0.71, and 0.74 for mRNA and C-index of 0.64, 0.68, and 0.69 for DNA methylation). Using RSF model, incorporating molecular data with clinical variables improved prediction of progression-free survival compared with using only clinical variables (p < 0.001 and p=0.008). Similarly, PDS enabled us to classify patients into high-risk and low-risk group, which resulted in survival difference in mRNA profiles (log-rank p-value=0.011). In external validation, gene signature was well correlated with prediction of lymphatic invasion and patients' survival. Molecular signature model predicting lymphatic invasion was well performed and also associated with survival of EOC patients.
Cangelosi, Davide; Muselli, Marco; Parodi, Stefano; Blengio, Fabiola; Becherini, Pamela; Versteeg, Rogier; Conte, Massimo; Varesio, Luigi
2014-01-01
Cancer patient's outcome is written, in part, in the gene expression profile of the tumor. We previously identified a 62-probe sets signature (NB-hypo) to identify tissue hypoxia in neuroblastoma tumors and showed that NB-hypo stratified neuroblastoma patients in good and poor outcome 1. It was important to develop a prognostic classifier to cluster patients into risk groups benefiting of defined therapeutic approaches. Novel classification and data discretization approaches can be instrumental for the generation of accurate predictors and robust tools for clinical decision support. We explored the application to gene expression data of Rulex, a novel software suite including the Attribute Driven Incremental Discretization technique for transforming continuous variables into simplified discrete ones and the Logic Learning Machine model for intelligible rule generation. We applied Rulex components to the problem of predicting the outcome of neuroblastoma patients on the bases of 62 probe sets NB-hypo gene expression signature. The resulting classifier consisted in 9 rules utilizing mainly two conditions of the relative expression of 11 probe sets. These rules were very effective predictors, as shown in an independent validation set, demonstrating the validity of the LLM algorithm applied to microarray data and patients' classification. The LLM performed as efficiently as Prediction Analysis of Microarray and Support Vector Machine, and outperformed other learning algorithms such as C4.5. Rulex carried out a feature selection by selecting a new signature (NB-hypo-II) of 11 probe sets that turned out to be the most relevant in predicting outcome among the 62 of the NB-hypo signature. Rules are easily interpretable as they involve only few conditions. Our findings provided evidence that the application of Rulex to the expression values of NB-hypo signature created a set of accurate, high quality, consistent and interpretable rules for the prediction of neuroblastoma patients' outcome. We identified the Rulex weighted classification as a flexible tool that can support clinical decisions. For these reasons, we consider Rulex to be a useful tool for cancer classification from microarray gene expression data.
Zaas, Aimee K.; Chen, Minhua; Varkey, Jay; Veldman, Timothy; Hero, Alfred O.; Lucas, Joseph; Huang, Yongsheng; Turner, Ronald; Gilbert, Anthony; Lambkin-Williams, Robert; Øien, N. Christine; Nicholson, Bradly; Kingsmore, Stephen; Carin, Lawrence; Woods, Christopher W.; Ginsburg, Geoffrey S.
2010-01-01
Summary Acute respiratory infections (ARI) are a common reason for seeking medical attention and the threat of pandemic influenza will likely add to these numbers. Using human viral challenge studies with live rhinovirus, respiratory syncytial virus, and influenza A, we developed peripheral blood gene expression signatures that distinguish individuals with symptomatic ARI from uninfected individuals with > 95% accuracy. We validated this “acute respiratory viral” signature - encompassing genes with a known role in host defense against viral infections - across each viral challenge. We also validated the signature in an independently acquired dataset for influenza A and classified infected individuals from healthy controls with 100% accuracy. In the same dataset, we could also distinguish viral from bacterial ARIs (93% accuracy). These results demonstrate that ARIs induce changes in human peripheral blood gene expression that can be used to diagnose a viral etiology of respiratory infection and triage symptomatic individuals. PMID:19664979
Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia
2012-01-01
Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of “injury" gene predictors that can classify lung injury samples and identify patients at risk for clinically relevant lung injury complications. PMID:23071521
Blood-Based Gene Expression Signatures of Infants and Toddlers with Autism
ERIC Educational Resources Information Center
Glatt, Stephen J.; Tsuang, Ming T.; Winn, Mary; Chandler, Sharon D.; Collins, Melanie; Lopez, Linda; Weinfeld, Melanie; Carter, Cindy; Schork, Nicholas; Pierce, Karen; Courchesne, Eric
2012-01-01
Objective: Autism spectrum disorders (ASDs) are highly heritable neurodevelopmental disorders that onset clinically during the first years of life. ASD risk biomarkers expressed early in life could significantly impact diagnosis and treatment, but no transcriptome-wide biomarker classifiers derived from fresh blood samples from children with…
Clinical Value of Prognosis Gene Expression Signatures in Colorectal Cancer: A Systematic Review
Cordero, David; Riccadonna, Samantha; Solé, Xavier; Crous-Bou, Marta; Guinó, Elisabet; Sanjuan, Xavier; Biondo, Sebastiano; Soriano, Antonio; Jurman, Giuseppe; Capella, Gabriel; Furlanello, Cesare; Moreno, Victor
2012-01-01
Introduction The traditional staging system is inadequate to identify those patients with stage II colorectal cancer (CRC) at high risk of recurrence or with stage III CRC at low risk. A number of gene expression signatures to predict CRC prognosis have been proposed, but none is routinely used in the clinic. The aim of this work was to assess the prediction ability and potential clinical usefulness of these signatures in a series of independent datasets. Methods A literature review identified 31 gene expression signatures that used gene expression data to predict prognosis in CRC tissue. The search was based on the PubMed database and was restricted to papers published from January 2004 to December 2011. Eleven CRC gene expression datasets with outcome information were identified and downloaded from public repositories. Random Forest classifier was used to build predictors from the gene lists. Matthews correlation coefficient was chosen as a measure of classification accuracy and its associated p-value was used to assess association with prognosis. For clinical usefulness evaluation, positive and negative post-tests probabilities were computed in stage II and III samples. Results Five gene signatures showed significant association with prognosis and provided reasonable prediction accuracy in their own training datasets. Nevertheless, all signatures showed low reproducibility in independent data. Stratified analyses by stage or microsatellite instability status showed significant association but limited discrimination ability, especially in stage II tumors. From a clinical perspective, the most predictive signatures showed a minor but significant improvement over the classical staging system. Conclusions The published signatures show low prediction accuracy but moderate clinical usefulness. Although gene expression data may inform prognosis, better strategies for signature validation are needed to encourage their widespread use in the clinic. PMID:23145004
Classification of a large microarray data set: Algorithm comparison and analysis of drug signatures
Natsoulis, Georges; El Ghaoui, Laurent; Lanckriet, Gert R.G.; Tolley, Alexander M.; Leroy, Fabrice; Dunlea, Shane; Eynon, Barrett P.; Pearson, Cecelia I.; Tugendreich, Stuart; Jarnagin, Kurt
2005-01-01
A large gene expression database has been produced that characterizes the gene expression and physiological effects of hundreds of approved and withdrawn drugs, toxicants, and biochemical standards in various organs of live rats. In order to derive useful biological knowledge from this large database, a variety of supervised classification algorithms were compared using a 597-microarray subset of the data. Our studies show that several types of linear classifiers based on Support Vector Machines (SVMs) and Logistic Regression can be used to derive readily interpretable drug signatures with high classification performance. Both methods can be tuned to produce classifiers of drug treatments in the form of short, weighted gene lists which upon analysis reveal that some of the signature genes have a positive contribution (act as “rewards” for the class-of-interest) while others have a negative contribution (act as “penalties”) to the classification decision. The combination of reward and penalty genes enhances performance by keeping the number of false positive treatments low. The results of these algorithms are combined with feature selection techniques that further reduce the length of the drug signatures, an important step towards the development of useful diagnostic biomarkers and low-cost assays. Multiple signatures with no genes in common can be generated for the same classification end-point. Comparison of these gene lists identifies biological processes characteristic of a given class. PMID:15867433
Ensemble stump classifiers and gene expression signatures in lung cancer.
Frey, Lewis; Edgerton, Mary; Fisher, Douglas; Levy, Shawn
2007-01-01
Microarray data sets for cancer tumor tissue generally have very few samples, each sample having thousands of probes (i.e., continuous variables). The sparsity of samples makes it difficult for machine learning techniques to discover probes relevant to the classification of tumor tissue. By combining data from different platforms (i.e., data sources), data sparsity is reduced, but this typically requires normalizing data from the different platforms, which can be non-trivial. This paper proposes a variant on the idea of ensemble learners to circumvent the need for normalization. To facilitate comprehension we build ensembles of very simple classifiers known as decision stumps--decision trees of one test each. The Ensemble Stump Classifier (ESC) identifies an mRNA signature having three probes and high accuracy for distinguishing between adenocarcinoma and squamous cell carcinoma of the lung across four data sets. In terms of accuracy, ESC outperforms a decision tree classifier on all four data sets, outperforms ensemble decision trees on three data sets, and simple stump classifiers on two data sets.
Comparative analyses identify molecular signature of MRI-classified SVZ-associated glioblastoma
Lin, Chin-Hsing Annie; Rhodes, Christopher T.; Lin, ChenWei; Phillips, Joanna J.; Berger, Mitchel S.
2017-01-01
ABSTRACT Glioblastoma (GBM) is a highly aggressive brain cancer with limited therapeutic options. While efforts to identify genes responsible for GBM have revealed mutations and aberrant gene expression associated with distinct types of GBM, patients with GBM are often diagnosed and classified based on MRI features. Therefore, we seek to identify molecular representatives in parallel with MRI classification for group I and group II primary GBM associated with the subventricular zone (SVZ). As group I and II GBM contain stem-like signature, we compared gene expression profiles between these 2 groups of primary GBM and endogenous neural stem progenitor cells to reveal dysregulation of cell cycle, chromatin status, cellular morphogenesis, and signaling pathways in these 2 types of MRI-classified GBM. In the absence of IDH mutation, several genes associated with metabolism are differentially expressed in these subtypes of primary GBM, implicating metabolic reprogramming occurs in tumor microenvironment. Furthermore, histone lysine methyltransferase EZH2 was upregulated while histone lysine demethylases KDM2 and KDM4 were downregulated in both group I and II primary GBM. Lastly, we identified 9 common genes across large data sets of gene expression profiles among MRI-classified group I/II GBM, a large cohort of GBM subtypes from TCGA, and glioma stem cells by unsupervised clustering comparison. These commonly upregulated genes have known functions in cell cycle, centromere assembly, chromosome segregation, and mitotic progression. Our findings highlight altered expression of genes important in chromosome integrity across all GBM, suggesting a common mechanism of disrupted fidelity of chromosome structure in GBM. PMID:28278055
Hinchcliff, Monique; Huang, Chiang-Ching; Wood, Tammara A.; Mahoney, J. Matthew; Martyanov, Viktor; Bhattacharyya, Swati; Tamaki, Zenshiro; Lee, Jungwha; Carns, Mary; Podlusky, Sofia; Sirajuddin, Arlene; Shah, Sanjiv J; Chang, Rowland W.; Lafyatis, Robert; Varga, John; Whitfield, Michael L.
2013-01-01
Heterogeneity in systemic sclerosis/SSc confounds clinical trials. We previously identified ‘intrinsic’ gene expression subsets by analysis of SSc skin. Here we test the hypotheses that skin gene expression signatures including intrinsic subset are associated with skin score/MRSS improvement during mycophenolate mofetil (MMF) treatment. Gene expression and intrinsic subset assignment were measured in 12 SSc patients’ biopsies and ten controls at baseline, and from serial biopsies of one cyclophosphamide-treated patient, and nine MMF-treated patients. Gene expression changes during treatment were determined using paired t-tests corrected for multiple hypothesis testing. MRSS improved in four of seven MMF-treated patients classified as the inflammatory intrinsic subset. Three patients without MRSS improvement were classified as normal-like or fibroproliferative intrinsic subsets. 321 genes (FDR <5%) were differentially expressed at baseline between patients with and without MRSS improvement during treatment. Expression of 571 genes (FDR <10%) changed between pre- and post-MMF treatment biopsies for patients demonstrating MRSS improvement. Gene expression changes in skin are only seen in patients with MRSS improvement. Baseline gene expression in skin, including intrinsic subset assignment, may identify SSc patients whose MRSS will improve during MMF treatment, suggesting that gene expression in skin may allow targeted treatment in SSc. PMID:23677167
2009-01-01
Background Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. Conclusion Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments. PMID:19941644
Microarray analysis of miRNA expression profiles following whole body irradiation in a mouse model.
Aryankalayil, Molykutty J; Chopra, Sunita; Makinde, Adeola; Eke, Iris; Levin, Joel; Shankavaram, Uma; MacMillan, Laurel; Vanpouille-Box, Claire; Demaria, Sandra; Coleman, C Norman
2018-06-19
Accidental exposure to life-threatening radiation in a nuclear event is a major concern; there is an enormous need for identifying biomarkers for radiation biodosimetry to triage populations and treat critically exposed individuals. To identify dose-differentiating miRNA signatures from whole blood samples of whole body irradiated mice. Mice were whole body irradiated with X-rays (2 Gy-15 Gy); blood was collected at various time-points post-exposure; total RNA was isolated; miRNA microarrays were performed; miRNAs differentially expressed in irradiated vs. unirradiated controls were identified; feature extraction and classification models were applied to predict dose-differentiating miRNA signature. We observed a time and dose responsive alteration in the expression levels of miRNAs. Maximum number of miRNAs were altered at 24-h and 48-h time-points post-irradiation. A 23-miRNA signature was identified using feature selection algorithms and classifier models. An inverse correlation in the expression level changes of miR-17 members, and their targets were observed in whole body irradiated mice and non-human primates. Whole blood-based miRNA expression signatures might be used for predicting radiation exposures in a mass casualty nuclear incident.
Gao, Jianyong; Tian, Gang; Han, Xu; Zhu, Qiang
2018-01-01
Oral squamous cell carcinoma (OSCC) is the sixth most common type cancer worldwide, with poor prognosis. The present study aimed to identify gene signatures that could classify OSCC and predict prognosis in different stages. A training data set (GSE41613) and two validation data sets (GSE42743 and GSE26549) were acquired from the online Gene Expression Omnibus database. In the training data set, patients were classified based on the tumor-node-metastasis staging system, and subsequently grouped into low stage (L) or high stage (H). Signature genes between L and H stages were selected by disparity index analysis, and classification was performed by the expression of these signature genes. The established classification was compared with the L and H classification, and fivefold cross validation was used to evaluate the stability. Enrichment analysis for the signature genes was implemented by the Database for Annotation, Visualization and Integration Discovery. Two validation data sets were used to determine the precise of classification. Survival analysis was conducted followed each classification using the package ‘survival’ in R software. A set of 24 signature genes was identified based on the classification model with the Fi value of 0.47, which was used to distinguish OSCC samples in two different stages. Overall survival of patients in the H stage was higher than those in the L stage. Signature genes were primarily enriched in ‘ether lipid metabolism’ pathway and biological processes such as ‘positive regulation of adaptive immune response’ and ‘apoptotic cell clearance’. The results provided a novel 24-gene set that may be used as biomarkers to predict OSCC prognosis with high accuracy, which may be used to determine an appropriate treatment program for patients with OSCC in addition to the traditional evaluation index. PMID:29257303
Chen, Wanghao; Yu, Qiang; Chen, Bo; Lu, Xingyu; Li, Qiaoyu
2016-08-16
Glioma is often diagnosed at a later stage, and the high risk of recurrence remains a major challenge. We hypothesized that the microRNA expression profile may serve as a biomarker for the prognosis and prediction of glioblastoma recurrence. We defined microRNAs that were associated with good and poor prognosis in 300 specimens of glioblastoma from the Cancer Genome Atlas. By analyzing microarray gene expression data and clinical information from three random groups, we identified 7 microRNAs that have prognostic and prognostic accuracy: microRNA-124a, microRNA-129, microRNA-139, microRNA-15b, microRNA-21, microRNA-218 and microRNA-7. The differential expression of these miRNAs was verified using an independent set of glioma samples from the Affiliated People's Hospital of Jiangsu University. We used the log-rank test and the Kaplan-Meier method to estimate correlations between the miRNA signature and disease-free survival/overall survival. Using the LASSO model, we observed a uniform significant difference in disease-free survival and overall survival between patients with high-risk and low-risk miRNA signature scores. Furthermore, the prognostic capability of the seven-miRNA signature was demonstrated by receiver operator characteristic curve analysis. A Circos plot was generated to examine the network of genes and pathways predicted to be targeted by the seven-miRNA signature. The seven-miRNA-based classifier should be useful in the stratification and individualized management of patients with glioma.
A Host-Based RT-PCR Gene Expression Signature to Identify Acute Respiratory Viral Infection
Zaas, Aimee K.; Burke, Thomas; Chen, Minhua; McClain, Micah; Nicholson, Bradly; Veldman, Timothy; Tsalik, Ephraim L.; Fowler, Vance; Rivers, Emanuel P.; Otero, Ronny; Kingsmore, Stephen F.; Voora, Deepak; Lucas, Joseph; Hero, Alfred O.; Carin, Lawrence; Woods, Christopher W.; Ginsburg, Geoffrey S.
2014-01-01
Improved ways to diagnose acute respiratory viral infections could decrease inappropriate antibacterial use and serve as a vital triage mechanism in the event of a potential viral pandemic. Measurement of the host response to infection is an alternative to pathogen-based diagnostic testing and may improve diagnostic accuracy. We have developed a host-based assay with a reverse transcription polymerase chain reaction (RT-PCR) TaqMan low-density array (TLDA) platform for classifying respiratory viral infection. We developed the assay using two cohorts experimentally infected with influenza A H3N2/Wisconsin or influenza A H1N1/Brisbane, and validated the assay in a sample of adults presenting to the emergency department with fever (n = 102) and in healthy volunteers (n = 41). Peripheral blood RNA samples were obtained from individuals who underwent experimental viral challenge or who presented to the emergency department and had microbiologically proven viral respiratory infection or systemic bacterial infection. The selected gene set on the RT-PCR TLDA assay classified participants with experimentally induced influenza H3N2 and H1N1 infection with 100 and 87% accuracy, respectively. We validated this host gene expression signature in a cohort of 102 individuals arriving at the emergency department. The sensitivity of the RT-PCR test was 89% [95% confidence interval (CI), 72 to 98%], and the specificity was 94% (95% CI, 86 to 99%). These results show that RT-PCR–based detection of a host gene expression signature can classify individuals with respiratory viral infection and sets the stage for prospective evaluation of this diagnostic approach in a clinical setting. PMID:24048524
Target recognition based on the moment functions of radar signatures
NASA Astrophysics Data System (ADS)
Kim, Kyung-Tae; Kim, Hyo-Tae
2002-03-01
In this paper, we present the results of target recognition research based on the moment functions of various radar signatures, such as time-frequency signatures, range profiles, and scattering centers. The proposed approach utilizes geometrical moments or central moments of the obtained radar signatures. In particular, we derived exact and closed form expressions of the geometrical moments of the adaptive Gaussian representation (AGR), which is one of the adaptive joint time-frequency techniques, and also computed the central moments of range profiles and one-dimensional (1-D) scattering centers on a target, which are obtained by various super-resolution techniques. The obtained moment functions are further processed to provide small dimensional and redundancy-free feature vectors, and classified via a neural network approach or a Bayes classifier. The performances of the proposed technique are demonstrated using a simulated radar cross section (RCS) data set, or a measured RCS data set of various scaled aircraft models, obtained at the Pohang University of Science and Technology (POSTECH) compact range facility. Results show that the techniques in this paper can not only provide reliable classification accuracy, but also save computational resources.
matK-QR classifier: a patterns based approach for plant species identification.
More, Ravi Prabhakar; Mane, Rupali Chandrashekhar; Purohit, Hemant J
2016-01-01
DNA barcoding is widely used and most efficient approach that facilitates rapid and accurate identification of plant species based on the short standardized segment of the genome. The nucleotide sequences of maturaseK ( matK ) and ribulose-1, 5-bisphosphate carboxylase ( rbcL ) marker loci are commonly used in plant species identification. Here, we present a new and highly efficient approach for identifying a unique set of discriminating nucleotide patterns to generate a signature (i.e. regular expression) for plant species identification. In order to generate molecular signatures, we used matK and rbcL loci datasets, which encompass 125 plant species in 52 genera reported by the CBOL plant working group. Initially, we performed Multiple Sequence Alignment (MSA) of all species followed by Position Specific Scoring Matrix (PSSM) for both loci to achieve a percentage of discrimination among species. Further, we detected Discriminating Patterns (DP) at genus and species level using PSSM for the matK dataset. Combining DP and consecutive pattern distances, we generated molecular signatures for each species. Finally, we performed a comparative assessment of these signatures with the existing methods including BLASTn, Support Vector Machines (SVM), Jrip-RIPPER, J48 (C4.5 algorithm), and the Naïve Bayes (NB) methods against NCBI-GenBank matK dataset. Due to the higher discrimination success obtained with the matK as compared to the rbcL , we selected matK gene for signature generation. We generated signatures for 60 species based on identified discriminating patterns at genus and species level. Our comparative assessment results suggest that a total of 46 out of 60 species could be correctly identified using generated signatures, followed by BLASTn (34 species), SVM (18 species), C4.5 (7 species), NB (4 species) and RIPPER (3 species) methods As a final outcome of this study, we converted signatures into QR codes and developed a software matK -QR Classifier (http://www.neeri.res.in/matk_classifier/index.htm), which search signatures in the query matK gene sequences and predict corresponding plant species. This novel approach of employing pattern-based signatures opens new avenues for the classification of species. In addition to existing methods, we believe that matK -QR Classifier would be a valuable tool for molecular taxonomists enabling precise identification of plant species.
The purpose of this study was to develop a method of classifying cancers to specific diagnostic categories based on their gene expression signatures using artificial neural networks (ANNs). We trained the ANNs using the small, round blue-cell tumors (SRBCTs) as a model. These cancers belong to four distinct diagnostic categories and often present diagnostic dilemmas in
Lam, Lucia L.; Ghadessi, Mercedeh; Erho, Nicholas; Vergara, Ismael A.; Alshalalfa, Mohammed; Buerki, Christine; Haddad, Zaid; Sierocinski, Thomas; Triche, Timothy J.; Skinner, Eila C.; Davicioni, Elai; Daneshmand, Siamak; Black, Peter C.
2014-01-01
Background Nearly half of muscle-invasive bladder cancer patients succumb to their disease following cystectomy. Selecting candidates for adjuvant therapy is currently based on clinical parameters with limited predictive power. This study aimed to develop and validate genomic-based signatures that can better identify patients at risk for recurrence than clinical models alone. Methods Transcriptome-wide expression profiles were generated using 1.4 million feature-arrays on archival tumors from 225 patients who underwent radical cystectomy and had muscle-invasive and/or node-positive bladder cancer. Genomic (GC) and clinical (CC) classifiers for predicting recurrence were developed on a discovery set (n = 133). Performances of GC, CC, an independent clinical nomogram (IBCNC), and genomic-clinicopathologic classifiers (G-CC, G-IBCNC) were assessed in the discovery and independent validation (n = 66) sets. GC was further validated on four external datasets (n = 341). Discrimination and prognostic abilities of classifiers were compared using area under receiver-operating characteristic curves (AUCs). All statistical tests were two-sided. Results A 15-feature GC was developed on the discovery set with area under curve (AUC) of 0.77 in the validation set. This was higher than individual clinical variables, IBCNC (AUC = 0.73), and comparable to CC (AUC = 0.78). Performance was improved upon combining GC with clinical nomograms (G-IBCNC, AUC = 0.82; G-CC, AUC = 0.86). G-CC high-risk patients had elevated recurrence probabilities (P < .001), with GC being the best predictor by multivariable analysis (P = .005). Genomic-clinicopathologic classifiers outperformed clinical nomograms by decision curve and reclassification analyses. GC performed the best in validation compared with seven prior signatures. GC markers remained prognostic across four independent datasets. Conclusions The validated genomic-based classifiers outperform clinical models for predicting postcystectomy bladder cancer recurrence. This may be used to better identify patients who need more aggressive management. PMID:25344601
Forreryd, Andy; Johansson, Henrik; Albrekt, Ann-Sofie; Lindstedt, Malin
2014-05-16
Allergic contact dermatitis (ACD) develops upon exposure to certain chemical compounds termed skin sensitizers. To reduce the occurrence of skin sensitizers, chemicals are regularly screened for their capacity to induce sensitization. The recently developed Genomic Allergen Rapid Detection (GARD) assay is an in vitro alternative to animal testing for identification of skin sensitizers, classifying chemicals by evaluating transcriptional levels of a genomic biomarker signature. During assay development and biomarker identification, genome-wide expression analysis was applied using microarrays covering approximately 30,000 transcripts. However, the microarray platform suffers from drawbacks in terms of low sample throughput, high cost per sample and time consuming protocols and is a limiting factor for adaption of GARD into a routine assay for screening of potential sensitizers. With the purpose to simplify assay procedures, improve technical parameters and increase sample throughput, we assessed the performance of three high throughput gene expression platforms--nCounter®, BioMark HD™ and OpenArray®--and correlated their performance metrics against our previously generated microarray data. We measured the levels of 30 transcripts from the GARD biomarker signature across 48 samples. Detection sensitivity, reproducibility, correlations and overall structure of gene expression measurements were compared across platforms. Gene expression data from all of the evaluated platforms could be used to classify most of the sensitizers from non-sensitizers in the GARD assay. Results also showed high data quality and acceptable reproducibility for all platforms but only medium to poor correlations of expression measurements across platforms. In addition, evaluated platforms were superior to the microarray platform in terms of cost efficiency, simplicity of protocols and sample throughput. We evaluated the performance of three non-array based platforms using a limited set of transcripts from the GARD biomarker signature. We demonstrated that it was possible to achieve acceptable discriminatory power in terms of separation between sensitizers and non-sensitizers in the GARD assay while reducing assay costs, simplify assay procedures and increase sample throughput by using an alternative platform, providing a first step towards the goal to prepare GARD for formal validation and adaption of the assay for industrial screening of potential sensitizers.
Shailes, Hannah; Eleftherohorinou, Hariklia; Hoggart, Clive J; Cebey-Lopez, Miriam; Carter, Michael J; Janes, Victoria A; Gormley, Stuart; Shimizu, Chisato; Tremoulet, Adriana H; Barendregt, Anouk M; Salas, Antonio; Kanegaye, John; Pollard, Andrew J; Faust, Saul N; Patel, Sanjay; Kuijpers, Taco; Martinon-Torres, Federico; Burns, Jane C; Coin, Lachlan JM; Levin, Michael
2018-01-01
Importance As clinical features do not reliably distinguish bacterial from viral infection, many children worldwide receive unnecessary antibiotic treatment whilst bacterial infection is missed in others. Objective To identify a blood RNA expression signature that distinguishes bacterial from viral infection in febrile children. Design Febrile children presenting to participating hospitals in UK, Spain, Netherlands and USA between 2009-2013 were prospectively recruited, comprising a discovery group and validation group. Each group was classified after microbiological investigation into definite bacterial, definite viral infection or indeterminate infection. RNA expression signatures distinguishing definite bacterial from viral infection were identified in the discovery group and diagnostic performance assessed in the validation group. Additional validation was undertaken in separate studies of children with meningococcal disease (n=24) inflammatory diseases (n=48), and on published gene expression datasets. Exposures A 2-transcript RNA expression signature distinguishing bacterial infection from viral infection was evaluated against clinical and microbiological diagnosis. Main Outcomes Definite Bacterial and viral infection was confirmed by culture or molecular detection of the pathogens. Performance of the RNA signature was evaluated in the definite bacterial and viral group, and the indeterminate group. Results The discovery cohort of 240 children (median age 19 months, 62% males) included 52 with definite bacterial infection of whom 36 (69%) required intensive care; and 92 with definite viral infection of whom 32 (35%) required intensive care. 96 children had indeterminate infection. Bioinformatic analysis of RNA expression data identified a 38-transcript signature distinguishing bacterial from viral infection. A smaller (2-transcript) signature (FAM89A and IFI44L) was identified by removing highly correlated transcripts. When this 2-transcript signature was implemented as a Disease Risk Score in the validation group (130 children, including 23 bacterial, 28 viral, 79 indeterminate; median age 17 months, 57% males), bacterial infection was identified in all 23 microbiologically-confirmed definite bacterial patients, with a sensitivity of 100% (95% confidence interval [CI], 100 - 100), and in 1 of 28 definite viral patients, with specificity of 96.4% (95% CI, 89.3 – 100). When applied to additional validation datasets from patients with meningococcal and inflammatory diseases, bacterial infection was identified with a sensitivity of 91.7% (79.2-100) and 90.0% (70.0-100) respectively, and with specificity of 96.0% (88.0-100) and 95.8% (89.6-100). A minority of children in the indeterminate group were classified as having bacterial infection (63 of 136, 46.3%), although most received antibiotic treatment (129 of 136, 94.9%). Conclusions and Relevance This study provides preliminary data regarding test accuracy of a 2-transcript host RNA signature discriminating bacterial from viral infection in febrile children. Further studies are needed in diverse groups of patients to assess accuracy and clinical utility of this test in different clinical settings. PMID:27552617
Mertens, Tinne C J; van der Does, Anne M; Kistemaker, Loes E; Ninaber, Dennis K; Taube, Christian; Hiemstra, Pieter S
2017-07-01
Allergic airways inflammation in asthma is characterized by an airway epithelial gene signature composed of POSTN , CLCA1 , and SERPINB2 This Th2 gene signature is proposed as a tool to classify patients with asthma into Th2-high and Th2-low phenotypes. However, many asthmatics smoke and the effects of cigarette smoke exposure on the epithelial Th2 gene signature are largely unknown. Therefore, we investigated the combined effect of IL-13 and whole cigarette smoke (CS) on the Th2 gene signature and the mucin-related genes MUC5AC and SPDEF in air-liquid interface differentiated human bronchial (ALI-PBEC) and tracheal epithelial cells (ALI-PTEC). Cultures were exposed to IL-13 for 14 days followed by 5 days of IL-13 with CS exposure. Alternatively, cultures were exposed once daily to CS for 14 days, followed by 5 days CS with IL-13. POSTN , SERPINB2 , and CLCA1 expression were measured 24 h after the last exposure to CS and IL-13. In both models POSTN , SERPINB2 , and CLCA1 expression were increased by IL-13. CS markedly affected the IL-13-induced Th2 gene signature as indicated by a reduced POSTN , CLCA1 , and MUC5AC expression in both models. In contrast, IL-13-induced SERPINB2 expression remained unaffected by CS, whereas SPDEF expression was additively increased. Importantly, cessation of CS exposure failed to restore IL-13-induced POSTN and CLCA1 expression. We show for the first time that CS differentially affects the IL-13-induced gene signature for Th2-high asthma. These findings provide novel insights into the interaction between Th2 inflammation and cigarette smoke that is important for asthma pathogenesis and biomarker-guided therapy in asthma. © 2017 The Authors. Physiological Reports published by Wiley Periodicals, Inc. on behalf of The Physiological Society and the American Physiological Society.
Templin, Thomas; Young, Erik F.; Smilenov, Lubomir B.
2013-01-01
Purpose Previously, we showed that microRNA (miRNA) signatures derived from the peripheral blood of mice are highly specific for both radiation energy (γ-rays or high linear energy transfer [LET] 56Fe ions) and radiation dose. Here, we investigate to what extent miRNA expression signatures derived from mouse blood can be used as biomarkers for exposure to 600 MeV proton radiation. Materials and methods We exposed mice to 600 MeV protons, using doses of 0.5 or 1.0 Gy, isolated total RNA at 6 h or 24 h after irradiation, and used quantitative real-time polymerase chain reaction (PCR) to determine the changes in miRNA expression. Results A total of 26 miRNA were differentially expressed after proton irradiation, in either one (77%) or multiple conditions (23%). Statistical classifiers based on proton, γ, and 56Fe-ion miRNA expression signatures predicted radiation type and proton dose with accuracies of 81% and 88%, respectively. Importantly, gene ontology analysis for proton-irradiated cells shows that genes targeted by radiation-induced miRNA are involved in biological processes and molecular functions similar to those controlled by miRNA in γ ray- and 56Fe-irradiated cells. Conclusions Mouse blood miRNA signatures induced by proton, γ, or 56Fe irradiation are radiation type- and dose-specific. These findings underline the complexity of the miRNA-mediated radiation response. PMID:22551419
A novel multi-tissue RNA diagnostic of healthy ageing relates to cognitive health status.
Sood, Sanjana; Gallagher, Iain J; Lunnon, Katie; Rullman, Eric; Keohane, Aoife; Crossland, Hannah; Phillips, Bethan E; Cederholm, Tommy; Jensen, Thomas; van Loon, Luc J C; Lannfelt, Lars; Kraus, William E; Atherton, Philip J; Howard, Robert; Gustafsson, Thomas; Hodges, Angela; Timmons, James A
2015-09-07
Diagnostics of the human ageing process may help predict future healthcare needs or guide preventative measures for tackling diseases of older age. We take a transcriptomics approach to build the first reproducible multi-tissue RNA expression signature by gene-chip profiling tissue from sedentary normal subjects who reached 65 years of age in good health. One hundred and fifty probe-sets form an accurate classifier of young versus older muscle tissue and this healthy ageing RNA classifier performed consistently in independent cohorts of human muscle, skin and brain tissue (n = 594, AUC = 0.83-0.96) and thus represents a biomarker for biological age. Using the Uppsala Longitudinal Study of Adult Men birth-cohort (n = 108) we demonstrate that the RNA classifier is insensitive to confounding lifestyle biomarkers, while greater gene score at age 70 years is independently associated with better renal function at age 82 years and longevity. The gene score is 'up-regulated' in healthy human hippocampus with age, and when applied to blood RNA profiles from two large independent age-matched dementia case-control data sets (n = 717) the healthy controls have significantly greater gene scores than those with cognitive impairment. Alone, or when combined with our previously described prototype Alzheimer disease (AD) RNA 'disease signature', the healthy ageing RNA classifier is diagnostic for AD. We identify a novel and statistically robust multi-tissue RNA signature of human healthy ageing that can act as a diagnostic of future health, using only a peripheral blood sample. This RNA signature has great potential to assist research aimed at finding treatments for and/or management of AD and other ageing-related conditions.
Coustaty, M; Bertet, K; Visani, M; Ogier, J
2011-08-01
In this paper, we propose a new approach for symbol recognition using structural signatures and a Galois lattice as a classifier. The structural signatures are based on topological graphs computed from segments which are extracted from the symbol images by using an adapted Hough transform. These structural signatures-that can be seen as dynamic paths which carry high-level information-are robust toward various transformations. They are classified by using a Galois lattice as a classifier. The performance of the proposed approach is evaluated based on the GREC'03 symbol database, and the experimental results we obtain are encouraging.
A 16-Gene Signature Distinguishes Anaplastic Astrocytoma from Glioblastoma
Rao, Soumya Alige Mahabala; Srinivasan, Sujaya; Patric, Irene Rosita Pia; Hegde, Alangar Sathyaranjandas; Chandramouli, Bangalore Ashwathnarayanara; Arimappamagan, Arivazhagan; Santosh, Vani; Kondaiah, Paturu; Rao, Manchanahalli R. Sathyanarayana; Somasundaram, Kumaravel
2014-01-01
Anaplastic astrocytoma (AA; Grade III) and glioblastoma (GBM; Grade IV) are diffusely infiltrating tumors and are called malignant astrocytomas. The treatment regimen and prognosis are distinctly different between anaplastic astrocytoma and glioblastoma patients. Although histopathology based current grading system is well accepted and largely reproducible, intratumoral histologic variations often lead to difficulties in classification of malignant astrocytoma samples. In order to obtain a more robust molecular classifier, we analysed RT-qPCR expression data of 175 differentially regulated genes across astrocytoma using Prediction Analysis of Microarrays (PAM) and found the most discriminatory 16-gene expression signature for the classification of anaplastic astrocytoma and glioblastoma. The 16-gene signature obtained in the training set was validated in the test set with diagnostic accuracy of 89%. Additionally, validation of the 16-gene signature in multiple independent cohorts revealed that the signature predicted anaplastic astrocytoma and glioblastoma samples with accuracy rates of 99%, 88%, and 92% in TCGA, GSE1993 and GSE4422 datasets, respectively. The protein-protein interaction network and pathway analysis suggested that the 16-genes of the signature identified epithelial-mesenchymal transition (EMT) pathway as the most differentially regulated pathway in glioblastoma compared to anaplastic astrocytoma. In addition to identifying 16 gene classification signature, we also demonstrated that genes involved in epithelial-mesenchymal transition may play an important role in distinguishing glioblastoma from anaplastic astrocytoma. PMID:24475040
Peripheral Blood Signatures of Lead Exposure
LaBreche, Heather G.; Meadows, Sarah K.; Nevins, Joseph R.; Chute, John P.
2011-01-01
Background Current evidence indicates that even low-level lead (Pb) exposure can have detrimental effects, especially in children. We tested the hypothesis that Pb exposure alters gene expression patterns in peripheral blood cells and that these changes reflect dose-specific alterations in the activity of particular pathways. Methodology/Principal Finding Using Affymetrix Mouse Genome 430 2.0 arrays, we examined gene expression changes in the peripheral blood of female Balb/c mice following exposure to per os lead acetate trihydrate or plain drinking water for two weeks and after a two-week recovery period. Data sets were RMA-normalized and dose-specific signatures were generated using established methods of supervised classification and binary regression. Pathway activity was analyzed using the ScoreSignatures module from GenePattern. Conclusions/Significance The low-level Pb signature was 93% sensitive and 100% specific in classifying samples a leave-one-out crossvalidation. The high-level Pb signature demonstrated 100% sensitivity and specificity in the leave-one-out crossvalidation. These two signatures exhibited dose-specificity in their ability to predict Pb exposure and had little overlap in terms of constituent genes. The signatures also seemed to reflect current levels of Pb exposure rather than past exposure. Finally, the two doses showed differential activation of cellular pathways. Low-level Pb exposure increased activity of the interferon-gamma pathway, whereas high-level Pb exposure increased activity of the E2F1 pathway. PMID:21829687
Identification of a Genomic Signature Predicting for Recurrence in Early Stage Ovarian Cancer
2015-12-01
early stage ovarian cancer to help researchers worldwide identify biomarkers that can aid early detection and inform novel targets for therapy. This...to detect differentially expressed genes after transformation using Voom. When using the top 5 genes to build the classifier, it predicted...to analyze expression of micro-RNA in these samples. Thus, at the end of the third year of funding we started a parallel analysis of RNAseq, DNA- CNV
Release of (and lessons learned from mining) a pioneering large toxicogenomics database.
Sandhu, Komal S; Veeramachaneni, Vamsi; Yao, Xiang; Nie, Alex; Lord, Peter; Amaratunga, Dhammika; McMillian, Michael K; Verheyen, Geert R
2015-07-01
We release the Janssen Toxicogenomics database. This rat liver gene-expression database was generated using Codelink microarrays, and has been used over the past years within Janssen to derive signatures for multiple end points and to classify proprietary compounds. The release consists of gene-expression responses to 124 compounds, selected to give a broad coverage of liver-active compounds. A selection of the compounds were also analyzed on Affymetrix microarrays. The release includes results of an in-house reannotation pipeline to Entrez gene annotations, to classify probes into different confidence classes. High confidence unambiguously annotated probes were used to create gene-level data which served as starting point for cross-platform comparisons. Connectivity map-based similarity methods show excellent agreement between Codelink and Affymetrix runs of the same samples. We also compared our dataset with the Japanese Toxicogenomics Project and observed reasonable agreement, especially for compounds with stronger gene signatures. We describe an R-package containing the gene-level data and show how it can be used for expression-based similarity searches. Comparing the same biological samples run on the Affymetrix and the Codelink platform, good correspondence is observed using connectivity mapping approaches. As expected, this correspondence is smaller when the data are compared with an independent dataset such as TG-GATE. We hope that this collection of gene-expression profiles will be incorporated in toxicogenomics pipelines of users.
Identifying gnostic predictors of the vaccine response.
Haining, W Nicholas; Pulendran, Bali
2012-06-01
Molecular predictors of the response to vaccination could transform vaccine development. They would allow larger numbers of vaccine candidates to be rapidly screened, shortening the development time for new vaccines. Gene-expression based predictors of vaccine response have shown early promise. However, a limitation of gene-expression based predictors is that they often fail to reveal the mechanistic basis of their ability to classify response. Linking predictive signatures to the function of their component genes would advance basic understanding of vaccine immunity and also improve the robustness of vaccine prediction. New analytic tools now allow more biological meaning to be extracted from predictive signatures. Functional genomic approaches to perturb gene expression in mammalian cells permit the function of predictive genes to be surveyed in highly parallel experiments. The challenge for vaccinologists is therefore to use these tools to embed mechanistic insights into predictors of vaccine response. Copyright © 2012 Elsevier Ltd. All rights reserved.
Identifying gnostic predictors of the vaccine response
Haining, W. Nicholas; Pulendran, Bali
2012-01-01
Molecular predictors of the response to vaccination could transform vaccine development. They would allow larger numbers of vaccine candidates to be rapidly screened, shortening the development time for new vaccines. Gene-expression based predictors of vaccine response have shown early promise. However, a limitation of gene-expression based predictors is that they often fail to reveal the mechanistic basis for their ability to classify response. Linking predictive signatures to the function of their component genes would advance basic understanding of vaccine immunity and also improve the robustness of outcome classification. New analytic tools now allow more biological meaning to be extracted from predictive signatures. Functional genomic approaches to perturb gene expression in mammalian cells permit the function of predictive genes to be surveyed in highly parallel experiments. The challenge for vaccinologists is therefore to use these tools to embed mechanistic insights into predictors of vaccine response. PMID:22633886
Gene Expression in Parp1 Deficient Mice Exposed to a Median Lethal Dose of Gamma Rays.
Kumar, M A Suresh; Laiakis, Evagelia C; Ghandhi, Shanaz A; Morton, Shad R; Fornace, Albert J; Amundson, Sally A
2018-05-10
There is a current interest in the development of biodosimetric methods for rapidly assessing radiation exposure in the wake of a large-scale radiological event. This work was initially focused on determining the exposure dose to an individual using biological indicators. Gene expression signatures show promise for biodosimetric application, but little is known about how these signatures might translate for the assessment of radiological injury in radiosensitive individuals, who comprise a significant proportion of the general population, and who would likely require treatment after exposure to lower doses. Using Parp1 -/- mice as a model radiation-sensitive genotype, we have investigated the effect of this DNA repair deficiency on the gene expression response to radiation. Although Parp1 is known to play general roles in regulating transcription, the pattern of gene expression changes observed in Parp1 -/- mice 24 h postirradiation to a LD 50/30 was remarkably similar to that in wild-type mice after exposure to LD 50/30 . Similar levels of activation of both the p53 and NFκB radiation response pathways were indicated in both strains. In contrast, exposure of wild-type mice to a sublethal dose that was equal to the Parp1 -/- LD 50/30 , which resulted in a lower magnitude gene expression response. Thus, Parp1 -/- mice displayed a heightened gene expression response to radiation, which was more similar to the wild-type response to an equitoxic dose than to an equal absorbed dose. Gene expression classifiers trained on the wild-type data correctly identified all wild-type samples as unexposed, exposed to a sublethal dose or exposed to an LD 50/30 . All unexposed samples from Parp1 -/- mice were also correctly classified with the same gene set, and 80% of irradiated Parp1 -/- samples were identified as exposed to an LD 50/30 . The results of this study suggest that, at least for some pathways that may influence radiosensitivity in humans, specific gene expression signatures have the potential to accurately detect the extent of radiological injury, rather than serving only as a surrogate of physical radiation dose.
Detours, V; Delys, L; Libert, F; Weiss Solís, D; Bogdanova, T; Dumont, J E; Franc, B; Thomas, G; Maenhaut, C
2007-01-01
Papillary thyroid cancers (PTCs) incidence dramatically increased in the vicinity of Chernobyl. The cancer-initiating role of radiation elsewhere is debated. Therefore, we searched for a signature distinguishing radio-induced from sporadic cancers. Using microarrays, we compared the expression profiles of PTCs from the Chernobyl Tissue Bank (CTB, n=12) and from French patients with no history of exposure to ionising radiations (n=14). We also compared the transcriptional responses of human lymphocytes to the presumed aetiological agents initiating these tumours, γ-radiation and H2O2. On a global scale, the transcriptomes of CTB and French tumours are indistinguishable, and the transcriptional responses to γ-radiation and H2O2 are similar. On a finer scale, a 118 genes signature discriminated the γ-radiation and H2O2 responses. This signature could be used to classify the tumours as CTB or French with an error of 15–27%. Similar results were obtained with an independent signature of 13 genes involved in homologous recombination. Although sporadic and radio-induced PTCs represent the same disease, they are distinguishable with molecular signatures reflecting specific responses to γ-radiation and H2O2. These signatures in PTCs could reflect the susceptibility profiles of the patients, suggesting the feasibility of a radiation susceptibility test. PMID:17712314
A 15-gene signature for prediction of colon cancer recurrence and prognosis based on SVM.
Xu, Guangru; Zhang, Minghui; Zhu, Hongxing; Xu, Jinhua
2017-03-10
To screen the gene signature for distinguishing patients with high risks from those with low-risks for colon cancer recurrence and predicting their prognosis. Five microarray datasets of colon cancer samples were collected from Gene Expression Omnibus database and one was obtained from The Cancer Genome Atlas (TCGA). After preprocessing, data in GSE17537 were analyzed using the Linear Models for Microarray data (LIMMA) method to identify the differentially expressed genes (DEGs). The DEGs further underwent PPI network-based neighborhood scoring and support vector machine (SVM) analyses to screen the feature genes associated with recurrence and prognosis, which were then validated by four datasets GSE38832, GSE17538, GSE28814 and TCGA using SVM and Cox regression analyses. A total of 1207 genes were identified as DEGs between recurrence and no-recurrence samples, including 726 downregulated and 481 upregulated genes. Using SVM analysis and five gene expression profile data confirmation, a 15-gene signature (HES5, ZNF417, GLRA2, OR8D2, HOXA7, FABP6, MUSK, HTR6, GRIP2, KLRK1, VEGFA, AKAP12, RHEB, NCRNA00152 and PMEPA1) were identified as a predictor of recurrence risk and prognosis for colon cancer patients. Our identified 15-gene signature may be useful to classify colon cancer patients with different prognosis and some genes in this signature may represent new therapeutic targets. Copyright © 2016. Published by Elsevier B.V.
Automated defect spatial signature analysis for semiconductor manufacturing process
Tobin, Jr., Kenneth W.; Gleason, Shaun S.; Karnowski, Thomas P.; Sari-Sarraf, Hamed
1999-01-01
An apparatus and method for performing automated defect spatial signature alysis on a data set representing defect coordinates and wafer processing information includes categorizing data from the data set into a plurality of high level categories, classifying the categorized data contained in each high level category into user-labeled signature events, and correlating the categorized, classified signature events to a present or incipient anomalous process condition.
Combining Gene Signatures Improves Prediction of Breast Cancer Survival
Zhao, Xi; Naume, Bjørn; Langerød, Anita; Frigessi, Arnoldo; Kristensen, Vessela N.; Børresen-Dale, Anne-Lise; Lingjærde, Ole Christian
2011-01-01
Background Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. Principal Findings To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. Conclusion Combining the predictive strength of multiple gene signatures improves prediction of breast cancer survival. The presented methodology is broadly applicable to breast cancer risk assessment using any new identified gene set. PMID:21423775
Stromal-Based Signatures for the Classification of Gastric Cancer.
Uhlik, Mark T; Liu, Jiangang; Falcon, Beverly L; Iyer, Seema; Stewart, Julie; Celikkaya, Hilal; O'Mahony, Marguerita; Sevinsky, Christopher; Lowes, Christina; Douglass, Larry; Jeffries, Cynthia; Bodenmiller, Diane; Chintharlapalli, Sudhakar; Fischl, Anthony; Gerald, Damien; Xue, Qi; Lee, Jee-Yun; Santamaria-Pang, Alberto; Al-Kofahi, Yousef; Sui, Yunxia; Desai, Keyur; Doman, Thompson; Aggarwal, Amit; Carter, Julia H; Pytowski, Bronislaw; Jaminet, Shou-Ching; Ginty, Fiona; Nasir, Aejaz; Nagy, Janice A; Dvorak, Harold F; Benjamin, Laura E
2016-05-01
Treatment of metastatic gastric cancer typically involves chemotherapy and monoclonal antibodies targeting HER2 (ERBB2) and VEGFR2 (KDR). However, reliable methods to identify patients who would benefit most from a combination of treatment modalities targeting the tumor stroma, including new immunotherapy approaches, are still lacking. Therefore, we integrated a mouse model of stromal activation and gastric cancer genomic information to identify gene expression signatures that may inform treatment strategies. We generated a mouse model in which VEGF-A is expressed via adenovirus, enabling a stromal response marked by immune infiltration and angiogenesis at the injection site, and identified distinct stromal gene expression signatures. With these data, we designed multiplexed IHC assays that were applied to human primary gastric tumors and classified each tumor to a dominant stromal phenotype representative of the vascular and immune diversity found in gastric cancer. We also refined the stromal gene signatures and explored their relation to the dominant patient phenotypes identified by recent large-scale studies of gastric cancer genomics (The Cancer Genome Atlas and Asian Cancer Research Group), revealing four distinct stromal phenotypes. Collectively, these findings suggest that a genomics-based systems approach focused on the tumor stroma can be used to discover putative predictive biomarkers of treatment response, especially to antiangiogenesis agents and immunotherapy, thus offering an opportunity to improve patient stratification. Cancer Res; 76(9); 2573-86. ©2016 AACR. ©2016 American Association for Cancer Research.
MicroRNA signature of the human developing pancreas.
Rosero, Samuel; Bravo-Egana, Valia; Jiang, Zhijie; Khuri, Sawsan; Tsinoremas, Nicholas; Klein, Dagmar; Sabates, Eduardo; Correa-Medina, Mayrin; Ricordi, Camillo; Domínguez-Bendala, Juan; Diez, Juan; Pastori, Ricardo L
2010-09-22
MicroRNAs are non-coding RNAs that regulate gene expression including differentiation and development by either inhibiting translation or inducing target degradation. The aim of this study is to determine the microRNA expression signature during human pancreatic development and to identify potential microRNA gene targets calculating correlations between the signature microRNAs and their corresponding mRNA targets, predicted by bioinformatics, in genome-wide RNA microarray study. The microRNA signature of human fetal pancreatic samples 10-22 weeks of gestational age (wga), was obtained by PCR-based high throughput screening with Taqman Low Density Arrays. This method led to identification of 212 microRNAs. The microRNAs were classified in 3 groups: Group number I contains 4 microRNAs with the increasing profile; II, 35 microRNAs with decreasing profile and III with 173 microRNAs, which remain unchanged. We calculated Pearson correlations between the expression profile of microRNAs and target mRNAs, predicted by TargetScan 5.1 and miRBase algorithms, using genome-wide mRNA expression data. Group I correlated with the decreasing expression of 142 target mRNAs and Group II with the increasing expression of 876 target mRNAs. Most microRNAs correlate with multiple targets, just as mRNAs are targeted by multiple microRNAs. Among the identified targets are the genes and transcription factors known to play an essential role in pancreatic development. We have determined specific groups of microRNAs in human fetal pancreas that change the degree of their expression throughout the development. A negative correlative analysis suggests an intertwined network of microRNAs and mRNAs collaborating with each other. This study provides information leading to potential two-way level of combinatorial control regulating gene expression through microRNAs targeting multiple mRNAs and, conversely, target mRNAs regulated in parallel by other microRNAs as well. This study may further the understanding of gene expression regulation in the human developing pancreas.
MicroRNA signature of the human developing pancreas
2010-01-01
Background MicroRNAs are non-coding RNAs that regulate gene expression including differentiation and development by either inhibiting translation or inducing target degradation. The aim of this study is to determine the microRNA expression signature during human pancreatic development and to identify potential microRNA gene targets calculating correlations between the signature microRNAs and their corresponding mRNA targets, predicted by bioinformatics, in genome-wide RNA microarray study. Results The microRNA signature of human fetal pancreatic samples 10-22 weeks of gestational age (wga), was obtained by PCR-based high throughput screening with Taqman Low Density Arrays. This method led to identification of 212 microRNAs. The microRNAs were classified in 3 groups: Group number I contains 4 microRNAs with the increasing profile; II, 35 microRNAs with decreasing profile and III with 173 microRNAs, which remain unchanged. We calculated Pearson correlations between the expression profile of microRNAs and target mRNAs, predicted by TargetScan 5.1 and miRBase altgorithms, using genome-wide mRNA expression data. Group I correlated with the decreasing expression of 142 target mRNAs and Group II with the increasing expression of 876 target mRNAs. Most microRNAs correlate with multiple targets, just as mRNAs are targeted by multiple microRNAs. Among the identified targets are the genes and transcription factors known to play an essential role in pancreatic development. Conclusions We have determined specific groups of microRNAs in human fetal pancreas that change the degree of their expression throughout the development. A negative correlative analysis suggests an intertwined network of microRNAs and mRNAs collaborating with each other. This study provides information leading to potential two-way level of combinatorial control regulating gene expression through microRNAs targeting multiple mRNAs and, conversely, target mRNAs regulated in parallel by other microRNAs as well. This study may further the understanding of gene expression regulation in the human developing pancreas. PMID:20860821
Shi, Weiwei; Bugrim, Andrej; Nikolsky, Yuri; Nikolskya, Tatiana; Brennan, Richard J
2008-01-01
ABSTRACT The ideal toxicity biomarker is composed of the properties of prediction (is detected prior to traditional pathological signs of injury), accuracy (high sensitivity and specificity), and mechanistic relationships to the endpoint measured (biological relevance). Gene expression-based toxicity biomarkers ("signatures") have shown good predictive power and accuracy, but are difficult to interpret biologically. We have compared different statistical methods of feature selection with knowledge-based approaches, using GeneGo's database of canonical pathway maps, to generate gene sets for the classification of renal tubule toxicity. The gene set selection algorithms include four univariate analyses: t-statistics, fold-change, B-statistics, and RankProd, and their combination and overlap for the identification of differentially expressed probes. Enrichment analysis following the results of the four univariate analyses, Hotelling T-square test, and, finally out-of-bag selection, a variant of cross-validation, were used to identify canonical pathway maps-sets of genes coordinately involved in key biological processes-with classification power. Differentially expressed genes identified by the different statistical univariate analyses all generated reasonably performing classifiers of tubule toxicity. Maps identified by enrichment analysis or Hotelling T-square had lower classification power, but highlighted perturbed lipid homeostasis as a common discriminator of nephrotoxic treatments. The out-of-bag method yielded the best functionally integrated classifier. The map "ephrins signaling" performed comparably to a classifier derived using sparse linear programming, a machine learning algorithm, and represents a signaling network specifically involved in renal tubule development and integrity. Such functional descriptors of toxicity promise to better integrate predictive toxicogenomics with mechanistic analysis, facilitating the interpretation and risk assessment of predictive genomic investigations.
Hereditary family signature of facial expression
Peleg, Gili; Katzir, Gadi; Peleg, Ofer; Kamara, Michal; Brodsky, Leonid; Hel-Or, Hagit; Keren, Daniel; Nevo, Eviatar
2006-01-01
Although facial expressions of emotion are universal, individual differences create a facial expression “signature” for each person; but, is there a unique family facial expression signature? Only a few family studies on the heredity of facial expressions have been performed, none of which compared the gestalt of movements in various emotional states; they compared only a few movements in one or two emotional states. No studies, to our knowledge, have compared movements of congenitally blind subjects with their relatives to our knowledge. Using two types of analyses, we show a correlation between movements of congenitally blind subjects with those of their relatives in think-concentrate, sadness, anger, disgust, joy, and surprise and provide evidence for a unique family facial expression signature. In the analysis “in-out family test,” a particular movement was compared each time across subjects. Results show that the frequency of occurrence of a movement of a congenitally blind subject in his family is significantly higher than that outside of his family in think-concentrate, sadness, and anger. In the analysis “the classification test,” in which congenitally blind subjects were classified to their families according to the gestalt of movements, results show 80% correct classification over the entire interview and 75% in anger. Analysis of the movements' frequencies in anger revealed a correlation between the movements' frequencies of congenitally blind individuals and those of their relatives. This study anticipates discovering genes that influence facial expressions, understanding their evolutionary significance, and elucidating repair mechanisms for syndromes lacking facial expression, such as autism. PMID:17043232
Ushijima, Masaru; Mashima, Tetsuo; Tomida, Akihiro; Dan, Shingo; Saito, Sakae; Furuno, Aki; Tsukahara, Satomi; Seimiya, Hiroyuki; Yamori, Takao; Matsuura, Masaaki
2013-03-01
Genome-wide transcriptional expression analysis is a powerful strategy for characterizing the biological activity of anticancer compounds. It is often instructive to identify gene sets involved in the activity of a given drug compound for comparison with different compounds. Currently, however, there is no comprehensive gene expression database and related application system that is; (i) specialized in anticancer agents; (ii) easy to use; and (iii) open to the public. To develop a public gene expression database of antitumor agents, we first examined gene expression profiles in human cancer cells after exposure to 35 compounds including 25 clinically used anticancer agents. Gene signatures were extracted that were classified as upregulated or downregulated after exposure to the drug. Hierarchical clustering showed that drugs with similar mechanisms of action, such as genotoxic drugs, were clustered. Connectivity map analysis further revealed that our gene signature data reflected modes of action of the respective agents. Together with the database, we developed analysis programs that calculate scores for ranking changes in gene expression and for searching statistically significant pathways from the Kyoto Encyclopedia of Genes and Genomes database in order to analyze the datasets more easily. Our database and the analysis programs are available online at our website (http://scads.jfcr.or.jp/db/cs/). Using these systems, we successfully showed that proteasome inhibitors are selectively classified as endoplasmic reticulum stress inducers and induce atypical endoplasmic reticulum stress. Thus, our public access database and related analysis programs constitute a set of efficient tools to evaluate the mode of action of novel compounds and identify promising anticancer lead compounds. © 2012 Japanese Cancer Association.
Sparse feature selection for classification and prediction of metastasis in endometrial cancer.
Ahsen, Mehmet Eren; Boren, Todd P; Singh, Nitin K; Misganaw, Burook; Mutch, David G; Moore, Kathleen N; Backes, Floor J; McCourt, Carolyn K; Lea, Jayanthi S; Miller, David S; White, Michael A; Vidyasagar, Mathukumalli
2017-03-27
Metastasis via pelvic and/or para-aortic lymph nodes is a major risk factor for endometrial cancer. Lymph-node resection ameliorates risk but is associated with significant co-morbidities. Incidence in patients with stage I disease is 4-22% but no mechanism exists to accurately predict it. Therefore, national guidelines for primary staging surgery include pelvic and para-aortic lymph node dissection for all patients whose tumor exceeds 2cm in diameter. We sought to identify a robust molecular signature that can accurately classify risk of lymph node metastasis in endometrial cancer patients. 86 tumors matched for age and race, and evenly distributed between lymph node-positive and lymph node-negative cases, were selected as a training cohort. Genomic micro-RNA expression was profiled for each sample to serve as the predictive feature matrix. An independent set of 28 tumor samples was collected and similarly characterized to serve as a test cohort. A feature selection algorithm was designed for applications where the number of samples is far smaller than the number of measured features per sample. A predictive miRNA expression signature was developed using this algorithm, which was then used to predict the metastatic status of the independent test cohort. A weighted classifier, using 18 micro-RNAs, achieved 100% accuracy on the training cohort. When applied to the testing cohort, the classifier correctly predicted 90% of node-positive cases, and 80% of node-negative cases (FDR = 6.25%). Results indicate that the evaluation of the quantitative sparse-feature classifier proposed here in clinical trials may lead to significant improvement in the prediction of lymphatic metastases in endometrial cancer patients.
Sun, Jie; Chen, Xihai; Wang, Zhenzhen; Guo, Maoni; Shi, Hongbo; Wang, Xiaojun; Cheng, Liang; Zhou, Meng
2015-11-09
Long non-coding RNAs (lncRNAs) have been implicated in a variety of biological processes, and dysregulated lncRNAs have demonstrated potential roles as biomarkers and therapeutic targets for cancer prognosis and treatment. In this study, by repurposing microarray probes, we analyzed lncRNA expression profiles of 916 breast cancer patients from the Gene Expression Omnibus (GEO). Nine lncRNAs were identified to be significantly associated with metastasis-free survival (MFS) in the training dataset of 254 patients using the Cox proportional hazards regression model. These nine lncRNAs were then combined to form a single prognostic signature for predicting metastatic risk in breast cancer patients that was able to classify patients in the training dataset into high- and low-risk subgroups with significantly different MFSs (median 2.4 years versus 3.0 years, log-rank test p < 0.001). This nine-lncRNA signature was similarly effective for prognosis in a testing dataset and two independent datasets. Further analysis showed that the predictive ability of the signature was independent of clinical variables, including age, ER status, ESR1 status and ERBB2 status. Our results indicated that lncRNA signature could be a useful prognostic marker to predict metastatic risk in breast cancer patients and may improve upon our understanding of the molecular mechanisms underlying breast cancer metastasis.
A Gene Signature to Determine Metastatic Behavior in Thymomas
Gökmen-Polar, Yesim; Wilkinson, Jeff; Maetzold, Derek; Stone, John F.; Oelschlager, Kristen M.; Vladislav, Ioan Tudor; Shirar, Kristen L.; Kesler, Kenneth A.; Loehrer, Patrick J.; Badve, Sunil
2013-01-01
Purpose Thymoma represents one of the rarest of all malignancies. Stage and completeness of resection have been used to ascertain postoperative therapeutic strategies albeit with limited prognostic accuracy. A molecular classifier would be useful to improve the assessment of metastatic behaviour and optimize patient management. Methods qRT-PCR assay for 23 genes (19 test and four reference genes) was performed on multi-institutional archival primary thymomas (n = 36). Gene expression levels were used to compute a signature, classifying tumors into classes 1 and 2, corresponding to low or high likelihood for metastases. The signature was validated in an independent multi-institutional cohort of patients (n = 75). Results A nine-gene signature that can predict metastatic behavior of thymomas was developed and validated. Using radial basis machine modeling in the training set, 5-year and 10-year metastasis-free survival rates were 77% and 26% for predicted low (class 1) and high (class 2) risk of metastasis (P = 0.0047, log-rank), respectively. For the validation set, 5-year metastasis-free survival rates were 97% and 30% for predicted low- and high-risk patients (P = 0.0004, log-rank), respectively. The 5-year metastasis-free survival rates for the validation set were 49% and 41% for Masaoka stages I/II and III/IV (P = 0.0537, log-rank), respectively. In univariate and multivariate Cox models evaluating common prognostic factors for thymoma metastasis, the nine-gene signature was the only independent indicator of metastases (P = 0.036). Conclusion A nine-gene signature was established and validated which predicts the likelihood of metastasis more accurately than traditional staging. This further underscores the biologic determinants of the clinical course of thymoma and may improve patient management. PMID:23894276
NASA Technical Reports Server (NTRS)
Harston, Craig; Schumacher, Chris
1992-01-01
Automated schemes are needed to classify multispectral remotely sensed data. Human intelligence is often required to correctly interpret images from satellites and aircraft. Humans suceed because they use various types of cues about a scene to accurately define the contents of the image. Consequently, it follows that computer techniques that integrate and use different types of information would perform better than single source approaches. This research illustrated that multispectral signatures and topographical information could be used in concert. Significantly, this dual source tactic classified a remotely sensed image better than the multispectral classification alone. These classifications were accomplished by fusing spectral signatures with topographical information using neural network technology. A neural network was trained to classify Landsat mulitspectral signatures. A file of georeferenced ground truth classifications were used as the training criterion. The network was trained to classify urban, agriculture, range, and forest with an accuracy of 65.7 percent. Another neural network was programmed and trained to fuse these multispectral signature results with a file of georeferenced altitude data. This topological file contained 10 levels of elevations. When this nonspectral elevation information was fused with the spectral signatures, the classifications were improved to 73.7 and 75.7 percent.
A Label Propagation Approach for Detecting Buried Objects in Handheld GPR Data
2016-04-17
regions of interest that correspond to locations with anomalous signatures. Second, a classifier (or an ensemble of classifiers ) is used to assign a...investigated for almost two decades and several classifiers have been developed. Most of these methods are based on the supervised learning paradigm where...labeled target and clutter signatures are needed to train a classifier to discriminate between the two classes. Typically, large and diverse labeled
Multiclass cancer diagnosis using tumor gene expression signatures
Ramaswamy, S.; Tamayo, P.; Rifkin, R.; ...
2001-12-11
The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a supportmore » vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.« less
Whole mouse blood microRNA as biomarkers for exposure to γ-rays and 56Fe ions
Templin, Thomas; Amundson, Sally A.; Brenner, David J.; Smilenov, Lubomir B.
2013-01-01
Purpose Biomarkers of ionising radiation exposure are useful in a variety of scenarios, such as medical diagnostic imaging, occupational exposures, and spaceflight. This study investigates to what extent microRNA (miRNA) expression signatures in mouse peripheral blood can be used as biomarkers for exposures to radiation with low and high linear energy transfers. Materials and methods Mice were irradiated with doses of 0.5, 1.5, or 5.0 Gy γ-rays (dose rate of 0.0136 Gy/s) or with doses of 0.1 or 0.5 Gy 56Fe ions (dose rate of 0.00208 Gy/s). Total RNA was isolated from whole blood at 6 h or 24 h after irradiation. Three animals per irradiation condition were used. Differentially expressed miRNA were determined by means of quantitative real-time polymerase chain reaction. Results miRNA expression signatures were radiation type-specific and dose- and time-dependent. The differentially expressed miRNA were expressed in either one condition (71%) or multiple conditions (29%). Classifiers based on the differentially expressed miRNA predicted radiation type or dose with accuracies between 75% and 100%. Gene-ontology analyses show that miRNA induced by irradiation are involved in the control of several biological processes, such as mRNA transcription regulation, nucleic-acid metabolism, and development. Conclusion miRNA signatures induced by ionising radiation in mouse blood are radiation type- and radiation dose-specific. These findings underline the complexity of the radiation response and the importance of miRNA in it. PMID:21271940
Age and gender-invariant features of handwritten signatures for verification systems
NASA Astrophysics Data System (ADS)
AbdAli, Sura; Putz-Leszczynska, Joanna
2014-11-01
Handwritten signature is one of the most natural biometrics, the study of human physiological and behavioral patterns. Behavioral biometrics includes signatures that may be different due to its owner gender or age because of intrinsic or extrinsic factors. This paper presents the results of the author's research on age and gender influence on verification factors. The experiments in this research were conducted using a database that contains signatures and their associated metadata. The used algorithm is based on the universal forgery feature idea, where the global classifier is able to classify a signature as a genuine one or, as a forgery, without the actual knowledge of the signature template and its owner. Additionally, the reduction of the dimensionality with the MRMR method is discussed.
Pediatric Sepsis Endotypes Among Adults With Sepsis.
Wong, Hector R; Sweeney, Timothy E; Hart, Kimberly W; Khatri, Purvesh; Lindsell, Christopher J
2017-12-01
Recent transcriptomic studies describe two subgroups of adults with sepsis differentiated by a sepsis response signature. The implied biology and related clinical associations are comparable with recently reported pediatric sepsis endotypes, labeled "A" and "B." We classified adults with sepsis using the pediatric endotyping strategy and the sepsis response signature and determined how endotype assignment, sepsis response signature membership, and age interact with respect to mortality. Retrospective analysis of publically available transcriptomic data representing critically ill adults with sepsis from which the sepsis response signature groups were derived and validated. Multiple ICUs. Adults with sepsis. None. Transcriptomic data were conormalized into a single dataset yielding 549 unique cases with sepsis response signature assignments. Each subject was assigned to endotype A or B using the expression data for the 100 endotyping genes. There were 163 subjects (30%) assigned to endotype A and 386 to endotype B. There was a weak, positive correlation between endotype assignment and sepsis response signature membership. Mortality rates were similar between patients assigned endotype A and those assigned endotype B. A multivariable logistic regression model fit to endotype assignment, sepsis response signature membership, age, and the respective two-way interactions revealed that endotype A, sepsis response signature 1 membership, older age, and the interactions between them were associated with mortality. Subjects coassigned to endotype A, and sepsis response signature 1 had the highest mortality. Combining the pediatric endotyping strategy with sepsis response signature membership might provide complementary, age-dependent, biological, and prognostic information.
Frerich, Candace A.; Brayer, Kathryn J.; Painter, Brandon M.; Kang, Huining; Mitani, Yoshitsugu; El-Naggar, Adel K.; Ness, Scott A.
2018-01-01
The relative rarity of salivary gland adenoid cystic carcinoma (ACC) and its slow growing yet aggressive nature has complicated the development of molecular markers for patient stratification. To analyze molecular differences linked to the protracted disease course of ACC and metastases that form 5 or more years after diagnosis, detailed RNA-sequencing (RNA-seq) analysis was performed on 68 ACC tumor samples, starting with archived, formalin-fixed paraffin-embedded (FFPE) samples up to 25 years old, so that clinical outcomes were available. A statistical peak-finding approach was used to classify the tumors that expressed MYB or MYBL1, which had overlapping gene expression signatures, from a group that expressed neither oncogene and displayed a unique phenotype. Expression of MYB or MYBL1 was closely correlated to the expression of the SOX4 and EN1 genes, suggesting that they are direct targets of Myb proteins in ACC tumors. Unsupervised hierarchical clustering identified a subgroup of approximately 20% of patients with exceptionally poor overall survival (median less than 30 months) and a unique gene expression signature resembling embryonic stem cells. The results provide a strategy for stratifying ACC patients and identifying the high-risk, poor-outcome group that are candidates for personalized therapies. PMID:29484115
Vargas, Teodoro; Moreno-Rubio, Juan; Herranz, Jesús; Cejas, Paloma; Molina, Susana; González-Vallinas, Margarita; Mendiola, Marta; Burgos, Emilio; Aguayo, Cristina; Custodio, Ana B.; Machado, Isidro; Ramos, David; Gironella, Meritxell; Espinosa-Salinas, Isabel; Ramos, Ricardo; Martín-Hernández, Roberto; Risueño, Alberto; De Las Rivas, Javier; Reglero, Guillermo; Yaya, Ricardo; Fernández-Martos, Carlos; Aparicio, Jorge; Maurel, Joan; Feliu, Jaime; de Molina, Ana Ramírez
2015-01-01
Lipid metabolism plays an essential role in carcinogenesis due to the requirements of tumoral cells to sustain increased structural, energetic and biosynthetic precursor demands for cell proliferation. We investigated the association between expression of lipid metabolism-related genes and clinical outcome in intermediate-stage colon cancer patients with the aim of identifying a metabolic profile associated with greater malignancy and increased risk of relapse. Expression profile of 70 lipid metabolism-related genes was determined in 77 patients with stage II colon cancer. Cox regression analyses using c-index methodology was applied to identify a metabolic-related signature associated to prognosis. The metabolic signature was further confirmed in two independent validation sets of 120 patients and additionally, in a group of 264 patients from a public database. The combined analysis of these 4 genes, ABCA1, ACSL1, AGPAT1 and SCD, constitutes a metabolic-signature (ColoLipidGene) able to accurately stratify stage II colon cancer patients with 5-fold higher risk of relapse with strong statistical power in the four independent groups of patients. The identification of a group of 4 genes that predict survival in intermediate-stage colon cancer patients allows delineation of a high-risk group that may benefit from adjuvant therapy, and avoids the toxic and unnecessary chemotherapy in patients classified as low-risk group. PMID:25749516
Transferring genomics to the clinic: distinguishing Burkitt and diffuse large B cell lymphomas.
Sha, Chulin; Barrans, Sharon; Care, Matthew A; Cunningham, David; Tooze, Reuben M; Jack, Andrew; Westhead, David R
2015-01-01
Classifiers based on molecular criteria such as gene expression signatures have been developed to distinguish Burkitt lymphoma and diffuse large B cell lymphoma, which help to explore the intermediate cases where traditional diagnosis is difficult. Transfer of these research classifiers into a clinical setting is challenging because there are competing classifiers in the literature based on different methodology and gene sets with no clear best choice; classifiers based on one expression measurement platform may not transfer effectively to another; and, classifiers developed using fresh frozen samples may not work effectively with the commonly used and more convenient formalin fixed paraffin-embedded samples used in routine diagnosis. Here we thoroughly compared two published high profile classifiers developed on data from different Affymetrix array platforms and fresh-frozen tissue, examining their transferability and concordance. Based on this analysis, a new Burkitt and diffuse large B cell lymphoma classifier (BDC) was developed and employed on Illumina DASL data from our own paraffin-embedded samples, allowing comparison with the diagnosis made in a central haematopathology laboratory and evaluation of clinical relevance. We show that both previous classifiers can be recapitulated using very much smaller gene sets than originally employed, and that the classification result is closely dependent on the Burkitt lymphoma criteria applied in the training set. The BDC classification on our data exhibits high agreement (~95 %) with the original diagnosis. A simple outcome comparison in the patients presenting intermediate features on conventional criteria suggests that the cases classified as Burkitt lymphoma by BDC have worse response to standard diffuse large B cell lymphoma treatment than those classified as diffuse large B cell lymphoma. In this study, we comprehensively investigate two previous Burkitt lymphoma molecular classifiers, and implement a new gene expression classifier, BDC, that works effectively on paraffin-embedded samples and provides useful information for treatment decisions. The classifier is available as a free software package under the GNU public licence within the R statistical software environment through the link http://www.bioinformatics.leeds.ac.uk/labpages/softwares/ or on github https://github.com/Sharlene/BDC.
Molecular differences in transition zone and peripheral zone prostate tumors
Sinnott, Jennifer A.; Rider, Jennifer R.; Carlsson, Jessica; Gerke, Travis; Tyekucheva, Svitlana; Penney, Kathryn L.; Sesso, Howard D.; Loda, Massimo; Fall, Katja; Stampfer, Meir J.; Mucci, Lorelei A.; Pawitan, Yudi; Andersson, Sven-Olof; Andrén, Ove
2015-01-01
Prostate tumors arise primarily in the peripheral zone (PZ) of the prostate, but 20–30% arise in the transition zone (TZ). Zone of origin may have prognostic value or reflect distinct molecular subtypes; however, it can be difficult to determine in practice. Using whole-genome gene expression, we built a signature of zone using normal tissue from five individuals and found that it successfully classified nine tumors of known zone. Hypothesizing that this signature captures tumor zone of origin, we assessed its relationship with clinical factors among 369 tumors of unknown zone from radical prostatectomies (RPs) and found that tumors that molecularly resembled TZ tumors showed lower mortality (P = 0.09) that was explained by lower Gleason scores (P = 0.009). We further applied the signature to an earlier study of 88 RP and 333 transurethral resection of the prostate (TURP) tumor samples, also of unknown zone, with gene expression on ~6000 genes. We had observed previously substantial expression differences between RP and TURP specimens, and hypothesized that this might be because RPs capture primarily PZ tumors, whereas TURPs capture more TZ tumors. Our signature distinguished these two groups, with an area under the receiver operating characteristic curve of 87% (P < 0.0001). Our findings that zonal differences in normal tissue persist in tumor tissue and that these differences are associated with Gleason score and sample type suggest that subtypes potentially resulting from different etiologic pathways might arise in these zones. Zone of origin may be important to consider in prostate tumor biomarker research. PMID:25870172
Gene expression profiling assigns CHEK2 1100delC breast cancers to the luminal intrinsic subtypes.
Nagel, Jord H A; Peeters, Justine K; Smid, Marcel; Sieuwerts, Anieta M; Wasielewski, Marijke; de Weerd, Vanja; Trapman-Jansen, Anita M A C; van den Ouweland, Ans; Brüggenwirth, Hennie; van I Jcken, Wilfred F J; Klijn, Jan G M; van der Spek, Peter J; Foekens, John A; Martens, John W M; Schutte, Mieke; Meijers-Heijboer, Hanne
2012-04-01
CHEK2 1100delC is a moderate-risk cancer susceptibility allele that confers a high breast cancer risk in a polygenic setting. Gene expression profiling of CHEK2 1100delC breast cancers may reveal clues to the nature of the polygenic CHEK2 model and its genes involved. Here, we report global gene expression profiles of a cohort of 155 familial breast cancers, including 26 CHEK2 1100delC mutant tumors. In line with previous work, all CHEK2 1100delC mutant tumors clustered among the hormone receptor-positive breast cancers. In the hormone receptor-positive subset, a 40-gene CHEK2 signature was subsequently defined that significantly associated with CHEK2 1100delC breast cancers. The identification of a CHEK2 gene signature implies an unexpected biological homogeneity among the CHEK2 1100delC breast cancers. In addition, all 26 CHEK2 1100delC tumors classified as luminal intrinsic subtype breast cancers, with 8 luminal A and 18 luminal B tumors. This biological make-up of CHEK2 1100delC breast cancers suggests that a relatively limited number of additional susceptibility alleles are involved in the polygenic CHEK2 model. Identification of these as-yet-unknown susceptibility alleles should be aided by clues from the 40-gene CHEK2 signature.
CrossLink: a novel method for cross-condition classification of cancer subtypes.
Ma, Chifeng; Sastry, Konduru S; Flore, Mario; Gehani, Salah; Al-Bozom, Issam; Feng, Yusheng; Serpedin, Erchin; Chouchane, Lotfi; Chen, Yidong; Huang, Yufei
2016-08-22
We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms.
A prognostic gene signature for metastasis-free survival of triple negative breast cancer patients.
Lee, Unjin; Frankenberger, Casey; Yun, Jieun; Bevilacqua, Elena; Caldas, Carlos; Chin, Suet-Feung; Rueda, Oscar M; Reinitz, John; Rosner, Marsha Rich
2013-01-01
Although triple negative breast cancers (TNBC) are the most aggressive subtype of breast cancer, they currently lack targeted therapies. Because this classification still includes a heterogeneous collection of tumors, new tools to classify TNBCs are urgently required in order to improve our prognostic capability for high risk patients and predict response to therapy. We previously defined a gene expression signature, RKIP Pathway Metastasis Signature (RPMS), based upon a metastasis-suppressive signaling pathway initiated by Raf Kinase Inhibitory Protein (RKIP). We have now generated a new BACH1 Pathway Metastasis gene signature (BPMS) that utilizes targets of the metastasis regulator BACH1. Specifically, we substituted experimentally validated target genes to generate a new BACH1 metagene, developed an approach to optimize patient tumor stratification, and reduced the number of signature genes to 30. The BPMS significantly and selectively stratified metastasis-free survival in basal-like and, in particular, TNBC patients. In addition, the BPMS further stratified patients identified as having a good or poor prognosis by other signatures including the Mammaprint® and Oncotype® clinical tests. The BPMS is thus complementary to existing signatures and is a prognostic tool for high risk ER-HER2- patients. We also demonstrate the potential clinical applicability of the BPMS as a single sample predictor. Together, these results reveal the potential of this pathway-based BPMS gene signature to identify high risk TNBC patients that can respond effectively to targeted therapy, and highlight BPMS genes as novel drug targets for therapeutic development.
Investigation of VOCs associated with different characteristics of breast cancer cells
Lavra, Luca; Catini, Alexandro; Ulivieri, Alessandra; Capuano, Rosamaria; Baghernajad Salehi, Leila; Sciacchitano, Salvatore; Bartolazzi, Armando; Nardis, Sara; Paolesse, Roberto; Martinelli, Eugenio; Di Natale, Corrado
2015-01-01
The efficacy of breath volatile organic compounds (VOCs) analysis for the screening of patients bearing breast cancer lesions has been demonstrated by using gas chromatography and artificial olfactory systems. On the other hand, in-vitro studies suggest that VOCs detection could also give important indications regarding molecular and tumorigenic characteristics of tumor cells. Aim of this study was to analyze VOCs in the headspace of breast cancer cell lines in order to ascertain the potentiality of VOCs signatures in giving information about these cells and set-up a new sensor system able to detect breast tumor-associated VOCs. We identified by Gas Chromatography-Mass Spectrometry analysis a VOCs signature that discriminates breast cancer cells for: i) transformed condition; ii) cell doubling time (CDT); iii) Estrogen and Progesterone Receptors (ER, PgR) expression, and HER2 overexpression. Moreover, the signals obtained from a temperature modulated metal oxide semiconductor gas sensor can be classified in order to recognize VOCs signatures associated with breast cancer cells, CDT and ER expression. Our results demonstrate that VOCs analysis could give clinically relevant information about proliferative and molecular features of breast cancer cells and pose the basis for the optimization of a low-cost diagnostic device to be used for tumors characterization. PMID:26304457
Multigene signature for predicting prognosis of patients with 1p19q co-deletion diffuse glioma.
Hu, Xin; Martinez-Ledesma, Emmanuel; Zheng, Siyuan; Kim, Hoon; Barthel, Floris; Jiang, Tao; Hess, Kenneth R; Verhaak, Roel G W
2017-06-01
Co-deletion of 1p and 19q marks a diffuse glioma subtype associated with relatively favorable overall survival; however, heterogeneous clinical outcomes are observed within this category. We assembled gene expression profiles and sample annotation of 374 glioma patients carrying the 1p/19q co-deletion. We predicted 1p/19q status using gene expression when annotation was missing. A first cohort was randomly split into training (n = 170) and a validation dataset (n = 163). A second validation set consisted of 41 expression profiles. An elastic-net penalized Cox proportional hazards model was applied to build a classifier model through cross-validation within the training dataset. The selected 35-gene signature was used to identify high-risk and low-risk groups in the validation set, which showed significantly different overall survival (P = .00058, log-rank test). For time-to-death events, the high-risk group predicted by the gene signature yielded a hazard ratio of 1.78 (95% confidence interval, 1.02-3.11). The signature was also significantly associated with clinical outcome in the The Cancer Genome Atlas (CGA) IDH-mutant 1p/19q wild-type and IDH-wild-type glioma cohorts. Pathway analysis suggested that high risk was associated with increased acetylation activity and inflammatory response. Tumor purity was found to be significantly decreased in high-risk IDH-mutant with 1p/19q co-deletion gliomas and IDH-wild-type glioblastomas but not in IDH-wild-type lower grade or IDH-mutant, non-co-deleted gliomas. We identified a 35-gene signature that identifies high-risk and low-risk categories of 1p/19q positive glioma patients. We have demonstrated heterogeneity amongst a relatively new glioma subtype and provided a stepping stone towards risk stratification. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Neuro-Oncology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Hoffheins, B.S.; Lauf, R.J.
1997-08-05
A gas detecting system is described for classifying the type of liquid fuel in a container or tank. The system includes a plurality of semiconductor gas sensors, each of which differs from the other in its response to various organic vapors. The system includes a means of processing the responses of the plurality of sensors such that the responses to any particular organic substance or mixture is sufficiently distinctive to constitute a recognizable ``signature``. The signature of known substances are collected and divided into two classes based on some other known characteristic of the substances. A pattern recognition system classifies the signature of an unknown substance with reference to the two user-defined classes, thereby classifying the unknown substance with regard to the characteristic of interest, such as its suitability for a particular use. 14 figs.
Hoffheins, Barbara S.; Lauf, Robert J.
1997-01-01
A gas detecting system for classifying the type of liquid fuel in a container or tank. The system includes a plurality of semiconductor gas sensors, each of which differs from the other in its response to various organic vapors. The system includes a means of processing the responses of the plurality of sensors such that the responses to any particular organic substance or mixture is sufficiently distinctive to constitute a recognizable "signature". The signature of known substances are collected and divided into two classes based on some other known characteristic of the substances. A pattern recognition system classifies the signature of an unknown substance with reference to the two user-defined classes, thereby classifying the unknown substance with regard to the characteristic of interest, such as its suitability for a particular use.
Lv, Yufeng; Wei, Wenhao; Huang, Zhong; Chen, Zhichao; Fang, Yuan; Pan, Lili; Han, Xueqiong; Xu, Zihai
2018-06-20
The aim of this study was to develop a novel long non-coding RNA (lncRNA) expression signature to accurately predict early recurrence for patients with hepatocellular carcinoma (HCC) after curative resection. Using expression profiles downloaded from The Cancer Genome Atlas database, we identified multiple lncRNAs with differential expression between early recurrence (ER) group and non-early recurrence (non-ER) group of HCC. Least absolute shrinkage and selection operator (LASSO) for logistic regression models were used to develop a lncRNA-based classifier for predicting ER in the training set. An independent test set was used to validated the predictive value of this classifier. Futhermore, a co-expression network based on these lncRNAs and its highly related genes was constructed and Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses of genes in the network were performed. We identified 10 differentially expressed lncRNAs, including 3 that were upregulated and 7 that were downregulated in ER group. The lncRNA-based classifier was constructed based on 7 lncRNAs (AL035661.1, PART1, AC011632.1, AC109588.1, AL365361.1, LINC00861 and LINC02084), and its accuracy was 0.83 in training set, 0.87 in test set and 0.84 in total set. And ROC curve analysis showed the AUROC was 0.741 in training set, 0.824 in the test set and 0.765 in total set. A functional enrichment analysis suggested that the genes of which is highly related to 4 lncRNAs were involved in immune system. This 7-lncRNA expression profile can effectively predict the early recurrence after surgical resection for HCC. This article is protected by copyright. All rights reserved.
Stem Cell-Like Gene Expression in Ovarian Cancer Predicts Type II Subtype and Prognosis
Schwede, Matthew; Spentzos, Dimitrios; Bentink, Stefan; Hofmann, Oliver; Haibe-Kains, Benjamin; Harrington, David; Quackenbush, John; Culhane, Aedín C.
2013-01-01
Although ovarian cancer is often initially chemotherapy-sensitive, the vast majority of tumors eventually relapse and patients die of increasingly aggressive disease. Cancer stem cells are believed to have properties that allow them to survive therapy and may drive recurrent tumor growth. Cancer stem cells or cancer-initiating cells are a rare cell population and difficult to isolate experimentally. Genes that are expressed by stem cells may characterize a subset of less differentiated tumors and aid in prognostic classification of ovarian cancer. The purpose of this study was the genomic identification and characterization of a subtype of ovarian cancer that has stem cell-like gene expression. Using human and mouse gene signatures of embryonic, adult, or cancer stem cells, we performed an unsupervised bipartition class discovery on expression profiles from 145 serous ovarian tumors to identify a stem-like and more differentiated subgroup. Subtypes were reproducible and were further characterized in four independent, heterogeneous ovarian cancer datasets. We identified a stem-like subtype characterized by a 51-gene signature, which is significantly enriched in tumors with properties of Type II ovarian cancer; high grade, serous tumors, and poor survival. Conversely, the differentiated tumors share properties with Type I, including lower grade and mixed histological subtypes. The stem cell-like signature was prognostic within high-stage serous ovarian cancer, classifying a small subset of high-stage tumors with better prognosis, in the differentiated subtype. In multivariate models that adjusted for common clinical factors (including grade, stage, age), the subtype classification was still a significant predictor of relapse. The prognostic stem-like gene signature yields new insights into prognostic differences in ovarian cancer, provides a genomic context for defining Type I/II subtypes, and potential gene targets which following further validation may be valuable in the clinical management or treatment of ovarian cancer. PMID:23536770
Rosso, Osvaldo A; Ospina, Raydonal; Frery, Alejandro C
2016-01-01
We present a new approach for handwritten signature classification and verification based on descriptors stemming from time causal information theory. The proposal uses the Shannon entropy, the statistical complexity, and the Fisher information evaluated over the Bandt and Pompe symbolization of the horizontal and vertical coordinates of signatures. These six features are easy and fast to compute, and they are the input to an One-Class Support Vector Machine classifier. The results are better than state-of-the-art online techniques that employ higher-dimensional feature spaces which often require specialized software and hardware. We assess the consistency of our proposal with respect to the size of the training sample, and we also use it to classify the signatures into meaningful groups.
A Prognostic Gene Signature for Metastasis-Free Survival of Triple Negative Breast Cancer Patients
Yun, Jieun; Bevilacqua, Elena; Caldas, Carlos; Chin, Suet-Feung; Rueda, Oscar M.; Reinitz, John; Rosner, Marsha Rich
2013-01-01
Although triple negative breast cancers (TNBC) are the most aggressive subtype of breast cancer, they currently lack targeted therapies. Because this classification still includes a heterogeneous collection of tumors, new tools to classify TNBCs are urgently required in order to improve our prognostic capability for high risk patients and predict response to therapy. We previously defined a gene expression signature, RKIP Pathway Metastasis Signature (RPMS), based upon a metastasis-suppressive signaling pathway initiated by Raf Kinase Inhibitory Protein (RKIP). We have now generated a new BACH1 Pathway Metastasis gene signature (BPMS) that utilizes targets of the metastasis regulator BACH1. Specifically, we substituted experimentally validated target genes to generate a new BACH1 metagene, developed an approach to optimize patient tumor stratification, and reduced the number of signature genes to 30. The BPMS significantly and selectively stratified metastasis-free survival in basal-like and, in particular, TNBC patients. In addition, the BPMS further stratified patients identified as having a good or poor prognosis by other signatures including the Mammaprint® and Oncotype® clinical tests. The BPMS is thus complementary to existing signatures and is a prognostic tool for high risk ER-HER2- patients. We also demonstrate the potential clinical applicability of the BPMS as a single sample predictor. Together, these results reveal the potential of this pathway-based BPMS gene signature to identify high risk TNBC patients that can respond effectively to targeted therapy, and highlight BPMS genes as novel drug targets for therapeutic development. PMID:24349199
Rue-Albrecht, Kévin; McGettigan, Paul A; Hernández, Belinda; Nalpas, Nicolas C; Magee, David A; Parnell, Andrew C; Gordon, Stephen V; MacHugh, David E
2016-03-11
Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.
A 10-Gene Classifier for Indeterminate Thyroid Nodules: Development and Multicenter Accuracy Study
González, Hernán E.; Martínez, José R.; Vargas-Salas, Sergio; Solar, Antonieta; Veliz, Loreto; Cruz, Francisco; Arias, Tatiana; Loyola, Soledad; Horvath, Eleonora; Tala, Hernán; Traipe, Eufrosina; Meneses, Manuel; Marín, Luis; Wohllk, Nelson; Diaz, René E.; Véliz, Jesús; Pineda, Pedro; Arroyo, Patricia; Mena, Natalia; Bracamonte, Milagros; Miranda, Giovanna; Bruce, Elsa
2017-01-01
Background: In most of the world, diagnostic surgery remains the most frequent approach for indeterminate thyroid cytology. Although several molecular tests are available for testing in centralized commercial laboratories in the United States, there are no available kits for local laboratory testing. The aim of this study was to develop a prototype in vitro diagnostic (IVD) gene classifier for the further characterization of nodules with an indeterminate thyroid cytology. Methods: In a first stage, the expression of 18 genes was determined by quantitative polymerase chain reaction (qPCR) in a broad histopathological spectrum of 114 fresh-tissue biopsies. Expression data were used to train several classifiers by supervised machine learning approaches. Classifiers were tested in an independent set of 139 samples. In a second stage, the best classifier was chosen as a model to develop a multiplexed-qPCR IVD prototype assay, which was tested in a prospective multicenter cohort of fine-needle aspiration biopsies. Results: In tissue biopsies, the best classifier, using only 10 genes, reached an optimal and consistent performance in the ninefold cross-validated testing set (sensitivity 93% and specificity 81%). In the multicenter cohort of fine-needle aspiration biopsy samples, the 10-gene signature, built into a multiplexed-qPCR IVD prototype, showed an area under the curve of 0.97, a positive predictive value of 78%, and a negative predictive value of 98%. By Bayes' theorem, the IVD prototype is expected to achieve a positive predictive value of 64–82% and a negative predictive value of 97–99% in patients with a cancer prevalence range of 20–40%. Conclusions: A new multiplexed-qPCR IVD prototype is reported that accurately classifies thyroid nodules and may provide a future solution suitable for local reference laboratory testing. PMID:28521616
Ospina, Raydonal; Frery, Alejandro C.
2016-01-01
We present a new approach for handwritten signature classification and verification based on descriptors stemming from time causal information theory. The proposal uses the Shannon entropy, the statistical complexity, and the Fisher information evaluated over the Bandt and Pompe symbolization of the horizontal and vertical coordinates of signatures. These six features are easy and fast to compute, and they are the input to an One-Class Support Vector Machine classifier. The results are better than state-of-the-art online techniques that employ higher-dimensional feature spaces which often require specialized software and hardware. We assess the consistency of our proposal with respect to the size of the training sample, and we also use it to classify the signatures into meaningful groups. PMID:27907014
Blood Based Biomarkers of Early Onset Breast Cancer
2016-12-01
discretizes the data, and also using logistic elastic net – a form of linear regression - we were unable to build a classifier that could accurately...classifier for differentiating cases from controls off discretized data. The first pass analysis demonstrated a 35 gene signature that differentiated...to the discretized data for mRNA gene signature, the samples used to “train” were also included in the final samples used to “test” the algorithm
Díaz-Beyá, Marina; Brunet, Salut; Nomdedéu, Josep; Pratcorona, Marta; Cordeiro, Anna; Gallardo, David; Escoda, Lourdes; Tormo, Mar; Heras, Inmaculada; Ribera, Josep Maria; Duarte, Rafael; de Llano, María Paz Queipo; Bargay, Joan; Sampol, Antonia; Nomdedeu, Mertixell; Risueño, Ruth M.; Hoyos, Montserrat; Sierra, Jorge; Monzo, Mariano; Navarro, Alfons; Esteve, Jordi
2015-01-01
Long non-coding RNAs (lncRNAs) are deregulated in several tumors, although their role in acute myeloid leukemia (AML) is mostly unknown. We have examined the expression of the lncRNA HOX antisense intergenic RNA myeloid 1 (HOTAIRM1) in 241 AML patients. We have correlated HOTAIRM1 expression with a miRNA expression profile. We have also analyzed the prognostic value of HOTAIRM1 expression in 215 intermediate-risk AML (IR-AML) patients. The lowest expression level was observed in acute promyelocytic leukemia (P < 0.001) and the highest in t(6;9) AML (P = 0.005). In 215 IR-AML patients, high HOTAIRM1 expression was independently associated with shorter overall survival (OR:2.04;P = 0.001), shorter leukemia-free survival (OR:2.56; P < 0.001) and a higher cumulative incidence of relapse (OR:1.67; P = 0.046). Moreover, HOTAIRM1 maintained its independent prognostic value within the favorable molecular subgroup (OR: 3.43; P = 0.009). Interestingly, HOTAIRM1 was overexpressed in NPM1-mutated AML (P < 0.001) and within this group retained its prognostic value (OR: 2.21; P = 0.01). Moreover, HOTAIRM1 expression was associated with a specific 33- microRNA signature that included miR-196b (P < 0.001). miR-196b is located in the HOX genomic region and has previously been reported to have an independent prognostic value in AML. miR-196b and HOTAIRM1 in combination as a prognostic factor can classify patients as high-, intermediate-, or low-risk (5-year OS: 24% vs 42% vs 70%; P = 0.004). Determination of HOTAIRM1 level at diagnosis provided relevant prognostic information in IR-AML and allowed refinement of risk stratification based on common molecular markers. The prognostic information provided by HOTAIRM1 was strengthened when combined with miR-196b expression. Furthermore, HOTAIRM1 correlated with a 33-miRNA signature. PMID:26436590
76 FR 75461 - Express Mail Domestic Postage Refund Policy and Waiver of Signature
Federal Register 2010, 2011, 2012, 2013, 2014
2011-12-02
... POSTAL SERVICE 39 CFR Part 111 Express Mail Domestic Postage Refund Policy and Waiver of Signature... to 30 days after the date of mailing, and to change the Express Mail ``waiver of signature'' standard for domestic items by obtaining an addressee's signature only when the mailer selects the ``signature...
Ma, Xu; He, Zhijuan; Li, Ling; Yang, Daping; Liu, Guofeng
2017-09-29
Recent advancements in cancer biology have identified a large number of lncRNAs that are dysregulated expression in the development and tumorigenesis of cancers, highlighting the importance of lncRNAs as a key player for human cancers. However, the prognostic value of lncRNAs still remains unclear and needs to be further investigated. In the present study, we aim to assess the prognostic value of lncRNAs in cutaneous melanoma by integrated lncRNA expression profiles from TCGA database and matched clinical information from a large cohort of patients with cutaneous melanoma. We finally identified a set of six lncRNAs that are significantly associated with survival of patients with cutaneous melanoma. A linear combination of six lncRNAs ( LINC01260, HCP5, PIGBOS1, RP11-247L20.4, CTA-292E10.6 and CTB-113P19.5 ) was constructed as a six-lncRNA signature which classified patients of training cohort into the high-risk group and low-risk group with significantly different survival time. The prognostic value of the six-lncRNA signature was validated in both the validation cohort and entire TCGA cohort. Moreover, the six-lncRNA signature is independent of known clinic-pathological factors by multivariate Cox regression analysis and demonstrated good performance for predicting three- and five-year overall survival by time-dependent receiver operating characteristic (ROC) analysis. Our study provides novel insights into the molecular heterogeneity of cutaneous melanoma and also shows potentially important implications of lncRNAs for prognosis and therapy for cutaneous melanoma.
Promish, D I; Lester, D
1999-11-08
We attempted to match the appearance and demeanor of 27 serial killers to the postmortem 'signatures' found on their victims' bodies. Our results suggest that a link may exist between postmortem signatures and two complementary appearance-demeanor types.
Logic Learning Machine creates explicit and stable rules stratifying neuroblastoma patients
2013-01-01
Background Neuroblastoma is the most common pediatric solid tumor. About fifty percent of high risk patients die despite treatment making the exploration of new and more effective strategies for improving stratification mandatory. Hypoxia is a condition of low oxygen tension occurring in poorly vascularized areas of the tumor associated with poor prognosis. We had previously defined a robust gene expression signature measuring the hypoxic component of neuroblastoma tumors (NB-hypo) which is a molecular risk factor. We wanted to develop a prognostic classifier of neuroblastoma patients' outcome blending existing knowledge on clinical and molecular risk factors with the prognostic NB-hypo signature. Furthermore, we were interested in classifiers outputting explicit rules that could be easily translated into the clinical setting. Results Shadow Clustering (SC) technique, which leads to final models called Logic Learning Machine (LLM), exhibits a good accuracy and promises to fulfill the aims of the work. We utilized this algorithm to classify NB-patients on the bases of the following risk factors: Age at diagnosis, INSS stage, MYCN amplification and NB-hypo. The algorithm generated explicit classification rules in good agreement with existing clinical knowledge. Through an iterative procedure we identified and removed from the dataset those examples which caused instability in the rules. This workflow generated a stable classifier very accurate in predicting good and poor outcome patients. The good performance of the classifier was validated in an independent dataset. NB-hypo was an important component of the rules with a strength similar to that of tumor staging. Conclusions The novelty of our work is to identify stability, explicit rules and blending of molecular and clinical risk factors as the key features to generate classification rules for NB patients to be conveyed to the clinic and to be used to design new therapies. We derived, through LLM, a set of four stable rules identifying a new class of poor outcome patients that could benefit from new therapies potentially targeting tumor hypoxia or its consequences. PMID:23815266
Logic Learning Machine creates explicit and stable rules stratifying neuroblastoma patients.
Cangelosi, Davide; Blengio, Fabiola; Versteeg, Rogier; Eggert, Angelika; Garaventa, Alberto; Gambini, Claudio; Conte, Massimo; Eva, Alessandra; Muselli, Marco; Varesio, Luigi
2013-01-01
Neuroblastoma is the most common pediatric solid tumor. About fifty percent of high risk patients die despite treatment making the exploration of new and more effective strategies for improving stratification mandatory. Hypoxia is a condition of low oxygen tension occurring in poorly vascularized areas of the tumor associated with poor prognosis. We had previously defined a robust gene expression signature measuring the hypoxic component of neuroblastoma tumors (NB-hypo) which is a molecular risk factor. We wanted to develop a prognostic classifier of neuroblastoma patients' outcome blending existing knowledge on clinical and molecular risk factors with the prognostic NB-hypo signature. Furthermore, we were interested in classifiers outputting explicit rules that could be easily translated into the clinical setting. Shadow Clustering (SC) technique, which leads to final models called Logic Learning Machine (LLM), exhibits a good accuracy and promises to fulfill the aims of the work. We utilized this algorithm to classify NB-patients on the bases of the following risk factors: Age at diagnosis, INSS stage, MYCN amplification and NB-hypo. The algorithm generated explicit classification rules in good agreement with existing clinical knowledge. Through an iterative procedure we identified and removed from the dataset those examples which caused instability in the rules. This workflow generated a stable classifier very accurate in predicting good and poor outcome patients. The good performance of the classifier was validated in an independent dataset. NB-hypo was an important component of the rules with a strength similar to that of tumor staging. The novelty of our work is to identify stability, explicit rules and blending of molecular and clinical risk factors as the key features to generate classification rules for NB patients to be conveyed to the clinic and to be used to design new therapies. We derived, through LLM, a set of four stable rules identifying a new class of poor outcome patients that could benefit from new therapies potentially targeting tumor hypoxia or its consequences.
Yoon, Dukyong; Kim, Hyosil; Suh-Kim, Haeyoung; Park, Rae Woong; Lee, KiYoung
2011-01-01
Microarray analyses based on differentially expressed genes (DEGs) have been widely used to distinguish samples across different cellular conditions. However, studies based on DEGs have not been able to clearly determine significant differences between samples of pathophysiologically similar HIV-1 stages, e.g., between acute and chronic progressive (or AIDS) or between uninfected and clinically latent stages. We here suggest a novel approach to allow such discrimination based on stage-specific genetic features of HIV-1 infection. Our approach is based on co-expression changes of genes known to interact. The method can identify a genetic signature for a single sample as contrasted with existing protein-protein-based analyses with correlational designs. Our approach distinguishes each sample using differentially co-expressed interacting protein pairs (DEPs) based on co-expression scores of individual interacting pairs within a sample. The co-expression score has positive value if two genes in a sample are simultaneously up-regulated or down-regulated. And the score has higher absolute value if expression-changing ratios are similar between the two genes. We compared characteristics of DEPs with that of DEGs by evaluating their usefulness in separation of HIV-1 stage. And we identified DEP-based network-modules and their gene-ontology enrichment to find out the HIV-1 stage-specific gene signature. Based on the DEP approach, we observed clear separation among samples from distinct HIV-1 stages using clustering and principal component analyses. Moreover, the discrimination power of DEPs on the samples (70-100% accuracy) was much higher than that of DEGs (35-45%) using several well-known classifiers. DEP-based network analysis also revealed the HIV-1 stage-specific network modules; the main biological processes were related to "translation," "RNA splicing," "mRNA, RNA, and nucleic acid transport," and "DNA metabolism." Through the HIV-1 stage-related modules, changing stage-specific patterns of protein interactions could be observed. DEP-based method discriminated the HIV-1 infection stages clearly, and revealed a HIV-1 stage-specific gene signature. The proposed DEP-based method might complement existing DEG-based approaches in various microarray expression analyses.
Prognostic and functional role of subtype-specific tumor-stroma interaction in breast cancer.
Merlino, Giuseppe; Miodini, Patrizia; Callari, Maurizio; D'Aiuto, Francesca; Cappelletti, Vera; Daidone, Maria Grazia
2017-10-01
None of the clinically relevant gene expression signatures available for breast cancer were specifically developed to capture the influence of the microenvironment on tumor cells. Here, we attempted to build subtype-specific signatures derived from an in vitro model reproducing tumor cell modifications after interaction with activated or normal stromal cells. Gene expression signatures derived from HER2+, luminal, and basal breast cancer cell lines (treated by normal fibroblasts or cancer-associated fibroblasts conditioned media) were evaluated in clinical tumors by in silico analysis on published gene expression profiles (GEPs). Patients were classified as microenvironment-positive (μENV+ve), that is, with tumors showing molecular profiles suggesting activation by the stroma, or microenvironment-negative (μENV-ve) based on correlation of their tumors' GEP with the respective subtype-specific signature. Patients with estrogen receptor alpha (ER)+/HER2-/μENV+ve tumors were characterized by 2.5-fold higher risk of developing distant metastases (HR = 2.546; 95% CI: 1.751-3.701, P = 9.84E-07), while μENV status did not affect, or only suggested the risk of distant metastases, in women with HER2+ (HR = 1.541; 95% CI: 0.788-3.012, P = 0.206) or ER-/HER2- tumors (HR = 1.894; 95% CI: 0.938-3.824; P = 0.0747), respectively. In ER+/HER2- tumors, the μENV status remained significantly associated with metastatic progression (HR = 2.098; CI: 1.214-3.624; P = 0.00791) in multivariable analysis including size, age, and Genomic Grade Index. Validity of our in vitro model was also supported by in vitro biological endpoints such as cell growth (MTT assay) and migration/invasion (Transwell assay). In vitro-derived gene signatures tracing the bidirectional interaction with cancer activated fibroblasts are subtype-specific and add independent prognostic information to classical prognostic variables in women with ER+/HER2- tumors. © 2017 The Authors. Published by FEBS Press and John Wiley & Sons Ltd.
Manijak, Mieszko P; Nielsen, Henrik B
2011-06-11
Although, systematic analysis of gene annotation is a powerful tool for interpreting gene expression data, it sometimes is blurred by incomplete gene annotation, missing expression response of key genes and secondary gene expression responses. These shortcomings may be partially circumvented by instead matching gene expression signatures to signatures of other experiments. To facilitate this we present the Functional Association Response by Overlap (FARO) server, that match input signatures to a compendium of 242 gene expression signatures, extracted from more than 1700 Arabidopsis microarray experiments. Hereby we present a publicly available tool for robust characterization of Arabidopsis gene expression experiments which can point to similar experimental factors in other experiments. The server is available at http://www.cbs.dtu.dk/services/faro/.
NASA Astrophysics Data System (ADS)
Schmalz, M.; Ritter, G.; Key, R.
Accurate and computationally efficient spectral signature classification is a crucial step in the nonimaging detection and recognition of spaceborne objects. In classical hyperspectral recognition applications using linear mixing models, signature classification accuracy depends on accurate spectral endmember discrimination [1]. If the endmembers cannot be classified correctly, then the signatures cannot be classified correctly, and object recognition from hyperspectral data will be inaccurate. In practice, the number of endmembers accurately classified often depends linearly on the number of inputs. This can lead to potentially severe classification errors in the presence of noise or densely interleaved signatures. In this paper, we present an comparison of emerging technologies for nonimaging spectral signature classfication based on a highly accurate, efficient search engine called Tabular Nearest Neighbor Encoding (TNE) [3,4] and a neural network technology called Morphological Neural Networks (MNNs) [5]. Based on prior results, TNE can optimize its classifier performance to track input nonergodicities, as well as yield measures of confidence or caution for evaluation of classification results. Unlike neural networks, TNE does not have a hidden intermediate data structure (e.g., the neural net weight matrix). Instead, TNE generates and exploits a user-accessible data structure called the agreement map (AM), which can be manipulated by Boolean logic operations to effect accurate classifier refinement algorithms. The open architecture and programmability of TNE's agreement map processing allows a TNE programmer or user to determine classification accuracy, as well as characterize in detail the signatures for which TNE did not obtain classification matches, and why such mis-matches occurred. In this study, we will compare TNE and MNN based endmember classification, using performance metrics such as probability of correct classification (Pd) and rate of false detections (Rfa). As proof of principle, we analyze classification of multiple closely spaced signatures from a NASA database of space material signatures. Additional analysis pertains to computational complexity and noise sensitivity, which are superior to Bayesian techniques based on classical neural networks. [1] Winter, M.E. "Fast autonomous spectral end-member determination in hyperspectral data," in Proceedings of the 13th International Conference On Applied Geologic Remote Sensing, Vancouver, B.C., Canada, pp. 337-44 (1999). [2] N. Keshava, "A survey of spectral unmixing algorithms," Lincoln Laboratory Journal 14:55-78 (2003). [3] Key, G., M.S. SCHMALZ, F.M. Caimi, and G.X. Ritter. "Performance analysis of tabular nearest neighbor encoding algorithm for joint compression and ATR", in Proceedings SPIE 3814:115-126 (1999). [4] Schmalz, M.S. and G. Key. "Algorithms for hyperspectral signature classification in unresolved object detection using tabular nearest neighbor encoding" in Proceedings of the 2007 AMOS Conference, Maui HI (2007). [5] Ritter, G.X., G. Urcid, and M.S. Schmalz. "Autonomous single-pass endmember approximation using lattice auto-associative memories", Neurocomputing (Elsevier), accepted (June 2008).
Machine Learning Through Signature Trees. Applications to Human Speech.
ERIC Educational Resources Information Center
White, George M.
A signature tree is a binary decision tree used to classify unknown patterns. An attempt was made to develop a computer program for manipulating signature trees as a general research tool for exploring machine learning and pattern recognition. The program was applied to the problem of speech recognition to test its effectiveness for a specific…
Connor, Ashton A; Denroche, Robert E; Jang, Gun Ho; Timms, Lee; Kalimuthu, Sangeetha N; Selander, Iris; McPherson, Treasa; Wilson, Gavin W; Chan-Seng-Yue, Michelle A; Borozan, Ivan; Ferretti, Vincent; Grant, Robert C; Lungu, Ilinca M; Costello, Eithne; Greenhalf, William; Palmer, Daniel; Ghaneh, Paula; Neoptolemos, John P; Buchler, Markus; Petersen, Gloria; Thayer, Sarah; Hollingsworth, Michael A; Sherker, Alana; Durocher, Daniel; Dhani, Neesha; Hedley, David; Serra, Stefano; Pollett, Aaron; Roehrl, Michael H A; Bavi, Prashant; Bartlett, John M S; Cleary, Sean; Wilson, Julie M; Alexandrov, Ludmil B; Moore, Malcolm; Wouters, Bradly G; McPherson, John D; Notta, Faiyaz; Stein, Lincoln D; Gallinger, Steven
2017-06-01
Outcomes for patients with pancreatic ductal adenocarcinoma (PDAC) remain poor. Advances in next-generation sequencing provide a route to therapeutic approaches, and integrating DNA and RNA analysis with clinicopathologic data may be a crucial step toward personalized treatment strategies for this disease. To classify PDAC according to distinct mutational processes, and explore their clinical significance. We performed a retrospective cohort study of resected PDAC, using cases collected between 2008 and 2015 as part of the International Cancer Genome Consortium. The discovery cohort comprised 160 PDAC cases from 154 patients (148 primary; 12 metastases) that underwent tumor enrichment prior to whole-genome and RNA sequencing. The replication cohort comprised 95 primary PDAC cases that underwent whole-genome sequencing and expression microarray on bulk biospecimens. Somatic mutations accumulate from sequence-specific processes creating signatures detectable by DNA sequencing. Using nonnegative matrix factorization, we measured the contribution of each signature to carcinogenesis, and used hierarchical clustering to subtype each cohort. We examined expression of antitumor immunity genes across subtypes to uncover biomarkers predictive of response to systemic therapies. The discovery cohort was 53% male (n = 79) and had a median age of 67 (interquartile range, 58-74) years. The replication cohort was 50% male (n = 48) and had a median age of 68 (interquartile range, 60-75) years. Five predominant mutational subtypes were identified that clustered PDAC into 4 major subtypes: age related, double-strand break repair, mismatch repair, and 1 with unknown etiology (signature 8). These were replicated and validated. Signatures were faithfully propagated from primaries to matched metastases, implying their stability during carcinogenesis. Twelve of 27 (45%) double-strand break repair cases lacked germline or somatic events in canonical homologous recombination genes-BRCA1, BRCA2, or PALB2. Double-strand break repair and mismatch repair subtypes were associated with increased expression of antitumor immunity, including activation of CD8-positive T lymphocytes (GZMA and PRF1) and overexpression of regulatory molecules (cytotoxic T-lymphocyte antigen 4, programmed cell death 1, and indolamine 2,3-dioxygenase 1), corresponding to higher frequency of somatic mutations and tumor-specific neoantigens. Signature-based subtyping may guide personalized therapy of PDAC in the context of biomarker-driven prospective trials.
NASA Astrophysics Data System (ADS)
Song, Sutao; Huang, Yuxia; Long, Zhiying; Zhang, Jiacai; Chen, Gongxiang; Wang, Shuqing
2016-03-01
Recently, several studies have successfully applied multivariate pattern analysis methods to predict the categories of emotions. These studies are mainly focused on self-experienced emotions, such as the emotional states elicited by music or movie. In fact, most of our social interactions involve perception of emotional information from the expressions of other people, and it is an important basic skill for humans to recognize the emotional facial expressions of other people in a short time. In this study, we aimed to determine the discriminability of perceived emotional facial expressions. In a rapid event-related fMRI design, subjects were instructed to classify four categories of facial expressions (happy, disgust, angry and neutral) by pressing different buttons, and each facial expression stimulus lasted for 2s. All participants performed 5 fMRI runs. One multivariate pattern analysis method, support vector machine was trained to predict the categories of facial expressions. For feature selection, ninety masks defined from anatomical automatic labeling (AAL) atlas were firstly generated and each were treated as the input of the classifier; then, the most stable AAL areas were selected according to prediction accuracies, and comprised the final feature sets. Results showed that: for the 6 pair-wise classification conditions, the accuracy, sensitivity and specificity were all above chance prediction, among which, happy vs. neutral , angry vs. disgust achieved the lowest results. These results suggested that specific neural signatures of perceived emotional facial expressions may exist, and happy vs. neutral, angry vs. disgust might be more similar in information representation in the brain.
NASA Astrophysics Data System (ADS)
Markman, Adam; Carnicer, Artur; Javidi, Bahram
2017-05-01
We overview our recent work [1] on utilizing three-dimensional (3D) optical phase codes for object authentication using the random forest classifier. A simple 3D optical phase code (OPC) is generated by combining multiple diffusers and glass slides. This tag is then placed on a quick-response (QR) code, which is a barcode capable of storing information and can be scanned under non-uniform illumination conditions, rotation, and slight degradation. A coherent light source illuminates the OPC and the transmitted light is captured by a CCD to record the unique signature. Feature extraction on the signature is performed and inputted into a pre-trained random-forest classifier for authentication.
Autophagy-related prognostic signature for breast cancer.
Gu, Yunyan; Li, Pengfei; Peng, Fuduan; Zhang, Mengmeng; Zhang, Yuanyuan; Liang, Haihai; Zhao, Wenyuan; Qi, Lishuang; Wang, Hongwei; Wang, Chenguang; Guo, Zheng
2016-03-01
Autophagy is a process that degrades intracellular constituents, such as long-lived or damaged proteins and organelles, to buffer metabolic stress under starvation conditions. Deregulation of autophagy is involved in the progression of cancer. However, the predictive value of autophagy for breast cancer prognosis remains unclear. First, based on gene expression profiling, we found that autophagy genes were implicated in breast cancer. Then, using the Cox proportional hazard regression model, we detected autophagy prognostic signature for breast cancer in a training dataset. We identified a set of eight autophagy genes (BCL2, BIRC5, EIF4EBP1, ERO1L, FOS, GAPDH, ITPR1 and VEGFA) that were significantly associated with overall survival in breast cancer. The eight autophagy genes were assigned as a autophagy-related prognostic signature for breast cancer. Based on the autophagy-related signature, the training dataset GSE21653 could be classified into high-risk and low-risk subgroups with significantly different survival times (HR = 2.72, 95% CI = (1.91, 3.87); P = 1.37 × 10(-5)). Inactivation of autophagy was associated with shortened survival of breast cancer patients. The prognostic value of the autophagy-related signature was confirmed in the testing dataset GSE3494 (HR = 2.12, 95% CI = (1.48, 3.03); P = 1.65 × 10(-3)) and GSE7390 (HR = 1.76, 95% CI = (1.22, 2.54); P = 9.95 × 10(-4)). Further analysis revealed that the prognostic value of the autophagy signature was independent of known clinical prognostic factors, including age, tumor size, grade, estrogen receptor status, progesterone receptor status, ERBB2 status, lymph node status and TP53 mutation status. Finally, we demonstrated that the autophagy signature could also predict distant metastasis-free survival for breast cancer. © 2015 Wiley Periodicals, Inc.
Classifying lower grade glioma cases according to whole genome gene expression.
Chen, Baoshi; Liang, Tingyu; Yang, Pei; Wang, Haoyuan; Liu, Yanwei; Yang, Fan; You, Gan
2016-11-08
To identify a gene-based signature as a novel prognostic model in lower grade gliomas. A gene signature developed from HOXA7, SLC2A4RG and MN1 could segregate patients into low and high risk score groups with different overall survival (OS), and was validated in TCGA RNA-seq and GSE16011 mRNA array datasets. Receiver operating characteristic (ROC) was performed to show that the three-gene signature was more sensitive and specific than histology, grade, age, IDH1 mutation and 1p/19q co-deletion. Gene Set Enrichment Analysis (GSEA) and GO analysis showed high-risk samples were associated with tumor associated macrophages (TAMs) and highly invasive phenotypes. Moreover, HOXA7-siRNA inhibited migration and invasion in vitro, and downregulated MMP9 at the protein level in U251 glioma cells. A cohort of 164 glioma specimens from the Chinese Glioma Genome Atlas (CGGA) array database were assessed as the training group. TCGA RNA-seq and GSE16011 mRNA array datasets were used for validation. Regression analyses and linear risk score assessment were performed for the identification of the three-gene signature comprising HOXA7, SLC2A4RG and MN1. We established a three-gene signature for lower grade gliomas, which could independently predict overall survival (OS) of lower grade glioma patients with higher sensitivity and specificity compared with other clinical characteristics. These findings indicate that the three-gene signature is a new prognostic model that could provide improved OS prediction and accurate therapies for lower grade glioma patients.
76 FR 62000 - Express Mail Domestic Postage Refund Policy and Waiver of Signature
Federal Register 2010, 2011, 2012, 2013, 2014
2011-10-06
... POSTAL SERVICE 39 CFR Part 111 Express Mail Domestic Postage Refund Policy and Waiver of Signature... days to 30 days after the date of mailing, and to change the Express Mail ``waiver of signature'' standard for domestic items by obtaining an addressee's signature only when the mailer selects the...
Comparison of wheat classification accuracy using different classifiers of the image-100 system
NASA Technical Reports Server (NTRS)
Dejesusparada, N. (Principal Investigator); Chen, S. C.; Moreira, M. A.; Delima, A. M.
1981-01-01
Classification results using single-cell and multi-cell signature acquisition options, a point-by-point Gaussian maximum-likelihood classifier, and K-means clustering of the Image-100 system are presented. Conclusions reached are that: a better indication of correct classification can be provided by using a test area which contains various cover types of the study area; classification accuracy should be evaluated considering both the percentages of correct classification and error of commission; supervised classification approaches are better than K-means clustering; Gaussian distribution maximum likelihood classifier is better than Single-cell and Multi-cell Signature Acquisition Options of the Image-100 system; and in order to obtain a high classification accuracy in a large and heterogeneous crop area, using Gaussian maximum-likelihood classifier, homogeneous spectral subclasses of the study crop should be created to derive training statistics.
Statistical Methods for Passive Vehicle Classification in Urban Traffic Surveillance and Control
DOT National Transportation Integrated Search
1980-01-01
A statistical approach to passive vehicle classification using the phase-shift signature from electromagnetic presence-type vehicle detectors is developed with digitized samples of the analog phase-shift signature, the problem of classifying vehicle ...
Tsai, Yu-Shuen; Aguan, Kripamoy; Pal, Nikhil R.; Chung, I-Fang
2011-01-01
Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases. PMID:21909426
Bermudo, R; Abia, D; Mozos, A; García-Cruz, E; Alcaraz, A; Ortiz, Á R; Thomson, T M; Fernández, P L
2011-01-01
Introduction: Currently, final diagnosis of prostate cancer (PCa) is based on histopathological analysis of needle biopsies, but this process often bears uncertainties due to small sample size, tumour focality and pathologist's subjective assessment. Methods: Prostate cancer diagnostic signatures were generated by applying linear discriminant analysis to microarray and real-time RT–PCR (qRT–PCR) data from normal and tumoural prostate tissue samples. Additionally, after removal of biopsy tissues, material washed off from transrectal biopsy needles was used for molecular profiling and discriminant analysis. Results: Linear discriminant analysis applied to microarray data for a set of 318 genes differentially expressed between non-tumoural and tumoural prostate samples produced 26 gene signatures, which classified the 84 samples used with 100% accuracy. To identify signatures potentially useful for the diagnosis of prostate biopsies, surplus material washed off from routine biopsy needles from 53 patients was used to generate qRT–PCR data for a subset of 11 genes. This analysis identified a six-gene signature that correctly assigned the biopsies as benign or tumoural in 92.6% of the cases, with 88.8% sensitivity and 96.1% specificity. Conclusion: Surplus material from prostate needle biopsies can be used for minimal-size gene signature analysis for sensitive and accurate discrimination between non-tumoural and tumoural prostates, without interference with current diagnostic procedures. This approach could be a useful adjunct to current procedures in PCa diagnosis. PMID:22009027
Szabo, Eva; Miller, Mark Steven; Lubet, Ronald A.; You, Ming; Wang, Yian
2017-01-01
Due to exposure to environmental toxicants, a “field cancerization” effect occurs in the lung resulting in the development of a field of initiated but morphologically normal appearing cells in the damaged epithelium of bronchial airways with dysregulated gene expression patterns. Using a mouse model of lung squamous cell carcinoma (SCC), we performed transcriptome sequencing (RNA-Seq) to profile bronchial airway gene expression and found activation of the PI3K and Myc signaling networks in cytologically normal bronchial airway epithelial cells of mice with preneopastic lung SCC lesions, which was reversed by treatment with the PI3K Inhibitor XL-147 and pioglitazone, respectively. Activated MYC signaling was also present in premalignant and tumor tissues from human lung SCC patients. In addition, we identified a key microRNA, mmu-miR-449c-5p, whose suppression significantly up-regulated Myc expression in the normal bronchial airway epithelial cells of mice with early stage SCC lesions. We developed a novel bronchial genomic classifier in mice and validated it in humans. In the classifier, Ppbp (pro-platelet basic protein) was overexpressed 115 fold in the bronchial airways of mice with preneoplastic lung SCC lesions. This is the first report that demonstrates Ppbp as a novel biomarker in the bronchial airway for lung cancer diagnosis. PMID:27935865
2016-02-10
gram positive bacteria) at various time points. We carried out carrying out gene expression analysis for SEB, Dengue , Plague, VEE, Bot toxin, at...to confidently identify transcriptional responses induced by bacteria (anthrax, plague, Brucella), toxins (CT, SEB, BoNTA), or viruses ( Dengue , VEE...P, Celluzzi CM, Marovich M, Subramanian H, Eller M, Widjaja S, Palmer D, Porter K, Sun W, Burgess T: CD40 ligand enhances dengue viral infection of
A translatable predictor of human radiation exposure.
Lucas, Joseph; Dressman, Holly K; Suchindran, Sunil; Nakamura, Mai; Chao, Nelson J; Himburg, Heather; Minor, Kerry; Phillips, Gary; Ross, Joel; Abedi, Majid; Terbrueggen, Robert; Chute, John P
2014-01-01
Terrorism using radiological dirty bombs or improvised nuclear devices is recognized as a major threat to both public health and national security. In the event of a radiological or nuclear disaster, rapid and accurate biodosimetry of thousands of potentially affected individuals will be essential for effective medical management to occur. Currently, health care providers lack an accurate, high-throughput biodosimetric assay which is suitable for the triage of large numbers of radiation injury victims. Here, we describe the development of a biodosimetric assay based on the analysis of irradiated mice, ex vivo-irradiated human peripheral blood (PB) and humans treated with total body irradiation (TBI). Interestingly, a gene expression profile developed via analysis of murine PB radiation response alone was inaccurate in predicting human radiation injury. In contrast, generation of a gene expression profile which incorporated data from ex vivo irradiated human PB and human TBI patients yielded an 18-gene radiation classifier which was highly accurate at predicting human radiation status and discriminating medically relevant radiation dose levels in human samples. Although the patient population was relatively small, the accuracy of this classifier in discriminating radiation dose levels in human TBI patients was not substantially confounded by gender, diagnosis or prior exposure to chemotherapy. We have further incorporated genes from this human radiation signature into a rapid and high-throughput chemical ligation-dependent probe amplification assay (CLPA) which was able to discriminate radiation dose levels in a pilot study of ex vivo irradiated human blood and samples from human TBI patients. Our results illustrate the potential for translation of a human genetic signature for the diagnosis of human radiation exposure and suggest the basis for further testing of CLPA as a candidate biodosimetric assay.
Kim, Minseung; Zorraquino, Violeta; Tagkopoulos, Ilias
2015-03-01
A tantalizing question in cellular physiology is whether the cellular state and environmental conditions can be inferred by the expression signature of an organism. To investigate this relationship, we created an extensive normalized gene expression compendium for the bacterium Escherichia coli that was further enriched with meta-information through an iterative learning procedure. We then constructed an ensemble method to predict environmental and cellular state, including strain, growth phase, medium, oxygen level, antibiotic and carbon source presence. Results show that gene expression is an excellent predictor of environmental structure, with multi-class ensemble models achieving balanced accuracy between 70.0% (±3.5%) to 98.3% (±2.3%) for the various characteristics. Interestingly, this performance can be significantly boosted when environmental and strain characteristics are simultaneously considered, as a composite classifier that captures the inter-dependencies of three characteristics (medium, phase and strain) achieved 10.6% (±1.0%) higher performance than any individual models. Contrary to expectations, only 59% of the top informative genes were also identified as differentially expressed under the respective conditions. Functional analysis of the respective genetic signatures implicates a wide spectrum of Gene Ontology terms and KEGG pathways with condition-specific information content, including iron transport, transferases, and enterobactin synthesis. Further experimental phenotypic-to-genotypic mapping that we conducted for knock-out mutants argues for the information content of top-ranked genes. This work demonstrates the degree at which genome-scale transcriptional information can be predictive of latent, heterogeneous and seemingly disparate phenotypic and environmental characteristics, with far-reaching applications.
NASA Astrophysics Data System (ADS)
Schmalz, M.; Ritter, G.
Accurate multispectral or hyperspectral signature classification is key to the nonimaging detection and recognition of space objects. Additionally, signature classification accuracy depends on accurate spectral endmember determination [1]. Previous approaches to endmember computation and signature classification were based on linear operators or neural networks (NNs) expressed in terms of the algebra (R, +, x) [1,2]. Unfortunately, class separation in these methods tends to be suboptimal, and the number of signatures that can be accurately classified often depends linearly on the number of NN inputs. This can lead to poor endmember distinction, as well as potentially significant classification errors in the presence of noise or densely interleaved signatures. In contrast to traditional CNNs, autoassociative morphological memories (AMM) are a construct similar to Hopfield autoassociatived memories defined on the (R, +, ?,?) lattice algebra [3]. Unlimited storage and perfect recall of noiseless real valued patterns has been proven for AMMs [4]. However, AMMs suffer from sensitivity to specific noise models, that can be characterized as erosive and dilative noise. On the other hand, the prior definition of a set of endmembers corresponds to material spectra lying on vertices of the minimum convex region covering the image data. These vertices can be characterized as morphologically independent patterns. It has further been shown that AMMs can be based on dendritic computation [3,6]. These techniques yield improved accuracy and class segmentation/separation ability in the presence of highly interleaved signature data. In this paper, we present a procedure for endmember determination based on AMM noise sensitivity, which employs morphological dendritic computation. We show that detected endmembers can be exploited by AMM based classification techniques, to achieve accurate signature classification in the presence of noise, closely spaced or interleaved signatures, and simulated camera optical distortions. In particular, we examine two critical cases: (1) classification of multiple closely spaced signatures that are difficult to separate using distance measures, and (2) classification of materials in simulated hyperspectral images of spaceborne satellites. In each case, test data are derived from a NASA database of space material signatures. Additional analysis pertains to computational complexity and noise sensitivity, which are superior to classical NN based techniques.
Verhagen, Lilly M; Zomer, Aldert; Maes, Mailis; Villalba, Julian A; Del Nogal, Berenice; Eleveld, Marc; van Hijum, Sacha Aft; de Waard, Jacobus H; Hermans, Peter Wm
2013-02-01
Tuberculosis (TB) continues to cause a high toll of disease and death among children worldwide. The diagnosis of childhood TB is challenged by the paucibacillary nature of the disease and the difficulties in obtaining specimens. Whereas scientific and clinical research efforts to develop novel diagnostic tools have focused on TB in adults, childhood TB has been relatively neglected. Blood transcriptional profiling has improved our understanding of disease pathogenesis of adult TB and may offer future leads for diagnosis and treatment. No studies applying gene expression profiling of children with TB have been published so far. We identified a 116-gene signature set that showed an average prediction error of 11% for TB vs. latent TB infection (LTBI) and for TB vs. LTBI vs. healthy controls (HC) in our dataset. A minimal gene set of only 9 genes showed the same prediction error of 11% for TB vs. LTBI in our dataset. Furthermore, this minimal set showed a significant discriminatory value for TB vs. LTBI for all previously published adult studies using whole blood gene expression, with average prediction errors between 17% and 23%. In order to identify a robust representative gene set that would perform well in populations of different genetic backgrounds, we selected ten genes that were highly discriminative between TB, LTBI and HC in all literature datasets as well as in our dataset. Functional annotation of these genes highlights a possible role for genes involved in calcium signaling and calcium metabolism as biomarkers for active TB. These ten genes were validated by quantitative real-time polymerase chain reaction in an additional cohort of 54 Warao Amerindian children with LTBI, HC and non-TB pneumonia. Decision tree analysis indicated that five of the ten genes were sufficient to classify 78% of the TB cases correctly with no LTBI subjects wrongly classified as TB (100% specificity). Our data justify the further exploration of our signature set as biomarkers for potential childhood TB diagnosis. We show that, as the identification of different biomarkers in ethnically distinct cohorts is apparent, it is important to cross-validate newly identified markers in all available cohorts.
2013-01-01
Background Tuberculosis (TB) continues to cause a high toll of disease and death among children worldwide. The diagnosis of childhood TB is challenged by the paucibacillary nature of the disease and the difficulties in obtaining specimens. Whereas scientific and clinical research efforts to develop novel diagnostic tools have focused on TB in adults, childhood TB has been relatively neglected. Blood transcriptional profiling has improved our understanding of disease pathogenesis of adult TB and may offer future leads for diagnosis and treatment. No studies applying gene expression profiling of children with TB have been published so far. Results We identified a 116-gene signature set that showed an average prediction error of 11% for TB vs. latent TB infection (LTBI) and for TB vs. LTBI vs. healthy controls (HC) in our dataset. A minimal gene set of only 9 genes showed the same prediction error of 11% for TB vs. LTBI in our dataset. Furthermore, this minimal set showed a significant discriminatory value for TB vs. LTBI for all previously published adult studies using whole blood gene expression, with average prediction errors between 17% and 23%. In order to identify a robust representative gene set that would perform well in populations of different genetic backgrounds, we selected ten genes that were highly discriminative between TB, LTBI and HC in all literature datasets as well as in our dataset. Functional annotation of these genes highlights a possible role for genes involved in calcium signaling and calcium metabolism as biomarkers for active TB. These ten genes were validated by quantitative real-time polymerase chain reaction in an additional cohort of 54 Warao Amerindian children with LTBI, HC and non-TB pneumonia. Decision tree analysis indicated that five of the ten genes were sufficient to classify 78% of the TB cases correctly with no LTBI subjects wrongly classified as TB (100% specificity). Conclusions Our data justify the further exploration of our signature set as biomarkers for potential childhood TB diagnosis. We show that, as the identification of different biomarkers in ethnically distinct cohorts is apparent, it is important to cross-validate newly identified markers in all available cohorts. PMID:23375113
Albrekt, Ann-Sofie; Borrebaeck, Carl A. K.; Lindstedt, Malin
2015-01-01
Background Repeated exposure to certain low molecular weight (LMW) chemical compounds may result in development of allergic reactions in the skin or in the respiratory tract. In most cases, a certain LMW compound selectively sensitize the skin, giving rise to allergic contact dermatitis (ACD), or the respiratory tract, giving rise to occupational asthma (OA). To limit occurrence of allergic diseases, efforts are currently being made to develop predictive assays that accurately identify chemicals capable of inducing such reactions. However, while a few promising methods for prediction of skin sensitization have been described, to date no validated method, in vitro or in vivo, exists that is able to accurately classify chemicals as respiratory sensitizers. Results Recently, we presented the in vitro based Genomic Allergen Rapid Detection (GARD) assay as a novel testing strategy for classification of skin sensitizing chemicals based on measurement of a genomic biomarker signature. We have expanded the applicability domain of the GARD assay to classify also respiratory sensitizers by identifying a separate biomarker signature containing 389 differentially regulated genes for respiratory sensitizers in comparison to non-respiratory sensitizers. By using an independent data set in combination with supervised machine learning, we validated the assay, showing that the identified genomic biomarker is able to accurately classify respiratory sensitizers. Conclusions We have identified a genomic biomarker signature for classification of respiratory sensitizers. Combining this newly identified biomarker signature with our previously identified biomarker signature for classification of skin sensitizers, we have developed a novel in vitro testing strategy with a potent ability to predict both skin and respiratory sensitization in the same sample. PMID:25760038
Nsengimana, Jérémie; Laye, Jon; Filia, Anastasia; Walker, Christy; Jewell, Rosalyn; Van den Oord, Joost J; Wolter, Pascal; Patel, Poulam; Sucker, Antje; Schadendorf, Dirk; Jönsson, Göran B; Bishop, D Timothy; Newton-Bishop, Julia
2015-05-10
Development and validation of robust molecular biomarkers has so far been limited in melanoma research. In this paper we used a large population-based cohort to replicate two published gene signatures for melanoma classification. We assessed the signatures prognostic value and explored their biological significance by correlating them with factors known to be associated with survival (vitamin D) or etiological routes (nevi, sun sensitivity and telomere length). Genomewide microarray gene expressions were profiled in 300 archived tumors (224 primaries, 76 secondaries). The two gene signatures classified up to 96% of our samples and showed strong correlation with melanoma specific survival (P=3 x 10(-4)), Breslow thickness (P=5 x 10(-10)), ulceration (P=9.x10-8) and mitotic rate (P=3 x 10(-7)), adding prognostic value over AJCC stage (adjusted hazard ratio 1.79, 95%CI 1.13-2.83), as previously reported. Furthermore, molecular subtypes were associated with season-adjusted serum vitamin D at diagnosis (P=0.04) and genetically predicted telomere length (P=0.03). Specifically, molecular high-grade tumors were more frequent in patients with lower vitamin D levels whereas high immune tumors came from patients with predicted shorter telomeres. Our data confirm the utility of molecular biomarkers in melanoma prognostic estimation using tiny archived specimens and shed light on biological mechanisms likely to impact on cancer initiation and progression.
Lae, Marick; Moarii, Matahi; Sadacca, Benjamin; Pinheiro, Alice; Galliot, Marion; Abecassis, Judith; Laurent, Cecile; Reyal, Fabien
2016-01-01
Introduction HER2-positive breast cancer (BC) is a heterogeneous group of aggressive breast cancers, the prognosis of which has greatly improved since the introduction of treatments targeting HER2. However, these tumors may display intrinsic or acquired resistance to treatment, and classifiers of HER2-positive tumors are required to improve the prediction of prognosis and to develop novel therapeutic interventions. Methods We analyzed 2893 primary human breast cancer samples from 21 publicly available datasets and developed a six-metagene signature on a training set of 448 HER2-positive BC. We then used external public datasets to assess the ability of these metagenes to predict the response to chemotherapy (Ignatiadis dataset), and prognosis (METABRIC dataset). Results We identified a six-metagene signature (138 genes) containing metagenes enriched in different gene ontologies. The gene clusters were named as follows: Immunity, Tumor suppressors/proliferation, Interferon, Signal transduction, Hormone/survival and Matrix clusters. In all datasets, the Immunity metagene was less strongly expressed in ER-positive than in ER-negative tumors, and was inversely correlated with the Hormonal/survival metagene. Within the signature, multivariate analyses showed that strong expression of the “Immunity” metagene was associated with higher pCR rates after NAC (OR = 3.71[1.28–11.91], p = 0.019) than weak expression, and with a better prognosis in HER2-positive/ER-negative breast cancers (HR = 0.58 [0.36–0.94], p = 0.026). Immunity metagene expression was associated with the presence of tumor-infiltrating lymphocytes (TILs). Conclusion The identification of a predictive and prognostic immune module in HER2-positive BC confirms the need for clinical testing for immune checkpoint modulators and vaccines for this specific subtype. The inverse correlation between Immunity and hormone pathways opens research perspectives and deserves further investigation. PMID:28005906
2012-01-01
Background The criteria for choosing relevant cell lines among a vast panel of available intestinal-derived lines exhibiting a wide range of functional properties are still ill-defined. The objective of this study was, therefore, to establish objective criteria for choosing relevant cell lines to assess their appropriateness as tumor models as well as for drug absorption studies. Results We made use of publicly available expression signatures and cell based functional assays to delineate differences between various intestinal colon carcinoma cell lines and normal intestinal epithelium. We have compared a panel of intestinal cell lines with patient-derived normal and tumor epithelium and classified them according to traits relating to oncogenic pathway activity, epithelial-mesenchymal transition (EMT) and stemness, migratory properties, proliferative activity, transporter expression profiles and chemosensitivity. For example, SW480 represent an EMT-high, migratory phenotype and scored highest in terms of signatures associated to worse overall survival and higher risk of recurrence based on patient derived databases. On the other hand, differentiated HT29 and T84 cells showed gene expression patterns closest to tumor bulk derived cells. Regarding drug absorption, we confirmed that differentiated Caco-2 cells are the model of choice for active uptake studies in the small intestine. Regarding chemosensitivity we were unable to confirm a recently proposed association of chemo-resistance with EMT traits. However, a novel signature was identified through mining of NCI60 GI50 values that allowed to rank the panel of intestinal cell lines according to their drug responsiveness to commonly used chemotherapeutics. Conclusions This study presents a straightforward strategy to exploit publicly available gene expression data to guide the choice of cell-based models. While this approach does not overcome the major limitations of such models, introducing a rank order of selected features may allow selecting model cell lines that are more adapted and pertinent to the addressed biological question. PMID:22726358
Faruki, Hawazin; Mayhew, Gregory M; Fan, Cheng; Wilkerson, Matthew D; Parker, Scott; Kam-Morgan, Lauren; Eisenberg, Marcia; Horten, Bruce; Hayes, D Neil; Perou, Charles M; Lai-Goldman, Myla
2016-06-01
Context .- A histologic classification of lung cancer subtypes is essential in guiding therapeutic management. Objective .- To complement morphology-based classification of lung tumors, a previously developed lung subtyping panel (LSP) of 57 genes was tested using multiple public fresh-frozen gene-expression data sets and a prospectively collected set of formalin-fixed, paraffin-embedded lung tumor samples. Design .- The LSP gene-expression signature was evaluated in multiple lung cancer gene-expression data sets totaling 2177 patients collected from 4 platforms: Illumina RNAseq (San Diego, California), Agilent (Santa Clara, California) and Affymetrix (Santa Clara) microarrays, and quantitative reverse transcription-polymerase chain reaction. Gene centroids were calculated for each of 3 genomic-defined subtypes: adenocarcinoma, squamous cell carcinoma, and neuroendocrine, the latter of which encompassed both small cell carcinoma and carcinoid. Classification by LSP into 3 subtypes was evaluated in both fresh-frozen and formalin-fixed, paraffin-embedded tumor samples, and agreement with the original morphology-based diagnosis was determined. Results .- The LSP-based classifications demonstrated overall agreement with the original clinical diagnosis ranging from 78% (251 of 322) to 91% (492 of 538 and 869 of 951) in the fresh-frozen public data sets and 84% (65 of 77) in the formalin-fixed, paraffin-embedded data set. The LSP performance was independent of tissue-preservation method and gene-expression platform. Secondary, blinded pathology review of formalin-fixed, paraffin-embedded samples demonstrated concordance of 82% (63 of 77) with the original morphology diagnosis. Conclusions .- The LSP gene-expression signature is a reproducible and objective method for classifying lung tumors and demonstrates good concordance with morphology-based classification across multiple data sets. The LSP panel can supplement morphologic assessment of lung cancers, particularly when classification by standard methods is challenging.
Di, Chao; Yuan, Jiapei; Wu, Yue; Li, Jingrui; Lin, Huixin; Hu, Long; Zhang, Ting; Qi, Yijun; Gerstein, Mark B; Guo, Yan; Lu, Zhi John
2014-12-01
Recently, in addition to poly(A)+ long non-coding RNAs (lncRNAs), many lncRNAs without poly(A) tails, have been characterized in mammals. However, the non-polyA lncRNAs and their conserved motifs, especially those associated with environmental stresses, have not been fully investigated in plant genomes. We performed poly(A)- RNA-seq for seedlings of Arabidopsis thaliana under four stress conditions, and predicted lncRNA transcripts. We classified the lncRNAs into three confidence levels according to their expression patterns, epigenetic signatures and RNA secondary structures. Then, we further classified the lncRNAs to poly(A)+ and poly(A)- transcripts. Compared with poly(A)+ lncRNAs and coding genes, we found that poly(A)- lncRNAs tend to have shorter transcripts and lower expression levels, and they show significant expression specificity in response to stresses. In addition, their differential expression is significantly enriched in drought condition and depleted in heat condition. Overall, we identified 245 poly(A)+ and 58 poly(A)- lncRNAs that are differentially expressed under various stress stimuli. The differential expression was validated by qRT-PCR, and the signaling pathways involved were supported by specific binding of transcription factors (TFs), phytochrome-interacting factor 4 (PIF4) and PIF5. Moreover, we found many conserved sequence and structural motifs of lncRNAs from different functional groups (e.g. a UUC motif responding to salt and a AU-rich stem-loop responding to cold), indicated that the conserved elements might be responsible for the stress-responsive functions of lncRNAs. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Seismic signature analysis for discrimination of people from animals
NASA Astrophysics Data System (ADS)
Damarla, Thyagaraju; Mehmood, Asif; Sabatier, James M.
2013-05-01
Cadence analysis has been the main focus for discriminating between the seismic signatures of people and animals. However, cadence analysis fails when multiple targets are generating the signatures. We analyze the mechanism of human walking and the signature generated by a human walker, and compare it with the signature generated by a quadruped. We develop Fourier-based analysis to differentiate the human signatures from the animal signatures. We extract a set of basis vectors to represent the human and animal signatures using non-negative matrix factorization, and use them to separate and classify both the targets. Grazing animals such as deer, cows, etc., often produce sporadic signals as they move around from patch to patch of grass and one must characterize them so as to differentiate their signatures from signatures generated by a horse steadily walking along a path. These differences in the signatures are used in developing a robust algorithm to distinguish the signatures of animals from humans. The algorithm is tested on real data collected in a remote area.
Ion channel gene expression predicts survival in glioma patients
Wang, Rong; Gurguis, Christopher I.; Gu, Wanjun; Ko, Eun A; Lim, Inja; Bang, Hyoweon; Zhou, Tong; Ko, Jae-Hong
2015-01-01
Ion channels are important regulators in cell proliferation, migration, and apoptosis. The malfunction and/or aberrant expression of ion channels may disrupt these important biological processes and influence cancer progression. In this study, we investigate the expression pattern of ion channel genes in glioma. We designate 18 ion channel genes that are differentially expressed in high-grade glioma as a prognostic molecular signature. This ion channel gene expression based signature predicts glioma outcome in three independent validation cohorts. Interestingly, 16 of these 18 genes were down-regulated in high-grade glioma. This signature is independent of traditional clinical, molecular, and histological factors. Resampling tests indicate that the prognostic power of the signature outperforms random gene sets selected from human genome in all the validation cohorts. More importantly, this signature performs better than the random gene signatures selected from glioma-associated genes in two out of three validation datasets. This study implicates ion channels in brain cancer, thus expanding on knowledge of their roles in other cancers. Individualized profiling of ion channel gene expression serves as a superior and independent prognostic tool for glioma patients. PMID:26235283
Connectivity map identifies HDAC inhibition as a treatment option of high-risk hepatoblastoma.
Beck, Alexander; Eberherr, Corinna; Hagemann, Michaela; Cairo, Stefano; Häberle, Beate; Vokuhl, Christian; von Schweinitz, Dietrich; Kappler, Roland
2016-11-01
Hepatoblastoma (HB) is the most common liver tumor of childhood, usually occurring in children under the age of 3 y. The prognosis of patients presenting with distant metastasis, vascular invasion and advanced tumor stages remains poor and children that do survive often face severe late effects from the aggressive chemotherapy regimen. To identify potential new therapeutics for high risk HB we used a 1,000-gene expression signature as input for a Connectivity Map (CMap) analysis, which predicted histone deacetylase (HDAC) inhibitors as a promising therapy option. Subsequent expression analysis of primary HB and HB cell lines revealed a general overexpression of HDAC1 and HDAC2, which has been suggested to be predictive for the efficacy of HDAC inhibition. Accordingly, treatment of HB cells with the HDAC inhibitors SAHA and MC1568 resulted in a potent reduction of cell viability, induction of apoptosis, reactivation of epigenetically suppressed tumor suppressor genes, and the reversion of the 16-gene HB classifier toward the more favorable expression signature. Most importantly, the combination of HDAC inhibitors and cisplatin - a major chemotherapeutic agent of HB treatment - revealed a strong synergistic effect, even at significantly reduced doses of cisplatin. Our findings suggest that HDAC inhibitors skew HB cells toward a more favorable prognostic phenotype through changes in gene expression, thus indicating a targeted molecular mechanism that seems to enhance the anti-proliferative effects of conventional chemotherapy. Thus, adding HDAC inhibitors to the treatment regimen of high risk HB could potentially improve outcomes and reduce severe late effects.
L1000CDS2: LINCS L1000 characteristic direction signatures search engine.
Duan, Qiaonan; Reid, St Patrick; Clark, Neil R; Wang, Zichen; Fernandez, Nicolas F; Rouillard, Andrew D; Readhead, Ben; Tritsch, Sarah R; Hodos, Rachel; Hafner, Marc; Niepel, Mario; Sorger, Peter K; Dudley, Joel T; Bavari, Sina; Panchal, Rekha G; Ma'ayan, Avi
2016-01-01
The library of integrated network-based cellular signatures (LINCS) L1000 data set currently comprises of over a million gene expression profiles of chemically perturbed human cell lines. Through unique several intrinsic and extrinsic benchmarking schemes, we demonstrate that processing the L1000 data with the characteristic direction (CD) method significantly improves signal to noise compared with the MODZ method currently used to compute L1000 signatures. The CD processed L1000 signatures are served through a state-of-the-art web-based search engine application called L1000CDS 2 . The L1000CDS 2 search engine provides prioritization of thousands of small-molecule signatures, and their pairwise combinations, predicted to either mimic or reverse an input gene expression signature using two methods. The L1000CDS 2 search engine also predicts drug targets for all the small molecules profiled by the L1000 assay that we processed. Targets are predicted by computing the cosine similarity between the L1000 small-molecule signatures and a large collection of signatures extracted from the gene expression omnibus (GEO) for single-gene perturbations in mammalian cells. We applied L1000CDS 2 to prioritize small molecules that are predicted to reverse expression in 670 disease signatures also extracted from GEO, and prioritized small molecules that can mimic expression of 22 endogenous ligand signatures profiled by the L1000 assay. As a case study, to further demonstrate the utility of L1000CDS 2 , we collected expression signatures from human cells infected with Ebola virus at 30, 60 and 120 min. Querying these signatures with L1000CDS 2 we identified kenpaullone, a GSK3B/CDK2 inhibitor that we show, in subsequent experiments, has a dose-dependent efficacy in inhibiting Ebola infection in vitro without causing cellular toxicity in human cell lines. In summary, the L1000CDS 2 tool can be applied in many biological and biomedical settings, while improving the extraction of knowledge from the LINCS L1000 resource.
Prognostic Power of a Tumor Differentiation Gene Signature for Bladder Urothelial Carcinomas.
Mo, Qianxing; Nikolos, Fotis; Chen, Fengju; Tramel, Zoe; Lee, Yu-Cheng; Hayashi, Kazukuni; Xiao, Jing; Shen, Jianjun; Chan, Keith Syson
2018-05-01
Muscle-invasive bladder cancers (MIBCs) cause approximately 150 000 deaths per year worldwide. Survival for MIBC patients is heterogeneous, with no clinically validated molecular markers that predict clinical outcome. Non-MIBCs (NMIBCs) generally have favorable outcome; however, a portion progress to MIBC. Hence, development of a prognostic tool that can guide decision-making is crucial for improving clinical management of bladder urothelial carcinomas. Tumor grade is defined by pathologic evaluation of tumor cell differentiation, and it often associates with clinical outcome. The current study extrapolates this conventional wisdom and combines it with molecular profiling. We developed an 18-gene signature that molecularly defines urothelial cellular differentiation, thus classifying MIBCs and NMIBCs into two subgroups: basal and differentiated. We evaluated the prognostic capability of this "tumor differentiation signature" and three other existing gene signatures including the The Cancer Genome Atlas (TCGA; 2707 genes), MD Anderson Cancer Center (MDA; 2252 genes/2697 probes), and University of North Carolina at Chapel Hill (UNC; 47 genes) using five gene expression data sets derived from MIBC and NMIBC patients. All statistical tests were two-sided. The tumor differentiation signature demonstrated consistency and statistical robustness toward stratifying MIBC patients into different overall survival outcomes (TCGA cohort 1, P = .03; MDA discovery, P = .009; MDA validation, P = .01), while the other signatures were not as consistent. In addition, we analyzed the progression (Ta/T1 progressing to ≥T2) probability of NMIBCs. NMIBC patients with a basal tumor differentiation signature associated with worse progression outcome (P = .008). Gene functional term enrichment and gene set enrichment analyses revealed that genes involved in the biologic process of immune response and inflammatory response are among the most elevated within basal bladder cancers, implicating them as candidates for immune checkpoint therapies. These results provide definitive evidence that a biology-prioritizing clustering methodology generates meaningful insights into patient stratification and reveals targetable molecular pathways to impact future therapeutic approach.
A signature inferred from Drosophila mitotic genes predicts survival of breast cancer patients.
Damasco, Christian; Lembo, Antonio; Somma, Maria Patrizia; Gatti, Maurizio; Di Cunto, Ferdinando; Provero, Paolo
2011-02-28
The classification of breast cancer patients into risk groups provides a powerful tool for the identification of patients who will benefit from aggressive systemic therapy. The analysis of microarray data has generated several gene expression signatures that improve diagnosis and allow risk assessment. There is also evidence that cell proliferation-related genes have a high predictive power within these signatures. We thus constructed a gene expression signature (the DM signature) using the human orthologues of 108 Drosophila melanogaster genes required for either the maintenance of chromosome integrity (36 genes) or mitotic division (72 genes). The DM signature has minimal overlap with the extant signatures and is highly predictive of survival in 5 large breast cancer datasets. In addition, we show that the DM signature outperforms many widely used breast cancer signatures in predictive power, and performs comparably to other proliferation-based signatures. For most genes of the DM signature, an increased expression is negatively correlated with patient survival. The genes that provide the highest contribution to the predictive power of the DM signature are those involved in cytokinesis. This finding highlights cytokinesis as an important marker in breast cancer prognosis and as a possible target for antimitotic therapies.
Shin, Heesun; Günther, Oliver; Hollander, Zsuzsanna; Wilson-McManus, Janet E.; Ng, Raymond T.; Balshaw, Robert; Keown, Paul A.; McMaster, Robert; McManus, Bruce M.; Isbel, Nicole M.; Knoll, Greg; Tebbutt, Scott J.
2014-01-01
In this study, we explored a time course of peripheral whole blood transcriptomes from kidney transplantation patients who either experienced an acute rejection episode or did not in order to better delineate the immunological and biological processes measureable in blood leukocytes that are associated with acute renal allograft rejection. Using microarrays, we generated gene expression data from 24 acute rejectors and 24 nonrejectors. We filtered the data to obtain the most unambiguous and robustly expressing probe sets and selected a subset of patients with the clearest phenotype. We then performed a data-driven exploratory analysis using data reduction and differential gene expression analysis tools in order to reveal gene expression signatures associated with acute allograft rejection. Using a template-matching algorithm, we then expanded our analysis to include time course data, identifying genes whose expression is modulated leading up to acute rejection. We have identified molecular phenotypes associated with acute renal allograft rejection, including a significantly upregulated signature of neutrophil activation and accumulation following transplant surgery that is common to both acute rejectors and nonrejectors. Our analysis shows that this expression signature appears to stabilize over time in nonrejectors but persists in patients who go on to reject the transplanted organ. In addition, we describe an expression signature characteristic of lymphocyte activity and proliferation. This lymphocyte signature is significantly downregulated in both acute rejectors and nonrejectors following surgery; however, patients who go on to reject the organ show a persistent downregulation of this signature relative to the neutrophil signature. PMID:24526836
NASA Astrophysics Data System (ADS)
Khosravi, Farhad; Trainor, Patrick; Rai, Shesh N.; Kloecker, Goetz; Wickstrom, Eric; Panchapakesan, Balaji
2016-04-01
We demonstrate the rapid and label-free capture of breast cancer cells spiked in buffy coats using nanotube-antibody micro-arrays. Single wall carbon nanotube arrays were manufactured using photo-lithography, metal deposition, and etching techniques. Anti-epithelial cell adhesion molecule (EpCAM) antibodies were functionalized to the surface of the nanotube devices using 1-pyrene-butanoic acid succinimidyl ester functionalization method. Following functionalization, plain buffy coat and MCF7 cell spiked buffy coats were adsorbed on to the nanotube device and electrical signatures were recorded for differences in interaction between samples. A statistical classifier for the ‘liquid biopsy’ was developed to create a predictive model based on dynamic time warping to classify device electrical signals that corresponded to plain (control) or spiked buffy coats (case). In training test, the device electrical signals originating from buffy versus spiked buffy samples were classified with ˜100% sensitivity, ˜91% specificity and ˜96% accuracy. In the blinded test, the signals were classified with ˜91% sensitivity, ˜82% specificity and ˜86% accuracy. A heatmap was generated to visually capture the relationship between electrical signatures and the sample condition. Confocal microscopic analysis of devices that were classified as spiked buffy coats based on their electrical signatures confirmed the presence of cancer cells, their attachment to the device and overexpression of EpCAM receptors. The cell numbers were counted to be ˜1-17 cells per 5 μl per device suggesting single cell sensitivity in spiked buffy coats that is scalable to higher volumes using the micro-arrays.
NASA Technical Reports Server (NTRS)
Thadani, S. G.
1977-01-01
The Maximum Likelihood Estimation of Signature Transformation (MLEST) algorithm is used to obtain maximum likelihood estimates (MLE) of affine transformation. The algorithm has been evaluated for three sets of data: simulated (training and recognition segment pairs), consecutive-day (data gathered from Landsat images), and geographical-extension (large-area crop inventory experiment) data sets. For each set, MLEST signature extension runs were made to determine MLE values and the affine-transformed training segment signatures were used to classify the recognition segments. The classification results were used to estimate wheat proportions at 0 and 1% threshold values.
Stromal signatures in endometrioid endometrial carcinomas.
Espinosa, Iñigo; Catasus, Lluis; D' Angelo, Emanuela; Mozos, Ana; Pedrola, Nuria; Bértolo, Cristina; Ferrer, Irene; Zannoni, Gian Franco; West, Robert B; van de Rijn, Matt; Matias-Guiu, Xavier; Prat, Jaime
2014-04-01
The pattern of myometrial invasion in endometrioid endometrial carcinomas varies considerably; ie, from widely scattered glands and cell nests, often associated with a fibromyxoid stromal reaction (desmoplasia) and/or a lymphocytic infiltrate, to invasive glands with little or no stromal response. Recently, two distinct stromal signatures derived from a macrophage response (colony-stimulating factor 1, CSF1) and a fibroblastic response (desmoid-type fibromatosis, DTF) were identified in breast carcinomas and correlated with clinicopathologic features including outcome. In this study, we explored whether these stromal signatures also apply to endometrioid carcinomas and how their expression patterns correlated with morphologic changes. We studied the stromal signatures both by immunohistochemistry and in situ hybridization in 98 primary endometrioid carcinomas with (87 cases) and without (11 cases) myometrial invasion as well as in the corresponding regional lymph nodes metatases of 9 myoinvasive tumors. Desmoplasia correlated positively with the DTF expression signature. Likewise, mononuclear infiltrates were found in the stroma of tumors expressing CSF1. Twenty-four out of eighty-seven (27%) myoinvasive endometrioid carcinomas were positive for the macrophage signature and thirteen out of eighty-seven (15%) expressed the fibroblast signature. Eleven additional cases were positive for both DTF and CSF1 signatures (11/87; 13%). However, over half of the cases (39/87; 45%) and the majority of the non-myoinvasive tumors (8/11; 73%) failed to express any of the two stromal signatures. The macrophage response (CSF1) was associated with higher tumor grade, lymphovascular invasion, and PIK3CA mutations (P<0.05). There was a concordance in the expression of the CSF1 signature in the primary tumors and their corresponding lymph node metastases. This study is the first characterization of stromal signatures in endometrioid carcinomas. Our findings shed new light on the relationship between genetically different endometrioid carcinomas and various stromal responses. Preservation of the CSF1 macrophage stromal response in the metastases leds support to targeting the CSF1 pathway in endometrioid endometrial carcinomas.
Hunter, M Colby; Pozhitkov, Alex E; Noble, Peter A
2016-12-01
Conceptual models suggest that certain microorganisms (e.g., the "red" complex) are indicative of a specific disease state (e.g., periodontitis); however, recent studies have questioned the validity of these models. Here, the abundances of 500+ microbial species were determined in 16 patients with clinical signs of one of the following oral conditions: periodontitis, established caries, edentulism, and oral health. Our goal was to determine if the abundances of certain microorganisms reflect dysbiosis or a specific clinical condition that could be used as a 'signature' for dental research. Microbial abundances were determined by the analysis of 138,718 calibrated probes using Gene Meter methodology. Each 16S rRNA gene was targeted by an average of 194 unique probes (n=25nt). The calibration involved diluting pooled gene target samples, hybridizing each dilution to a DNA microarray, and fitting the probe intensities to adsorption models. The fit of the model to the experimental data was used to assess individual and aggregate probe behavior; good fits (R 2 >0.90) were retained for back-calculating microbial abundances from patient samples. The abundance of a gene was determined from the median of all calibrated individual probes or from the calibrated abundance of all aggregated probes. With the exception of genes with low abundances (<2 arbitrary units), the abundances determined by the different calibrations were highly correlated (r~1.0). Seventeen genera were classified as 'signatures of dysbiosis' because they had significantly higher abundances in patients with periodontitis and edentulism when contrasted with health. Similarly, 13 genera were classified as 'signatures of periodontitis', and 14 genera were classified as 'signatures of edentulism'. The signatures could be used, individually or in combination, to assess the clinical status of a patient (e.g., evaluating treatments such as antibiotic therapies). Comparisons of the same patient samples revealed high false negatives (45%) for next-generation-sequencing results and low false positives (7%) for Gene Meter results. Copyright © 2016 Elsevier B.V. All rights reserved.
Is interstellar archeology possible?
NASA Astrophysics Data System (ADS)
Carrigan, Richard A.
2012-09-01
Searching for signatures of cosmic-scale archeological artifacts such as Dyson spheres is an interesting alternative to conventional radio SETI. Uncovering such an artifact does not require the intentional transmission of a signal on the part of the original civilization. This type of search is called interstellar archeology or sometimes cosmic archeology. A variety of interstellar archeology signatures is discussed including non-natural planetary atmospheric constituents, stellar doping, Dyson spheres, as well as signatures of stellar, and galactic-scale engineering. The concept of a Fermi bubble due to interstellar migration is reviewed in the discussion of galactic signatures. These potential interstellar archeological signatures are classified using the Kardashev scale. A modified Drake equation is introduced. With few exceptions interstellar archeological signatures are clouded and beyond current technological capabilities. However SETI for so-called cultural transmissions and planetary atmosphere signatures are within reach.
Online signature recognition using principal component analysis and artificial neural network
NASA Astrophysics Data System (ADS)
Hwang, Seung-Jun; Park, Seung-Je; Baek, Joong-Hwan
2016-12-01
In this paper, we propose an algorithm for on-line signature recognition using fingertip point in the air from the depth image acquired by Kinect. We extract 10 statistical features from X, Y, Z axis, which are invariant to changes in shifting and scaling of the signature trajectories in three-dimensional space. Artificial neural network is adopted to solve the complex signature classification problem. 30 dimensional features are converted into 10 principal components using principal component analysis, which is 99.02% of total variances. We implement the proposed algorithm and test to actual on-line signatures. In experiment, we verify the proposed method is successful to classify 15 different on-line signatures. Experimental result shows 98.47% of recognition rate when using only 10 feature vectors.
Glioma IL13Rα2 Is Associated with Mesenchymal Signature Gene Expression and Poor Patient Prognosis
Starr, Renate; Deng, Xutao; Badie, Behnam; Yuan, Yate-Ching; Forman, Stephen J.; Barish, Michael E.
2013-01-01
A major challenge for successful immunotherapy against glioma is the identification and characterization of validated targets. We have taken a bioinformatics approach towards understanding the biological context of IL-13 receptor α2 (IL13Rα2) expression in brain tumors, and its functional significance for patient survival. Querying multiple gene expression databases, we show that IL13Rα2 expression increases with glioma malignancy grade, and expression for high-grade tumors is bimodal, with approximately 58% of WHO grade IV gliomas over-expressing this receptor. By several measures, IL13Rα2 expression in patient samples and low-passage primary glioma lines most consistently correlates with the expression of signature genes defining mesenchymal subclass tumors and negatively correlates with proneural signature genes as defined by two studies. Positive associations were also noted with proliferative signature genes, whereas no consistent associations were found with either classical or neural signature genes. Probing the potential functional consequences of this mesenchymal association through IPA analysis suggests that IL13Rα2 expression is associated with activation of proinflammatory and immune pathways characteristic of mesenchymal subclass tumors. In addition, survival analyses indicate that IL13Rα2 over-expression is associated with poor patient prognosis, a single gene correlation ranking IL13Rα2 in the top ~1% of total gene expression probes with regard to survival association with WHO IV gliomas. This study better defines the functional consequences of IL13Rα2 expression by demonstrating association with mesenchymal signature gene expression and poor patient prognosis. It thus highlights the utility of IL13Rα2 as a therapeutic target, and helps define patient populations most likely to respond to immunotherapy in present and future clinical trials. PMID:24204956
Glioma IL13Rα2 is associated with mesenchymal signature gene expression and poor patient prognosis.
Brown, Christine E; Warden, Charles D; Starr, Renate; Deng, Xutao; Badie, Behnam; Yuan, Yate-Ching; Forman, Stephen J; Barish, Michael E
2013-01-01
A major challenge for successful immunotherapy against glioma is the identification and characterization of validated targets. We have taken a bioinformatics approach towards understanding the biological context of IL-13 receptor α2 (IL13Rα2) expression in brain tumors, and its functional significance for patient survival. Querying multiple gene expression databases, we show that IL13Rα2 expression increases with glioma malignancy grade, and expression for high-grade tumors is bimodal, with approximately 58% of WHO grade IV gliomas over-expressing this receptor. By several measures, IL13Rα2 expression in patient samples and low-passage primary glioma lines most consistently correlates with the expression of signature genes defining mesenchymal subclass tumors and negatively correlates with proneural signature genes as defined by two studies. Positive associations were also noted with proliferative signature genes, whereas no consistent associations were found with either classical or neural signature genes. Probing the potential functional consequences of this mesenchymal association through IPA analysis suggests that IL13Rα2 expression is associated with activation of proinflammatory and immune pathways characteristic of mesenchymal subclass tumors. In addition, survival analyses indicate that IL13Rα2 over-expression is associated with poor patient prognosis, a single gene correlation ranking IL13Rα2 in the top ~1% of total gene expression probes with regard to survival association with WHO IV gliomas. This study better defines the functional consequences of IL13Rα2 expression by demonstrating association with mesenchymal signature gene expression and poor patient prognosis. It thus highlights the utility of IL13Rα2 as a therapeutic target, and helps define patient populations most likely to respond to immunotherapy in present and future clinical trials.
Serum microRNAs as biomarkers for recurrence in melanoma
2012-01-01
Background Identification of melanoma patients at high risk for recurrence and monitoring for recurrence are critical for informed management decisions. We hypothesized that serum microRNAs (miRNAs) could provide prognostic information at the time of diagnosis unaccounted for by the current staging system and could be useful in detecting recurrence after resection. Methods We screened 355 miRNAs in sera from 80 melanoma patients at primary diagnosis (discovery cohort) using a unique quantitative reverse transcription-PCR (qRT-PCR) panel. Cox proportional hazard models and Kaplan-Meier recurrence-free survival (RFS) curves were used to identify a miRNA signature with prognostic potential adjusting for stage. We then tested the miRNA signature in an independent cohort of 50 primary melanoma patients (validation cohort). Logistic regression analysis was performed to determine if the miRNA signature can determine risk of recurrence in both cohorts. Selected miRNAs were measured longitudinally in subsets of patients pre-/post-operatively and pre-/post-recurrence. Results A signature of 5 miRNAs successfully classified melanoma patients into high and low recurrence risk groups with significant separation of RFS in both discovery and validation cohorts (p = 0.0036, p = 0.0093, respectively). Significant separation of RFS was maintained when a logistic model containing the same signature set was used to predict recurrence risk in both discovery and validation cohorts (p < 0.0001, p = 0.033, respectively). Longitudinal expression of 4 miRNAs in a subset of patients was dynamic, suggesting miRNAs can be associated with tumor burden. Conclusion Our data demonstrate that serum miRNAs can improve accuracy in identifying primary melanoma patients with high recurrence risk and in monitoring melanoma tumor burden over time. PMID:22857597
Cochain, Clément; Vafadarnejad, Ehsan; Arampatzi, Panagiota; Jaroslav, Pelisek; Winkels, Holger; Ley, Klaus; Wolf, Dennis; Saliba, Antoine-Emmanuel; Zernecke, Alma
2018-03-15
Rationale: It is assumed that atherosclerotic arteries contain several macrophage subsets endowed with specific functions. The precise identity of these subsets is poorly characterized as they ha ve been defined by the expression of a restricted number of markers. Objective: We have applied single-cell RNA-seq as an unbiased profiling strategy to interrogate and classify aortic macrophage heterogeneity at the single-cell level in atherosclerosis. Methods and Results: We performed single-cell RNA sequencing of total aortic CD45 + cells extracted from the non-diseased (chow fed) and atherosclerotic (11 weeks of high fat diet) aorta of Ldlr -/- mice. Unsupervised clustering singled out 13 distinct aortic cell clusters. Among the myeloid cell populations, Resident-like macrophages with a gene expression profile similar to aortic resident macrophages were found in healthy and diseased aortae, whereas monocytes, monocyte-derived dendritic cells (MoDC), and two populations of macrophages were almost exclusively detectable in atherosclerotic aortae, comprising Inflammatory macrophages showing enrichment in I l1b , and previously undescribed TREM2 hi macrophages. Differential gene expression and gene ontology enrichment analyses revealed specific gene expression patterns distinguishing these three macrophage subsets and MoDC, and uncovered putative functions of each cell type. Notably, TREM2 hi macrophages appeared to be endowed with specialized functions in lipid metabolism and catabolism, and presented a gene expression signature reminiscent of osteoclasts, suggesting a role in lesion calcification. TREM2 expression was moreover detected in human lesional macrophages. Importantly, these macrophage populations were present also in advanced atherosclerosis and in Apoe -/- aortae, indicating relevance of our findings in different stages of atherosclerosis and mouse models. Conclusions: These data unprecedentedly uncovered the transcriptional landscape and phenotypic heterogeneity of aortic macrophages and MoDCs in atherosclerotic and identified previously unrecognized macrophage populations and their gene expression signature, suggesting specialized functions. Our findings will open up novel opportunities to explore distinct myeloid cell populations and their functions in atherosclerosis.
L1000CDS2: LINCS L1000 characteristic direction signatures search engine
Duan, Qiaonan; Reid, St Patrick; Clark, Neil R; Wang, Zichen; Fernandez, Nicolas F; Rouillard, Andrew D; Readhead, Ben; Tritsch, Sarah R; Hodos, Rachel; Hafner, Marc; Niepel, Mario; Sorger, Peter K; Dudley, Joel T; Bavari, Sina; Panchal, Rekha G; Ma’ayan, Avi
2016-01-01
The library of integrated network-based cellular signatures (LINCS) L1000 data set currently comprises of over a million gene expression profiles of chemically perturbed human cell lines. Through unique several intrinsic and extrinsic benchmarking schemes, we demonstrate that processing the L1000 data with the characteristic direction (CD) method significantly improves signal to noise compared with the MODZ method currently used to compute L1000 signatures. The CD processed L1000 signatures are served through a state-of-the-art web-based search engine application called L1000CDS2. The L1000CDS2 search engine provides prioritization of thousands of small-molecule signatures, and their pairwise combinations, predicted to either mimic or reverse an input gene expression signature using two methods. The L1000CDS2 search engine also predicts drug targets for all the small molecules profiled by the L1000 assay that we processed. Targets are predicted by computing the cosine similarity between the L1000 small-molecule signatures and a large collection of signatures extracted from the gene expression omnibus (GEO) for single-gene perturbations in mammalian cells. We applied L1000CDS2 to prioritize small molecules that are predicted to reverse expression in 670 disease signatures also extracted from GEO, and prioritized small molecules that can mimic expression of 22 endogenous ligand signatures profiled by the L1000 assay. As a case study, to further demonstrate the utility of L1000CDS2, we collected expression signatures from human cells infected with Ebola virus at 30, 60 and 120 min. Querying these signatures with L1000CDS2 we identified kenpaullone, a GSK3B/CDK2 inhibitor that we show, in subsequent experiments, has a dose-dependent efficacy in inhibiting Ebola infection in vitro without causing cellular toxicity in human cell lines. In summary, the L1000CDS2 tool can be applied in many biological and biomedical settings, while improving the extraction of knowledge from the LINCS L1000 resource. PMID:28413689
Yamamoto, Cindy M; Murakami, Taku; Oakes, Melanie L; Mitsuhashi, Masato; Kelly, Colleen; Henry, Robert R; Sharma, Kumar
2018-05-18
Extracellular vesicles (EVs) enclose mRNA derived from their cell of origin and are considered a source of potential biomarkers. We examined urinary EV mRNA from individuals with diabetic kidney disease (DKD), chronic kidney disease, type 2 diabetes (T2DM), and obese and healthy controls to determine if such biomarkers had the potential to classify kidney disease and predict patients at higher risk of renal function decline. A total of 242 participants enrolled in this study. Urinary EV mRNA from all subjects were isolated by a filter-based platform, and the expression of 8 target genes were determined by quantitative polymerase chain reaction (qPCR). Changes in estimated glomerular filtration rate (eGFR) in 161 T2DM patients were evaluated for 2 consecutive years and compared with EV RNA profiles at baseline. We observe that mild and severe DKD groups show a significant 3.2- and -4.4-fold increase in UMOD compared to healthy controls and expression increases linearly from healthy, diabetic, and DKD subjects. UMOD expression is significantly correlated to albumin creatinine ratio (ACR), eGFR, and HbA1c. Using linear discriminant analyses with mRNA from severe DKD and T2DM as training data, a multi-gene signature classified DKD and -non-DKD with a sensitivity of 93% and specificity of 73% with area under the receiver operating characteristic (ROC) curve (AUC) = 0.90. Although 6% of T2DM were determined to have a > 80% posterior probability of developing DKD based on this mRNA profile, eGFR changes observed within the 2-year follow-up did not reveal a decline in kidney function. Urinary EV UMOD mRNA levels are progressively elevated from T2DM to DKD groups and correlate with widely used eGFR and ACR diagnostic criteria. An EV mRNA signature could identify DKD with greater than 90% sensitivity and 70% specificity. © 2018 S. Karger AG, Basel.
An Integrated Approach Identifies Mediators of Local Recurrence in Head and Neck Squamous Carcinoma.
Citron, Francesca; Armenia, Joshua; Franchin, Giovanni; Polesel, Jerry; Talamini, Renato; D'Andrea, Sara; Sulfaro, Sandro; Croce, Carlo M; Klement, William; Otasek, David; Pastrello, Chiara; Tokar, Tomas; Jurisica, Igor; French, Deborah; Bomben, Riccardo; Vaccher, Emanuela; Serraino, Diego; Belletti, Barbara; Vecchione, Andrea; Barzan, Luigi; Baldassarre, Gustavo
2017-07-15
Purpose: Head and neck squamous cell carcinomas (HNSCCs) cause more than 300,000 deaths worldwide each year. Locoregional and distant recurrences represent worse prognostic events and accepted surrogate markers of patients' overall survival. No valid biomarker and salvage therapy exist to identify and treat patients at high-risk of recurrence. We aimed to verify if selected miRNAs could be used as biomarkers of recurrence in HNSCC. Experimental Design: A NanoString array was used to identify miRNAs associated with locoregional recurrence in 44 patients with HNSCC. Bioinformatic approaches validated the signature and identified potential miRNA targets. Validation experiments were performed using an independent cohort of primary HNSCC samples and a panel of HNSCC cell lines. In vivo experiments validated the in vitro results. Results: Our data identified a four-miRNA signature that classified HNSCC patients at high- or low-risk of recurrence. These miRNAs collectively impinge on the epithelial-mesenchymal transition process. In silico and wet lab approaches showed that miR-9, expressed at high levels in recurrent HNSCC, targets SASH1 and KRT13, whereas miR-1, miR-133, and miR-150, expressed at low levels in recurrent HNSCC, collectively target SP1 and TGFβ pathways. A six-gene signature comprising these targets identified patients at high risk of recurrences, as well. Combined pharmacological inhibition of SP1 and TGFβ pathways induced HNSCC cell death and, when timely administered, prevented recurrence formation in a preclinical model of HNSCC recurrence. Conclusions: By integrating different experimental approaches and competences, we identified critical mediators of recurrence formation in HNSCC that may merit to be considered for future clinical development. Clin Cancer Res; 23(14); 3769-80. ©2017 AACR . ©2017 American Association for Cancer Research.
Use of mRNA expression signatures to discover small molecule inhibitors of skeletal muscle atrophy
Adams, Christopher M.; Ebert, Scott M.; Dyle, Michael C.
2017-01-01
Purpose of review Here, we discuss a recently developed experimental strategy for discovering small molecules with potential to prevent and treat skeletal muscle atrophy. Recent findings Muscle atrophy involves and requires widespread changes in skeletal muscle gene expression, which generate complex but measurable patterns of positive and negative changes in skeletal muscle mRNA levels (a.k.a. mRNA expression signatures of muscle atrophy). Many bioactive small molecules generate their own characteristic mRNA expression signatures, and by identifying small molecules whose signatures approximate mirror images of muscle atrophy signatures, one may identify small molecules with potential to prevent and/or reverse muscle atrophy. Unlike a conventional drug discovery approach, this strategy does not rely on a predefined molecular target but rather exploits the complexity of muscle atrophy to identify small molecules that counter the entire spectrum of pathological changes in atrophic muscle. We discuss how this strategy has been used to identify two natural compounds, ursolic acid and tomatidine, that reduce muscle atrophy and improve skeletal muscle function. Summary Discovery strategies based on mRNA expression signatures can elucidate new approaches for preserving and restoring muscle mass and function. PMID:25807353
Use of mRNA expression signatures to discover small molecule inhibitors of skeletal muscle atrophy.
Adams, Christopher M; Ebert, Scott M; Dyle, Michael C
2015-05-01
Here, we discuss a recently developed experimental strategy for discovering small molecules with potential to prevent and treat skeletal muscle atrophy. Muscle atrophy involves and requires widespread changes in skeletal muscle gene expression, which generate complex but measurable patterns of positive and negative changes in skeletal muscle mRNA levels (a.k.a. mRNA expression signatures of muscle atrophy). Many bioactive small molecules generate their own characteristic mRNA expression signatures, and by identifying small molecules whose signatures approximate mirror images of muscle atrophy signatures, one may identify small molecules with potential to prevent and/or reverse muscle atrophy. Unlike a conventional drug discovery approach, this strategy does not rely on a predefined molecular target but rather exploits the complexity of muscle atrophy to identify small molecules that counter the entire spectrum of pathological changes in atrophic muscle. We discuss how this strategy has been used to identify two natural compounds, ursolic acid and tomatidine, that reduce muscle atrophy and improve skeletal muscle function. Discovery strategies based on mRNA expression signatures can elucidate new approaches for preserving and restoring muscle mass and function.
Anderson, Suzanne T.; Bangani, Nonzwakazi; Banwell, Claire M.; Brent, Andrew J.; Crampin, Amelia C.; Dockrell, Hazel M.; Eley, Brian; Heyderman, Robert S.; Hibberd, Martin L.; Kern, Florian; Langford, Paul R.; Ling, Ling; Mendelson, Marc; Ottenhoff, Tom H.; Zgambo, Femia; Wilkinson, Robert J.; Coin, Lachlan J.; Levin, Michael
2013-01-01
Background A major impediment to tuberculosis control in Africa is the difficulty in diagnosing active tuberculosis (TB), particularly in the context of HIV infection. We hypothesized that a unique host blood RNA transcriptional signature would distinguish TB from other diseases (OD) in HIV-infected and -uninfected patients, and that this could be the basis of a simple diagnostic test. Methods and Findings Adult case-control cohorts were established in South Africa and Malawi of HIV-infected or -uninfected individuals consisting of 584 patients with either TB (confirmed by culture of Mycobacterium tuberculosis [M.TB] from sputum or tissue sample in a patient under investigation for TB), OD (i.e., TB was considered in the differential diagnosis but then excluded), or healthy individuals with latent TB infection (LTBI). Individuals were randomized into training (80%) and test (20%) cohorts. Blood transcriptional profiles were assessed and minimal sets of significantly differentially expressed transcripts distinguishing TB from LTBI and OD were identified in the training cohort. A 27 transcript signature distinguished TB from LTBI and a 44 transcript signature distinguished TB from OD. To evaluate our signatures, we used a novel computational method to calculate a disease risk score (DRS) for each patient. The classification based on this score was first evaluated in the test cohort, and then validated in an independent publically available dataset (GSE19491). In our test cohort, the DRS classified TB from LTBI (sensitivity 95%, 95% CI [87–100]; specificity 90%, 95% CI [80–97]) and TB from OD (sensitivity 93%, 95% CI [83–100]; specificity 88%, 95% CI [74–97]). In the independent validation cohort, TB patients were distinguished both from LTBI individuals (sensitivity 95%, 95% CI [85–100]; specificity 94%, 95% CI [84–100]) and OD patients (sensitivity 100%, 95% CI [100–100]; specificity 96%, 95% CI [93–100]). Limitations of our study include the use of only culture confirmed TB patients, and the potential that TB may have been misdiagnosed in a small proportion of OD patients despite the extensive clinical investigation used to assign each patient to their diagnostic group. Conclusions In our study, blood transcriptional signatures distinguished TB from other conditions prevalent in HIV-infected and -uninfected African adults. Our DRS, based on these signatures, could be developed as a test for TB suitable for use in HIV endemic countries. Further evaluation of the performance of the signatures and DRS in prospective populations of patients with symptoms consistent with TB will be needed to define their clinical value under operational conditions. Please see later in the article for the Editors' Summary PMID:24167453
Aneurysm miRNA Signature Differs, Depending on Disease Localization and Morphology
Busch, Albert; Busch, Martin; Scholz, Claus-Jürgen; Kellersmann, Richard; Otto, Christoph; Chernogubova, Ekaterina; Maegdefessel, Lars; Zernecke, Alma; Lorenz, Udo
2016-01-01
Limited comprehension of aneurysm pathology has led to inconclusive results from clinical trials. miRNAs are key regulators of post-translational gene modification and are useful tools in elucidating key features of aneurysm pathogenesis in distinct entities of abdominal and popliteal aneurysms. Here, surgically harvested specimens from 19 abdominal aortic aneurysm (AAA) and 8 popliteal artery aneurysm (PAA) patients were analyzed for miRNA expression and histologically classified regarding extracellular matrix (ECM) remodeling and inflammation. DIANA-based computational target prediction and pathway enrichment analysis verified our results, as well as previous ones. miRNA-362, -19b-1, -194, -769, -21 and -550 were significantly down-regulated in AAA samples depending on degree of inflammation. Similar or inverse regulation was found for miR-769, 19b-1 and miR-550, -21, whereas miR-194 and -362 were unaltered in PAA. In situ hybridization verified higher expression of miR-550 and -21 in PAA compared to AAA and computational analysis for target genes and pathway enrichment affirmed signal transduction, cell-cell-interaction and cell degradation pathways, in line with previous results. Despite the vague role of miRNAs for potential diagnostic and treatment purposes, the number of candidates from tissue signature studies is increasing. Tissue morphology influences subsequent research, yet comparison of distinct entities of aneurysm disease can unravel core pathways. PMID:26771601
Weidhaas, Joanne B.; Li, Shu-Xia; Winter, Kathryn; Ryu, Janice; Jhingran, Anuja; Miller, Bridgette; Dicker, Adam P.; Gaffney, David
2009-01-01
Purpose To evaluate the potential of gene expression signatures to predict response to treatment in locally advanced cervical cancer treated with definitive chemotherapy and radiation. Experimental Design Tissue biopsies were collected from patients participating in Radiation Therapy Oncology Group (RTOG) 0128, a phase II trial evaluating the benefit of celecoxib in addition to cisplatin chemotherapy and radiation for locally advanced cervical cancer. Gene expression profiling was done and signatures of pretreatment, mid-treatment (before the first implant), and “changed” gene expression patterns between pre- and mid-treatment samples were determined. The ability of the gene signatures to predict local control versus local failure was evaluated. Two-group t test was done to identify the initial gene set separating these end points. Supervised classification methods were used to enrich the gene sets. The results were further validated by leave-one-out and 2-fold cross-validation. Results Twenty-two patients had suitable material from pretreatment samples for analysis, and 13 paired pre- and mid-treatment samples were obtained. The changed gene expression signatures between the pre- and mid-treatment biopsies predicted response to treatment, separating patients with local failures from those who achieved local control with a seven-gene signature. The in-sample prediction rate, leave-one-out prediction rate, and 2-fold prediction rate are 100% for this seven-gene signature. This signature was enriched for cell cycle genes. Conclusions Changed gene expression signatures during therapy in cervical cancer can predict outcome as measured by local control. After further validation, such findings could be applied to direct additional therapy for cervical cancer patients treated with chemotherapy and radiation. PMID:19509178
Nirmala, Nanguneri; Grom, Alexei; Gram, Hermann
2014-09-01
This review summarizes biomarkers in systemic juvenile idiopathic arthritis (sJIA). Broadly, the markers are classified under protein, cellular, gene expression and genetic markers. We also compare the biomarkers in sJIA to biomarkers in cryopyrin-associated periodic syndrome (CAPS). Recent publications showing the similarity of clinical response of sJIA and CAPS to anti-interleukin 1 therapies prompted a comparison at the biomarker level. sJIA traditionally is classified under the umbrella of juvenile idiopathic arthritis. At the clinical phenotypic level, sJIA has several features that are more similar to those seen in CAPS. In this review, we summarize biomarkers in sJIA and CAPS and draw upon the various similarities and differences between the two families of diseases. The main differences between sJIA and CAPS biomarkers are genetic markers, with CAPS being a family of monogenic diseases with mutations in NLRP3. There have been a small number of publications describing cellular biomarkers in sJIA with no such studies described for CAPS. Many of the protein marker's characteristics of sJIA are also seen to characterize CAPS. The gene expression data in both sJIA and CAPS show a strong upregulation of innate immunity pathways. In addition, we describe a strong similarity between sJIA and CAPS at the gene expression level in which several genes that form a part of the erythropoiesis signature are upregulated in both sJIA and CAPS.
Nirmala, Nanguneri; Grom, Alexei; Gram, Hermann
2015-01-01
Purpose of review This review summarizes biomarkers in Systemic Juvenile Idiopathic Arthritis (sJIA). Broadly, the markers are classified under protein, cellular, gene expression and genetic markers. We also compare the biomarkers in sJIA to biomarkers in cryopyrin associated periodic syndromes (CAPS). Recent findings Recent publications showing the similarity of clinical response of sJIA and CAPS to anti IL1 therapies prompted a comparison at the biomarker level. Summary sJIA traditionally is classified under the umbrella of juvenile idiopathic arthritis. At the clinical phenotypic level, sJIA has several features that are more similar to those seen in Cryopyrin Associated Periodic Syndromes (CAPS). In this review, we summarize biomarkers in sJIA and CAPS and draw upon the various similarities and differences between the two families of diseases. The main difference between sJIA and CAPS biomarkers are genetic markers with CAPS being a family of monogenic diseases with mutations in NLRP3. There have been a small number of publications describing cellular biomarkers in sJIA with no such studies described for CAPS. Many of the protein markers characteristic of sJIA are also seen to characterize CAPS. The gene expression data in both sJIA and CAPS show a strong upregulation of innate immunity pathways. In addition, we describe a strong similarity between sJIA and CAPS at the gene expression level where several genes that form a part of the erythropoiesis signature are upregulated in both sJIA and CAPS. PMID:25050926
GeneSigDB—a curated database of gene expression signatures
Culhane, Aedín C.; Schwarzl, Thomas; Sultana, Razvan; Picard, Kermshlise C.; Picard, Shaita C.; Lu, Tim H.; Franklin, Katherine R.; French, Simon J.; Papenhausen, Gerald; Correll, Mick; Quackenbush, John
2010-01-01
The primary objective of most gene expression studies is the identification of one or more gene signatures; lists of genes whose transcriptional levels are uniquely associated with a specific biological phenotype. Whilst thousands of experimentally derived gene signatures are published, their potential value to the community is limited by their computational inaccessibility. Gene signatures are embedded in published article figures, tables or in supplementary materials, and are frequently presented using non-standard gene or probeset nomenclature. We present GeneSigDB (http://compbio.dfci.harvard.edu/genesigdb) a manually curated database of gene expression signatures. GeneSigDB release 1.0 focuses on cancer and stem cells gene signatures and was constructed from more than 850 publications from which we manually transcribed 575 gene signatures. Most gene signatures (n = 560) were successfully mapped to the genome to extract standardized lists of EnsEMBL gene identifiers. GeneSigDB provides the original gene signature, the standardized gene list and a fully traceable gene mapping history for each gene from the original transcribed data table through to the standardized list of genes. The GeneSigDB web portal is easy to search, allows users to compare their own gene list to those in the database, and download gene signatures in most common gene identifier formats. PMID:19934259
A sensitive NanoString-based assay to score STK11 (LKB1) pathway disruption in lung adenocarcinoma
Chen, Lu; Engel, Brienne E.; Welsh, Eric A.; Yoder, Sean J.; Brantley, Stephen G.; Chen, Dung-Tsa; Beg, Amer A.; Cao, Chunxia; Kaye, Frederic J.; Haura, Eric B.; Schabath, Matthew B.; Cress, W. Douglas
2016-01-01
Introduction Serine/threonine kinase 11 (STK11), better known as LKB1, is a tumor-suppressor commonly mutated in lung adenocarcinoma (LUAD). Previous work has shown that mutational inactivation of the STK11 pathway may serve as a predictive biomarker for cancer treatments including phenformin and COX-2 inhibition. Although immunohistochemistry and diagnostic sequencing are employed to measure STK11 pathway disruption, there are serious limitations to these methods emphasizing the importance to validate a clinically useful assay. Methods An initial STK11 mutation mRNA signature was generated using cell line data and refined using three large, independent patient databases. The signature was validated as a classifier using The Cancer Genome Anatomy Project (TCGA) LUAD cohort as well as a 442-patient LUAD cohort developed at Moffitt. Finally, the signature was adapted into a NanoString -based format and validated using RNA samples isolated from FFPE tissue blocks corresponding to a cohort of 150 LUAD patients. For comparison, STK11 immunochemistry was also performed. Results The STK11 signature was found to correlate with null mutations identified by exon sequencing in multiple cohorts using both microarray and NanoString formats. While there was a statistically significant correlation between reduced STK11 protein expression by IHC and mutation status, the NanoString-based assay showed superior overall performance with a −0.1588 improvement in area under the curve in receiver-operator characteristic curve analysis (p<0.012). Conclusion The described NanoString-based STK11 assay is a sensitive biomarker to study emerging therapeutic modalities in clinical trials. PMID:26917230
Sorzano, Carlos O. S.; Pascual-Montano, Alberto; Carazo, Jose M.
2017-01-01
Benign neurofibromas, the main phenotypic manifestations of the rare neurological disorder neurofibromatosis type 1, degenerate to malignant tumors associated to poor prognosis in about 10% of patients. Despite efforts in the field of (epi)genomics, the lack of prognostic biomarkers with which to predict disease evolution frustrates the adoption of appropriate early therapeutic measures. To identify potential biomarkers of malignant neurofibroma transformation, we integrated four human experimental studies and one for mouse, using a gene score-based meta-analysis method, from which we obtained a score-ranked signature of 579 genes. Genes with the highest absolute scores were classified as promising disease biomarkers. By grouping genes with similar neurofibromatosis-related profiles, we derived panels of potential biomarkers. The addition of promoter methylation data to gene profiles indicated a panel of genes probably silenced by hypermethylation. To identify possible therapeutic treatments, we used the gene signature to query drug expression databases. Trichostatin A and other histone deacetylase inhibitors, as well as cantharidin and tamoxifen, were retrieved as putative therapeutic means to reverse the aberrant regulation that drives to malignant cell proliferation and metastasis. This in silico prediction corroborated reported experimental results that suggested the inclusion of these compounds in clinical trials. This experimental validation supported the suitability of the meta-analysis method used to integrate several sources of public genomic information, and the reliability of the gene signature associated to the malignant evolution of neurofibromas to generate working hypotheses for prognostic and drug-responsive biomarkers or therapeutic measures, thus showing the potential of this in silico approach for biomarker discovery. PMID:28542306
Reynier, Frédéric; Petit, Fabien; Paye, Malick; Turrel-Davin, Fanny; Imbert, Pierre-Emmanuel; Hot, Arnaud; Mougin, Bruno; Miossec, Pierre
2011-01-01
The analysis of gene expression data shows that many genes display similarity in their expression profiles suggesting some co-regulation. Here, we investigated the co-expression patterns in gene expression data and proposed a correlation-based research method to stratify individuals. Using blood from rheumatoid arthritis (RA) patients, we investigated the gene expression profiles from whole blood using Affymetrix microarray technology. Co-expressed genes were analyzed by a biclustering method, followed by gene ontology analysis of the relevant biclusters. Taking the type I interferon (IFN) pathway as an example, a classification algorithm was developed from the 102 RA patients and extended to 10 systemic lupus erythematosus (SLE) patients and 100 healthy volunteers to further characterize individuals. We developed a correlation-based algorithm referred to as Classification Algorithm Based on a Biological Signature (CABS), an alternative to other approaches focused specifically on the expression levels. This algorithm applied to the expression of 35 IFN-related genes showed that the IFN signature presented a heterogeneous expression between RA, SLE and healthy controls which could reflect the level of global IFN signature activation. Moreover, the monitoring of the IFN-related genes during the anti-TNF treatment identified changes in type I IFN gene activity induced in RA patients. In conclusion, we have proposed an original method to analyze genes sharing an expression pattern and a biological function showing that the activation levels of a biological signature could be characterized by its overall state of correlation.
Lee, Langho; Wang, Kai; Li, Gang; Xie, Zhi; Wang, Yuli; Xu, Jiangchun; Sun, Shaoxian; Pocalyko, David; Bhak, Jong; Kim, Chulhong; Lee, Kee-Ho; Jang, Ye Jin; Yeom, Young Il; Yoo, Hyang-Sook; Hwang, Seungwoo
2011-11-30
Hepatocellular carcinoma (HCC) is the fifth most common cancer worldwide. A number of molecular profiling studies have investigated the changes in gene and protein expression that are associated with various clinicopathological characteristics of HCC and generated a wealth of scattered information, usually in the form of gene signature tables. A database of the published HCC gene signatures would be useful to liver cancer researchers seeking to retrieve existing differential expression information on a candidate gene and to make comparisons between signatures for prioritization of common genes. A challenge in constructing such database is that a direct import of the signatures as appeared in articles would lead to a loss or ambiguity of their context information that is essential for a correct biological interpretation of a gene's expression change. This challenge arises because designation of compared sample groups is most often abbreviated, ad hoc, or even missing from published signature tables. Without manual curation, the context information becomes lost, leading to uninformative database contents. Although several databases of gene signatures are available, none of them contains informative form of signatures nor shows comprehensive coverage on liver cancer. Thus we constructed Liverome, a curated database of liver cancer-related gene signatures with self-contained context information. Liverome's data coverage is more than three times larger than any other signature database, consisting of 143 signatures taken from 98 HCC studies, mostly microarray and proteome, and involving 6,927 genes. The signatures were post-processed into an informative and uniform representation and annotated with an itemized summary so that all context information is unambiguously self-contained within the database. The signatures were further informatively named and meaningfully organized according to ten functional categories for guided browsing. Its web interface enables a straightforward retrieval of known differential expression information on a query gene and a comparison of signatures to prioritize common genes. The utility of Liverome-collected data is shown by case studies in which useful biological insights on HCC are produced. Liverome database provides a comprehensive collection of well-curated HCC gene signatures and straightforward interfaces for gene search and signature comparison as well. Liverome is available at http://liverome.kobic.re.kr.
Dennison, Jennifer B.; Shahmoradgoli, Maria; Liu, Wenbin; Ju, Zhenlin; Meric-Bernstam, Funda; Perou, Charles M.; Sahin, Aysegul A.; Welm, Alana; Oesterreich, Steffi; Sikora, Matthew J.; Brown, Robert E.; Mills, Gordon B.
2016-01-01
Purpose The current study evaluated associative effects of breast cancer cells with the tumor microenvironment and its influence on tumor behavior. Experimental design Formalin-fixed paraffin embedded tissue and matched protein lysates were evaluated from two independent breast cancer patient data sets (TCGA and MD Anderson). Reverse-phase protein arrays (RPPA) were utilized to create a proteomics signature to define breast tumor subtypes. Expression patterns of cell lines and normal breast tissues were utilized to determine markers that were differentially expressed in stroma and cancer cells. Protein localization and stromal contents were evaluated for matched cases by imaging. Results A subtype of breast cancers designated “Reactive,” previously identified by RPPA that was not predicted by mRNA profiling, was extensively characterized. These tumors were primarily estrogen receptor (ER)-positive/human epidermal growth factor receptor (HER)2-negative, low-risk cancers as determined by enrichment of low-grade nuclei, lobular or tubular histopathology, and the luminal A subtype by PAM50. Reactive breast cancers contained high numbers of stromal cells and the highest extracellular matrix content typically without infiltration of immune cells. For ER-positive/HER2-negative cancers, the Reactive classification predicted favorable clinical outcomes in the TCGA cohort (HR = 0.36, P < 0.05). Conclusions A protein stromal signature in breast cancers is associated with a highly differentiated phenotype. The stromal compartment content and proteins are an extended phenotype not predicted by mRNA expression that could be utilized to sub-classify ER-positive/HER2-negative breast cancers. PMID:27172895
2012-01-01
Background The use of growth-promoters in beef cattle, despite the EU ban, remains a frequent practice. The use of transcriptomic markers has already proposed to identify indirect evidence of anabolic hormone treatment. So far, such approach has been tested in experimentally treated animals. Here, for the first time commercial samples were analyzed. Results Quantitative determination of Dexamethasone (DEX) residues in the urine collected at the slaughterhouse was performed by Liquid Chromatography-Mass Spectrometry (LC-MS). DNA-microarray technology was used to obtain transcriptomic profiles of skeletal muscle in commercial samples and negative controls. LC-MS confirmed the presence of low level of DEX residues in the urine of the commercial samples suspect for histological classification. Principal Component Analysis (PCA) on microarray data identified two clusters of samples. One cluster included negative controls and a subset of commercial samples, while a second cluster included part of the specimens collected at the slaughterhouse together with positives for corticosteroid treatment based on thymus histology and LC-MS. Functional analysis of the differentially expressed genes (3961) between the two groups provided further evidence that animals clustering with positive samples might have been treated with corticosteroids. These suspect samples could be reliably classified with a specific classification tool (Prediction Analysis of Microarray) using just two genes. Conclusions Despite broad variation observed in gene expression profiles, the present study showed that DNA-microarrays can be used to find transcriptomic signatures of putative anabolic treatments and that gene expression markers could represent a useful screening tool. PMID:23110699
Molecular Structure-Based Large-Scale Prediction of Chemical-Induced Gene Expression Changes.
Liu, Ruifeng; AbdulHameed, Mohamed Diwan M; Wallqvist, Anders
2017-09-25
The quantitative structure-activity relationship (QSAR) approach has been used to model a wide range of chemical-induced biological responses. However, it had not been utilized to model chemical-induced genomewide gene expression changes until very recently, owing to the complexity of training and evaluating a very large number of models. To address this issue, we examined the performance of a variable nearest neighbor (v-NN) method that uses information on near neighbors conforming to the principle that similar structures have similar activities. Using a data set of gene expression signatures of 13 150 compounds derived from cell-based measurements in the NIH Library of Integrated Network-based Cellular Signatures program, we were able to make predictions for 62% of the compounds in a 10-fold cross validation test, with a correlation coefficient of 0.61 between the predicted and experimentally derived signatures-a reproducibility rivaling that of high-throughput gene expression measurements. To evaluate the utility of the predicted gene expression signatures, we compared the predicted and experimentally derived signatures in their ability to identify drugs known to cause specific liver, kidney, and heart injuries. Overall, the predicted and experimentally derived signatures had similar receiver operating characteristics, whose areas under the curve ranged from 0.71 to 0.77 and 0.70 to 0.73, respectively, across the three organ injury models. However, detailed analyses of enrichment curves indicate that signatures predicted from multiple near neighbors outperformed those derived from experiments, suggesting that averaging information from near neighbors may help improve the signal from gene expression measurements. Our results demonstrate that the v-NN method can serve as a practical approach for modeling large-scale, genomewide, chemical-induced, gene expression changes.
Kurscheid, Sebastian; Bady, Pierre; Sciuscio, Davide; Samarzija, Ivana; Shay, Tal; Vassallo, Irene; Van Criekinge, Wim; Domany, Eytan; Stupp, Roger; Delorenzi, Mauro; Hegi, Monika
2014-01-01
We previously reported a stem cell related HOX gene signature associated with resistance to chemo-radiotherapy (TMZ/RT- > TMZ) in glioblastoma. However, underlying mechanisms triggering overexpression remain mostly elusive. Interestingly, HOX genes are neither involved in the developing brain, nor expressed in normal brain, suggestive of an acquired gene expression signature during gliomagenesis. HOXA genes are located on CHR 7 that displays trisomy in most glioblastoma which strongly impacts gene expression on this chromosome, modulated by local regulatory elements. Furthermore we observed more pronounced DNA methylation across the HOXA locus as compared to non-tumoral brain (Human methylation 450K BeadChip Illumina; 59 glioblastoma, 5 non-tumoral brain sampes). CpG probes annotated for HOX-signature genes, contributing most to the variability, served as input into the analysis of DNA methylation and expression to identify key regulatory regions. The structural similarity of the observed correlation matrices between DNA methylation and gene expression in our cohort and an independent data-set from TCGA (106 glioblastoma) was remarkable (RV-coefficient, 0.84; p-value < 0.0001). We identified a CpG located in the promoter region of the HOXA10 locus exerting the strongest mean negative correlation between methylation and expression of the whole HOX-signature. Applying this analysis the same CpG emerged in the external set. We then determined the contribution of both, gene copy aberration (CNA) and methylation at the selected probe to explain expression of the HOX-signature using a linear model. Statistically significant results suggested an additive effect between gene dosage and methylation at the key CpG identified. Similarly, such an additive effect was also observed in the external data-set. Taken together, we hypothesize that overexpression of the stem-cell related HOX signature is triggered by gain of trisomy 7 and escape from compensatory DNA methylation at positions controlling the effect of enhanced gene dose on expression.
Brkic, Zana; Maria, Naomi I; van Helden-Meeuwsen, Cornelia G; van de Merwe, Joop P; van Daele, Paul L; Dalm, Virgil A; Wildenberg, Manon E; Beumer, Wouter; Drexhage, Hemmo A; Versnel, Marjan A
2013-01-01
Objective To determine the prevalence of upregulation of interferon (IFN) type I inducible genes, the so called ‘IFN type I signature’, in CD14 monocytes in 69 patients with primary Sjögren's syndrome (pSS) and 44 healthy controls (HC) and correlate it with disease manifestations and expression of B cell activating factor (BAFF). Methods Expression of IFI44L, IFI44, IFIT3, LY6E and MX1 was measured using real time quantitative PCR in monocytes. Expression values were used to calculate IFN type I scores for each subject. pSS patients positive for the IFN type I signature (IFN score≥10) and patients negative for the signature (IFN score<10) were then compared for clinical disease manifestations and BAFF expression. A bioassay using a monocytic cell line was performed to study whether BAFF mRNA expression was inducible by IFN type I activity in serum of patients with pSS. Results An IFN type I signature was present in 55% of patients with pSS compared with 4.5% of HC. Patients with the IFN type I signature showed: (a) higher EULAR Sjögren's Syndrome Disease Activity Index scores; higher anti-Ro52, anti-Ro60 and anti-La autoantibodies; higher rheumatoid factor; higher serum IgG; lower C3, lower absolute lymphocyte and neutrophil counts; (b)higher BAFF gene expression in monocytes. In addition, serum of signature-positive patients induced BAFF gene expression in monocytes. Conclusions The monocyte IFN type I signature identifies a subgroup of patients with pSS with a higher clinical disease activity together with higher BAFF mRNA expression. Such patients might benefit from treatment blocking IFN type I production or activity. PMID:22736090
Circulating neutrophil transcriptome may reveal intracranial aneurysm signature
Tutino, Vincent M.; Poppenberg, Kerry E.; Jiang, Kaiyu; Jarvis, James N.; Sun, Yijun; Sonig, Ashish; Siddiqui, Adnan H.; Snyder, Kenneth V.; Levy, Elad I.; Kolega, John
2018-01-01
Background Unruptured intracranial aneurysms (IAs) are typically asymptomatic and undetected except for incidental discovery on imaging. Blood-based diagnostic biomarkers could lead to improvements in IA management. This exploratory study examined circulating neutrophils to determine whether they carry RNA expression signatures of IAs. Methods Blood samples were collected from patients receiving cerebral angiography. Eleven samples were collected from patients with IAs and 11 from patients without IAs as controls. Samples from the two groups were paired based on demographics and comorbidities. RNA was extracted from isolated neutrophils and subjected to next-generation RNA sequencing to obtain differential expressions for identification of an IA-associated signature. Bioinformatics analyses, including gene set enrichment analysis and Ingenuity Pathway Analysis, were used to investigate the biological function of all differentially expressed transcripts. Results Transcriptome profiling identified 258 differentially expressed transcripts in patients with and without IAs. Expression differences were consistent with peripheral neutrophil activation. An IA-associated RNA expression signature was identified in 82 transcripts (p<0.05, fold-change ≥2). This signature was able to separate patients with and without IAs on hierarchical clustering. Furthermore, in an independent, unpaired, replication cohort of patients with IAs (n = 5) and controls (n = 5), the 82 transcripts separated 9 of 10 patients into their respective groups. Conclusion Preliminary findings show that RNA expression from circulating neutrophils carries an IA-associated signature. These findings highlight a potential to use predictive biomarkers from peripheral blood samples to identify patients with IAs. PMID:29342213
Cruz-Rodriguez, Nataly; Combita, Alba L; Enciso, Leonardo J; Raney, Lauren F; Pinzon, Paula L; Lozano, Olga C; Campos, Alba M; Peñaloza, Niyireth; Solano, Julio; Herrera, Maria V; Zabaleta, Jovanny; Quijano, Sandra
2017-02-28
Survival of adults with B-Acute Lymphoblastic Leukemia requires accurate risk stratification of patients in order to provide the appropriate therapy. Contemporary techniques, using clinical and cytogenetic variables are incomplete for prognosis prediction. To improve the classification of adult patients diagnosed with B-ALL into prognosis groups, two strategies were examined and combined: the expression of the ID1/ID3/IGJ gene signature by RT-PCR and the immunophenotypic profile of 19 markers proposed in the EuroFlow protocol by Flow Cytometry in bone marrow samples. Both techniques were correlated to stratify patients into prognostic groups. An inverse relationship between survival and expression of the three-genes signature was observed and an immunophenotypic profile associated with clinical outcome was identified. Markers CD10 and CD20 were correlated with simultaneous overexpression of ID1, ID3 and IGJ. Patients with simultaneous expression of the poor prognosis gene signature and overexpression of CD10 or CD20, had worse Event Free Survival and Overall Survival than patients who had either the poor prognosis gene expression signature or only CD20 or CD10 overexpressed. By utilizing the combined evaluation of these two immunophenotypic markers along with the poor prognosis gene expression signature, the risk stratification can be significantly strengthened. Further studies including a large number of patients are needed to confirm these findings.
Integrated MicroRNA and mRNA Signatures Associated with Survival in Triple Negative Breast Cancer
Lovat, Francesca; Carasi, Stefania; Pulvirenti, Alfredo; Ferro, Alfredo; Alder, Hansjuerg; He, Gang; Vecchione, Andrea; Croce, Carlo M.; Shapiro, Charles L.; Huebner, Kay
2013-01-01
Triple negative breast cancer (TNBC) is a heterogeneous disease at the molecular, pathologic and clinical levels. To stratify TNBCs, we determined microRNA (miRNA) expression profiles, as well as expression profiles of a cancer-focused mRNA panel, in tumor, adjacent non-tumor (normal) and lymph node metastatic lesion (mets) tissues, from 173 women with TNBCs; we linked specific miRNA signatures to patient survival and used miRNA/mRNA anti-correlations to identify clinically and genetically different TNBC subclasses. We also assessed miRNA signatures as potential regulators of TNBC subclass-specific gene expression networks defined by expression of canonical signal pathways. Tissue specific miRNAs and mRNAs were identified for normal vs tumor vs mets comparisons. miRNA signatures correlated with prognosis were identified and predicted anti-correlated targets within the mRNA profile were defined. Two miRNA signatures (miR-16, 155, 125b, 374a and miR-16, 125b, 374a, 374b, 421, 655, 497) predictive of overall survival (P = 0.05) and distant-disease free survival (P = 0.009), respectively, were identified for patients 50 yrs of age or younger. By multivariate analysis the risk signatures were independent predictors for overall survival and distant-disease free survival. mRNA expression profiling, using the cancer-focused mRNA panel, resulted in clustering of TNBCs into 4 molecular subclasses with different expression signatures anti-correlated with the prognostic miRNAs. Our findings suggest that miRNAs play a key role in triple negative breast cancer through their ability to regulate fundamental pathways such as: cellular growth and proliferation, cellular movement and migration, Extra Cellular Matrix degradation. The results define miRNA expression signatures that characterize and contribute to the phenotypic diversity of TNBC and its metastasis. PMID:23405235
Integrated microRNA and mRNA signatures associated with survival in triple negative breast cancer.
Cascione, Luciano; Gasparini, Pierluigi; Lovat, Francesca; Carasi, Stefania; Pulvirenti, Alfredo; Ferro, Alfredo; Alder, Hansjuerg; He, Gang; Vecchione, Andrea; Croce, Carlo M; Shapiro, Charles L; Huebner, Kay
2013-01-01
Triple negative breast cancer (TNBC) is a heterogeneous disease at the molecular, pathologic and clinical levels. To stratify TNBCs, we determined microRNA (miRNA) expression profiles, as well as expression profiles of a cancer-focused mRNA panel, in tumor, adjacent non-tumor (normal) and lymph node metastatic lesion (mets) tissues, from 173 women with TNBCs; we linked specific miRNA signatures to patient survival and used miRNA/mRNA anti-correlations to identify clinically and genetically different TNBC subclasses. We also assessed miRNA signatures as potential regulators of TNBC subclass-specific gene expression networks defined by expression of canonical signal pathways.Tissue specific miRNAs and mRNAs were identified for normal vs tumor vs mets comparisons. miRNA signatures correlated with prognosis were identified and predicted anti-correlated targets within the mRNA profile were defined. Two miRNA signatures (miR-16, 155, 125b, 374a and miR-16, 125b, 374a, 374b, 421, 655, 497) predictive of overall survival (P = 0.05) and distant-disease free survival (P = 0.009), respectively, were identified for patients 50 yrs of age or younger. By multivariate analysis the risk signatures were independent predictors for overall survival and distant-disease free survival. mRNA expression profiling, using the cancer-focused mRNA panel, resulted in clustering of TNBCs into 4 molecular subclasses with different expression signatures anti-correlated with the prognostic miRNAs. Our findings suggest that miRNAs play a key role in triple negative breast cancer through their ability to regulate fundamental pathways such as: cellular growth and proliferation, cellular movement and migration, Extra Cellular Matrix degradation. The results define miRNA expression signatures that characterize and contribute to the phenotypic diversity of TNBC and its metastasis.
Thakar, Manjusha; Howard, Jason D.; Kagohara, Luciane T.; Krigsfeld, Gabriel; Ranaweera, Ruchira S.; Hughes, Robert M.; Perez, Jimena; Jones, Siân; Favorov, Alexander V.; Carey, Jacob; Stein-O'Brien, Genevieve; Gaykalova, Daria A.; Ochs, Michael F.; Chung, Christine H.
2016-01-01
Patients with oncogene driven tumors are treated with targeted therapeutics including EGFR inhibitors. Genomic data from The Cancer Genome Atlas (TCGA) demonstrates molecular alterations to EGFR, MAPK, and PI3K pathways in previously untreated tumors. Therefore, this study uses bioinformatics algorithms to delineate interactions resulting from EGFR inhibitor use in cancer cells with these genetic alterations. We modify the HaCaT keratinocyte cell line model to simulate cancer cells with constitutive activation of EGFR, HRAS, and PI3K in a controlled genetic background. We then measure gene expression after treating modified HaCaT cells with gefitinib, afatinib, and cetuximab. The CoGAPS algorithm distinguishes a gene expression signature associated with the anticipated silencing of the EGFR network. It also infers a feedback signature with EGFR gene expression itself increasing in cells that are responsive to EGFR inhibitors. This feedback signature has increased expression of several growth factor receptors regulated by the AP-2 family of transcription factors. The gene expression signatures for AP-2alpha are further correlated with sensitivity to cetuximab treatment in HNSCC cell lines and changes in EGFR expression in HNSCC tumors with low CDKN2A gene expression. In addition, the AP-2alpha gene expression signatures are also associated with inhibition of MEK, PI3K, and mTOR pathways in the Library of Integrated Network-Based Cellular Signatures (LINCS) data. These results suggest that AP-2 transcription factors are activated as feedback from EGFR network inhibition and may mediate EGFR inhibitor resistance. PMID:27650546
Polak, Paz; Kim, Jaegil; Braunstein, Lior Z; Karlic, Rosa; Haradhavala, Nicholas J; Tiao, Grace; Rosebrock, Daniel; Livitz, Dimitri; Kübler, Kirsten; Mouw, Kent W; Kamburov, Atanas; Maruvka, Yosef E; Leshchiner, Ignaty; Lander, Eric S; Golub, Todd R; Zick, Aviad; Orthwein, Alexandre; Lawrence, Michael S; Batra, Rajbir N; Caldas, Carlos; Haber, Daniel A; Laird, Peter W; Shen, Hui; Ellisen, Leif W; D'Andrea, Alan D; Chanock, Stephen J; Foulkes, William D; Getz, Gad
2017-10-01
Biallelic inactivation of BRCA1 or BRCA2 is associated with a pattern of genome-wide mutations known as signature 3. By analyzing ∼1,000 breast cancer samples, we confirmed this association and established that germline nonsense and frameshift variants in PALB2, but not in ATM or CHEK2, can also give rise to the same signature. We were able to accurately classify missense BRCA1 or BRCA2 variants known to impair homologous recombination (HR) on the basis of this signature. Finally, we show that epigenetic silencing of RAD51C and BRCA1 by promoter methylation is strongly associated with signature 3 and, in our data set, was highly enriched in basal-like breast cancers in young individuals of African descent.
Identification of an Efficient Gene Expression Panel for Glioblastoma Classification
Zelaya, Ivette; Laks, Dan R.; Zhao, Yining; Kawaguchi, Riki; Gao, Fuying; Kornblum, Harley I.; Coppola, Giovanni
2016-01-01
We present here a novel genetic algorithm-based random forest (GARF) modeling technique that enables a reduction in the complexity of large gene disease signatures to highly accurate, greatly simplified gene panels. When applied to 803 glioblastoma multiforme samples, this method allowed the 840-gene Verhaak et al. gene panel (the standard in the field) to be reduced to a 48-gene classifier, while retaining 90.91% classification accuracy, and outperforming the best available alternative methods. Additionally, using this approach we produced a 32-gene panel which allows for better consistency between RNA-seq and microarray-based classifications, improving cross-platform classification retention from 69.67% to 86.07%. A webpage producing these classifications is available at http://simplegbm.semel.ucla.edu. PMID:27855170
ADAGE signature analysis: differential expression analysis with data-defined gene sets.
Tan, Jie; Huyck, Matthew; Hu, Dongbo; Zelaya, René A; Hogan, Deborah A; Greene, Casey S
2017-11-22
Gene set enrichment analysis and overrepresentation analyses are commonly used methods to determine the biological processes affected by a differential expression experiment. This approach requires biologically relevant gene sets, which are currently curated manually, limiting their availability and accuracy in many organisms without extensively curated resources. New feature learning approaches can now be paired with existing data collections to directly extract functional gene sets from big data. Here we introduce a method to identify perturbed processes. In contrast with methods that use curated gene sets, this approach uses signatures extracted from public expression data. We first extract expression signatures from public data using ADAGE, a neural network-based feature extraction approach. We next identify signatures that are differentially active under a given treatment. Our results demonstrate that these signatures represent biological processes that are perturbed by the experiment. Because these signatures are directly learned from data without supervision, they can identify uncurated or novel biological processes. We implemented ADAGE signature analysis for the bacterial pathogen Pseudomonas aeruginosa. For the convenience of different user groups, we implemented both an R package (ADAGEpath) and a web server ( http://adage.greenelab.com ) to run these analyses. Both are open-source to allow easy expansion to other organisms or signature generation methods. We applied ADAGE signature analysis to an example dataset in which wild-type and ∆anr mutant cells were grown as biofilms on the Cystic Fibrosis genotype bronchial epithelial cells. We mapped active signatures in the dataset to KEGG pathways and compared with pathways identified using GSEA. The two approaches generally return consistent results; however, ADAGE signature analysis also identified a signature that revealed the molecularly supported link between the MexT regulon and Anr. We designed ADAGE signature analysis to perform gene set analysis using data-defined functional gene signatures. This approach addresses an important gap for biologists studying non-traditional model organisms and those without extensive curated resources available. We built both an R package and web server to provide ADAGE signature analysis to the community.
Loboda, Andrey; Nebozhyn, Michael; Klinghoffer, Rich; Frazier, Jason; Chastain, Michael; Arthur, William; Roberts, Brian; Zhang, Theresa; Chenard, Melissa; Haines, Brian; Andersen, Jannik; Nagashima, Kumiko; Paweletz, Cloud; Lynch, Bethany; Feldman, Igor; Dai, Hongyue; Huang, Pearl; Watters, James
2010-06-30
Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90%) sensitivity but relatively low (50%) specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical utility in lung and breast tumors.
Gene-expression signatures can distinguish gastric cancer grades and stages.
Cui, Juan; Li, Fan; Wang, Guoqing; Fang, Xuedong; Puett, J David; Xu, Ying
2011-03-18
Microarray gene-expression data of 54 paired gastric cancer and adjacent noncancerous gastric tissues were analyzed, with the aim to establish gene signatures for cancer grades (well-, moderately-, poorly- or un-differentiated) and stages (I, II, III and IV), which have been determined by pathologists. Our statistical analysis led to the identification of a number of gene combinations whose expression patterns serve well as signatures of different grades and different stages of gastric cancer. A 19-gene signature was found to have discerning power between high- and low-grade gastric cancers in general, with overall classification accuracy at 79.6%. An expanded 198-gene panel allows the stratification of cancers into four grades and control, giving rise to an overall classification agreement of 74.2% between each grade designated by the pathologists and our prediction. Two signatures for cancer staging, consisting of 10 genes and 9 genes, respectively, provide high classification accuracies at 90.0% and 84.0%, among early-, advanced-stage cancer and control. Functional and pathway analyses on these signature genes reveal the significant relevance of the derived signatures to cancer grades and progression. To the best of our knowledge, this represents the first study on identification of genes whose expression patterns can serve as markers for cancer grades and stages.
Phenotypic Robustness and the Assortativity Signature of Human Transcription Factor Networks
Pechenick, Dov A.; Payne, Joshua L.; Moore, Jason H.
2014-01-01
Many developmental, physiological, and behavioral processes depend on the precise expression of genes in space and time. Such spatiotemporal gene expression phenotypes arise from the binding of sequence-specific transcription factors (TFs) to DNA, and from the regulation of nearby genes that such binding causes. These nearby genes may themselves encode TFs, giving rise to a transcription factor network (TFN), wherein nodes represent TFs and directed edges denote regulatory interactions between TFs. Computational studies have linked several topological properties of TFNs — such as their degree distribution — with the robustness of a TFN's gene expression phenotype to genetic and environmental perturbation. Another important topological property is assortativity, which measures the tendency of nodes with similar numbers of edges to connect. In directed networks, assortativity comprises four distinct components that collectively form an assortativity signature. We know very little about how a TFN's assortativity signature affects the robustness of its gene expression phenotype to perturbation. While recent theoretical results suggest that increasing one specific component of a TFN's assortativity signature leads to increased phenotypic robustness, the biological context of this finding is currently limited because the assortativity signatures of real-world TFNs have not been characterized. It is therefore unclear whether these earlier theoretical findings are biologically relevant. Moreover, it is not known how the other three components of the assortativity signature contribute to the phenotypic robustness of TFNs. Here, we use publicly available DNaseI-seq data to measure the assortativity signatures of genome-wide TFNs in 41 distinct human cell and tissue types. We find that all TFNs share a common assortativity signature and that this signature confers phenotypic robustness to model TFNs. Lastly, we determine the extent to which each of the four components of the assortativity signature contributes to this robustness. PMID:25121490
Kim, Jihoon; Grillo, Janice M; Boxwala, Aziz A; Jiang, Xiaoqian; Mandelbaum, Rose B; Patel, Bhakti A; Mikels, Debra; Vinterbo, Staal A; Ohno-Machado, Lucila
2011-01-01
Our objective is to facilitate semi-automated detection of suspicious access to EHRs. Previously we have shown that a machine learning method can play a role in identifying potentially inappropriate access to EHRs. However, the problem of sampling informative instances to build a classifier still remained. We developed an integrated filtering method leveraging both anomaly detection based on symbolic clustering and signature detection, a rule-based technique. We applied the integrated filtering to 25.5 million access records in an intervention arm, and compared this with 8.6 million access records in a control arm where no filtering was applied. On the training set with cross-validation, the AUC was 0.960 in the control arm and 0.998 in the intervention arm. The difference in false negative rates on the independent test set was significant, P=1.6×10(-6). Our study suggests that utilization of integrated filtering strategies to facilitate the construction of classifiers can be helpful.
Kim, Jihoon; Grillo, Janice M; Boxwala, Aziz A; Jiang, Xiaoqian; Mandelbaum, Rose B; Patel, Bhakti A; Mikels, Debra; Vinterbo, Staal A; Ohno-Machado, Lucila
2011-01-01
Our objective is to facilitate semi-automated detection of suspicious access to EHRs. Previously we have shown that a machine learning method can play a role in identifying potentially inappropriate access to EHRs. However, the problem of sampling informative instances to build a classifier still remained. We developed an integrated filtering method leveraging both anomaly detection based on symbolic clustering and signature detection, a rule-based technique. We applied the integrated filtering to 25.5 million access records in an intervention arm, and compared this with 8.6 million access records in a control arm where no filtering was applied. On the training set with cross-validation, the AUC was 0.960 in the control arm and 0.998 in the intervention arm. The difference in false negative rates on the independent test set was significant, P=1.6×10−6. Our study suggests that utilization of integrated filtering strategies to facilitate the construction of classifiers can be helpful. PMID:22195129
Guijarro, María; Pajares, Gonzalo; Herrera, P. Javier
2009-01-01
The increasing technology of high-resolution image airborne sensors, including those on board Unmanned Aerial Vehicles, demands automatic solutions for processing, either on-line or off-line, the huge amountds of image data sensed during the flights. The classification of natural spectral signatures in images is one potential application. The actual tendency in classification is oriented towards the combination of simple classifiers. In this paper we propose a combined strategy based on the Deterministic Simulated Annealing (DSA) framework. The simple classifiers used are the well tested supervised parametric Bayesian estimator and the Fuzzy Clustering. The DSA is an optimization approach, which minimizes an energy function. The main contribution of DSA is its ability to avoid local minima during the optimization process thanks to the annealing scheme. It outperforms simple classifiers used for the combination and some combined strategies, including a scheme based on the fuzzy cognitive maps and an optimization approach based on the Hopfield neural network paradigm. PMID:22399989
Early signatures of regime shifts in gene expression dynamics
NASA Astrophysics Data System (ADS)
Pal, Mainak; Pal, Amit Kumar; Ghosh, Sayantari; Bose, Indrani
2013-06-01
Recently, a large number of studies have been carried out on the early signatures of sudden regime shifts in systems as diverse as ecosystems, financial markets, population biology and complex diseases. The signatures of regime shifts in gene expression dynamics are less systematically investigated. In this paper, we consider sudden regime shifts in the gene expression dynamics described by a fold-bifurcation model involving bistability and hysteresis. We consider two alternative models, models 1 and 2, of competence development in the bacterial population B. subtilis and determine some early signatures of the regime shifts between competence and noncompetence. We use both deterministic and stochastic formalisms for the purpose of our study. The early signatures studied include the critical slowing down as a transition point is approached, rising variance and the lag-1 autocorrelation function, skewness and a ratio of two mean first passage times. Some of the signatures could provide the experimental basis for distinguishing between bistability and excitability as the correct mechanism for the development of competence.
Buick, Julie K.; Moffat, Ivy; Williams, Andrew; Swartz, Carol D.; Recio, Leslie; Hyduke, Daniel R.; Li, Heng‐Hong; Fornace, Albert J.; Aubrecht, Jiri
2015-01-01
The use of integrated approaches in genetic toxicology, including the incorporation of gene expression data to determine the molecular pathways involved in the response, is becoming more common. In a companion article, a genomic biomarker was developed in human TK6 cells to classify chemicals as genotoxic or nongenotoxic. Because TK6 cells are not metabolically competent, we set out to broaden the utility of the biomarker for use with chemicals requiring metabolic activation. Specifically, chemical exposures were conducted in the presence of rat liver S9. The ability of the biomarker to classify genotoxic (benzo[a]pyrene, BaP; aflatoxin B1, AFB1) and nongenotoxic (dexamethasone, DEX; phenobarbital, PB) agents correctly was evaluated. Cells were exposed to increasing chemical concentrations for 4 hr and collected 0 hr, 4 hr, and 20 hr postexposure. Relative survival, apoptosis, and micronucleus frequency were measured at 24 hr. Transcriptome profiles were measured with Agilent microarrays. Statistical modeling and bioinformatics tools were applied to classify each chemical using the genomic biomarker. BaP and AFB1 were correctly classified as genotoxic at the mid‐ and high concentrations at all three time points, whereas DEX was correctly classified as nongenotoxic at all concentrations and time points. The high concentration of PB was misclassified at 24 hr, suggesting that cytotoxicity at later time points may cause misclassification. The data suggest that the use of S9 does not impair the ability of the biomarker to classify genotoxicity in TK6 cells. Finally, we demonstrate that the biomarker is also able to accurately classify genotoxicity using a publicly available dataset derived from human HepaRG cells. Environ. Mol. Mutagen. 56:520–534, 2015. © 2015 The Authors. Environmental and Molecular Mutagenesis Published by Wiley Periodicals, Inc. PMID:25733247
Building gene expression signatures indicative of transcription factor activation to predict AOP modulation Adverse outcome pathways (AOPs) are a framework for predicting quantitative relationships between molecular initiatin...
Tang, Xin-Ran; Li, Ying-Qin; Liang, Shao-Bo; Jiang, Wei; Liu, Fang; Ge, Wen-Xiu; Tang, Ling-Long; Mao, Yan-Ping; He, Qing-Mei; Yang, Xiao-Jing; Zhang, Yuan; Wen, Xin; Zhang, Jian; Wang, Ya-Qin; Zhang, Pan-Pan; Sun, Ying; Yun, Jing-Ping; Zeng, Jing; Li, Li; Liu, Li-Zhi; Liu, Na; Ma, Jun
2018-03-01
Gene expression patterns can be used as prognostic biomarkers in various types of cancers. We aimed to identify a gene expression pattern for individual distant metastatic risk assessment in patients with locoregionally advanced nasopharyngeal carcinoma. In this multicentre, retrospective, cohort analysis, we included 937 patients with locoregionally advanced nasopharyngeal carcinoma from three Chinese hospitals: the Sun Yat-sen University Cancer Center (Guangzhou, China), the Affiliated Hospital of Guilin Medical University (Guilin, China), and the First People's Hospital of Foshan (Foshan, China). Using microarray analysis, we profiled mRNA gene expression between 24 paired locoregionally advanced nasopharyngeal carcinoma tumours from patients at Sun Yat-sen University Cancer Center with or without distant metastasis after radical treatment. Differentially expressed genes were examined using digital expression profiling in a training cohort (Guangzhou training cohort; n=410) to build a gene classifier using a penalised regression model. We validated the prognostic accuracy of this gene classifier in an internal validation cohort (Guangzhou internal validation cohort, n=204) and two external independent cohorts (Guilin cohort, n=165; Foshan cohort, n=158). The primary endpoint was distant metastasis-free survival. Secondary endpoints were disease-free survival and overall survival. We identified 137 differentially expressed genes between metastatic and non-metastatic locoregionally advanced nasopharyngeal carcinoma tissues. A distant metastasis gene signature for locoregionally advanced nasopharyngeal carcinoma (DMGN) that consisted of 13 genes was generated to classify patients into high-risk and low-risk groups in the training cohort. Patients with high-risk scores in the training cohort had shorter distant metastasis-free survival (hazard ratio [HR] 4·93, 95% CI 2·99-8·16; p<0·0001), disease-free survival (HR 3·51, 2·43-5·07; p<0·0001), and overall survival (HR 3·22, 2·18-4·76; p<0·0001) than patients with low-risk scores. The prognostic accuracy of DMGN was validated in the internal and external cohorts. Furthermore, among patients with low-risk scores in the combined training and internal cohorts, concurrent chemotherapy improved distant metastasis-free survival compared with those patients who did not receive concurrent chemotherapy (HR 0·40, 95% CI 0·19-0·83; p=0·011), whereas patients with high-risk scores did not benefit from concurrent chemotherapy (HR 1·03, 0·71-1·50; p=0·876). This was also validated in the two external cohorts combined. We developed a nomogram based on the DMGN and other variables that predicted an individual's risk of distant metastasis, which was strengthened by adding Epstein-Barr virus DNA status. The DMGN is a reliable prognostic tool for distant metastasis in patients with locoregionally advanced nasopharyngeal carcinoma and might be able to predict which patients benefit from concurrent chemotherapy. It has the potential to guide treatment decisions for patients at different risk of distant metastasis. The National Natural Science Foundation of China, the National Science & Technology Pillar Program during the Twelfth Five-year Plan Period, the Natural Science Foundation of Guang Dong Province, the National Key Research and Development Program of China, the Innovation Team Development Plan of the Ministry of Education, the Health & Medical Collaborative Innovation Project of Guangzhou City, China, and the Program of Introducing Talents of Discipline to Universities. Copyright © 2018 Elsevier Ltd. All rights reserved.
Garcia-Closas, Montserrat; Davis, Sean; Meltzer, Paul; Lissowska, Jolanta; Horne, Hisani N.; Sherman, Mark E.; Lee, Maxwell
2015-01-01
Identification of prognostic gene expression signatures may enable improved decisions about management of breast cancer. To identify a prognostic signature for breast cancer, we performed DNA methylation profiling and identified methylation markers that were associated with expression of ER, PR, HER2, CK5/6 and EGFR proteins. Methylation markers that were correlated with corresponding mRNA expression levels were identified using 208 invasive tumors from a population-based case-control study conducted in Poland. Using this approach, we defined the Methylation Expression Index (MEI) signature that was based on a weighted sum of mRNA levels of 57 genes. Classification of cases as low or high MEI scores were related to survival using Cox regression models. In the Polish study, women with ER-positive low MEI cancers had reduced survival at a median of 5.20 years of follow-up, HR=2.85 95%CI=1.25-6.47. Low MEI was also related to decreased survival in four independent datasets totaling over 2500 ER-positive breast cancers. These results suggest that integrated analysis of tumor expression markers, DNA methylation, and mRNA data can be an important approach for identifying breast cancer prognostic signatures. Prospective assessment of MEI along with other prognostic signatures should be evaluated in future studies. PMID:25773928
Breast masses in mammography classification with local contour features.
Li, Haixia; Meng, Xianjing; Wang, Tingwen; Tang, Yuchun; Yin, Yilong
2017-04-14
Mammography is one of the most popular tools for early detection of breast cancer. Contour of breast mass in mammography is very important information to distinguish benign and malignant mass. Contour of benign mass is smooth and round or oval, while malignant mass has irregular shape and spiculated contour. Several studies have shown that 1D signature translated from 2D contour can describe the contour features well. In this paper, we propose a new method to translate 2D contour of breast mass in mammography into 1D signature. The method can describe not only the contour features but also the regularity of breast mass. Then we segment the whole 1D signature into different subsections. We extract four local features including a new contour descriptor from the subsections. The new contour descriptor is root mean square (RMS) slope. It can describe the roughness of the contour. KNN, SVM and ANN classifier are used to classify benign breast mass and malignant mass. The proposed method is tested on a set with 323 contours including 143 benign masses and 180 malignant ones from digital database of screening mammography (DDSM). The best accuracy of classification is 99.66% using the feature of root mean square slope with SVM classifier. The performance of the proposed method is better than traditional method. In addition, RMS slope is an effective feature comparable to most of the existing features.
Sadanandam, Anguraj; Wullschleger, Stephan; Lyssiotis, Costas A.; Grötzinger, Carsten; Barbi, Stefano; Bersani, Samantha; Körner, Jan; Wafy, Ismael; Mafficini, Andrea; Lawlor, Rita T.; Simbolo, Michele; Asara, John M.; Bläker, Hendrik; Cantley, Lewis C.; Wiedenmann, Bertram; Scarpa, Aldo; Hanahan, Douglas
2016-01-01
Seeking to assess the representative and instructive value of an engineered mouse model of pancreatic neuroendocrine tumors (PanNET) for its cognate human cancer, we profiled and compared mRNA and miRNA transcriptomes of tumors from both. Mouse PanNET tumors could be classified into two distinctive subtypes, well-differentiated islet/insulinoma tumors (IT) and poorly differentiated tumors associated with liver metastases, dubbed metastasis-like primary (MLP). Human PanNETs were independently classified into these same two subtypes, along with a third, specific gene mutation–enriched subtype. The MLP subtypes in human and mouse were similar to liver metastases in terms of miRNA and mRNA transcriptome profiles and signature genes. The human/mouse MLP subtypes also similarly expressed genes known to regulate early pancreas development, whereas the IT subtypes expressed genes characteristic of mature islet cells, suggesting different tumorigenesis pathways. In addition, these subtypes exhibit distinct metabolic profiles marked by differential pyruvate metabolism, substantiating the significance of their separate identities. SIGNIFICANCE This study involves a comprehensive cross-species integrated analysis of multi-omics profiles and histology to stratify PanNETs into subtypes with distinctive characteristics. We provide support for the RIP1-TAG2 mouse model as representative of its cognate human cancer with prospects to better understand PanNET heterogeneity and consider future applications of personalized cancer therapy. PMID:26446169
Sadanandam, Anguraj; Wullschleger, Stephan; Lyssiotis, Costas A; Grötzinger, Carsten; Barbi, Stefano; Bersani, Samantha; Körner, Jan; Wafy, Ismael; Mafficini, Andrea; Lawlor, Rita T; Simbolo, Michele; Asara, John M; Bläker, Hendrik; Cantley, Lewis C; Wiedenmann, Bertram; Scarpa, Aldo; Hanahan, Douglas
2015-12-01
Seeking to assess the representative and instructive value of an engineered mouse model of pancreatic neuroendocrine tumors (PanNET) for its cognate human cancer, we profiled and compared mRNA and miRNA transcriptomes of tumors from both. Mouse PanNET tumors could be classified into two distinctive subtypes, well-differentiated islet/insulinoma tumors (IT) and poorly differentiated tumors associated with liver metastases, dubbed metastasis-like primary (MLP). Human PanNETs were independently classified into these same two subtypes, along with a third, specific gene mutation-enriched subtype. The MLP subtypes in human and mouse were similar to liver metastases in terms of miRNA and mRNA transcriptome profiles and signature genes. The human/mouse MLP subtypes also similarly expressed genes known to regulate early pancreas development, whereas the IT subtypes expressed genes characteristic of mature islet cells, suggesting different tumorigenesis pathways. In addition, these subtypes exhibit distinct metabolic profiles marked by differential pyruvate metabolism, substantiating the significance of their separate identities. This study involves a comprehensive cross-species integrated analysis of multi-omics profiles and histology to stratify PanNETs into subtypes with distinctive characteristics. We provide support for the RIP1-TAG2 mouse model as representative of its cognate human cancer with prospects to better understand PanNET heterogeneity and consider future applications of personalized cancer therapy. ©2015 American Association for Cancer Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leone, Angelique; Nie, Alex; Brandon Parker, J.
Previously we reported a gene expression signature in rat liver for detecting a specific type of oxidative stress (OS) related to reactive metabolites (RM). High doses of the drugs disulfiram, ethinyl estradiol and nimesulide were used with another dozen paradigm OS/RM compounds, and three other drugs flutamide, phenacetin and sulindac were identified by this signature. In a second study, antiepileptic drugs were compared for covalent binding and their effects on OS/RM; felbamate, carbamazepine, and phenobarbital produced robust OS/RM gene expression. In the present study, liver RNA samples from drug-treated rats from more recent experiments were examined for statistical fit tomore » the OS/RM signature. Of all 97 drugs examined, in addition to the nine drugs noted above, 19 more were identified as OS/RM-producing compounds—chlorpromazine, clozapine, cyproterone acetate, dantrolene, dipyridamole, glibenclamide, isoniazid, ketoconazole, methapyrilene, naltrexone, nifedipine, sulfamethoxazole, tamoxifen, coumarin, ritonavir, amitriptyline, valproic acid, enalapril, and chloramphenicol. Importantly, all of the OS/RM drugs listed above have been linked to idiosyncratic hepatotoxicity, excepting chloramphenicol, which does not have a package label for hepatotoxicity, but does have a black box warning for idiosyncratic bone marrow suppression. Most of these drugs are not acutely toxic in the rat. The OS/RM signature should be useful to avoid idiosyncratic hepatotoxicity of drug candidates. - Highlights: • 28 of 97 drugs gave a positive OS/RM gene expression signature in rat liver. • The specificity of the signature for human idiosyncratic hepatotoxicants was 98%. • The sensitivity of the signature for human idiosyncratic hepatotoxicants was 75%. • The signature can help eliminate hepatotoxicants from drug development.« less
Kaufman, Alon; Dror, Gideon; Meilijson, Isaac; Ruppin, Eytan
2006-12-08
The claim that genetic properties of neurons significantly influence their synaptic network structure is a common notion in neuroscience. The nematode Caenorhabditis elegans provides an exciting opportunity to approach this question in a large-scale quantitative manner. Its synaptic connectivity network has been identified, and, combined with cellular studies, we currently have characteristic connectivity and gene expression signatures for most of its neurons. By using two complementary analysis assays we show that the expression signature of a neuron carries significant information about its synaptic connectivity signature, and identify a list of putative genes predicting neural connectivity. The current study rigorously quantifies the relation between gene expression and synaptic connectivity signatures in the C. elegans nervous system and identifies subsets of neurons where this relation is highly marked. The results presented and the genes identified provide a promising starting point for further, more detailed computational and experimental investigations.
A 7-Gene Signature Depicts the Biochemical Profile of Early Prefibrotic Myelofibrosis
Skov, Vibe; Burton, Mark; Thomassen, Mads; Stauffer Larsen, Thomas; Riley, Caroline H.; Brinch Madelung, Ann; Kjær, Lasse; Bondo, Henrik; Stamp, Inger; Ehinger, Mats; Dahl-Sørensen, Rasmus; Brochmann, Nana; Nielsen, Karsten; Thiele, Jürgen; Jensen, Morten K.; Weis Bjerrum, Ole; Kruse, Torben A.; Hasselbalch, Hans Carl
2016-01-01
Recent studies have shown that a large proportion of patients classified as essential thrombocythemia (ET) actually have early primary prefibrotic myelofibrosis (prePMF), which implies an inferior prognosis as compared to patients being diagnosed with so-called genuine or true ET. According to the World Health Organization (WHO) 2008 classification, bone marrow histology is a major component in the distinction between these disease entities. However, the differential diagnosis between them may be challenging and several studies have not been able to distinguish between them. Most lately, it has been argued that simple blood tests, including the leukocyte count and plasma lactate dehydrogenase (LDH) may be useful tools to separate genuine ET from prePMF, the latter disease entity more often being featured by anemia, leukocytosis and elevated LDH. Whole blood gene expression profiling was performed in 17 and 9 patients diagnosed with ET and PMF, respectively. Using elevated LDH obtained at the time of diagnosis as a marker of prePMF, a 7-gene signature was identified which correctly predicted the prePMF group with a sensitivity of 100% and a specificity of 89%. The 7 genes included MPO, CEACAM8, CRISP3, MS4A3, CEACAM6, HEMGN, and MMP8, which are genes known to be involved in inflammation, cell adhesion, differentiation and proliferation. Evaluation of bone marrow biopsies and the 7-gene signature showed a concordance rate of 71%, 79%, 62%, and 38%. Our 7-gene signature may be a useful tool to differentiate between genuine ET and prePMF but needs to be validated in a larger cohort of “ET” patients. PMID:27579896
Angular relational signature-based chest radiograph image view classification.
Santosh, K C; Wendling, Laurent
2018-01-22
In a computer-aided diagnosis (CAD) system, especially for chest radiograph or chest X-ray (CXR) screening, CXR image view information is required. Automatically separating CXR image view, frontal and lateral can ease subsequent CXR screening process, since the techniques may not equally work for both views. We present a novel technique to classify frontal and lateral CXR images, where we introduce angular relational signature through force histogram to extract features and apply three different state-of-the-art classifiers: multi-layer perceptron, random forest, and support vector machine to make a decision. We validated our fully automatic technique on a set of 8100 images hosted by the U.S. National Library of Medicine (NLM), National Institutes of Health (NIH), and achieved an accuracy close to 100%. Our method outperforms the state-of-the-art methods in terms of processing time (less than or close to 2 s for the whole test data) while the accuracies can be compared, and therefore, it justifies its practicality. Graphical Abstract Interpreting chest X-ray (CXR) through the angular relational signature.
Feature-based RNN target recognition
NASA Astrophysics Data System (ADS)
Bakircioglu, Hakan; Gelenbe, Erol
1998-09-01
Detection and recognition of target signatures in sensory data obtained by synthetic aperture radar (SAR), forward- looking infrared, or laser radar, have received considerable attention in the literature. In this paper, we propose a feature based target classification methodology to detect and classify targets in cluttered SAR images, that makes use of selective signature data from sensory data, together with a neural network technique which uses a set of trained networks based on the Random Neural Network (RNN) model (Gelenbe 89, 90, 91, 93) which is trained to act as a matched filter. We propose and investigate radial features of target shapes that are invariant to rotation, translation, and scale, to characterize target and clutter signatures. These features are then used to train a set of learning RNNs which can be used to detect targets within clutter with high accuracy, and to classify the targets or man-made objects from natural clutter. Experimental data from SAR imagery is used to illustrate and validate the proposed method, and to calculate Receiver Operating Characteristics which illustrate the performance of the proposed algorithm.
Seera, Manjeevan; Lim, Chee Peng; Ishak, Dahaman; Singh, Harapajan
2012-01-01
In this paper, a novel approach to detect and classify comprehensive fault conditions of induction motors using a hybrid fuzzy min-max (FMM) neural network and classification and regression tree (CART) is proposed. The hybrid model, known as FMM-CART, exploits the advantages of both FMM and CART for undertaking data classification and rule extraction problems. A series of real experiments is conducted, whereby the motor current signature analysis method is applied to form a database comprising stator current signatures under different motor conditions. The signal harmonics from the power spectral density are extracted as discriminative input features for fault detection and classification with FMM-CART. A comprehensive list of induction motor fault conditions, viz., broken rotor bars, unbalanced voltages, stator winding faults, and eccentricity problems, has been successfully classified using FMM-CART with good accuracy rates. The results are comparable, if not better, than those reported in the literature. Useful explanatory rules in the form of a decision tree are also elicited from FMM-CART to analyze and understand different fault conditions of induction motors.
Shermeyer, Jacob S.; Haack, Barry N.
2015-01-01
Two forestry-change detection methods are described, compared, and contrasted for estimating deforestation and growth in threatened forests in southern Peru from 2000 to 2010. The methods used in this study rely on freely available data, including atmospherically corrected Landsat 5 Thematic Mapper and Moderate Resolution Imaging Spectroradiometer (MODIS) vegetation continuous fields (VCF). The two methods include a conventional supervised signature extraction method and a unique self-calibrating method called MODIS VCF guided forest/nonforest (FNF) masking. The process chain for each of these methods includes a threshold classification of MODIS VCF, training data or signature extraction, signature evaluation, k-nearest neighbor classification, analyst-guided reclassification, and postclassification image differencing to generate forest change maps. Comparisons of all methods were based on an accuracy assessment using 500 validation pixels. Results of this accuracy assessment indicate that FNF masking had a 5% higher overall accuracy and was superior to conventional supervised classification when estimating forest change. Both methods succeeded in classifying persistently forested and nonforested areas, and both had limitations when classifying forest change.
Arnold, Kelly B; Burgener, Adam; Birse, Kenzie; Romas, Laura; Dunphy, Laura J; Shahabi, Kamnoosh; Abou, Max; Westmacott, Garrett R; McCorrister, Stuart; Kwatampora, Jessie; Nyanga, Billy; Kimani, Joshua; Masson, Lindi; Liebenberg, Lenine J; Abdool Karim, Salim S; Passmore, Jo-Ann S; Lauffenburger, Douglas A; Kaul, Rupert; McKinnon, Lyle R
2016-01-01
Elevated inflammatory cytokines (EMCs) at mucosal surfaces have been associated with HIV susceptibility, but the underlying mechanisms remain unclear. We characterized the soluble mucosal proteome associated with elevated cytokine expression in the female reproductive tract. A scoring system was devised based on the elevation (upper quartile) of at least three of seven inflammatory cytokines in cervicovaginal lavage. Using this score, HIV-uninfected Kenyan women were classified as either having EMC (n=28) or not (n=68). Of 455 proteins quantified in proteomic analyses, 53 were associated with EMC (5% false discovery rate threshold). EMCs were associated with proteases, cell motility, and actin cytoskeletal pathways, whereas protease inhibitor, epidermal cell differentiation, and cornified envelope pathways were decreased. Multivariate analysis identified an optimal signature of 16 proteins that distinguished the EMC group with 88% accuracy. Three proteins in this signature were neutrophil-associated proteases that correlated with many cytokines, especially GM-CSF (granulocyte-macrophage colony-stimulating factor), IL-1β (interleukin-1β), MIP-3α (macrophage inflammatory protein-3α), IL-17, and IL-8. Gene set enrichment analyses implicated activated immune cells; we verified experimentally that EMC women had an increased frequency of endocervical CD4(+) T cells. These data reveal strong linkages between mucosal cytokines, barrier function, proteases, and immune cell movement, and propose these as potential mechanisms that increase risk of HIV acquisition.
2010-01-01
Background Hyperactivation of the Ras signaling pathway is a driver of many cancers, and RAS pathway activation can predict response to targeted therapies. Therefore, optimal methods for measuring Ras pathway activation are critical. The main focus of our work was to develop a gene expression signature that is predictive of RAS pathway dependence. Methods We used the coherent expression of RAS pathway-related genes across multiple datasets to derive a RAS pathway gene expression signature and generate RAS pathway activation scores in pre-clinical cancer models and human tumors. We then related this signature to KRAS mutation status and drug response data in pre-clinical and clinical datasets. Results The RAS signature score is predictive of KRAS mutation status in lung tumors and cell lines with high (> 90%) sensitivity but relatively low (50%) specificity due to samples that have apparent RAS pathway activation in the absence of a KRAS mutation. In lung and breast cancer cell line panels, the RAS pathway signature score correlates with pMEK and pERK expression, and predicts resistance to AKT inhibition and sensitivity to MEK inhibition within both KRAS mutant and KRAS wild-type groups. The RAS pathway signature is upregulated in breast cancer cell lines that have acquired resistance to AKT inhibition, and is downregulated by inhibition of MEK. In lung cancer cell lines knockdown of KRAS using siRNA demonstrates that the RAS pathway signature is a better measure of dependence on RAS compared to KRAS mutation status. In human tumors, the RAS pathway signature is elevated in ER negative breast tumors and lung adenocarcinomas, and predicts resistance to cetuximab in metastatic colorectal cancer. Conclusions These data demonstrate that the RAS pathway signature is superior to KRAS mutation status for the prediction of dependence on RAS signaling, can predict response to PI3K and RAS pathway inhibitors, and is likely to have the most clinical utility in lung and breast tumors. PMID:20591134
Klein, Hans-Ulrich; Ruckert, Christian; Kohlmann, Alexander; Bullinger, Lars; Thiede, Christian; Haferlach, Torsten; Dugas, Martin
2009-12-15
Multiple gene expression signatures derived from microarray experiments have been published in the field of leukemia research. A comparison of these signatures with results from new experiments is useful for verification as well as for interpretation of the results obtained. Currently, the percentage of overlapping genes is frequently used to compare published gene signatures against a signature derived from a new experiment. However, it has been shown that the percentage of overlapping genes is of limited use for comparing two experiments due to the variability of gene signatures caused by different array platforms or assay-specific influencing parameters. Here, we present a robust approach for a systematic and quantitative comparison of published gene expression signatures with an exemplary query dataset. A database storing 138 leukemia-related published gene signatures was designed. Each gene signature was manually annotated with terms according to a leukemia-specific taxonomy. Two analysis steps are implemented to compare a new microarray dataset with the results from previous experiments stored and curated in the database. First, the global test method is applied to assess gene signatures and to constitute a ranking among them. In a subsequent analysis step, the focus is shifted from single gene signatures to chromosomal aberrations or molecular mutations as modeled in the taxonomy. Potentially interesting disease characteristics are detected based on the ranking of gene signatures associated with these aberrations stored in the database. Two example analyses are presented. An implementation of the approach is freely available as web-based application. The presented approach helps researchers to systematically integrate the knowledge derived from numerous microarray experiments into the analysis of a new dataset. By means of example leukemia datasets we demonstrate that this approach detects related experiments as well as related molecular mutations and may help to interpret new microarray data.
L.R. Iverson; E.A. Cook; R.L. Graham
1989-01-01
An approach to extending high-resolution forest cover information across large regions is presented and validated. Landsat Thematic Mapper (TM) data were classified into forest and nonforest for a portion of Jackson County, Illinois. The classified TM image was then used to determine the relationship between forest cover and the spectral signature of Advanced Very High...
Van Vugt, Floris Tijmen; Jabusch, Hans-Christian; Altenmüller, Eckart
2013-01-01
Whatever we do, we do it in our own way, and we recognize master artists by small samples of their work. This study investigates individuality of temporal deviations in musical scales in pianists in the absence of deliberate expressive intention. Note-by-note timing deviations away from regularity form a remarkably consistent “pianistic fingerprint.” First, eight professional pianists played C-major scales in two sessions, separated by 15 min. Euclidian distances between deviation traces originating from different pianists were reliably larger than traces originating from the same pianist. As a result, a simple classifier that matched deviation traces by minimizing their distance was able to recognize each pianist with 100% accuracy. Furthermore, within each pianist, fingerprints produced by the same movements were more similar than fingerprints resulting in the same scale sound. This allowed us to conclude that the fingerprints are mostly neuromuscular rather than intentional or expressive in nature. However, human listeners were not able to distinguish the temporal fingerprints by ear. Next, 18 pianists played C-major scales on a normal or muted piano. Recognition rates ranged from 83 to 100%, further supporting the view that auditory feedback is not implicated in the creation of the temporal signature. Finally, 20 pianists were recognized 20 months later at above chance level, showing signature effects to be long lasting. Our results indicate that even non-expressive playing of scales reveals consistent, partially effector-unspecific, but inaudible inter-individual differences. We suggest that machine learning studies into individuality in performance will need to take into account unintentional but consistent variability below the perceptual threshold. PMID:23519688
Thakar, Juilee; Mohanty, Subhasis; West, A Phillip; Joshi, Samit R; Ueda, Ikuyo; Wilson, Jean; Meng, Hailong; Blevins, Tamara P; Tsang, Sui; Trentalange, Mark; Siconolfi, Barbara; Park, Koonam; Gill, Thomas M; Belshe, Robert B; Kaech, Susan M; Shadel, Gerald S; Kleinstein, Steven H; Shaw, Albert C
2015-01-01
To elucidate gene expression pathways underlying age-associated impairment in influenza vaccine response, we screened young (age 21-30) and older (age≥65) adults receiving influenza vaccine in two consecutive seasons and identified those with strong or absent response to vaccine, including a subset of older adults meeting criteria for frailty. PBMCs obtained prior to vaccination (Day 0) and at day 2 or 4, day 7 and day 28 post-vaccine were subjected to gene expression microarray analysis. We defined a response signature and also detected induction of a type I interferon response at day 2 and a plasma cell signature at day 7 post-vaccine in young responders. The response signature was dysregulated in older adults, with the plasma cell signature induced at day 2, and was never induced in frail subjects (who were all non-responders). We also identified a mitochondrial signature in young vaccine responders containing genes mediating mitochondrial biogenesis and oxidative phosphorylation that was consistent in two different vaccine seasons and verified by analyses of mitochondrial content and protein expression. These results represent the first genome-wide transcriptional profiling analysis of age-associated dynamics following influenza vaccination, and implicate changes in mitochondrial biogenesis and function as a critical factor in human vaccine responsiveness.
Wen, Qing; Kim, Chang-Sik; Hamilton, Peter W; Zhang, Shu-Dong
2016-05-11
Gene expression connectivity mapping has gained much popularity recently with a number of successful applications in biomedical research testifying its utility and promise. Previously methodological research in connectivity mapping mainly focused on two of the key components in the framework, namely, the reference gene expression profiles and the connectivity mapping algorithms. The other key component in this framework, the query gene signature, has been left to users to construct without much consensus on how this should be done, albeit it has been an issue most relevant to end users. As a key input to the connectivity mapping process, gene signature is crucially important in returning biologically meaningful and relevant results. This paper intends to formulate a standardized procedure for constructing high quality gene signatures from a user's perspective. We describe a two-stage process for making quality gene signatures using gene expression data as initial inputs. First, a differential gene expression analysis comparing two distinct biological states; only the genes that have passed stringent statistical criteria are considered in the second stage of the process, which involves ranking genes based on statistical as well as biological significance. We introduce a "gene signature progression" method as a standard procedure in connectivity mapping. Starting from the highest ranked gene, we progressively determine the minimum length of the gene signature that allows connections to the reference profiles (drugs) being established with a preset target false discovery rate. We use a lung cancer dataset and a breast cancer dataset as two case studies to demonstrate how this standardized procedure works, and we show that highly relevant and interesting biological connections are returned. Of particular note is gefitinib, identified as among the candidate therapeutics in our lung cancer case study. Our gene signature was based on gene expression data from Taiwan female non-smoker lung cancer patients, while there is evidence from independent studies that gefitinib is highly effective in treating women, non-smoker or former light smoker, advanced non-small cell lung cancer patients of Asian origin. In summary, we introduced a gene signature progression method into connectivity mapping, which enables a standardized procedure for constructing high quality gene signatures. This progression method is particularly useful when the number of differentially expressed genes identified is large, and when there is a need to prioritize them to be included in the query signature. The results from two case studies demonstrate that the approach we have developed is capable of obtaining pertinent candidate drugs with high precision.
Cheng, Jun; He, Jun; Liu, Huaping; Cai, Hao; Hong, Guini; Zhang, Jiahui; Li, Na; Ao, Lu; Guo, Zheng
2017-01-01
Formalin-fixed paraffin-embedded (FFPE) samples represent a valuable resource for clinical researches. However, FFPE samples are usually considered an unreliable source for gene expression analysis due to the partial RNA degradation. In this study, through comparing gene expression profiles between FFPE samples and paired fresh-frozen (FF) samples for three cancer types, we firstly showed that expression measurements of thousands of genes had at least two-fold change in FFPE samples compared with paired FF samples. Therefore, for a transcriptional signature based on risk scores summarized from the expression levels of the signature genes, the risk score thresholds trained from FFPE (or FF) samples could not be applied to FF (or FFPE) samples. On the other hand, we found that more than 90% of the relative expression orderings (REOs) of gene pairs in the FF samples were maintained in their paired FFPE samples and largely unaffected by the storage time. The result suggested that the REOs of gene pairs were highly robust against partial RNA degradation in FFPE samples. Finally, as a case study, we developed a REOs-based signature to distinguish liver cirrhosis from hepatocellular carcinoma (HCC) using FFPE samples. The signature was validated in four datasets of FFPE samples and eight datasets of FF samples. In conclusion, the valuable FFPE samples can be fully exploited to identify REOs-based diagnostic and prognostic signatures which could be robustly applicable to both FF samples and FFPE samples with degraded RNA. PMID:28036264
Authentication Based on Pole-zero Models of Signature Velocity
Rashidi, Saeid; Fallah, Ali; Towhidkhah, Farzad
2013-01-01
With the increase of communication and financial transaction through internet, on-line signature verification is an accepted biometric technology for access control and plays a significant role in authenticity and authorization in modernized society. Therefore, fast and precise algorithms for the signature verification are very attractive. The goal of this paper is modeling of velocity signal that pattern and properties is stable for persons. With using pole-zero models based on discrete cosine transform, precise method is proposed for modeling and then features is founded from strokes. With using linear, parzen window and support vector machine classifiers, the signature verification technique was tested with a large number of authentic and forgery signatures and has demonstrated the good potential of this technique. The signatures are collected from three different database include a proprietary database, the SVC2004 and the Sabanci University signature database benchmark databases. Experimental results based on Persian, SVC2004 and SUSIG databases show that our method achieves an equal error rate of 5.91%, 5.62% and 3.91% in the skilled forgeries, respectively. PMID:24696797
Nagaraju, Kanneboyina; Ghimbovschi, Svetlana; Rayavarapu, Sree; Phadke, Aditi; Rider, Lisa G.; Hoffman, Eric P.
2016-01-01
Abstract Objective. To identify muscle gene expression patterns that predict rituximab responses and assess the effects of rituximab on muscle gene expression in PM and DM. Methods. In an attempt to understand the molecular mechanism of response and non-response to rituximab therapy, we performed Affymetrix gene expression array analyses on muscle biopsy specimens taken before and after rituximab therapy from eight PM and two DM patients in the Rituximab in Myositis study. We also analysed selected muscle-infiltrating cell phenotypes in these biopsies by immunohistochemical staining. Partek and Ingenuity pathway analyses assessed the gene pathways and networks. Results. Myeloid type I IFN signature genes were expressed at higher levels at baseline in the skeletal muscle of rituximab responders than in non-responders, whereas classic non-myeloid IFN signature genes were expressed at higher levels in non-responders at baseline. Also, rituximab responders have a greater reduction of the myeloid and non-myeloid type I IFN signatures than non-responders. The decrease in the type I IFN signature following administration of rituximab may be associated with the decreases in muscle-infiltrating CD19 + B cells and CD68 + macrophages in responders. Conclusion. Our findings suggest that high levels of myeloid type I IFN gene expression in skeletal muscle predict responses to rituximab in PM/DM and that rituximab responders also have a greater decrease in the expression of these genes. These data add further evidence to recent studies defining the type I IFN signature as both a predictor of therapeutic responses and a biomarker of myositis disease activity. PMID:27215813
Wang, Zhanwei; Katsaros, Dionyssios; Shen, Yi; Fu, Yuanyuan; Canuto, Emilie Marion; Benedetto, Chiara; Lu, Lingeng; Chu, Wen-Ming; Risch, Harvey A.; Yu, Herbert
2015-01-01
To investigate the biologic relevance and clinical implication of genes involved in multiple gene expression signatures for breast cancer prognosis, we identified 16 published gene expression signatures, and selected two genes, MAD2L1 and BUB1. These genes appeared in 5 signatures and were involved in cell-cycle regulation. We analyzed the expression of these genes in relation to tumor features and disease outcomes. In vitro experiments were also performed in two breast cancer cell lines, MDA-MB-231 and MDA-MB-468, to assess cell proliferation, migration and invasion after knocking down the expression of these genes. High expression of these genes was found to be associated with aggressive tumors and poor disease-free survival of 203 breast cancer patients in our study, and the association with survival was confirmed in an online database consisting of 914 patients. In vitro experiments demonstrated that lowering the expression of these genes by siRNAs reduced tumor cell growth and inhibited cell migration and invasion. Our investigation suggests that MAD2L1 and BUB1 may play important roles in breast cancer progression, and measuring the expression of these genes may assist the prediction of breast cancer prognosis. PMID:26287798
Wang, Dong-Yu; Done, Susan J; Mc Cready, David R; Leong, Wey L
2014-07-04
Using genome-wide expression profiles of a prospective training cohort of breast cancer patients, ClinicoMolecular Triad Classification (CMTC) was recently developed to classify breast cancers into three clinically relevant groups to aid treatment decisions. CMTC was found to be both prognostic and predictive in a large external breast cancer cohort in that study. This study serves to validate the reproducibility of CMTC and its prognostic value using independent patient cohorts. An independent internal cohort (n = 284) and a new external cohort (n = 2,181) were used to validate the association of CMTC between clinicopathological factors, 12 known gene signatures, two molecular subtype classifiers, and 19 oncogenic signalling pathway activities, and to reproduce the abilities of CMTC to predict clinical outcomes of breast cancer. In addition, we also updated the outcome data of the original training cohort (n = 147). The original training cohort reached a statistically significant difference (p < 0.05) in disease-free survivals between the three CMTC groups after an additional two years of follow-up (median = 55 months). The prognostic value of the triad classification was reproduced in the second independent internal cohort and the new external validation cohort. CMTC achieved even higher prognostic significance when all available patients were analyzed (n = 4,851). Oncogenic pathways Myc, E2F1, Ras and β-catenin were again implicated in the high-risk groups. Both prospective internal cohorts and the independent external cohorts reproduced the triad classification of CMTC and its prognostic significance. CMTC is an independent prognostic predictor, and it outperformed 12 other known prognostic gene signatures, molecular subtype classifications, and all other standard prognostic clinicopathological factors. Our results support further development of CMTC portfolio into a guide for personalized breast cancer treatments.
Kim, Kyu-Tae; Lee, Hye Won; Lee, Hae-Ock; Kim, Sang Cheol; Seo, Yun Jee; Chung, Woosung; Eum, Hye Hyeon; Nam, Do-Hyun; Kim, Junhyong; Joo, Kyeung Min; Park, Woong-Yang
2015-06-19
Intra-tumoral genetic and functional heterogeneity correlates with cancer clinical prognoses. However, the mechanisms by which intra-tumoral heterogeneity impacts therapeutic outcome remain poorly understood. RNA sequencing (RNA-seq) of single tumor cells can provide comprehensive information about gene expression and single-nucleotide variations in individual tumor cells, which may allow for the translation of heterogeneous tumor cell functional responses into customized anti-cancer treatments. We isolated 34 patient-derived xenograft (PDX) tumor cells from a lung adenocarcinoma patient tumor xenograft. Individual tumor cells were subjected to single cell RNA-seq for gene expression profiling and expressed mutation profiling. Fifty tumor-specific single-nucleotide variations, including KRAS(G12D), were observed to be heterogeneous in individual PDX cells. Semi-supervised clustering, based on KRAS(G12D) mutant expression and a risk score representing expression of 69 lung adenocarcinoma-prognostic genes, classified PDX cells into four groups. PDX cells that survived in vitro anti-cancer drug treatment displayed transcriptome signatures consistent with the group characterized by KRAS(G12D) and low risk score. Single-cell RNA-seq on viable PDX cells identified a candidate tumor cell subgroup associated with anti-cancer drug resistance. Thus, single-cell RNA-seq is a powerful approach for identifying unique tumor cell-specific gene expression profiles which could facilitate the development of optimized clinical anti-cancer strategies.
Building predictive gene signatures through simultaneous assessment of transcription factor activation and gene expression Exposure to many drugs and environmentally-relevant chemicals can cause adverse outcomes. These adverse outcomes, such as cancer, have been linked to mol...
Brain amyloidosis ascertainment from cognitive, imaging, and peripheral blood protein measures
Hwang, Kristy S.; Avila, David; Elashoff, David; Kohannim, Omid; Teng, Edmond; Sokolow, Sophie; Jack, Clifford R.; Jagust, William J.; Shaw, Leslie; Trojanowski, John Q.; Weiner, Michael W.; Thompson, Paul M.
2015-01-01
Background: The goal of this study was to identify a clinical biomarker signature of brain amyloidosis in the Alzheimer's Disease Neuroimaging Initiative 1 (ADNI1) mild cognitive impairment (MCI) cohort. Methods: We developed a multimodal biomarker classifier for predicting brain amyloidosis using cognitive, imaging, and peripheral blood protein ADNI1 MCI data. We used CSF β-amyloid 1–42 (Aβ42) ≤192 pg/mL as proxy measure for Pittsburgh compound B (PiB)-PET standard uptake value ratio ≥1.5. We trained our classifier in the subcohort with CSF Aβ42 but no PiB-PET data and tested its performance in the subcohort with PiB-PET but no CSF Aβ42 data. We also examined the utility of our biomarker signature for predicting disease progression from MCI to Alzheimer dementia. Results: The CSF training classifier selected Mini-Mental State Examination, Trails B, Auditory Verbal Learning Test delayed recall, education, APOE genotype, interleukin 6 receptor, clusterin, and ApoE protein, and achieved leave-one-out accuracy of 85% (area under the curve [AUC] = 0.8). The PiB testing classifier achieved an AUC of 0.72, and when classifier self-tuning was allowed, AUC = 0.74. The 36-month disease-progression classifier achieved AUC = 0.75 and accuracy = 71%. Conclusions: Automated classifiers based on cognitive and peripheral blood protein variables can identify the presence of brain amyloidosis with a modest level of accuracy. Such methods could have implications for clinical trial design and enrollment in the near future. Classification of evidence: This study provides Class II evidence that a classification algorithm based on cognitive, imaging, and peripheral blood protein measures identifies patients with brain amyloid on PiB-PET with moderate accuracy (sensitivity 68%, specificity 78%). PMID:25609767
Brain amyloidosis ascertainment from cognitive, imaging, and peripheral blood protein measures.
Apostolova, Liana G; Hwang, Kristy S; Avila, David; Elashoff, David; Kohannim, Omid; Teng, Edmond; Sokolow, Sophie; Jack, Clifford R; Jagust, William J; Shaw, Leslie; Trojanowski, John Q; Weiner, Michael W; Thompson, Paul M
2015-02-17
The goal of this study was to identify a clinical biomarker signature of brain amyloidosis in the Alzheimer's Disease Neuroimaging Initiative 1 (ADNI1) mild cognitive impairment (MCI) cohort. We developed a multimodal biomarker classifier for predicting brain amyloidosis using cognitive, imaging, and peripheral blood protein ADNI1 MCI data. We used CSF β-amyloid 1-42 (Aβ42) ≤ 192 pg/mL as proxy measure for Pittsburgh compound B (PiB)-PET standard uptake value ratio ≥ 1.5. We trained our classifier in the subcohort with CSF Aβ42 but no PiB-PET data and tested its performance in the subcohort with PiB-PET but no CSF Aβ42 data. We also examined the utility of our biomarker signature for predicting disease progression from MCI to Alzheimer dementia. The CSF training classifier selected Mini-Mental State Examination, Trails B, Auditory Verbal Learning Test delayed recall, education, APOE genotype, interleukin 6 receptor, clusterin, and ApoE protein, and achieved leave-one-out accuracy of 85% (area under the curve [AUC] = 0.8). The PiB testing classifier achieved an AUC of 0.72, and when classifier self-tuning was allowed, AUC = 0.74. The 36-month disease-progression classifier achieved AUC = 0.75 and accuracy = 71%. Automated classifiers based on cognitive and peripheral blood protein variables can identify the presence of brain amyloidosis with a modest level of accuracy. Such methods could have implications for clinical trial design and enrollment in the near future. This study provides Class II evidence that a classification algorithm based on cognitive, imaging, and peripheral blood protein measures identifies patients with brain amyloid on PiB-PET with moderate accuracy (sensitivity 68%, specificity 78%). © 2015 American Academy of Neurology.
Starry Messages - Searching for Signatures of Interstellar Archaeology
NASA Astrophysics Data System (ADS)
Carrigan, R. A., Jr.
Searching for signatures of cosmic-scale archaeological artefacts such as Dyson spheres or Kardashev civilizations is an interesting alternative to conventional SETI. Uncovering such an artifact does not require the intentional transmission of a signal on the part of the originating civilization. This type of search is called interstellar archaeology or sometimes cosmic archaeology . The detection of intelligence elsewhere in the Universe with interstellar archaeology or SETI would have broad implications for science. For example, the constraints of the anthropic principle would have to be loosened if a different type of intelligence was discovered elsewhere. A variety of interstellar archaeology signatures are discussed including non-natural planetary atmospheric constituents, stellar doping with isotopes of nuclear wastes, Dyson spheres, as well as signatures of stellar and galactic-scale engineering. The concept of a Fermi bubble due to interstellar migration is introduced in the discussion of galactic signatures. These potential interstellar archaeological signatures are classified using the Kardashev scale. A modified Drake equation is used to evaluate the relative challenges of finding various sources. With few exceptions interstellar archaeological signatures are clouded and beyond current technological capabilities. However SETI for so-called cultural transmissions and planetary atmosphere signatures are within reach.
NASA Technical Reports Server (NTRS)
Sadowski, F. E.; Sarno, J. E.
1976-01-01
First, an analysis of forest feature signatures was used to help explain the large variation in classification accuracy that can occur among individual forest features for any one case of spatial resolution and the inconsistent changes in classification accuracy that were demonstrated among features as spatial resolution was degraded. Second, the classification rejection threshold was varied in an effort to reduce the large proportion of unclassified resolution elements that previously appeared in the processing of coarse resolution data when a constant rejection threshold was used for all cases of spatial resolution. For the signature analysis, two-channel ellipse plots showing the feature signature distributions for several cases of spatial resolution indicated that the capability of signatures to correctly identify their respective features is dependent on the amount of statistical overlap among signatures. Reductions in signature variance that occur in data of degraded spatial resolution may not necessarily decrease the amount of statistical overlap among signatures having large variance and small mean separations. Features classified by such signatures may thus continue to have similar amounts of misclassified elements in coarser resolution data, and thus, not necessarily improve in classification accuracy.
NASA Astrophysics Data System (ADS)
Kim, Youngwook; Park, Jinhee; Moon, Taesup
2017-05-01
Remote detection of human aquatic activity can be applied not only to ocean surveillance but also to rescue operations. When a human is illuminated by electromagnetic waves, a Doppler signal is generated from his or her moving parts. Indeed, bodily movements are what make humans' micro-Doppler signatures unique, offering a chance to classify human motions. Certain studies have analyzed and attempted to recognize human aquatic activity, but the topic has yet to be extensively studied. In the present research, we simulate the micro-Doppler signatures of a swimming person in an attempt to investigate those signatures' characteristics. We model human arms as point scatterers while assuming a simple arm motion. By means of such a simulation, we can obtain spectrograms from a swimming person, then extend our measurement to multiple participants. Measurements are taken from five aquatic activities featuring five participants, comprising freestyle, backstroke, and breaststroke, pulling a boat, and rowing. As suggested by the simulation study, the spectrograms for the five activities show different micro-Doppler signatures; hence, we propose to classify them using a deep convolutional neural network (DCNN). In particular, we suggest the use of a transfer-learned DCNN, which is based on a DCNN pretrained by a large-scale RGB image dataset that is, ImageNet. The classification accuracy is calculated using fivefold cross-validation on our dataset. We find that a DCNN trained through transfer learning achieves the highest accuracy while also providing a significant performance boost over the conventional classification method.
Investigations in adaptive processing of multispectral data
NASA Technical Reports Server (NTRS)
Kriegler, F. J.; Horwitz, H. M.
1973-01-01
Adaptive data processing procedures are applied to the problem of classifying objects in a scene scanned by multispectral sensor. These procedures show a performance improvement over standard nonadaptive techniques. Some sources of error in classification are identified and those correctable by adaptive processing are discussed. Experiments in adaptation of signature means by decision-directed methods are described. Some of these methods assume correlation between the trajectories of different signature means; for others this assumption is not made.
Using linear algebra for protein structural comparison and classification
2009-01-01
In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in. PMID:21637532
Using linear algebra for protein structural comparison and classification.
Gomide, Janaína; Melo-Minardi, Raquel; Dos Santos, Marcos Augusto; Neshich, Goran; Meira, Wagner; Lopes, Júlio César; Santoro, Marcelo
2009-07-01
In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.
Dozmorov, Mikhail G
2015-01-01
Although age-associated gene expression and methylation changes have been reported throughout the literature, the unifying epigenomic principles of aging remain poorly understood. Recent explosion in availability and resolution of functional/regulatory genome annotation data (epigenomic data), such as that provided by the ENCODE and Roadmap Epigenomics projects, provides an opportunity for the identification of epigenomic mechanisms potentially altered by age-associated differentially methylated regions (aDMRs) and regulatory signatures in the promoters of age-associated genes (aGENs). In this study we found that aDMRs and aGENs identified in multiple independent studies share a common Polycomb Repressive Complex 2 signature marked by EZH2, SUZ12, CTCF binding sites, repressive H3K27me3, and activating H3K4me1 histone modification marks, and a “poised promoter” chromatin state. This signature is depleted in RNA Polymerase II-associated transcription factor binding sites, activating H3K79me2, H3K36me3, H3K27ac marks, and an “active promoter” chromatin state. The PRC2 signature was shown to be generally stable across cell types. When considering the directionality of methylation changes, we found the PRC2 signature to be associated with aDMRs hypermethylated with age, while hypomethylated aDMRs were associated with enhancers. In contrast, aGENs were associated with the PRC2 signature independently of the directionality of gene expression changes. In this study we demonstrate that the PRC2 signature is the common epigenomic context of genomic regions associated with hypermethylation and gene expression changes in aging. PMID:25880792
Ferreira, Ricardo C.; Guo, Hui; Coulson, Richard M.R.; Smyth, Deborah J.; Pekalski, Marcin L.; Burren, Oliver S.; Cutler, Antony J.; Doecke, James D.; Flint, Shaun; McKinney, Eoin F.; Lyons, Paul A.; Smith, Kenneth G.C.; Achenbach, Peter; Beyerlein, Andreas; Dunger, David B.; Clayton, David G.; Wicker, Linda S.; Bonifacio, Ezio
2014-01-01
Diagnosis of the autoimmune disease type 1 diabetes (T1D) is preceded by the appearance of circulating autoantibodies to pancreatic islets. However, almost nothing is known about events leading to this islet autoimmunity. Previous epidemiological and genetic data have associated viral infections and antiviral type I interferon (IFN) immune response genes with T1D. Here, we first used DNA microarray analysis to identify IFN-β–inducible genes in vitro and then used this set of genes to define an IFN-inducible transcriptional signature in peripheral blood mononuclear cells from a group of active systemic lupus erythematosus patients (n = 25). Using this predefined set of 225 IFN signature genes, we investigated the expression of the signature in cohorts of healthy controls (n = 87), patients with T1D (n = 64), and a large longitudinal birth cohort of children genetically predisposed to T1D (n = 109; 454 microarrayed samples). Expression of the IFN signature was increased in genetically predisposed children before the development of autoantibodies (P = 0.0012) but not in patients with established T1D. Upregulation of IFN-inducible genes was transient, temporally associated with a recent history of upper respiratory tract infections (P = 0.0064), and marked by increased expression of SIGLEC-1 (CD169), a lectin-like receptor expressed on CD14+ monocytes. DNA variation in IFN-inducible genes altered T1D risk (P = 0.007), as exemplified by IFIH1, one of the genes in our IFN signature for which increased expression is a known risk factor for disease. These findings identify transient increased expression of type I IFN genes in preclinical diabetes as a risk factor for autoimmunity in children with a genetic predisposition to T1D. PMID:24561305
Nagy, Ádám; Pongor, Lőrinc Sándor; Szabó, András; Santarpia, Mariacarmela; Győrffy, Balázs
2017-02-15
KRAS is the most frequently mutated oncogene in non-small cell lung cancer (NSCLC). However, the prognostic role of KRAS mutation status in NSCLC still remains controversial. We hypothesize that the expression changes of genes affected by KRAS mutation status will have the most prominent effect and could be used as a prognostic signature in lung cancer. We divided NSCLC patients with mutation and RNA-seq data into KRAS mutated and wild type groups. Mann-Whitney test was used to identify genes showing altered expression between these cohorts. Mean expression of the top five genes was designated as a "transcriptomic fingerprint" of the mutation. We evaluated the effect of this signature on clinical outcome in 2,437 NSCLC patients using univariate and multivariate Cox regression analysis. Mutation of KRAS was most common in adenocarcinoma. Mutation status and KRAS expression were not correlated to prognosis. The transcriptomic fingerprint of KRAS include FOXRED2, KRAS, TOP1, PEX3 and ABL2. The KRAS signature had a high prognostic power. Similar results were achieved when using the second and third set of strongest genes. Moreover, all cutoff values delivered significant prognostic power (p < 0.01). The KRAS signature also remained significant (p < 0.01) in a multivariate analysis including age, gender, smoking history and tumor stage. We generated a "surrogate signature" of KRAS mutation status in NSCLC patients by computationally linking genotype and gene expression. We show that secondary effects of a mutation can have a higher prognostic relevance than the primary genetic alteration itself. © 2016 The Authors International Journal of Cancer published by John Wiley & Sons Ltd on behalf of UICC.
Martínez-Terroba, Elena; Behrens, Carmen; de Miguel, Fernando J; Agorreta, Jackeline; Monsó, Eduard; Millares, Laura; Sainz, Cristina; Mesa-Guzman, Miguel; Pérez-Gracia, Jose Luis; Lozano, María Dolores; Zulueta, Javier J; Pio, Ruben; Wistuba, Ignacio I; Montuenga, Luis M; Pajares, María J
2018-05-13
Each of the pathological stages (I-IIIa) in which surgically resected non-small cell lung cancer patients are classified conceals hidden biological heterogeneity, manifested in heterogeneous outcomes within each stage. Thus, the finding of robust and precise molecular classifiers to assess individual patient risk is an unmet medical need. Here we identified and validated the clinical utility of a new prognostic signature based on three proteins (BRCA1, QKI and SLC2A1) to stratify early lung adenocarcinoma patients according to their risk of recurrence or death. Patients were staged following the new International Association for the Study of Lung Cancer (IASLC) staging criteria (8 th edition, 2018). A test cohort (n=239) was used to assess the value of this new prognostic index (PI) based on the three proteins. The prognostic signature was developed by Cox regression following stringent statistical criteria (TRIPOD: Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis). The model resulted in a highly significant predictor of five-year outcome for disease-free survival (P<0.001) and overall survival (P<0.001). The prognostic ability of the model was externally validated in an independent multi-institutional cohort of patients (n=114, P=0.021). We also demonstrated that this molecular classifier adds relevant information to the gold standard TNM-based pathological staging with a highly significant improvement of likelihood ratio. We subsequently developed a combined prognostic index (CPI) including both the molecular and the pathological data which improved the risk stratification in both cohorts (P≤0.001). Moreover, the signature may help to select stage I-IIA patients who might benefit from adjuvant chemotherapy. In summary, this protein-based signature accurately identifies those patients with high risk of recurrence and death, and adds further prognostic information to the TNM-based clinical staging, even applying the new IASLC 8 th edition staging criteria. More importantly, it may be a valuable tool for selecting patients for adjuvant therapy. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
miRNA signature associated with outcome of gastric cancer patients following chemotherapy
2011-01-01
Background Identification of patients who likely will or will not benefit from cytotoxic chemotherapy through the use of biomarkers could greatly improve clinical management by better defining appropriate treatment options for patients. microRNAs may be potentially useful biomarkers that help guide individualized therapy for cancer because microRNA expression is dysregulated in cancer. In order to identify miRNA signatures for gastric cancer and for predicting clinical resistance to cisplatin/fluorouracil (CF) chemotherapy, a comprehensive miRNA microarray analysis was performed using endoscopic biopsy samples. Methods Biopsy samples were collected prior to chemotherapy from 90 gastric cancer patients treated with CF and from 34 healthy volunteers. At the time of disease progression, post-treatment samples were additionally collected from 8 clinical responders. miRNA expression was determined using a custom-designed Agilent microarray. In order to identify a miRNA signature for chemotherapy resistance, we correlated miRNA expression levels with the time to progression (TTP) of disease after CF therapy. Results A miRNA signature distinguishing gastric cancer from normal stomach epithelium was identified. 30 miRNAs were significantly inversely correlated with TTP whereas 28 miRNAs were significantly positively correlated with TTP of 82 cancer patients (P<0.05). Prominent among the upregulated miRNAs associated with chemosensitivity were miRNAs known to regulate apoptosis, including let-7g, miR-342, miR-16, miR-181, miR-1, and miR-34. When this 58-miRNA predictor was applied to a separate set of pre- and post-treatment tumor samples from the 8 clinical responders, all of the 8 pre-treatment samples were correctly predicted as low-risk, whereas samples from the post-treatment tumors that developed chemoresistance were predicted to be in the high-risk category by the 58 miRNA signature, suggesting that selection for the expression of these miRNAs occurred as chemoresistance arose. Conclusions We have identified 1) a miRNA expression signature that distinguishes gastric cancer from normal stomach epithelium from healthy volunteers, and 2) a chemoreresistance miRNA expression signature that is correlated with TTP after CF therapy. The chemoresistance miRNA expression signature includes several miRNAs previously shown to regulate apoptosis in vitro, and warrants further validation. PMID:22112324
2013-01-01
Background The prostate is divided into three glandular zones, the peripheral zone (PZ), the transition zone (TZ), and the central zone. Most prostate tumors arise in the peripheral zone (70-75%) and in the transition zone (20-25%) while only 10% arise in the central zone. The aim of this study was to investigate if differences in miRNA expression could be a possible explanation for the difference in propensity of tumors in the zones of the prostate. Methods Patients with prostate cancer were included in the study if they had a tumor with Gleason grade 3 in the PZ, the TZ, or both (n=16). Normal prostate tissue was collected from men undergoing cystoprostatectomy (n=20). The expression of 667 unique miRNAs was investigated using TaqMan low density arrays for miRNAs. Student’s t-test was used in order to identify differentially expressed miRNAs, followed by hierarchical clustering and principal component analysis (PCA) to study the separation of the tissues. The ADtree algorithm was used to identify markers for classification of tissues and a cross-validation procedure was used to test the generality of the identified miRNA-based classifiers. Results The t-tests revealed that the major differences in miRNA expression are found between normal and malignant tissues. Hierarchical clustering and PCA based on differentially expressed miRNAs between normal and malignant tissues showed perfect separation between samples, while the corresponding analyses based on differentially expressed miRNAs between the two zones showed several misplaced samples. A classification and cross-validation procedure confirmed these results and several potential miRNA markers were identified. Conclusions The results of this study indicate that the major differences in the transcription program are those arising during tumor development, rather than during normal tissue development. In addition, tumors arising in the TZ have more unique differentially expressed miRNAs compared to the PZ. The results also indicate that separate miRNA expression signatures for diagnosis might be needed for tumors arising in the different zones. MicroRNA signatures that are specific for PZ and TZ tumors could also lead to more accurate prognoses, since tumors arising in the PZ tend to be more aggressive than tumors arising in the TZ. PMID:23890084
Gene-expression signatures of Atlantic salmon's plastic life cycle.
Aubin-Horth, Nadia; Letcher, Benjamin H; Hofmann, Hans A
2009-09-15
How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated-at similar magnitudes, yet in opposite direction-in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators.
Application of connectivity mapping in predictive toxicology based on gene-expression similarity.
Smalley, Joshua L; Gant, Timothy W; Zhang, Shu-Dong
2010-02-09
Connectivity mapping is the process of establishing connections between different biological states using gene-expression profiles or signatures. There are a number of applications but in toxicology the most pertinent is for understanding mechanisms of toxicity. In its essence the process involves comparing a query gene signature generated as a result of exposure of a biological system to a chemical to those in a database that have been previously derived. In the ideal situation the query gene-expression signature is characteristic of the event and will be matched to similar events in the database. Key criteria are therefore the means of choosing the signature to be matched and the means by which the match is made. In this article we explore these concepts with examples applicable to toxicology. (c) 2009 Elsevier Ireland Ltd. All rights reserved.
Díaz-Gay, Marcos; Vila-Casadesús, Maria; Franch-Expósito, Sebastià; Hernández-Illán, Eva; Lozano, Juan José; Castellví-Bel, Sergi
2018-06-14
Mutational signatures have been proved as a valuable pattern in somatic genomics, mainly regarding cancer, with a potential application as a biomarker in clinical practice. Up to now, several bioinformatic packages to address this topic have been developed in different languages/platforms. MutationalPatterns has arisen as the most efficient tool for the comparison with the signatures currently reported in the Catalogue of Somatic Mutations in Cancer (COSMIC) database. However, the analysis of mutational signatures is nowadays restricted to a small community of bioinformatic experts. In this work we present Mutational Signatures in Cancer (MuSiCa), a new web tool based on MutationalPatterns and built using the Shiny framework in R language. By means of a simple interface suited to non-specialized researchers, it provides a comprehensive analysis of the somatic mutational status of the supplied cancer samples. It permits characterizing the profile and burden of mutations, as well as quantifying COSMIC-reported mutational signatures. It also allows classifying samples according to the above signature contributions. MuSiCa is a helpful web application to characterize mutational signatures in cancer samples. It is accessible online at http://bioinfo.ciberehd.org/GPtoCRC/en/tools.html and source code is freely available at https://github.com/marcos-diazg/musica .
Nagaraju, Kanneboyina; Ghimbovschi, Svetlana; Rayavarapu, Sree; Phadke, Aditi; Rider, Lisa G; Hoffman, Eric P; Miller, Frederick W
2016-09-01
To identify muscle gene expression patterns that predict rituximab responses and assess the effects of rituximab on muscle gene expression in PM and DM. In an attempt to understand the molecular mechanism of response and non-response to rituximab therapy, we performed Affymetrix gene expression array analyses on muscle biopsy specimens taken before and after rituximab therapy from eight PM and two DM patients in the Rituximab in Myositis study. We also analysed selected muscle-infiltrating cell phenotypes in these biopsies by immunohistochemical staining. Partek and Ingenuity pathway analyses assessed the gene pathways and networks. Myeloid type I IFN signature genes were expressed at higher levels at baseline in the skeletal muscle of rituximab responders than in non-responders, whereas classic non-myeloid IFN signature genes were expressed at higher levels in non-responders at baseline. Also, rituximab responders have a greater reduction of the myeloid and non-myeloid type I IFN signatures than non-responders. The decrease in the type I IFN signature following administration of rituximab may be associated with the decreases in muscle-infiltrating CD19(+) B cells and CD68(+) macrophages in responders. Our findings suggest that high levels of myeloid type I IFN gene expression in skeletal muscle predict responses to rituximab in PM/DM and that rituximab responders also have a greater decrease in the expression of these genes. These data add further evidence to recent studies defining the type I IFN signature as both a predictor of therapeutic responses and a biomarker of myositis disease activity. Published by Oxford University Press on behalf British Society for Rheumatology 2016. This work is written by US Government employees and is in the public domain in the US.
Analysis of blood-based gene expression in idiopathic Parkinson disease.
Shamir, Ron; Klein, Christine; Amar, David; Vollstedt, Eva-Juliane; Bonin, Michael; Usenovic, Marija; Wong, Yvette C; Maver, Ales; Poths, Sven; Safer, Hershel; Corvol, Jean-Christophe; Lesage, Suzanne; Lavi, Ofer; Deuschl, Günther; Kuhlenbaeumer, Gregor; Pawlack, Heike; Ulitsky, Igor; Kasten, Meike; Riess, Olaf; Brice, Alexis; Peterlin, Borut; Krainc, Dimitri
2017-10-17
To examine whether gene expression analysis of a large-scale Parkinson disease (PD) patient cohort produces a robust blood-based PD gene signature compared to previous studies that have used relatively small cohorts (≤220 samples). Whole-blood gene expression profiles were collected from a total of 523 individuals. After preprocessing, the data contained 486 gene profiles (n = 205 PD, n = 233 controls, n = 48 other neurodegenerative diseases) that were partitioned into training, validation, and independent test cohorts to identify and validate a gene signature. Batch-effect reduction and cross-validation were performed to ensure signature reliability. Finally, functional and pathway enrichment analyses were applied to the signature to identify PD-associated gene networks. A gene signature of 100 probes that mapped to 87 genes, corresponding to 64 upregulated and 23 downregulated genes differentiating between patients with idiopathic PD and controls, was identified with the training cohort and successfully replicated in both an independent validation cohort (area under the curve [AUC] = 0.79, p = 7.13E-6) and a subsequent independent test cohort (AUC = 0.74, p = 4.2E-4). Network analysis of the signature revealed gene enrichment in pathways, including metabolism, oxidation, and ubiquitination/proteasomal activity, and misregulation of mitochondria-localized genes, including downregulation of COX4I1 , ATP5A1 , and VDAC3 . We present a large-scale study of PD gene expression profiling. This work identifies a reliable blood-based PD signature and highlights the importance of large-scale patient cohorts in developing potential PD biomarkers. © 2017 American Academy of Neurology.
Molecular profiling identifies prognostic markers of stage IA lung adenocarcinoma.
Zhang, Jie; Shao, Jinchen; Zhu, Lei; Zhao, Ruiying; Xing, Jie; Wang, Jun; Guo, Xiaohui; Tu, Shichun; Han, Baohui; Yu, Keke
2017-09-26
We previously showed that different pathologic subtypes were associated with different prognostic values in patients with stage IA lung adenocarcinoma (AC). We hypothesize that differential gene expression profiles of different subtypes may be valuable factors for prognosis in stage IA lung adenocarcinoma. We performed microarray gene expression profiling on tumor tissues micro-dissected from patients with acinar and solid predominant subtypes of stage IA lung adenocarcinoma. These patients had undergone a lobectomy and mediastinal lymph node dissection at the Shanghai Chest Hospital, Shanghai, China in 2012. No patient had preoperative treatment. We performed the Gene Set Enrichment Analysis (GSEA) analysis to look for gene expression signatures associated with tumor subtypes. The histologic subtypes of all patients were classified according to the 2015 WHO lung Adenocarcinoma classification. We found that patients with the solid predominant subtype are enriched for genes involved in RNA polymerase activity as well as inactivation of the p53 pathway. Further, we identified a list of genes that may serve as prognostic markers for stage IA lung adenocarcinoma. Validation in the TCGA database shows that these genes are correlated with survival, suggesting that they are novel prognostic factors for stage IA lung adenocarcinoma. In conclusion, we have uncovered novel prognostic factors for stage IA lung adenocarcinoma using gene expression profiling in combination with histopathology subtyping.
Gao, She-Gan; Liu, Rui-Min; Zhao, Yun-Gang; Wang, Pei; Ward, Douglas G.; Wang, Guang-Chao; Guo, Xiang-Qian; Gu, Juan; Niu, Wan-Bin; Zhang, Tian; Martin, Ashley; Guo, Zhi-Peng; Feng, Xiao-Shan; Qi, Yi-Jun; Ma, Yuan-Fang
2016-01-01
Combining MS-based proteomic data with network and topological features of such network would identify more clinically relevant molecules and meaningfully expand the repertoire of proteins derived from MS analysis. The integrative topological indexes representing 95.96% information of seven individual topological measures of node proteins were calculated within a protein-protein interaction (PPI) network, built using 244 differentially expressed proteins (DEPs) identified by iTRAQ 2D-LC-MS/MS. Compared with DEPs, differentially expressed genes (DEGs) and comprehensive features (CFs), structurally dominant nodes (SDNs) based on integrative topological index distribution produced comparable classification performance in three different clinical settings using five independent gene expression data sets. The signature molecules of SDN-based classifier for distinction of early from late clinical TNM stages were enriched in biological traits of protein synthesis, intracellular localization and ribosome biogenesis, which suggests that ribosome biogenesis represents a promising therapeutic target for treating ESCC. In addition, ITGB1 expression selected exclusively by integrative topological measures correlated with clinical stages and prognosis, which was further validated with two independent cohorts of ESCC samples. Thus the integrative topological analysis of PPI networks proposed in this study provides an alternative approach to identify potential biomarkers and therapeutic targets from MS/MS data with functional insights in ESCC. PMID:26898710
An integrated mRNA and microRNA expression signature for glioblastoma multiforme prognosis.
Xiong, Jie; Bing, Zhitong; Su, Yanlin; Deng, Defeng; Peng, Xiaoning
2014-01-01
Although patients with Glioblastoma multiforme (GBM) have grave prognosis, significant variability in patient outcome is observed. The objective of this study is to identify a molecular signature for GBM prognosis. We subjected 355 mRNA and microRNA expression profiles to elastic net-regulated Cox regression for identification of an integrated RNA signature for GBM prognosis. A prognostic index (PI) was generated for patient stratification. Survival comparison was conducted by Kaplan-Meier method and a general multivariate Cox regression procedure was applied to evaluate the independence of the PI. The abilities and efficiencies of signatures to predict GBM patient outcome was assessed and compared by the area under the curve (AUC) of the receiver-operator characteristic (ROC). An integrated RNA prognostic signature consisted by 4 protective mRNAs, 12 risky mRNAs, and 1 risky microRNA was identified. Decreased survival was associated with being in the high-risk group (hazard ratio = 2.864, P<0.0001). The prognostic value of the integrated signature was validated in five independent GBM expression datasets (n = 201, hazard ratio = 2.453, P<0.0001). The PI outperformed the known clinical factors, mRNA-only, and miRNA-only prognostic signatures for GBM prognosis (area under the ROC curve for the integrated RNA, mRNA-only, and miRNA-only signatures were 0.828, 0.742, and 0.757 at 3 years of overall survival, respectively, P<0.0001 by permutation test). We describe the first, to our knowledge, robust transcriptome-based integrated RNA signature that improves the current GBM prognosis based on clinical variables, mRNA-only, and miRNA-only signatures.
An Integrated mRNA and microRNA Expression Signature for Glioblastoma Multiforme Prognosis
Xiong, Jie; Bing, Zhitong; Su, Yanlin; Deng, Defeng; Peng, Xiaoning
2014-01-01
Although patients with Glioblastoma multiforme (GBM) have grave prognosis, significant variability in patient outcome is observed. The objective of this study is to identify a molecular signature for GBM prognosis. We subjected 355 mRNA and microRNA expression profiles to elastic net-regulated Cox regression for identification of an integrated RNA signature for GBM prognosis. A prognostic index (PI) was generated for patient stratification. Survival comparison was conducted by Kaplan-Meier method and a general multivariate Cox regression procedure was applied to evaluate the independence of the PI. The abilities and efficiencies of signatures to predict GBM patient outcome was assessed and compared by the area under the curve (AUC) of the receiver-operator characteristic (ROC). An integrated RNA prognostic signature consisted by 4 protective mRNAs, 12 risky mRNAs, and 1 risky microRNA was identified. Decreased survival was associated with being in the high-risk group (hazard ratio = 2.864, P<0.0001). The prognostic value of the integrated signature was validated in five independent GBM expression datasets (n = 201, hazard ratio = 2.453, P<0.0001). The PI outperformed the known clinical factors, mRNA-only, and miRNA-only prognostic signatures for GBM prognosis (area under the ROC curve for the integrated RNA, mRNA-only, and miRNA-only signatures were 0.828, 0.742, and 0.757 at 3 years of overall survival, respectively, P<0.0001 by permutation test). We describe the first, to our knowledge, robust transcriptome-based integrated RNA signature that improves the current GBM prognosis based on clinical variables, mRNA-only, and miRNA-only signatures. PMID:24871302
Scherer, Christina A.; Magness, Charles L.; Steiger, Kathryn V.; Poitinger, Nicholas D.; Caputo, Christine M.; Miner, Douglas G.; Winokur, Patricia L.; Klinzman, Donna; McKee, Janice; Pilar, Christine; Ward, Patricia A.; Gillham, Martha H.; Haulman, N. Jean; Stapleton, Jack T.; Iadonato, Shawn P.
2007-01-01
Gene expression in human peripheral blood mononuclear cells was systematically evaluated following smallpox and yellow fever vaccination, and naturally occurring upper respiratory infection (URI). All three infections were characterized by the induction of many interferon stimulated genes, as well as enhanced expression of genes involved in proteolysis and antigen presentation. Vaccinia infection was also characterized by a distinct expression signature composed of up-regulation of monocyte response genes, with repression of genes expressed by B and T-cells. In contrast, the yellow fever host response was characterized by a suppression of ribosomal and translation factors, distinguishing this infection from vaccinia and URI. No significant URI-specific signature was observed, perhaps reflecting greater heterogeneity in the study population and etiological agents. Taken together, these data suggest that specific host gene expression signatures may be identified that distinguish one or a small number of virus agents. PMID:17651872
Place recognition using batlike sonar.
Vanderelst, Dieter; Steckel, Jan; Boen, Andre; Peremans, Herbert; Holderied, Marc W
2016-08-02
Echolocating bats have excellent spatial memory and are able to navigate to salient locations using bio-sonar. Navigating and route-following require animals to recognize places. Currently, it is mostly unknown how bats recognize places using echolocation. In this paper, we propose template based place recognition might underlie sonar-based navigation in bats. Under this hypothesis, bats recognize places by remembering their echo signature - rather than their 3D layout. Using a large body of ensonification data collected in three different habitats, we test the viability of this hypothesis assessing two critical properties of the proposed echo signatures: (1) they can be uniquely classified and (2) they vary continuously across space. Based on the results presented, we conclude that the proposed echo signatures satisfy both criteria. We discuss how these two properties of the echo signatures can support navigation and building a cognitive map.
The terrain signatures of administrative units: a tool for environmental assessment.
Miliaresis, George Ch
2009-03-01
The quantification of knowledge related to the terrain and the landuse/landcover of administrative units in Southern Greece (Peloponnesus) is performed from the CGIAR-CSI SRTM digital elevation model and the CORINE landuse/landcover database. Each administrative unit is parametrically represented by a set of attributes related to its relief. Administrative units are classified on the basis of K-means cluster analysis in an attempt to see how they are organized into groups and cluster derived geometric signatures are defined. Finally each cluster is parametrically represented on the basis of the occurrence of the Corine landuse/landcover classes included and thus, landcover signatures are derived. The geometric and the landuse/landcover signatures revealed a terrain dependent landuse/landcover organization that was used in the assessment of the forest fires impact at moderate resolution scale.
Gaydos, Leonard
1978-01-01
The cost of classifying 5,607 square kilometers (2,165 sq. mi.) in the Portland area was less than 8 cents per square kilometer ($0.0788, or $0.2041 per square mile). Besides saving in costs, this and other signature extension techniques may be useful in completing land use and land cover mapping in other large areas where multispectral and multitemporal Landsat data are available in digital form but other source materials are generally lacking.
Autonomous target recognition using remotely sensed surface vibration measurements
NASA Astrophysics Data System (ADS)
Geurts, James; Ruck, Dennis W.; Rogers, Steven K.; Oxley, Mark E.; Barr, Dallas N.
1993-09-01
The remotely measured surface vibration signatures of tactical military ground vehicles are investigated for use in target classification and identification friend or foe (IFF) systems. The use of remote surface vibration sensing by a laser radar reduces the effects of partial occlusion, concealment, and camouflage experienced by automatic target recognition systems using traditional imagery in a tactical battlefield environment. Linear Predictive Coding (LPC) efficiently represents the vibration signatures and nearest neighbor classifiers exploit the LPC feature set using a variety of distortion metrics. Nearest neighbor classifiers achieve an 88 percent classification rate in an eight class problem, representing a classification performance increase of thirty percent from previous efforts. A novel confidence figure of merit is implemented to attain a 100 percent classification rate with less than 60 percent rejection. The high classification rates are achieved on a target set which would pose significant problems to traditional image-based recognition systems. The targets are presented to the sensor in a variety of aspects and engine speeds at a range of 1 kilometer. The classification rates achieved demonstrate the benefits of using remote vibration measurement in a ground IFF system. The signature modeling and classification system can also be used to identify rotary and fixed-wing targets.
On Hunting Animals of the Biometric Menagerie for Online Signature.
Houmani, Nesma; Garcia-Salicetti, Sonia
2016-01-01
Individuals behave differently regarding to biometric authentication systems. This fact was formalized in the literature by the concept of Biometric Menagerie, defining and labeling user groups with animal names in order to reflect their characteristics with respect to biometric systems. This concept was illustrated for face, fingerprint, iris, and speech modalities. The present study extends the Biometric Menagerie to online signatures, by proposing a novel methodology that ties specific quality measures for signatures to categories of the Biometric Menagerie. Such measures are combined for retrieving automatically writer categories of the extended version of the Biometric Menagerie. Performance analysis with different types of classifiers shows the pertinence of our approach on the well-known MCYT-100 database.
Gillet, Jean-Pierre; Molina, Thierry Jo; Jamart, Jacques; Gaulard, Philippe; Leroy, Karen; Briere, Josette; Theate, Ivan; Thieblemont, Catherine; Bosly, Andre; Herin, Michel; Hamels, Jacques; Remacle, Jose
2009-03-01
Lymphomas are classified according to the World Health Organisation (WHO) classification which defines subtypes on the basis of clinical, morphological, immunophenotypic, molecular and cytogenetic criteria. Differential diagnosis of the subtypes is sometimes difficult, especially for small B-cell lymphoma (SBCL). Standardisation of molecular genetic assays using multiple gene expression analysis by microarrays could be a useful complement to the current diagnosis. The aim of the present study was to develop a low density DNA microarray for the analysis of 107 genes associated with B-cell non-Hodgkin lymphoma and to evaluate its performance in the diagnosis of SBCL. A predictive tool based on Fisher discriminant analysis using a training set of 40 patients including four different subtypes (follicular lymphoma n = 15, mantle cell lymphoma n = 7, B-cell chronic lymphocytic leukemia n = 6 and splenic marginal zone lymphoma n = 12) was designed. A short additional preliminary analysis to gauge the accuracy of this signature was then performed on an external set of nine patients. Using this model, eight of nine of those samples were classified successfully. This pilot study demonstrates that such a microarray tool may be a promising diagnostic approach for small B-cell non-Hodgkin lymphoma.
Yang, Chengqing; Hu, Guoqin; Li, Zezhi; Wang, Qingzhong; Wang, Xuemei; Yuan, Chengmei; Wang, Zuowei; Hong, Wu; Lu, Weihong; Cao, Lan; Chen, Jun; Wang, Yong; Yu, Shunying; Zhou, Yimin; Yi, Zhenghui; Fang, Yiru
2017-01-01
Subsyndromal symptomatic depression (SSD) is a subtype of subthreshold depressive and can lead to significant psychosocial functional impairment. Although the pathogenesis of major depressive disorder (MDD) and SSD still remains poorly understood, a set of studies have found that many same genetic factors play important roles in the etiology of these two disorders. Nowadays, the differential gene expression between MDD and SSD is still unknown. In our previous study, we compared the expression profile and made the classification with the leukocytes by using whole-genome cRNA microarrays among drug-free first-episode subjects with SSD, MDD and matched healthy controls (8 subjects in each group), and finally determined 48 gene expression signatures. Based on these findings, we further clarify whether these genes mRNA was different expressed in peripheral blood in patients with SSD, MDD and healthy controls (60 subjects respectively). With the help of the quantitative real-time reverse transcription-polymerase chain reaction (RT-qPCR), we gained gene relative expression levels among the three groups. We found that there are three of the forty eight co-regulated genes had differential expression in peripheral blood among the three groups, which are CD84, STRN, CTNS gene (F = 3.528, p = 0.034; F = 3.382, p = 0.039; F = 3.801, p = 0.026, respectively) while there were no significant differences for other genes. CD84, STRN, CTNS gene may have significant value for performing diagnostic functions and classifying SSD, MDD and healthy controls.
Araf, Shamzah; Korfi, Koorosh; Rahim, Tahrima; Davies, Andrew; Fitzgibbon, Jude
2016-10-01
The adoption of high-throughput technologies has led to a transformation in our ability to classify diffuse large B-cell lymphoma (DLBCL) into unique molecular subtypes. In parallel, the expansion of agents targeting key genetic and gene expression signatures has led to an unprecedented opportunity to personalize cancer therapies, paving the way for precision medicine. Areas covered: This review summarizes the key molecular subtypes of DLBCL and outlines the novel technology platforms in development to discriminate clinically relevant subtypes. Expert commentary: The application of emerging diagnostic tests into routine clinical practise is gaining momentum following the demonstration of subtype specific activity by novel agents. Co-ordinated efforts are required to ensure that these state of the art technologies provide reliable and clinically meaningful results accessible to the wider haematology community.
SELDI-TOF-based serum proteomic pattern diagnostics for early detection of cancer.
Petricoin, Emanuel F; Liotta, Lance A
2004-02-01
Proteomics is more than just generating lists of proteins that increase or decrease in expression as a cause or consequence of pathology. The goal should be to characterize the information flow through the intercellular protein circuitry that communicates with the extracellular microenvironment and then ultimately to the serum/plasma macroenvironment. The nature of this information can be a cause, or a consequence, of disease and toxicity-based processes. Serum proteomic pattern diagnostics is a new type of proteomic platform in which patterns of proteomic signatures from high dimensional mass spectrometry data are used as a diagnostic classifier. This approach has recently shown tremendous promise in the detection of early-stage cancers. The biomarkers found by SELDI-TOF-based pattern recognition analysis are mostly low molecular weight fragments produced at the specific tumor microenvironment.
Gene-expression signatures of Atlantic salmon's plastic life cycle
Aubin-Horth, N.; Letcher, B.H.; Hofmann, H.A.
2009-01-01
How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated-at similar magnitudes, yet in opposite direction-in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators. ?? 2009 Elsevier Inc. All rights reserved.
Gene-expression signatures of Atlantic salmon’s plastic life cycle
Aubin-Horth, Nadia; Letcher, Benjamin H.; Hofmann, Hans A.
2009-01-01
How genomic expression differs as a function of life history variation is largely unknown. Atlantic salmon exhibits extreme alternative life histories. We defined the gene-expression signatures of wild-caught salmon at two different life stages by comparing the brain expression profiles of mature sneaker males and immature males, and early migrants and late migrants. In addition to life-stage-specific signatures, we discovered a surprisingly large gene set that was differentially regulated - at similar magnitudes, yet in opposite direction - in both life history transitions. We suggest that this co-variation is not a consequence of many independent cellular and molecular switches in the same direction but rather represents the molecular equivalent of a physiological shift orchestrated by one or very few master regulators. PMID:19401203
Umadevi, P; Soumya, M; George, Johnson K; Anandaraj, M
2018-05-01
Plant antimicrobial peptides are the interesting source of studies in defense response as they are essential components of innate immunity which exert rapid defense response. In spite of abundant reports on the isolation of antimicrobial peptides (AMPs) from many sources, the profile of AMPs expressed/identified from single crop species under certain stress/physiological condition is still unknown. This work describes the AMP signature profile of black pepper and their expression upon Phytophthora infection using label-free quantitative proteomics strategy. The differential expression of 24 AMPs suggests that a combinatorial strategy is working in the defense network. The 24 AMP signatures belonged to the cationic, anionic, cysteine-rich and cysteine-free group. As the first report on the possible involvement of AMP signature in Phytophthora infection, our results offer a platform for further study on regulation, evolutionary importance and exploitation of theses AMPs as next generation molecules against pathogens.
NASA Technical Reports Server (NTRS)
Cook, J. P. (Principal Investigator); Stringer, W. J.
1974-01-01
The author has identified the following significant results. The objective is to determine the feasibility of detecting large Alaskan archaeological sites by satellite remote sensing techniques and mapping such sites. The approach used is to develop digital multispectral signatures of dominant surface features including vegetation, exposed soils and rock, hydrological patterns and known archaeological sites. ERTS-1 scenes are then printed out digitally in a map-like array with a letter reflecting the most appropriate classification representing each pixel. Preliminary signatures were developed and tested. It was determined that there was a need to tighten up the archaeological site signature by developing accurate signatures for all naturally-occurring vegetation and surface conditions in the vicinity of the test area. These second generation signatures have been tested by means of computer printouts and classified tape displays on the University of Alaska CDU-200 and by comparison with aerial photography. It has been concluded that the archaeological signatures now in use are as good as can be developed. Plans are to print out signatures for the entire test area and locate on topographic maps the likely locations of archaeological sites within the test area.
Starry messages: Searching for signatures of interstellar archaeology
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carrigan, Richard A., Jr.; /Fermilab
2009-12-01
Searching for signatures of cosmic-scale archaeological artifacts such as Dyson spheres or Kardashev civilizations is an interesting alternative to conventional SETI. Uncovering such an artifact does not require the intentional transmission of a signal on the part of the original civilization. This type of search is called interstellar archaeology or sometimes cosmic archaeology. The detection of intelligence elsewhere in the Universe with interstellar archaeology or SETI would have broad implications for science. For example, the constraints of the anthropic principle would have to be loosened if a different type of intelligence was discovered elsewhere. A variety of interstellar archaeology signaturesmore » are discussed including non-natural planetary atmospheric constituents, stellar doping with isotopes of nuclear wastes, Dyson spheres, as well as signatures of stellar and galactic-scale engineering. The concept of a Fermi bubble due to interstellar migration is introduced in the discussion of galactic signatures. These potential interstellar archaeological signatures are classified using the Kardashev scale. A modified Drake equation is used to evaluate the relative challenges of finding various sources. With few exceptions interstellar archaeological signatures are clouded and beyond current technological capabilities. However SETI for so-called cultural transmissions and planetary atmosphere signatures are within reach.« less
Distinct types of primary cutaneous large B-cell lymphoma identified by gene expression profiling.
Hoefnagel, Juliette J; Dijkman, Remco; Basso, Katia; Jansen, Patty M; Hallermann, Christian; Willemze, Rein; Tensen, Cornelis P; Vermeer, Maarten H
2005-05-01
In the European Organization for Research and Treatment of Cancer (EORTC) classification 2 types of primary cutaneous large B-cell lymphoma (PCLBCL) are distinguished: primary cutaneous follicle center cell lymphomas (PCFCCL) and PCLBCL of the leg (PCLBCL-leg). Distinction between both groups is considered important because of differences in prognosis (5-year survival > 95% and 52%, respectively) and the first choice of treatment (radiotherapy or systemic chemotherapy, respectively), but is not generally accepted. To establish a molecular basis for this subdivision in the EORTC classification, we investigated the gene expression profiles of 21 PCLBCLs by oligonucleotide microarray analysis. Hierarchical clustering based on a B-cell signature (7450 genes) classified PCLBCL into 2 distinct subgroups consisting of, respectively, 8 PCFCCLs and 13 PCLBCLsleg. PCLBCLs-leg showed increased expression of genes associated with cell proliferation; the proto-oncogenes Pim-1, Pim-2, and c-Myc; and the transcription factors Mum1/IRF4 and Oct-2. In the group of PCFCCL high expression of SPINK2 was observed. Further analysis suggested that PCFCCLs and PCLBCLs-leg have expression profiles similar to that of germinal center B-cell-like and activated B-cell-like diffuse large B-cell lymphoma, respectively. The results of this study suggest that different pathogenetic mechanisms are involved in the development of PCFCCLs and PCLBCLs-leg and provide molecular support for the subdivision used in the EORTC classification.
Yadav, Saurabh; Kumari, Pragati; Kushwaha, Hemant Ritturaj
2013-01-01
Glutaredoxins are enzymatic antioxidants which are small, ubiquitous, glutathione dependent and essentially classified under thioredoxin-fold superfamily. Glutaredoxins are classified into two types: dithiol and monothiol. Monothiol glutaredoxins which carry the signature "CGFS" as a redox active motif is known for its role in oxidative stress, inside the cell. In the present analysis, the 138 amino acid long monothiol glutaredoxin, AgGRX1 from Ashbya gossypii was identified and has been used for the analysis. The multiple sequence alignment of the AgGRX1 protein sequence revealed the characteristic motif of typical monothiol glutaredoxin as observed in various other organisms. The proposed structure of the AgGRX1 protein was used to analyze signature folds related to the thioredoxin superfamily. Further, the study highlighted the structural features pertaining to the complex mechanism of glutathione docking and interacting residues.
Cross-Platform Toxicogenomics for the Prediction of Non-Genotoxic Hepatocarcinogenesis in Rat
Metzger, Ute; Templin, Markus F.; Plummer, Simon; Ellinger-Ziegelbauer, Heidrun; Zell, Andreas
2014-01-01
In the area of omics profiling in toxicology, i.e. toxicogenomics, characteristic molecular profiles have previously been incorporated into prediction models for early assessment of a carcinogenic potential and mechanism-based classification of compounds. Traditionally, the biomarker signatures used for model construction were derived from individual high-throughput techniques, such as microarrays designed for monitoring global mRNA expression. In this study, we built predictive models by integrating omics data across complementary microarray platforms and introduced new concepts for modeling of pathway alterations and molecular interactions between multiple biological layers. We trained and evaluated diverse machine learning-based models, differing in the incorporated features and learning algorithms on a cross-omics dataset encompassing mRNA, miRNA, and protein expression profiles obtained from rat liver samples treated with a heterogeneous set of substances. Most of these compounds could be unambiguously classified as genotoxic carcinogens, non-genotoxic carcinogens, or non-hepatocarcinogens based on evidence from published studies. Since mixed characteristics were reported for the compounds Cyproterone acetate, Thioacetamide, and Wy-14643, we reclassified these compounds as either genotoxic or non-genotoxic carcinogens based on their molecular profiles. Evaluating our toxicogenomics models in a repeated external cross-validation procedure, we demonstrated that the prediction accuracy of our models could be increased by joining the biomarker signatures across multiple biological layers and by adding complex features derived from cross-platform integration of the omics data. Furthermore, we found that adding these features resulted in a better separation of the compound classes and a more confident reclassification of the three undefined compounds as non-genotoxic carcinogens. PMID:24830643
Moreso, Francesc; Torres, Irina B; Martínez-Gallo, Monica; Benlloch, Susana; Cantarell, Carme; Perelló, Manel; Jimeno, José; Pujol-Borrell, Ricardo; Seron, Daniel
2014-06-01
In kidney transplants operational tolerance has been associated with up-regulation of B cell differentiation genes and an increased number of total, naive and transitional peripheral B cells. The aim is to evaluate tolerance biomarkers in different cohorts of stable renal transplants under immunosuppression. This is a cross-sectional study conducted in renal transplants. We evaluate genetic tolerance signature and lymphocyte subsets in stable transplants treated with calcineurin inhibitors (CNI) at 1 (n=15), 5 (n=14) and 10 (n=16) years, and azathioprine-treated transplants followed 30 years (n=8). Healthy volunteers (n=10) and patients with chronic rejection (n=15) served as controls. We confirm that peripheral expression of IGKV1D-13 and IGKV4-1 genes by RT-PCR distinguish tolerant (n=10) from stable transplants (n=10) provided by the International Tolerance Network. Tolerance signature was defined as the lowest expression for both genes in tolerant patients. In CNI-treated patients, genetic signature of tolerance and B cells showed a time-dependent increase not observed in azathioprine-treated patients (p<0.01). Genetic tolerance signature was observed in 0% at 1, 7% at 5 and 25% at 10-years while it was not observed in azathioprine-treated and chronic rejection patients. Fifteen out of 16 CNI-treated transplants at 10 years were revaluated 3 months apart. Nine did not show the tolerance signature in any determination, 4 in one and 2 in both determinations. Genetic signature of tolerance was associated with an increase of total, naive and transitional B cells (p<0.05). IGKV1D-13 and IGKV4-1 gene expression and its linked B cell populations increase during follow up in CNI-treated patients. At 10 years, 2 out of 15 CNI treated patients consistently express biomarkers associated with true tolerance. In azathioprine-treated patients these biomarkers were down-regulated. Copyright © 2014 Elsevier B.V. All rights reserved.
Mvubu, Nontobeko Eunice; Pillay, Balakrishna; Gamieldien, Junaid; Bishai, William; Pillay, Manormoney
2016-12-01
Although pulmonary epithelial cells are integral to innate and adaptive immune responses during Mycobacterium tuberculosis infection, global transcriptomic changes in these cells remain largely unknown. Changes in gene expression induced in pulmonary epithelial cells infected with M. tuberculosis F15/LAM4/KZN, F11, F28, Beijing and Unique genotypes were investigated by RNA sequencing (RNA-Seq). The Illumina HiSeq 2000 platform generated 50 bp reads that were mapped to the human genome (Hg19) using Tophat (2.0.10). Differential gene expression induced by the different strains in infected relative to the uninfected cells was quantified and compared using Cufflinks (2.1.0) and MeV (4.0.9), respectively. Gene expression varied among the strains with the total number of genes as follows: F15/LAM4/KZN (1187), Beijing (1252), F11 (1639), F28 (870), Unique (886) and H37Rv (1179). A subset of 292 genes was commonly induced by all strains, where 52 genes were down-regulated while 240 genes were up-regulated. Differentially expressed genes were compared among the strains and the number of induced strain-specific gene signatures were as follows: F15/LAM4/KZN (138), Beijing (52), F11 (255), F28 (55), Unique (186) and H37Rv (125). Strain-specific molecular gene signatures associated with functional pathways were observed only for the Unique and H37Rv strains while certain biological functions may be associated with other strain signatures. This study demonstrated that strains of M. tuberculosis induce differential gene expression and strain-specific molecular signatures in pulmonary epithelial cells. Specific signatures induced by clinical strains of M. tuberculosis can be further explored for novel host-associated biomarkers and adjunctive immunotherapies. Copyright © 2016 Elsevier Ltd. All rights reserved.
Human relevance of an in vitro gene signature in HaCaT for skin sensitization.
van der Veen, Jochem W; Hodemaekers, Henny; Reus, Astrid A; Maas, Wilfred J M; van Loveren, Henk; Ezendam, Janine
2015-02-01
The skin sensitizing potential of chemicals is mainly assessed using animal methods, such as the murine local lymph node assay. Recently, an in vitro assay based on a gene expression signature in the HaCaT keratinocyte cell line was proposed as an alternative to these animal methods. Here, the human relevance of this gene signature is assessed through exposure of freshly isolated human skin to the chemical allergens dinitrochlorobenzene (DNCB) and diphenylcyclopropenone (DCP). In human skin, the gene signature shows similar direction of regulation as was previously observed in vitro, suggesting that the molecular processes that drive expression of these genes are similar between the HaCaT cell line and freshly isolated skin, providing evidence for the human relevance of the gene signature. Copyright © 2014 Elsevier Ltd. All rights reserved.
Mengual, Lourdes; Burset, Moisès; Ribal, María José; Ars, Elisabet; Marín-Aguilera, Mercedes; Fernández, Manuel; Ingelmo-Torres, Mercedes; Villavicencio, Humberto; Alcaraz, Antonio
2010-05-01
To develop an accurate and noninvasive method for bladder cancer diagnosis and prediction of disease aggressiveness based on the gene expression patterns of urine samples. Gene expression patterns of 341 urine samples from bladder urothelial cell carcinoma (UCC) patients and 235 controls were analyzed via TaqMan Arrays. In a first phase of the study, three consecutive gene selection steps were done to identify a gene set expression signature to detect and stratify UCC in urine. Subsequently, those genes more informative for UCC diagnosis and prediction of tumor aggressiveness were combined to obtain a classification system of bladder cancer samples. In a second phase, the obtained gene set signature was evaluated in a routine clinical scenario analyzing only voided urine samples. We have identified a 12+2 gene expression signature for UCC diagnosis and prediction of tumor aggressiveness on urine samples. Overall, this gene set panel had 98% sensitivity (SN) and 99% specificity (SP) in discriminating between UCC and control samples and 79% SN and 92% SP in predicting tumor aggressiveness. The translation of the model to the clinically applicable format corroborates that the 12+2 gene set panel described maintains a high accuracy for UCC diagnosis (SN = 89% and SP = 95%) and tumor aggressiveness prediction (SN = 79% and SP = 91%) in voided urine samples. The 12+2 gene expression signature described in urine is able to identify patients suffering from UCC and predict tumor aggressiveness. We show that a panel of molecular markers may improve the schedule for diagnosis and follow-up in UCC patients. Copyright 2010 AACR.
A MicroRNA Signature Associated With Metastasis of T1 Colorectal Cancers to Lymph Nodes.
Ozawa, Tsuyoshi; Kandimalla, Raju; Gao, Feng; Nozawa, Hiroaki; Hata, Keisuke; Nagata, Hiroshi; Okada, Satoshi; Izumi, Daisuke; Baba, Hideo; Fleshman, James; Wang, Xin; Watanabe, Toshiaki; Goel, Ajay
2018-03-01
Most T1 colorectal cancers treated by radical surgery can now be cured by endoscopic submucosal dissection. Although 70%-80% of T1 colorectal cancers are classified as high risk, <16% of these patients actually have lymph node metastases. Biomarkers are needed to identify patients with T1 cancers with the highest risk of metastasis, to prevent unnecessary radical surgery. We collected data from The Cancer Genome Atlas and identified 5 microRNAs (MIR32, MIR181B, MIR193B, MIR195, and MIR411) with significant changes in expression in T1 and T2 colorectal cancers with vs without lymph node metastases. Levels of the 5 microRNAs identified patients with lymph node invasion by T1 or T2 cancers with an area under the receiver operating characteristic curve (AUROC) value of 0.84. We validated these findings in 2 cohorts of patients with T1 cancers, using findings from histology as the reference. The 5-microRNA signature identified T1 cancers with lymph node invasion in cohort 1 with an AUROC value of 0.83, and in cohort 2 with an AUROC value of 0.74. When we analyzed biopsy samples from untreated patients, the 5-microRNA signature identified cancers with lymph node metastases with an AUROC value of 0.77. The 5-microRNA therefore identifies high-risk T1 colorectal cancers with a greater degree of accuracy than currently used pathologic features. Copyright © 2018 AGA Institute. Published by Elsevier Inc. All rights reserved.
Weston, David J; Gunter, Lee E; Rogers, Alistair; Wullschleger, Stan D
2008-01-01
Background One of the eminent opportunities afforded by modern genomic technologies is the potential to provide a mechanistic understanding of the processes by which genetic change translates to phenotypic variation and the resultant appearance of distinct physiological traits. Indeed much progress has been made in this area, particularly in biomedicine where functional genomic information can be used to determine the physiological state (e.g., diagnosis) and predict phenotypic outcome (e.g., patient survival). Ecology currently lacks an analogous approach where genomic information can be used to diagnose the presence of a given physiological state (e.g., stress response) and then predict likely phenotypic outcomes (e.g., stress duration and tolerance, fitness). Results Here, we demonstrate that a compendium of genomic signatures can be used to classify the plant abiotic stress phenotype in Arabidopsis according to the architecture of the transcriptome, and then be linked with gene coexpression network analysis to determine the underlying genes governing the phenotypic response. Using this approach, we confirm the existence of known stress responsive pathways and marker genes, report a common abiotic stress responsive transcriptome and relate phenotypic classification to stress duration. Conclusion Linking genomic signatures to gene coexpression analysis provides a unique method of relating an observed plant phenotype to changes in gene expression that underlie that phenotype. Such information is critical to current and future investigations in plant biology and, in particular, to evolutionary ecology, where a mechanistic understanding of adaptive physiological responses to abiotic stress can provide researchers with a tool of great predictive value in understanding species and population level adaptation to climate change. PMID:18248680
Scherer, Christina A; Magness, Charles L; Steiger, Kathryn V; Poitinger, Nicholas D; Caputo, Christine M; Miner, Douglas G; Winokur, Patricia L; Klinzman, Donna; McKee, Janice; Pilar, Christine; Ward, Patricia A; Gillham, Martha H; Haulman, N Jean; Stapleton, Jack T; Iadonato, Shawn P
2007-08-29
Gene expression in human peripheral blood mononuclear cells was systematically evaluated following smallpox and yellow fever vaccination, and naturally occurring upper respiratory infection (URI). All three infections were characterized by the induction of many interferon stimulated genes, as well as enhanced expression of genes involved in proteolysis and antigen presentation. Vaccinia infection was also characterized by a distinct expression signature composed of up-regulation of monocyte response genes, with repression of genes expressed by B and T-cells. In contrast, the yellow fever host response was characterized by a suppression of ribosomal and translation factors, distinguishing this infection from vaccinia and URI. No significant URI-specific signature was observed, perhaps reflecting greater heterogeneity in the study population and etiological agents. Taken together, these data suggest that specific host gene expression signatures may be identified that distinguish one or a small number of virus agents.
Metabolomic analysis of insulin resistance across different mouse strains and diets.
Stöckli, Jacqueline; Fisher-Wellman, Kelsey H; Chaudhuri, Rima; Zeng, Xiao-Yi; Fazakerley, Daniel J; Meoli, Christopher C; Thomas, Kristen C; Hoffman, Nolan J; Mangiafico, Salvatore P; Xirouchaki, Chrysovalantou E; Yang, Chieh-Hsin; Ilkayeva, Olga; Wong, Kari; Cooney, Gregory J; Andrikopoulos, Sofianos; Muoio, Deborah M; James, David E
2017-11-24
Insulin resistance is a major risk factor for many diseases. However, its underlying mechanism remains unclear in part because it is triggered by a complex relationship between multiple factors, including genes and the environment. Here, we used metabolomics combined with computational methods to identify factors that classified insulin resistance across individual mice derived from three different mouse strains fed two different diets. Three inbred ILSXISS strains were fed high-fat or chow diets and subjected to metabolic phenotyping and metabolomics analysis of skeletal muscle. There was significant metabolic heterogeneity between strains, diets, and individual animals. Distinct metabolites were changed with insulin resistance, diet, and between strains. Computational analysis revealed 113 metabolites that were correlated with metabolic phenotypes. Using these 113 metabolites, combined with machine learning to segregate mice based on insulin sensitivity, we identified C22:1-CoA, C2-carnitine, and C16-ceramide as the best classifiers. Strikingly, when these three metabolites were combined into one signature, they classified mice based on insulin sensitivity more accurately than each metabolite on its own or other published metabolic signatures. Furthermore, C22:1-CoA was 2.3-fold higher in insulin-resistant mice and correlated significantly with insulin resistance. We have identified a metabolomic signature composed of three functionally unrelated metabolites that accurately predicts whole-body insulin sensitivity across three mouse strains. These data indicate the power of simultaneous analysis of individual, genetic, and environmental variance in mice for identifying novel factors that accurately predict metabolic phenotypes like whole-body insulin sensitivity. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
NASA Astrophysics Data System (ADS)
Alkilani, Amjad; Shirkhodaie, Amir
2013-05-01
Handling, manipulation, and placement of objects, hereon called Human-Object Interaction (HOI), in the environment generate sounds. Such sounds are readily identifiable by the human hearing. However, in the presence of background environment noises, recognition of minute HOI sounds is challenging, though vital for improvement of multi-modality sensor data fusion in Persistent Surveillance Systems (PSS). Identification of HOI sound signatures can be used as precursors to detection of pertinent threats that otherwise other sensor modalities may miss to detect. In this paper, we present a robust method for detection and classification of HOI events via clustering of extracted features from training of HOI acoustic sound waves. In this approach, salient sound events are preliminary identified and segmented from background via a sound energy tracking method. Upon this segmentation, frequency spectral pattern of each sound event is modeled and its features are extracted to form a feature vector for training. To reduce dimensionality of training feature space, a Principal Component Analysis (PCA) technique is employed to expedite fast classification of test feature vectors, a kd-tree and Random Forest classifiers are trained for rapid classification of training sound waves. Each classifiers employs different similarity distance matching technique for classification. Performance evaluations of classifiers are compared for classification of a batch of training HOI acoustic signatures. Furthermore, to facilitate semantic annotation of acoustic sound events, a scheme based on Transducer Mockup Language (TML) is proposed. The results demonstrate the proposed approach is both reliable and effective, and can be extended to future PSS applications.
Wild-type APC predicts poor prognosis in microsatellite-stable proximal colon cancer.
Jorissen, Robert N; Christie, Michael; Mouradov, Dmitri; Sakthianandeswaren, Anuratha; Li, Shan; Love, Christopher; Xu, Zheng-Zhou; Molloy, Peter L; Jones, Ian T; McLaughlin, Stephen; Ward, Robyn L; Hawkins, Nicholas J; Ruszkiewicz, Andrew R; Moore, James; Burgess, Antony W; Busam, Dana; Zhao, Qi; Strausberg, Robert L; Lipton, Lara; Desai, Jayesh; Gibbs, Peter; Sieber, Oliver M
2015-09-15
APC mutations (APC-mt) occur in ∼70% of colorectal cancers (CRCs), but their relationship to prognosis is unclear. APC prognostic value was evaluated in 746 stage I-IV CRC patients, stratifying for tumour location and microsatellite instability (MSI). Microarrays were used to identify a gene signature that could classify APC mutation status, and classifier ability to predict prognosis was examined in an independent cohort. Wild-type APC microsatellite stable (APC-wt/MSS) tumours from the proximal colon showed poorer overall and recurrence-free survival (OS, RFS) than APC-mt/MSS proximal, APC-wt/MSS distal and APC-mt/MSS distal tumours (OS HR⩾1.79, P⩽0.015; RFS HR⩾1.88, P⩽0.026). APC was a stronger prognostic indicator than BRAF, KRAS, PIK3CA, TP53, CpG island methylator phenotype or chromosomal instability status (P⩽0.036). Microarray analysis similarly revealed poorer survival in MSS proximal cancers with an APC-wt-like signature (P=0.019). APC status did not affect outcomes in MSI tumours. In a validation on 206 patients with proximal colon cancer, APC-wt-like signature MSS cases showed poorer survival than APC-mt-like signature MSS or MSI cases (OS HR⩾2.50, P⩽0.010; RFS HR⩾2.14, P⩽0.025). Poor prognosis APC-wt/MSS proximal tumours exhibited features of the sessile serrated neoplasia pathway (P⩽0.016). APC-wt status is a marker of poor prognosis in MSS proximal colon cancer.
McMullin, Ryan P; Wittner, Ben S; Yang, Chuanwei; Denton-Schneider, Benjamin R; Hicks, Daniel; Singavarapu, Raj; Moulis, Sharon; Lee, Jeongeun; Akbari, Mohammad R; Narod, Steven A; Aldape, Kenneth D; Steeg, Patricia S; Ramaswamy, Sridhar; Sgroi, Dennis C
2014-03-14
There is an unmet clinical need for biomarkers to identify breast cancer patients at an increased risk of developing brain metastases. The objective is to identify gene signatures and biological pathways associated with human epidermal growth factor receptor 2-positive (HER2+) brain metastasis. We combined laser capture microdissection and gene expression microarrays to analyze malignant epithelium from HER2+ breast cancer brain metastases with that from HER2+ nonmetastatic primary tumors. Differential gene expression was performed including gene set enrichment analysis (GSEA) using publicly available breast cancer gene expression data sets. In a cohort of HER2+ breast cancer brain metastases, we identified a gene expression signature that anti-correlates with overexpression of BRCA1. Sequence analysis of the HER2+ brain metastases revealed no pathogenic mutations of BRCA1, and therefore the aforementioned signature was designated BRCA1 Deficient-Like (BD-L). Evaluation of an independent cohort of breast cancer metastases demonstrated that BD-L values are significantly higher in brain metastases as compared to other metastatic sites. Although the BD-L signature is present in all subtypes of breast cancer, it is significantly higher in BRCA1 mutant primary tumors as compared with sporadic breast tumors. Additionally, BD-L signature values are significantly higher in HER2-/ER- primary tumors as compared with HER2+/ER + and HER2-/ER + tumors. The BD-L signature correlates with breast cancer cell line pharmacologic response to a combination of poly (ADP-ribose) polymerase (PARP) inhibitor and temozolomide, and the signature outperformed four published gene signatures of BRCA1/2 deficiency. A BD-L signature is enriched in HER2+ breast cancer brain metastases without pathogenic BRCA1 mutations. Unexpectedly, elevated BD-L values are found in a subset of primary tumors across all breast cancer subtypes. Evaluation of pharmacological sensitivity in breast cancer cell lines representing all breast cancer subtypes suggests the BD-L signature may serve as a biomarker to identify sporadic breast cancer patients who might benefit from a therapeutic combination of PARP inhibitor and temozolomide and may be indicative of a dysfunctional BRCA1-associated pathway.
2014-01-01
Introduction There is an unmet clinical need for biomarkers to identify breast cancer patients at an increased risk of developing brain metastases. The objective is to identify gene signatures and biological pathways associated with human epidermal growth factor receptor 2-positive (HER2+) brain metastasis. Methods We combined laser capture microdissection and gene expression microarrays to analyze malignant epithelium from HER2+ breast cancer brain metastases with that from HER2+ nonmetastatic primary tumors. Differential gene expression was performed including gene set enrichment analysis (GSEA) using publicly available breast cancer gene expression data sets. Results In a cohort of HER2+ breast cancer brain metastases, we identified a gene expression signature that anti-correlates with overexpression of BRCA1. Sequence analysis of the HER2+ brain metastases revealed no pathogenic mutations of BRCA1, and therefore the aforementioned signature was designated BRCA1 Deficient-Like (BD-L). Evaluation of an independent cohort of breast cancer metastases demonstrated that BD-L values are significantly higher in brain metastases as compared to other metastatic sites. Although the BD-L signature is present in all subtypes of breast cancer, it is significantly higher in BRCA1 mutant primary tumors as compared with sporadic breast tumors. Additionally, BD-L signature values are significantly higher in HER2-/ER- primary tumors as compared with HER2+/ER + and HER2-/ER + tumors. The BD-L signature correlates with breast cancer cell line pharmacologic response to a combination of poly (ADP-ribose) polymerase (PARP) inhibitor and temozolomide, and the signature outperformed four published gene signatures of BRCA1/2 deficiency. Conclusions A BD-L signature is enriched in HER2+ breast cancer brain metastases without pathogenic BRCA1 mutations. Unexpectedly, elevated BD-L values are found in a subset of primary tumors across all breast cancer subtypes. Evaluation of pharmacological sensitivity in breast cancer cell lines representing all breast cancer subtypes suggests the BD-L signature may serve as a biomarker to identify sporadic breast cancer patients who might benefit from a therapeutic combination of PARP inhibitor and temozolomide and may be indicative of a dysfunctional BRCA1-associated pathway. PMID:24625110
Hoffman, Robert W; Merrill, Joan T; Alarcón-Riquelme, Marta M E; Petri, Michelle; Dow, Ernst R; Nantz, Eric; Nisenbaum, Laura K; Schroeder, Krista M; Komocsar, Wendy J; Perumal, Narayanan B; Linnik, Matthew D; Airey, David C; Liu, Yushi; Rocha, Guilherme V; Higgs, Richard E
2017-03-01
To characterize baseline gene expression and pharmacodynamically induced changes in whole blood gene expression in 1,760 systemic lupus erythematosus (SLE) patients from 2 phase III, 52-week, randomized, placebo-controlled, double-blind studies in which patients were treated with the BAFF-blocking IgG4 monoclonal antibody tabalumab. Patient samples were obtained from SLE patients from the ILLUMINATE-1 and ILLUMINATE-2 studies, and control samples were obtained from healthy donors. Blood was collected in Tempus tubes at baseline, week 16, and week 52. RNA was analyzed using Affymetrix Human Transcriptome Array 2.0 and NanoString. At baseline, expression of the interferon (IFN) response gene was elevated in patients compared with controls, with 75% of patients being positive for this IFN response gene signature. There was, however, substantial heterogeneity of IFN response gene expression and complex relationships among gene networks. The IFN response gene signature was a predictor of time to disease flare, independent of anti-double-stranded DNA (anti-dsDNA) antibody and C3 and C4 levels, and overall disease activity. Pharmacodynamically induced changes in gene expression following tabalumab treatment were extensive, occurring predominantly in B cell-related and immunoglobulin genes, and were consistent with other pharmacodynamic changes including anti-dsDNA antibody, C3, and immunoglobulin levels. SLE patients demonstrated increased expression of an IFN response gene signature (75% of patients had an elevated IFN response gene signature) at baseline in ILLUMINATE-1 and ILLUMINATE-2. Substantial heterogeneity of gene expression was detected among individual patients and in gene networks. The IFN response gene signature was an independent risk factor for future disease flares. Pharmacodynamic changes in gene expression were consistent with the mechanism of BAFF blockade by tabalumab. © 2016, American College of Rheumatology.
Khan, Haseeb Ahmad
2005-01-28
Due to versatile diagnostic and prognostic fidelity molecular signatures or fingerprints are anticipated as the most powerful tools for cancer management in the near future. Notwithstanding the experimental advancements in microarray technology, methods for analyzing either whole arrays or gene signatures have not been firmly established. Recently, an algorithm, ArraySolver has been reported by Khan for two-group comparison of microarray gene expression data using two-tailed Wilcoxon signed-rank test. Most of the molecular signatures are composed of two sets of genes (hybrid signatures) wherein up-regulation of one set and down-regulation of the other set collectively define the purpose of a gene signature. Since the direction of a selected gene's expression (positive or negative) with respect to a particular disease condition is known, application of one-tailed statistics could be a more relevant choice. A novel method, ArrayVigil, is described for comparing hybrid signatures using segregated-one-tailed (SOT) Wilcoxon signed-rank test and the results compared with integrated-two-tailed (ITT) procedures (SPSS and ArraySolver). ArrayVigil resulted in lower P values than those obtained from ITT statistics while comparing real data from four signatures.
Yuan, Yinyin; Failmezger, Henrik; Rueda, Oscar M; Ali, H Raza; Gräf, Stefan; Chin, Suet-Feung; Schwarz, Roland F; Curtis, Christina; Dunning, Mark J; Bardwell, Helen; Johnson, Nicola; Doyle, Sarah; Turashvili, Gulisa; Provenzano, Elena; Aparicio, Sam; Caldas, Carlos; Markowetz, Florian
2012-10-24
Solid tumors are heterogeneous tissues composed of a mixture of cancer and normal cells, which complicates the interpretation of their molecular profiles. Furthermore, tissue architecture is generally not reflected in molecular assays, rendering this rich information underused. To address these challenges, we developed a computational approach based on standard hematoxylin and eosin-stained tissue sections and demonstrated its power in a discovery and validation cohort of 323 and 241 breast tumors, respectively. To deconvolute cellular heterogeneity and detect subtle genomic aberrations, we introduced an algorithm based on tumor cellularity to increase the comparability of copy number profiles between samples. We next devised a predictor for survival in estrogen receptor-negative breast cancer that integrated both image-based and gene expression analyses and significantly outperformed classifiers that use single data types, such as microarray expression signatures. Image processing also allowed us to describe and validate an independent prognostic factor based on quantitative analysis of spatial patterns between stromal cells, which are not detectable by molecular assays. Our quantitative, image-based method could benefit any large-scale cancer study by refining and complementing molecular assays of tumor samples.
Johnsen, Hans Erik; Bergkvist, Kim Steve; Schmitz, Alexander; Kjeldsen, Malene Krag; Hansen, Steen Møller; Gaihede, Michael; Nørgaard, Martin Agge; Bæch, John; Grønholdt, Marie-Louise; Jensen, Frank Svendsen; Johansen, Preben; Bødker, Julie Støve; Bøgsted, Martin; Dybkær, Karen
2014-06-01
Recent findings have suggested biological classification of B-cell malignancies as exemplified by the "activated B-cell-like" (ABC), the "germinal-center B-cell-like" (GCB) and primary mediastinal B-cell lymphoma (PMBL) subtypes of diffuse large B-cell lymphoma and "recurrent translocation and cyclin D" (TC) classification of multiple myeloma. Biological classification of B-cell derived cancers may be refined by a direct and systematic strategy where identification and characterization of normal B-cell differentiation subsets are used to define the cancer cell of origin phenotype. Here we propose a strategy combining multiparametric flow cytometry, global gene expression profiling and biostatistical modeling to generate B-cell subset specific gene signatures from sorted normal human immature, naive, germinal centrocytes and centroblasts, post-germinal memory B-cells, plasmablasts and plasma cells from available lymphoid tissues including lymph nodes, tonsils, thymus, peripheral blood and bone marrow. This strategy will provide an accurate image of the stage of differentiation, which prospectively can be used to classify any B-cell malignancy and eventually purify tumor cells. This report briefly describes the current models of the normal B-cell subset differentiation in multiple tissues and the pathogenesis of malignancies originating from the normal germinal B-cell hierarchy.
Blakely, Collin M; Stoddard, Alexander J; Belka, George K; Dugan, Katherine D; Notarfrancesco, Kathleen L; Moody, Susan E; D'Cruz, Celina M; Chodosh, Lewis A
2006-06-15
Women who have their first child early in life have a substantially lower lifetime risk of breast cancer. The mechanism for this is unknown. Similar to humans, rats exhibit parity-induced protection against mammary tumorigenesis. To explore the basis for this phenomenon, we identified persistent pregnancy-induced changes in mammary gene expression that are tightly associated with protection against tumorigenesis in multiple inbred rat strains. Four inbred rat strains that exhibit marked differences in their intrinsic susceptibilities to carcinogen-induced mammary tumorigenesis were each shown to display significant protection against methylnitrosourea-induced mammary tumorigenesis following treatment with pregnancy levels of estradiol and progesterone. Microarray expression profiling of parous and nulliparous mammary tissue from these four strains yielded a common 70-gene signature. Examination of the genes constituting this signature implicated alterations in transforming growth factor-beta signaling, the extracellular matrix, amphiregulin expression, and the growth hormone/insulin-like growth factor I axis in pregnancy-induced alterations in breast cancer risk. Notably, related molecular changes have been associated with decreased mammographic density, which itself is strongly associated with decreased breast cancer risk. Our findings show that hormone-induced protection against mammary tumorigenesis is widely conserved among divergent rat strains and define a gene expression signature that is tightly correlated with reduced mammary tumor susceptibility as a consequence of a normal developmental event. Given the conservation of this signature, these pathways may contribute to pregnancy-induced protection against breast cancer.
Lu, Dengsheng; Batistella, Mateus; de Miranda, Evaristo E; Moran, Emilio
2008-01-01
Complex forest structure and abundant tree species in the moist tropical regions often cause difficulties in classifying vegetation classes with remotely sensed data. This paper explores improvement in vegetation classification accuracies through a comparative study of different image combinations based on the integration of Landsat Thematic Mapper (TM) and SPOT High Resolution Geometric (HRG) instrument data, as well as the combination of spectral signatures and textures. A maximum likelihood classifier was used to classify the different image combinations into thematic maps. This research indicated that data fusion based on HRG multispectral and panchromatic data slightly improved vegetation classification accuracies: a 3.1 to 4.6 percent increase in the kappa coefficient compared with the classification results based on original HRG or TM multispectral images. A combination of HRG spectral signatures and two textural images improved the kappa coefficient by 6.3 percent compared with pure HRG multispectral images. The textural images based on entropy or second-moment texture measures with a window size of 9 pixels × 9 pixels played an important role in improving vegetation classification accuracy. Overall, optical remote-sensing data are still insufficient for accurate vegetation classifications in the Amazon basin.
Lu, Dengsheng; Batistella, Mateus; de Miranda, Evaristo E.; Moran, Emilio
2009-01-01
Complex forest structure and abundant tree species in the moist tropical regions often cause difficulties in classifying vegetation classes with remotely sensed data. This paper explores improvement in vegetation classification accuracies through a comparative study of different image combinations based on the integration of Landsat Thematic Mapper (TM) and SPOT High Resolution Geometric (HRG) instrument data, as well as the combination of spectral signatures and textures. A maximum likelihood classifier was used to classify the different image combinations into thematic maps. This research indicated that data fusion based on HRG multispectral and panchromatic data slightly improved vegetation classification accuracies: a 3.1 to 4.6 percent increase in the kappa coefficient compared with the classification results based on original HRG or TM multispectral images. A combination of HRG spectral signatures and two textural images improved the kappa coefficient by 6.3 percent compared with pure HRG multispectral images. The textural images based on entropy or second-moment texture measures with a window size of 9 pixels × 9 pixels played an important role in improving vegetation classification accuracy. Overall, optical remote-sensing data are still insufficient for accurate vegetation classifications in the Amazon basin. PMID:19789716
Place recognition using batlike sonar
Vanderelst, Dieter; Steckel, Jan; Boen, Andre; Peremans, Herbert; Holderied, Marc W
2016-01-01
Echolocating bats have excellent spatial memory and are able to navigate to salient locations using bio-sonar. Navigating and route-following require animals to recognize places. Currently, it is mostly unknown how bats recognize places using echolocation. In this paper, we propose template based place recognition might underlie sonar-based navigation in bats. Under this hypothesis, bats recognize places by remembering their echo signature - rather than their 3D layout. Using a large body of ensonification data collected in three different habitats, we test the viability of this hypothesis assessing two critical properties of the proposed echo signatures: (1) they can be uniquely classified and (2) they vary continuously across space. Based on the results presented, we conclude that the proposed echo signatures satisfy both criteria. We discuss how these two properties of the echo signatures can support navigation and building a cognitive map. DOI: http://dx.doi.org/10.7554/eLife.14188.001 PMID:27481189
Zambernardi, Agustina; Chiodetti, Ana; Meier, Dominik; Cabanne, Ana; Nachman, Fabio; Solar, Héctor; Rumbo, Carolina; Gondolesi, Gabriel E; Rumbo, Martin
2014-12-01
Acute cellular rejection (ACR) and infections are leading causes of graft loss and death in intestinal transplant patients. Our aim was to evaluate the impact of maintenance immunosuppressive therapies on the expression of pro-inflammatory mediators in small bowel at ACR diagnosis. We analyzed expression levels of Th1-associated genes, IFNG, CXCL10, and CXCL11 by qPCR in 46 selected graft biopsies unequivocally assigned to mild ACR (n = 14) or normal histopathology and clinical condition (n = 32) from 15 patients receiving two different immunosuppressive (IS) schemes. Double treatment: corticosteroids and tacrolimus (n = 17) and triple treatment: sirolimus or mycophenolate mofetil in addition to the basal therapy (n = 29). IFNG, CXCL10, and CXCL11 were induced during rejection (p < 0.05; p < 0.005, and p < 0.05, respectively). However, when rejection and control groups were classified according to immunosuppressive treatment, in the rejection group, significant differences of IFNG, CXCL10, and CXCL11 expression (p < 0.001; p < 0.005, and 0.01, respectively) were detected, whereas no differences were observed in the control group. Gene expression of Th1 response mediators is higher during ACR. Triple IS group showed significantly lower expression of pro-inflammatory Th1 mediators during mild ACR indicating that use of these markers to monitor rejection can be affected by the IS treatment used. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Dyer, Betsey D.; Kahn, Michael J.; LeBlanc, Mark D.
2008-01-01
Classification and regression tree (CART) analysis was applied to genome-wide tetranucleotide frequencies (genomic signatures) of 195 archaea and bacteria. Although genomic signatures have typically been used to classify evolutionary divergence, in this study, convergent evolution was the focus. Temperature optima for most of the organisms examined could be distinguished by CART analyses of tetranucleotide frequencies. This suggests that pervasive (nonlinear) qualities of genomes may reflect certain environmental conditions (such as temperature) in which those genomes evolved. The predominant use of GAGA and AGGA as the discriminating tetramers in CART models suggests that purine-loading and codon biases of thermophiles may explain some of the results. PMID:19054742
On Hunting Animals of the Biometric Menagerie for Online Signature
Houmani, Nesma; Garcia-Salicetti, Sonia
2016-01-01
Individuals behave differently regarding to biometric authentication systems. This fact was formalized in the literature by the concept of Biometric Menagerie, defining and labeling user groups with animal names in order to reflect their characteristics with respect to biometric systems. This concept was illustrated for face, fingerprint, iris, and speech modalities. The present study extends the Biometric Menagerie to online signatures, by proposing a novel methodology that ties specific quality measures for signatures to categories of the Biometric Menagerie. Such measures are combined for retrieving automatically writer categories of the extended version of the Biometric Menagerie. Performance analysis with different types of classifiers shows the pertinence of our approach on the well-known MCYT-100 database. PMID:27054836
MISR Level 2 TOA/Cloud Classifier parameters (MIL2TCCL_V3)
NASA Technical Reports Server (NTRS)
Diner, David J. (Principal Investigator)
The TOA/Cloud Classifiers contain the Angular Signature Cloud Mask (ASCM), a scene classifier calculated using support vector machine technology (SVM) both of which are on a 1.1 km grid, and cloud fractions at 17.6 km resolution that are available in different height bins (low, middle, high) and are also calculated on an angle-by-angle basis. [Temporal_Coverage: Start_Date=2000-02-24; Stop_Date=] [Spatial_Coverage: Southernmost_Latitude=-90; Northernmost_Latitude=90; Westernmost_Longitude=-180; Easternmost_Longitude=180] [Data_Resolution: Latitude_Resolution=1.1 km; Longitude_Resolution=1.1 km; Temporal_Resolution=about 15 orbits/day].
Lu, Yan; Wang, Liang; Liu, Pengyuan; Yang, Ping; You, Ming
2012-01-01
About 30% stage I non-small cell lung cancer (NSCLC) patients undergoing resection will recur. Robust prognostic markers are required to better manage therapy options. The purpose of this study is to develop and validate a novel gene-expression signature that can predict tumor recurrence of stage I NSCLC patients. Cox proportional hazards regression analysis was performed to identify recurrence-related genes and a partial Cox regression model was used to generate a gene signature of recurrence in the training dataset −142 stage I lung adenocarcinomas without adjunctive therapy from the Director's Challenge Consortium. Four independent validation datasets, including GSE5843, GSE8894, and two other datasets provided by Mayo Clinic and Washington University, were used to assess the prediction accuracy by calculating the correlation between risk score estimated from gene expression and real recurrence-free survival time and AUC of time-dependent ROC analysis. Pathway-based survival analyses were also performed. 104 probesets correlated with recurrence in the training dataset. They are enriched in cell adhesion, apoptosis and regulation of cell proliferation. A 51-gene expression signature was identified to distinguish patients likely to develop tumor recurrence (Dxy = −0.83, P<1e-16) and this signature was validated in four independent datasets with AUC >85%. Multiple pathways including leukocyte transendothelial migration and cell adhesion were highly correlated with recurrence-free survival. The gene signature is highly predictive of recurrence in stage I NSCLC patients, which has important prognostic and therapeutic implications for the future management of these patients. PMID:22292069
Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L.; Roberts, Brian S.; Arthur, William T.; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing
2014-01-01
Background Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. Results We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Conclusion Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells. PMID:24651522
Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L; Roberts, Brian S; Arthur, William T; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing
2014-01-01
Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells.
Four-miRNA signature as a prognostic tool for lung adenocarcinoma.
Lin, Yan; Lv, Yufeng; Liang, Rong; Yuan, Chunling; Zhang, Jinyan; He, Dan; Zheng, Xiaowen; Zhang, Jianfeng
2018-01-01
The aim of this study was to generate a novel miRNA expression signature to accurately predict prognosis for patients with lung adenocarcinoma (LUAD). Using expression profiles downloaded from The Cancer Genome Atlas database, we identified multiple miRNAs with differential expression between LUAD and paired healthy tissues. We then evaluated the prognostic values of the differentially expressed miRNAs using univariate/multivariate Cox regression analysis. This analysis was ultimately used to construct a four-miRNA signature that effectively predicted patient survival. Finally, we analyzed potential functional roles of the target genes for these four miRNAs using Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses. Based on our cutoff criteria ( P <0.05 and |log2FC| >1.0), we identified a total of 187 differentially expressed miRNAs, including 148 that were upregulated in LUAD tissues and 39 that were downregulated. Four miRNAs (miR-148a-5p, miR-31-5p, miR-548v, and miR-550a-5p) were independently associated with survival based on Kaplan-Meier analysis. We generated a signature index based on the expression of these four miRNAs and stratified patients into low- and high-risk groups. Patients in the high-risk group had significantly shorter survival times than those in the low-risk group ( P =0.002). A functional enrichment analysis suggested that the target genes of these four miRNAs were involved in protein phosphorylation and the Hippo and sphingolipid signaling pathways. Taken together, our results suggest that our four-miRNA signature can be used as a prognostic tool for patients with LUAD.
Grayson, Peter C.; Carmona-Rivera, Carmelo; Xu, Lijing; Lim, Noha; Gao, Zhong; Asare, Adam L.; Specks, Ulrich; Stone, John H.; Seo, Philip; Spiera, Robert F.; Langford, Carol A.; Hoffman, Gary S.; Kallenberg, Cees G.M.; St Clair, E. William; Tchao, Nadia K.; Ytterberg, Steven R.; Phippard, Deborah J.; Merkel, Peter A.; Kaplan, Mariana J.; Monach, Paul A.
2015-01-01
Objectives To discover biomarkers involved in the pathophysiology of ANCA-associated vasculitis (AAV) and determine if low-density granulocytes (LDGs) contribute to gene expression signatures in AAV. Methods The source of clinical data and linked biospecimens was a randomized controlled treatment trial in AAV. RNA-sequencing of whole blood from patients with AAV was performed during active disease at the baseline visit (BL) and during remission 6 months later (6M). Gene expression was compared between patients who met versus did not meet the primary trial outcome of clinical remission at 6M (responders vs. nonresponders). Measurement of neutrophil-related gene expression was confirmed in PBMCs to validate findings in whole blood. A negative selection strategy isolated LDGs from PBMC fractions. Results Differential expression between responders (n=77) and nonresponders (n=35) was detected in 2,346 transcripts at BL visit (p<0.05). Unsupervised hierarchical clustering demonstrated a cluster of granulocyte-related genes, including myeloperoxidase (MPO) and proteinase 3 (PR3). A granulocyte multi-gene composite score was significantly higher in nonresponders than responders (p<0.01) and during active disease compared to remission (p<0.01). This signature strongly overlapped an LDG signature identified previously in lupus (FDRGSEA<0.01). Transcription of PR3 measured in PBMCs was associated with active disease and treatment response (p<0.01). LDGs isolated from patients with AAV spontaneously formed neutrophil extracellular traps containing PR3 and MPO. Conclusions In AAV an increased expression of a granulocyte gene signature is associated with disease activity and decreased response to treatment. The source of this signature is likely LDGs, a potentially pathogenic cell type in AAV. PMID:25891759
CoBOP: Electro-Optic Identification Laser Line Sean Sensors
1998-01-01
Electro - Optic Identification Sensors Project[1] is to develop and demonstrate high resolution underwater electro - optic (EO) imaging sensors, and associated image processing/analysis methods, for rapid visual identification of mines and mine-like contacts (MLCs). Identification of MLCs is a pressing Fleet need. During MCM operations, sonar contacts are classified as mine-like if they are sufficiently similar to signatures of mines. Each contact classified as mine-like must be identified as a mine or not a mine. During MCM operations in littoral areas,
A Transcriptional Signature of Fatigue Derived from Patients with Primary Sjögren's Syndrome.
James, Katherine; Al-Ali, Shereen; Tarn, Jessica; Cockell, Simon J; Gillespie, Colin S; Hindmarsh, Victoria; Locke, James; Mitchell, Sheryl; Lendrem, Dennis; Bowman, Simon; Price, Elizabeth; Pease, Colin T; Emery, Paul; Lanyon, Peter; Hunter, John A; Gupta, Monica; Bombardieri, Michele; Sutcliffe, Nurhan; Pitzalis, Costantino; McLaren, John; Cooper, Annie; Regan, Marian; Giles, Ian; Isenberg, David; Saravanan, Vadivelu; Coady, David; Dasgupta, Bhaskar; McHugh, Neil; Young-Min, Steven; Moots, Robert; Gendi, Nagui; Akil, Mohammed; Griffiths, Bridget; Wipat, Anil; Newton, Julia; Jones, David E; Isaacs, John; Hallinan, Jennifer; Ng, Wan-Fai
2015-01-01
Fatigue is a debilitating condition with a significant impact on patients' quality of life. Fatigue is frequently reported by patients suffering from primary Sjögren's Syndrome (pSS), a chronic autoimmune condition characterised by dryness of the eyes and the mouth. However, although fatigue is common in pSS, it does not manifest in all sufferers, providing an excellent model with which to explore the potential underpinning biological mechanisms. Whole blood samples from 133 fully-phenotyped pSS patients stratified for the presence of fatigue, collected by the UK primary Sjögren's Syndrome Registry, were used for whole genome microarray. The resulting data were analysed both on a gene by gene basis and using pre-defined groups of genes. Finally, gene set enrichment analysis (GSEA) was used as a feature selection technique for input into a support vector machine (SVM) classifier. Classification was assessed using area under curve (AUC) of receiver operator characteristic and standard error of Wilcoxon statistic, SE(W). Although no genes were individually found to be associated with fatigue, 19 metabolic pathways were enriched in the high fatigue patient group using GSEA. Analysis revealed that these enrichments arose from the presence of a subset of 55 genes. A radial kernel SVM classifier with this subset of genes as input displayed significantly improved performance over classifiers using all pathway genes as input. The classifiers had AUCs of 0.866 (SE(W) 0.002) and 0.525 (SE(W) 0.006), respectively. Systematic analysis of gene expression data from pSS patients discordant for fatigue identified 55 genes which are predictive of fatigue level using SVM classification. This list represents the first step in understanding the underlying pathophysiological mechanisms of fatigue in patients with pSS.
Smid, Marcel; Rodríguez-González, F. Germán; Sieuwerts, Anieta M.; Salgado, Roberto; Prager-Van der Smissen, Wendy J. C.; Vlugt-Daane, Michelle van der; van Galen, Anne; Nik-Zainal, Serena; Staaf, Johan; Brinkman, Arie B.; van de Vijver, Marc J.; Richardson, Andrea L.; Fatima, Aquila; Berentsen, Kim; Butler, Adam; Martin, Sancha; Davies, Helen R.; Debets, Reno; Gelder, Marion E. Meijer-Van; van Deurzen, Carolien H. M.; MacGrogan, Gaëtan; Van den Eynden, Gert G. G. M.; Purdie, Colin; Thompson, Alastair M.; Caldas, Carlos; Span, Paul N.; Simpson, Peter T.; Lakhani, Sunil R.; Van Laere, Steven; Desmedt, Christine; Ringnér, Markus; Tommasi, Stefania; Eyford, Jorunn; Broeks, Annegien; Vincent-Salomon, Anne; Futreal, P. Andrew; Knappskog, Stian; King, Tari; Thomas, Gilles; Viari, Alain; Langerød, Anita; Børresen-Dale, Anne-Lise; Birney, Ewan; Stunnenberg, Hendrik G.; Stratton, Mike; Foekens, John A.; Martens, John W. M.
2016-01-01
A recent comprehensive whole genome analysis of a large breast cancer cohort was used to link known and novel drivers and substitution signatures to the transcriptome of 266 cases. Here, we validate that subtype-specific aberrations show concordant expression changes for, for example, TP53, PIK3CA, PTEN, CCND1 and CDH1. We find that CCND3 expression levels do not correlate with amplification, while increased GATA3 expression in mutant GATA3 cancers suggests GATA3 is an oncogene. In luminal cases the total number of substitutions, irrespective of type, associates with cell cycle gene expression and adverse outcome, whereas the number of mutations of signatures 3 and 13 associates with immune-response specific gene expression, increased numbers of tumour-infiltrating lymphocytes and better outcome. Thus, while earlier reports imply that the sheer number of somatic aberrations could trigger an immune-response, our data suggests that substitutions of a particular type are more effective in doing so than others. PMID:27666519
Fukutani, Eduardo; Rodrigues, Moreno; Kasprzykowski, José Irahe; Araujo, Cintia Figueiredo de; Paschoal, Alexandre Rossi; Ramos, Pablo Ivan Pereira; Fukutani, Kiyoshi Ferreira; Queiroz, Artur Trancoso Lopo de
2018-01-01
The mosquito Aedes aegypti is the main vector of several arthropod-borne diseases that have global impacts. In a previous meta-analysis, our group identified a vector gene set containing 110 genes strongly associated with infections of dengue, West Nile and yellow fever viruses. Of these 110 genes, four genes allowed a highly accurate classification of infected status. More recently, a new study of Ae. aegypti infected with Zika virus (ZIKV) was published, providing new data to investigate whether this "infection" gene set is also altered during a ZIKV infection. Our hypothesis is that the infection-associated signature may also serve as a proxy to classify the ZIKV infection in the vector. Raw data associated with the NCBI/BioProject were downloaded and re-analysed. A total of 18 paired-end replicates corresponding to three ZIKV-infected samples and three controls were included in this study. The nMDS technique with a logistic regression was used to obtain the probabilities of belonging to a given class. Thus, to compare both gene sets, we used the area under the curve and performed a comparison using the bootstrap method. Our meta-signature was able to separate the infected mosquitoes from the controls with good predictive power to classify the Zika-infected mosquitoes.
Mak, Milena P; Tong, Pan; Diao, Lixia; Cardnell, Robert J; Gibbons, Don L; William, William N; Skoulidis, Ferdinandos; Parra, Edwin R; Rodriguez-Canales, Jaime; Wistuba, Ignacio I; Heymach, John V; Weinstein, John N; Coombes, Kevin R; Wang, Jing; Byers, Lauren Averett
2016-02-01
We previously demonstrated the association between epithelial-to-mesenchymal transition (EMT) and drug response in lung cancer using an EMT signature derived in cancer cell lines. Given the contribution of tumor microenvironments to EMT, we extended our investigation of EMT to patient tumors from 11 cancer types to develop a pan-cancer EMT signature. Using the pan-cancer EMT signature, we conducted an integrated, global analysis of genomic and proteomic profiles associated with EMT across 1,934 tumors including breast, lung, colon, ovarian, and bladder cancers. Differences in outcome and in vitro drug response corresponding to expression of the pan-cancer EMT signature were also investigated. Compared with the lung cancer EMT signature, the patient-derived, pan-cancer EMT signature encompasses a set of core EMT genes that correlate even more strongly with known EMT markers across diverse tumor types and identifies differences in drug sensitivity and global molecular alterations at the DNA, RNA, and protein levels. Among those changes associated with EMT, pathway analysis revealed a strong correlation between EMT and immune activation. Further supervised analysis demonstrated high expression of immune checkpoints and other druggable immune targets, such as PD1, PD-L1, CTLA4, OX40L, and PD-L2, in tumors with the most mesenchymal EMT scores. Elevated PD-L1 protein expression in mesenchymal tumors was confirmed by IHC in an independent lung cancer cohort. This new signature provides a novel, patient-based, histology-independent tool for the investigation of EMT and offers insights into potential novel therapeutic targets for mesenchymal tumors, independent of cancer type, including immune checkpoints. ©2015 American Association for Cancer Research.
Wongchenko, Matthew J; McArthur, Grant A; Dréno, Brigitte; Larkin, James; Ascierto, Paolo A; Sosman, Jeffrey; Andries, Luc; Kockx, Mark; Hurst, Stephen D; Caro, Ivor; Rooney, Isabelle; Hegde, Priti S; Molinero, Luciana; Yue, Huibin; Chang, Ilsung; Amler, Lukas; Yan, Yibing; Ribas, Antoni
2017-09-01
Purpose: The association of tumor gene expression profiles with progression-free survival (PFS) outcomes in patients with BRAF V600 -mutated melanoma treated with vemurafenib or cobimetinib combined with vemurafenib was evaluated. Experimental Design: Gene expression of archival tumor samples from patients in four trials (BRIM-2, BRIM-3, BRIM-7, and coBRIM) was evaluated. Genes significantly associated with PFS ( P < 0.05) were identified by univariate Cox proportional hazards modeling, then subjected to unsupervised hierarchical clustering, principal component analysis, and recursive partitioning to develop optimized gene signatures. Results: Forty-six genes were identified as significantly associated with PFS in both BRIM-2 ( n = 63) and the vemurafenib arm of BRIM-3 ( n = 160). Two distinct signatures were identified: cell cycle and immune. Among vemurafenib-treated patients, the cell-cycle signature was associated with shortened PFS compared with the immune signature in the BRIM-2/BRIM-3 training set [hazard ratio (HR) 1.8; 95% confidence interval (CI), 1.3-2.6, P = 0.0001] and in the coBRIM validation set ( n = 101; HR, 1.6; 95% CI, 1.0-2.5; P = 0.08). The adverse impact of the cell-cycle signature on PFS was not observed in patients treated with cobimetinib combined with vemurafenib ( n = 99; HR, 1.1; 95% CI, 0.7-1.8; P = 0.66). Conclusions: In vemurafenib-treated patients, the cell-cycle gene signature was associated with shorter PFS. However, in cobimetinib combined with vemurafenib-treated patients, both cell cycle and immune signature subgroups had comparable PFS. Cobimetinib combined with vemurafenib may abrogate the adverse impact of the cell-cycle signature. Clin Cancer Res; 23(17); 5238-45. ©2017 AACR . ©2017 American Association for Cancer Research.
Martin, F; Talikka, M; Hoeng, J; Peitsch, M C
2015-12-01
Gene expression profiling data can be used in toxicology to assess both the level and impact of toxicant exposure, aligned with a vision of 21st century toxicology. Here, we present a whole blood-derived gene signature that can distinguish current smokers from either nonsmokers or former smokers with high specificity and sensitivity. Such a signature that can be measured in a surrogate tissue (whole blood) may help in monitoring smoking exposure as well as discontinuation of exposure when the primarily impacted tissue (e.g., lung) is not readily accessible. The signature consisted of LRRN3, SASH1, PALLD, RGL1, TNFRSF17, CDKN1C, IGJ, RRM2, ID3, SERPING1, and FUCA1. Several members of this signature have been previously described in the context of smoking. The signature translated well across species and could distinguish mice that were exposed to cigarette smoke from ones exposed to air only or had been withdrawn from cigarette smoke exposure. Finally, the small signature of only 11 genes could be converted into a polymerase chain reaction-based assay that could serve as a marker to monitor compliance with a smoking abstinence protocol. © The Author(s) 2015.
Phycoerythrin Signatures in the Littoral Zone
2007-06-29
TITLE. Enter title and subtitle with volume Enter report number as assigned by the sponsoring/ number and part number, if applicable . On classified...forms of chromatic adaptation known in some freshwater cyanobacteria that differentially regulate the synthesis of PE and phycocyanin (a blue green
Zhu, Jianguo; Pan, Cong; Jiang, Jun; Deng, Mingsen; Gao, Hengjun; Men, Bozhao; McClelland, Michael; Mercola, Dan; Zhong, Wei-De; Jia, Zhenyu
2015-01-01
We previously analyzed human prostate tissue containing stroma near to tumor and from cancer-negative tissues of volunteers. Over 100 candidate gene expression differences were identified and used to develop a classifier that could detect nearby tumor with an accuracy of 97% (sensitivity = 98% and specificity = 88%) based on 364 independent test cases from primarily European American cases. These stroma-based gene signatures have the potential to identify cancer patients among those with negative biopsies. In this study, we used prostate tissues from Chinese cases to validate six of these markers (CAV1, COL4A2, HSPB1, ITGB3, MAP1A and MCAM). In validation by real-time PCR, four genes (COL4A2, HSPB1, ITGB3, and MAP1A) demonstrated significantly lower expression in tumor-adjacent stroma compared to normal stroma (p value ≤ 0.05). Next, we tested whether these expression differences could be extended to the protein level. In IHC assays, all six selected proteins showed lower expression in tumor-adjacent stroma compared to the normal stroma, of which COL4A2, HSPB1 and ITGB3 showed significant differences (p value ≤ 0.05). These results suggest that biomarkers for diagnosing prostate cancer based on tumor microenvironment may be applicable across multiple racial groups. PMID:26158290
Hippo Cascade Controls Lineage Commitment of Liver Tumors in Mice and Humans.
Zhang, Shanshan; Wang, Jingxiao; Wang, Haichuan; Fan, Lingling; Fan, Biao; Zeng, Billy; Tao, Junyan; Li, Xiaolei; Che, Li; Cigliano, Antonio; Ribback, Silvia; Dombrowski, Frank; Chen, Bin; Cong, Wenming; Wei, Lixin; Calvisi, Diego F; Chen, Xin
2018-04-01
Primary liver cancer consists mainly of hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma (ICC). A subset of human HCCs expresses a ICC-like gene signature and is classified as ICC-like HCC. The Hippo pathway is a critical regulator of normal and malignant liver development. However, the precise function(s) of the Hippo cascade along liver carcinogenesis remain to be fully delineated. The role of the Hippo pathway in a murine mixed HCC/ICC model induced by activated forms of AKT and Ras oncogenes (AKT/Ras) was investigated. The authors demonstrated the inactivation of Hippo in AKT/Ras liver tumors leading to nuclear localization of Yap and TAZ. Coexpression of AKT/Ras with Lats2, which activates Hippo, or the dominant negative form of TEAD2 (dnTEAD2), which blocks Yap/TAZ activity, resulted in delayed hepatocarcinogenesis and elimination of ICC-like lesions in the liver. Mechanistically, Notch2 expression was found to be down-regulated by the Hippo pathway in liver tumors. Overexpression of Lats2 or dnTEAD2 in human HCC cell lines inhibited their growth and led to the decreased expression of ICC-like markers, as well as Notch2 expression. Altogether, this study supports the key role of the Hippo cascade in regulating the differentiation status of liver tumors. Copyright © 2018 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.
Quality Assurance of RNA Expression Profiling in Clinical Laboratories
Tang, Weihua; Hu, Zhiyuan; Muallem, Hind; Gulley, Margaret L.
2012-01-01
RNA expression profiles are increasingly used to diagnose and classify disease, based on expression patterns of as many as several thousand RNAs. To ensure quality of expression profiling services in clinical settings, a standard operating procedure incorporates multiple quality indicators and controls, beginning with preanalytic specimen preparation and proceeding thorough analysis, interpretation, and reporting. Before testing, histopathological examination of each cellular specimen, along with optional cell enrichment procedures, ensures adequacy of the input tissue. Other tactics include endogenous controls to evaluate adequacy of RNA and exogenous or spiked controls to evaluate run- and patient-specific performance of the test system, respectively. Unique aspects of quality assurance for array-based tests include controls for the pertinent outcome signatures that often supersede controls for each individual analyte, built-in redundancy for critical analytes or biochemical pathways, and software-supported scrutiny of abundant data by a laboratory physician who interprets the findings in a manner facilitating appropriate medical intervention. Access to high-quality reagents, instruments, and software from commercial sources promotes standardization and adoption in clinical settings, once an assay is vetted in validation studies as being analytically sound and clinically useful. Careful attention to the well-honed principles of laboratory medicine, along with guidance from government and professional groups on strategies to preserve RNA and manage large data sets, promotes clinical-grade assay performance. PMID:22020152
GESearch: An Interactive GUI Tool for Identifying Gene Expression Signature.
Ye, Ning; Yin, Hengfu; Liu, Jingjing; Dai, Xiaogang; Yin, Tongming
2015-01-01
The huge amount of gene expression data generated by microarray and next-generation sequencing technologies present challenges to exploit their biological meanings. When searching for the coexpression genes, the data mining process is largely affected by selection of algorithms. Thus, it is highly desirable to provide multiple options of algorithms in the user-friendly analytical toolkit to explore the gene expression signatures. For this purpose, we developed GESearch, an interactive graphical user interface (GUI) toolkit, which is written in MATLAB and supports a variety of gene expression data files. This analytical toolkit provides four models, including the mean, the regression, the delegate, and the ensemble models, to identify the coexpression genes, and enables the users to filter data and to select gene expression patterns by browsing the display window or by importing knowledge-based genes. Subsequently, the utility of this analytical toolkit is demonstrated by analyzing two sets of real-life microarray datasets from cell-cycle experiments. Overall, we have developed an interactive GUI toolkit that allows for choosing multiple algorithms for analyzing the gene expression signatures.
Adipose Gene Expression Prior to Weight Loss Can Differentiate and Weakly Predict Dietary Responders
Mutch, David M.; Temanni, M. Ramzi; Henegar, Corneliu; Combes, Florence; Pelloux, Véronique; Holst, Claus; Sørensen, Thorkild I. A.; Astrup, Arne; Martinez, J. Alfredo; Saris, Wim H. M.; Viguerie, Nathalie; Langin, Dominique; Zucker, Jean-Daniel; Clément, Karine
2007-01-01
Background The ability to identify obese individuals who will successfully lose weight in response to dietary intervention will revolutionize disease management. Therefore, we asked whether it is possible to identify subjects who will lose weight during dietary intervention using only a single gene expression snapshot. Methodology/Principal Findings The present study involved 54 female subjects from the Nutrient-Gene Interactions in Human Obesity-Implications for Dietary Guidelines (NUGENOB) trial to determine whether subcutaneous adipose tissue gene expression could be used to predict weight loss prior to the 10-week consumption of a low-fat hypocaloric diet. Using several statistical tests revealed that the gene expression profiles of responders (8–12 kgs weight loss) could always be differentiated from non-responders (<4 kgs weight loss). We also assessed whether this differentiation was sufficient for prediction. Using a bottom-up (i.e. black-box) approach, standard class prediction algorithms were able to predict dietary responders with up to 61.1%±8.1% accuracy. Using a top-down approach (i.e. using differentially expressed genes to build a classifier) improved prediction accuracy to 80.9%±2.2%. Conclusion Adipose gene expression profiling prior to the consumption of a low-fat diet is able to differentiate responders from non-responders as well as serve as a weak predictor of subjects destined to lose weight. While the degree of prediction accuracy currently achieved with a gene expression snapshot is perhaps insufficient for clinical use, this work reveals that the comprehensive molecular signature of adipose tissue paves the way for the future of personalized nutrition. PMID:18094752
Extensive Transcriptomic and Genomic Analysis Provides New Insights about Luminal Breast Cancers
Tishchenko, Inna; Milioli, Heloisa Helena; Riveros, Carlos; Moscato, Pablo
2016-01-01
Despite constituting approximately two thirds of all breast cancers, the luminal A and B tumours are poorly classified at both clinical and molecular levels. There are contradictory reports on the nature of these subtypes: some define them as intrinsic entities, others as a continuum. With the aim of addressing these uncertainties and identifying molecular signatures of patients at risk, we conducted a comprehensive transcriptomic and genomic analysis of 2,425 luminal breast cancer samples. Our results indicate that the separation between the molecular luminal A and B subtypes—per definition—is not associated with intrinsic characteristics evident in the differentiation between other subtypes. Moreover, t-SNE and MST-kNN clustering approaches based on 10,000 probes, associated with luminal tumour initiation and/or development, revealed the close connections between luminal A and B tumours, with no evidence of a clear boundary between them. Thus, we considered all luminal tumours as a single heterogeneous group for analysis purposes. We first stratified luminal tumours into two distinct groups by their HER2 gene cluster co-expression: HER2-amplified luminal and ordinary-luminal. The former group is associated with distinct transcriptomic and genomic profiles, and poor prognosis; it comprises approximately 8% of all luminal cases. For the remaining ordinary-luminal tumours we further identified the molecular signature correlated with disease outcomes, exhibiting an approximately continuous gene expression range from low to high risk. Thus, we employed four virtual quantiles to segregate the groups of patients. The clinico-pathological characteristics and ratios of genomic aberrations are concordant with the variations in gene expression profiles, hinting at a progressive staging. The comparison with the current separation into luminal A and B subtypes revealed a substantially improved survival stratification. Concluding, we suggest a review of the definition of luminal A and B subtypes. A proposition for a revisited delineation is provided in this study. PMID:27341628
Taroni, Jaclyn N; Greene, Casey S; Martyanov, Viktor; Wood, Tammara A; Christmann, Romy B; Farber, Harrison W; Lafyatis, Robert A; Denton, Christopher P; Hinchcliff, Monique E; Pioli, Patricia A; Mahoney, J Matthew; Whitfield, Michael L
2017-03-23
Systemic sclerosis (SSc) is a multi-organ autoimmune disease characterized by skin fibrosis. Internal organ involvement is heterogeneous. It is unknown whether disease mechanisms are common across all involved affected tissues or if each manifestation has a distinct underlying pathology. We used consensus clustering to compare gene expression profiles of biopsies from four SSc-affected tissues (skin, lung, esophagus, and peripheral blood) from patients with SSc, and the related conditions pulmonary fibrosis (PF) and pulmonary arterial hypertension, and derived a consensus disease-associate signature across all tissues. We used this signature to query tissue-specific functional genomic networks. We performed novel network analyses to contrast the skin and lung microenvironments and to assess the functional role of the inflammatory and fibrotic genes in each organ. Lastly, we tested the expression of macrophage activation state-associated gene sets for enrichment in skin and lung using a Wilcoxon rank sum test. We identified a common pathogenic gene expression signature-an immune-fibrotic axis-indicative of pro-fibrotic macrophages (MØs) in multiple tissues (skin, lung, esophagus, and peripheral blood mononuclear cells) affected by SSc. While the co-expression of these genes is common to all tissues, the functional consequences of this upregulation differ by organ. We used this disease-associated signature to query tissue-specific functional genomic networks to identify common and tissue-specific pathologies of SSc and related conditions. In contrast to skin, in the lung-specific functional network we identify a distinct lung-resident MØ signature associated with lipid stimulation and alternative activation. In keeping with our network results, we find distinct MØ alternative activation transcriptional programs in SSc-associated PF lung and in the skin of patients with an "inflammatory" SSc gene expression signature. Our results suggest that the innate immune system is central to SSc disease processes but that subtle distinctions exist between tissues. Our approach provides a framework for examining molecular signatures of disease in fibrosis and autoimmune diseases and for leveraging publicly available data to understand common and tissue-specific disease processes in complex human diseases.
The goal of this project is to use an eight-gene expression profile to define functional signatures for small molecules and natural products with heretofore undefined mechanism of action. Two genes in the eight gene set are used as internal controls and do not vary across gene expression array data collected from the public domain. The remaining six genes are found to vary independently across a large collection of publically available gene expression array datasets. Read the abstract
Automatic Organ Segmentation for CT Scans Based on Super-Pixel and Convolutional Neural Networks.
Liu, Xiaoming; Guo, Shuxu; Yang, Bingtao; Ma, Shuzhi; Zhang, Huimao; Li, Jing; Sun, Changjian; Jin, Lanyi; Li, Xueyan; Yang, Qi; Fu, Yu
2018-04-20
Accurate segmentation of specific organ from computed tomography (CT) scans is a basic and crucial task for accurate diagnosis and treatment. To avoid time-consuming manual optimization and to help physicians distinguish diseases, an automatic organ segmentation framework is presented. The framework utilized convolution neural networks (CNN) to classify pixels. To reduce the redundant inputs, the simple linear iterative clustering (SLIC) of super-pixels and the support vector machine (SVM) classifier are introduced. To establish the perfect boundary of organs in one-pixel-level, the pixels need to be classified step-by-step. First, the SLIC is used to cut an image into grids and extract respective digital signatures. Next, the signature is classified by the SVM, and the rough edges are acquired. Finally, a precise boundary is obtained by the CNN, which is based on patches around each pixel-point. The framework is applied to abdominal CT scans of livers and high-resolution computed tomography (HRCT) scans of lungs. The experimental CT scans are derived from two public datasets (Sliver 07 and a Chinese local dataset). Experimental results show that the proposed method can precisely and efficiently detect the organs. This method consumes 38 s/slice for liver segmentation. The Dice coefficient of the liver segmentation results reaches to 97.43%. For lung segmentation, the Dice coefficient is 97.93%. This finding demonstrates that the proposed framework is a favorable method for lung segmentation of HRCT scans.
Bisarro Dos Reis, Mariana; Barros-Filho, Mateus Camargo; Marchi, Fábio Albuquerque; Beltrami, Caroline Moraes; Kuasne, Hellen; Pinto, Clóvis Antônio Lopes; Ambatipudi, Srikant; Herceg, Zdenko; Kowalski, Luiz Paulo; Rogatto, Silvia Regina
2017-11-01
Even though the majority of well-differentiated thyroid carcinoma (WDTC) is indolent, a number of cases display an aggressive behavior. Cumulative evidence suggests that the deregulation of DNA methylation has the potential to point out molecular markers associated with worse prognosis. To identify a prognostic epigenetic signature in thyroid cancer. Genome-wide DNA methylation assays (450k platform, Illumina) were performed in a cohort of 50 nonneoplastic thyroid tissues (NTs), 17 benign thyroid lesions (BTLs), and 74 thyroid carcinomas (60 papillary, 8 follicular, 2 Hürthle cell, 1 poorly differentiated, and 3 anaplastic). A prognostic classifier for WDTC was developed via diagonal linear discriminant analysis. The results were compared with The Cancer Genome Atlas (TCGA) database. A specific epigenetic profile was detected according to each histological subtype. BTLs and follicular carcinomas showed a greater number of methylated CpG in comparison with NTs, whereas hypomethylation was predominant in papillary and undifferentiated carcinomas. A prognostic classifier based on 21 DNA methylation probes was able to predict poor outcome in patients with WDTC (sensitivity 63%, specificity 92% for internal data; sensitivity 64%, specificity 88% for TCGA data). High-risk score based on the classifier was considered an independent factor of poor outcome (Cox regression, P < 0.001). The methylation profile of thyroid lesions exhibited a specific signature according to the histological subtype. A meaningful algorithm composed of 21 probes was capable of predicting the recurrence in WDTC. Copyright © 2017 Endocrine Society
Pescatori, Mario; Broccolini, Aldobrando; Minetti, Carlo; Bertini, Enrico; Bruno, Claudio; D'amico, Adele; Bernardini, Camilla; Mirabella, Massimiliano; Silvestri, Gabriella; Giglio, Vincenzo; Modoni, Anna; Pedemonte, Marina; Tasca, Giorgio; Galluzzi, Giuliana; Mercuri, Eugenio; Tonali, Pietro A; Ricci, Enzo
2007-04-01
Genome-wide gene expression profiling of skeletal muscle from Duchenne muscular dystrophy (DMD) patients has been used to describe muscle tissue alterations in DMD children older than 5 years. By studying the expression profile of 19 patients younger than 2 years, we describe with high resolution the gene expression signature that characterizes DMD muscle during the initial or "presymptomatic" phase of the disease. We show that in the first 2 years of the disease, DMD muscle is already set to express a distinctive gene expression pattern considerably different from the one expressed by normal, age-matched muscle. This "dystrophic" molecular signature is characterized by a coordinate induction of genes involved in the inflammatory response, extracellular matrix (ECM) remodeling and muscle regeneration, and the reduced transcription of those involved in energy metabolism. Despite the lower degree of muscle dysfunction experienced, our younger patients showed abnormal expression of most of the genes reported as differentially expressed in more advanced stages of the disease. By analyzing our patients as a time series, we provide evidence that some genes, including members of three pathways involved in morphogenetic signaling-Wnt, Notch, and BMP-are progressively induced or repressed in the natural history of DMD.
Distinguishing signatures of determinism and stochasticity in spiking complex systems
Aragoneses, Andrés; Rubido, Nicolás; Tiana-Alsina, Jordi; Torrent, M. C.; Masoller, Cristina
2013-01-01
We describe a method to infer signatures of determinism and stochasticity in the sequence of apparently random intensity dropouts emitted by a semiconductor laser with optical feedback. The method uses ordinal time-series analysis to classify experimental data of inter-dropout-intervals (IDIs) in two categories that display statistically significant different features. Despite the apparent randomness of the dropout events, one IDI category is consistent with waiting times in a resting state until noise triggers a dropout, and the other is consistent with dropouts occurring during the return to the resting state, which have a clear deterministic component. The method we describe can be a powerful tool for inferring signatures of determinism in the dynamics of complex systems in noisy environments, at an event-level description of their dynamics.
Kim, W T; Kim, J; Yan, C; Jeong, P; Choi, S Y; Lee, O J; Chae, Y B; Yun, S J; Lee, S C; Kim, W J
2014-05-01
In our previous gene expression profile analysis, IL1B, S100A8, S100A9, and EGFR were shown to be important mediators of muscle invasive bladder cancer (MIBC) progression. The aim of the present study was to investigate the ability of these gene signatures to predict disease progression after chemotherapy in patients with locally recurrent or metastatic MIBC. Patients with locally advanced MIBC who received chemotherapy were enrolled. The expression signatures of four genes were measured and carried out further functional analysis to confirm our findings. Two of the four genes, S100A9 and EGFR, were determined to significantly influence disease progression (P = 0.023, 0.045, respectively). Based on a receiver operating characteristic curve, a cut-off value for disease progression was determined. Patients with the good-prognostic signature group had a significantly longer time to progression and cancer-specific survival time than those with the poor-prognostic signature group (P < 0.001, 0.042, respectively). In the multivariate Cox regression analysis, gene signature was the only factor that significantly influenced disease progression [hazard ratio: 4.726, confidence interval: 1.623-13.763, P = 0.004]. In immunohistochemical analysis, S100A9 and EGFR positivity were associated with disease progression after chemotherapy. Protein expression of S100A9/EGFR showed modest correlation with gene expression of S100A9/EGFR (r = 0.395, P = 0.014 and r = 0.453, P = 0.004). Our functional analysis provided the evidence demonstrating that expression of S100A9 and EGFR closely associated chemoresistance, and that inhibition of S100A9 and EGFR may sensitize bladder tumor cells to the cisplatin-based chemotherapy. The S100A9/EGFR level is a novel prognostic marker to predict the chemoresponsiveness of patients with locally recurrent or metastatic MIBC.
Sarkar, Shreya; Alam, Neyaz; Mandal, Syam Sundar; Chatterjee, Kabita; Ghosh, Supratim; Roychoudhury, Susanta; Panda, Chinmay Kumar
2018-01-01
Head and neck squamous cell carcinoma (HNSCC) is a global disease and mortality burden, necessitating the elucidation of its molecular progression for effective disease management. The study aims to understand the molecular profile of three candidate cell cycle regulatory genes, RBSP3, LIMD1 and CDC25A in the basal/ parabasal versus spinous layer of normal oral epithelium and during head and neck tumorigenesis. Immunohistochemical expression and promoter methylation was used to determine the molecular signature in normal oral epithelium. The mechanism of alteration transmission of this profile during tumorigenesis was then explored through additional deletion and mutation in HPV/ tobacco etiological groups, followed byclinico-pathological correlation. In basal/parabasal layer, the molecular signature of the genes was low protein expression/ high promoter methylation of RBSP3, high expression/ low methylation of LIMD1 and high expression of CDC25A. Dysplastic epithelium maintained the signature of RBSP3 through high methylation/ additional deletion with loss of the signatures of LIMD1 and CDC25A via deletion/ additional methylation. Similarly, maintenance and / or loss of signature in invasive tumors was by recurrent deletion/ methylation. Thus, differential patterns of alteration of the genes might be pre-requisite for the development of dysplastic and invasive lesions. Etiological factors played a key role in promoting genetic alterations and determining prognosis. Tobacco negative HNSCC patients had significantly lower alterations of LIMD1 and CDC25A, along with better survival among tobacco negative/ HPV positive patients. Our data suggests the necessity for perturbation of normal molecular profile of RBSP3, LIMD1 and CDC25A in conjunction with etiological factors for head and neck tumorigenesis, implying their diagnostic and prognostic significance.
Kwei, Kevin A; Baker, Joffre B; Pelham, Robert J
2012-01-01
The phosphoinositide 3-kinase (PI3K) signaling pathway is significantly altered in a wide variety of human cancers, driving cancer cell growth and survival. Consequently, a large number of PI3K inhibitors are now in clinical development. To begin to improve the selection of patients for treatment with PI3K inhibitors and to identify de novo determinants of patient response, we sought to identify and characterize candidate genomic and phosphoproteomic biomarkers predictive of response to the selective PI3K inhibitor, GDC-0941, using the NCI-60 human tumor cell line collection. In this study, sixty diverse tumor cell lines were exposed to GDC-0941 and classified by GI(50) value as sensitive or resistant. The most sensitive and resistant cell lines were analyzed for their baseline levels of gene expression and phosphorylation of key signaling nodes. Phosphorylation or activation status of both the PI3K-Akt signaling axis and PARP were correlated with in vitro response to GDC-0941. A gene expression signature associated with in vitro sensitivity to GDC-0941 was also identified. Furthermore, in vitro siRNA-mediated silencing of two genes in this signature, OGT and DDN, validated their role in modulating sensitivity to GDC-0941 in numerous cell lines and begins to provide biological insights into their role as chemosensitizers. These candidate biomarkers will offer useful tools to begin a more thorough understanding of determinants of patient response to PI3K inhibitors and merit exploration in human cancer patients treated with PI3K inhibitors.
Kwei, Kevin A.; Baker, Joffre B.; Pelham, Robert J.
2012-01-01
The phosphoinositide 3-kinase (PI3K) signaling pathway is significantly altered in a wide variety of human cancers, driving cancer cell growth and survival. Consequently, a large number of PI3K inhibitors are now in clinical development. To begin to improve the selection of patients for treatment with PI3K inhibitors and to identify de novo determinants of patient response, we sought to identify and characterize candidate genomic and phosphoproteomic biomarkers predictive of response to the selective PI3K inhibitor, GDC-0941, using the NCI-60 human tumor cell line collection. In this study, sixty diverse tumor cell lines were exposed to GDC-0941 and classified by GI50 value as sensitive or resistant. The most sensitive and resistant cell lines were analyzed for their baseline levels of gene expression and phosphorylation of key signaling nodes. Phosphorylation or activation status of both the PI3K-Akt signaling axis and PARP were correlated with in vitro response to GDC-0941. A gene expression signature associated with in vitro sensitivity to GDC-0941 was also identified. Furthermore, in vitro siRNA-mediated silencing of two genes in this signature, OGT and DDN, validated their role in modulating sensitivity to GDC-0941 in numerous cell lines and begins to provide biological insights into their role as chemosensitizers. These candidate biomarkers will offer useful tools to begin a more thorough understanding of determinants of patient response to PI3K inhibitors and merit exploration in human cancer patients treated with PI3K inhibitors. PMID:23029544
Flytzanis, Nicholas C.; Balsamo, Michele; Condeelis, John S.; Oktay, Maja H.; Burge, Christopher B.; Gertler, Frank B.
2011-01-01
Epithelial-mesenchymal transition (EMT), a mechanism important for embryonic development, plays a critical role during malignant transformation. While much is known about transcriptional regulation of EMT, alternative splicing of several genes has also been correlated with EMT progression, but the extent of splicing changes and their contributions to the morphological conversion accompanying EMT have not been investigated comprehensively. Using an established cell culture model and RNA–Seq analyses, we determined an alternative splicing signature for EMT. Genes encoding key drivers of EMT–dependent changes in cell phenotype, such as actin cytoskeleton remodeling, regulation of cell–cell junction formation, and regulation of cell migration, were enriched among EMT–associated alternatively splicing events. Our analysis suggested that most EMT–associated alternative splicing events are regulated by one or more members of the RBFOX, MBNL, CELF, hnRNP, or ESRP classes of splicing factors. The EMT alternative splicing signature was confirmed in human breast cancer cell lines, which could be classified into basal and luminal subtypes based exclusively on their EMT–associated splicing pattern. Expression of EMT–associated alternative mRNA transcripts was also observed in primary breast cancer samples, indicating that EMT–dependent splicing changes occur commonly in human tumors. The functional significance of EMT–associated alternative splicing was tested by expression of the epithelial-specific splicing factor ESRP1 or by depletion of RBFOX2 in mesenchymal cells, both of which elicited significant changes in cell morphology and motility towards an epithelial phenotype, suggesting that splicing regulation alone can drive critical aspects of EMT–associated phenotypic changes. The molecular description obtained here may aid in the development of new diagnostic and prognostic markers for analysis of breast cancer progression. PMID:21876675
ToxRefDB: Classifying ToxCast™ Phase I Chemicals Utilizing Structured Toxicity Information
There is an essential need for highly detailed chemicals classifications within the ToxCast™ research program. In order to develop predictive models and biological signatures utilizing high-throughput screening (HTS) and in vitro genomic data, relevant endpoints and toxicities m...
Schwalbe, E C; Hicks, D; Rafiee, G; Bashton, M; Gohlke, H; Enshaei, A; Potluri, S; Matthiesen, J; Mather, M; Taleongpong, P; Chaston, R; Silmon, A; Curtis, A; Lindsey, J C; Crosier, S; Smith, A J; Goschzik, T; Doz, F; Rutkowski, S; Lannering, B; Pietsch, T; Bailey, S; Williamson, D; Clifford, S C
2017-10-18
Rapid and reliable detection of disease-associated DNA methylation patterns has major potential to advance molecular diagnostics and underpin research investigations. We describe the development and validation of minimal methylation classifier (MIMIC), combining CpG signature design from genome-wide datasets, multiplex-PCR and detection by single-base extension and MALDI-TOF mass spectrometry, in a novel method to assess multi-locus DNA methylation profiles within routine clinically-applicable assays. We illustrate the application of MIMIC to successfully identify the methylation-dependent diagnostic molecular subgroups of medulloblastoma (the most common malignant childhood brain tumour), using scant/low-quality samples remaining from the most recently completed pan-European medulloblastoma clinical trial, refractory to analysis by conventional genome-wide DNA methylation analysis. Using this approach, we identify critical DNA methylation patterns from previously inaccessible cohorts, and reveal novel survival differences between the medulloblastoma disease subgroups with significant potential for clinical exploitation.
Common patterns and disease-related signatures in tuberculosis and sarcoidosis.
Maertzdorf, Jeroen; Weiner, January; Mollenkopf, Hans-Joachim; Bauer, Torsten; Prasse, Antje; Müller-Quernheim, Joachim; Kaufmann, Stefan H E
2012-05-15
In light of the marked global health impact of tuberculosis (TB), strong focus has been on identifying biosignatures. Gene expression profiles in blood cells identified so far are indicative of a persistent activation of the immune system and chronic inflammatory pathology in active TB. Definition of a biosignature with unique specificity for TB demands that identified profiles can differentiate diseases with similar pathology, like sarcoidosis (SARC). Here, we present a detailed comparison between pulmonary TB and SARC, including whole-blood gene expression profiling, microRNA expression, and multiplex serum analytes. Our analysis reveals that previously disclosed gene expression signatures in TB show highly similar patterns in SARC, with a common up-regulation of proinflammatory pathways and IFN signaling and close similarity to TB-related signatures. microRNA expression also presented a highly similar pattern in both diseases, whereas cytokines in the serum of TB patients revealed a slightly elevated proinflammatory pattern compared with SARC and controls. Our results indicate several differences in expression between the two diseases, with increased metabolic activity and significantly higher antimicrobial defense responses in TB. However, matrix metallopeptidase 14 was identified as the most distinctive marker of SARC. Described communalities as well as unique signatures in blood profiles of two distinct inflammatory pulmonary diseases not only have considerable implications for the design of TB biosignatures and future diagnosis, but they also provide insights into biological processes underlying chronic inflammatory disease entities of different etiology.
van Beers, Erik H; van Vliet, Martin H; Kuiper, Rowan; de Best, Leonie; Anderson, Kenneth C; Chari, Ajai; Jagannath, Sundar; Jakubowiak, Andrzej; Kumar, Shaji K; Levy, Joan B; Auclair, Daniel; Lonial, Sagar; Reece, Donna; Richardson, Paul; Siegel, David S; Stewart, A Keith; Trudel, Suzanne; Vij, Ravi; Zimmerman, Todd M; Fonseca, Rafael
2017-09-01
High risk and low risk multiple myeloma patients follow a very different clinical course as reflected in their PFS and OS. To be clinically useful, methodologies used to identify high and low risk disease must be validated in representative independent clinical data and available so that patients can be managed appropriately. A recent analysis has indicated that SKY92 combined with the International Staging System (ISS) identifies patients with different risk disease with high sensitivity. Here we computed the performance of eight gene expression based classifiers SKY92, UAMS70, UAMS80, IFM15, Proliferation Index, Centrosome Index, Cancer Testis Antigen and HM19 as well as the combination of SKY92/ISS in an independent cohort of 91 newly diagnosed MM patients. The classifiers identified between 9%-21% of patients as high risk, with hazard ratios (HRs) between 1.9 and 8.2. Among the eight signatures, SKY92 identified the largest proportion of patients (21%) also with the highest HR (8.2). Our analysis also validated the combination SKY92/ISS for identification of three classes; low risk (42%), intermediate risk (37%) and high risk (21%). Between low risk and high risk classes the HR is >10. Copyright © 2017 Elsevier Inc. All rights reserved.
On Expression Patterns and Developmental Origin of Human Brain Regions.
Kirsch, Lior; Chechik, Gal
2016-08-01
Anatomical substructures of the human brain have characteristic cell-types, connectivity and local circuitry, which are reflected in area-specific transcriptome signatures, but the principles governing area-specific transcription and their relation to brain development are still being studied. In adult rodents, areal transcriptome patterns agree with the embryonic origin of brain regions, but the processes and genes that preserve an embryonic signature in regional expression profiles were not quantified. Furthermore, it is not clear how embryonic-origin signatures of adult-brain expression interplay with changes in expression patterns during development. Here we first quantify which genes have regional expression-patterns related to the developmental origin of brain regions, using genome-wide mRNA expression from post-mortem adult human brains. We find that almost all human genes (92%) exhibit an expression pattern that agrees with developmental brain-region ontology, but that this agreement changes at multiple phases during development. Agreement is particularly strong in neuron-specific genes, but also in genes that are not spatially correlated with neuron-specific or glia-specific markers. Surprisingly, agreement is also stronger in early-evolved genes. We further find that pairs of similar genes having high agreement to developmental region ontology tend to be more strongly correlated or anti-correlated, and that the strength of spatial correlation changes more strongly in gene pairs with stronger embryonic signatures. These results suggest that transcription regulation of most genes in the adult human brain is spatially tuned in a way that changes through life, but in agreement with development-determined brain regions.
On Expression Patterns and Developmental Origin of Human Brain Regions
Kirsch, Lior; Chechik, Gal
2016-01-01
Anatomical substructures of the human brain have characteristic cell-types, connectivity and local circuitry, which are reflected in area-specific transcriptome signatures, but the principles governing area-specific transcription and their relation to brain development are still being studied. In adult rodents, areal transcriptome patterns agree with the embryonic origin of brain regions, but the processes and genes that preserve an embryonic signature in regional expression profiles were not quantified. Furthermore, it is not clear how embryonic-origin signatures of adult-brain expression interplay with changes in expression patterns during development. Here we first quantify which genes have regional expression-patterns related to the developmental origin of brain regions, using genome-wide mRNA expression from post-mortem adult human brains. We find that almost all human genes (92%) exhibit an expression pattern that agrees with developmental brain-region ontology, but that this agreement changes at multiple phases during development. Agreement is particularly strong in neuron-specific genes, but also in genes that are not spatially correlated with neuron-specific or glia-specific markers. Surprisingly, agreement is also stronger in early-evolved genes. We further find that pairs of similar genes having high agreement to developmental region ontology tend to be more strongly correlated or anti-correlated, and that the strength of spatial correlation changes more strongly in gene pairs with stronger embryonic signatures. These results suggest that transcription regulation of most genes in the adult human brain is spatially tuned in a way that changes through life, but in agreement with development-determined brain regions. PMID:27564987
Poussin, Carine; Belcastro, Vincenzo; Martin, Florian; Boué, Stéphanie; Peitsch, Manuel C; Hoeng, Julia
2017-04-17
Systems toxicology intends to quantify the effect of toxic molecules in biological systems and unravel their mechanisms of toxicity. The development of advanced computational methods is required for analyzing and integrating high throughput data generated for this purpose as well as for extrapolating predictive toxicological outcomes and risk estimates. To ensure the performance and reliability of the methods and verify conclusions from systems toxicology data analysis, it is important to conduct unbiased evaluations by independent third parties. As a case study, we report here the results of an independent verification of methods and data in systems toxicology by crowdsourcing. The sbv IMPROVER systems toxicology computational challenge aimed to evaluate computational methods for the development of blood-based gene expression signature classification models with the ability to predict smoking exposure status. Participants created/trained models on blood gene expression data sets including smokers/mice exposed to 3R4F (a reference cigarette) or noncurrent smokers/Sham (mice exposed to air). Participants applied their models on unseen data to predict whether subjects classify closer to smoke-exposed or nonsmoke exposed groups. The data sets also included data from subjects that had been exposed to potential modified risk tobacco products (MRTPs) or that had switched to a MRTP after exposure to conventional cigarette smoke. The scoring of anonymized participants' predictions was done using predefined metrics. The top 3 performers' methods predicted class labels with area under the precision recall scores above 0.9. Furthermore, although various computational approaches were used, the crowd's results confirmed our own data analysis outcomes with regards to the classification of MRTP-related samples. Mice exposed directly to a MRTP were classified closer to the Sham group. After switching to a MRTP, the confidence that subjects belonged to the smoke-exposed group decreased significantly. Smoking exposure gene signatures that contributed to the group separation included a core set of genes highly consistent across teams such as AHRR, LRRN3, SASH1, and P2RY6. In conclusion, crowdsourcing constitutes a pertinent approach, in complement to the classical peer review process, to independently and unbiasedly verify computational methods and data for risk assessment using systems toxicology.
Ali, Mohamed A E; Naka, Kazuhito; Yoshida, Akiyo; Fuse, Kyoko; Kasada, Atsuo; Hoshii, Takayuki; Tadokoro, Yuko; Ueno, Masaya; Ohta, Kumiko; Kobayashi, Masahiko; Takahashi, Chiaki; Hirao, Atsushi
2014-07-18
Acute myeloid leukaemia (AML) is a heterogeneous neoplastic disorder in which a subset of cells function as leukaemia-initiating cells (LICs). In this study, we prospectively evaluated the leukaemia-initiating capacity of AML cells fractionated according to the expression of a nucleolar GTP binding protein, nucleostemin (NS). To monitor NS expression in living AML cells, we generated a mouse AML model in which green fluorescent protein (GFP) is expressed under the control of a region of the NS promoter (NS-GFP). In AML cells, NS-GFP levels were correlated with endogenous NS mRNA. AML cells with the highest expression of NS-GFP were very immature blast-like cells, efficiently formed leukaemia colonies in vitro, and exhibited the highest leukaemia-initiating capacity in vivo. Gene expression profiling analysis revealed that cell cycle regulators and nucleotide metabolism-related genes were highly enriched in a gene set associated with leukaemia-initiating capacity that we termed the 'leukaemia stem cell gene signature'. This gene signature stratified human AML patients into distinct clusters that reflected prognosis, demonstrating that the mouse leukaemia stem cell gene signature is significantly associated with the malignant properties of human AML. Further analyses of gene regulation in leukaemia stem cells could provide novel insights into diagnostic and therapeutic approaches to AML. Copyright © 2014 Elsevier Inc. All rights reserved.
Mehdi, Ahmed M; Hamilton-Williams, Emma E; Cristino, Alexandre; Ziegler, Anette; Bonifacio, Ezio; Le Cao, Kim-Anh; Harris, Mark; Thomas, Ranjeny
2018-03-08
Autoimmune-mediated destruction of pancreatic islet β cells results in type 1 diabetes (T1D). Serum islet autoantibodies usually develop in genetically susceptible individuals in early childhood before T1D onset, with multiple islet autoantibodies predicting diabetes development. However, most at-risk children remain islet-antibody negative, and no test currently identifies those likely to seroconvert. We sought a genomic signature predicting seroconversion risk by integrating longitudinal peripheral blood gene expression profiles collected in high-risk children included in the BABYDIET and DIPP cohorts, of whom 50 seroconverted. Subjects were followed for 10 years to determine time of seroconversion. Any cohort effect and the time of seroconversion were corrected to uncover genes differentially expressed (DE) in seroconverting children. Gene expression signatures associated with seroconversion were evident during the first year of life, with 67 DE genes identified in seroconverting children relative to those remaining antibody negative. These genes contribute to T cell-, DC-, and B cell-related immune responses. Near-birth expression of ADCY9, PTCH1, MEX3B, IL15RA, ZNF714, TENM1, and PLEKHA5, along with HLA risk score predicted seroconversion (AUC 0.85). The ubiquitin-proteasome pathway linked DE genes and T1D susceptibility genes. Therefore, a gene expression signature in infancy predicts risk of seroconversion. Ubiquitination may play a mechanistic role in diabetes progression.
Mehdi, Ahmed M.; Hamilton-Williams, Emma E.; Cristino, Alexandre; Ziegler, Anette; Harris, Mark
2018-01-01
Autoimmune-mediated destruction of pancreatic islet β cells results in type 1 diabetes (T1D). Serum islet autoantibodies usually develop in genetically susceptible individuals in early childhood before T1D onset, with multiple islet autoantibodies predicting diabetes development. However, most at-risk children remain islet-antibody negative, and no test currently identifies those likely to seroconvert. We sought a genomic signature predicting seroconversion risk by integrating longitudinal peripheral blood gene expression profiles collected in high-risk children included in the BABYDIET and DIPP cohorts, of whom 50 seroconverted. Subjects were followed for 10 years to determine time of seroconversion. Any cohort effect and the time of seroconversion were corrected to uncover genes differentially expressed (DE) in seroconverting children. Gene expression signatures associated with seroconversion were evident during the first year of life, with 67 DE genes identified in seroconverting children relative to those remaining antibody negative. These genes contribute to T cell–, DC-, and B cell–related immune responses. Near-birth expression of ADCY9, PTCH1, MEX3B, IL15RA, ZNF714, TENM1, and PLEKHA5, along with HLA risk score predicted seroconversion (AUC 0.85). The ubiquitin-proteasome pathway linked DE genes and T1D susceptibility genes. Therefore, a gene expression signature in infancy predicts risk of seroconversion. Ubiquitination may play a mechanistic role in diabetes progression. PMID:29515040
An 8-gene qRT-PCR-based gene expression score that has prognostic value in early breast cancer
2010-01-01
Background Gene expression profiling may improve prognostic accuracy in patients with early breast cancer. Our objective was to demonstrate that it is possible to develop a simple molecular signature to predict distant relapse. Methods We included 153 patients with stage I-II hormonal receptor-positive breast cancer. RNA was isolated from formalin-fixed paraffin-embedded samples and qRT-PCR amplification of 83 genes was performed with gene expression assays. The genes we analyzed were those included in the 70-Gene Signature, the Recurrence Score and the Two-Gene Index. The association among gene expression, clinical variables and distant metastasis-free survival was analyzed using Cox regression models. Results An 8-gene prognostic score was defined. Distant metastasis-free survival at 5 years was 97% for patients defined as low-risk by the prognostic score versus 60% for patients defined as high-risk. The 8-gene score remained a significant factor in multivariate analysis and its performance was similar to that of two validated gene profiles: the 70-Gene Signature and the Recurrence Score. The validity of the signature was verified in independent cohorts obtained from the GEO database. Conclusions This study identifies a simple gene expression score that complements histopathological prognostic factors in breast cancer, and can be determined in paraffin-embedded samples. PMID:20584321
Predicting ionizing radiation exposure using biochemically-inspired genomic machine learning.
Zhao, Jonathan Z L; Mucaki, Eliseos J; Rogan, Peter K
2018-01-01
Background: Gene signatures derived from transcriptomic data using machine learning methods have shown promise for biodosimetry testing. These signatures may not be sufficiently robust for large scale testing, as their performance has not been adequately validated on external, independent datasets. The present study develops human and murine signatures with biochemically-inspired machine learning that are strictly validated using k-fold and traditional approaches. Methods: Gene Expression Omnibus (GEO) datasets of exposed human and murine lymphocytes were preprocessed via nearest neighbor imputation and expression of genes implicated in the literature to be responsive to radiation exposure (n=998) were then ranked by Minimum Redundancy Maximum Relevance (mRMR). Optimal signatures were derived by backward, complete, and forward sequential feature selection using Support Vector Machines (SVM), and validated using k-fold or traditional validation on independent datasets. Results: The best human signatures we derived exhibit k-fold validation accuracies of up to 98% ( DDB2 , PRKDC , TPP2 , PTPRE , and GADD45A ) when validated over 209 samples and traditional validation accuracies of up to 92% ( DDB2 , CD8A , TALDO1 , PCNA , EIF4G2 , LCN2 , CDKN1A , PRKCH , ENO1 , and PPM1D ) when validated over 85 samples. Some human signatures are specific enough to differentiate between chemotherapy and radiotherapy. Certain multi-class murine signatures have sufficient granularity in dose estimation to inform eligibility for cytokine therapy (assuming these signatures could be translated to humans). We compiled a list of the most frequently appearing genes in the top 20 human and mouse signatures. More frequently appearing genes among an ensemble of signatures may indicate greater impact of these genes on the performance of individual signatures. Several genes in the signatures we derived are present in previously proposed signatures. Conclusions: Gene signatures for ionizing radiation exposure derived by machine learning have low error rates in externally validated, independent datasets, and exhibit high specificity and granularity for dose estimation.
Li, Shuangshuang; Wu, Huan; Wang, Yi; Li, Xiaoqing; Guo, Yuxia; Liang, Shaoyan
2017-01-01
All-trans retinoic acid (ATRA) induces complete remission in almost all patients with acute promyelocytic leukemia (APL) via its ability to induce the in vivo differentiation of APL blasts. However, prolonged ATRA treatment can result in drug resistance. In previous studies, we generated a multi-drug-resistant HL60/ATRA cell line and found it to contain a new drug resistance-related gene segment, HA117. In this study, we demonstrate that ATRA induces multi-drug-resistant subpopulations of HL60 cells with a putative stem-like signature by up-regulating the expression of the new gene segment HA117. Western blot analysis and quantitative real-time PCR demonstrated that HA117 causes alternative splicing of regulator of G-protein signaling 6 (RGS6) and down-regulation of the expression of the GGL domain of RGS6, which plays an important role in DNA methyltransferase 1 (DNMT1) degradation. Moreover, DNMT1 expression was increased in multi-drug resistance HL60/ATRA cells. Knockdown of HA117 restored expression of the GGL domain and blocked DNMT1 expression. Moreover, resistant cells displayed a putative stem-like signature with increased expression of cancer steam cell markers CD133 and CD123. The stem cell marker, Nanog, was significantly up-regulated. In conclusion, our study shows that HA117 potentially promotes the stem-like signature of the HL60/ATRA cell line by inhibiting by the ubiquitination and degradation of DNMT1 and by down-regulating the expression of the GGL domain of RGS6. These results throw light on the cellular events associated with the ATRA-induced multi-drug resistance phenotype in acute leukemia. PMID:28665981
Clarke, Loren E; Flake, Darl D; Busam, Klaus; Cockerell, Clay; Helm, Klaus; McNiff, Jennifer; Reed, Jon; Tschen, Jaime; Kim, Jinah; Barnhill, Raymond; Elenitsas, Rosalie; Prieto, Victor G; Nelson, Jonathan; Kimbrell, Hillary; Kolquist, Kathryn A; Brown, Krystal L; Warf, M Bryan; Roa, Benjamin B; Wenstrup, Richard J
2017-02-15
Recently, a 23-gene signature was developed to produce a melanoma diagnostic score capable of differentiating malignant and benign melanocytic lesions. The primary objective of this study was to independently assess the ability of the gene signature to differentiate melanoma from benign nevi in clinically relevant lesions. A set of 1400 melanocytic lesions was selected from samples prospectively submitted for gene expression testing at a clinical laboratory. Each sample was tested and subjected to an independent histopathologic evaluation by 3 experienced dermatopathologists. A primary diagnosis (benign or malignant) was assigned to each sample, and diagnostic concordance among the 3 dermatopathologists was required for inclusion in analyses. The sensitivity and specificity of the score in differentiating benign and malignant melanocytic lesions were calculated to assess the association between the score and the pathologic diagnosis. The gene expression signature differentiated benign nevi from malignant melanoma with a sensitivity of 91.5% and a specificity of 92.5%. These results reflect the performance of the gene signature in a diverse array of samples encountered in routine clinical practice. Cancer 2017;123:617-628. © 2016 American Cancer Society. © 2016 Myriad Genetics, Inc. Cancer published by Wiley Periodicals, Inc. on behalf of American Cancer Society.
Elbahesh, Husni; Schughart, Klaus
2016-05-19
Influenza A viruses (IAV) are zoonotic pathogens that pose a major threat to human and animal health. Influenza virus disease severity is influenced by viral virulence factors as well as individual differences in host response. We analyzed gene expression changes in the blood of infected mice using a previously defined set of signature genes that was derived from changes in the blood transcriptome of IAV-infected human volunteers. We found that the human signature was reproduced well in the founder strains of the Collaborative Cross (CC) mice, thus demonstrating the relevance and importance of mouse experimental model systems for studying human influenza disease.
NASA Astrophysics Data System (ADS)
Figueroa, M. C.; Gregory, D. D.; Lyons, T. W.; Williford, K. H.
2017-12-01
Life processes affect trace element abundances in pyrite such that sedimentary and hydrothermal pyrite have significantly different trace element signatures. Thus, we propose that these biogeochemical data could be used to identify pyrite that formed biogenetically either early in our planet's history or on other planets, particularly Mars. The potential for this approach is elevated because pyrite is common in diverse sedimentary settings, and its trace element content can be preserved despite secondary overprints up to greenschist facies, thus minimizing the concerns about remobilization that can plague traditional whole rock studies. We are also including in-situ sulfur isotope analysis to further refine our understanding of the complex signatures of ancient pyrite. Sulfur isotope data can point straightforwardly to the involvement of life, because pyrite in sediments is inextricably linked to bacterial sulfate reduction and its diagnostic isotopic expressions. In addition to analyzing pyrite of known biological origin formed in the modern and ancient oceans under a range of conditions, we are building a data set for pyrite formed by hydrothermal and metamorphic processes to minimize the risk of false positives in life detection. We have used Random Forests (RF), a machine learning statistical technique with proven efficiency for classifying large geological datasets, to classify pyrite into biotic and abiotic end members. Coupling the trace element and sulfur isotope data from our analyses with a large existing dataset from diverse settings has yielded 4500 analyses with 18 different variables. Our initial results reveal the promise of the RF approach, correctly identifying biogenic pyrite 97 percent of the time. We will continue to couple new in-situ S-isotope and trace element analyses of biogenic pyrite grains from modern and ancient environments, using cutting-edge microanalytical techniques, with new data from high temperature settings. Our ultimately goal is a refined search tool with straightforward application in the search for early life on Earth and distant life recorded in meteorites, returned samples, and in situ measurements.
2014-01-01
Introduction Using genome-wide expression profiles of a prospective training cohort of breast cancer patients, ClinicoMolecular Triad Classification (CMTC) was recently developed to classify breast cancers into three clinically relevant groups to aid treatment decisions. CMTC was found to be both prognostic and predictive in a large external breast cancer cohort in that study. This study serves to validate the reproducibility of CMTC and its prognostic value using independent patient cohorts. Methods An independent internal cohort (n = 284) and a new external cohort (n = 2,181) were used to validate the association of CMTC between clinicopathological factors, 12 known gene signatures, two molecular subtype classifiers, and 19 oncogenic signalling pathway activities, and to reproduce the abilities of CMTC to predict clinical outcomes of breast cancer. In addition, we also updated the outcome data of the original training cohort (n = 147). Results The original training cohort reached a statistically significant difference (p < 0.05) in disease-free survivals between the three CMTC groups after an additional two years of follow-up (median = 55 months). The prognostic value of the triad classification was reproduced in the second independent internal cohort and the new external validation cohort. CMTC achieved even higher prognostic significance when all available patients were analyzed (n = 4,851). Oncogenic pathways Myc, E2F1, Ras and β-catenin were again implicated in the high-risk groups. Conclusions Both prospective internal cohorts and the independent external cohorts reproduced the triad classification of CMTC and its prognostic significance. CMTC is an independent prognostic predictor, and it outperformed 12 other known prognostic gene signatures, molecular subtype classifications, and all other standard prognostic clinicopathological factors. Our results support further development of CMTC portfolio into a guide for personalized breast cancer treatments. PMID:24996446
Scheid, Adam D; Van Keulen, Virginia P; Felts, Sara J; Neier, Steven C; Middha, Sumit; Nair, Asha A; Techentin, Robert W; Gilbert, Barry K; Jen, Jin; Neuhauser, Claudia; Zhang, Yuji; Pease, Larry R
2018-03-01
Human immunity exhibits remarkable heterogeneity among individuals, which engenders variable responses to immune perturbations in human populations. Population studies reveal that, in addition to interindividual heterogeneity, systemic immune signatures display longitudinal stability within individuals, and these signatures may reliably dictate how given individuals respond to immune perturbations. We hypothesize that analyzing relationships among these signatures at the population level may uncover baseline immune phenotypes that correspond with response outcomes to immune stimuli. To test this, we quantified global gene expression in peripheral blood CD4 + cells from healthy individuals at baseline and following CD3/CD28 stimulation at two time points 1 mo apart. Systemic CD4 + cell baseline and poststimulation molecular immune response signatures (MIRS) were defined by identifying genes expressed at levels that were stable between time points within individuals and differential among individuals in each state. Iterative differential gene expression analyses between all possible phenotypic groupings of at least three individuals using the baseline and stimulated MIRS gene sets revealed shared baseline and response phenotypic groupings, indicating the baseline MIRS contained determinants of immune responsiveness. Furthermore, significant numbers of shared phenotype-defining sets of determinants were identified in baseline data across independent healthy cohorts. Combining the cohorts and repeating the analyses resulted in identification of over 6000 baseline immune phenotypic groups, implying that the MIRS concept may be useful in many immune perturbation contexts. These findings demonstrate that patterns in complex gene expression variability can be used to define immune phenotypes and discover determinants of immune responsiveness. Copyright © 2018 by The American Association of Immunologists, Inc.
Siano, Marco; Espeli, Vittoria; Mach, Nicolas; Bossi, Paolo; Licitra, Lisa; Ghielmini, Michele; Frattini, Milo; Canevari, Silvana; De Cecco, Loris
2018-07-01
Platinum-based chemotherapy plus the anti-EGFR monoclonal antibody (mAb) cetuximab is used to treat recurrent/metastatic (RM) head-neck squamous cell carcinoma (HNSCC). Recently, we defined Cluster3 gene-expression signature as a potential predictor of favorable progression-free survival (PFS) in cetuximab-treated RM-HNSCC patients and predictor of partial metabolic FDG-PET response in an afatinib window-of-opportunity trial. Another anti-EGFR-mAb (panitumumab) was used as the treatment agent in RM-HNSCC patients in the phase II PANI01trial. PANI01 tumor samples were analyzed using functional genomics to explore response predictors to anti-EGFR therapy. Whole-gene expression and real-time PCR analyses were applied to pre-treatment samples from 25 PANI01 patients. Three gene signatures (Cluster3 score, RAS onco-signature, microenvironment score) and seven selected miRNAs were separately analyzed for association with panitumumab efficacy. Cluster3 expression levels had a profile with a significant bimodal separation of samples (P = 3.08 E-13). Higher RAS activation, microenvironment score, and miRNA expression were associated with low-Cluster3 patients. The same biomarkers were separately associated with PFS. Patients with high-Cluster3 had significantly longer PFS than patients with low-Cluster3 (median PFS: 174 versus 51 days; log-rank P = 0.0021). ROC analysis demonstrated accuracy in predicting PFS (AUC = 0.877). Despite differences in clinical settings and anti-EGFR inhibitors used for treatment, response prediction by the Cluster3 signature and selected miRNAs was essentially the same. Translation into a useful clinical assay requires validation in a broader setting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Fuzzy Intervals for Designing Structural Signature: An Application to Graphic Symbol Recognition
NASA Astrophysics Data System (ADS)
Luqman, Muhammad Muzzamil; Delalandre, Mathieu; Brouard, Thierry; Ramel, Jean-Yves; Lladós, Josep
The motivation behind our work is to present a new methodology for symbol recognition. The proposed method employs a structural approach for representing visual associations in symbols and a statistical classifier for recognition. We vectorize a graphic symbol, encode its topological and geometrical information by an attributed relational graph and compute a signature from this structural graph. We have addressed the sensitivity of structural representations to noise, by using data adapted fuzzy intervals. The joint probability distribution of signatures is encoded by a Bayesian network, which serves as a mechanism for pruning irrelevant features and choosing a subset of interesting features from structural signatures of underlying symbol set. The Bayesian network is deployed in a supervised learning scenario for recognizing query symbols. The method has been evaluated for robustness against degradations & deformations on pre-segmented 2D linear architectural & electronic symbols from GREC databases, and for its recognition abilities on symbols with context noise i.e. cropped symbols.
Long-wave infrared profile feature extractor (PFx) sensor
NASA Astrophysics Data System (ADS)
Sartain, Ronald B.; Aliberti, Keith; Alexander, Troy; Chiu, David
2009-05-01
The Long Wave Infrared (LWIR) Profile Feature Extractor (PFx) sensor has evolved from the initial profiling sensor that was developed by the University of Memphis (Near IR) and the Army Research Laboratory (visible). This paper presents the initial signatures of the LWIR PFx for human with and without backpacks, human with animal (dog), and a number of other animals. The current version of the LWIR PFx sensor is a diverging optical tripwire sensor. The LWIR PFx signatures are compared to the signatures of the Profile Sensor in the visible and Near IR spectral regions. The LWIR PFx signatures were collected with two different un-cooled micro bolometer focal plane array cameras, where the individual pixels were used as stand alone detectors (a non imaging sensor). This approach results in a completely passive, much lower bandwidth, much longer battery life, low weight, small volume sensor that provides sufficient information to classify objects into human Vs non human categories with a 98.5% accuracy.
GeneSigDB: a manually curated database and resource for analysis of gene expression signatures
Culhane, Aedín C.; Schröder, Markus S.; Sultana, Razvan; Picard, Shaita C.; Martinelli, Enzo N.; Kelly, Caroline; Haibe-Kains, Benjamin; Kapushesky, Misha; St Pierre, Anne-Alyssa; Flahive, William; Picard, Kermshlise C.; Gusenleitner, Daniel; Papenhausen, Gerald; O'Connor, Niall; Correll, Mick; Quackenbush, John
2012-01-01
GeneSigDB (http://www.genesigdb.org or http://compbio.dfci.harvard.edu/genesigdb/) is a database of gene signatures that have been extracted and manually curated from the published literature. It provides a standardized resource of published prognostic, diagnostic and other gene signatures of cancer and related disease to the community so they can compare the predictive power of gene signatures or use these in gene set enrichment analysis. Since GeneSigDB release 1.0, we have expanded from 575 to 3515 gene signatures, which were collected and transcribed from 1604 published articles largely focused on gene expression in cancer, stem cells, immune cells, development and lung disease. We have made substantial upgrades to the GeneSigDB website to improve accessibility and usability, including adding a tag cloud browse function, facetted navigation and a ‘basket’ feature to store genes or gene signatures of interest. Users can analyze GeneSigDB gene signatures, or upload their own gene list, to identify gene signatures with significant gene overlap and results can be viewed on a dynamic editable heatmap that can be downloaded as a publication quality image. All data in GeneSigDB can be downloaded in numerous formats including .gmt file format for gene set enrichment analysis or as a R/Bioconductor data file. GeneSigDB is available from http://www.genesigdb.org. PMID:22110038
Reliable classification of high explosive and chemical/biological artillery using acoustic sensors
NASA Astrophysics Data System (ADS)
Desai, Sachi V.; Hohil, Myron E.; Bass, Henry E.; Chambers, Jim
2005-05-01
Feature extraction methods based on the discrete wavelet transform and multiresolution analysis are used to develop a robust classification algorithm that reliably discriminates between conventional and simulated chemical/biological artillery rounds via acoustic signals produced during detonation utilizing a generic acoustic sensor. Based on the transient properties of the signature blast distinct characteristics arise within the different acoustic signatures because high explosive warheads emphasize concussive and shrapnel effects, while chemical/biological warheads are designed to disperse their contents over large areas, therefore employing a slower burning, less intense explosive to mix and spread their contents. The ensuing blast waves are readily characterized by variations in the corresponding peak pressure and rise time of the blast, differences in the ratio of positive pressure amplitude to the negative amplitude, and variations in the overall duration of the resulting waveform. Unique attributes can also be identified that depend upon the properties of the gun tube, projectile speed at the muzzle, and the explosive burn rates of the warhead. The algorithm enables robust classification of various airburst signatures using acoustics. It is capable of being integrated within an existing chemical/biological sensor, a stand-alone generic sensor, or a part of a disparate sensor suite. When emplaced in high-threat areas, this added capability would further provide field personal with advanced battlefield knowledge without the aide of so-called "sniffer" sensors that rely upon air particle information based on direct contact with possible contaminated air. In this work, the discrete wavelet transform is used to extract the predominant components of these characteristics from air burst signatures at ranges exceeding 2km while maintaining temporal sequence of the data to keep relevance to the transient differences of the airburst signatures. Highly reliable discrimination is achieved with a feedforward neural network classifier trained on a feature space derived from the distribution of wavelet coefficients and higher frequency details found within different levels of the multiresolution decomposition the neural network then is capable of classifying new airburst signatures as Chemical/Biological or High Explosive.
NASA Astrophysics Data System (ADS)
da Silva, Flávio Altinier Maximiano; Pedrini, Helio
2015-03-01
Facial expressions are an important demonstration of humanity's humors and emotions. Algorithms capable of recognizing facial expressions and associating them with emotions were developed and employed to compare the expressions that different cultural groups use to show their emotions. Static pictures of predominantly occidental and oriental subjects from public datasets were used to train machine learning algorithms, whereas local binary patterns, histogram of oriented gradients (HOGs), and Gabor filters were employed to describe the facial expressions for six different basic emotions. The most consistent combination, formed by the association of HOG filter and support vector machines, was then used to classify the other cultural group: there was a strong drop in accuracy, meaning that the subtle differences of facial expressions of each culture affected the classifier performance. Finally, a classifier was trained with images from both occidental and oriental subjects and its accuracy was higher on multicultural data, evidencing the need of a multicultural training set to build an efficient classifier.
NASA Astrophysics Data System (ADS)
Zhou, Zhiheng; Liu, Haibai; Wang, Caixia; Lu, Qian; Huang, Qinhai; Zheng, Chanjiao; Lei, Yixiong
2015-10-01
Increasing evidence suggests that long non-coding RNAs (lncRNAs) are involved in a variety of physiological and pathophysiological processes. Our study was to investigate whether lncRNAs as novel expression signatures are able to modulate DNA damage and repair in cadmium(Cd) toxicity. There were aberrant expression profiles of lncRNAs in 35th Cd-induced cells as compared to untreated 16HBE cells. siRNA-mediated knockdown of ENST00000414355 inhibited the growth of DNA-damaged cells and decreased the expressions of DNA-damage related genes (ATM, ATR and ATRIP), while increased the expressions of DNA-repair related genes (DDB1, DDB2, OGG1, ERCC1, MSH2, RAD50, XRCC1 and BARD1). Cadmium increased ENST00000414355 expression in the lung of Cd-exposed rats in a dose-dependent manner. A significant positive correlation was observed between blood ENST00000414355 expression and urinary/blood Cd concentrations, and there were significant correlations of lncRNA-ENST00000414355 expression with the expressions of target genes in the lung of Cd-exposed rats and the blood of Cd exposed workers. These results indicate that some lncRNAs are aberrantly expressed in Cd-treated 16HBE cells. lncRNA-ENST00000414355 may serve as a signature for DNA damage and repair related to the epigenetic mechanisms underlying the cadmium toxicity and become a novel biomarker of cadmium toxicity.
Classification of spatially unresolved objects
NASA Technical Reports Server (NTRS)
Nalepka, R. F.; Horwitz, H. M.; Hyde, P. D.; Morgenstern, J. P.
1972-01-01
A proportion estimation technique for classification of multispectral scanner images is reported that uses data point averaging to extract and compute estimated proportions for a single average data point to classify spatial unresolved areas. Example extraction calculations of spectral signatures for bare soil, weeds, alfalfa, and barley prove quite accurate.
MicroRNA dynamics in the stages of tumorigenesis correlate with hallmark capabilities of cancer.
Olson, Peter; Lu, Jun; Zhang, Hao; Shai, Anny; Chun, Matthew G; Wang, Yucheng; Libutti, Steven K; Nakakura, Eric K; Golub, Todd R; Hanahan, Douglas
2009-09-15
While altered expression of microRNAs (miRs) in tumors has been well documented, it remains unclear how the miR transcriptome intersects neoplastic progression. By profiling the miR transcriptome we identified miR expression signatures associated with steps in tumorigenesis and the acquisition of hallmark capabilities in a prototypical mouse model of cancer. Metastases and a rare subset of primary tumors shared a distinct miR signature, implicating a discrete lineage for metastatic tumors. The miR-200 family is strongly down-regulated in metastases and met-like primary tumors, thereby relieving repression of the mesenchymal transcription factor Zeb1, which in turn suppresses E-cadherin. Treatment with a clinically approved angiogenesis inhibitor normalized angiogenic signature miRs in primary tumors, while altering expression of metastatic signature miRs similarly to liver metastases, suggesting their involvement in adaptive resistance to anti-angiogenic therapy via enhanced metastasis. Many of the miR changes associated with specific stages and hallmark capabilities in the mouse model are similarly altered in human tumors, including cognate pancreatic neuroendocrine tumors, implying a generality.
Repliscan: a tool for classifying replication timing regions.
Zynda, Gregory J; Song, Jawon; Concia, Lorenzo; Wear, Emily E; Hanley-Bowdoin, Linda; Thompson, William F; Vaughn, Matthew W
2017-08-07
Replication timing experiments that use label incorporation and high throughput sequencing produce peaked data similar to ChIP-Seq experiments. However, the differences in experimental design, coverage density, and possible results make traditional ChIP-Seq analysis methods inappropriate for use with replication timing. To accurately detect and classify regions of replication across the genome, we present Repliscan. Repliscan robustly normalizes, automatically removes outlying and uninformative data points, and classifies Repli-seq signals into discrete combinations of replication signatures. The quality control steps and self-fitting methods make Repliscan generally applicable and more robust than previous methods that classify regions based on thresholds. Repliscan is simple and effective to use on organisms with different genome sizes. Even with analysis window sizes as small as 1 kilobase, reliable profiles can be generated with as little as 2.4x coverage.
Safari, Mohammad Taghi; Chaleshi, Vahid; Tarban, Peyman; Nourian, Mahyar; Balaii, Hedieh; Shahrokh, Shabnam; Asadzadeh Aghdaei, Hamid
2017-01-01
In this study, we determined the gene expression analysis of IL-17 gene family for early detection of subclinical inflammation among IBD patients. Cytokines have a vital role in the pathogenesis of inflammatory bowel disease (IBD). Interleukin-17 is the signature cytokine of the recently identified T helper 17 (Th17) cell subset. IL-17F is mainly involved in mucosal host defense mechanisms whereas the functions of IL-17B remain largely elusive. In this cross-sectional study, IBD patients divided into two active and inactive groups. Peripheral blood mononuclear cells (PBMCs) from 38 IBD patients which 20 inactive samples and 18 active individuals were collected. Changes of IL-17 F and IL-17B mRNA expression level evaluated by quantitative-real time-PCR. mRNA expression level of IL-17B and IL-17F in CD, UC, active and inactive groups have been assessed and there were no significant differences (P>0.05). Patients were classified into five different categories as follows: i) 5ASA; ii) 5ASA + Pred; iii) 5ASA + AZA; iv) 5ASA + Pred + AZA; v) 5ASA + Pred + AZA + IFX according to medication usage, expression of IL-17F and IL-17B had no differences (p>0.05). Evaluation of IL-17B and IL-17F mRNA expression level illustrate no difference among active and inactive patients. Therefore, IL-17B and IL-17F are not biomarkers in an Iranian IBD patients.
Safari, Mohammad Taghi; Chaleshi, Vahid; Tarban, Peyman; Nourian, Mahyar; Balaii, Hedieh; Shahrokh, Shabnam; Asadzadeh Aghdaei, Hamid
2017-01-01
Aim: In this study, we determined the gene expression analysis of IL-17 gene family for early detection of subclinical inflammation among IBD patients. Background: Cytokines have a vital role in the pathogenesis of inflammatory bowel disease (IBD). Interleukin-17 is the signature cytokine of the recently identified T helper 17 (Th17) cell subset. IL-17F is mainly involved in mucosal host defense mechanisms whereas the functions of IL-17B remain largely elusive. Methods: In this cross-sectional study, IBD patients divided into two active and inactive groups. Peripheral blood mononuclear cells (PBMCs) from 38 IBD patients which 20 inactive samples and 18 active individuals were collected. Changes of IL-17 F and IL-17B mRNA expression level evaluated by quantitative-real time-PCR. Results: mRNA expression level of IL-17B and IL-17F in CD, UC, active and inactive groups have been assessed and there were no significant differences (P>0.05). Patients were classified into five different categories as follows: i) 5ASA; ii) 5ASA + Pred; iii) 5ASA + AZA; iv) 5ASA + Pred + AZA; v) 5ASA + Pred + AZA + IFX according to medication usage, expression of IL-17F and IL-17B had no differences (p>0.05). Conclusion: Evaluation of IL-17B and IL-17F mRNA expression level illustrate no difference among active and inactive patients. Therefore, IL-17B and IL-17F are not biomarkers in an Iranian IBD patients. PMID:29511476
Electronic Derivative Classifier/Reviewing Official
DOE Office of Scientific and Technical Information (OSTI.GOV)
Harris, Joshua C; McDuffie, Gregory P; Light, Ken L
2017-02-17
The electronic Derivative Classifier, Reviewing Official (eDC/RO) is a web based document management and routing system that reduces security risks and increases workflow efficiencies. The system automates the upload, notification review request, and document status tracking of documents for classification review on a secure server. It supports a variety of document formats (i.e., pdf, doc, docx, xls, xlsx, xlsm, ppt, pptx, vsd, vsdx and txt), and allows for the dynamic placement of classification markings such as the classification level, category and caveats on the document, in addition to a document footer and digital signature.
The Human Airway Epithelial Basal Cell Transcriptome
Wang, Rui; Zwick, Rachel K.; Ferris, Barbara; Witover, Bradley; Salit, Jacqueline; Crystal, Ronald G.
2011-01-01
Background The human airway epithelium consists of 4 major cell types: ciliated, secretory, columnar and basal cells. During natural turnover and in response to injury, the airway basal cells function as stem/progenitor cells for the other airway cell types. The objective of this study is to better understand human airway epithelial basal cell biology by defining the gene expression signature of this cell population. Methodology/Principal Findings Bronchial brushing was used to obtain airway epithelium from healthy nonsmokers. Microarrays were used to assess the transcriptome of basal cells purified from the airway epithelium in comparison to the transcriptome of the differentiated airway epithelium. This analysis identified the “human airway basal cell signature” as 1,161 unique genes with >5-fold higher expression level in basal cells compared to differentiated epithelium. The basal cell signature was suppressed when the basal cells differentiated into a ciliated airway epithelium in vitro. The basal cell signature displayed overlap with genes expressed in basal-like cells from other human tissues and with that of murine airway basal cells. Consistent with self-modulation as well as signaling to other airway cell types, the human airway basal cell signature was characterized by genes encoding extracellular matrix components, growth factors and growth factor receptors, including genes related to the EGF and VEGF pathways. Interestingly, while the basal cell signature overlaps that of basal-like cells of other organs, the human airway basal cell signature has features not previously associated with this cell type, including a unique pattern of genes encoding extracellular matrix components, G protein-coupled receptors, neuroactive ligands and receptors, and ion channels. Conclusion/Significance The human airway epithelial basal cell signature identified in the present study provides novel insights into the molecular phenotype and biology of the stem/progenitor cells of the human airway epithelium. PMID:21572528
Immune signatures of protective spleen memory CD8 T cells.
Brinza, Lilia; Djebali, Sophia; Tomkowiak, Martine; Mafille, Julien; Loiseau, Céline; Jouve, Pierre-Emmanuel; de Bernard, Simon; Buffat, Laurent; Lina, Bruno; Ottmann, Michèle; Rosa-Calatrava, Manuel; Schicklin, Stéphane; Bonnefoy, Nathalie; Lauvau, Grégoire; Grau, Morgan; Wencker, Mélanie; Arpin, Christophe; Walzer, Thierry; Leverrier, Yann; Marvel, Jacqueline
2016-11-24
Memory CD8 T lymphocyte populations are remarkably heterogeneous and differ in their ability to protect the host. In order to identify the whole range of qualities uniquely associated with protective memory cells we compared the gene expression signatures of two qualities of memory CD8 T cells sharing the same antigenic-specificity: protective (Influenza-induced, Flu-TM) and non-protective (peptide-induced, TIM) spleen memory CD8 T cells. Although Flu-TM and TIM express classical phenotypic memory markers and are polyfunctional, only Flu-TM protects against a lethal viral challenge. Protective memory CD8 T cells express a unique set of genes involved in migration and survival that correlate with their unique capacity to rapidly migrate within the infected lung parenchyma in response to influenza infection. We also enlighten a new set of poised genes expressed by protective cells that is strongly enriched in cytokines and chemokines such as Ccl1, Ccl9 and Gm-csf. CCL1 and GM-CSF genes are also poised in human memory CD8 T cells. These immune signatures are also induced by two other pathogens (vaccinia virus and Listeria monocytogenes). The immune signatures associated with immune protection were identified on circulating cells, i.e. those that are easily accessible for immuno-monitoring and could help predict vaccines efficacy.
The goal of this project is to use an eight-gene expression profile to define functional signatures for small molecules and natural products with heretofore undefined mechanism of action. Two genes in the eight gene set are used as internal controls and do not vary across gene expression array data collected from the public domain. The remaining six genes are found to vary independently across a large collection of publically available gene expression array datasets. Read the abstract
Schlessinger, Karni; Li, Wenyu; Tan, Yejun; Liu, Franklin; Souza, Sandra C; Tozzo, Effie; Liu, Kevin; Thompson, John R; Wang, Liangsu; Muise, Eric S
2015-09-01
Identify a gene expression signature in white adipose tissue (WAT) that reports on WAT browning and is associated with a healthy phenotype. RNA from several different adipose depots across three species were analyzed by whole transcriptome profiling, including 1) mouse subcutaneous white fat, brown fat, and white fat after in vivo treatment with FGF21; 2) human subcutaneous and omental fat from insulin-sensitive and insulin-resistant patients; and 3) rhesus monkey subcutaneous fat from healthy and dysmetabolic individuals. A "browning" signature in mice was identified by cross-referencing the FGF21-induced signature in WAT with the brown adipose tissue (BAT) vs. WAT comparison. In addition, gene expression levels in WAT from insulin-sensitive/healthy vs. insulin-resistant/dysmetabolic humans and rhesus monkeys, respectively, correlated with the gene expression levels in mouse BAT vs. WAT. A subset of 49 genes were identified that were consistently regulated or differentially expressed in the mouse and human data sets that could be used to monitor browning of WAT across species. Gene expression profiles of WATs from healthy insulin-sensitive individuals correlate with those of BAT and FGF21-induced browning of WAT. © 2015 The Obesity Society.
Algorithms exploiting ultrasonic sensors for subject classification
NASA Astrophysics Data System (ADS)
Desai, Sachi; Quoraishee, Shafik
2009-09-01
Proposed here is a series of techniques exploiting micro-Doppler ultrasonic sensors capable of characterizing various detected mammalian targets based on their physiological movements captured a series of robust features. Employed is a combination of unique and conventional digital signal processing techniques arranged in such a manner they become capable of classifying a series of walkers. These processes for feature extraction develops a robust feature space capable of providing discrimination of various movements generated from bipeds and quadrupeds and further subdivided into large or small. These movements can be exploited to provide specific information of a given signature dividing it in a series of subset signatures exploiting wavelets to generate start/stop times. After viewing a series spectrograms of the signature we are able to see distinct differences and utilizing kurtosis, we generate an envelope detector capable of isolating each of the corresponding step cycles generated during a walk. The walk cycle is defined as one complete sequence of walking/running from the foot pushing off the ground and concluding when returning to the ground. This time information segments the events that are readily seen in the spectrogram but obstructed in the temporal domain into individual walk sequences. This walking sequence is then subsequently translated into a three dimensional waterfall plot defining the expected energy value associated with the motion at particular instance of time and frequency. The value is capable of being repeatable for each particular class and employable to discriminate the events. Highly reliable classification is realized exploiting a classifier trained on a candidate sample space derived from the associated gyrations created by motion from actors of interest. The classifier developed herein provides a capability to classify events as an adult humans, children humans, horses, and dogs at potentially high rates based on the tested sample space. The algorithm developed and described will provide utility to an underused sensor modality for human intrusion detection because of the current high-rate of generated false alarms. The active ultrasonic sensor coupled in a multi-modal sensor suite with binary, less descriptive sensors like seismic devices realizing a greater accuracy rate for detection of persons of interest for homeland purposes.
Axelsson, Annika S; Tubbs, Emily; Mecham, Brig; Chacko, Shaji; Nenonen, Hannah A; Tang, Yunzhao; Fahey, Jed W; Derry, Jonathan M J; Wollheim, Claes B; Wierup, Nils; Haymond, Morey W; Friend, Stephen H; Mulder, Hindrik; Rosengren, Anders H
2017-06-14
A potentially useful approach for drug discovery is to connect gene expression profiles of disease-affected tissues ("disease signatures") to drug signatures, but it remains to be shown whether it can be used to identify clinically relevant treatment options. We analyzed coexpression networks and genetic data to identify a disease signature for type 2 diabetes in liver tissue. By interrogating a library of 3800 drug signatures, we identified sulforaphane as a compound that may reverse the disease signature. Sulforaphane suppressed glucose production from hepatic cells by nuclear translocation of nuclear factor erythroid 2-related factor 2 (NRF2) and decreased expression of key enzymes in gluconeogenesis. Moreover, sulforaphane reversed the disease signature in the livers from diabetic animals and attenuated exaggerated glucose production and glucose intolerance by a magnitude similar to that of metformin. Finally, sulforaphane, provided as concentrated broccoli sprout extract, reduced fasting blood glucose and glycated hemoglobin (HbA1c) in obese patients with dysregulated type 2 diabetes. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Esposito, Gianluca; Del Carmen Rostagno, Maria; Venuti, Paola; Haltigan, John D; Messinger, Daniel S
2014-04-01
Previous studies have provided preliminary evidence that disruptions in cry acoustics may be part of an atypical vocal signature of autism early in life. We examined the acoustic characteristics of cries extracted from the separation phase of the strange situation procedure in a sample of toddler of younger siblings of a child with autism spectrum disorder-autism spectrum disorders (ASD) (high risk, HR) and a low risk (LR) group. Cry samples derived from vocal recordings of 15-month-old HR (n = 13) and LR infants (n = 14) were subjected to acoustic analyses. HR toddlers, compared to those with LR, produced cries that were shorter and had a higher fundamental frequency (F0). Three HR toddlers later classified with an ASD at 36 months (autistic disorder in all cases) produced cries that had among the highest F0 and shortest durations. Taken together these results indicate that toddlers at high risk for ASD (and those with an ASD) express atypical patterns of distress in response a social stressor. Implications for early diagnosis and parenting are discussed.
Kimura, S; Yamakami-Kimura, M; Obata, Y; Hase, K; Kitamura, H; Ohno, H; Iwanaga, T
2015-05-01
The microfold (M) cell residing in the follicle-associated epithelium is a specialized epithelial cell that initiates mucosal immune responses by sampling luminal antigens. The differentiation process of M cells remains unclear due to limitations of analytical methods. Here we found that M cells were classified into two functionally different subtypes based on the expression of Glycoprotein 2 (GP2) by newly developed image cytometric analysis. GP2-high M cells actively took up luminal microbeads, whereas GP2-negative or low cells scarcely ingested them, even though both subsets equally expressed the other M-cell signature genes, suggesting that GP2-high M cells represent functionally mature M cells. Further, the GP2-high mature M cells were abundant in Peyer's patch but sparse in the cecal patch: this was most likely due to a decrease in the nuclear translocation of RelB, a downstream transcription factor for the receptor activator of nuclear factor-κB signaling. Given that murine cecum contains a protrusion of beneficial commensals, the restriction of M-cell activity might contribute to preventing the onset of any excessive immune response to the commensals through decelerating the M-cell-dependent uptake of microorganisms.
eXpression2Kinases (X2K) Web: linking expression signatures to upstream cell signaling networks.
Clarke, Daniel J B; Kuleshov, Maxim V; Schilder, Brian M; Torre, Denis; Duffy, Mary E; Keenan, Alexandra B; Lachmann, Alexander; Feldmann, Axel S; Gundersen, Gregory W; Silverstein, Moshe C; Wang, Zichen; Ma'ayan, Avi
2018-05-25
While gene expression data at the mRNA level can be globally and accurately measured, profiling the activity of cell signaling pathways is currently much more difficult. eXpression2Kinases (X2K) computationally predicts involvement of upstream cell signaling pathways, given a signature of differentially expressed genes. X2K first computes enrichment for transcription factors likely to regulate the expression of the differentially expressed genes. The next step of X2K connects these enriched transcription factors through known protein-protein interactions (PPIs) to construct a subnetwork. The final step performs kinase enrichment analysis on the members of the subnetwork. X2K Web is a new implementation of the original eXpression2Kinases algorithm with important enhancements. X2K Web includes many new transcription factor and kinase libraries, and PPI networks. For demonstration, thousands of gene expression signatures induced by kinase inhibitors, applied to six breast cancer cell lines, are provided for fetching directly into X2K Web. The results are displayed as interactive downloadable vector graphic network images and bar graphs. Benchmarking various settings via random permutations enabled the identification of an optimal set of parameters to be used as the default settings in X2K Web. X2K Web is freely available from http://X2K.cloud.
Developing processing techniques for Skylab data
NASA Technical Reports Server (NTRS)
Nalepka, R. F. (Principal Investigator); Malila, W. A.; Morgenstern, J. P.
1975-01-01
The author has identified the following significant results. The effects of misregistration and the scan-line-straightening algorithm on multispectral data were found to be: (1) there is greatly increased misregistration in scan-line-straightening data over conic data; (2) scanner caused misregistration between any pairs of channels may not be corrected for in scan-line-straightened data; and (3) this data will have few pure field center pixels than will conic data. A program SIMSIG was developed implementing the signature simulation model. Data processing stages of the experiment were carried out, and an analysis was made of the effects of spatial misregistration on field center classification accuracy. Fifteen signatures originally used for classifying the data were analyzed, showing the following breakdown: corn (4 signatures), trees (2), brush (1), grasses, weeds, etc. (5), bare soil (1), soybeans (1), and alfalfa (1).
Modelling of celestial backgrounds
NASA Astrophysics Data System (ADS)
Hickman, Duncan L.; Smith, Moira I.; Lim, Jae-Wan; Jeon, Yun-Ho
2018-05-01
For applications where a sensor's image includes the celestial background, stars and Solar System Bodies compromise the ability of the sensor system to correctly classify a target. Such false targets are particularly significant for the detection of weak target signatures which only have a small relative angular motion. The detection of celestial features is well established in the visible spectral band. However, given the increasing sensitivity and low noise afforded by emergent infrared focal plane array technology together with larger and more efficient optics, the signatures of celestial features can also impact performance at infrared wavelengths. A methodology has been developed which allows the rapid generation of celestial signatures in any required spectral band using star data from star catalogues and other open-source information. Within this paper, the radiometric calculations are presented to determine the irradiance values of stars and planets in any spectral band.
Establishment of a 12-gene expression signature to predict colon cancer prognosis
Zhao, Guangxi; Dong, Pingping; Wu, Bingrui
2018-01-01
A robust and accurate gene expression signature is essential to assist oncologists to determine which subset of patients at similar Tumor-Lymph Node-Metastasis (TNM) stage has high recurrence risk and could benefit from adjuvant therapies. Here we applied a two-step supervised machine-learning method and established a 12-gene expression signature to precisely predict colon adenocarcinoma (COAD) prognosis by using COAD RNA-seq transcriptome data from The Cancer Genome Atlas (TCGA). The predictive performance of the 12-gene signature was validated with two independent gene expression microarray datasets: GSE39582 includes 566 COAD cases for the development of six molecular subtypes with distinct clinical, molecular and survival characteristics; GSE17538 is a dataset containing 232 colon cancer patients for the generation of a metastasis gene expression profile to predict recurrence and death in COAD patients. The signature could effectively separate the poor prognosis patients from good prognosis group (disease specific survival (DSS): Kaplan Meier (KM) Log Rank p = 0.0034; overall survival (OS): KM Log Rank p = 0.0336) in GSE17538. For patients with proficient mismatch repair system (pMMR) in GSE39582, the signature could also effectively distinguish high risk group from low risk group (OS: KM Log Rank p = 0.005; Relapse free survival (RFS): KM Log Rank p = 0.022). Interestingly, advanced stage patients were significantly enriched in high 12-gene score group (Fisher’s exact test p = 0.0003). After stage stratification, the signature could still distinguish poor prognosis patients in GSE17538 from good prognosis within stage II (Log Rank p = 0.01) and stage II & III (Log Rank p = 0.017) in the outcome of DFS. Within stage III or II/III pMMR patients treated with Adjuvant Chemotherapies (ACT) and patients with higher 12-gene score showed poorer prognosis (III, OS: KM Log Rank p = 0.046; III & II, OS: KM Log Rank p = 0.041). Among stage II/III pMMR patients with lower 12-gene scores in GSE39582, the subgroup receiving ACT showed significantly longer OS time compared with those who received no ACT (Log Rank p = 0.021), while there is no obvious difference between counterparts among patients with higher 12-gene scores (Log Rank p = 0.12). Besides COAD, our 12-gene signature is multifunctional in several other cancer types including kidney cancer, lung cancer, uveal and skin melanoma, brain cancer, and pancreatic cancer. Functional classification showed that seven of the twelve genes are involved in immune system function and regulation, so our 12-gene signature could potentially be used to guide decisions about adjuvant therapy for patients with stage II/III and pMMR COAD.
Asmal, Mohammed; Hellmann, Ina; Liu, Weimin; Keele, Brandon F.; Perelson, Alan S.; Bhattacharya, Tanmoy; Gnanakaran, S.; Daniels, Marcus; Haynes, Barton F.; Korber, Bette T.; Hahn, Beatrice H.; Shaw, George M.; Letvin, Norman L.
2011-01-01
Mucosal transmission of the human immunodeficiency virus (HIV) results in a bottleneck in viral genetic diversity. Gnanakaran and colleagues used a computational strategy to identify signature amino acids at particular positions in Envelope that were associated either with transmitted sequences sampled very early in infection, or sequences sampled during chronic infection. Among the strongest signatures observed was an enrichment for the stable presence of histidine at position 12 at transmission and in early infection, and a recurrent loss of histidine at position 12 in chronic infection. This amino acid lies within the leader peptide of Envelope, a region of the protein that has been shown to influence envelope glycoprotein expression and virion infectivity. We show a strong association between a positively charged amino acid like histidine at position 12 in transmitted/founder viruses with more efficient trafficking of the nascent envelope polypeptide to the endoplasmic reticulum and higher steady-state glycoprotein expression compared to viruses that have a non-basic position 12 residue, a substitution that was enriched among viruses sampled from chronically infected individuals. When expressed in the context of other viral proteins, transmitted envelopes with a basic amino acid position 12 were incorporated at higher density into the virus and exhibited higher infectious titers than did non-signature envelopes. These results support the potential utility of using a computational approach to examine large viral sequence data sets for functional signatures and indicate the importance of Envelope expression levels for efficient HIV transmission. PMID:21876761
Tamez-Peña, Jose-Gerardo; Rodriguez-Rojas, Juan-Andrés; Gomez-Rueda, Hugo; Celaya-Padilla, Jose-Maria; Rivera-Prieto, Roxana-Alicia; Palacios-Corona, Rebeca; Garza-Montemayor, Margarita; Cardona-Huerta, Servando; Treviño, Victor
2018-01-01
In breast cancer, well-known gene expression subtypes have been related to a specific clinical outcome. However, their impact on the breast tissue phenotype has been poorly studied. Here, we investigate the association of imaging data of tumors to gene expression signatures from 71 patients with breast cancer that underwent pre-treatment digital mammograms and tumor biopsies. From digital mammograms, a semi-automated radiogenomics analysis generated 1,078 features describing the shape, signal distribution, and texture of tumors along their contralateral image used as control. From tumor biopsy, we estimated the OncotypeDX and PAM50 recurrence scores using gene expression microarrays. Then, we used multivariate analysis under stringent cross-validation to train models predicting recurrence scores. Few univariate features reached Spearman correlation coefficients above 0.4. Nevertheless, multivariate analysis yielded significantly correlated models for both signatures (correlation of OncotypeDX = 0.49 ± 0.07 and PAM50 = 0.32 ± 0.10 in stringent cross-validation and OncotypeDX = 0.83 and PAM50 = 0.78 for a unique model). Equivalent models trained from the unaffected contralateral breast were not correlated suggesting that the image signatures were tumor-specific and that overfitting was not a considerable issue. We also noted that models were improved by combining clinical information (triple negative status and progesterone receptor). The models used mostly wavelets and fractal features suggesting their importance to capture tumor information. Our results suggest that molecular-based recurrence risk and breast cancer subtypes have observable radiographic phenotypes. To our knowledge, this is the first study associating mammographic information to gene expression recurrence signatures.
Tamez-Peña, Jose-Gerardo; Rodriguez-Rojas, Juan-Andrés; Gomez-Rueda, Hugo; Celaya-Padilla, Jose-Maria; Rivera-Prieto, Roxana-Alicia; Palacios-Corona, Rebeca; Garza-Montemayor, Margarita; Cardona-Huerta, Servando
2018-01-01
In breast cancer, well-known gene expression subtypes have been related to a specific clinical outcome. However, their impact on the breast tissue phenotype has been poorly studied. Here, we investigate the association of imaging data of tumors to gene expression signatures from 71 patients with breast cancer that underwent pre-treatment digital mammograms and tumor biopsies. From digital mammograms, a semi-automated radiogenomics analysis generated 1,078 features describing the shape, signal distribution, and texture of tumors along their contralateral image used as control. From tumor biopsy, we estimated the OncotypeDX and PAM50 recurrence scores using gene expression microarrays. Then, we used multivariate analysis under stringent cross-validation to train models predicting recurrence scores. Few univariate features reached Spearman correlation coefficients above 0.4. Nevertheless, multivariate analysis yielded significantly correlated models for both signatures (correlation of OncotypeDX = 0.49 ± 0.07 and PAM50 = 0.32 ± 0.10 in stringent cross-validation and OncotypeDX = 0.83 and PAM50 = 0.78 for a unique model). Equivalent models trained from the unaffected contralateral breast were not correlated suggesting that the image signatures were tumor-specific and that overfitting was not a considerable issue. We also noted that models were improved by combining clinical information (triple negative status and progesterone receptor). The models used mostly wavelets and fractal features suggesting their importance to capture tumor information. Our results suggest that molecular-based recurrence risk and breast cancer subtypes have observable radiographic phenotypes. To our knowledge, this is the first study associating mammographic information to gene expression recurrence signatures. PMID:29596496
MISR Level 2 TOA/Cloud Classifier parameters (MIL2TCCL_V2)
NASA Technical Reports Server (NTRS)
Diner, David J. (Principal Investigator)
The TOA/Cloud Classifiers contain the Angular Signature Cloud Mask (ASCM), a scene classifier calculated using support vector machine technology (SVM) both of which are on a 1.1 km grid, and cloud fractions at 17.6 km resolution that are available in different height bins (low, middle, high) and are also calculated on an angle-by-angle basis. [Location=GLOBAL] [Temporal_Coverage: Start_Date=2000-02-24; Stop_Date=] [Spatial_Coverage: Southernmost_Latitude=-90; Northernmost_Latitude=90; Westernmost_Longitude=-180; Easternmost_Longitude=180] [Data_Resolution: Latitude_Resolution=17.6 km; Longitude_Resolution=17.6 km; Horizontal_Resolution_Range=10 km - < 50 km or approximately .09 degree - < .5 degree; Temporal_Resolution=about 15 orbits/day; Temporal_Resolution_Range=Daily - < Weekly, Daily - < Weekly].
Targeted proteomics identifies liquid-biopsy signatures for extracapsular prostate cancer
Kim, Yunee; Jeon, Jouhyun; Mejia, Salvador; Yao, Cindy Q; Ignatchenko, Vladimir; Nyalwidhe, Julius O; Gramolini, Anthony O; Lance, Raymond S; Troyer, Dean A; Drake, Richard R; Boutros, Paul C; Semmes, O. John; Kislinger, Thomas
2016-01-01
Biomarkers are rapidly gaining importance in personalized medicine. Although numerous molecular signatures have been developed over the past decade, there is a lack of overlap and many biomarkers fail to validate in independent patient cohorts and hence are not useful for clinical application. For these reasons, identification of novel and robust biomarkers remains a formidable challenge. We combine targeted proteomics with computational biology to discover robust proteomic signatures for prostate cancer. Quantitative proteomics conducted in expressed prostatic secretions from men with extraprostatic and organ-confined prostate cancers identified 133 differentially expressed proteins. Using synthetic peptides, we evaluate them by targeted proteomics in a 74-patient cohort of expressed prostatic secretions in urine. We quantify a panel of 34 candidates in an independent 207-patient cohort. We apply machine-learning approaches to develop clinical predictive models for prostate cancer diagnosis and prognosis. Our results demonstrate that computationally guided proteomics can discover highly accurate non-invasive biomarkers. PMID:27350604
Nakaoka, Hirofumi; Tajima, Atsushi; Yoneyama, Taku; Hosomichi, Kazuyoshi; Kasuya, Hidetoshi; Mizutani, Tohru; Inoue, Ituro
2014-08-01
The rupture of intracranial aneurysm (IA) causes subarachnoid hemorrhage associated with high morbidity and mortality. We compared gene expression profiles in aneurysmal domes between unruptured IAs and ruptured IAs (RIAs) to elucidate biological mechanisms predisposing to the rupture of IA. We determined gene expression levels of 8 RIAs, 5 unruptured IAs, and 10 superficial temporal arteries with the Agilent microarrays. To explore biological heterogeneity of IAs, we classified the samples into subgroups showing similar gene expression patterns, using clustering methods. The clustering analysis identified 4 groups: superficial temporal arteries and unruptured IAs were aggregated into their own clusters, whereas RIAs segregated into 2 distinct subgroups (early and late RIAs). Comparing gene expression levels between early RIAs and unruptured IAs, we identified 430 upregulated and 617 downregulated genes in early RIAs. The upregulated genes were associated with inflammatory and immune responses and phagocytosis including S100/calgranulin genes (S100A8, S100A9, and S100A12). The downregulated genes suggest mechanical weakness of aneurysm walls. The expressions of Krüppel-like family of transcription factors (KLF2, KLF12, and KLF15), which were anti-inflammatory regulators, and CDKN2A, which was located on chromosome 9p21 that was the most consistently replicated locus in genome-wide association studies of IA, were also downregulated. We demonstrate that gene expression patterns of RIAs were different according to the age of patients. The results suggest that macrophage-mediated inflammation is a key biological pathway for IA rupture. The identified genes can be good candidates for molecular markers of rupture-prone IAs and therapeutic targets. © 2014 American Heart Association, Inc.
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey; ...
2017-01-21
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Genome-wide screen identifies a novel prognostic signature for breast cancer survival
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mao, Xuan Y.; Lee, Matthew J.; Zhu, Jeffrey
Large genomic datasets in combination with clinical data can be used as an unbiased tool to identify genes important in patient survival and discover potential therapeutic targets. We used a genome-wide screen to identify 587 genes significantly and robustly deregulated across four independent breast cancer (BC) datasets compared to normal breast tissue. Gene expression of 381 genes was significantly associated with relapse-free survival (RFS) in BC patients. We used a gene co-expression network approach to visualize the genetic architecture in normal breast and BCs. In normal breast tissue, co-expression cliques were identified enriched for cell cycle, gene transcription, cell adhesion,more » cytoskeletal organization and metabolism. In contrast, in BC, only two major co-expression cliques were identified enriched for cell cycle-related processes or blood vessel development, cell adhesion and mammary gland development processes. Interestingly, gene expression levels of 7 genes were found to be negatively correlated with many cell cycle related genes, highlighting these genes as potential tumor suppressors and novel therapeutic targets. A forward-conditional Cox regression analysis was used to identify a 12-gene signature associated with RFS. A prognostic scoring system was created based on the 12-gene signature. This scoring system robustly predicted BC patient RFS in 60 sampling test sets and was further validated in TCGA and METABRIC BC data. Our integrated study identified a 12-gene prognostic signature that could guide adjuvant therapy for BC patients and includes novel potential molecular targets for therapy.« less
Sohn, Bo Hwa; Shim, Jae-Jun; Kim, Sang-Bae; Jang, Kyu Yun; Kim, Soo Mi; Kim, Ji Hoon; Hwang, Jun Eul; Jang, Hee-Jin; Lee, Hyun-Sung; Kim, Sang-Cheol; Jeong, Woojin; Kim, Sung Soo; Park, Eun Sung; Heo, Jeonghoon; Kim, Yoon Jun; Kim, Dae-Ghon; Leem, Sun-Hee; Kaseb, Ahmed; Hassan, Manal M; Cha, Minse; Chu, In-Sun; Johnson, Randy L; Park, Yun-Yong; Lee, Ju-Seog
2016-03-01
The Hippo pathway is a tumor suppressor in the liver. However, the clinical significance of Hippo pathway inactivation in HCC is not clearly defined. We analyzed genomic data from human and mouse tissues to determine clinical relevance of Hippo pathway inactivation in HCC. We analyzed gene expression data from Mst1/2(-/-) and Sav1(-/-) mice and identified a 610-gene expression signature reflecting Hippo pathway inactivation in the liver [silence of Hippo (SOH) signature]. By integrating gene expression data from mouse models with those from human HCC tissues, we developed a prediction model that could identify HCC patients with an inactivated Hippo pathway and used it to test its significance in HCC patients, via univariate and multivariate Cox analyses. HCC patients (National Cancer Institute cohort, n = 113) with the SOH signature had a significantly poorer prognosis than those without the SOH signature [P < 0.001 for overall survival (OS)]. The significant association of the signature with poor prognosis was further validated in the Korean (n = 100, P = 0.006 for OS) and Fudan University cohorts (n = 242, P = 0.001 for OS). On multivariate analysis, the signature was an independent predictor of recurrence-free survival (HR, 1.6; 95% confidence interval, 1.12-2.28: P = 0.008). We also demonstrated significant concordance between the SOH HCC subtype and the hepatic stem cell HCC subtype that had been identified in a previous study (P < 0.001). Inactivation of the Hippo pathway in HCC is significantly associated with poor prognosis. ©2015 American Association for Cancer Research.
Auerbach, Scott S; Phadke, Dhiral P; Mav, Deepak; Holmgren, Stephanie; Gao, Yuan; Xie, Bin; Shin, Joo Heon; Shah, Ruchir R; Merrick, B Alex; Tice, Raymond R
2015-07-01
Formalin-fixed, paraffin-embedded (FFPE) pathology specimens represent a potentially vast resource for transcriptomic-based biomarker discovery. We present here a comparison of results from a whole transcriptome RNA-Seq analysis of RNA extracted from fresh frozen and FFPE livers. The samples were derived from rats exposed to aflatoxin B1 (AFB1 ) and a corresponding set of control animals. Principal components analysis indicated that samples were separated in the two groups representing presence or absence of chemical exposure, both in fresh frozen and FFPE sample types. Sixty-five percent of the differentially expressed transcripts (AFB1 vs. controls) in fresh frozen samples were also differentially expressed in FFPE samples (overlap significance: P < 0.0001). Genomic signature and gene set analysis of AFB1 differentially expressed transcript lists indicated highly similar results between fresh frozen and FFPE at the level of chemogenomic signatures (i.e., single chemical/dose/duration elicited transcriptomic signatures), mechanistic and pathology signatures, biological processes, canonical pathways and transcription factor networks. Overall, our results suggest that similar hypotheses about the biological mechanism of toxicity would be formulated from fresh frozen and FFPE samples. These results indicate that phenotypically anchored archival specimens represent a potentially informative resource for signature-based biomarker discovery and mechanistic characterization of toxicity. Copyright © 2014 John Wiley & Sons, Ltd.
Chakravarthy, Ankur; Henderson, Stephen; Thirdborough, Stephen M.; Ottensmeier, Christian H.; Su, Xiaoping; Lechner, Matt; Feber, Andrew; Thomas, Gareth J.
2016-01-01
Purpose In squamous cell carcinomas of the head and neck (HNSCC), the increasing incidence of oropharyngeal squamous cell carcinomas (OPSCCs) is attributable to human papillomavirus (HPV) infection. Despite commonly presenting at late stage, HPV-driven OPSCCs are associated with improved prognosis compared with HPV-negative disease. HPV DNA is also detectable in nonoropharyngeal (non-OPSCC), but its pathogenic role and clinical significance are unclear. The objectives of this study were to determine whether HPV plays a causal role in non-OPSCC and to investigate whether HPV confers a survival benefit in these tumors. Methods Meta-analysis was used to build a cross-tissue gene-expression signature for HPV-driven cancer. Classifiers trained by machine-learning approaches were used to predict the HPV status of 520 HNSCCs profiled by The Cancer Genome Atlas project. DNA methylation data were similarly used to classify 464 HNSCCs and these analyses were integrated with genomic, histopathology, and survival data to permit a comprehensive comparison of HPV transcript-positive OPSCC and non-OPSCC. Results HPV-driven tumors accounted for 4.1% of non-OPSCCs. Regardless of anatomic site, HPV+ HNSCCs shared highly similar gene expression and DNA methylation profiles; nonkeratinizing, basaloid histopathological features; and lack of TP53 or CDKN2A alterations. Improved overall survival, however, was largely restricted to HPV-driven OPSCCs, which were associated with increased levels of tumor-infiltrating lymphocytes compared with HPV-driven non-OPSCCs. Conclusion Our analysis identified a causal role for HPV in transcript-positive non-OPSCCs throughout the head and neck. Notably, however, HPV-driven non-OPSCCs display a distinct immune microenvironment and clinical behavior compared with HPV-driven OPSCCs. PMID:27863190
A Self-Directed Method for Cell-Type Identification and Separation of Gene Expression Microarrays
Zuckerman, Neta S.; Noam, Yair; Goldsmith, Andrea J.; Lee, Peter P.
2013-01-01
Gene expression analysis is generally performed on heterogeneous tissue samples consisting of multiple cell types. Current methods developed to separate heterogeneous gene expression rely on prior knowledge of the cell-type composition and/or signatures - these are not available in most public datasets. We present a novel method to identify the cell-type composition, signatures and proportions per sample without need for a-priori information. The method was successfully tested on controlled and semi-controlled datasets and performed as accurately as current methods that do require additional information. As such, this method enables the analysis of cell-type specific gene expression using existing large pools of publically available microarray datasets. PMID:23990767
Authentication of gold nanoparticle encoded pharmaceutical tablets using polarimetric signatures.
Carnicer, Artur; Arteaga, Oriol; Suñé-Negre, Josep M; Javidi, Bahram
2016-10-01
The counterfeiting of pharmaceutical products represents concerns for both industry and the safety of the general public. Falsification produces losses to companies and poses health risks for patients. In order to detect fake pharmaceutical tablets, we propose producing film-coated tablets with gold nanoparticle encoding. These coated tablets contain unique polarimetric signatures. We present experiments to show that ellipsometric optical techniques, in combination with machine learning algorithms, can be used to distinguish genuine and fake samples. To the best of our knowledge, this is the first report using gold nanoparticles encoded with optical polarimetric classifiers to prevent the counterfeiting of pharmaceutical products.
Feng, Juerong; Zhou, Rui; Chang, Ying; Liu, Jing; Zhao, Qiu
2017-01-01
Hepatocellular carcinoma (HCC) has a high incidence and mortality worldwide, and its carcinogenesis and progression are influenced by a complex network of gene interactions. A weighted gene co-expression network was constructed to identify gene modules associated with the clinical traits in HCC (n = 214). Among the 13 modules, high correlation was only found between the red module and metastasis risk (classified by the HCC metastasis gene signature) (R2 = −0.74). Moreover, in the red module, 34 network hub genes for metastasis risk were identified, six of which (ABAT, AGXT, ALDH6A1, CYP4A11, DAO and EHHADH) were also hub nodes in the protein-protein interaction network of the module genes. Thus, a total of six hub genes were identified. In validation, all hub genes showed a negative correlation with the four-stage HCC progression (P for trend < 0.05) in the test set. Furthermore, in the training set, HCC samples with any hub gene lowly expressed demonstrated a higher recurrence rate and poorer survival rate (hazard ratios with 95% confidence intervals > 1). RNA-sequencing data of 142 HCC samples showed consistent results in the prognosis. Gene set enrichment analysis (GSEA) demonstrated that in the samples with any hub gene highly expressed, a total of 24 functional gene sets were enriched, most of which focused on amino acid metabolism and oxidation. In conclusion, co-expression network analysis identified six hub genes in association with HCC metastasis risk and prognosis, which might improve the prognosis by influencing amino acid metabolism and oxidation. PMID:28430663
Lopez Marulanda, Juliana; Adam, Olivier; Delfour, Fabienne
2016-11-01
Bottlenose dolphins are highly social cetaceans with an extensive sound production including clicks, burst-pulsed sounds, and whistles. Some whistles, known as signature whistles, are individually specific. These acoustic signatures are commonly described as being emitted in contexts of stress during forced isolation and as group cohesion calls. Interactions between humans and captive dolphins is largely based on positive reinforcement conditioning within several training/feeding sessions per day. Vocal behavior of dolphins during these interactions might vary. To investigate this, we recorded 10 bottlenose dolphins of Parc Asterix dolphinarium (France) before, during and after 10 training sessions for a total duration of 7 hr and 32 min. We detected 3,272 whistles with 2,884 presenting a quality good enough to be categorized. We created a catalog of whistle types by visual categorization verified by five naive judges (Fleiss' Kappa Test). We then applied the SIGID method to identify the signatures whistles present in our recordings. We found 279 whistles belonging to one of the four identified signature whistle types. The remaining 2,605 were classified as non-signature whistles. The non-signature whistles emission rate was higher during and after the training sessions than before. Emission rate of three signature whistles types significantly increased afterwards as compared to before the training sessions. We suggest that dolphins use their signature whistles when they return to their intraspecific social interactions succeeding scheduled and human-organized training sessions. More observations are needed to make conclusions about the function of signature whistles in relation to training sessions. Zoo Biol. 35:495-504, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The Biology of Linguistic Expression Impacts Neural Correlates for Spatial Language
Emmorey, Karen; McCullough, Stephen; Mehta, Sonya; Ponto, Laura L. B.; Grabowski, Thomas J.
2013-01-01
Biological differences between signed and spoken languages may be most evident in the expression of spatial information. PET was used to investigate the neural substrates supporting the production of spatial language in American Sign Language as expressed by classifier constructions, in which handshape indicates object type and the location/motion of the hand iconically depicts the location/motion of a referent object. Deaf native signers performed a picture description task in which they overtly named objects or produced classifier constructions that varied in location, motion, or object type. In contrast to the expression of location and motion, the production of both lexical signs and object type classifier morphemes engaged left inferior frontal cortex and left inferior temporal cortex, supporting the hypothesis that unlike the location and motion components of a classifier construction, classifier handshapes are categorical morphemes that are retrieved via left hemisphere language regions. In addition, lexical signs engaged the anterior temporal lobes to a greater extent than classifier constructions, which we suggest reflects increased semantic processing required to name individual objects compared with simply indicating the type of object. Both location and motion classifier constructions engaged bilateral superior parietal cortex, with some evidence that the expression of static locations differentially engaged the left intraparietal sulcus. We argue that bilateral parietal activation reflects the biological underpinnings of sign language. To express spatial information, signers must transform visual–spatial representations into a body-centered reference frame and reach toward target locations within signing space. PMID:23249348
Deep convolutional neural networks for classifying GPR B-scans
NASA Astrophysics Data System (ADS)
Besaw, Lance E.; Stimac, Philip J.
2015-05-01
Symmetric and asymmetric buried explosive hazards (BEHs) present real, persistent, deadly threats on the modern battlefield. Current approaches to mitigate these threats rely on highly trained operatives to reliably detect BEHs with reasonable false alarm rates using handheld Ground Penetrating Radar (GPR) and metal detectors. As computers become smaller, faster and more efficient, there exists greater potential for automated threat detection based on state-of-the-art machine learning approaches, reducing the burden on the field operatives. Recent advancements in machine learning, specifically deep learning artificial neural networks, have led to significantly improved performance in pattern recognition tasks, such as object classification in digital images. Deep convolutional neural networks (CNNs) are used in this work to extract meaningful signatures from 2-dimensional (2-D) GPR B-scans and classify threats. The CNNs skip the traditional "feature engineering" step often associated with machine learning, and instead learn the feature representations directly from the 2-D data. A multi-antennae, handheld GPR with centimeter-accurate positioning data was used to collect shallow subsurface data over prepared lanes containing a wide range of BEHs. Several heuristics were used to prevent over-training, including cross validation, network weight regularization, and "dropout." Our results show that CNNs can extract meaningful features and accurately classify complex signatures contained in GPR B-scans, complementing existing GPR feature extraction and classification techniques.
Alsop, Eric B; Raymond, Jason
2013-01-01
Oligonucleotide signatures, especially tetranucleotide signatures, have been used as method for homology binning by exploiting an organism's inherent biases towards the use of specific oligonucleotide words. Tetranucleotide signatures have been especially useful in environmental metagenomics samples as many of these samples contain organisms from poorly classified phyla which cannot be easily identified using traditional homology methods, including NCBI BLAST. This study examines oligonucleotide signatures across 1,424 completed genomes from across the tree of life, substantially expanding upon previous work. A comprehensive analysis of mononucleotide through nonanucleotide word lengths suggests that longer word lengths substantially improve the classification of DNA fragments across a range of sizes of relevance to high throughput sequencing. We find that, at present, heptanucleotide signatures represent an optimal balance between prediction accuracy and computational time for resolving taxonomy using both genomic and metagenomic fragments. We directly compare the ability of tetranucleotide and heptanucleotide world lengths (tetranucleotide signatures are the current standard for oligonucleotide word usage analyses) for taxonomic binning of metagenome reads. We present evidence that heptanucleotide word lengths consistently provide more taxonomic resolving power, particularly in distinguishing between closely related organisms that are often present in metagenomic samples. This implies that longer oligonucleotide word lengths should replace tetranucleotide signatures for most analyses. Finally, we show that the application of longer word lengths to metagenomic datasets leads to more accurate taxonomic binning of DNA scaffolds and have the potential to substantially improve taxonomic assignment and assembly of metagenomic data.
Labaj, Wojciech; Papiez, Anna; Polanski, Andrzej; Polanska, Joanna
2017-03-01
Large collections of data in studies on cancer such as leukaemia provoke the necessity of applying tailored analysis algorithms to ensure supreme information extraction. In this work, a custom-fit pipeline is demonstrated for thorough investigation of the voluminous MILE gene expression data set. Three analyses are accomplished, each for gaining a deeper understanding of the processes underlying leukaemia types and subtypes. First, the main disease groups are tested for differential expression against the healthy control as in a standard case-control study. Here, the basic knowledge on molecular mechanisms is confirmed quantitatively and by literature references. Second, pairwise comparison testing is performed for juxtaposing the main leukaemia types among each other. In this case by means of the Dice coefficient similarity measure the general relations are pointed out. Moreover, lists of candidate main leukaemia group biomarkers are proposed. Finally, with this approach being successful, the third analysis provides insight into all of the studied subtypes, followed by the emergence of four leukaemia subtype biomarkers. In addition, the class enhanced DEG signature obtained on the basis of novel pipeline processing leads to significantly better classification power of multi-class data classifiers. The developed methodology consisting of batch effect adjustment, adaptive noise and feature filtration coupled with adequate statistical testing and biomarker definition proves to be an effective approach towards knowledge discovery in high-throughput molecular biology experiments.
Using Public Data for Comparative Proteome Analysis in Precision Medicine Programs.
Hughes, Christopher S; Morin, Gregg B
2018-03-01
Maximizing the clinical utility of information obtained in longitudinal precision medicine programs would benefit from robust comparative analyses to known information to assess biological features of patient material toward identifying the underlying features driving their disease phenotype. Herein, the potential for utilizing publically deposited mass-spectrometry-based proteomics data to perform inter-study comparisons of cell-line or tumor-tissue materials is investigated. To investigate the robustness of comparison between MS-based proteomics studies carried out with different methodologies, deposited data representative of label-free (MS1) and isobaric tagging (MS2 and MS3 quantification) are utilized. In-depth quantitative proteomics data acquired from analysis of ovarian cancer cell lines revealed the robust recapitulation of observable gene expression dynamics between individual studies carried out using significantly different methodologies. The observed signatures enable robust inter-study clustering of cell line samples. In addition, the ability to classify and cluster tumor samples based on observed gene expression trends when using a single patient sample is established. With this analysis, relevant gene expression dynamics are obtained from a single patient tumor, in the context of a precision medicine analysis, by leveraging a large cohort of repository data as a comparator. Together, these data establish the potential for state-of-the-art MS-based proteomics data to serve as resources for robust comparative analyses in precision medicine applications. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Oberhauser, Felix; Kornhuber, Johannes; Wiltfang, Jens; Fassbender, Klaus; Schroeter, Matthias L.; Volk, Alexander E.; Diehl-Schmid, Janine; Prudlo, Johannes; Danek, Adrian; Landwehrmeyer, Bernhard; Lauer, Martin; Otto, Markus; Jahn, Holger
2018-01-01
Information on circulating miRNAs in frontotemporal lobar degeneration is very limited and conflicting results have complicated an interpretation in Alzheimer’s disease thus far. In the present study we I) collected samples from multiple clinical centers across Germany, II) defined 3 homogenous patient groups with high sample sizes (bvFTD n = 48, AD n = 48 and cognitively healthy controls n = 44), III) compared expression levels in both CSF and serum samples and IV) detected a limited set of miRNAs by using a MIQE compliant protocol based on SYBR-green miRCURY assays that have proven reliable to generate reproducible results. We included several quality controls that identified and reduced technical variation to increase the reliability of our data. We showed that the expression levels of circulating miRNAs measured in CSF did not correlate with levels in serum. Using cluster analysis we found expression pattern in serum that, in part, reflects the genomic organization and affiliation to a specific miRNA family and that were specifically altered in bvFTD, AD, and control groups. Applying factor analysis we identified a 3-factor model characterized by a miRNA signature that explained 80% of the variance classifying healthy controls with 97%, bvFTD with 77% and AD with 72% accuracy. MANOVA confirmed signals like miR-320a and miR-26b-5p at BH corrected significance that contributed most to discriminate bvFTD cases with 96% sensitivity and 90% specificity and AD cases with 89% sensitivity and specificity compared to healthy controls, respectively. Correlation analysis revealed that miRNAs from the 3-factor model also correlated with levels of protein biomarker amyloid-beta1-42 and phosphorylated neurofilament heavy chain, indicating their potential role in the monitoring of progressive neuronal degeneration. Our data show that miRNAs can be reproducibly measured in serum and CSF without pre-amplification and that serum includes higher expressed signals that demonstrate an overall better ability to classify bvFTD, AD and healthy controls compared to signals detected in CSF. PMID:29746584
This study examined inter-analyst classification variability based on training site signature selection only for six classifications from a 10 km2 Landsat ETM+ image centered over a highly heterogeneous area in south-central Virginia. Six analysts classified the image...
Final Technical Report: PV Fault Detection Tool.
DOE Office of Scientific and Technical Information (OSTI.GOV)
King, Bruce Hardison; Jones, Christian Birk
The PV Fault Detection Tool project plans to demonstrate that the FDT can (a) detect catastrophic and degradation faults and (b) identify the type of fault. This will be accomplished by collecting fault signatures using different instruments and integrating this information to establish a logical controller for detecting, diagnosing and classifying each fault.
Pattern classifier for health monitoring of helicopter gearboxes
NASA Technical Reports Server (NTRS)
Chin, Hsinyung; Danai, Kourosh; Lewicki, David G.
1993-01-01
The application of a newly developed diagnostic method to a helicopter gearbox is demonstrated. This method is a pattern classifier which uses a multi-valued influence matrix (MVIM) as its diagnostic model. The method benefits from a fast learning algorithm, based on error feedback, that enables it to estimate gearbox health from a small set of measurement-fault data. The MVIM method can also assess the diagnosability of the system and variability of the fault signatures as the basis to improve fault signatures. This method was tested on vibration signals reflecting various faults in an OH-58A main rotor transmission gearbox. The vibration signals were then digitized and processed by a vibration signal analyzer to enhance and extract various features of the vibration data. The parameters obtained from this analyzer were utilized to train and test the performance of the MVIM method in both detection and diagnosis. The results indicate that the MVIM method provided excellent detection results when the full range of faults effects on the measurements were included in training, and it had a correct diagnostic rate of 95 percent when the faults were included in training.
Discrete Neural Signatures of Basic Emotions.
Saarimäki, Heini; Gotsopoulos, Athanasios; Jääskeläinen, Iiro P; Lampinen, Jouko; Vuilleumier, Patrik; Hari, Riitta; Sams, Mikko; Nummenmaa, Lauri
2016-06-01
Categorical models of emotions posit neurally and physiologically distinct human basic emotions. We tested this assumption by using multivariate pattern analysis (MVPA) to classify brain activity patterns of 6 basic emotions (disgust, fear, happiness, sadness, anger, and surprise) in 3 experiments. Emotions were induced with short movies or mental imagery during functional magnetic resonance imaging. MVPA accurately classified emotions induced by both methods, and the classification generalized from one induction condition to another and across individuals. Brain regions contributing most to the classification accuracy included medial and inferior lateral prefrontal cortices, frontal pole, precentral and postcentral gyri, precuneus, and posterior cingulate cortex. Thus, specific neural signatures across these regions hold representations of different emotional states in multimodal fashion, independently of how the emotions are induced. Similarity of subjective experiences between emotions was associated with similarity of neural patterns for the same emotions, suggesting a direct link between activity in these brain regions and the subjective emotional experience. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Hassane, Duane C.; Guzman, Monica L.; Corbett, Cheryl; Li, Xiaojie; Abboud, Ramzi; Young, Fay; Liesveld, Jane L.; Carroll, Martin
2008-01-01
Increasing evidence indicates that malignant stem cells are important for the pathogenesis of acute myelogenous leukemia (AML) and represent a reservoir of cells that drive the development of AML and relapse. Therefore, new treatment regimens are necessary to prevent relapse and improve therapeutic outcomes. Previous studies have shown that the sesquiterpene lactone, parthenolide (PTL), ablates bulk, progenitor, and stem AML cells while causing no appreciable toxicity to normal hematopoietic cells. Thus, PTL must evoke cellular responses capable of mediating AML selective cell death. Given recent advances in chemical genomics such as gene expression-based high-throughput screening (GE-HTS) and the Connectivity Map, we hypothesized that the gene expression signature resulting from treatment of primary AML with PTL could be used to search for similar signatures in publicly available gene expression profiles deposited into the Gene Expression Omnibus (GEO). We therefore devised a broad in silico screen of the GEO database using the PTL gene expression signature as a template and discovered 2 new agents, celastrol and 4-hydroxy-2-nonenal, that effectively eradicate AML at the bulk, progenitor, and stem cell level. These findings suggest the use of multicenter collections of high-throughput data to facilitate discovery of leukemia drugs and drug targets. PMID:18305216
Wu, Jing-Shan; Lo, Hsin-Yi; Li, Chia-Cheng; Chen, Feng-Yuan; Hsiang, Chien-Yun; Ho, Tin-Yun
2017-08-15
Electroacupuncture (EA) has been applied to treat and prevent diseases for years. However, molecular events happened in both the acupunctured site and the internal organs after EA stimulation have not been clarified. Here we applied transcriptomic analysis to explore the gene expression signatures after EA stimulation. Mice were applied EA stimulation at ST36 for 15 min and nine tissues were collected three hours later for microarray analysis. We found that EA affected the expression of genes not only in the acupunctured site but also in the internal organs. EA commonly affected biological networks involved in cytoskeleton and cell adhesion, and also regulated unique process networks in specific organs, such as γ-aminobutyric acid-ergic neurotransmission in brain and inflammation process in lung. In addition, EA affected the expression of genes related to various diseases, such as neurodegenerative diseases in brain and obstructive pulmonary diseases in lung. This report applied, for the first time, a global comprehensive genome-wide approach to analyze the gene expression profiling of acupunctured site and internal organs after EA stimulation. The connection between gene expression signatures, biological processes, and diseases might provide a basis for prediction and explanation on the therapeutic potentials of acupuncture in organs.
NASA Technical Reports Server (NTRS)
Prill, J. C. (Principal Investigator)
1979-01-01
The author has identified the following significant results. Level 2 forest features (softwood, hardwood, clear-cut, and water) can be classified with an overall accuracy of 71.6 percent plus or minus 6.7 percent at the 90 percent confidence level for the particular data and conditions existing at the time of the study. Signatures derived from training fields taken from only 10 percent of the site are not sufficient to adequately classify the site. The level 3 softwood age group classification appears reasonable, although no statistical evaluation was performed.
Altobelli, Gioia; Bogdarina, Irina G; Stupka, Elia; Clark, Adrian J L; Langley-Evans, Simon
2013-01-01
A large body of evidence from human and animal studies demonstrates that the maternal diet during pregnancy can programme physiological and metabolic functions in the developing fetus, effectively determining susceptibility to later disease. The mechanistic basis of such programming is unclear but may involve resetting of epigenetic marks and fetal gene expression. The aim of this study was to evaluate genome-wide DNA methylation and gene expression in the livers of newborn rats exposed to maternal protein restriction. On day one postnatally, there were 618 differentially expressed genes and 1183 differentially methylated regions (FDR 5%). The functional analysis of differentially expressed genes indicated a significant effect on DNA repair/cycle/maintenance functions and of lipid, amino acid metabolism and circadian functions. Enrichment for known biological functions was found to be associated with differentially methylated regions. Moreover, these epigenetically altered regions overlapped genetic loci associated with metabolic and cardiovascular diseases. Both expression changes and DNA methylation changes were largely reversed by supplementing the protein restricted diet with folic acid. Although the epigenetic and gene expression signatures appeared to underpin largely different biological processes, the gene expression profile of DNA methyl transferases was altered, providing a potential link between the two molecular signatures. The data showed that maternal protein restriction is associated with widespread differential gene expression and DNA methylation across the genome, and that folic acid is able to reset both molecular signatures.
Hu, Shimin; Xu-Monette, Zijun Y.; Balasubramanyam, Aarthi; Manyam, Ganiraju C.; Visco, Carlo; Tzankov, Alexander; Liu, Wei-min; Miranda, Roberto N.; Zhang, Li; Montes-Moreno, Santiago; Dybkær, Karen; Chiu, April; Orazi, Attilio; Zu, Youli; Bhagat, Govind; Richards, Kristy L.; Hsi, Eric D.; Choi, William W. L.; Han van Krieken, J.; Huang, Qin; Huh, Jooryung; Ai, Weiyun; Ponzoni, Maurilio; Ferreri, Andrés J. M.; Zhao, Xiaoying; Winter, Jane N.; Zhang, Mingzhi; Li, Ling; Møller, Michael B.; Piris, Miguel A.; Li, Yong; Go, Ronald S.; Wu, Lin; Medeiros, L. Jeffrey; Young, Ken H.
2013-01-01
CD30, originally identified as a cell-surface marker of Reed-Sternberg and Hodgkin cells of classical Hodgkin lymphoma, is also expressed by several types of non-Hodgkin lymphoma, including a subset of diffuse large B-cell lymphoma (DLBCL). However, the prognostic and biological importance of CD30 expression in DLBCL is unknown. Here we report that CD30 expression is a favorable prognostic factor in a cohort of 903 de novo DLBCL patients. CD30 was expressed in ∼14% of DLBCL patients. Patients with CD30+ DLBCL had superior 5-year overall survival (CD30+, 79% vs CD30–, 59%; P = .001) and progression-free survival (P = .003). The favorable outcome of CD30 expression was maintained in both the germinal center B-cell and activated B-cell subtypes. Gene expression profiling revealed the upregulation of genes encoding negative regulators of nuclear factor κB activation and lymphocyte survival, and downregulation of genes encoding B-cell receptor signaling and proliferation, as well as prominent cytokine and stromal signatures in CD30+ DLBCL patients, suggesting a distinct molecular basis for its favorable outcome. Given the superior prognostic value, unique gene expression signature, and significant value of CD30 as a therapeutic target for brentuximab vedotin in ongoing successful clinical trials, it seems appropriate to consider CD30+ DLBCL as a distinct subgroup of DLBCL. PMID:23343832
Ben Salem, Samira; Bacha, Khmais; Chaari, Abdelkader
2012-09-01
In this work we suggest an original fault signature based on an improved combination of Hilbert and Park transforms. Starting from this combination we can create two fault signatures: Hilbert modulus current space vector (HMCSV) and Hilbert phase current space vector (HPCSV). These two fault signatures are subsequently analysed using the classical fast Fourier transform (FFT). The effects of mechanical faults on the HMCSV and HPCSV spectrums are described, and the related frequencies are determined. The magnitudes of spectral components, relative to the studied faults (air-gap eccentricity and outer raceway ball bearing defect), are extracted in order to develop the input vector necessary for learning and testing the support vector machine with an aim of classifying automatically the various states of the induction motor. Copyright © 2012 ISA. Published by Elsevier Ltd. All rights reserved.
Millimeter-wave micro-Doppler measurements of small UAVs
NASA Astrophysics Data System (ADS)
Rahman, Samiur; Robertson, Duncan A.
2017-05-01
This paper discusses the micro-Doppler signatures of small UAVs obtained from a millimeter-wave radar system. At first, simulation results are shown to demonstrate the theoretical concept. It is illustrated that whilst the propeller rotation rate of the small UAVs is quite high, millimeter-wave radar systems are capable of capturing the full micro-Doppler spread. Measurements of small UAVs have been performed with both CW and FMCW radars operating at 94 GHz. The CW radar was used for obtaining micro-Doppler signatures of individual propellers. The field test data of a flying small UAV was collected with the FMCW radar and was processed to extract micro-Doppler signatures. The high fidelity results clearly reveal features such as blade flashes and propeller rotation modulation lines which can be used to classify targets. This work confirms that millimeter-wave radar is suitable for the detection and classification of small UAVs at usefully long ranges.
BMI1 induces an invasive signature in melanoma that promotes metastasis and chemoresistance
Ferretti, Roberta; Bhutkar, Arjun; McNamara, Molly C.; Lees, Jacqueline A.
2016-01-01
Melanoma can switch between proliferative and invasive states, which have identifying gene expression signatures that correlate with good and poor prognosis, respectively. However, the mechanisms controlling these signatures are poorly understood. In this study, we identify BMI1 as a key determinant of melanoma metastasis by which its overexpression enhanced and its deletion impaired dissemination. Remarkably, in this tumor type, BMI1 had no effect on proliferation or primary tumor growth but enhanced every step of the metastatic cascade. Consistent with the broad spectrum of effects, BMI1 activated widespread gene expression changes, which are characteristic of melanoma progression and also chemoresistance. Accordingly, we showed that up-regulation or down-regulation of BMI1 induced resistance or sensitivity to BRAF inhibitor treatment and that induction of noncanonical Wnt by BMI1 is required for this resistance. Finally, we showed that our BMI1-induced gene signature encompasses all of the hallmarks of the previously described melanoma invasive signature. Moreover, our signature is predictive of poor prognosis in human melanoma and is able to identify primary tumors that are likely to become metastatic. These data yield key insights into melanoma biology and establish BMI1 as a compelling drug target whose inhibition would suppress both metastasis and chemoresistance of melanoma. PMID:26679841
A four-gene signature predicts survival in clear-cell renal-cell carcinoma.
Dai, Jun; Lu, Yuchao; Wang, Jinyu; Yang, Lili; Han, Yingyan; Wang, Ying; Yan, Dan; Ruan, Qiurong; Wang, Shaogang
2016-12-13
Clear-cell renal-cell carcinoma (ccRCC) is the most common pathological subtype of renal cell carcinoma (RCC), accounting for about 80% of RCC. In order to find potential prognostic biomarkers in ccRCC, we presented a four-gene signature to evaluate the prognosis of ccRCC. SurvExpress and immunohistochemical (IHC) staining of tissue microarrays were used to analyze the association between the four genes and the prognosis of ccRCC. Data from TCGA dataset revealed a prognostic prompt function of the four genes (PTEN, PIK3C2A, ITPA and BCL3). Further discovery suggested that the four-gene signature predicted survival better than any of the four genes alone. Moreover, IHC staining demonstrated a consistent result with TCGA, indicating that the signature was an independent prognostic factor of survival in ccRCC. Univariate and multivariate Cox proportional hazard regression analysis were conducted to verify the association of clinicopathological variables and the four genes' expression levels with survival. The results further testified that the risk (four-gene signature) was an independent prognostic factors of both Overall Survival (OS) and Disease-free Survival (DFS) (P<0.05). In conclusion, the four-gene signature was correlated with the survival of ccRCC, and therefore, may help to provide significant clinical implications for predicting the prognosis of patients.
Peters, Linda M.; Belyantseva, Inna A.; Lagziel, Ayala; Battey, James F.; Friedman, Thomas B.; Morell, Robert J.
2007-01-01
Specialization in cell function and morphology is influenced by the differential expression of mRNAs, many of which are expressed at low abundance and restricted to certain cell types. Detecting such transcripts in cDNA libraries may require sequencing millions of clones. Massively parallel signature sequencing (MPSS) is well-suited for identifying transcripts that are expressed in discrete cell types and in low abundance. We have made MPSS libraries from microdissections of three inner ear tissues. By comparing these MPSS libraries to those of 87 other tissues included in the Mouse Reference Transcriptome (MRT) online resource, we have identified genes that are highly enriched in, or specific to, the inner ear. We show by RT-PCR and in situ hybridization that signatures unique to the inner ear libraries identify transcripts with highly specific cell-type localizations. These transcripts serve to illustrate the utility of a resource that is available to the research community. Utilization of these resources will increase the number of known transcription units and expand our knowledge of the tissue-specific regulation of the transcriptome. PMID:17049805
Santuario-Facio, Sandra K; Cardona-Huerta, Servando; Perez-Paramo, Yadira X; Trevino, Victor; Hernandez-Cabrera, Francisco; Rojas-Martinez, Augusto; Uscanga-Perales, Grecia; Martinez-Rodriguez, Jorge L; Martinez-Jacobo, Lizeth; Padilla-Rivas, Gerardo; Muñoz-Maldonado, Gerardo; Gonzalez-Guerrero, Juan Francisco; Valero-Gomez, Javier; Vazquez-Guerrero, Ana L; Martinez-Rodriguez, Herminia G; Barboza-Quintana, Alvaro; Barboza-Quintana, Oralia; Garza-Guajardo, Raquel; Ortiz-Lopez, Rocio
2017-01-01
Triple-negative breast cancer (TNBC) is an aggressive subtype of breast cancer tumors. Comparisons between TNBC and non–triple-negative breast cancer (nTNBC) may help to differentiate key components involved in TNBC neoplasms. The purpose of the study was to analyze the expression profile of TNBC versus nTNBC tumors in a homogeneous population from northeastern Mexico. A prospective study of 50 patients (25 TNBC and 25 nTNBC) was conducted. Clinic parameters were equally distributed for TNBC and nTNBC: age at diagnosis (51 versus 47 years, p = 0.1), glucose level (107 mg/dl versus 104 mg/dl, p = 0.64), and body mass index (28 versus 29, p = 0.14). Core biopsies were collected for histopathological diagnosis and gene expression analysis. Total RNA was isolated and expression profiling was performed. Forty genes showed differential expression pattern in TNBC tumors. Among these, nine overexpressed genes (PRKX/PRKY, UGT8, HMGA1, LPIN1, HAPLN3, FAM171A1, BCL141A, FOXC1, and ANKRD11), and one underexpressed gene (ANX9) are involved in general metabolism. Based on this biochemical peculiarity and the overexpression of BCL11A and FOXC1 (involved in tumor growth and metastasis, respectively), we validated by quantitative polymerase chain reaction the expression profiles of seven genes out of the signature. In this report, a new gene signature for TNBC is proposed. To our knowledge, this is the first TNBC signature that describes genes involved in general metabolism. The findings may be pertinent for Mexican patients and require evaluation in other ethnic groups and populations. PMID:28474731
Yan, Bin; Yang, Xinping; Lee, Tin-Lap; Friedman, Jay; Tang, Jun; Van Waes, Carter; Chen, Zhong
2007-01-01
Background Differentially expressed gene profiles have previously been observed among pathologically defined cancers by microarray technologies, including head and neck squamous cell carcinomas (HNSCCs). However, the molecular expression signatures and transcriptional regulatory controls that underlie the heterogeneity in HNSCCs are not well defined. Results Genome-wide cDNA microarray profiling of ten HNSCC cell lines revealed novel gene expression signatures that distinguished cancer cell subsets associated with p53 status. Three major clusters of over-expressed genes (A to C) were defined through hierarchical clustering, Gene Ontology, and statistical modeling. The promoters of genes in these clusters exhibited different patterns and prevalence of transcription factor binding sites for p53, nuclear factor-κB (NF-κB), activator protein (AP)-1, signal transducer and activator of transcription (STAT)3 and early growth response (EGR)1, as compared with the frequency in vertebrate promoters. Cluster A genes involved in chromatin structure and function exhibited enrichment for p53 and decreased AP-1 binding sites, whereas clusters B and C, containing cytokine and antiapoptotic genes, exhibited a significant increase in prevalence of NF-κB binding sites. An increase in STAT3 and EGR1 binding sites was distributed among the over-expressed clusters. Novel regulatory modules containing p53 or NF-κB concomitant with other transcription factor binding motifs were identified, and experimental data supported the predicted transcriptional regulation and binding activity. Conclusion The transcription factors p53, NF-κB, and AP-1 may be important determinants of the heterogeneous pattern of gene expression, whereas STAT3 and EGR1 may broadly enhance gene expression in HNSCCs. Defining these novel gene signatures and regulatory mechanisms will be important for establishing new molecular classifications and subtyping, which in turn will promote development of targeted therapeutics for HNSCC. PMID:17498291
Wang, Weining; Lim, Weng Khong; Leong, Hui Sun; Chong, Fui Teen; Lim, Tony K H; Tan, Daniel S W; Teh, Bin Tean; Iyer, N Gopalakrishna
2015-04-01
Extracapsular spread (ECS) is an important prognostic factor for oral squamous cell carcinoma (OSCC) and is used to guide management. In this study, we aimed to identify an expression profile signature for ECS in node-positive OSCC using data derived from two different sources: a cohort of OSCC patients from our institution (National Cancer Centre Singapore) and The Cancer Genome Atlas (TCGA) head and neck squamous cell carcinoma (HNSCC) cohort. We also sought to determine if this signature could serve as a prognostic factor in node negative cancers. Patients with a histological diagnosis of OSCC were identified from an institutional database and fresh tumor samples were retrieved. RNA was extracted and gene expression profiling was performed using the Affymetrix GeneChip Human Genome U133 Plus 2.0 microarray platform. RNA sequence data and corresponding clinical data for the TCGA HNSCC cohort were downloaded from the TCGA Data Portal. All data analyses were conducted using R package and SPSS. We identified an 11 gene signature (GGH, MTFR1, CDKN3, PSRC1, SMIM3, CA9, IRX4, CPA3, ZSCAN16, CBX7 and ZFP3) which was robust in segregating tumors by ECS status. In node negative patients, patients harboring this ECS signature had a significantly worse overall survival (p=0.04). An eleven gene signature for ECS was derived. Our results also suggest that this signature is prognostic in a separate subset of patients with no nodal metastasis Further validation of this signature on other datasets and immunohistochemical studies are required to establish utility of this signature in stratifying early stage OSCC patients. Copyright © 2014 Elsevier Ltd. All rights reserved.
Electromagnetic signatures of the preclinical and prodromal stages of Alzheimer's disease.
Nakamura, Akinori; Cuesta, Pablo; Fernández, Alberto; Arahata, Yutaka; Iwata, Kaori; Kuratsubo, Izumi; Bundo, Masahiko; Hattori, Hideyuki; Sakurai, Takashi; Fukuda, Koji; Washimi, Yukihiko; Endo, Hidetoshi; Takeda, Akinori; Diers, Kersten; Bajo, Ricardo; Maestú, Fernando; Ito, Kengo; Kato, Takashi
2018-05-01
Biomarkers useful for the predementia stages of Alzheimer's disease are needed. Electroencephalography and magnetoencephalography (MEG) are expected to provide potential biomarker candidates for evaluating the predementia stages of Alzheimer's disease. However, the physiological relevance of EEG/MEG signal changes and their role in pathophysiological processes such as amyloid-β deposition and neurodegeneration need to be elucidated. We evaluated 28 individuals with mild cognitive impairment and 38 cognitively normal individuals, all of whom were further classified into amyloid-β-positive mild cognitive impairment (n = 17, mean age 74.7 ± 5.4 years, nine males), amyloid-β-negative mild cognitive impairment (n = 11, mean age 73.8 ± 8.8 years, eight males), amyloid-β-positive cognitively normal (n = 13, mean age 71.8 ± 4.4 years, seven males), and amyloid-β-negative cognitively normal (n = 25, mean age 72.5 ± 3.4 years, 11 males) individuals using Pittsburgh compound B-PET. We measured resting state MEG for 5 min with the eyes closed, and investigated regional spectral patterns of MEG signals using atlas-based region of interest analysis. Then, the relevance of the regional spectral patterns and their associations with pathophysiological backgrounds were analysed by integrating information from Pittsburgh compound B-PET, fluorodeoxyglucose-PET, structural MRI, and cognitive tests. The results demonstrated that regional spectral patterns of resting state activity could be separated into several types of MEG signatures as follows: (i) the effects of amyloid-β deposition were expressed as the alpha band power augmentation in medial frontal areas; (ii) the delta band power increase in the same region was associated with disease progression within the Alzheimer's disease continuum and was correlated with entorhinal atrophy and an Alzheimer's disease-like regional decrease in glucose metabolism; and (iii) the global theta power augmentation, which was previously considered to be an Alzheimer's disease-related EEG/MEG signature, was associated with general cognitive decline and hippocampal atrophy, but was not specific to Alzheimer's disease because these changes could be observed in the absence of amyloid-β deposition. The results suggest that these MEG signatures may be useful as unique biomarkers for the predementia stages of Alzheimer's disease.
Genome-wide expression profiling in the peripheral blood of patients with fibromyalgia
Jones, Kim D.; Gelbart, Terri; Whisenant, Thomas C.; Waalen, Jill; Mondala, Tony S.; Iklé, David N.; Salomon, Daniel R.; Bennett, Robert M.; Kurian, Sunil M.
2016-01-01
Objective Fibromyalgia (FM) is a common pain disorder characterised by nociceptive dysregulation. The basic biology of FM is poorly understood. Herein we have used agnostic gene expression as a potential probe for informing its underlying biology and the development of a proof-of-concept diagnostic gene expression signature. Methods We analysed RNA expression in 70 FM patients and 70 healthy controls. The isolated RNA was amplified and hybridised to Affymetrix® Human Gene 1.1 ST Peg arrays. The data was analysed using Partek Genomics Suite v. 6.6. Results Fibromyalgia patients exhibited a differential expression of 421 genes (p<0.001), several relevant to pathways for pain processing, such as glutamine/glutamate signaling and axonal development. There was also an upregulation of several inflammatory pathways and downregulation of pathways related to hypersensitivity and allergy. Using rigorous diagnostic modeling strategies, we show “locked” gene signatures discovered on Training and Test cohorts, that have a mean Area Under the Curve (AUC) of 0.81 on randomised, independent external data cohorts. Lastly, we identified a subset of 10 probesets that provided a diagnostic sensitivity for FM of 95% and a specificity of 96%. We also show that the signatures for FM were very specific to FM rather than common FM comorbidities. Conclusion These findings provide new insights relevant to the pathogenesis of FM, and provide several testable hypotheses that warrant further exploration and also establish the foundation for a first blood-based molecular signature in FM that needs to be validated in larger cohorts of patients. PMID:27157394
Clinical proteomics: Applications for prostate cancer biomarker discovery and detection.
Petricoin, Emanuel F; Ornstein, David K; Liotta, Lance A
2004-01-01
The science of proteomics comprises much more than simply generating lists of proteins that change in expression as a cause of or consequence of pathophysiology. The goal of proteomics should be to characterize the information flow through the intercellular protein circuitry that communicates with the extracellular microenvironment and then ultimately to the serum/plasma macroenvironment. Serum proteomic pattern diagnostics is a new type of proteomic concept in which patterns of ion signatures generated from high dimensional mass spectrometry data are used as diagnostic classifiers. This recent approach has exciting potential for clinical utility of diagnostic patterns because low molecular weight metabolites, peptides, and protein fragments may have higher accuracy than traditional biomarkers of cancer detection. Intriguingly, we now have discovered that this diagnostic information exists in a bound state, complexed with circulating highly abundant carrier proteins. These diagnostic fragments may one day be harvested by circulating nanoparticles, designed to absorb, enrich, and amplify the repertoire of diagnostic biomarkers generated-even at the critical, initial stages of carcinogenesis. Copyright 2004 Elsevier Inc.
Bradford, James R; Cox, Angela; Bernard, Philip; Camp, Nicola J
2016-01-01
Long non-coding RNAs (lncRNAs) are emerging as crucial regulators of cellular processes and diseases such as cancer; however, their functions remain poorly characterised. Several studies have demonstrated that lncRNAs are typically disease and tumour subtype specific, particularly in breast cancer where lncRNA expression alone is sufficient to discriminate samples based on hormone status and molecular intrinsic subtype. However, little attempt has been made to assess the reproducibility of lncRNA signatures across more than one dataset. In this work, we derive consensus lncRNA signatures indicative of breast cancer subtype based on two clinical RNA-Seq datasets: the Utah Breast Cancer Study and The Cancer Genome Atlas, through integration of differential expression and hypothesis-free clustering analyses. The most consistent signature is associated with breast cancers of the basal-like subtype, leading us to generate a putative set of six lncRNA basal-like breast cancer markers, at least two of which may have a role in cis-regulation of known poor prognosis markers. Through in silico functional characterization of individual signatures and integration of expression data from pre-clinical cancer models, we discover that discordance between signatures derived from different clinical cohorts can arise from the strong influence of non-cancerous cells in tumour samples. As a consequence, we identify nine lncRNAs putatively associated with breast cancer associated fibroblasts, or the immune response. Overall, our study establishes the confounding effects of tumour purity on lncRNA signature derivation, and generates several novel hypotheses on the role of lncRNAs in basal-like breast cancers and the tumour microenvironment.
Wruck, Wasco; Schröter, Friederike; Adjaye, James
2016-01-01
Although the incidence of Alzheimer's disease (AD) is continuously increasing in the aging population worldwide, effective therapies are not available. The interplay between causative genetic and environmental factors is partially understood. Meta-analyses have been performed on aspects such as polymorphisms, cytokines, and cognitive training. Here, we propose a meta-analysis approach based on hierarchical clustering analysis of a reliable training set of hippocampus biopsies, which is condensed to a gene expression signature. This gene expression signature was applied to various test sets of brain biopsies and iPSC-derived neuronal cell models to demonstrate its ability to distinguish AD samples from control. Thus, our identified AD-gene signature may form the basis for determination of biomarkers that are urgently needed to overcome current diagnostic shortfalls. Intriguingly, the well-described AD-related genes APP and APOE are not within the signature because their gene expression profiles show a lower correlation to the disease phenotype than genes from the signature. This is in line with the differing characteristics of the disease as early-/late-onset or with/without genetic predisposition. To investigate the gene signature's systemic role(s), signaling pathways, gene ontologies, and transcription factors were analyzed which revealed over-representation of response to stress, regulation of cellular metabolic processes, and reactive oxygen species. Additionally, our results clearly point to an important role of FOXA1 and FOXA2 gene regulatory networks in the etiology of AD. This finding is in corroboration with the recently reported major role of the dopaminergic system in the development of AD and its regulation by FOXA1 and FOXA2.
NASA Astrophysics Data System (ADS)
Leakey, Chris D. B.; Attrill, Martin J.; Jennings, Simon; Fitzsimons, Mark F.
2008-04-01
Estuaries are regarded as valuable nursery habitats for many commercially important marine fishes, potentially providing a thermal resource, refuge from predators and a source of abundant prey. Stable isotope analysis may be used to assess relative resource use from isotopically distinct sources. This study comprised two major components: (1) development of a spatial map and discriminant function model of stable isotope variation in selected invertebrate groups inhabiting the Thames Estuary and adjacent coastal regions; and (2) analysis of stable isotope signatures of juvenile bass ( Dicentrarchus labrax), sole ( Solea solea) and whiting ( Merlangius merlangus) for assessment of resource use and feeding strategies. The data were also used to consider anthropogenic enrichment of the estuary and potential energetic benefits of feeding in estuarine nursery habitat. Analysis of carbon (δ 13C), nitrogen (δ 15N) and sulphur (δ 34S) isotope data identified significant differences in the 'baseline' isotopic signatures between estuarine and coastal invertebrates, and discriminant function analysis allowed samples to be re-classified to estuarine and coastal regions with 98.8% accuracy. Using invertebrate signatures as source indicators, stable isotope data classified juvenile fishes to the region in which they fed. Feeding signals appear to reflect physiological (freshwater tolerance) and functional (mobility) differences between species. Juvenile sole were found to exist as two isotopically-discrete sub-populations, with no evidence of mixing between the two. An apparent energetic benefit of estuarine feeding was only found for sole.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mayer, B. P.; Mew, D. A.; DeHope, A.
Attribution of the origin of an illicit drug relies on identification of compounds indicative of its clandestine production and is a key component of many modern forensic investigations. The results of these studies can yield detailed information on method of manufacture, starting material source, and final product - all critical forensic evidence. In the present work, chemical attribution signatures (CAS) associated with the synthesis of the analgesic fentanyl, N-(1-phenylethylpiperidin-4-yl)-N-phenylpropanamide, were investigated. Six synthesis methods, all previously published fentanyl synthetic routes or hybrid versions thereof, were studied in an effort to identify and classify route-specific signatures. 160 distinct compounds and inorganicmore » species were identified using gas and liquid chromatographies combined with mass spectrometric methods (GC-MS and LCMS/ MS-TOF) in conjunction with inductively coupled plasma mass spectrometry (ICPMS). The complexity of the resultant data matrix urged the use of multivariate statistical analysis. Using partial least squares discriminant analysis (PLS-DA), 87 route-specific CAS were classified and a statistical model capable of predicting the method of fentanyl synthesis was validated and tested against CAS profiles from crude fentanyl products deposited and later extracted from two operationally relevant surfaces: stainless steel and vinyl tile. This work provides the most detailed fentanyl CAS investigation to date by using orthogonal mass spectral data to identify CAS of forensic significance for illicit drug detection, profiling, and attribution.« less
A Transcriptional Signature of Fatigue Derived from Patients with Primary Sjögren’s Syndrome
James, Katherine; Al-Ali, Shereen; Tarn, Jessica; Cockell, Simon J.; Gillespie, Colin S.; Hindmarsh, Victoria; Locke, James; Mitchell, Sheryl; Lendrem, Dennis; Bowman, Simon; Price, Elizabeth; Pease, Colin T.; Emery, Paul; Lanyon, Peter; Hunter, John A.; Gupta, Monica; Bombardieri, Michele; Sutcliffe, Nurhan; Pitzalis, Costantino; McLaren, John; Cooper, Annie; Regan, Marian; Giles, Ian; Isenberg, David; Saravanan, Vadivelu; Coady, David; Dasgupta, Bhaskar; McHugh, Neil; Young-Min, Steven; Moots, Robert; Gendi, Nagui; Akil, Mohammed; Griffiths, Bridget; Wipat, Anil; Newton, Julia; Jones, David E.; Isaacs, John; Hallinan, Jennifer; Ng, Wan-Fai
2015-01-01
Background Fatigue is a debilitating condition with a significant impact on patients’ quality of life. Fatigue is frequently reported by patients suffering from primary Sjögren’s Syndrome (pSS), a chronic autoimmune condition characterised by dryness of the eyes and the mouth. However, although fatigue is common in pSS, it does not manifest in all sufferers, providing an excellent model with which to explore the potential underpinning biological mechanisms. Methods Whole blood samples from 133 fully-phenotyped pSS patients stratified for the presence of fatigue, collected by the UK primary Sjögren’s Syndrome Registry, were used for whole genome microarray. The resulting data were analysed both on a gene by gene basis and using pre-defined groups of genes. Finally, gene set enrichment analysis (GSEA) was used as a feature selection technique for input into a support vector machine (SVM) classifier. Classification was assessed using area under curve (AUC) of receiver operator characteristic and standard error of Wilcoxon statistic, SE(W). Results Although no genes were individually found to be associated with fatigue, 19 metabolic pathways were enriched in the high fatigue patient group using GSEA. Analysis revealed that these enrichments arose from the presence of a subset of 55 genes. A radial kernel SVM classifier with this subset of genes as input displayed significantly improved performance over classifiers using all pathway genes as input. The classifiers had AUCs of 0.866 (SE(W) 0.002) and 0.525 (SE(W) 0.006), respectively. Conclusions Systematic analysis of gene expression data from pSS patients discordant for fatigue identified 55 genes which are predictive of fatigue level using SVM classification. This list represents the first step in understanding the underlying pathophysiological mechanisms of fatigue in patients with pSS. PMID:26694930
2011-06-01
8w DC in patients treated with erlotinib, but not sorafenib, indicating that it is not merely a prognostic signature; D) Both the 5-gene signature...disease-free, progression-free, and overall survival will vary across prognostically distinct groups. 3. Specific molecular signatures in primary tumors...therapeutic strategies at relapse. Specific Aims: Aim 1: To define characteristic TTF/gene expression profiles of prognostically distinct
Identifying prognostic signature in ovarian cancer using DirGenerank
Wang, Jian-Yong; Chen, Ling-Ling; Zhou, Xiong-Hui
2017-01-01
Identifying the prognostic genes in cancer is essential not only for the treatment of cancer patients, but also for drug discovery. However, it's still a big challenge to select the prognostic genes that can distinguish the risk of cancer patients across various data sets because of tumor heterogeneity. In this situation, the selected genes whose expression levels are statistically related to prognostic risks may be passengers. In this paper, based on gene expression data and prognostic data of ovarian cancer patients, we used conditional mutual information to construct gene dependency network in which the nodes (genes) with more out-degrees have more chances to be the modulators of cancer prognosis. After that, we proposed DirGenerank (Generank in direct netowrk) algorithm, which concerns both the gene dependency network and genes’ correlations to prognostic risks, to identify the gene signature that can predict the prognostic risks of ovarian cancer patients. Using ovarian cancer data set from TCGA (The Cancer Genome Atlas) as training data set, 40 genes with the highest importance were selected as prognostic signature. Survival analysis of these patients divided by the prognostic signature in testing data set and four independent data sets showed the signature can distinguish the prognostic risks of cancer patients significantly. Enrichment analysis of the signature with curated cancer genes and the drugs selected by CMAP showed the genes in the signature may be drug targets for therapy. In summary, we have proposed a useful pipeline to identify prognostic genes of cancer patients. PMID:28615526
Linel, Patrice; Wu, Shuang; Deng, Nan; Wu, Hulin
2014-10-01
Recent studies demonstrate that human blood transcriptional signatures may be used to support diagnosis and clinical decisions for acute respiratory viral infections such as influenza. In this article, we propose to use a newly developed systems biology approach for time course gene expression data to identify significant dynamically response genes and dynamic gene network responses to viral infection. We illustrate the methodological pipeline by reanalyzing the time course gene expression data from a study with healthy human subjects challenged by live influenza virus. We observed clear differences in the number of significant dynamic response genes (DRGs) between the symptomatic and asymptomatic subjects and also identified DRG signatures for symptomatic subjects with influenza infection. The 505 common DRGs shared by the symptomatic subjects have high consistency with the signature genes for predicting viral infection identified in previous works. The temporal response patterns and network response features were carefully analyzed and investigated.
Ai, Rizi; Hammaker, Deepa; Boyle, David L.; Morgan, Rachel; Walsh, Alice M.; Fan, Shicai; Firestein, Gary S.; Wang, Wei
2016-01-01
Stratifying patients on the basis of molecular signatures could facilitate development of therapeutics that target pathways specific to a particular disease or tissue location. Previous studies suggest that pathogenesis of rheumatoid arthritis (RA) is similar in all affected joints. Here we show that distinct DNA methylation and transcriptome signatures not only discriminate RA fibroblast-like synoviocytes (FLS) from osteoarthritis FLS, but also distinguish RA FLS isolated from knees and hips. Using genome-wide methods, we show differences between RA knee and hip FLS in the methylation of genes encoding biological pathways, such as IL-6 signalling via JAK-STAT pathway. Furthermore, differentially expressed genes are identified between knee and hip FLS using RNA-sequencing. Double-evidenced genes that are both differentially methylated and expressed include multiple HOX genes. Joint-specific DNA signatures suggest that RA disease mechanisms might vary from joint to joint, thus potentially explaining some of the diversity of drug responses in RA patients. PMID:27282753
Iandolino, Alberto; Nobuta, Kan; da Silva, Francisco Goes; Cook, Douglas R; Meyers, Blake C
2008-05-12
Vitis vinifera (V. vinifera) is the primary grape species cultivated for wine production, with an industry valued annually in the billions of dollars worldwide. In order to sustain and increase grape production, it is necessary to understand the genetic makeup of grape species. Here we performed mRNA profiling using Massively Parallel Signature Sequencing (MPSS) and combined it with available Expressed Sequence Tag (EST) data. These tag-based technologies, which do not require a priori knowledge of genomic sequence, are well-suited for transcriptional profiling. The sequence depth of MPSS allowed us to capture and quantify almost all the transcripts at a specific stage in the development of the grape berry. The number and relative abundance of transcripts from stage II grape berries was defined using Massively Parallel Signature Sequencing (MPSS). A total of 2,635,293 17-base and 2,259,286 20-base signatures were obtained, representing at least 30,737 and 26,878 distinct sequences. The average normalized abundance per signature was approximately 49 TPM (Transcripts Per Million). Comparisons of the MPSS signatures with available Vitis species' ESTs and a unigene set demonstrated that 6,430 distinct contigs and 2,190 singletons have a perfect match to at least one MPSS signature. Among the matched sequences, ESTs were identified from tissues other than berries or from berries at different developmental stages. Additional MPSS signatures not matching to known grape ESTs can extend our knowledge of the V. vinifera transcriptome, particularly when these data are used to assist in annotation of whole genome sequences from Vitis vinifera. The MPSS data presented here not only achieved a higher level of saturation than previous EST based analyses, but in doing so, expand the known set of transcripts of grape berries during the unique stage in development that immediately precedes the onset of ripening. The MPSS dataset also revealed evidence of antisense expression not previously reported in grapes but comparable to that reported in other plant species. Finally, we developed a novel web-based, public resource for utilization of the grape MPSS data [1].
Reinke, Sarah; Richter, Julia; Fend, Falko; Feller, Alfred; Hansmann, Martin-Leo; Hüttl, Katrin; Oschlies, Ilske; Ott, German; Möller, Peter; Rosenwald, Andreas; Stein, Harald; Altenbuchinger, Michael; Spang, Rainer; Klapper, Wolfram
2018-05-05
Diffuse large B-cell lymphoma (DLBCL) is subdivided by gene expression analysis (GEP) into two molecular subtypes named germinal center B-cell-like (GCB) and activated B-cell-like (ABC) after their putative cell-of-origin (COO). Determination of the COO is considered mandatory in any new-diagnosed DLBCL, not otherwise specified according to the updated WHO classification. Despite the fact that pathologists are free to choose the method for COO classification, immunohistochemical (IHC) assays are most widely used. However, to the best of our knowledge, no round-robin test to evaluate the interlaboratory variability has been published so far. Eight hematopathology laboratories participated in an interlaboratory test for COO classification of 10 DLBCL tumors using the IHC classifier comprising the expression of CD10, BCL6, and MUM1 (so-called Hans classifier). The results were compared with GEP for COO signature and, in a subset, with results obtained by image analysis. In 7/10 cases (70%), at least seven laboratories assigned a given case to the same COO subtype (one center assessed one sample as not analyzable), which was in agreement with the COO subtype determined by GEP. The results in 3/10 cases (30%) revealed discrepancies between centers and/or between IHC and GEP subtype. Whereas the CD10 staining results were highly reproducible, staining for MUM1 was inconsistent in 50% and for BCL6 in 40% of cases. Image analysis of 16 slides stained for BCL6 (N = 8) and MUM1 (N = 8) of the two cases with the highest disagreement in COO classification were in line with the score of the pathologists in 14/16 stainings analyzed (87.5%). This study describes the first round-robin test for COO subtyping in DLBCL using IHC and demonstrates that COO classification using the Hans classifier yields consistent results among experienced hematopathologists, even when variable staining protocols are used. Data from this small feasibility study need to be validated in larger cohorts.
Study of the microdoppler signature of a bicyclist for different directions of approach
NASA Astrophysics Data System (ADS)
Rodriguez-Hervas, Berta; Maile, Michael; Flores, Benjamin C.
2015-05-01
The successful implementation of autonomous driving in an urban setting depends on the ability of the environment perception system to correctly classify vulnerable road users such as pedestrians and bicyclists in dense, complex scenarios. Self-driving vehicles include sensor systems such as cameras, lidars, and radars to enable decision making. Among these systems, radars are particularly relevant due to their operational robustness under adverse weather and night light conditions. Classification of pedestrian and car in urban settings using automotive radar has been widely investigated, suggesting that micro-Doppler signatures are useful for target discrimination. Our objective is to analyze and study the micro-Doppler signature of bicyclists approaching a vehicle from different directions in order to establish the basis of a classification criterion to distinguish bicycles from other targets including clutter. The micro-Doppler signature is obtained by grouping individual reflecting points using a clustering algorithm and observing the evolution of all the points belonging to an object in the Doppler domain over time. A comparison is then made with simulated data that uses a kinematic model of bicyclists' movement. The suitability of the micro-Doppler bicyclist signature as a classification feature is determined by comparing it to those belonging to cars and pedestrians approaching the automotive radar system.
Buick, Julie K.; Williams, Andrew; Swartz, Carol D.; Recio, Leslie; Li, Heng‐Hong; Fornace, Albert J.; Thomson, Errol M.; Aubrecht, Jiri
2016-01-01
In vitro transcriptional signatures that predict toxicities can facilitate chemical screening. We previously developed a transcriptomic biomarker (known as TGx‐28.65) for classifying agents as genotoxic (DNA damaging) and non‐genotoxic in human lymphoblastoid TK6 cells. Because TK6 cells do not express cytochrome P450s, we confirmed accurate classification by the biomarker in cells co‐exposed to 1% 5,6 benzoflavone/phenobarbital‐induced rat liver S9 for metabolic activation. However, chemicals may require different types of S9 for activation. Here we investigated the response of TK6 cells to higher percentages of Aroclor‐, benzoflavone/phenobarbital‐, or ethanol‐induced rat liver S9 to expand TGx‐28.65 biomarker applicability. Transcriptional profiles were derived 3 to 4 hr following a 4 hr co‐exposure of TK6 cells to test chemicals and S9. Preliminary studies established that 10% Aroclor‐ and 5% ethanol‐induced S9 alone did not induce the TGx‐28.65 biomarker genes. Seven genotoxic and two non‐genotoxic chemicals (and concurrent solvent and positive controls) were then tested with one of the S9s (selected based on cell survival and micronucleus induction). Relative survival and micronucleus frequency was assessed by flow cytometry in cells 20 hr post‐exposure. Genotoxic/non‐genotoxic chemicals were accurately classified using the different S9s. One technical replicate of cells co‐treated with dexamethasone and 10% Aroclor‐induced S9 was falsely classified as genotoxic, suggesting caution in using high S9 concentrations. Even low concentrations of genotoxic chemicals (those not causing cytotoxicity) were correctly classified, demonstrating that TGx‐28.65 is a sensitive biomarker of genotoxicity. A meta‐analysis of datasets from 13 chemicals supports that different S9s can be used in TK6 cells, without impairing classification using the TGx‐28.65 biomarker. Environ. Mol. Mutagen. 57:243–260, 2016. © 2016 Her Majesty the Queen in Right of Canada. Environmental and Molecular Mutagenesis © 2016 Environmental Mutagen Society PMID:26946220
Kunkel, Steven D.; Suneja, Manish; Ebert, Scott M.; Bongers, Kale S.; Fox, Daniel K.; Malmberg, Sharon E.; Alipour, Fariborz; Shields, Richard K.; Adams, Christopher M.
2011-01-01
SUMMARY Skeletal muscle atrophy is a common and debilitating condition that lacks a pharmacologic therapy. To develop a potential therapy, we identified 63 mRNAs that were regulated by fasting in both human and mouse muscle, and 29 mRNAs that were regulated by both fasting and spinal cord injury in human muscle. We used these two unbiased mRNA expression signatures of muscle atrophy to query the Connectivity Map, which singled out ursolic acid as a compound whose signature was opposite to those of atrophy-inducing stresses. A natural compound enriched in apples, ursolic acid reduced muscle atrophy and stimulated muscle hypertrophy in mice. It did so by enhancing skeletal muscle insulin/IGF-I signaling, and inhibiting atrophy-associated skeletal muscle mRNA expression. Importantly, ursolic acid’s effects on muscle were accompanied by reductions in adiposity, fasting blood glucose and plasma cholesterol and triglycerides. These findings identify a potential therapy for muscle atrophy and perhaps other metabolic diseases. PMID:21641545
Can specific transcriptional regulators assemble a universal cancer signature?
NASA Astrophysics Data System (ADS)
Roy, Janine; Isik, Zerrin; Pilarsky, Christian; Schroeder, Michael
2013-10-01
Recently, there is a lot of interest in using biomarker signatures derived from gene expression data to predict cancer progression. We assembled signatures of 25 published datasets covering 13 types of cancers. How do these signatures compare with each other? On one hand signatures answering the same biological question should overlap, whereas signatures predicting different cancer types should differ. On the other hand, there could also be a Universal Cancer Signature that is predictive independently of the cancer type. Initially, we generate signatures for all datasets using classical approaches such as t-test and fold change and then, we explore signatures resulting from a network-based method, that applies the random surfer model of Google's PageRank algorithm. We show that the signatures as published by the authors and the signatures generated with classical methods do not overlap - not even for the same cancer type - whereas the network-based signatures strongly overlap. Selecting 10 out of 37 universal cancer genes gives the optimal prediction for all cancers thus taking a first step towards a Universal Cancer Signature. We furthermore analyze and discuss the involved genes in terms of the Hallmarks of cancer and in particular single out SP1, JUN/FOS and NFKB1 and examine their specific role in cancer progression.
Fisher classifier and its probability of error estimation
NASA Technical Reports Server (NTRS)
Chittineni, C. B.
1979-01-01
Computationally efficient expressions are derived for estimating the probability of error using the leave-one-out method. The optimal threshold for the classification of patterns projected onto Fisher's direction is derived. A simple generalization of the Fisher classifier to multiple classes is presented. Computational expressions are developed for estimating the probability of error of the multiclass Fisher classifier.
Vallejo, Abbe N; Hamel, David L; Mueller, Robert G; Ives, Diane G; Michel, Joshua J; Boudreau, Robert M; Newman, Anne B
2011-01-01
Exceptional aging has been defined as maintenance of physical and cognitive function beyond the median lifespan despite a history of diseases and/or concurrent subclinical conditions. Since immunity is vital to individual fitness, we examined immunologic fingerprint(s) of highly functional elders. Therefore, survivors of the Cardiovascular Health Study in Pittsburgh, Pennsylvania, USA were recruited (n = 140; mean age = 86 years) and underwent performance testing. Blood samples were collected and examined blindly for humoral factors and T cell phenotypes. Based on results of physical and cognitive performance testing, elders were classified as "impaired" or "unimpaired", accuracy of group assignment was verified by discriminant function analysis. The two groups showed distinct immune profiles as determined by factor analysis. The dominant immune signature of impaired elders consisted of interferon (IFN)-γ, interleukin (IL)-6, tumor necrosis factor-α, and T cells expressing inhibitory natural killer-related receptors (NKR) CD158a, CD158e, and NKG2A. In contrast, the dominant signature of unimpaired elders consisted of IL-5, IL-12p70, and IL-13 with co-expression of IFN-γ, IL-4, and IL-17, and T cells expressing stimulatory NKRs CD56, CD16, and NKG2D. In logistic regression models, unimpaired phenotype was predicted independently by IL-5 and by CD4(+)CD28(null)CD56(+)CD57(+) T cells. All elders had high antibody titers to common viruses including cytomegalovirus. In cellular bioassays, T cell receptor (TCR)-independent ligation of either CD56 or NKG2D elicited activation of T cells. Collectively, these data demonstrate the importance of immunological parameters in distinguishing between health phenotypes of older adults. NKR(+) T cells and cytokine upregulation indicate a unique physiologic environment in old age. Correlation of particular NKR(+) T cell subsets and IL-5 with unimpaired performance, and NKR-driven TCR-independent activation of T cells suggest novel immunopathway(s) that could be exploited to improve immunity in old age.
Time-Frequency Analysis of a Moving Human Doppler Signature
2009-02-01
Spectrogram at 1 GHz...........................8 3.3 Spectrograms of a Walking Human Carrying an AK47 Rifle.......................................12...carrying an AK47 rifle, at 1 GHz...section peaks out at 1 GHz. 3.3 Spectrograms of a Walking Human Carrying an AK47 Rifle One scenario of great interest is trying to classify a person
Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing
2018-04-23
Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis and benefit the therapy improvement.
Namani, Akhileshwar; Matiur Rahaman, Md; Chen, Ming; Tang, Xiuwen
2018-01-06
NRF2 is the key regulator of oxidative stress in normal cells and aberrant expression of the NRF2 pathway due to genetic alterations in the KEAP1 (Kelch-like ECH-associated protein 1)-NRF2 (nuclear factor erythroid 2 like 2)-CUL3 (cullin 3) axis leads to tumorigenesis and drug resistance in many cancers including head and neck squamous cell cancer (HNSCC). The main goal of this study was to identify specific genes regulated by the KEAP1-NRF2-CUL3 axis in HNSCC patients, to assess the prognostic value of this gene signature in different cohorts, and to reveal potential biomarkers. RNA-Seq V2 level 3 data from 279 tumor samples along with 37 adjacent normal samples from patients enrolled in the The Cancer Genome Atlas (TCGA)-HNSCC study were used to identify upregulated genes using two methods (altered KEAP1-NRF2-CUL3 versus normal, and altered KEAP1-NRF2-CUL3 versus wild-type). We then used a new approach to identify the combined gene signature by integrating both datasets and subsequently tested this signature in 4 independent HNSCC datasets to assess its prognostic value. In addition, functional annotation using the DAVID v6.8 database and protein-protein interaction (PPI) analysis using the STRING v10 database were performed on the signature. A signature composed of a subset of 17 genes regulated by the KEAP1-NRF2-CUL3 axis was identified by overlapping both the upregulated genes of altered versus normal (251 genes) and altered versus wild-type (25 genes) datasets. We showed that increased expression was significantly associated with poor survival in 4 independent HNSCC datasets, including the TCGA-HNSCC dataset. Furthermore, Gene Ontology, Kyoto Encyclopedia of Genes and Genomes, and PPI analysis revealed that most of the genes in this signature are associated with drug metabolism and glutathione metabolic pathways. Altogether, our study emphasizes the discovery of a gene signature regulated by the KEAP1-NRF2-CUL3 axis which is strongly associated with tumorigenesis and drug resistance in HNSCC. This 17-gene signature provides potential biomarkers and therapeutic targets for HNSCC cases in which the NRF2 pathway is activated.
Nowak, Anna; Lock, Dominik; Bacher, Petra; Hohnstein, Thordis; Vogt, Katrin; Gottfreund, Judith; Giehr, Pascal; Polansky, Julia K.; Sawitzki, Birgit; Kaiser, Andrew; Walter, Jörn; Scheffold, Alexander
2018-01-01
Regulatory T cells (Tregs) are an attractive therapeutic tool for several different immune pathologies. Therapeutic Treg application often requires prolonged in vitro culture to generate sufficient Treg numbers or to optimize their functionality, e.g., via genetic engineering of their antigen receptors. However, purity of clinical Treg expansion cultures is highly variable, and currently, it is impossible to identify and separate stable Tregs from contaminating effector T cells, either ex vivo or after prior expansion. This represents a major obstacle for quality assurance of expanded Tregs and raises significant safety concerns. Here, we describe a Treg activation signature that allows identification and sorting of epigenetically imprinted Tregs even after prolonged in vitro culture. We show that short-term reactivation resulted in expression of CD137 but not CD154 on stable FoxP3+ Tregs that displayed a demethylated Treg-specific demethylated region, high suppressive potential, and lack of inflammatory cytokine expression. We also applied this Treg activation signature for rapid testing of chimeric antigen receptor functionality in human Tregs and identified major differences in the signaling requirements regarding CD137 versus CD28 costimulation. Taken together, CD137+CD154− expression emerges as a universal Treg activation signature ex vivo and upon in vitro expansion allowing the identification and isolation of epigenetically stable antigen-activated Tregs and providing a means for their rapid functional testing in vitro. PMID:29467769
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge
Wagner, Florian
2015-01-01
Method Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. Results I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets. PMID:26575370
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.
Wagner, Florian
2015-01-01
Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.
Nowak, Anna; Lock, Dominik; Bacher, Petra; Hohnstein, Thordis; Vogt, Katrin; Gottfreund, Judith; Giehr, Pascal; Polansky, Julia K; Sawitzki, Birgit; Kaiser, Andrew; Walter, Jörn; Scheffold, Alexander
2018-01-01
Regulatory T cells (Tregs) are an attractive therapeutic tool for several different immune pathologies. Therapeutic Treg application often requires prolonged in vitro culture to generate sufficient Treg numbers or to optimize their functionality, e.g., via genetic engineering of their antigen receptors. However, purity of clinical Treg expansion cultures is highly variable, and currently, it is impossible to identify and separate stable Tregs from contaminating effector T cells, either ex vivo or after prior expansion. This represents a major obstacle for quality assurance of expanded Tregs and raises significant safety concerns. Here, we describe a Treg activation signature that allows identification and sorting of epigenetically imprinted Tregs even after prolonged in vitro culture. We show that short-term reactivation resulted in expression of CD137 but not CD154 on stable FoxP3+ Tregs that displayed a demethylated Treg-specific demethylated region, high suppressive potential, and lack of inflammatory cytokine expression. We also applied this Treg activation signature for rapid testing of chimeric antigen receptor functionality in human Tregs and identified major differences in the signaling requirements regarding CD137 versus CD28 costimulation. Taken together, CD137+CD154- expression emerges as a universal Treg activation signature ex vivo and upon in vitro expansion allowing the identification and isolation of epigenetically stable antigen-activated Tregs and providing a means for their rapid functional testing in vitro .
Hudson, Marie; Bernatsky, Sasha; Colmegna, Ines; Lora, Maximilien; Pastinen, Tomi; Klein Oros, Kathleen; Greenwood, Celia M T
2017-06-03
We undertook this study to identify DNA methylation signatures of three systemic autoimmune rheumatic diseases (SARDs), namely rheumatoid arthritis, systemic lupus erythematosus, and systemic sclerosis, compared to healthy controls. Using a careful design to minimize confounding, we restricted our study to subjects with incident disease and performed our analyses on purified CD4 + T cells, key effector cells in SARD. We identified differentially methylated (using the Illumina Infinium HumanMethylation450 BeadChip array) and expressed (using the Illumina TruSeq stranded RNA-seq protocol) sites between cases and controls, and investigated the biological significance of this SARD signature using gene annotation databases. We recruited 13 seropositive rheumatoid arthritis, 19 systemic sclerosis, 12 systemic lupus erythematosus subjects, and 8 healthy controls. We identified 33 genes that were both differentially methylated and expressed (26 over- and 7 under-expressed) in SARD cases versus controls. The most highly overexpressed gene was CD1C (log fold change in expression = 1.85, adjusted P value = 0.009). In functional analysis (Ingenuity Pathway Analysis), the top network identified was lipid metabolism, molecular transport, small molecule biochemistry. The top canonical pathways included the mitochondrial L-carnitine shuttle pathway (P = 5E-03) and PTEN signaling (P = 8E-03). The top upstream regulator was HNF4A (P = 3E-05). This novel SARD signature contributes to ongoing work to further our understanding of the molecular mechanisms underlying SARD and provides novel targets of interest.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hermsen, Sanne A.B., E-mail: Sanne.Hermsen@rivm.nl; Department of Toxicogenomics, Maastricht University, P.O. Box 616, 6200 MD, Maastricht; Institute for Risk Assessment Sciences
2013-10-01
The zebrafish embryotoxicity test is a promising alternative assay for developmental toxicity. Classically, morphological assessment of the embryos is applied to evaluate the effects of compound exposure. However, by applying differential gene expression analysis the sensitivity and predictability of the test may be increased. For defining gene expression signatures of developmental toxicity, we explored the possibility of using gene expression signatures of compound exposures based on commonly expressed individual genes as well as based on regulated gene pathways. Four developmental toxic compounds were tested in concentration-response design, caffeine, carbamazepine, retinoic acid and valproic acid, and two non-embryotoxic compounds, D-mannitol andmore » saccharin, were included. With transcriptomic analyses we were able to identify commonly expressed genes, which were mostly development related, after exposure to the embryotoxicants. We also identified gene pathways regulated by the embryotoxicants, suggestive of their modes of action. Furthermore, whereas pathways may be regulated by all compounds, individual gene expression within these pathways can differ for each compound. Overall, the present study suggests that the use of individual gene expression signatures as well as pathway regulation may be useful starting points for defining gene biomarkers for predicting embryotoxicity. - Highlights: • The zebrafish embryotoxicity test in combination with transcriptomics was used. • We explored two approaches of defining gene biomarkers for developmental toxicity. • Four compounds in concentration-response design were tested. • We identified commonly expressed individual genes as well as regulated gene pathways. • Both approaches seem suitable starting points for defining gene biomarkers.« less
A high resolution atlas of gene expression in the domestic sheep (Ovis aries)
Farquhar, Iseabail L.; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G.; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C. Bruce; Freeman, Tom C.; Archibald, Alan L.; Hume, David A.
2017-01-01
Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of ‘guilt by association’ was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages. PMID:28915238
A high resolution atlas of gene expression in the domestic sheep (Ovis aries).
Clark, Emily L; Bush, Stephen J; McCulloch, Mary E B; Farquhar, Iseabail L; Young, Rachel; Lefevre, Lucas; Pridans, Clare; Tsang, Hiu G; Wu, Chunlei; Afrasiabi, Cyrus; Watson, Mick; Whitelaw, C Bruce; Freeman, Tom C; Summers, Kim M; Archibald, Alan L; Hume, David A
2017-09-01
Sheep are a key source of meat, milk and fibre for the global livestock sector, and an important biomedical model. Global analysis of gene expression across multiple tissues has aided genome annotation and supported functional annotation of mammalian genes. We present a large-scale RNA-Seq dataset representing all the major organ systems from adult sheep and from several juvenile, neonatal and prenatal developmental time points. The Ovis aries reference genome (Oar v3.1) includes 27,504 genes (20,921 protein coding), of which 25,350 (19,921 protein coding) had detectable expression in at least one tissue in the sheep gene expression atlas dataset. Network-based cluster analysis of this dataset grouped genes according to their expression pattern. The principle of 'guilt by association' was used to infer the function of uncharacterised genes from their co-expression with genes of known function. We describe the overall transcriptional signatures present in the sheep gene expression atlas and assign those signatures, where possible, to specific cell populations or pathways. The findings are related to innate immunity by focusing on clusters with an immune signature, and to the advantages of cross-breeding by examining the patterns of genes exhibiting the greatest expression differences between purebred and crossbred animals. This high-resolution gene expression atlas for sheep is, to our knowledge, the largest transcriptomic dataset from any livestock species to date. It provides a resource to improve the annotation of the current reference genome for sheep, presenting a model transcriptome for ruminants and insight into gene, cell and tissue function at multiple developmental stages.
Continuous-variable quantum homomorphic signature
NASA Astrophysics Data System (ADS)
Li, Ke; Shang, Tao; Liu, Jian-wei
2017-10-01
Quantum cryptography is believed to be unconditionally secure because its security is ensured by physical laws rather than computational complexity. According to spectrum characteristic, quantum information can be classified into two categories, namely discrete variables and continuous variables. Continuous-variable quantum protocols have gained much attention for their ability to transmit more information with lower cost. To verify the identities of different data sources in a quantum network, we propose a continuous-variable quantum homomorphic signature scheme. It is based on continuous-variable entanglement swapping and provides additive and subtractive homomorphism. Security analysis shows the proposed scheme is secure against replay, forgery and repudiation. Even under nonideal conditions, it supports effective verification within a certain verification threshold.
Signatures in Martian Volatiles and the Magma Sources of NC Meteorites
NASA Technical Reports Server (NTRS)
Marti, K.; Mathew, K. J.
2004-01-01
We report nitrogen and xenon isotopic signatures in Yamato nakhlites and use the data to assess properties of the magma source of NC meteorites in planet Mars. The Chassigny meteorite was investigated by Floran et al, who classified it as a cumulate dunite with hydrous amphibole-bearing melt inclusions with no preferred orientation of the olivines. Their inferred composition of the parent magma, which was based on electron microprobe analyses, has been questioned. The trace and minor elements in minerals were analyzed in nakhlites and in Chassigny and the authors conclude that nakhlites may represent samples from different horizons of the same lithologic unit, but that Chassigny was not co-magmatic with the nakhlites.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Renner, J.; Farbin, A.; Vidal, J. Muñoz
Here, we investigate the potential of using deep learning techniques to reject background events in searches for neutrinoless double beta decay with high pressure xenon time projection chambers capable of detailed track reconstruction. The differences in the topological signatures of background and signal events can be learned by deep neural networks via training over many thousands of events. These networks can then be used to classify further events as signal or background, providing an additional background rejection factor at an acceptable loss of efficiency. The networks trained in this study performed better than previous methods developed based on the usemore » of the same topological signatures by a factor of 1.2 to 1.6, and there is potential for further improvement.« less
NASA Technical Reports Server (NTRS)
Ohtani, S.; Takahashi, K.; Zanetti, L. J.; Potemra, T. A.; Mcentire, R. W.; Iijima, T.
1992-01-01
The initial signatures of tail field reconfiguration observed in the near-earth magnetotail are examined using data obtained by the AMPTE/CCE magnetometer and the Medium Energy Particle Analyzer. It is found that the tail reconfiguration events could be classified as belonging to two types, Type I and Type II. In Type I events, a current disruption is immersed in a hot plasma region expanding from inward (earthward/equatorward) of the spacecraft; consequently, the spacecraft is immersed in a hot plasma region expanding from inward. The Type II reconfiguration event is characterized by a distinctive interval (explosive growth phase) just prior to the local commencement of tail phase.
Binato, Renata; Santos, Everton Cruz; Boroni, Mariana; Demachki, Samia; Assumpção, Paulo; Abdelhay, Eliana
2018-01-26
Gastric carcinoma (GC) is one of the most aggressive cancers and the second leading cause of cancer death in the world. According to the Lauren classification, this adenocarcinoma is divided into two subtypes, intestinal and diffuse, which differ in their clinical, epidemiological and molecular features. Several studies have attempted to delineate the molecular signature of gastric cancer to develop new and non-invasive screening tests that improve diagnosis and lead to new treatment strategies. However, a consensus signature has not yet been identified for each condition. Thus, this work aimed to analyze the gene expression profile of Brazilian intestinal-type GC tissues using microarrays and compare the results to those of non-tumor tissue samples. Moreover, we compared our intestinal-type gastric carcinoma profile with those obtained from populations worldwide to assess their similarity. The results identified a molecular signature for intestinal-type GC and revealed that 38 genes differentially expressed in Brazilian intestinal-type gastric carcinoma samples can successfully distinguish gastric tumors from non-tumor tissue in the global population. These differentially expressed genes participate in biological processes important to cell homeostasis. Furthermore, Kaplan-Meier analysis suggested that 7 of these genes could individually be able to predict overall survival in intestinal-type gastric cancer patients.
Duan, Qiaonan; Flynn, Corey; Niepel, Mario; Hafner, Marc; Muhlich, Jeremy L; Fernandez, Nicolas F; Rouillard, Andrew D; Tan, Christopher M; Chen, Edward Y; Golub, Todd R; Sorger, Peter K; Subramanian, Aravind; Ma'ayan, Avi
2014-07-01
For the Library of Integrated Network-based Cellular Signatures (LINCS) project many gene expression signatures using the L1000 technology have been produced. The L1000 technology is a cost-effective method to profile gene expression in large scale. LINCS Canvas Browser (LCB) is an interactive HTML5 web-based software application that facilitates querying, browsing and interrogating many of the currently available LINCS L1000 data. LCB implements two compacted layered canvases, one to visualize clustered L1000 expression data, and the other to display enrichment analysis results using 30 different gene set libraries. Clicking on an experimental condition highlights gene-sets enriched for the differentially expressed genes from the selected experiment. A search interface allows users to input gene lists and query them against over 100 000 conditions to find the top matching experiments. The tool integrates many resources for an unprecedented potential for new discoveries in systems biology and systems pharmacology. The LCB application is available at http://www.maayanlab.net/LINCS/LCB. Customized versions will be made part of the http://lincscloud.org and http://lincs.hms.harvard.edu websites. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Molina-Pinelo, Sonia; Suárez, Rocío; Pastor, María Dolores; Nogal, Ana; Márquez-Martín, Eduardo; Martín-Juan, José; Carnero, Amancio; Paz-Ares, Luis
2012-01-01
The identification of new less invasive biomarkers is necessary to improve the detection and prognostic outcome of respiratory pathological processes. The measurement of miRNA expression through less invasive techniques such as plasma and serum have been suggested to analysis of several lung malignancies including lung cancer. These studies are assuming a common deregulated miRNA expression both in blood and lung tissue. The present study aimed to obtain miRNA representative signatures both in plasma and bronchoalveolar cell fraction that could serve as biomarker in respiratory diseases. Ten patients were evaluated to assess the expression levels of 381 miRNAs. We found that around 50% miRNAs were no detected in both plasma and bronchoalveolar cell fraction and only 20% of miRNAs showed similar expression in both samples. These results show a lack of association of miRNA signatures between plasma and bronchoalveolar cytology in the same patient. The profiles are not comparable; however, there is a similarity in the relative expression in a very small subset of miRNAs (miR-17, miR-19b, miR-195 and miR-20b) between both biological samples in all patients. This finding supports that the miRNAs profiles obtained from different biological samples have to be carefully validated to link with respiratory diseases.
Flechner, Stuart M.; Kurian, Sunil M.; Head, Steven R.; Sharp, Starlette M.; Whisenant, Thomas C.; Zhang, Jie; Chismar, Jeffrey D.; Horvath, Steve; Mondala, Tony; Gilmartin, Timothy; Cook, Daniel J.; Kay, Steven A.; Walker, John R.; Salomon, Daniel R.
2007-01-01
A major challenge for kidney transplantation is balancing the need for immunosuppression to prevent rejection, while minimizing drug-induced toxicities. We used DNA microarrays (HG-U95Av2 GeneChips, Affymetrix) to determine gene expression profiles for kidney biopsies and peripheral blood lymphocytes (PBLs) in transplant patients including normal donor kidneys, well-functioning transplants without rejection, kidneys undergoing acute rejection, and transplants with renal dysfunction without rejection. We developed a data analysis schema based on expression signal determination, class comparison and prediction, hierarchical clustering, statistical power analysis and real-time quantitative PCR validation. We identified distinct gene expression signatures for both biopsies and PBLs that correlated significantly with each of the different classes of transplant patients. This is the most complete report to date using commercial arrays to identify unique expression signatures in transplant biopsies distinguishing acute rejection, acute dysfunction without rejection and well-functioning transplants with no rejection history. We demonstrate for the first time the successful application of high density DNA chip analysis of PBL as a diagnostic tool for transplantation. The significance of these results, if validated in a multicenter prospective trial, would be the establishment of a metric based on gene expression signatures for monitoring the immune status and immunosuppression of transplanted patients. PMID:15307835
Pediatric Crohn disease patients exhibit specific ileal transcriptome and microbiome signature.
Haberman, Yael; Tickle, Timothy L; Dexheimer, Phillip J; Kim, Mi-Ok; Tang, Dora; Karns, Rebekah; Baldassano, Robert N; Noe, Joshua D; Rosh, Joel; Markowitz, James; Heyman, Melvin B; Griffiths, Anne M; Crandall, Wallace V; Mack, David R; Baker, Susan S; Huttenhower, Curtis; Keljo, David J; Hyams, Jeffrey S; Kugathasan, Subra; Walters, Thomas D; Aronow, Bruce; Xavier, Ramnik J; Gevers, Dirk; Denson, Lee A
2014-08-01
Interactions between the host and gut microbial community likely contribute to Crohn disease (CD) pathogenesis; however, direct evidence for these interactions at the onset of disease is lacking. Here, we characterized the global pattern of ileal gene expression and the ileal microbial community in 359 treatment-naive pediatric patients with CD, patients with ulcerative colitis (UC), and control individuals. We identified core gene expression profiles and microbial communities in the affected CD ilea that are preserved in the unaffected ilea of patients with colon-only CD but not present in those with UC or control individuals; therefore, this signature is specific to CD and independent of clinical inflammation. An abnormal increase of antimicrobial dual oxidase (DUOX2) expression was detected in association with an expansion of Proteobacteria in both UC and CD, while expression of lipoprotein APOA1 gene was downregulated and associated with CD-specific alterations in Firmicutes. The increased DUOX2 and decreased APOA1 gene expression signature favored oxidative stress and Th1 polarization and was maximally altered in patients with more severe mucosal injury. A regression model that included APOA1 gene expression and microbial abundance more accurately predicted month 6 steroid-free remission than a model using clinical factors alone. These CD-specific host and microbe profiles identify the ileum as the primary inductive site for all forms of CD and may direct prognostic and therapeutic approaches.
Mina, Eleni; van Roon-Mom, Willeke; Hettne, Kristina; van Zwet, Erik; Goeman, Jelle; Neri, Christian; A C 't Hoen, Peter; Mons, Barend; Roos, Marco
2016-08-01
Huntington's disease (HD) is a devastating brain disorder with no effective treatment or cure available. The scarcity of brain tissue makes it hard to study changes in the brain and impossible to perform longitudinal studies. However, peripheral pathology in HD suggests that it is possible to study the disease using peripheral tissue as a monitoring tool for disease progression and/or efficacy of novel therapies. In this study, we investigated if blood can be used to monitor disease severity and progression in brain. Since previous attempts using only gene expression proved unsuccessful, we compared blood and brain Huntington's disease signatures in a functional context. Microarray HD gene expression profiles from three brain regions were compared to the transcriptome of HD blood generated by next generation sequencing. The comparison was performed with a combination of weighted gene co-expression network analysis and literature based functional analysis (Concept Profile Analysis). Uniquely, our comparison of blood and brain datasets was not based on (the very limited) gene overlap but on the similarity between the gene annotations in four different semantic categories: "biological process", "cellular component", "molecular function" and "disease or syndrome". We identified signatures in HD blood reflecting a broad pathophysiological spectrum, including alterations in the immune response, sphingolipid biosynthetic processes, lipid transport, cell signaling, protein modification, spliceosome, RNA splicing, vesicle transport, cell signaling and synaptic transmission. Part of this spectrum was reminiscent of the brain pathology. The HD signatures in caudate nucleus and BA4 exhibited the highest similarity with blood, irrespective of the category of semantic annotations used. BA9 exhibited an intermediate similarity, while cerebellum had the least similarity. We present two signatures that were shared between blood and brain: immune response and spinocerebellar ataxias. Our results demonstrate that HD blood exhibits dysregulation that is similar to brain at a functional level, but not necessarily at the level of individual genes. We report two common signatures that can be used to monitor the pathology in brain of HD patients in a non-invasive manner. Our results are an exemplar of how signals in blood data can be used to represent brain disorders. Our methodology can be used to study disease specific signatures in diseases where heterogeneous tissues are involved in the pathology.
2013-01-01
Background Alterations in epigenetic marks, including methylation or acetylation, are common in human cancers. For many epigenetic pathways, however, direct measures of activity are unknown, making their role in various cancers difficult to assess. Gene expression signatures facilitate the examination of patterns of epigenetic pathway activation across and within human cancer types allowing better understanding of the relationships between these pathways. Methods We used Bayesian regression to generate gene expression signatures from normal epithelial cells before and after epigenetic pathway activation. Signatures were applied to datasets from TCGA, GEO, CaArray, ArrayExpress, and the cancer cell line encyclopedia. For TCGA data, signature results were correlated with copy number variation and DNA methylation changes. GSEA was used to identify biologic pathways related to the signatures. Results We developed and validated signatures reflecting downstream effects of enhancer of zeste homolog 2(EZH2), histone deacetylase(HDAC) 1, HDAC4, sirtuin 1(SIRT1), and DNA methyltransferase 2(DNMT2). By applying these signatures to data from cancer cell lines and tumors in large public repositories, we identify those cancers that have the highest and lowest activation of each of these pathways. Highest EZH2 activation is seen in neuroblastoma, hepatocellular carcinoma, small cell lung cancer, and melanoma, while highest HDAC activity is seen in pharyngeal cancer, kidney cancer, and pancreatic cancer. Across all datasets studied, activation of both EZH2 and HDAC4 is significantly underrepresented. Using breast cancer and glioblastoma as examples to examine intrinsic subtypes of particular cancers, EZH2 activation was highest in luminal breast cancers and proneural glioblastomas, while HDAC4 activation was highest in basal breast cancer and mesenchymal glioblastoma. EZH2 and HDAC4 activation are associated with particular chromosome abnormalities: EZH2 activation with aberrations in genes from the TGF and phosphatidylinositol pathways and HDAC4 activation with aberrations in inflammatory and chemokine related genes. Conclusion Gene expression patterns can reveal the activation level of epigenetic pathways. Epigenetic pathways define biologically relevant subsets of human cancers. EZH2 activation and HDAC4 activation correlate with growth factor signaling and inflammation, respectively, and represent two distinct states for cancer cells. This understanding may allow us to identify targetable drivers in these cancer subsets. PMID:24079712
Optimization Of Engine Heat Transfer Mechanisms For Ground Combat Vehicle Signature Models
NASA Astrophysics Data System (ADS)
Gonda, T.; Rogers, P.; Gerhart, G.; Reynolds, W. R.
1988-08-01
A thermodynamic model for predicting the behavior of selected internal thermal sources of an M2 Bradley Infantry Fighting Vehicle is described. The modeling methodology is expressed in terms of first principle heat transfer equations along with a brief history of TACOM's experience with thermal signature modeling techniques. The dynamic operation of the internal thermal sources is presented along with limited test data and an examination of their effect on the vehicle signature.
Geology and forestry classification from ERTS-1 digital data
NASA Technical Reports Server (NTRS)
Lawrence, R. D.; Herzog, J. H.
1975-01-01
Computer classifications into seven and ten classes of two areas in central Oregon of interest to geology and forestry demonstrate the extraction of information from ERTS-1 data. The area around Newberry Caldera was classified into basalt, rhyolite obsidian, pumice flats, Newberry pumice, ponderosa pine, lodgepole pine, and water classes. The area around Mt. Washington was classified into two basalts, three forest, two clearcut, burn, snow, and water classes. Both also include an unclassified category. Significant details that cannot be extracted from photographic reconstitutions of the data emerge from these classifications, such as moraine locations and paleowind directions. Spectral signatures for the various rocks are comparable to those published elsewhere.
Specific molecular signatures predict decitabine response in chronic myelomonocytic leukemia
Meldi, Kristen; Qin, Tingting; Buchi, Francesca; Droin, Nathalie; Sotzen, Jason; Micol, Jean-Baptiste; Selimoglu-Buet, Dorothée; Masala, Erico; Allione, Bernardino; Gioia, Daniela; Poloni, Antonella; Lunghi, Monia; Solary, Eric; Abdel-Wahab, Omar; Santini, Valeria; Figueroa, Maria E.
2015-01-01
Myelodysplastic syndromes and chronic myelomonocytic leukemia (CMML) are characterized by mutations in genes encoding epigenetic modifiers and aberrant DNA methylation. DNA methyltransferase inhibitors (DMTis) are used to treat these disorders, but response is highly variable, with few means to predict which patients will benefit. Here, we examined baseline differences in mutations, DNA methylation, and gene expression in 40 CMML patients who were responsive or resistant to decitabine (DAC) in order to develop a molecular means of predicting response at diagnosis. While somatic mutations did not differentiate responders from nonresponders, we identified 167 differentially methylated regions (DMRs) of DNA at baseline that distinguished responders from nonresponders using next-generation sequencing. These DMRs were primarily localized to nonpromoter regions and overlapped with distal regulatory enhancers. Using the methylation profiles, we developed an epigenetic classifier that accurately predicted DAC response at the time of diagnosis. Transcriptional analysis revealed differences in gene expression at diagnosis between responders and nonresponders. In responders, the upregulated genes included those that are associated with the cell cycle, potentially contributing to effective DAC incorporation. Treatment with CXCL4 and CXCL7, which were overexpressed in nonresponders, blocked DAC effects in isolated normal CD34+ and primary CMML cells, suggesting that their upregulation contributes to primary DAC resistance. PMID:25822018
Raherison, Elie S M; Giguère, Isabelle; Caron, Sébastien; Lamara, Mebarek; MacKay, John J
2015-07-01
Transcript profiling has shown the molecular bases of several biological processes in plants but few studies have developed an understanding of overall transcriptome variation. We investigated transcriptome structure in white spruce (Picea glauca), aiming to delineate its modular organization and associated functional and evolutionary attributes. Microarray analyses were used to: identify and functionally characterize groups of co-expressed genes; investigate expressional and functional diversity of vascular tissue preferential genes which were conserved among Picea species, and identify expression networks underlying wood formation. We classified 22 857 genes as variable (79%; 22 coexpression groups) or invariant (21%) by profiling across several vegetative tissues. Modular organization and complex transcriptome restructuring among vascular tissue preferential genes was revealed by their assignment to coexpression groups with partially overlapping profiles and partially distinct functions. Integrated analyses of tissue-based and temporally variable profiles identified secondary xylem gene networks, showed their remodelling over a growing season and identified PgNAC-7 (no apical meristerm (NAM), Arabidopsis transcription activation factor (ATAF) and cup-shaped cotyledon (CUC) transcription factor 007 in Picea glauca) as a major hub gene specific to earlywood formation. Reference profiling identified comprehensive, statistically robust coexpressed groups, revealing that modular organization underpins the evolutionary conservation of the transcriptome structure. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
A 3-Protein Expression Signature of Neuroblastoma for Outcome Prediction.
Xie, Yi; Xu, Hua; Fang, Fang; Li, Zhiheng; Zhou, Huiting; Pan, Jian; Guo, Wanliang; Zhu, Xueming; Wang, Jian; Wu, Yi
2018-05-22
Neuroblastoma (NB) is the most common extracranial solid tumor in children with contrasting outcomes. Precise risk assessment contributes to prognosis prediction, which is critical for treatment strategy decisions. In this study, we developed a 3-protein predictor model, including the neural stem cell marker Msi1, neural differentiation marker ID1, and proliferation marker proliferating cell nuclear antigen (PCNA), to improve clinical risk assessment of patients with NB. Kaplan-Meier analysis in the microarray data (GSE16476) revealed that low expression of ID1 and high expression of Msi1 and PCNA were associated with poor prognosis in NB patients. Combined application of these 3 markers to constitute a signature further stratified NB patients into different risk subgroups can help obtain more accurate prediction performance. Survival prognostic power of age and Msi1_ID1_PCNA signature by receiver operating characteristics analysis showed that this signature predicted more effectively and sensitively compared with classic risk stratification system, compensating for the deficiency of the prediction function of the age. Furthermore, we validated the expressions of these 3 proteins in neuroblastic tumor spectrum tissues by immunohistochemistry revealed that Msi1 and PCNA exhibited increased expression in NB compared with intermedial ganglioneuroblastoma and benign ganglioneuroma, whereas ID1 levels were reduced in NB. In conclusion, we established a robust risk assessment predictor model based on simple immunohistochemistry for therapeutic decisions of NB patients.This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal. http://creativecommons.org/licenses/by-nc-nd/4.0/.
Bouquet, Jerome; Soloski, Mark J.; Swei, Andrea; Cheadle, Chris; Federman, Scot; Billaud, Jean-Noel; Rebman, Alison W.; Kabre, Beniwende; Halpert, Richard; Boorgula, Meher
2016-01-01
ABSTRACT Lyme disease is a tick-borne illness caused by the bacterium Borrelia burgdorferi, and approximately 10 to 20% of patients report persistent symptoms lasting months to years despite appropriate treatment with antibiotics. To gain insights into the molecular basis of acute Lyme disease and the ensuing development of post-treatment symptoms, we conducted a longitudinal transcriptome study of 29 Lyme disease patients (and 13 matched controls) enrolled at the time of diagnosis and followed for up to 6 months. The differential gene expression signature of Lyme disease following the acute phase of infection persisted for at least 3 weeks and had fewer than 44% differentially expressed genes (DEGs) in common with other infectious or noninfectious syndromes. Early Lyme disease prior to antibiotic therapy was characterized by marked upregulation of Toll-like receptor signaling but lack of activation of the inflammatory T-cell apoptotic and B-cell developmental pathways seen in other acute infectious syndromes. Six months after completion of therapy, Lyme disease patients were found to have 31 to 60% of their pathways in common with three different immune-mediated chronic diseases. No differential gene expression signature was observed between Lyme disease patients with resolved illness to those with persistent symptoms at 6 months post-treatment. The identification of a sustained differential gene expression signature in Lyme disease suggests that a panel of selected human host-based biomarkers may address the need for sensitive clinical diagnostics during the “window period” of infection prior to the appearance of a detectable antibody response and may also inform the development of new therapeutic targets. PMID:26873097
Hingorani, Pooja; Missiaglia, Edoardo; Shipley, Janet; Anderson, James R; Triche, Timothy J; Delorenzi, Mauro; Gastier-Foster, Julie; Wing, Michele; Hawkins, Douglas S; Skapek, Stephen X
2015-10-15
Pediatric rhabdomyosarcoma (RMS) has two common histologic subtypes: embryonal (ERMS) and alveolar (ARMS). PAX-FOXO1 fusion gene status is a more reliable prognostic marker than alveolar histology, whereas fusion gene-negative (FN) ARMS patients are clinically similar to ERMS patients. A five-gene expression signature (MG5) previously identified two diverse risk groups within the fusion gene-negative RMS (FN-RMS) patients, but this has not been independently validated. The goal of this study was to test whether expression of the MG5 metagene, measured using a technical platform that can be applied to routine pathology material, would correlate with outcome in a new cohort of patients with FN-RMS. Cases were taken from the Children's Oncology Group (COG) D9803 study of children with intermediate-risk RMS, and gene expression profiling for the MG5 genes was performed using the nCounter assay. The MG5 score was correlated with clinical and pathologic characteristics as well as overall and event-free survival. MG5 standardized score showed no significant association with any of the available clinicopathologic variables. The MG5 signature score showed a significant correlation with overall (N = 57; HR, 7.3; 95% CI, 1.9-27.0; P = 0.003) and failure-free survival (N = 57; HR, 6.1; 95% CI, 1.9-19.7; P = 0.002). This represents the first, validated molecular prognostic signature for children with FN-RMS who otherwise have intermediate-risk disease. The capacity to measure the expression of a small number of genes in routine pathology material and apply a simple mathematical formula to calculate the MG5 metagene score provides a clear path toward better risk stratification in future prospective clinical trials. ©2015 American Association for Cancer Research.
Signatures of tumour immunity distinguish Asian and non-Asian gastric adenocarcinomas
Lin, Suling J; Gagnon-Bartsch, Johann A; Tan, Iain Beehuat; Earle, Sophie; Ruff, Louise; Pettinger, Katherine; Ylstra, Bauke; van Grieken, Nicole; Rha, Sun Young; Chung, Hyun Cheol; Lee, Ju-Seog; Cheong, Jae Ho; Noh, Sung Hoon; Aoyama, Toru; Miyagi, Yohei; Tsuburaya, Akira; Yoshikawa, Takaki; Ajani, Jaffer A; Boussioutas, Alex; Yeoh, Khay Guan; Yong, Wei Peng; So, Jimmy; Lee, Jeeyun; Kang, Won Ki; Kim, Sung; Kameda, Yoichi; Arai, Tomio; zur Hausen, Axel; Speed, Terence P; Grabsch, Heike I; Tan, Patrick
2015-01-01
Objective Differences in gastric cancer (GC) clinical outcomes between patients in Asian and non-Asian countries has been historically attributed to variability in clinical management. However, recent international Phase III trials suggest that even with standardised treatments, GC outcomes differ by geography. Here, we investigated gene expression differences between Asian and non-Asian GCs, and if these molecular differences might influence clinical outcome. Design We compared gene expression profiles of 1016 GCs from six Asian and three non-Asian GC cohorts, using a two-stage meta-analysis design and a novel biostatistical method (RUV-4) to adjust for technical variation between cohorts. We further validated our findings by computerised immunohistochemical analysis on two independent tissue microarray (TMA) cohorts from Asian and non-Asian localities (n=665). Results Gene signatures differentially expressed between Asians and non-Asian GCs were related to immune function and inflammation. Non-Asian GCs were significantly enriched in signatures related to T-cell biology, including CTLA-4 signalling. Similarly, in the TMA cohorts, non-Asian GCs showed significantly higher expression of T-cell markers (CD3, CD45R0, CD8) and lower expression of the immunosuppressive T-regulatory cell marker FOXP3 compared to Asian GCs (p<0.05). Inflammatory cell markers CD66b and CD68 also exhibited significant cohort differences (p<0.05). Exploratory analyses revealed a significant relationship between tumour immunity factors, geographic locality-specific prognosis, and postchemotherapy outcomes. Conclusions Analyses of >1600 GCs suggest that Asian and non-Asian GCs exhibit distinct tumour immunity signatures related to T-cell function. These differences may influence geographical differences in clinical outcome, and the design of future trials particularly in immuno-oncology. PMID:25385008
Homuth, Georg; Wahl, Simone; Müller, Christian; Schurmann, Claudia; Mäder, Ulrike; Blankenberg, Stefan; Carstensen, Maren; Dörr, Marcus; Endlich, Karlhans; Englbrecht, Christian; Felix, Stephan B; Gieger, Christian; Grallert, Harald; Herder, Christian; Illig, Thomas; Kruppa, Jochen; Marzi, Carola S; Mayerle, Julia; Meitinger, Thomas; Metspalu, Andres; Nauck, Matthias; Peters, Annette; Rathmann, Wolfgang; Reinmaa, Eva; Rettig, Rainer; Roden, Michael; Schillert, Arne; Schramm, Katharina; Steil, Leif; Strauch, Konstantin; Teumer, Alexander; Völzke, Henry; Wallaschofski, Henri; Wild, Philipp S; Ziegler, Andreas; Völker, Uwe; Prokisch, Holger; Zeller, Tanja
2015-10-15
Obesity, defined as pathologically increased body mass index (BMI), is strongly related to an increased risk for numerous common cardiovascular and metabolic diseases. It is particularly associated with insulin resistance, hyperglycemia, and systemic oxidative stress and represents the most important risk factor for type 2 diabetes (T2D). However, the pathophysiological mechanisms underlying these associations are still not completely understood. Therefore, in order to identify potentially disease-relevant BMI-associated gene expression signatures, a transcriptome-wide association study (TWAS) on BMI was performed. Whole-blood mRNA levels determined by array-based transcriptional profiling were correlated with BMI in two large independent population-based cohort studies (KORA F4 and SHIP-TREND) comprising a total of 1977 individuals. Extensive alterations of the whole-blood transcriptome were associated with BMI: More than 3500 transcripts exhibited significant positive or negative BMI-correlation. Three major whole-blood gene expression signatures associated with increased BMI were identified. The three signatures suggested: i) a ratio shift from mature erythrocytes towards reticulocytes, ii) decreased expression of several genes essentially involved in the transmission and amplification of the insulin signal, and iii) reduced expression of several key genes involved in the defence against reactive oxygen species (ROS). Whereas the first signature confirms published results, the other two provide possible mechanistic explanations for well-known epidemiological findings under conditions of increased BMI, namely attenuated insulin signaling and increased oxidative stress. The putatively causative BMI-dependent down-regulation of the expression of numerous genes on the mRNA level represents a novel finding. BMI-associated negative transcriptional regulation of insulin signaling and oxidative stress management provide new insights into the pathogenesis of metabolic syndrome and T2D.
Cross-study projections of genomic biomarkers: an evaluation in cancer genomics.
Lucas, Joseph E; Carvalho, Carlos M; Chen, Julia Ling-Yu; Chi, Jen-Tsan; West, Mike
2009-01-01
Human disease studies using DNA microarrays in both clinical/observational and experimental/controlled studies are having increasing impact on our understanding of the complexity of human diseases. A fundamental concept is the use of gene expression as a "common currency" that links the results of in vitro controlled experiments to in vivo observational human studies. Many studies--in cancer and other diseases--have shown promise in using in vitro cell manipulations to improve understanding of in vivo biology, but experiments often simply fail to reflect the enormous phenotypic variation seen in human diseases. We address this with a framework and methods to dissect, enhance and extend the in vivo utility of in vitro derived gene expression signatures. From an experimentally defined gene expression signature we use statistical factor analysis to generate multiple quantitative factors in human cancer gene expression data. These factors retain their relationship to the original, one-dimensional in vitro signature but better describe the diversity of in vivo biology. In a breast cancer analysis, we show that factors can reflect fundamentally different biological processes linked to molecular and clinical features of human cancers, and that in combination they can improve prediction of clinical outcomes.
Molecular Pathways: Extracting Medical Knowledge from High Throughput Genomic Data
Goldstein, Theodore; Paull, Evan O.; Ellis, Matthew J.; Stuart, Joshua M.
2013-01-01
High-throughput genomic data that measures RNA expression, DNA copy number, mutation status and protein levels provide us with insights into the molecular pathway structure of cancer. Genomic lesions (amplifications, deletions, mutations) and epigenetic modifications disrupt biochemical cellular pathways. While the number of possible lesions is vast, different genomic alterations may result in concordant expression and pathway activities, producing common tumor subtypes that share similar phenotypic outcomes. How can these data be translated into medical knowledge that provides prognostic and predictive information? First generation mRNA expression signatures such as Genomic Health's Oncotype DX already provide prognostic information, but do not provide therapeutic guidance beyond the current standard of care – which is often inadequate in high-risk patients. Rather than building molecular signatures based on gene expression levels, evidence is growing that signatures based on higher-level quantities such as from genetic pathways may provide important prognostic and diagnostic cues. We provide examples of how activities for molecular entities can be predicted from pathway analysis and how the composite of all such activities, referred to here as the “activitome,” help connect genomic events to clinical factors in order to predict the drivers of poor outcome. PMID:23430023
Kurashige, Junji; Hasegawa, Takanori; Niida, Atsushi; Sugimachi, Keishi; Deng, Niantao; Mima, Kosuke; Uchi, Ryutaro; Sawada, Genta; Takahashi, Yusuke; Eguchi, Hidetoshi; Inomata, Masashi; Kitano, Seigo; Fukagawa, Takeo; Sasako, Mitsuru; Sasaki, Hiroki; Sasaki, Shin; Mori, Masaki; Yanagihara, Kazuyoshi; Baba, Hideo; Miyano, Satoru; Tan, Patrick; Mimori, Koshi
2016-03-03
Peritoneal dissemination is the most frequent, incurable metastasis occurring in patients with advanced gastric cancer (GC). However, molecular mechanisms driving peritoneal dissemination still remain poorly understood. Here, we aimed to provide novel insights into the molecular mechanisms that drive the peritoneal dissemination of GC. We performed combined expression analysis with in vivo-selected metastatic cell lines and samples from 200 GC patients to identify driver genes of peritoneal dissemination. The driver-gene functions associated with GC dissemination were examined using a mouse xenograft model. We identified a peritoneal dissemination-associated expression signature, whose profile correlated with those of genes related to development, focal adhesion, and the extracellular matrix. Among the genes comprising the expression signature, we identified that discoidin-domain receptor 2 (DDR2) as a potential regulator of peritoneal dissemination. The DDR2 was upregulated by the loss of DNA methylation and that DDR2 knockdown reduced peritoneal metastasis in a xenograft model. Dasatinib, an inhibitor of the DDR2 signaling pathway, effectively suppressed peritoneal dissemination. DDR2 was identified as a driver gene for GC dissemination from the combined expression signature and can potentially serve as a novel therapeutic target for inhibiting GC peritoneal dissemination.
Tuhtan, Jeffrey A; Fuentes-Perez, Juan Francisco; Toming, Gert; Schneider, Matthias; Schwarzenberger, Richard; Schletterer, Martin; Kruusmaa, Maarja
2018-05-25
The lateral line system provides fish with advanced mechanoreception over a wide range of flow conditions. Inspired by the abilities of their biological counterparts, artificial lateral lines have been developed and tested exclusively under laboratory settings. Motivated by the lack of flow measurements taken in the field which consider fluid-body interactions, we built a fish-shaped lateral line probe. The device is outfitted with 11 high-speed (2.5 kHz) time-synchronized pressure transducers, and designed to capture and classify flows in fish passage structures. A total of 252 field measurements, each with a sample size of 132 000 discrete sensor readings were recorded in the slots and across the pools of vertical slot fishways. These data were used to estimate the time-averaged flow velocity (R 2 = 0.952), which represents the most common metric to assess fishway flows. The significant contribution of this work is the creation and application of hydrodynamic signatures generated by the spatial distribution of pressure fluctuations on the fish-shaped body. The signatures are based on the collection of the pressure fluctuations' probability distributions, and it is shown that they can be used to automatically classify distinct flow regions within the pools of three different vertical slot fishways. For the first time, field data from operational fishway measurements are sampled and classified using an artificial lateral line, providing a completely new source of bioinspired flow information.
NASA Astrophysics Data System (ADS)
Wolf, Nils; Hof, Angela
2012-10-01
Urban sprawl driven by shifts in tourism development produces new suburban landscapes of water consumption on Mediterranean coasts. Golf courses, ornamental, 'Atlantic' gardens and swimming pools are the most striking artefacts of this transformation, threatening the local water supply systems and exacerbating water scarcity. In the face of climate change, urban landscape irrigation is becoming increasingly important from a resource management point of view. This paper adopts urban remote sensing towards a targeted mapping approach using machine learning techniques and highresolution satellite imagery (WorldView-2) to generate GIS-ready information for urban water consumption studies. Swimming pools, vegetation and - as a subgroup of vegetation - turf grass are extracted as important determinants of water consumption. For image analysis, the complex nature of urban environments suggests spatial-spectral classification, i.e. the complementary use of the spectral signature and spatial descriptors. Multiscale image segmentation provides means to extract the spatial descriptors - namely object feature layers - which can be concatenated at pixel level to the spectral signature. This study assesses the value of object features using different machine learning techniques and amounts of labeled information for learning. The results indicate the benefit of the spatial-spectral approach if combined with appropriate classifiers like tree-based ensembles or support vector machines, which can handle high dimensionality. Finally, a Random Forest classifier was chosen to deliver the classified input data for the estimation of evaporative water loss and net landscape irrigation requirements.
Guo, Lin; Frost, Michael R; Siegwart, John T; Norton, Thomas T
2014-01-01
During postnatal refractive development, the sclera receives retinally generated signals that regulate its biochemical properties. Hyperopic refractive error causes the retina to produce "GO" signals that, through the direct emmetropization pathway, cause scleral remodeling that increases the axial elongation rate of the eye, reducing the hyperopia. Myopia causes the retina to generate "STOP" signals that produce scleral remodeling, slowing the axial elongation rate and reducing the myopia. Our aim was to compare the pattern of gene expression produced in the sclera by the STOP signals with the GO gene expression signature we described previously. The GO gene expression signature was produced by monocular -5 diopter (D) lens wear for 2 days (ML-2) or 4 days (ML-4); an additional "STAY" condition was examined after eyes had fully compensated for a -5 D lens after 11 days of lens wear (ML-11). After 11 days of -5 D lens wear had produced full refractive compensation, gene expression in the STOP condition was examined during recovery (without the lens) for 2 days (REC-2) or 4 days (REC-4). The untreated contralateral eyes served as a control in all groups. Two age-matched normal groups provided a comparison with the treated groups. Quantitative real-time PCR was used to measure mRNA levels for 55 candidate genes. The STAY group compensated fully for the lens (treated eye versus control eye, -5.1±0.2 D). Wearing the lens, the hyperopic signal for elongation had dissipated (-0.3±0.3 D). In the STOP groups, the refraction in the recovering eyes became less myopic relative to the control eyes (REC-2, +1.3±0.3 D; REC-4, +2.6±0.4 D). In the STAY group, three genes showed significant downregulation. However, many genes that were significantly altered in GO showed smaller, nonsignificant, expression differences in the same direction in STAY, suggesting the gene expression signature in STAY is a greatly weakened form of the GO signature. In the STOP groups, a different gene expression pattern was observed, characterized by mostly upregulation with larger fold differences after 4 days than after 2 days of recovery. Eleven of the 55 genes examined showed significant bidirectional GO/STOP regulation in the ML-2 and REC-2 groups, and 13 genes showed bidirectional regulation in the ML-4 and REC-4 groups. Eight of these genes (NPR3, CAPNS1, NGEF, TGFB1, CTGF, NOV, TIMP1, and HS6ST1) were bidirectionally regulated at both time points in the GO and STOP conditions. An additional 15 genes showed significant regulation in either GO or STOP conditions but not in both. Many genes are involved in scleral remodeling and the control of axial length. The STOP (recovery) gene expression signature in the sclera involves some of the same genes, bidirectionally regulated, as the GO signature. However, other genes, regulated in GO, are not differentially regulated in STOP, and others show differential regulation only in STOP.
NASA Technical Reports Server (NTRS)
Kriegler, F. J.
1974-01-01
The MIDAS System is described as a third-generation fast multispectral recognition system able to keep pace with the large quantity and high rates of data acquisition from present and projected sensors. A principal objective of the MIDAS program is to provide a system well interfaced with the human operator and thus to obtain large overall reductions in turnaround time and significant gains in throughput. The hardware and software are described. The system contains a mini-computer to control the various high-speed processing elements in the data path, and a classifier which implements an all-digital prototype multivariate-Gaussian maximum likelihood decision algorithm operating at 200,000 pixels/sec. Sufficient hardware was developed to perform signature extraction from computer-compatible tapes, compute classifier coefficients, control the classifier operation, and diagnose operation.
NASA Technical Reports Server (NTRS)
Freeman, A.; Villasenor, J.; Klein, J. D.
1991-01-01
We describe the calibration and analysis of multi-frequency, multi-polarization radar backscatter signatures over an agriculture test site in the Netherlands. The calibration procedure involved two stages: in the first stage, polarimetric and radiometric calibrations (ignoring noise) were carried out using square-base trihedral corner reflector signatures and some properties of the clutter background. In the second stage, a novel algorithm was used to estimate the noise level in the polarimetric data channels by using the measured signature of an idealized rough surface with Bragg scattering (the ocean in this case). This estimated noise level was then used to correct the measured backscatter signatures from the agriculture fields. We examine the significance of several key parameters extracted from the calibrated and noise-corrected backscatter signatures. The significance is assessed in terms of the ability to uniquely separate among classes from 13 different backscatter types selected from the test site data, including eleven different crops, one forest and one ocean area. Using the parameters with the highest separation for a given class, we use a hierarchical algorithm to classify the entire image. We find that many classes, including ocean, forest, potato, and beet, can be identified with high reliability, while the classes for which no single parameter exhibits sufficient separation have higher rates of misclassification. We expect that modified decision criteria involving simultaneous consideration of several parameters increase performance for these classes.
A time-frequency classifier for human gait recognition
NASA Astrophysics Data System (ADS)
Mobasseri, Bijan G.; Amin, Moeness G.
2009-05-01
Radar has established itself as an effective all-weather, day or night sensor. Radar signals can penetrate walls and provide information on moving targets. Recently, radar has been used as an effective biometric sensor for classification of gait. The return from a coherent radar system contains a frequency offset in the carrier frequency, known as the Doppler Effect. The movements of arms and legs give rise to micro Doppler which can be clearly detailed in the time-frequency domain using traditional or modern time-frequency signal representation. In this paper we propose a gait classifier based on subspace learning using principal components analysis(PCA). The training set consists of feature vectors defined as either time or frequency snapshots taken from the spectrogram of radar backscatter. We show that gait signature is captured effectively in feature vectors. Feature vectors are then used in training a minimum distance classifier based on Mahalanobis distance metric. Results show that gait classification with high accuracy and short observation window is achievable using the proposed classifier.
Histogram Curve Matching Approaches for Object-based Image Classification of Land Cover and Land Use
Toure, Sory I.; Stow, Douglas A.; Weeks, John R.; Kumar, Sunil
2013-01-01
The classification of image-objects is usually done using parametric statistical measures of central tendency and/or dispersion (e.g., mean or standard deviation). The objectives of this study were to analyze digital number histograms of image objects and evaluate classifications measures exploiting characteristic signatures of such histograms. Two histograms matching classifiers were evaluated and compared to the standard nearest neighbor to mean classifier. An ADS40 airborne multispectral image of San Diego, California was used for assessing the utility of curve matching classifiers in a geographic object-based image analysis (GEOBIA) approach. The classifications were performed with data sets having 0.5 m, 2.5 m, and 5 m spatial resolutions. Results show that histograms are reliable features for characterizing classes. Also, both histogram matching classifiers consistently performed better than the one based on the standard nearest neighbor to mean rule. The highest classification accuracies were produced with images having 2.5 m spatial resolution. PMID:24403648
Suárez-Arroyo, Ivette J; Feliz-Mosquea, Yismeilin R; Pérez-Laspiur, Juliana; Arju, Rezina; Giashuddin, Shah; Maldonado-Martínez, Gerónimo; Cubano, Luis A; Schneider, Robert J; Martínez-Montemayor, Michelle M
2016-01-01
Inflammatory Breast Cancer (IBC) is the most lethal form of breast cancer with a 35% 5-year survival rate. The accurate and early diagnosis of IBC and the development of targeted therapy against this deadly disease remain a great medical challenge. Plasma membrane proteins (PMPs) such as E-cadherin and EGFR, play an important role in the progression of IBC. Because the critical role of PMPs in the oncogenic processes they are the perfect candidates as molecular markers and targets for cancer therapies. In the present study, Stable Isotope Labeling with Amino Acids in Cell Culture (SILAC) followed by mass spectrometry analysis was used to compare the relative expression levels of membrane proteins (MP) between non-cancerous mammary epithelial and IBC cells, MCF-10A and SUM-149, respectively. Six of the identified PMPs were validated by immunoblotting using the membrane fractions of non-IBC and IBC cell lines, compared with MCF-10A cells. Immunohistochemical analysis using IBC, invasive ductal carcinoma or normal mammary tissue samples was carried out to complete the validation method in nine of the PMPs. We identified and quantified 278 MPs, 76% of which classified as PMPs with 1.3-fold or higher change. We identified for the first time the overexpression of the novel plasminogen receptor, PLGRKT in IBC and of the carrier protein, SCAMP3. Furthermore, we describe the positive relationship between L1CAM expression and metastasis in IBC patients and the role of SCAMP3 as a tumor-related protein. Overall, the membrane proteomic signature of IBC reflects a global change in cellular organization and suggests additional strategies for cancer progression. Together, this study provides insight into the specialized IBC plasma membrane proteome with the potential to identify a number of novel therapeutic targets for IBC. PMID:27648361
Roy, Janine; Aust, Daniela; Knösel, Thomas; Rümmele, Petra; Jahnke, Beatrix; Hentrich, Vera; Rückert, Felix; Niedergethmann, Marco; Weichert, Wilko; Bahra, Marcus; Schlitt, Hans J.; Settmacher, Utz; Friess, Helmut; Büchler, Markus; Saeger, Hans-Detlev; Schroeder, Michael; Pilarsky, Christian; Grützmann, Robert
2012-01-01
Predicting the clinical outcome of cancer patients based on the expression of marker genes in their tumors has received increasing interest in the past decade. Accurate predictors of outcome and response to therapy could be used to personalize and thereby improve therapy. However, state of the art methods used so far often found marker genes with limited prediction accuracy, limited reproducibility, and unclear biological relevance. To address this problem, we developed a novel computational approach to identify genes prognostic for outcome that couples gene expression measurements from primary tumor samples with a network of known relationships between the genes. Our approach ranks genes according to their prognostic relevance using both expression and network information in a manner similar to Google's PageRank. We applied this method to gene expression profiles which we obtained from 30 patients with pancreatic cancer, and identified seven candidate marker genes prognostic for outcome. Compared to genes found with state of the art methods, such as Pearson correlation of gene expression with survival time, we improve the prediction accuracy by up to 7%. Accuracies were assessed using support vector machine classifiers and Monte Carlo cross-validation. We then validated the prognostic value of our seven candidate markers using immunohistochemistry on an independent set of 412 pancreatic cancer samples. Notably, signatures derived from our candidate markers were independently predictive of outcome and superior to established clinical prognostic factors such as grade, tumor size, and nodal status. As the amount of genomic data of individual tumors grows rapidly, our algorithm meets the need for powerful computational approaches that are key to exploit these data for personalized cancer therapies in clinical practice. PMID:22615549
Association of coral algal symbionts with a diverse viral community responsive to heat shock.
Brüwer, Jan D; Agrawal, Shobhit; Liew, Yi Jin; Aranda, Manuel; Voolstra, Christian R
2017-08-17
Stony corals provide the structural foundation of coral reef ecosystems and are termed holobionts given they engage in symbioses, in particular with photosynthetic dinoflagellates of the genus Symbiodinium. Besides Symbiodinium, corals also engage with bacteria affecting metabolism, immunity, and resilience of the coral holobiont, but the role of associated viruses is largely unknown. In this regard, the increase of studies using RNA sequencing (RNA-Seq) to assess gene expression provides an opportunity to elucidate viral signatures encompassed within the data via careful delineation of sequence reads and their source of origin. Here, we re-analyzed an RNA-Seq dataset from a cultured coral symbiont (Symbiodinium microadriaticum, Clade A1) across four experimental treatments (control, cold shock, heat shock, dark shock) to characterize associated viral diversity, abundance, and gene expression. Our approach comprised the filtering and removal of host sequence reads, subsequent phylogenetic assignment of sequence reads of putative viral origin, and the assembly and analysis of differentially expressed viral genes. About 15.46% (123 million) of all sequence reads were non-host-related, of which <1% could be classified as archaea, bacteria, or virus. Of these, 18.78% were annotated as virus and comprised a diverse community consistent across experimental treatments. Further, non-host related sequence reads assembled into 56,064 contigs, including 4856 contigs of putative viral origin that featured 43 differentially expressed genes during heat shock. The differentially expressed genes included viral kinases, ubiquitin, and ankyrin repeat proteins (amongst others), which are suggested to help the virus proliferate and inhibit the algal host's antiviral response. Our results suggest that a diverse viral community is associated with coral algal endosymbionts of the genus Symbiodinium, which prompts further research on their ecological role in coral health and resilience.
Ao, Lu; Guo, You; Song, Xuekun; Guan, Qingzhou; Zheng, Weicheng; Zhang, Jiahui; Huang, Haiyan; Zou, Yi; Guo, Zheng; Wang, Xianlong
2017-11-01
Concerns are raised about the representativeness of cell lines for tumours due to the culture environment and misidentification. Liver is a major metastatic destination of many cancers, which might further confuse the origin of hepatocellular carcinoma cell lines. Therefore, it is of crucial importance to understand how well they can represent hepatocellular carcinoma. The HCC-specific gene pairs with highly stable relative expression orderings in more than 99% of hepatocellular carcinoma but with reversed relative expression orderings in at least 99% of one of the six types of cancer, colorectal carcinoma, breast carcinoma, non-small-cell lung cancer, gastric carcinoma, pancreatic carcinoma and ovarian carcinoma, were identified. With the simple majority rule, the HCC-specific relative expression orderings from comparisons with colorectal carcinoma and breast carcinoma could exactly discriminate primary hepatocellular carcinoma samples from both primary colorectal carcinoma and breast carcinoma samples. Especially, they correctly classified more than 90% of liver metastatic samples from colorectal carcinoma and breast carcinoma to their original tumours. Finally, using these HCC-specific relative expression orderings from comparisons with six cancer types, we identified eight of 24 hepatocellular carcinoma cell lines in the Cancer Cell Line Encyclopedia (Huh-7, Huh-1, HepG2, Hep3B, JHH-5, JHH-7, C3A and Alexander cells) that are highly representative of hepatocellular carcinoma. Evaluated with a REOs-based prognostic signature for hepatocellular carcinoma, all these eight cell lines showed the same metastatic properties of the high-risk metastatic hepatocellular carcinoma tissues. Caution should be taken for using hepatocellular carcinoma cell lines. Our results should be helpful to select proper hepatocellular carcinoma cell lines for biological experiments. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Peripheral Blood Gene Expression as a Novel Genomic Biomarker in Complicated Sarcoidosis
Sweiss, Nadera J.; Chen, Edward S.; Moller, David R.; Knox, Kenneth S.; Ma, Shwu-Fan; Wade, Michael S.; Noth, Imre; Machado, Roberto F.; Garcia, Joe G. N.
2012-01-01
Sarcoidosis, a systemic granulomatous syndrome invariably affecting the lung, typically spontaneously remits but in ∼20% of cases progresses with severe lung dysfunction or cardiac and neurologic involvement (complicated sarcoidosis). Unfortunately, current biomarkers fail to distinguish patients with remitting (uncomplicated) sarcoidosis from other fibrotic lung disorders, and fail to identify individuals at risk for complicated sarcoidosis. We utilized genome-wide peripheral blood gene expression analysis to identify a 20-gene sarcoidosis biomarker signature distinguishing sarcoidosis (n = 39) from healthy controls (n = 35, 86% classification accuracy) and which served as a molecular signature for complicated sarcoidosis (n = 17). As aberrancies in T cell receptor (TCR) signaling, JAK-STAT (JS) signaling, and cytokine-cytokine receptor (CCR) signaling are implicated in sarcoidosis pathogenesis, a 31-gene signature comprised of T cell signaling pathway genes associated with sarcoidosis (TCR/JS/CCR) was compared to the unbiased 20-gene biomarker signature but proved inferior in prediction accuracy in distinguishing complicated from uncomplicated sarcoidosis. Additional validation strategies included significant association of single nucleotide polymorphisms (SNPs) in signature genes with sarcoidosis susceptibility and severity (unbiased signature genes - CX3CR1, FKBP1A, NOG, RBM12B, SENS3, TSHZ2; T cell/JAK-STAT pathway genes such as AKT3, CBLB, DLG1, IFNG, IL2RA, IL7R, ITK, JUN, MALT1, NFATC2, PLCG1, SPRED1). In summary, this validated peripheral blood molecular gene signature appears to be a valuable biomarker in identifying cases with sarcoidoisis and predicting risk for complicated sarcoidosis. PMID:22984568
DNA Copy Number Signature to Predict Recurrence in Early Stage Ovarian Cancer
2016-08-01
AWARD NUMBER: W81XWH-14-1-0194 TITLE: DNA Copy Number Signature to Predict Recurrence in Early-Stage Ovarian Cancer PRINCIPAL INVESTIGATOR...SUBTITLE 5a. CONTRACT NUMBER DNA Copy Number Signature to Predict Recurrence in Early-Stage Ovarian Cancer 5b. GRANT NUMBER W81XWH-14-1-0194 5c. PROGRAM...determine the copy number gain and loss for early stage high grade ovarian cancers through IlluminaHumanOmniExpress-FFPE BeadChip system • Subtask 1 DNA
Discovering causal signaling pathways through gene-expression patterns
Parikh, Jignesh R.; Klinger, Bertram; Xia, Yu; Marto, Jarrod A.; Blüthgen, Nils
2010-01-01
High-throughput gene-expression studies result in lists of differentially expressed genes. Most current meta-analyses of these gene lists include searching for significant membership of the translated proteins in various signaling pathways. However, such membership enrichment algorithms do not provide insight into which pathways caused the genes to be differentially expressed in the first place. Here, we present an intuitive approach for discovering upstream signaling pathways responsible for regulating these differentially expressed genes. We identify consistently regulated signature genes specific for signal transduction pathways from a panel of single-pathway perturbation experiments. An algorithm that detects overrepresentation of these signature genes in a gene group of interest is used to infer the signaling pathway responsible for regulation. We expose our novel resource and algorithm through a web server called SPEED: Signaling Pathway Enrichment using Experimental Data sets. SPEED can be freely accessed at http://speed.sys-bio.net/. PMID:20494976
Evaluating Machine Learning Classifiers for Hybrid Network Intrusion Detection Systems
2015-03-26
7 VRT Vulnerability Research Team...and the Talos (formerly the Vulnerability Research Team ( VRT )) [7] 7 ruleset libraries are the two leading rulesets in use. Both libraries offer paid...rule sets to load for the signature-based IDS. Snort is selected as the IDS engine using the “ VRT and ET No/GPL” rule set. The total rule count in the
Jinlong, Shi; Lin, Fu; Yonghui, Li; Li, Yu; Weidong, Wang
2015-01-01
Cytogenetically normal acute myeloid leukemia (CN-AML) is the largest and most heterogeneous AML subgroup. It lacks sensitive and specific biomarkers. Emerging evidences have suggested that microRNAs are involved in the pathogenesis of various leukemias. This paper evaluated the association between microRNA expression and prognostic outcome for CN-AML, based on the RNA/microRNA sequencing data of CN-AML patients. High let-7a-2-3p expression and low miR-188-5p expression were identified to be significantly associated with longer overall survival (OS) and event free survival (EFS) for CN-AML, independently or in a combined way. Their prognostic values were further confirmed in European Leukemia Net (ELN) genetic categories. Also, in multivariable analysis with other known risk factors, high let-7a-2-3p and low miR-188-5p expression remained to be associated with longer OS and EFS. In addition, the prognostic value of these two microRNAs was confirmed in patients who received hematopoietic stem cell transplantation (HSCT). To gain more biological insights of the underlying mechanisms, we derived the genome-wide differential gene/microRNA signatures associated with the expression of let-7a-2-3p and miR-188-5p. Several common microRNA signatures indicating favorable outcome in previous studies were up-regulated in both high let-7a-2-3p expressers and low miR-188-5p expressers, including miR-135a, miR-335 and miR-125b and all members of miR-181 family. Additionally, common up-regulated genes included FOSB, IGJ, SNORD50A and ZNF502, and FOSB was a known favorable signature in AML. These common signatures further confirmed the underlying common mechanisms for these two microRNAs value as favorable prognostic biomarkers. We concluded that high let-7a-2-3p and low miR-188-5p expression could be potentially used as favorably prognostic biomarkers independently or in a combined way in CN-AML patients, whether they received HSCT or not. PMID:25646775
Jinlong, Shi; Lin, Fu; Yonghui, Li; Li, Yu; Weidong, Wang
2015-01-01
Cytogenetically normal acute myeloid leukemia (CN-AML) is the largest and most heterogeneous AML subgroup. It lacks sensitive and specific biomarkers. Emerging evidences have suggested that microRNAs are involved in the pathogenesis of various leukemias. This paper evaluated the association between microRNA expression and prognostic outcome for CN-AML, based on the RNA/microRNA sequencing data of CN-AML patients. High let-7a-2-3p expression and low miR-188-5p expression were identified to be significantly associated with longer overall survival (OS) and event free survival (EFS) for CN-AML, independently or in a combined way. Their prognostic values were further confirmed in European Leukemia Net (ELN) genetic categories. Also, in multivariable analysis with other known risk factors, high let-7a-2-3p and low miR-188-5p expression remained to be associated with longer OS and EFS. In addition, the prognostic value of these two microRNAs was confirmed in patients who received hematopoietic stem cell transplantation (HSCT). To gain more biological insights of the underlying mechanisms, we derived the genome-wide differential gene/microRNA signatures associated with the expression of let-7a-2-3p and miR-188-5p. Several common microRNA signatures indicating favorable outcome in previous studies were up-regulated in both high let-7a-2-3p expressers and low miR-188-5p expressers, including miR-135a, miR-335 and miR-125b and all members of miR-181 family. Additionally, common up-regulated genes included FOSB, IGJ, SNORD50A and ZNF502, and FOSB was a known favorable signature in AML. These common signatures further confirmed the underlying common mechanisms for these two microRNAs value as favorable prognostic biomarkers. We concluded that high let-7a-2-3p and low miR-188-5p expression could be potentially used as favorably prognostic biomarkers independently or in a combined way in CN-AML patients, whether they received HSCT or not.
RIDES: Robust Intrusion Detection System for IP-Based Ubiquitous Sensor Networks
Amin, Syed Obaid; Siddiqui, Muhammad Shoaib; Hong, Choong Seon; Lee, Sungwon
2009-01-01
The IP-based Ubiquitous Sensor Network (IP-USN) is an effort to build the “Internet of things”. By utilizing IP for low power networks, we can benefit from existing well established tools and technologies of IP networks. Along with many other unresolved issues, securing IP-USN is of great concern for researchers so that future market satisfaction and demands can be met. Without proper security measures, both reactive and proactive, it is hard to envisage an IP-USN realm. In this paper we present a design of an IDS (Intrusion Detection System) called RIDES (Robust Intrusion DEtection System) for IP-USN. RIDES is a hybrid intrusion detection system, which incorporates both Signature and Anomaly based intrusion detection components. For signature based intrusion detection this paper only discusses the implementation of distributed pattern matching algorithm with the help of signature-code, a dynamically created attack-signature identifier. Other aspects, such as creation of rules are not discussed. On the other hand, for anomaly based detection we propose a scoring classifier based on the SPC (Statistical Process Control) technique called CUSUM charts. We also investigate the settings and their effects on the performance of related parameters for both of the components. PMID:22412321
RIDES: Robust Intrusion Detection System for IP-Based Ubiquitous Sensor Networks.
Amin, Syed Obaid; Siddiqui, Muhammad Shoaib; Hong, Choong Seon; Lee, Sungwon
2009-01-01
The IP-based Ubiquitous Sensor Network (IP-USN) is an effort to build the "Internet of things". By utilizing IP for low power networks, we can benefit from existing well established tools and technologies of IP networks. Along with many other unresolved issues, securing IP-USN is of great concern for researchers so that future market satisfaction and demands can be met. Without proper security measures, both reactive and proactive, it is hard to envisage an IP-USN realm. In this paper we present a design of an IDS (Intrusion Detection System) called RIDES (Robust Intrusion DEtection System) for IP-USN. RIDES is a hybrid intrusion detection system, which incorporates both Signature and Anomaly based intrusion detection components. For signature based intrusion detection this paper only discusses the implementation of distributed pattern matching algorithm with the help of signature-code, a dynamically created attack-signature identifier. Other aspects, such as creation of rules are not discussed. On the other hand, for anomaly based detection we propose a scoring classifier based on the SPC (Statistical Process Control) technique called CUSUM charts. We also investigate the settings and their effects on the performance of related parameters for both of the components.
Fluorescent polymer sensor array for detection and discrimination of explosives in water.
Woodka, Marc D; Schnee, Vincent P; Polcha, Michael P
2010-12-01
A fluorescent polymer sensor array (FPSA) was made from commercially available fluorescent polymers coated onto glass beads and was tested to assess the ability of the array to discriminate between different analytes in aqueous solution. The array was challenged with exposures to 17 different analytes, including the explosives trinitrotoluene (TNT), tetryl, and RDX, various explosive-related compounds (ERCs), and nonexplosive electron-withdrawing compounds (EWCs). The array exhibited a natural selectivity toward EWCs, while the non-electron-withdrawing explosive 1,3,5-trinitroperhydro-1,3,5-triazine (RDX) produced no response. Response signatures were visualized by principal component analysis (PCA), and classified by linear discriminant analysis (LDA). RDX produced the same response signature as the sampled blanks and was classified accordingly. The array exhibited excellent discrimination toward all other compounds, with the exception of the isomers of nitrotoluene and aminodinitrotoluene. Of particular note was the ability of the array to discriminate between the three isomers of dinitrobenzene. The natural selectivity of the FPSA toward EWCs, plus the ability of the FPSA to discriminate between different EWCs, could be used to design a sensor with a low false alarm rate and an excellent ability to discriminate between explosives and explosive-related compounds.
Fast and effective characterization of 3D region of interest in medical image data
NASA Astrophysics Data System (ADS)
Kontos, Despina; Megalooikonomou, Vasileios
2004-05-01
We propose a framework for detecting, characterizing and classifying spatial Regions of Interest (ROIs) in medical images, such as tumors and lesions in MRI or activation regions in fMRI. A necessary step prior to classification is efficient extraction of discriminative features. For this purpose, we apply a characterization technique especially designed for spatial ROIs. The main idea of this technique is to extract a k-dimensional feature vector using concentric spheres in 3D (or circles in 2D) radiating out of the ROI's center of mass. These vectors form characterization signatures that can be used to represent the initial ROIs. We focus on classifying fMRI ROIs obtained from a study that explores neuroanatomical correlates of semantic processing in Alzheimer's disease (AD). We detect a ROI highly associated with AD and apply the feature extraction technique with different experimental settings. We seek to distinguish control from patient samples. We study how classification can be performed using the extracted signatures as well as how different experimental parameters affect classification accuracy. The obtained classification accuracy ranged from 82% to 87% (based on the selected ROI) suggesting that the proposed classification framework can be potentially useful in supporting medical decision-making.
Yitzhak, Neta; Giladi, Nir; Gurevich, Tanya; Messinger, Daniel S; Prince, Emily B; Martin, Katherine; Aviezer, Hillel
2017-12-01
According to dominant theories of affect, humans innately and universally express a set of emotions using specific configurations of prototypical facial activity. Accordingly, thousands of studies have tested emotion recognition using sets of highly intense and stereotypical facial expressions, yet their incidence in real life is virtually unknown. In fact, a commonplace experience is that emotions are expressed in subtle and nonprototypical forms. Such facial expressions are at the focus of the current study. In Experiment 1, we present the development and validation of a novel stimulus set consisting of dynamic and subtle emotional facial displays conveyed without constraining expressers to using prototypical configurations. Although these subtle expressions were more challenging to recognize than prototypical dynamic expressions, they were still well recognized by human raters, and perhaps most importantly, they were rated as more ecological and naturalistic than the prototypical expressions. In Experiment 2, we examined the characteristics of subtle versus prototypical expressions by subjecting them to a software classifier, which used prototypical basic emotion criteria. Although the software was highly successful at classifying prototypical expressions, it performed very poorly at classifying the subtle expressions. Further validation was obtained from human expert face coders: Subtle stimuli did not contain many of the key facial movements present in prototypical expressions. Together, these findings suggest that emotions may be successfully conveyed to human viewers using subtle nonprototypical expressions. Although classic prototypical facial expressions are well recognized, they appear less naturalistic and may not capture the richness of everyday emotional communication. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
NASA Astrophysics Data System (ADS)
Praskievicz, S. J.; Luo, C.
2017-12-01
Classification of rivers is useful for a variety of purposes, such as generating and testing hypotheses about watershed controls on hydrology, predicting hydrologic variables for ungaged rivers, and setting goals for river management. In this research, we present a bottom-up (based on machine learning) river classification designed to investigate the underlying physical processes governing rivers' hydrologic regimes. The classification was developed for the entire state of Alabama, based on 248 United States Geological Survey (USGS) stream gages that met criteria for length and completeness of records. Five dimensionless hydrologic signatures were derived for each gage: slope of the flow duration curve (indicator of flow variability), baseflow index (ratio of baseflow to average streamflow), rising limb density (number of rising limbs per unit time), runoff ratio (ratio of long-term average streamflow to long-term average precipitation), and streamflow elasticity (sensitivity of streamflow to precipitation). We used a Bayesian clustering algorithm to classify the gages, based on the five hydrologic signatures, into distinct hydrologic regimes. We then used classification and regression trees (CART) to predict each gaged river's membership in different hydrologic regimes based on climatic and watershed variables. Using existing geospatial data, we applied the CART analysis to classify ungaged streams in Alabama, with the National Hydrography Dataset Plus (NHDPlus) catchment (average area 3 km2) as the unit of classification. The results of the classification can be used for meeting management and conservation objectives in Alabama, such as developing statewide standards for environmental instream flows. Such hydrologic classification approaches are promising for contributing to process-based understanding of river systems.
Gordon, Bradley S; Steiner, Jennifer L; Rossetti, Michael L; Qiao, Shuxi; Ellisen, Leif W; Govindarajan, Subramaniam S; Eroshkin, Alexey M; Williamson, David L; Coen, Paul M
2017-12-01
The metabolic stress placed on skeletal muscle by aerobic exercise promotes acute and long-term health benefits in part through changes in gene expression. However, the transducers that mediate altered gene expression signatures have not been completely elucidated. Regulated in development and DNA damage 1 (REDD1) is a stress-induced protein whose expression is transiently increased in skeletal muscle following acute aerobic exercise. However, the role of this induction remains unclear. Because REDD1 altered gene expression in other model systems, we sought to determine whether REDD1 induction following acute exercise altered the gene expression signature in muscle. To do this, wild-type and REDD1-null mice were randomized to remain sedentary or undergo a bout of acute treadmill exercise. Exercised mice recovered for 1, 3, or 6 h before euthanization. Acute exercise induced a transient increase in REDD1 protein expression within the plantaris only at 1 h postexercise, and the induction occurred in both cytosolic and nuclear fractions. At this time point, global changes in gene expression were surveyed using microarray. REDD1 induction was required for the exercise-induced change in expression of 24 genes. Validation by RT-PCR confirmed that the exercise-mediated changes in genes related to exercise capacity, muscle protein metabolism, neuromuscular junction remodeling, and Metformin action were negated in REDD1-null mice. Finally, the exercise-mediated induction of REDD1 was partially dependent upon glucocorticoid receptor activation. In all, these data show that REDD1 induction regulates the exercise-mediated change in a distinct set of genes within skeletal muscle. Copyright © 2017 the American Physiological Society.
Silkoff, Philip E; Laviolette, Michel; Singh, Dave; FitzGerald, J Mark; Kelsen, Steven; Backer, Vibeke; Porsbjerg, Celeste M; Girodet, Pierre-Olivier; Berger, Patrick; Kline, Joel N; Chupp, Geoffrey; Susulic, Vedrana S; Barnathan, Elliot S; Baribaud, Frédéric; Loza, Matthew J
2017-09-01
The Airways Disease Endotyping for Personalized Therapeutics (ADEPT) study profiled patients with mild, moderate, and severe asthma and nonatopic healthy control subjects. We explored this data set to define type 2 inflammation based on airway mucosal IL-13-driven gene expression and how this related to clinically accessible biomarkers. IL-13-driven gene expression was evaluated in several human cell lines. We then defined type 2 status in 25 healthy subjects, 28 patients with mild asthma, 29 patients with moderate asthma, and 26 patients with severe asthma based on airway mucosal expression of (1) CCL26 (the most differentially expressed gene), (2) periostin, or (3) a multigene IL-13 in vitro signature (IVS). Clinically accessible biomarkers included fraction of exhaled nitric oxide (Feno) values, blood eosinophil (bEOS) counts, serum CCL26 expression, and serum CCL17 expression. Expression of airway mucosal CCL26, periostin, and IL-13-IVS all facilitated segregation of subjects into type 2-high and type 2-low asthmatic groups, but in the ADEPT study population CCL26 expression was optimal. All subjects with high airway mucosal CCL26 expression and moderate-to-severe asthma had Feno values (≥35 ppb) and/or high bEOS counts (≥300 cells/mm 3 ) compared with a minority (36%) of subjects with low airway mucosal CCL26 expression. A combination of Feno values, bEOS counts, and serum CCL17 and CCL26 expression had 100% positive predictive value and 87% negative predictive value for airway mucosal CCL26-high status. Clinical variables did not differ between subjects with type 2-high and type 2-low status. Eosinophilic inflammation was associated with but not limited to airway mucosal type 2 gene expression. A panel of clinical biomarkers accurately classified type 2 status based on airway mucosal CCL26, periostin, or IL-13-IVS gene expression. Use of Feno values, bEOS counts, and serum marker levels (eg, CCL26 and CCL17) in combination might allow patient selection for novel type 2 therapeutics. Copyright © 2017 American Academy of Allergy, Asthma & Immunology. All rights reserved.
Ants regulate colony spatial organization using multiple chemical road-signs
Heyman, Yael; Shental, Noam; Brandis, Alexander; Hefetz, Abraham; Feinerman, Ofer
2017-01-01
Communication provides the basis for social life. In ant colonies, the prevalence of local, often chemically mediated, interactions introduces strong links between communication networks and the spatial distribution of ants. It is, however, unknown how ants identify and maintain nest chambers with distinct functions. Here, we combine individual tracking, chemical analysis and machine learning to decipher the chemical signatures present on multiple nest surfaces. We present evidence for several distinct chemical ‘road-signs' that guide the ants' movements within the dark nest. These chemical signatures can be used to classify nest chambers with different functional roles. Using behavioural manipulations, we demonstrate that at least three of these chemical signatures are functionally meaningful and allow ants from different task groups to identify their specific nest destinations, thus facilitating colony coordination and stabilization. The use of multiple chemicals that assist spatiotemporal guidance, segregation and pattern formation is abundant in multi-cellular organisms. Here, we provide a rare example for the use of these principles in the ant colony. PMID:28569746
NASA Technical Reports Server (NTRS)
Hlavka, C. A. (Principal Investigator); Carlyle, S. M.; Haralick, R. M.; Yokoyama, R.
1978-01-01
The author has identified the following significant results. The phenological method of crop identification involves the creation of crop signatures which characterize multispectral observations as phenological growth states. The phenological signature models spectral reflectance explicitly as a function of crop maturity rather than as a function of date. A correspondence of time to growth state is established which minimizes the smallest difference between the given multispectral multitemporal vector and a category mean vector. The application of the method to the identification of winter wheat and corn shows (1) the method is capable of discriminating crop type with about the same degree of accuracy as more traditional classifiers; (2) the use of LANDSAT observations on two or more dates yields better results than the use of a single observation; and (3) some potential is demonstrated for labeling the degree of maturity of the crop, as well as the crop type.
Ants regulate colony spatial organization using multiple chemical road-signs.
Heyman, Yael; Shental, Noam; Brandis, Alexander; Hefetz, Abraham; Feinerman, Ofer
2017-06-01
Communication provides the basis for social life. In ant colonies, the prevalence of local, often chemically mediated, interactions introduces strong links between communication networks and the spatial distribution of ants. It is, however, unknown how ants identify and maintain nest chambers with distinct functions. Here, we combine individual tracking, chemical analysis and machine learning to decipher the chemical signatures present on multiple nest surfaces. We present evidence for several distinct chemical 'road-signs' that guide the ants' movements within the dark nest. These chemical signatures can be used to classify nest chambers with different functional roles. Using behavioural manipulations, we demonstrate that at least three of these chemical signatures are functionally meaningful and allow ants from different task groups to identify their specific nest destinations, thus facilitating colony coordination and stabilization. The use of multiple chemicals that assist spatiotemporal guidance, segregation and pattern formation is abundant in multi-cellular organisms. Here, we provide a rare example for the use of these principles in the ant colony.
NASA Astrophysics Data System (ADS)
Bauer, K.; Pratt, R. G.; Haberland, C.; Weber, M.
2008-10-01
Crosshole seismic experiments were conducted to study the in-situ properties of gas hydrate bearing sediments (GHBS) in the Mackenzie Delta (NW Canada). Seismic tomography provided images of P velocity, anisotropy, and attenuation. Self-organizing maps (SOM) are powerful neural network techniques to classify and interpret multi-attribute data sets. The coincident tomographic images are translated to a set of data vectors in order to train a Kohonen layer. The total gradient of the model vectors is determined for the trained SOM and a watershed segmentation algorithm is used to visualize and map the lithological clusters with well-defined seismic signatures. Application to the Mallik data reveals four major litho-types: (1) GHBS, (2) sands, (3) shale/coal interlayering, and (4) silt. The signature of seismic P wave characteristics distinguished for the GHBS (high velocities, strong anisotropy and attenuation) is new and can be used for new exploration strategies to map and quantify gas hydrates.
Huang, H; Coatrieux, G; Shu, H Z; Luo, L M; Roux, Ch
2011-01-01
In this paper we present a medical image integrity verification system that not only allows detecting and approximating malevolent local image alterations (e.g. removal or addition of findings) but is also capable to identify the nature of global image processing applied to the image (e.g. lossy compression, filtering …). For that purpose, we propose an image signature derived from the geometric moments of pixel blocks. Such a signature is computed over regions of interest of the image and then watermarked in regions of non interest. Image integrity analysis is conducted by comparing embedded and recomputed signatures. If any, local modifications are approximated through the determination of the parameters of the nearest generalized 2D Gaussian. Image moments are taken as image features and serve as inputs to one classifier we learned to discriminate the type of global image processing. Experimental results with both local and global modifications illustrate the overall performances of our approach.
Epigenetic signatures of invasive status in populations of marine invertebrates
NASA Astrophysics Data System (ADS)
Ardura, Alba; Zaiko, Anastasija; Morán, Paloma; Planes, Serge; Garcia-Vazquez, Eva
2017-02-01
Epigenetics, as a DNA signature that affects gene expression and enables rapid reaction of an organism to environmental changes, is likely involved in the process of biological invasions. DNA methylation is an epigenetic mechanism common to plants and animals for regulating gene expression. In this study we show, for the first time in any marine species, significant reduction of global methylation levels during the expansive phase of a pygmy mussel (Xenostrobus securis) recent invasion in Europe (two-year old), while in older introductions such epigenetic signature of invasion was progressively reduced. Decreased methylation was interpreted as a rapid way of increasing phenotypic plasticity that would help invasive populations to thrive. This epigenetic signature of early invasion was stronger than the expected environmental signature of environmental stress in younger populations sampled from ports, otherwise detected in a much older population (>90 year old) of the also invasive tubeworm Ficopomatus enigmaticus established in similar locations. Higher epigenetic than genetic diversity found in X. securis was confirmed from F. enigmaticus samples. As reported for introduced plants and vertebrates, epigenetic variation could compensate for relatively lower genetic variation caused by founder effects. These phenomena were compared with epigenetic mechanisms involved in metastasis, as parallel processes of community (biological invasion) and organism (cancer) invasions.
Amambua-Ngwa, Alfred; Tetteh, Kevin K A; Manske, Magnus; Gomez-Escobar, Natalia; Stewart, Lindsay B; Deerhake, M Elizabeth; Cheeseman, Ian H; Newbold, Christopher I; Holder, Anthony A; Knuepfer, Ellen; Janha, Omar; Jallow, Muminatou; Campino, Susana; Macinnis, Bronwyn; Kwiatkowski, Dominic P; Conway, David J
2012-01-01
Acquired immunity in vertebrates maintains polymorphisms in endemic pathogens, leading to identifiable signatures of balancing selection. To comprehensively survey for genes under such selection in the human malaria parasite Plasmodium falciparum, we generated paired-end short-read sequences of parasites in clinical isolates from an endemic Gambian population, which were mapped to the 3D7 strain reference genome to yield high-quality genome-wide coding sequence data for 65 isolates. A minority of genes did not map reliably, including the hypervariable var, rifin, and stevor families, but 5,056 genes (90.9% of all in the genome) had >70% sequence coverage with minimum read depth of 5 for at least 50 isolates, of which 2,853 genes contained 3 or more single nucleotide polymorphisms (SNPs) for analysis of polymorphic site frequency spectra. Against an overall background of negatively skewed frequencies, as expected from historical population expansion combined with purifying selection, the outlying minority of genes with signatures indicating exceptionally intermediate frequencies were identified. Comparing genes with different stage-specificity, such signatures were most common in those with peak expression at the merozoite stage that invades erythrocytes. Members of clag, PfMC-2TM, surfin, and msp3-like gene families were highly represented, the strongest signature being in the msp3-like gene PF10_0355. Analysis of msp3-like transcripts in 45 clinical and 11 laboratory adapted isolates grown to merozoite-containing schizont stages revealed surprisingly low expression of PF10_0355. In diverse clonal parasite lines the protein product was expressed in a minority of mature schizonts (<1% in most lines and ∼10% in clone HB3), and eight sub-clones of HB3 cultured separately had an intermediate spectrum of positive frequencies (0.9 to 7.5%), indicating phase variable expression of this polymorphic antigen. This and other identified targets of balancing selection are now prioritized for functional study.
USDA-ARS?s Scientific Manuscript database
A potentially useful approach for drug discovery is to connect gene expression profiles of disease-affected tissues ("disease signatures") to drug signatures, but it remains to be shown whether it can be used to identify clinically relevant treatment options. We analyzed coexpression networks and ge...
Taube, Joseph H; Herschkowitz, Jason I; Komurov, Kakajan; Zhou, Alicia Y; Gupta, Supriya; Yang, Jing; Hartwell, Kimberly; Onder, Tamer T; Gupta, Piyush B; Evans, Kurt W; Hollier, Brett G; Ram, Prahlad T; Lander, Eric S; Rosen, Jeffrey M; Weinberg, Robert A; Mani, Sendurai A
2010-08-31
The epithelial-to-mesenchymal transition (EMT) produces cancer cells that are invasive, migratory, and exhibit stem cell characteristics, hallmarks of cells that have the potential to generate metastases. Inducers of the EMT include several transcription factors (TFs), such as Goosecoid, Snail, and Twist, as well as the secreted TGF-beta1. Each of these factors is capable, on its own, of inducing an EMT in the human mammary epithelial (HMLE) cell line. However, the interactions between these regulators are poorly understood. Overexpression of each of the above EMT inducers up-regulates a subset of other EMT-inducing TFs, with Twist, Zeb1, Zeb2, TGF-beta1, and FOXC2 being commonly induced. Up-regulation of Slug and FOXC2 by either Snail or Twist does not depend on TGF-beta1 signaling. Gene expression signatures (GESs) derived by overexpressing EMT-inducing TFs reveal that the Twist GES and Snail GES are the most similar, although the Goosecoid GES is the least similar to the others. An EMT core signature was derived from the changes in gene expression shared by up-regulation of Gsc, Snail, Twist, and TGF-beta1 and by down-regulation of E-cadherin, loss of which can also trigger an EMT in certain cell types. The EMT core signature associates closely with the claudin-low and metaplastic breast cancer subtypes and correlates negatively with pathological complete response. Additionally, the expression level of FOXC1, another EMT inducer, correlates strongly with poor survival of breast cancer patients.
Taube, Joseph H.; Herschkowitz, Jason I.; Komurov, Kakajan; Zhou, Alicia Y.; Gupta, Supriya; Yang, Jing; Hartwell, Kimberly; Onder, Tamer T.; Gupta, Piyush B.; Evans, Kurt W.; Hollier, Brett G.; Ram, Prahlad T.; Lander, Eric S.; Rosen, Jeffrey M.; Weinberg, Robert A.; Mani, Sendurai A.
2010-01-01
The epithelial-to-mesenchymal transition (EMT) produces cancer cells that are invasive, migratory, and exhibit stem cell characteristics, hallmarks of cells that have the potential to generate metastases. Inducers of the EMT include several transcription factors (TFs), such as Goosecoid, Snail, and Twist, as well as the secreted TGF-β1. Each of these factors is capable, on its own, of inducing an EMT in the human mammary epithelial (HMLE) cell line. However, the interactions between these regulators are poorly understood. Overexpression of each of the above EMT inducers up-regulates a subset of other EMT-inducing TFs, with Twist, Zeb1, Zeb2, TGF-β1, and FOXC2 being commonly induced. Up-regulation of Slug and FOXC2 by either Snail or Twist does not depend on TGF-β1 signaling. Gene expression signatures (GESs) derived by overexpressing EMT-inducing TFs reveal that the Twist GES and Snail GES are the most similar, although the Goosecoid GES is the least similar to the others. An EMT core signature was derived from the changes in gene expression shared by up-regulation of Gsc, Snail, Twist, and TGF-β1 and by down-regulation of E-cadherin, loss of which can also trigger an EMT in certain cell types. The EMT core signature associates closely with the claudin-low and metaplastic breast cancer subtypes and correlates negatively with pathological complete response. Additionally, the expression level of FOXC1, another EMT inducer, correlates strongly with poor survival of breast cancer patients. PMID:20713713
Molecular and clinical characterization of IDH associated immune signature in lower-grade gliomas.
Qian, Zenghui; Li, Yiming; Fan, Xing; Zhang, Chuanbao; Wang, Yinyan; Jiang, Tao; Liu, Xing
2018-01-01
Background : Mutations in isocitrate dehydrogenase (IDH) affect the development and prognosis of gliomas. We investigated the role of IDH mutations in the regulation of immune phenotype in lower-grade gliomas (LGGs). Method and patients : A total of 1,008 cases with clinical and IDH mutation data from five cohorts were enrolled. Samples with RNA sequencing data from the Chinese Glioma Genome Atlas (CGGA) were used as training set, whereas RNA data from the Cancer Genome Atlas, Repository for Molecular Brain Neoplasia, GSE16011, and CGGA microarray databases were used for validation. R language tools and bioinformatics analysis were used for gene signature construction and biological function annotation. Results : We found that IDH mutations caused down-regulation of local immune response as among 332 immune system-related genes, 196(59.0%) were differentially expressed according to IDH mutation status. Nearly 70% of those differentially expressed genes exhibited prognostic value in LGGs. An immune response-based gene signature was constructed that distinguished cases with high- or low-risk of unfavorable prognosis and remained an independent prognostic factor in multivariate analyses in both training and validation cohorts. Samples from high-risk cases exhibited elevated expression of genes involved in immune response and NF-κB pathway activation. Furthermore, we found a strong correlation between the risk score and T cells, macrophage-related immune response, and expression of several prominent immune checkpoints. Conclusion : Our results indicated that mutant IDH is highly associated with the regulation of the immune microenvironment in LGGs. The observed immune system gene signature, which was sensitive to IDH mutation status, efficiently predicted patient survival.
Colaprico, Antonio; Bontempi, Gianluca; Castiglioni, Isabella
2018-01-01
Like other cancer diseases, prostate cancer (PC) is caused by the accumulation of genetic alterations in the cells that drives malignant growth. These alterations are revealed by gene profiling and copy number alteration (CNA) analysis. Moreover, recent evidence suggests that also microRNAs have an important role in PC development. Despite efforts to profile PC, the alterations (gene, CNA, and miRNA) and biological processes that correlate with disease development and progression remain partially elusive. Many gene signatures proposed as diagnostic or prognostic tools in cancer poorly overlap. The identification of co-expressed genes, that are functionally related, can identify a core network of genes associated with PC with a better reproducibility. By combining different approaches, including the integration of mRNA expression profiles, CNAs, and miRNA expression levels, we identified a gene signature of four genes overlapping with other published gene signatures and able to distinguish, in silico, high Gleason-scored PC from normal human tissue, which was further enriched to 19 genes by gene co-expression analysis. From the analysis of miRNAs possibly regulating this network, we found that hsa-miR-153 was highly connected to the genes in the network. Our results identify a four-gene signature with diagnostic and prognostic value in PC and suggest an interesting gene network that could play a key regulatory role in PC development and progression. Furthermore, hsa-miR-153, controlling this network, could be a potential biomarker for theranostics in high Gleason-scored PC. PMID:29562723
NASA Astrophysics Data System (ADS)
Litjens, G. J. S.; Elliott, R.; Shih, N.; Feldman, M.; Barentsz, J. O.; Hulsbergen-van de Kaa, C. A.; Kovacs, I.; Huisman, H. J.; Madabhushi, A.
2014-03-01
Learning how to separate benign confounders from prostate cancer is important because the imaging characteristics of these confounders are poorly understood. Furthermore, the typical representations of the MRI parameters might not be enough to allow discrimination. The diagnostic uncertainty this causes leads to a lower diagnostic accuracy. In this paper a new cascaded classifier is introduced to separate prostate cancer and benign confounders on MRI in conjunction with specific computer-extracted features to distinguish each of the benign classes (benign prostatic hyperplasia (BPH), inflammation, atrophy or prostatic intra-epithelial neoplasia (PIN). In this study we tried to (1) calculate different mathematical representations of the MRI parameters which more clearly express subtle differences between different classes, (2) learn which of the MRI image features will allow to distinguish specific benign confounders from prostate cancer, and (2) find the combination of computer-extracted MRI features to best discriminate cancer from the confounding classes using a cascaded classifier. One of the most important requirements for identifying MRI signatures for adenocarcinoma, BPH, atrophy, inflammation, and PIN is accurate mapping of the location and spatial extent of the confounder and cancer categories from ex vivo histopathology to MRI. Towards this end we employed an annotated prostatectomy data set of 31 patients, all of whom underwent a multi-parametric 3 Tesla MRI prior to radical prostatectomy. The prostatectomy slides were carefully co-registered to the corresponding MRI slices using an elastic registration technique. We extracted texture features from the T2-weighted imaging, pharmacokinetic features from the dynamic contrast enhanced imaging and diffusion features from the diffusion-weighted imaging for each of the confounder classes and prostate cancer. These features were selected because they form the mainstay of clinical diagnosis. Relevant features for each of the classes were selected using maximum relevance minimum redundancy feature selection, allowing us to perform classifier independent feature selection. The selected features were then incorporated in a cascading classifier, which can focus on easier sub-tasks at each stage, leaving the more difficult classification tasks for later stages. Results show that distinct features are relevant for each of the benign classes, for example the fraction of extra-vascular, extra-cellular space in a voxel is a clear discriminator for inflammation. Furthermore, the cascaded classifier outperforms both multi-class and one-shot classifiers in overall accuracy for discriminating confounders from cancer: 0.76 versus 0.71 and 0.62.
Yang, Xinan Holly; Tang, Fangming; Shin, Jisu; Cunningham, John M
2017-10-03
Previous studies suggested that cancer cells possess traits reminiscent of the biological mechanisms ascribed to normal embryonic stem cells (ESCs) regulated by MYC and Polycomb repressive complex 2 (PRC2). Several poorly differentiated adult tumors showed preferentially high expression levels in targets of MYC, coincident with low expression levels in targets of PRC2. This paper will reveal this ESC-like cancer signature in high-risk neuroblastoma (HR-NB), the most common extracranial solid tumor in children. We systematically assembled genomic variants, gene expression changes, priori knowledge of gene functions, and clinical outcomes to identify prognostic multigene signatures. First, we assigned a new, individualized prognostic index using the relative expressions between the poor- and good-outcome signature genes. We then characterized HR-NB aggressiveness beyond these prognostic multigene signatures through the imbalanced effects of MYC and PRC2 signaling. We further analyzed Retinoic acid (RA)-induced HR-NB cells to model tumor cell differentiation. Finally, we performed in vitro validation on ZFHX3, a cell differentiation marker silenced by PRC2, and compared cell morphology changes before and after blocking PRC2 in HR-NB cells. A significant concurrence existed between exons with verified variants and genes showing MYCN-dependent expression in HR-NB. From these biomarker candidates, we identified two novel prognostic gene-set pairs with multi-scale oncogenic defects. Intriguingly, MYC targets over-represented an unfavorable component of the identified prognostic signatures while PRC2 targets over-represented a favorable component. The cell cycle arrest and neuronal differentiation marker ZFHX3 was identified as one of PRC2-silenced tumor suppressor candidates. Blocking PRC2 reduced tumor cell growth and increased the mRNA expression levels of ZFHX3 in an early treatment stage. This hypothesis-driven systems bioinformatics work offered novel insights into the PRC2-mediated tumor cell growth and differentiation in neuroblastoma, which may exert oncogenic effects together with MYC regulation. Our results propose a prognostic effect of imbalanced MYC and PRC2 moderations in pediatric HR-NB for the first time. This study demonstrates an incorporation of genomic landscapes and transcriptomic profiles into the hypothesis-driven precision prognosis and biomarker discovery. The application of this approach to neuroblastoma, as well as other cancer more broadly, could contribute to reduced relapse and mortality rates in the long term.
Zhang, Shu-Dong; Gant, Timothy W
2009-07-31
Connectivity mapping is a process to recognize novel pharmacological and toxicological properties in small molecules by comparing their gene expression signatures with others in a database. A simple and robust method for connectivity mapping with increased specificity and sensitivity was recently developed, and its utility demonstrated using experimentally derived gene signatures. This paper introduces sscMap (statistically significant connections' map), a Java application designed to undertake connectivity mapping tasks using the recently published method. The software is bundled with a default collection of reference gene-expression profiles based on the publicly available dataset from the Broad Institute Connectivity Map 02, which includes data from over 7000 Affymetrix microarrays, for over 1000 small-molecule compounds, and 6100 treatment instances in 5 human cell lines. In addition, the application allows users to add their custom collections of reference profiles and is applicable to a wide range of other 'omics technologies. The utility of sscMap is two fold. First, it serves to make statistically significant connections between a user-supplied gene signature and the 6100 core reference profiles based on the Broad Institute expanded dataset. Second, it allows users to apply the same improved method to custom-built reference profiles which can be added to the database for future referencing. The software can be freely downloaded from http://purl.oclc.org/NET/sscMap.
Mills, Kevin J; Robinson, Michael K; Sherrill, Joseph D; Schnell, Daniel J; Xu, Jun
2018-05-28
Triggers of skin disease pathogenesis vary, but events associated with the elicitation of a lesion share many features in common. Our objective was to examine gene expression patterns in skin disease to develop a molecular signature of disruption of cutaneous homeostasis. Gene expression data from common inflammatory skin diseases (e.g., psoriasis, atopic dermatitis, seborrheic dermatitis and acne), and a novel statistical algorithm were used to define a unifying molecular signature referred to as the "Unhealthy Skin Signature" (USS). Using a pattern matching algorithm, analysis of public data repositories revealed that the USS is found in diverse epithelial diseases. Studies of milder disruptions of epidermal homeostasis have also shown that these conditions converge, to varying degrees, on the USS and that the degree of convergence is related directly to the severity of homeostatic disruption. The USS contains genes that had no prior published association with skin, but that play important roles in many different disease processes, supporting the importance of the USS to homeostasis. Finally, we show through pattern matching that the USS can be used to discover new potential dermatologic therapeutics. The USS provides a new means to further interrogate epithelial homeostasis and potentially develop novel therapeutics with efficacy across a spectrum of skin conditions. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Anomaly Detection in Host Signaling Pathways for the Early Prognosis of Acute Infection.
Wang, Kun; Langevin, Stanley; O'Hern, Corey S; Shattuck, Mark D; Ogle, Serenity; Forero, Adriana; Morrison, Juliet; Slayden, Richard; Katze, Michael G; Kirby, Michael
2016-01-01
Clinical diagnosis of acute infectious diseases during the early stages of infection is critical to administering the appropriate treatment to improve the disease outcome. We present a data driven analysis of the human cellular response to respiratory viruses including influenza, respiratory syncytia virus, and human rhinovirus, and compared this with the response to the bacterial endotoxin, Lipopolysaccharides (LPS). Using an anomaly detection framework we identified pathways that clearly distinguish between asymptomatic and symptomatic patients infected with the four different respiratory viruses and that accurately diagnosed patients exposed to a bacterial infection. Connectivity pathway analysis comparing the viral and bacterial diagnostic signatures identified host cellular pathways that were unique to patients exposed to LPS endotoxin indicating this type of analysis could be used to identify host biomarkers that can differentiate clinical etiologies of acute infection. We applied the Multivariate State Estimation Technique (MSET) on two human influenza (H1N1 and H3N2) gene expression data sets to define host networks perturbed in the asymptomatic phase of infection. Our analysis identified pathways in the respiratory virus diagnostic signature as prognostic biomarkers that triggered prior to clinical presentation of acute symptoms. These early warning pathways correctly predicted that almost half of the subjects would become symptomatic in less than forty hours post-infection and that three of the 18 subjects would become symptomatic after only 8 hours. These results provide a proof-of-concept for utility of anomaly detection algorithms to classify host pathway signatures that can identify presymptomatic signatures of acute diseases and differentiate between etiologies of infection. On a global scale, acute respiratory infections cause a significant proportion of human co-morbidities and account for 4.25 million deaths annually. The development of clinical diagnostic tools to distinguish between acute viral and bacterial respiratory infections is critical to improve patient care and limit the overuse of antibiotics in the medical community. The identification of prognostic respiratory virus biomarkers provides an early warning system that is capable of predicting which subjects will become symptomatic to expand our medical diagnostic capabilities and treatment options for acute infectious diseases. The host response to acute infection may be viewed as a deterministic signaling network responsible for maintaining the health of the host organism. We identify pathway signatures that reflect the very earliest perturbations in the host response to acute infection. These pathways provide a monitor the health state of the host using anomaly detection to quantify and predict health outcomes to pathogens.
Anomaly Detection in Host Signaling Pathways for the Early Prognosis of Acute Infection
O’Hern, Corey S.; Shattuck, Mark D.; Ogle, Serenity; Forero, Adriana; Morrison, Juliet; Slayden, Richard; Katze, Michael G.
2016-01-01
Clinical diagnosis of acute infectious diseases during the early stages of infection is critical to administering the appropriate treatment to improve the disease outcome. We present a data driven analysis of the human cellular response to respiratory viruses including influenza, respiratory syncytia virus, and human rhinovirus, and compared this with the response to the bacterial endotoxin, Lipopolysaccharides (LPS). Using an anomaly detection framework we identified pathways that clearly distinguish between asymptomatic and symptomatic patients infected with the four different respiratory viruses and that accurately diagnosed patients exposed to a bacterial infection. Connectivity pathway analysis comparing the viral and bacterial diagnostic signatures identified host cellular pathways that were unique to patients exposed to LPS endotoxin indicating this type of analysis could be used to identify host biomarkers that can differentiate clinical etiologies of acute infection. We applied the Multivariate State Estimation Technique (MSET) on two human influenza (H1N1 and H3N2) gene expression data sets to define host networks perturbed in the asymptomatic phase of infection. Our analysis identified pathways in the respiratory virus diagnostic signature as prognostic biomarkers that triggered prior to clinical presentation of acute symptoms. These early warning pathways correctly predicted that almost half of the subjects would become symptomatic in less than forty hours post-infection and that three of the 18 subjects would become symptomatic after only 8 hours. These results provide a proof-of-concept for utility of anomaly detection algorithms to classify host pathway signatures that can identify presymptomatic signatures of acute diseases and differentiate between etiologies of infection. On a global scale, acute respiratory infections cause a significant proportion of human co-morbidities and account for 4.25 million deaths annually. The development of clinical diagnostic tools to distinguish between acute viral and bacterial respiratory infections is critical to improve patient care and limit the overuse of antibiotics in the medical community. The identification of prognostic respiratory virus biomarkers provides an early warning system that is capable of predicting which subjects will become symptomatic to expand our medical diagnostic capabilities and treatment options for acute infectious diseases. The host response to acute infection may be viewed as a deterministic signaling network responsible for maintaining the health of the host organism. We identify pathway signatures that reflect the very earliest perturbations in the host response to acute infection. These pathways provide a monitor the health state of the host using anomaly detection to quantify and predict health outcomes to pathogens. PMID:27532264
Using a Stem Cell-Based Signature to Guide Therapeutic Selection in Cancer
Shats, Igor; Gatza, Michael L.; Chang, Jeffrey T.; Mori, Seiichi; Wang, Jialiang; Rich, Jeremy; Nevins, Joseph R.
2010-01-01
Given the very substantial heterogeneity of most human cancers, it is likely that most cancer therapeutics will be active in only a small fraction of any population of patients. As such, the development of new therapeutics, coupled with methods to match a therapy with the individual patient, will be critical to achieving significant gains in disease outcome. One such opportunity is the use of expression signatures to identify key oncogenic phenotypes that can serve not only as biomarkers but also as a means of identifying therapeutic compounds that might specifically target these phenotypes. Given the potential importance of targeting tumors exhibiting a stem-like phenotype, we have developed an expression signature that reflects common biological aspects of various stem-like characteristics. The Consensus Stemness Ranking (CSR) signature is upregulated in cancer stem cell enriched samples, at advanced tumor stages and is associated with poor prognosis in multiple cancer types. Using two independent computational approaches we utilized the CSR signature to identify clinically useful compounds that could target the CSR phenotype. In vitro assays confirmed selectivity of several predicted compounds including topoisomerase inhibitors and resveratrol towards breast cancer cell lines that exhibit a high-CSR phenotype. Importantly, the CSR signature could predict clinical response of breast cancer patients to a neoadjuvant regimen that included a CSR-specific agent. Collectively, these results suggest therapeutic opportunities to target the CSR phenotype in a relevant cohort of cancer patients. PMID:21169407
Distinct Host Tropism Protein Signatures to Identify Possible Zoonotic Influenza A Viruses.
Eng, Christine L P; Tong, Joo Chuan; Tan, Tin Wee
2016-01-01
Zoonotic influenza A viruses constantly pose a health threat to humans as novel strains occasionally emerge from the avian population to cause human infections. Many past epidemic as well as pandemic strains have originated from avian species. While most viruses are restricted to their primary hosts, zoonotic strains can sometimes arise from mutations or reassortment, leading them to acquire the capability to escape host species barrier and successfully infect a new host. Phylogenetic analyses and genetic markers are useful in tracing the origins of zoonotic infections, but there are still no effective means to identify high risk strains prior to an outbreak. Here we show that distinct host tropism protein signatures can be used to identify possible zoonotic strains in avian species which have the potential to cause human infections. We have discovered that influenza A viruses can now be classified into avian, human, or zoonotic strains based on their host tropism protein signatures. Analysis of all influenza A viruses with complete proteome using the host tropism prediction system, based on machine learning classifications of avian and human viral proteins has uncovered distinct signatures of zoonotic strains as mosaics of avian and human viral proteins. This is in contrast with typical avian or human strains where they show mostly avian or human viral proteins in their signatures respectively. Moreover, we have found that zoonotic strains from the same influenza outbreaks carry similar host tropism protein signatures characteristic of a common ancestry. Our results demonstrate that the distinct host tropism protein signature in zoonotic strains may prove useful in influenza surveillance to rapidly identify potential high risk strains circulating in avian species, which may grant us the foresight in anticipating an impending influenza outbreak.
The E2F3—Oncomir 1 axis is activated in Wilms Tumor
Kort, Eric J.; Farber, Leslie; Tretiakova, Maria; Petillo, David; Furge, Kyle A.; Yang, Ximing J.; Cornelius, Albert; Teh, Bin T.
2008-01-01
Oncomir-1 is an oncogenic cluster of microRNAs located on chromosome 13. Previous in vitro studies demonstrated that it is transcriptionally regulated by the transcription factor E2F3. In this report we combine expression profiling of both messenger RNA (mRNA) and micro RNAs (miRNA) in Wilms tumor (WT) samples to provide the first evidence that the E2F3—Oncomir 1 axis, previously identified in cell culture, is deregulated in primary human tumors. Analysis of RNA expression signatures demonstrated that an E2F3 gene signature was activated in all Wilms tumor samples analyzed, in contrast to other kidney tumors. This finding was validated by immunohistochemistry (IHC) on the protein level. Expression of E2F3 was lowest in early stage tumors, and highest in metastatic tissue. Expression profiling of miRNAs in WT showed that expression of each measured member of the Oncomir-1 family was highest in WT relative to other kidney tumor subtypes. Quantitative polymerase chain reaction (PCR) confirmed that these microRNAs were overexpressed in Wilms tumor relative to normal kidney tissue. These results suggest that the E2F3—Oncomir-1 axis is activated in Wilms tumor. Our study also demonstrates the utility of integrated genomics combining gene signature analysis with miRNA expression profiling to identify protein-miRNA interactions that are perturbed in disease states. PMID:18519660
Laser vibrometry exploitation for vehicle identification
NASA Astrophysics Data System (ADS)
Nolan, Adam; Lingg, Andrew; Goley, Steve; Sigmund, Kevin; Kangas, Scott
2014-06-01
Vibration signatures sensed from distant vehicles using laser vibrometry systems provide valuable information that may be used to help identify key vehicle features such as engine type, engine speed, and number of cylinders. Through the use of physics models of the vibration phenomenology, features are chosen to support classification algorithms. Various individual exploitation algorithms were developed using these models to classify vibration signatures into engine type (piston vs. turbine), engine configuration (Inline 4 vs. Inline 6 vs. V6 vs. V8 vs. V12) and vehicle type. The results of these algorithms will be presented for an 8 class problem. Finally, the benefits of using a factor graph representation to link these independent algorithms together will be presented which constructs a classification hierarchy for the vibration exploitation problem.
Admittance Survey of Type 1 Coronae on Venus: Implications for Elastic Thickness
NASA Technical Reports Server (NTRS)
Hoogenboom, T.; Smrekar, S. E.; Anderson, F. S.; Houseman, G.
2003-01-01
Coronae are volcano-tectonic features on Venus which range from 60km to 2600km and are defined by their nearly circular patterns of fractures. Type 1 (regular) coronae are classified as having >50% complete fracture annuli. Previous work has examined the factors controlling the morphology, size, and fracture pattern of coronae, using lithospheric properties, loading signature and geologic characteristics. However, these studies have been limited to Type 2 (topographic) coronae (e.g. coronaes with <50% fracture annuli), and the factors controlling the formation of Type 1 coronae remain poorly understood. In this study, we apply the methodology of to survey the admittance signature for Type 1 coronae to determine the controlling parameters which govern Type 1 coronae formation.
Background rejection in NEXT using deep neural networks
Renner, J.; Farbin, A.; Vidal, J. Muñoz; ...
2017-01-16
Here, we investigate the potential of using deep learning techniques to reject background events in searches for neutrinoless double beta decay with high pressure xenon time projection chambers capable of detailed track reconstruction. The differences in the topological signatures of background and signal events can be learned by deep neural networks via training over many thousands of events. These networks can then be used to classify further events as signal or background, providing an additional background rejection factor at an acceptable loss of efficiency. The networks trained in this study performed better than previous methods developed based on the usemore » of the same topological signatures by a factor of 1.2 to 1.6, and there is potential for further improvement.« less
Molina-Pinelo, Sonia; Suárez, Rocío; Pastor, María Dolores; Nogal, Ana; Márquez-Martín, Eduardo; Martín-Juan, José; Carnero, Amancio; Paz-Ares, Luis
2012-01-01
The identification of new less invasive biomarkers is necessary to improve the detection and prognostic outcome of respiratory pathological processes. The measurement of miRNA expression through less invasive techniques such as plasma and serum have been suggested to analysis of several lung malignancies including lung cancer. These studies are assuming a common deregulated miRNA expression both in blood and lung tissue. The present study aimed to obtain miRNA representative signatures both in plasma and bronchoalveolar cell fraction that could serve as biomarker in respiratory diseases. Ten patients were evaluated to assess the expression levels of 381 miRNAs. We found that around 50% miRNAs were no detected in both plasma and bronchoalveolar cell fraction and only 20% of miRNAs showed similar expression in both samples. These results show a lack of association of miRNA signatures between plasma and bronchoalveolar cytology in the same patient. The profiles are not comparable; however, there is a similarity in the relative expression in a very small subset of miRNAs (miR-17, miR-19b, miR-195 and miR-20b) between both biological samples in all patients. This finding supports that the miRNAs profiles obtained from different biological samples have to be carefully validated to link with respiratory diseases. PMID:22430188
Won, Kyeong-Hye; Song, Ki-Duk; Park, Jong-Eun; Kim, Duk-Kyung; Na, Chong-Sam
2016-01-01
Anethole and garlic have an immune modulatory effects on avian coccidiosis, and these effects are correlated with gene expression changes in intestinal epithelial lymphocytes (IELs). In this study, we integrated gene expression datasets from two independent experiments and investigated gene expression profile changes by anethole and garlic respectively, and identified gene expression signatures, which are common targets of these herbs as they might be used for the evaluation of the effect of plant herbs on immunity toward avian coccidiosis. We identified 4,382 and 371 genes, which were differentially expressed in IELs of chickens supplemented with garlic and anethole respectively. The gene ontology (GO) term of differentially expressed genes (DEGs) from garlic treatment resulted in the biological processes (BPs) related to proteolysis, e.g., “modification-dependent protein catabolic process”, “proteolysis involved in cellular protein catabolic process”, “cellular protein catabolic process”, “protein catabolic process”, and “ubiquitin-dependent protein catabolic process”. In GO analysis, one BP term, “Proteolysis”, was obtained. Among DEGs, 300 genes were differentially regulated in response to both garlic and anethole, and 234 and 59 genes were either up- or down-regulated in supplementation with both herbs. Pathway analysis resulted in enrichment of the pathways related to digestion such as “Starch and sucrose metabolism” and “Insulin signaling pathway”. Taken together, the results obtained in the present study could contribute to the effective development of evaluation system of plant herbs based on molecular signatures related with their immunological functions in chicken IELs. PMID:26954117
Piao, Yongjun; Piao, Minghao; Ryu, Keun Ho
2017-01-01
Cancer classification has been a crucial topic of research in cancer treatment. In the last decade, messenger RNA (mRNA) expression profiles have been widely used to classify different types of cancers. With the discovery of a new class of small non-coding RNAs; known as microRNAs (miRNAs), various studies have shown that the expression patterns of miRNA can also accurately classify human cancers. Therefore, there is a great demand for the development of machine learning approaches to accurately classify various types of cancers using miRNA expression data. In this article, we propose a feature subset-based ensemble method in which each model is learned from a different projection of the original feature space to classify multiple cancers. In our method, the feature relevance and redundancy are considered to generate multiple feature subsets, the base classifiers are learned from each independent miRNA subset, and the average posterior probability is used to combine the base classifiers. To test the performance of our method, we used bead-based and sequence-based miRNA expression datasets and conducted 10-fold and leave-one-out cross validations. The experimental results show that the proposed method yields good results and has higher prediction accuracy than popular ensemble methods. The Java program and source code of the proposed method and the datasets in the experiments are freely available at https://sourceforge.net/projects/mirna-ensemble/. Copyright © 2016 Elsevier Ltd. All rights reserved.
Molecular Biology In Young Women With Breast Cancer: From Tumor Gene Expression To DNA Mutations.
Gómez-Flores-Ramos, Liliana; Castro-Sánchez, Andrea; Peña-Curiel, Omar; Mohar-Betancourt, Alejandro
2017-01-01
Young women with breast cancer (YWBC) represent roughly 15% of breast cancer (BC) cases in Latin America and other developing regions. Breast tumors occurring at an early age are more aggressive and have an overall worse prognosis compared to breast tumors in postmenopausal women. The expression of relevant proliferation biomarkers such as endocrine receptors and human epidermal growth factor receptor 2 appears to be unique in YWBC. Moreover, histopathological, molecular, genetic, and genomic studies have shown that YWBC exhibit a higher frequency of aggressive subtypes, differential tumor gene expression, increased genetic susceptibility, and specific genomic signatures, compared to older women with BC. This article reviews the current knowledge on tumor biology and genomic signatures in YWBC.
Classifying threats with a 14-MeV neutron interrogation system.
Strellis, Dan; Gozani, Tsahi
2005-01-01
SeaPODDS (Sea Portable Drug Detection System) is a non-intrusive tool for detecting concealed threats in hidden compartments of maritime vessels. This system consists of an electronic neutron generator, a gamma-ray detector, a data acquisition computer, and a laptop computer user-interface. Although initially developed to detect narcotics, recent algorithm developments have shown that the system is capable of correctly classifying a threat into one of four distinct categories: narcotic, explosive, chemical weapon, or radiological dispersion device (RDD). Detection of narcotics, explosives, and chemical weapons is based on gamma-ray signatures unique to the chemical elements. Elements are identified by their characteristic prompt gamma-rays induced by fast and thermal neutrons. Detection of RDD is accomplished by detecting gamma-rays emitted by common radioisotopes and nuclear reactor fission products. The algorithm phenomenology for classifying threats into the proper categories is presented here.
Least Squares Neural Network-Based Wireless E-Nose System Using an SnO₂ Sensor Array.
Shahid, Areej; Choi, Jong-Hyeok; Rana, Abu Ul Hassan Sarwar; Kim, Hyun-Seok
2018-05-06
Over the last few decades, the development of the electronic nose (E-nose) for detection and quantification of dangerous and odorless gases, such as methane (CH₄) and carbon monoxide (CO), using an array of SnO₂ gas sensors has attracted considerable attention. This paper addresses sensor cross sensitivity by developing a classifier and estimator using an artificial neural network (ANN) and least squares regression (LSR), respectively. Initially, the ANN was implemented using a feedforward pattern recognition algorithm to learn the collective behavior of an array as the signature of a particular gas. In the second phase, the classified gas was quantified by minimizing the mean square error using LSR. The combined approach produced 98.7% recognition probability, with 95.5 and 94.4% estimated gas concentration accuracies for CH₄ and CO, respectively. The classifier and estimator parameters were deployed in a remote microcontroller for the actualization of a wireless E-nose system.
NASA Technical Reports Server (NTRS)
Kriegler, F. J.; Christenson, D.; Gordon, M.; Kistler, R.; Lampert, S.; Marshall, R.; Mclaughlin, R.
1974-01-01
The Midas System is a third-generation, fast, multispectral recognition system able to keep pace with the large quantity and high rates of data acquisition from present and projected sensors. A principal objective of the MIDAS Program is to provide a system well interfaced with the human operator and thus to obtain large overall reductions in turn-around time and significant gains in throughput. The hardware and software generated in Phase I of the overall program are described. The system contains a mini-computer to control the various high-speed processing elements in the data path and a classifier which implements an all-digital prototype multivariate-Gaussian maximum likelihood decision algorithm operating at 2 x 100,000 pixels/sec. Sufficient hardware was developed to perform signature extraction from computer-compatible tapes, compute classifier coefficients, control the classifier operation, and diagnose operation. The MIDAS construction and wiring diagrams are given.
NASA Technical Reports Server (NTRS)
Kriegler, F. J.; Christenson, D.; Gordon, M.; Kistler, R.; Lampert, S.; Marshall, R.; Mclaughlin, R.
1974-01-01
The MIDAS System is a third-generation, fast, multispectral recognition system able to keep pace with the large quantity and high rates of data acquisition from present and projected sensors. A principal objective of the MIDAS Program is to provide a system well interfaced with the human operator and thus to obtain large overall reductions in turn-around time and significant gains in throughout. The hardware and software generated in Phase I of the over-all program are described. The system contains a mini-computer to control the various high-speed processing elements in the data path and a classifier which implements an all-digital prototype multivariate-Gaussian maximum likelihood decision algorithm operating 2 x 105 pixels/sec. Sufficient hardware was developed to perform signature extraction from computer-compatible tapes, compute classifier coefficients, control the classifier operation, and diagnose operation. Diagnostic programs used to test MIDAS' operations are presented.
Discrete Biogeography Based Optimization for Feature Selection in Molecular Signatures.
Liu, Bo; Tian, Meihong; Zhang, Chunhua; Li, Xiangtao
2015-04-01
Biomarker discovery from high-dimensional data is a complex task in the development of efficient cancer diagnoses and classification. However, these data are usually redundant and noisy, and only a subset of them present distinct profiles for different classes of samples. Thus, selecting high discriminative genes from gene expression data has become increasingly interesting in the field of bioinformatics. In this paper, a discrete biogeography based optimization is proposed to select the good subset of informative gene relevant to the classification. In the proposed algorithm, firstly, the fisher-markov selector is used to choose fixed number of gene data. Secondly, to make biogeography based optimization suitable for the feature selection problem; discrete migration model and discrete mutation model are proposed to balance the exploration and exploitation ability. Then, discrete biogeography based optimization, as we called DBBO, is proposed by integrating discrete migration model and discrete mutation model. Finally, the DBBO method is used for feature selection, and three classifiers are used as the classifier with the 10 fold cross-validation method. In order to show the effective and efficiency of the algorithm, the proposed algorithm is tested on four breast cancer dataset benchmarks. Comparison with genetic algorithm, particle swarm optimization, differential evolution algorithm and hybrid biogeography based optimization, experimental results demonstrate that the proposed method is better or at least comparable with previous method from literature when considering the quality of the solutions obtained. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The 3′UTR signature defines a highly metastatic subgroup of triple-negative breast cancer
Wang, Lei; Hu, Xin; Wang, Peng; Shao, Zhi-Ming
2016-01-01
Triple-negative breast cancer (TNBC) is a highly heterogeneous disease with an aggressive clinical course. Prognostic models are needed to chart potential patient outcomes. To address this, we used alternative 3′UTR patterns to improve postoperative risk stratification. We collected 327 publicly available microarrays and generated the 3′UTR landscape based on expression ratios of alternative 3′UTR. After initial feature filtering, we built a 17-3′UTR-based classifier using an elastic net model. Time-dependent ROC comparisons and Kaplan–Meier analyses confirmed an outstanding discriminating power of our prognostic model for TNBC patients. In the training cohort, 5-year event-free survival (EFS) was 78.6% (95% CI 71.2–86.0) for the low-risk group, and 16.3% (95% CI 2.3–30.4) for the high-risk group (log-rank p<0.0001; hazard ratio [HR] 8.29, 95% CI 4.78–14.4), In the validation set, 5-year EFS was 75.6% (95% CI 68.0–83.2) for the low-risk group, and 33.2% (95% CI 17.1–49.3) for the high-risk group (log-rank p<0.0001; HR 3.17, 95% CI 1.66–5.42). In conclusion, the 17-3′UTR-based classifier provides a superior prognostic performance for estimating disease recurrence and metastasis in TNBC patients and it may permit personalized management strategies. PMID:27494850
A cDNA microarray gene expression data classifier for clinical diagnostics based on graph theory.
Benso, Alfredo; Di Carlo, Stefano; Politano, Gianfranco
2011-01-01
Despite great advances in discovering cancer molecular profiles, the proper application of microarray technology to routine clinical diagnostics is still a challenge. Current practices in the classification of microarrays' data show two main limitations: the reliability of the training data sets used to build the classifiers, and the classifiers' performances, especially when the sample to be classified does not belong to any of the available classes. In this case, state-of-the-art algorithms usually produce a high rate of false positives that, in real diagnostic applications, are unacceptable. To address this problem, this paper presents a new cDNA microarray data classification algorithm based on graph theory and is able to overcome most of the limitations of known classification methodologies. The classifier works by analyzing gene expression data organized in an innovative data structure based on graphs, where vertices correspond to genes and edges to gene expression relationships. To demonstrate the novelty of the proposed approach, the authors present an experimental performance comparison between the proposed classifier and several state-of-the-art classification algorithms.
Bouquet, Jerome; Soloski, Mark J; Swei, Andrea; Cheadle, Chris; Federman, Scot; Billaud, Jean-Noel; Rebman, Alison W; Kabre, Beniwende; Halpert, Richard; Boorgula, Meher; Aucott, John N; Chiu, Charles Y
2016-02-12
Lyme disease is a tick-borne illness caused by the bacterium Borrelia burgdorferi, and approximately 10 to 20% of patients report persistent symptoms lasting months to years despite appropriate treatment with antibiotics. To gain insights into the molecular basis of acute Lyme disease and the ensuing development of post-treatment symptoms, we conducted a longitudinal transcriptome study of 29 Lyme disease patients (and 13 matched controls) enrolled at the time of diagnosis and followed for up to 6 months. The differential gene expression signature of Lyme disease following the acute phase of infection persisted for at least 3 weeks and had fewer than 44% differentially expressed genes (DEGs) in common with other infectious or noninfectious syndromes. Early Lyme disease prior to antibiotic therapy was characterized by marked upregulation of Toll-like receptor signaling but lack of activation of the inflammatory T-cell apoptotic and B-cell developmental pathways seen in other acute infectious syndromes. Six months after completion of therapy, Lyme disease patients were found to have 31 to 60% of their pathways in common with three different immune-mediated chronic diseases. No differential gene expression signature was observed between Lyme disease patients with resolved illness to those with persistent symptoms at 6 months post-treatment. The identification of a sustained differential gene expression signature in Lyme disease suggests that a panel of selected human host-based biomarkers may address the need for sensitive clinical diagnostics during the "window period" of infection prior to the appearance of a detectable antibody response and may also inform the development of new therapeutic targets. Lyme disease is the most common tick-borne infection in the United States, and some patients report lingering symptoms lasting months to years despite antibiotic treatment. To better understand the role of the human host response in acute Lyme disease and the development of post-treatment symptoms, we conducted the first longitudinal gene expression (transcriptome) study of patients enrolled at the time of diagnosis and followed up for up to 6 months after treatment. Importantly, we found that the gene expression signature of early Lyme disease is distinct from that of other acute infectious diseases and persists for at least 3 weeks following infection. This study also uncovered multiple previously undescribed pathways and genes that may be useful in the future as human host biomarkers for diagnosis and that constitute potential targets for the development of new therapies. Copyright © 2016 Bouquet et al.
Switching benchmarks in cancer of unknown primary: from autopsy to microarray.
Pentheroudakis, George; Golfinopoulos, Vassilios; Pavlidis, Nicholas
2007-09-01
Cancer of unknown primary (CUP) is associated with unknown biology and dismal prognosis. Information on the primary site of origin is scant and has never been analysed. We systematically reviewed all published evidence on the CUP primary site identified by two different approaches, either autopsy or microarray gene expression profiling. Published reports on identification of CUP primary site by autopsy or microarray-based multigene expression platforms were retrieved and analysed for year of publication, primary site, patient age, gender, histology, rate of primary identification, manifestations and metastatic deposits, microarray chip technology, training and validation sets, mathematical modelling, classification accuracy and number of classifying genes. From 1944 to 2000, a total of 884 CUP patients (66% males) underwent autopsy in 12 studies after presenting with metastatic or systemic symptoms and succumbing to their disease. A primary was identified in 644 (73%) of them, mostly in the lung (27%), pancreas (24%), hepatobiliary tree (8%), kidneys (8%), bowel, genital system and stomach, as a small focus of adenocarcinoma or poorly differentiated carcinoma. An unpredictable systemic dissemination was evident with high frequency of lung (46%), nodal (35%), bone (17%), brain (16%) and uncommon (18%) deposits. Between the 1944-1980 and the 1980-2000 series, female representation increased, 'undetermined neoplasm' diagnosis became rarer, pancreatic primaries were found less often while colonic ones were identified more frequently. Four studies using microarray technology profiled more than 500 CUP cases using classifier set of genes (ranging from 10 to 495) and reported strikingly dissimilar frequencies of assigned primary sites (lung 11.5%, pancreas 12.5%, bowel 12%, breast 15%, hepatobiliary tree 8%, kidneys 6%, genital system 9%, bladder 5%) in 75-90% of the cases. Evolution in medical imaging technology, diet and lifestyle habits probably account for changing epidemiology of CUP primaries in autopsies. Discrepant assignment of primary sites by microarrays may be due to the presence of 'sanctuary sites' in autopsies, molecular misclassification and the postulated presence of a pro-metastatic genetic signature. In view of the absence of patient therapeutic or prognostic benefit with primary identification, gene expression profiling should be re-orientated towards unraveling the complex pathophysiology of metastases.
Feng, Quanzhou; Liu, Z Lewis; Weber, Scott A; Li, Shizhong
2018-01-01
Haploid laboratory strains of Saccharomyces cerevisiae are commonly used for genetic engineering to enable their xylose utilization but little is known about the industrial yeast which is often recognized as diploid and as well as haploid and tetraploid. Here we report three unique signature pathway expression patterns and gene interactions in the centre metabolic pathways that signify xylose utilization of genetically engineered industrial yeast S. cerevisiae NRRL Y-50463, a diploid yeast. Quantitative expression analysis revealed outstanding high levels of constitutive expression of YXI, a synthesized yeast codon-optimized xylose isomerase gene integrated into chromosome XV of strain Y-50463. Comparative expression analysis indicated that the YXI was necessary to initiate the xylose metabolic pathway along with a set of heterologous xylose transporter and utilization facilitating genes including XUT4, XUT6, XKS1 and XYL2. The highly activated transketolase and transaldolase genes TKL1, TKL2, TAL1 and NQM1 as well as their complex interactions in the non-oxidative pentose phosphate pathway branch were critical for the serial of sugar transformation to drive the metabolic flow into glycolysis for increased ethanol production. The significantly increased expression of the entire PRS gene family facilitates functions of the life cycle and biosynthesis superpathway for the yeast. The outstanding higher levels of constitutive expression of YXI and the first insight into the signature pathway expression and the gene interactions in the closely related centre metabolic pathways from the industrial yeast aid continued efforts for development of the next-generation biocatalyst. Our results further suggest the industrial yeast is a desirable delivery vehicle for new strain development for efficient lignocellulose-to-advanced biofuels production.
Feng, Quanzhou; Weber, Scott A.; Li, Shizhong
2018-01-01
Haploid laboratory strains of Saccharomyces cerevisiae are commonly used for genetic engineering to enable their xylose utilization but little is known about the industrial yeast which is often recognized as diploid and as well as haploid and tetraploid. Here we report three unique signature pathway expression patterns and gene interactions in the centre metabolic pathways that signify xylose utilization of genetically engineered industrial yeast S. cerevisiae NRRL Y-50463, a diploid yeast. Quantitative expression analysis revealed outstanding high levels of constitutive expression of YXI, a synthesized yeast codon-optimized xylose isomerase gene integrated into chromosome XV of strain Y-50463. Comparative expression analysis indicated that the YXI was necessary to initiate the xylose metabolic pathway along with a set of heterologous xylose transporter and utilization facilitating genes including XUT4, XUT6, XKS1 and XYL2. The highly activated transketolase and transaldolase genes TKL1, TKL2, TAL1 and NQM1 as well as their complex interactions in the non-oxidative pentose phosphate pathway branch were critical for the serial of sugar transformation to drive the metabolic flow into glycolysis for increased ethanol production. The significantly increased expression of the entire PRS gene family facilitates functions of the life cycle and biosynthesis superpathway for the yeast. The outstanding higher levels of constitutive expression of YXI and the first insight into the signature pathway expression and the gene interactions in the closely related centre metabolic pathways from the industrial yeast aid continued efforts for development of the next-generation biocatalyst. Our results further suggest the industrial yeast is a desirable delivery vehicle for new strain development for efficient lignocellulose-to-advanced biofuels production. PMID:29621349
Cruz-Rodriguez, Nataly; Combita, Alba L; Enciso, Leonardo J; Quijano, Sandra M; Pinzon, Paula L; Lozano, Olga C; Castillo, Juan S; Li, Li; Bareño, Jose; Cardozo, Claudia; Solano, Julio; Herrera, Maria V; Cudris, Jennifer; Zabaleta, Jovanny
2016-04-05
B-Acute lymphoblastic leukemia (B-ALL) represents a hematologic malignancy with poor clinical outcome and low survival rates in adult patients. Remission rates in Hispanic population are almost 30% lower and Overall Survival (OS) nearly two years inferior than those reported in other ethnic groups. Only 61% of Colombian adult patients with ALL achieve complete remission (CR), median overall survival is 11.3 months and event-free survival (EFS) is 7.34 months. Identification of prognostic factors is crucial for the application of proper treatment strategies and subsequently for successful outcome. Our goal was to identify a gene expression signature that might correlate with response to therapy and evaluate the utility of these as prognostic tool in hispanic patients. We included 43 adult patients newly diagnosed with B-ALL. We used microarray analysis in order to identify genes that distinguish poor from good response to treatment using differential gene expression analysis. The expression profile was validated by real-time PCR (RT-PCT). We identified 442 differentially expressed genes between responders and non-responders to induction treatment. Hierarchical analysis according to the expression of a 7-gene signature revealed 2 subsets of patients that differed in their clinical characteristics and outcome. Our study suggests that response to induction treatment and clinical outcome of Hispanic patients can be predicted from the onset of the disease and that gene expression profiles can be used to stratify patient risk adequately and accurately. The present study represents the first that shows the gene expression profiling of B-ALL Colombian adults and its relevance for stratification in the early course of disease.
Land use and land cover mapping: City of Palm Bay, Florida
NASA Technical Reports Server (NTRS)
Barile, D. D.; Pierce, R.
1977-01-01
Two different computer systems were compared for use in making land use and land cover maps. The Honeywell 635 with the LANDSAT signature development program (LSDP) produced a map depicting general patterns, but themes were difficult to classify as specific land use. Urban areas were unclassified. The General Electric Image 100 produced a map depicting eight land cover categories classifying 68 percent of the total area. Ground truth, LSDP, and Image 100 maps were all made to the same scale for comparison. LSDP agreed with the ground truth 60 percent and 64 percent within the two test areas compared and Image 100 was in agreement 70 percent and 80 percent.
Lequerré, Thierry; Bansard, Carine; Vittecoq, Olivier; Derambure, Céline; Hiron, Martine; Daveau, Maryvonne; Tron, François; Ayral, Xavier; Biga, Norman; Auquit-Auckbur, Isabelle; Chiocchia, Gilles; Le Loët, Xavier; Salier, Jean-Philippe
2009-01-01
Introduction Rheumatoid arthritis (RA) is a heterogeneous disease and its underlying molecular mechanisms are still poorly understood. Because previous microarray studies have only focused on long-standing (LS) RA compared to osteoarthritis, we aimed to compare the molecular profiles of early and LS RA versus control synovia. Methods Synovial biopsies were obtained by arthroscopy from 15 patients (4 early untreated RA, 4 treated LS RA and 7 controls, who had traumatic or mechanical lesions). Extracted mRNAs were used for large-scale gene-expression profiling. The different gene-expression combinations identified by comparison of profiles of early, LS RA and healthy synovia were linked to the biological processes involved in each situation. Results Three combinations of 719, 116 and 52 transcripts discriminated, respectively, early from LS RA, and early or LS RA from healthy synovia. We identified several gene clusters and distinct molecular signatures specifically expressed during early or LS RA, thereby suggesting the involvement of different pathophysiological mechanisms during the course of RA. Conclusions Early and LS RA have distinct molecular signatures with different biological processes participating at different times during the course of the disease. These results suggest that better knowledge of the main biological processes involved at a given RA stage might help to choose the most appropriate treatment. PMID:19563633
van Kessel, Kim E M; van de Werken, Harmen J G; Lurkin, Irene; Ziel-van der Made, Angelique C J; Zwarthoff, Ellen C; Boormans, Joost L
2017-01-01
Neoadjuvant chemotherapy (NAC) for muscle-invasive bladder cancer (MIBC) provides a small but significant survival benefit. Nevertheless, controversies on applying NAC remain because the limited benefit must be weight against chemotherapy-related toxicity and the delay of definitive local treatment. Therefore, there is a clear clinical need for tools to guide treatment decisions on NAC in MIBC. Here, we aimed to validate a previously reported 20-gene expression signature that predicted lymph node-positive disease at radical cystectomy in clinically node-negative MIBC patients, which would be a justification for upfront chemotherapy. We studied diagnostic transurethral resection of bladder tumors (dTURBT) of 150 MIBC patients (urothelial carcinoma) who were subsequently treated by radical cystectomy and pelvic lymph node dissection. RNA was isolated and the expression level of the 20 genes was determined on a qRT-PCR platform. Normalized Ct values were used to calculate a risk score to predict the presence of node-positive disease. The Cancer Genome Atlas (TCGA) RNA expression data was analyzed to subsequently validate the results. In a univariate regression analysis, none of the 20 genes significantly correlated with node-positive disease. The area under the curve of the risk score calculated by the 20-gene expression signature was 0.54 (95% Confidence Interval: 0.44-0.65) versus 0.67 for the model published by Smith et al. Node-negative patients had a significantly lower tumor grade at TURBT (p = 0.03), a lower pT stage (p<0.01) and less frequent lymphovascular invasion (13% versus 38%, p<0.01) at radical cystectomy than node-positive patients. In addition, in the TCGA data, none of the 20 genes was differentially expressed in node-negative versus node-positive patients. We conclude that a 20-gene expression signature developed for nodal staging of MIBC at radical cystectomy could not be validated on a qRT-PCR platform in a large cohort of dTURBT specimens.
Rogic, Sanja; Wong, Albertina; Pavlidis, Paul
2017-01-01
Background Prenatal alcohol exposure (PAE) can result in an array of morphological, behavioural and neurobiological deficits that can range in their severity. Despite extensive research in the field and a significant progress made, especially in understanding the range of possible malformations and neurobehavioral abnormalities, the molecular mechanisms of alcohol responses in development are still not well understood. There have been multiple transcriptomic studies looking at the changes in gene expression after PAE in animal models, however there is a limited apparent consensus among the reported findings. In an effort to address this issue, we performed a comprehensive re-analysis and meta-analysis of all suitable, publically available expression data sets. Methods We assembled ten microarray data sets of gene expression after PAE in mouse and rat models consisting of samples from a total of 63 ethanol-exposed and 80 control animals. We re-analyzed each data set for differential expression and then used the results to perform meta-analyses considering all data sets together or grouping them by time or duration of exposure (pre- and post-natal, acute and chronic, respectively). We performed network and Gene Ontology enrichment analysis to further characterize the identified signatures. Results For each sub-analysis we identified signatures of differential expressed genes that show support from multiple studies. Overall, the changes in gene expression were more extensive after acute ethanol treatment during prenatal development than in other models. Considering the analysis of all the data together, we identified a robust core signature of 104 genes down-regulated after PAE, with no up-regulated genes. Functional analysis reveals over-representation of genes involved in protein synthesis, mRNA splicing and chromatin organization. Conclusions Our meta-analysis shows that existing studies, despite superficial dissimilarity in findings, share features that allow us to identify a common core signature set of transcriptome changes in PAE. This is an important step to identifying the biological processes that underlie the etiology of FASD. PMID:26996386
Tuller, T; Atar, S; Ruppin, E; Gurevich, M; Achiron, A
2013-03-01
The aim of this study is to understand intracellular regulatory mechanisms in peripheral blood mononuclear cells (PBMCs), which are either common to many autoimmune diseases or specific to some of them. We incorporated large-scale data such as protein-protein interactions, gene expression and demographical information of hundreds of patients and healthy subjects, related to six autoimmune diseases with available large-scale gene expression measurements: multiple sclerosis (MS), systemic lupus erythematosus (SLE), juvenile rheumatoid arthritis (JRA), Crohn's disease (CD), ulcerative colitis (UC) and type 1 diabetes (T1D). These data were analyzed concurrently by statistical and systems biology approaches tailored for this purpose. We found that chemokines such as CXCL1-3, 5, 6 and the interleukin (IL) IL8 tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In addition, the anti-apoptotic gene BCL3, interferon-γ (IFNG), and the vitamin D receptor (VDR) gene physically interact with significantly many genes that tend to be differentially expressed in PBMCs of patients with the analyzed autoimmune diseases. In general, similar cellular processes tend to be differentially expressed in PBMC in the analyzed autoimmune diseases. Specifically, the cellular processes related to cell proliferation (for example, epidermal growth factor, platelet-derived growth factor, nuclear factor-κB, Wnt/β-catenin signaling, stress-activated protein kinase c-Jun NH2-terminal kinase), inflammatory response (for example, interleukins IL2 and IL6, the cytokine granulocyte-macrophage colony-stimulating factor and the B-cell receptor), general signaling cascades (for example, mitogen-activated protein kinase, extracellular signal-regulated kinase, p38 and TRK) and apoptosis are activated in most of the analyzed autoimmune diseases. However, our results suggest that in each of the analyzed diseases, apoptosis and chemotaxis are activated via different subsignaling pathways. Analyses of the expression levels of dozens of genes and the protein-protein interactions among them demonstrated that CD and UC have relatively similar gene expression signatures, whereas the gene expression signatures of T1D and JRA relatively differ from the signatures of the other autoimmune diseases. These diseases are the only ones activated via the Fcɛ pathway. The relevant genes and pathways reported in this study are discussed at length, and may be helpful in the diagnoses and understanding of autoimmunity and/or specific autoimmune diseases.
Cloud classification in polar regions using AVHRR textural and spectral signatures
NASA Technical Reports Server (NTRS)
Welch, R. M.; Sengupta, S. K.; Weger, R. C.; Christopher, S. A.; Kuo, K. S.; Carsey, F. D.
1990-01-01
Arctic clouds and ice-covered surfaces are classified on the basis of textural and spectral features obtained with AVHRR 1.1-km spatial resolution imagery over the Beaufort Sea during May-October, 1989. Scenes were acquired about every 5 days, for a total of 38 cases. A list comprising 20 arctic-surface and cloud classes is compiled using spectral measures defined by Garand (1988).