Sample records for cancer classification based

  1. Application of wavelet transformation and adaptive neighborhood based modified backpropagation (ANMBP) for classification of brain cancer

    NASA Astrophysics Data System (ADS)

    Werdiningsih, Indah; Zaman, Badrus; Nuqoba, Barry

    2017-08-01

    This paper presents classification of brain cancer using wavelet transformation and Adaptive Neighborhood Based Modified Backpropagation (ANMBP). Three stages of the processes, namely features extraction, features reduction, and classification process. Wavelet transformation is used for feature extraction and ANMBP is used for classification process. The result of features extraction is feature vectors. Features reduction used 100 energy values per feature and 10 energy values per feature. Classifications of brain cancer are normal, alzheimer, glioma, and carcinoma. Based on simulation results, 10 energy values per feature can be used to classify brain cancer correctly. The correct classification rate of proposed system is 95 %. This research demonstrated that wavelet transformation can be used for features extraction and ANMBP can be used for classification of brain cancer.

  2. Pathohistological classification systems in gastric cancer: Diagnostic relevance and prognostic value

    PubMed Central

    Berlth, Felix; Bollschweiler, Elfriede; Drebber, Uta; Hoelscher, Arnulf H; Moenig, Stefan

    2014-01-01

    Several pathohistological classification systems exist for the diagnosis of gastric cancer. Many studies have investigated the correlation between the pathohistological characteristics in gastric cancer and patient characteristics, disease specific criteria and overall outcome. It is still controversial as to which classification system imparts the most reliable information, and therefore, the choice of system may vary in clinical routine. In addition to the most common classification systems, such as the Laurén and the World Health Organization (WHO) classifications, other authors have tried to characterize and classify gastric cancer based on the microscopic morphology and in reference to the clinical outcome of the patients. In more than 50 years of systematic classification of the pathohistological characteristics of gastric cancer, there is no sole classification system that is consistently used worldwide in diagnostics and research. However, several national guidelines for the treatment of gastric cancer refer to the Laurén or the WHO classifications regarding therapeutic decision-making, which underlines the importance of a reliable classification system for gastric cancer. The latest results from gastric cancer studies indicate that it might be useful to integrate DNA- and RNA-based features of gastric cancer into the classification systems to establish prognostic relevance. This article reviews the diagnostic relevance and the prognostic value of different pathohistological classification systems in gastric cancer. PMID:24914328

  3. Cancer classification in the genomic era: five contemporary problems.

    PubMed

    Song, Qingxuan; Merajver, Sofia D; Li, Jun Z

    2015-10-19

    Classification is an everyday instinct as well as a full-fledged scientific discipline. Throughout the history of medicine, disease classification is central to how we develop knowledge, make diagnosis, and assign treatment. Here, we discuss the classification of cancer and the process of categorizing cancer subtypes based on their observed clinical and biological features. Traditionally, cancer nomenclature is primarily based on organ location, e.g., "lung cancer" designates a tumor originating in lung structures. Within each organ-specific major type, finer subgroups can be defined based on patient age, cell type, histological grades, and sometimes molecular markers, e.g., hormonal receptor status in breast cancer or microsatellite instability in colorectal cancer. In the past 15+ years, high-throughput technologies have generated rich new data regarding somatic variations in DNA, RNA, protein, or epigenomic features for many cancers. These data, collected for increasingly large tumor cohorts, have provided not only new insights into the biological diversity of human cancers but also exciting opportunities to discover previously unrecognized cancer subtypes. Meanwhile, the unprecedented volume and complexity of these data pose significant challenges for biostatisticians, cancer biologists, and clinicians alike. Here, we review five related issues that represent contemporary problems in cancer taxonomy and interpretation. (1) How many cancer subtypes are there? (2) How can we evaluate the robustness of a new classification system? (3) How are classification systems affected by intratumor heterogeneity and tumor evolution? (4) How should we interpret cancer subtypes? (5) Can multiple classification systems co-exist? While related issues have existed for a long time, we will focus on those aspects that have been magnified by the recent influx of complex multi-omics data. Exploration of these problems is essential for data-driven refinement of cancer classification and the successful application of these concepts in precision medicine.

  4. Cancer Pain: A Critical Review of Mechanism-based Classification and Physical Therapy Management in Palliative Care

    PubMed Central

    Kumar, Senthil P

    2011-01-01

    Mechanism-based classification and physical therapy management of pain is essential to effectively manage painful symptoms in patients attending palliative care. The objective of this review is to provide a detailed review of mechanism-based classification and physical therapy management of patients with cancer pain. Cancer pain can be classified based upon pain symptoms, pain mechanisms and pain syndromes. Classification based upon mechanisms not only addresses the underlying pathophysiology but also provides us with an understanding behind patient's symptoms and treatment responses. Existing evidence suggests that the five mechanisms – central sensitization, peripheral sensitization, sympathetically maintained pain, nociceptive and cognitive-affective – operate in patients with cancer pain. Summary of studies showing evidence for physical therapy treatment methods for cancer pain follows with suggested therapeutic implications. Effective palliative physical therapy care using a mechanism-based classification model should be tailored to suit each patient's findings, using a biopsychosocial model of pain. PMID:21976851

  5. Molecular cancer classification using a meta-sample-based regularized robust coding method.

    PubMed

    Wang, Shu-Lin; Sun, Liuchao; Fang, Jianwen

    2014-01-01

    Previous studies have demonstrated that machine learning based molecular cancer classification using gene expression profiling (GEP) data is promising for the clinic diagnosis and treatment of cancer. Novel classification methods with high efficiency and prediction accuracy are still needed to deal with high dimensionality and small sample size of typical GEP data. Recently the sparse representation (SR) method has been successfully applied to the cancer classification. Nevertheless, its efficiency needs to be improved when analyzing large-scale GEP data. In this paper we present the meta-sample-based regularized robust coding classification (MRRCC), a novel effective cancer classification technique that combines the idea of meta-sample-based cluster method with regularized robust coding (RRC) method. It assumes that the coding residual and the coding coefficient are respectively independent and identically distributed. Similar to meta-sample-based SR classification (MSRC), MRRCC extracts a set of meta-samples from the training samples, and then encodes a testing sample as the sparse linear combination of these meta-samples. The representation fidelity is measured by the l2-norm or l1-norm of the coding residual. Extensive experiments on publicly available GEP datasets demonstrate that the proposed method is more efficient while its prediction accuracy is equivalent to existing MSRC-based methods and better than other state-of-the-art dimension reduction based methods.

  6. [Evaluation of traditional pathological classification at molecular classification era for gastric cancer].

    PubMed

    Yu, Yingyan

    2014-01-01

    Histopathological classification is in a pivotal position in both basic research and clinical diagnosis and treatment of gastric cancer. Currently, there are different classification systems in basic science and clinical application. In medical literatures, different classifications are used including Lauren and WHO systems, which have confused many researchers. Lauren classification has been proposed for half a century, but is still used worldwide. It shows many advantages of simple, easy handling with prognostic significance. The WHO classification scheme is better than Lauren classification in that it is continuously being revised according to the progress of gastric cancer, and is always used in the clinical and pathological diagnosis of common scenarios. Along with the progression of genomics, transcriptomics, proteomics, metabolomics researches, molecular classification of gastric cancer becomes the current hot topics. The traditional therapeutic approach based on phenotypic characteristics of gastric cancer will most likely be replaced with a gene variation mode. The gene-targeted therapy against the same molecular variation seems more reasonable than traditional chemical treatment based on the same morphological change.

  7. Classification of cancerous cells based on the one-class problem approach

    NASA Astrophysics Data System (ADS)

    Murshed, Nabeel A.; Bortolozzi, Flavio; Sabourin, Robert

    1996-03-01

    One of the most important factors in reducing the effect of cancerous diseases is the early diagnosis, which requires a good and a robust method. With the advancement of computer technologies and digital image processing, the development of a computer-based system has become feasible. In this paper, we introduce a new approach for the detection of cancerous cells. This approach is based on the one-class problem approach, through which the classification system need only be trained with patterns of cancerous cells. This reduces the burden of the training task by about 50%. Based on this approach, a computer-based classification system is developed, based on the Fuzzy ARTMAP neural networks. Experimental results were performed using a set of 542 patterns taken from a sample of breast cancer. Results of the experiment show 98% correct identification of cancerous cells and 95% correct identification of non-cancerous cells.

  8. A discrete wavelet based feature extraction and hybrid classification technique for microarray data analysis.

    PubMed

    Bennet, Jaison; Ganaprakasam, Chilambuchelvan Arul; Arputharaj, Kannan

    2014-01-01

    Cancer classification by doctors and radiologists was based on morphological and clinical features and had limited diagnostic ability in olden days. The recent arrival of DNA microarray technology has led to the concurrent monitoring of thousands of gene expressions in a single chip which stimulates the progress in cancer classification. In this paper, we have proposed a hybrid approach for microarray data classification based on nearest neighbor (KNN), naive Bayes, and support vector machine (SVM). Feature selection prior to classification plays a vital role and a feature selection technique which combines discrete wavelet transform (DWT) and moving window technique (MWT) is used. The performance of the proposed method is compared with the conventional classifiers like support vector machine, nearest neighbor, and naive Bayes. Experiments have been conducted on both real and benchmark datasets and the results indicate that the ensemble approach produces higher classification accuracy than conventional classifiers. This paper serves as an automated system for the classification of cancer and can be applied by doctors in real cases which serve as a boon to the medical community. This work further reduces the misclassification of cancers which is highly not allowed in cancer detection.

  9. Lauren classification and individualized chemotherapy in gastric cancer.

    PubMed

    Ma, Junli; Shen, Hong; Kapesa, Linda; Zeng, Shan

    2016-05-01

    Gastric cancer is one of the most common malignancies worldwide. During the last 50 years, the histological classification of gastric carcinoma has been largely based on Lauren's criteria, in which gastric cancer is classified into two major histological subtypes, namely intestinal type and diffuse type adenocarcinoma. This classification was introduced in 1965, and remains currently widely accepted and employed, since it constitutes a simple and robust classification approach. The two histological subtypes of gastric cancer proposed by the Lauren classification exhibit a number of distinct clinical and molecular characteristics, including histogenesis, cell differentiation, epidemiology, etiology, carcinogenesis, biological behaviors and prognosis. Gastric cancer exhibits varied sensitivity to chemotherapy drugs and significant heterogeneity; therefore, the disease may be a target for individualized therapy. The Lauren classification may provide the basis for individualized treatment for advanced gastric cancer, which is increasingly gaining attention in the scientific field. However, few studies have investigated individualized treatment that is guided by pathological classification. The aim of the current review is to analyze the two major histological subtypes of gastric cancer, as proposed by the Lauren classification, and to discuss the implications of this for personalized chemotherapy.

  10. A New Direction of Cancer Classification: Positive Effect of Low-Ranking MicroRNAs.

    PubMed

    Li, Feifei; Piao, Minghao; Piao, Yongjun; Li, Meijing; Ryu, Keun Ho

    2014-10-01

    Many studies based on microRNA (miRNA) expression profiles showed a new aspect of cancer classification. Because one characteristic of miRNA expression data is the high dimensionality, feature selection methods have been used to facilitate dimensionality reduction. The feature selection methods have one shortcoming thus far: they just consider the problem of where feature to class is 1:1 or n:1. However, because one miRNA may influence more than one type of cancer, human miRNA is considered to be ranked low in traditional feature selection methods and are removed most of the time. In view of the limitation of the miRNA number, low-ranking miRNAs are also important to cancer classification. We considered both high- and low-ranking features to cover all problems (1:1, n:1, 1:n, and m:n) in cancer classification. First, we used the correlation-based feature selection method to select the high-ranking miRNAs, and chose the support vector machine, Bayes network, decision tree, k-nearest-neighbor, and logistic classifier to construct cancer classification. Then, we chose Chi-square test, information gain, gain ratio, and Pearson's correlation feature selection methods to build the m:n feature subset, and used the selected miRNAs to determine cancer classification. The low-ranking miRNA expression profiles achieved higher classification accuracy compared with just using high-ranking miRNAs in traditional feature selection methods. Our results demonstrate that the m:n feature subset made a positive impression of low-ranking miRNAs in cancer classification.

  11. Applicability of the Proposed Japanese Model for the Classification of Gastric Cancer Location: The "PROTRADIST" Retrospective Study.

    PubMed

    Marano, Luigi; Petrillo, Marianna; Pezzella, Modestino; Patriti, Alberto; Braccio, Bartolomeo; Esposito, Giuseppe; Grassia, Michele; Romano, Angela; Torelli, Francesco; De Luca, Raffaele; Fabozzi, Alessio; Falco, Giuseppe; Di Martino, Natale

    2017-06-01

    The extension of lymphadenectomy for surgical treatment of gastric cancer remains discordant among European and Japanese surgeons. Kinami et al. (Kinami S, Fujimura T, Ojima E, et al. PTD classification: proposal for a new classification of gastric cancer location based on physiological lymphatic flow. Int. J. Clin. Oncol. 2008;13:320-329) proposed a new experimental classification, the "Proximal zone, Transitional zone, Distal zone" (PTD) classification, based on the physiological lymphatic flow of gastric cancer site. The aim of the present retrospective study is to assess the applicability of PTD Japanese model in gastric cancer patients of our Western surgical department. Two groups of patients with histologically documented adenocarcinoma of the stomach were retrospectively obtained: In the first group were categorized 89 patients with T1a-T1b tumor invasion; and in the second group were 157 patients with T2-T3 category. The data collected were then categorized according to the PTD classification. In the T1a-T1b group there were no lymph node metastases within the r-GA or r-GEA compartments for tumors located in the P portion, and similarly there were no lymphatic metastases within the l-GEA or p-GA compartments for tumors located in the D portion. On the contrary, in the T2-T3 group the lymph node metastases presented a diffused spreading with no statistical significance between the two classification models. Our results show that the PTD classification based on physiological lymphatic flow of the gastric cancer site is a more physiological and clinical version than the Upper, Medium And Lower classification. It represents a valuable and applicable model of cancer location that could be a guide to a tailored surgical approach in Italian patients with neoplasm confined to submucosa. Nevertheless, in order to confirm our findings, larger and prospective studies are needed.

  12. Classifying breast cancer surgery: a novel, complexity-based system for oncological, oncoplastic and reconstructive procedures, and proof of principle by analysis of 1225 operations in 1166 patients.

    PubMed

    Hoffmann, Jürgen; Wallwiener, Diethelm

    2009-04-08

    One of the basic prerequisites for generating evidence-based data is the availability of classification systems. Attempts to date to classify breast cancer operations have focussed on specific problems, e.g. the avoidance of secondary corrective surgery for surgical defects, rather than taking a generic approach. Starting from an existing, simpler empirical scheme based on the complexity of breast surgical procedures, which was used in-house primarily in operative report-writing, a novel classification of ablative and breast-conserving procedures initially needed to be developed and elaborated systematically. To obtain proof of principle, a prospectively planned analysis of patient records for all major breast cancer-related operations performed at our breast centre in 2005 and 2006 was conducted using the new classification. Data were analysed using basic descriptive statistics such as frequency tables. A novel two-type, six-tier classification system comprising 12 main categories, 13 subcategories and 39 sub-subcategories of oncological, oncoplastic and reconstructive breast cancer-related surgery was successfully developed. Our system permitted unequivocal classification, without exception, of all 1225 procedures performed in 1166 breast cancer patients in 2005 and 2006. Breast cancer-related surgical procedures can be generically classified according to their surgical complexity. Analysis of all major procedures performed at our breast centre during the study period provides proof of principle for this novel classification system. We envisage various applications for this classification, including uses in randomised clinical trials, guideline development, specialist surgical training, continuing professional development as well as quality of care and public health research.

  13. Quantum Cascade Laser-Based Infrared Microscopy for Label-Free and Automated Cancer Classification in Tissue Sections.

    PubMed

    Kuepper, Claus; Kallenbach-Thieltges, Angela; Juette, Hendrik; Tannapfel, Andrea; Großerueschkamp, Frederik; Gerwert, Klaus

    2018-05-16

    A feasibility study using a quantum cascade laser-based infrared microscope for the rapid and label-free classification of colorectal cancer tissues is presented. Infrared imaging is a reliable, robust, automated, and operator-independent tissue classification method that has been used for differential classification of tissue thin sections identifying tumorous regions. However, long acquisition time by the so far used FT-IR-based microscopes hampered the clinical translation of this technique. Here, the used quantum cascade laser-based microscope provides now infrared images for precise tissue classification within few minutes. We analyzed 110 patients with UICC-Stage II and III colorectal cancer, showing 96% sensitivity and 100% specificity of this label-free method as compared to histopathology, the gold standard in routine clinical diagnostics. The main hurdle for the clinical translation of IR-Imaging is overcome now by the short acquisition time for high quality diagnostic images, which is in the same time range as frozen sections by pathologists.

  14. Regression-Based Approach For Feature Selection In Classification Issues. Application To Breast Cancer Detection And Recurrence

    NASA Astrophysics Data System (ADS)

    Belciug, Smaranda; Serbanescu, Mircea-Sebastian

    2015-09-01

    Feature selection is considered a key factor in classifications/decision problems. It is currently used in designing intelligent decision systems to choose the best features which allow the best performance. This paper proposes a regression-based approach to select the most important predictors to significantly increase the classification performance. Application to breast cancer detection and recurrence using publically available datasets proved the efficiency of this technique.

  15. Epithelial cancer detection by oblique-incidence optical spectroscopy

    NASA Astrophysics Data System (ADS)

    Garcia-Uribe, Alejandro; Balareddy, Karthik C.; Zou, Jun; Wang, Kenneth K.; Duvic, Madeleine; Wang, Lihong V.

    2009-02-01

    This paper presents a study on non-invasive detection of two common epithelial cancers (skin and esophagus) based on oblique incidence diffuse reflectance spectroscopy (OIDRS). An OIDRS measurement system, which combines fiber optics and MEMS technologies, was developed. In our pilot studies, a total number of 137 cases have been measured in-vivo for skin cancer detection and a total number of 20 biopsy samples have been measured ex-vivo for esophageal cancer detection. To automatically differentiate the cancerous cases from benign ones, a statistical software classification program was also developed. An overall classification accuracy of 90% and 100% has been achieved for skin and esophageal cancer classification, respectively.

  16. MO-DE-207B-03: Improved Cancer Classification Using Patient-Specific Biological Pathway Information Via Gene Expression Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Young, M; Craft, D

    Purpose: To develop an efficient, pathway-based classification system using network biology statistics to assist in patient-specific response predictions to radiation and drug therapies across multiple cancer types. Methods: We developed PICS (Pathway Informed Classification System), a novel two-step cancer classification algorithm. In PICS, a matrix m of mRNA expression values for a patient cohort is collapsed into a matrix p of biological pathways. The entries of p, which we term pathway scores, are obtained from either principal component analysis (PCA), normal tissue centroid (NTC), or gene expression deviation (GED). The pathway score matrix is clustered using both k-means and hierarchicalmore » clustering, and a clustering is judged by how well it groups patients into distinct survival classes. The most effective pathway scoring/clustering combination, per clustering p-value, thus generates various ‘signatures’ for conventional and functional cancer classification. Results: PICS successfully regularized large dimension gene data, separated normal and cancerous tissues, and clustered a large patient cohort spanning six cancer types. Furthermore, PICS clustered patient cohorts into distinct, statistically-significant survival groups. For a suboptimally-debulked ovarian cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00127) showed significant improvement over that of a prior gene expression-classified study (p = .0179). For a pancreatic cancer set, the pathway-classified Kaplan-Meier survival curve (p = .00141) showed significant improvement over that of a prior gene expression-classified study (p = .04). Pathway-based classification confirmed biomarkers for the pyrimidine, WNT-signaling, glycerophosphoglycerol, beta-alanine, and panthothenic acid pathways for ovarian cancer. Despite its robust nature, PICS requires significantly less run time than current pathway scoring methods. Conclusion: This work validates the PICS method to improve cancer classification using biological pathways. Patients are classified with greater specificity and physiological relevance as compared to current gene-specific approaches. Focus now moves to utilizing PICS for pan-cancer patient-specific treatment response prediction.« less

  17. Influence of nuclei segmentation on breast cancer malignancy classification

    NASA Astrophysics Data System (ADS)

    Jelen, Lukasz; Fevens, Thomas; Krzyzak, Adam

    2009-02-01

    Breast Cancer is one of the most deadly cancers affecting middle-aged women. Accurate diagnosis and prognosis are crucial to reduce the high death rate. Nowadays there are numerous diagnostic tools for breast cancer diagnosis. In this paper we discuss a role of nuclear segmentation from fine needle aspiration biopsy (FNA) slides and its influence on malignancy classification. Classification of malignancy plays a very important role during the diagnosis process of breast cancer. Out of all cancer diagnostic tools, FNA slides provide the most valuable information about the cancer malignancy grade which helps to choose an appropriate treatment. This process involves assessing numerous nuclear features and therefore precise segmentation of nuclei is very important. In this work we compare three powerful segmentation approaches and test their impact on the classification of breast cancer malignancy. The studied approaches involve level set segmentation, fuzzy c-means segmentation and textural segmentation based on co-occurrence matrix. Segmented nuclei were used to extract nuclear features for malignancy classification. For classification purposes four different classifiers were trained and tested with previously extracted features. The compared classifiers are Multilayer Perceptron (MLP), Self-Organizing Maps (SOM), Principal Component-based Neural Network (PCA) and Support Vector Machines (SVM). The presented results show that level set segmentation yields the best results over the three compared approaches and leads to a good feature extraction with a lowest average error rate of 6.51% over four different classifiers. The best performance was recorded for multilayer perceptron with an error rate of 3.07% using fuzzy c-means segmentation.

  18. Molecular Classification and Correlates in Colorectal Cancer

    PubMed Central

    Ogino, Shuji; Goel, Ajay

    2008-01-01

    Molecular classification of colorectal cancer is evolving. As our understanding of colorectal carcinogenesis improves, we are incorporating new knowledge into the classification system. In particular, global genomic status [microsatellite instability (MSI) status and chromosomal instability (CIN) status] and epigenomic status [CpG island methylator phenotype (CIMP) status] play a significant role in determining clinical, pathological and biological characteristics of colorectal cancer. In this review, we discuss molecular classification and molecular correlates based on MSI status and CIMP status in colorectal cancer. Studying molecular correlates is important in cancer research because it can 1) provide clues to pathogenesis, 2) propose or support the existence of a new molecular subtype, 3) alert investigators to be aware of potential confounding factors in association studies, and 4) suggest surrogate markers in clinical or research settings. PMID:18165277

  19. Classification of microscopic images of breast tissue

    NASA Astrophysics Data System (ADS)

    Ballerini, Lucia; Franzen, Lennart

    2004-05-01

    Breast cancer is the most common form of cancer among women. The diagnosis is usually performed by the pathologist, that subjectively evaluates tissue samples. The aim of our research is to develop techniques for the automatic classification of cancerous tissue, by analyzing histological samples of intact tissue taken with a biopsy. In our study, we considered 200 images presenting four different conditions: normal tissue, fibroadenosis, ductal cancer and lobular cancer. Methods to extract features have been investigated and described. One method is based on granulometries, which are size-shape descriptors widely used in mathematical morphology. Applications of granulometries lead to distribution functions whose moments are used as features. A second method is based on fractal geometry, that seems very suitable to quantify biological structures. The fractal dimension of binary images has been computed using the euclidean distance mapping. Image classification has then been performed using the extracted features as input of a back-propagation neural network. A new method that combines genetic algorithms and morphological filters has been also investigated. In this case, the classification is based on a correlation measure. Very encouraging results have been obtained with pilot experiments using a small subset of images as training set. Experimental results indicate the effectiveness of the proposed methods. Cancerous tissue was correctly classified in 92.5% of the cases.

  20. A Minimum Spanning Forest Based Method for Noninvasive Cancer Detection with Hyperspectral Imaging

    PubMed Central

    Pike, Robert; Lu, Guolan; Wang, Dongsheng; Chen, Zhuo Georgia; Fei, Baowei

    2016-01-01

    Goal The purpose of this paper is to develop a classification method that combines both spectral and spatial information for distinguishing cancer from healthy tissue on hyperspectral images in an animal model. Methods An automated algorithm based on a minimum spanning forest (MSF) and optimal band selection has been proposed to classify healthy and cancerous tissue on hyperspectral images. A support vector machine (SVM) classifier is trained to create a pixel-wise classification probability map of cancerous and healthy tissue. This map is then used to identify markers that are used to compute mutual information for a range of bands in the hyperspectral image and thus select the optimal bands. An MSF is finally grown to segment the image using spatial and spectral information. Conclusion The MSF based method with automatically selected bands proved to be accurate in determining the tumor boundary on hyperspectral images. Significance Hyperspectral imaging combined with the proposed classification technique has the potential to provide a noninvasive tool for cancer detection. PMID:26285052

  1. A probabilistic approach to segmentation and classification of neoplasia in uterine cervix images using color and geometric features

    NASA Astrophysics Data System (ADS)

    Srinivasan, Yeshwanth; Hernes, Dana; Tulpule, Bhakti; Yang, Shuyu; Guo, Jiangling; Mitra, Sunanda; Yagneswaran, Sriraja; Nutter, Brian; Jeronimo, Jose; Phillips, Benny; Long, Rodney; Ferris, Daron

    2005-04-01

    Automated segmentation and classification of diagnostic markers in medical imagery are challenging tasks. Numerous algorithms for segmentation and classification based on statistical approaches of varying complexity are found in the literature. However, the design of an efficient and automated algorithm for precise classification of desired diagnostic markers is extremely image-specific. The National Library of Medicine (NLM), in collaboration with the National Cancer Institute (NCI), is creating an archive of 60,000 digitized color images of the uterine cervix. NLM is developing tools for the analysis and dissemination of these images over the Web for the study of visual features correlated with precancerous neoplasia and cancer. To enable indexing of images of the cervix, it is essential to develop algorithms for the segmentation of regions of interest, such as acetowhitened regions, and automatic identification and classification of regions exhibiting mosaicism and punctation. Success of such algorithms depends, primarily, on the selection of relevant features representing the region of interest. We present color and geometric features based statistical classification and segmentation algorithms yielding excellent identification of the regions of interest. The distinct classification of the mosaic regions from the non-mosaic ones has been obtained by clustering multiple geometric and color features of the segmented sections using various morphological and statistical approaches. Such automated classification methodologies will facilitate content-based image retrieval from the digital archive of uterine cervix and have the potential of developing an image based screening tool for cervical cancer.

  2. Cloud-Scale Genomic Signals Processing for Robust Large-Scale Cancer Genomic Microarray Data Analysis.

    PubMed

    Harvey, Benjamin Simeon; Ji, Soo-Yeon

    2017-01-01

    As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring forth oncological inference to the bioinformatics community through the analysis of large-scale cancer genomic (LSCG) DNA and mRNA microarray data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological interpretation by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale distributed parallel (CSDP) separable 1-D wavelet decomposition technique for denoising through differential expression thresholding and classification of LSCG microarray data. This research presents a novel methodology that utilizes a CSDP separable 1-D method for wavelet-based transformation in order to initialize a threshold which will retain significantly expressed genes through the denoising process for robust classification of cancer patients. Additionally, the overall study was implemented and encompassed within CSDP environment. The utilization of cloud computing and wavelet-based thresholding for denoising was used for the classification of samples within the Global Cancer Map, Cancer Cell Line Encyclopedia, and The Cancer Genome Atlas. The results proved that separable 1-D parallel distributed wavelet denoising in the cloud and differential expression thresholding increased the computational performance and enabled the generation of higher quality LSCG microarray datasets, which led to more accurate classification results.

  3. Symbolic rule-based classification of lung cancer stages from free-text pathology reports.

    PubMed

    Nguyen, Anthony N; Lawley, Michael J; Hansen, David P; Bowman, Rayleen V; Clarke, Belinda E; Duhig, Edwina E; Colquist, Shoni

    2010-01-01

    To classify automatically lung tumor-node-metastases (TNM) cancer stages from free-text pathology reports using symbolic rule-based classification. By exploiting report substructure and the symbolic manipulation of systematized nomenclature of medicine-clinical terms (SNOMED CT) concepts in reports, statements in free text can be evaluated for relevance against factors relating to the staging guidelines. Post-coordinated SNOMED CT expressions based on templates were defined and populated by concepts in reports, and tested for subsumption by staging factors. The subsumption results were used to build logic according to the staging guidelines to calculate the TNM stage. The accuracy measure and confusion matrices were used to evaluate the TNM stages classified by the symbolic rule-based system. The system was evaluated against a database of multidisciplinary team staging decisions and a machine learning-based text classification system using support vector machines. Overall accuracy on a corpus of pathology reports for 718 lung cancer patients against a database of pathological TNM staging decisions were 72%, 78%, and 94% for T, N, and M staging, respectively. The system's performance was also comparable to support vector machine classification approaches. A system to classify lung TNM stages from free-text pathology reports was developed, and it was verified that the symbolic rule-based approach using SNOMED CT can be used for the extraction of key lung cancer characteristics from free-text reports. Future work will investigate the applicability of using the proposed methodology for extracting other cancer characteristics and types.

  4. Classification of TP53 Mutations and HPV Predict Survival in Advanced Larynx Cancer

    PubMed Central

    Scheel, Adam; Bellile, Emily; McHugh, Jonathan B.; Walline, Heather M.; Prince, Mark E.; Urba, Susan; Wolf, Gregory T.; Eisbruch, Avraham; Worden, Francis; Carey, Thomas E.; Bradford, Carol

    2016-01-01

    OBJECTIVE Assess TP53 functional mutations in the context of other biomarkers in advanced larynx cancer. STUDY DESIGN Prospective analysis of pretreatment tumor TP53, HPV, Bcl-xL and cyclin D1 status in stage III and IV larynx cancer patients in a clinical trial. METHODS TP53 exons 4-9 from 58 tumors were sequenced. Mutations were grouped using three classifications based on their expected function. Each functional group was analyzed for response to induction chemotherapy, time to surgery, survival, HPV status, p16INK4a, Bcl-xl and cyclin D1 expression. RESULTS TP53 Mutations were found in 22/58 (37.9%) patients with advanced larynx cancer, including missense mutations in 13/58 (22.4%) patients, nonsense mutations in 4/58 (6.9%), and deletions in 5/58 (8.6%). High risk HPV was found in 20/52 (38.5%) tumors. A classification based on crystal Evolutionary Action score of p53 (EAp53) distinguished missense mutations with high risk for decreased survival from low risk mutations (p=0.0315). A model including this TP53 classification, HPV status, cyclin D1 and Bcl-xL staining significantly predicts survival (p=0.0017). CONCLUSION EAp53 functional classification of TP53 mutants and biomarkers predict survival in advanced larynx cancer. PMID:27345657

  5. Minimising Immunohistochemical False Negative ER Classification Using a Complementary 23 Gene Expression Signature of ER Status

    PubMed Central

    Li, Qiyuan; Eklund, Aron C.; Juul, Nicolai; Haibe-Kains, Benjamin; Workman, Christopher T.; Richardson, Andrea L.; Szallasi, Zoltan; Swanton, Charles

    2010-01-01

    Background Expression of the oestrogen receptor (ER) in breast cancer predicts benefit from endocrine therapy. Minimising the frequency of false negative ER status classification is essential to identify all patients with ER positive breast cancers who should be offered endocrine therapies in order to improve clinical outcome. In routine oncological practice ER status is determined by semi-quantitative methods such as immunohistochemistry (IHC) or other immunoassays in which the ER expression level is compared to an empirical threshold[1], [2]. The clinical relevance of gene expression-based ER subtypes as compared to IHC-based determination has not been systematically evaluated. Here we attempt to reduce the frequency of false negative ER status classification using two gene expression approaches and compare these methods to IHC based ER status in terms of predictive and prognostic concordance with clinical outcome. Methodology/Principal Findings Firstly, ER status was discriminated by fitting the bimodal expression of ESR1 to a mixed Gaussian model. The discriminative power of ESR1 suggested bimodal expression as an efficient way to stratify breast cancer; therefore we identified a set of genes whose expression was both strongly bimodal, mimicking ESR expression status, and highly expressed in breast epithelial cell lines, to derive a 23-gene ER expression signature-based classifier. We assessed our classifiers in seven published breast cancer cohorts by comparing the gene expression-based ER status to IHC-based ER status as a predictor of clinical outcome in both untreated and tamoxifen treated cohorts. In untreated breast cancer cohorts, the 23 gene signature-based ER status provided significantly improved prognostic power compared to IHC-based ER status (P = 0.006). In tamoxifen-treated cohorts, the 23 gene ER expression signature predicted clinical outcome (HR = 2.20, P = 0.00035). These complementary ER signature-based strategies estimated that between 15.1% and 21.8% patients of IHC-based negative ER status would be classified with ER positive breast cancer. Conclusion/Significance Expression-based ER status classification may complement IHC to minimise false negative ER status classification and optimise patient stratification for endocrine therapies. PMID:21152022

  6. CAMUR: Knowledge extraction from RNA-seq cancer data through equivalent classification rules.

    PubMed

    Cestarelli, Valerio; Fiscon, Giulia; Felici, Giovanni; Bertolazzi, Paola; Weitschek, Emanuel

    2016-03-01

    Nowadays, knowledge extraction methods from Next Generation Sequencing data are highly requested. In this work, we focus on RNA-seq gene expression analysis and specifically on case-control studies with rule-based supervised classification algorithms that build a model able to discriminate cases from controls. State of the art algorithms compute a single classification model that contains few features (genes). On the contrary, our goal is to elicit a higher amount of knowledge by computing many classification models, and therefore to identify most of the genes related to the predicted class. We propose CAMUR, a new method that extracts multiple and equivalent classification models. CAMUR iteratively computes a rule-based classification model, calculates the power set of the genes present in the rules, iteratively eliminates those combinations from the data set, and performs again the classification procedure until a stopping criterion is verified. CAMUR includes an ad-hoc knowledge repository (database) and a querying tool.We analyze three different types of RNA-seq data sets (Breast, Head and Neck, and Stomach Cancer) from The Cancer Genome Atlas (TCGA) and we validate CAMUR and its models also on non-TCGA data. Our experimental results show the efficacy of CAMUR: we obtain several reliable equivalent classification models, from which the most frequent genes, their relationships, and the relation with a particular cancer are deduced. dmb.iasi.cnr.it/camur.php emanuel@iasi.cnr.it Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  7. Image classification of human carcinoma cells using complex wavelet-based covariance descriptors.

    PubMed

    Keskin, Furkan; Suhre, Alexander; Kose, Kivanc; Ersahin, Tulin; Cetin, A Enis; Cetin-Atalay, Rengul

    2013-01-01

    Cancer cell lines are widely used for research purposes in laboratories all over the world. Computer-assisted classification of cancer cells can alleviate the burden of manual labeling and help cancer research. In this paper, we present a novel computerized method for cancer cell line image classification. The aim is to automatically classify 14 different classes of cell lines including 7 classes of breast and 7 classes of liver cancer cells. Microscopic images containing irregular carcinoma cell patterns are represented by subwindows which correspond to foreground pixels. For each subwindow, a covariance descriptor utilizing the dual-tree complex wavelet transform (DT-[Formula: see text]WT) coefficients and several morphological attributes are computed. Directionally selective DT-[Formula: see text]WT feature parameters are preferred primarily because of their ability to characterize edges at multiple orientations which is the characteristic feature of carcinoma cell line images. A Support Vector Machine (SVM) classifier with radial basis function (RBF) kernel is employed for final classification. Over a dataset of 840 images, we achieve an accuracy above 98%, which outperforms the classical covariance-based methods. The proposed system can be used as a reliable decision maker for laboratory studies. Our tool provides an automated, time- and cost-efficient analysis of cancer cell morphology to classify different cancer cell lines using image-processing techniques, which can be used as an alternative to the costly short tandem repeat (STR) analysis. The data set used in this manuscript is available as supplementary material through http://signal.ee.bilkent.edu.tr/cancerCellLineClassificationSampleImages.html.

  8. Image Classification of Human Carcinoma Cells Using Complex Wavelet-Based Covariance Descriptors

    PubMed Central

    Keskin, Furkan; Suhre, Alexander; Kose, Kivanc; Ersahin, Tulin; Cetin, A. Enis; Cetin-Atalay, Rengul

    2013-01-01

    Cancer cell lines are widely used for research purposes in laboratories all over the world. Computer-assisted classification of cancer cells can alleviate the burden of manual labeling and help cancer research. In this paper, we present a novel computerized method for cancer cell line image classification. The aim is to automatically classify 14 different classes of cell lines including 7 classes of breast and 7 classes of liver cancer cells. Microscopic images containing irregular carcinoma cell patterns are represented by subwindows which correspond to foreground pixels. For each subwindow, a covariance descriptor utilizing the dual-tree complex wavelet transform (DT-WT) coefficients and several morphological attributes are computed. Directionally selective DT-WT feature parameters are preferred primarily because of their ability to characterize edges at multiple orientations which is the characteristic feature of carcinoma cell line images. A Support Vector Machine (SVM) classifier with radial basis function (RBF) kernel is employed for final classification. Over a dataset of 840 images, we achieve an accuracy above 98%, which outperforms the classical covariance-based methods. The proposed system can be used as a reliable decision maker for laboratory studies. Our tool provides an automated, time- and cost-efficient analysis of cancer cell morphology to classify different cancer cell lines using image-processing techniques, which can be used as an alternative to the costly short tandem repeat (STR) analysis. The data set used in this manuscript is available as supplementary material through http://signal.ee.bilkent.edu.tr/cancerCellLineClassificationSampleImages.html. PMID:23341908

  9. Molecular classification of gastric adenocarcinoma: translating new insights from the cancer genome atlas research network.

    PubMed

    Sunakawa, Yu; Lenz, Heinz-Josef

    2015-04-01

    Gastric cancer is a heterogenous cancer, which may be classified into several distinct subtypes based on pathology and epidemiology, each with different initiating pathological processes and each possibly having different tumor biology. A classification of gastric cancer should be important to select patients who can benefit from the targeted therapies or to precisely predict prognosis. The Cancer Genome Atlas (TCGA) study collaborated with previous reports regarding subtyping gastric cancer but also proposed a refined classification based on molecular characteristics. The addition of the new molecular classification strategy to a current classical subtyping may be a promising option, particularly stratification by Epstein-Barr virus (EBV) and microsatellite instability (MSI) statuses. According to TCGA study, EBV gastric cancer patients may benefit the programmed cell death protein 1 (PD-1)/programmed death-ligand 1 (PD-L1) antibodies or phosphoinositide 3-kinase (PI3K) inhibitors which are now being developed. The discoveries of predictive biomarkers should improve patient care and individualized medicine in the management since the targeted therapies may have the potential to change the landscape of gastric cancer treatment, moreover leading to both better understanding of the heterogeneity and better outcomes. Patient enrichment by predictive biomarkers for new treatment strategies will be critical to improve clinical outcomes. Additionally, liquid biopsies will be able to enable us to monitor in real-time molecular escape mechanism, resulting in better treatment strategies.

  10. BCDForest: a boosting cascade deep forest model towards the classification of cancer subtypes based on gene expression data.

    PubMed

    Guo, Yang; Liu, Shuhui; Li, Zhanhuai; Shang, Xuequn

    2018-04-11

    The classification of cancer subtypes is of great importance to cancer disease diagnosis and therapy. Many supervised learning approaches have been applied to cancer subtype classification in the past few years, especially of deep learning based approaches. Recently, the deep forest model has been proposed as an alternative of deep neural networks to learn hyper-representations by using cascade ensemble decision trees. It has been proved that the deep forest model has competitive or even better performance than deep neural networks in some extent. However, the standard deep forest model may face overfitting and ensemble diversity challenges when dealing with small sample size and high-dimensional biology data. In this paper, we propose a deep learning model, so-called BCDForest, to address cancer subtype classification on small-scale biology datasets, which can be viewed as a modification of the standard deep forest model. The BCDForest distinguishes from the standard deep forest model with the following two main contributions: First, a named multi-class-grained scanning method is proposed to train multiple binary classifiers to encourage diversity of ensemble. Meanwhile, the fitting quality of each classifier is considered in representation learning. Second, we propose a boosting strategy to emphasize more important features in cascade forests, thus to propagate the benefits of discriminative features among cascade layers to improve the classification performance. Systematic comparison experiments on both microarray and RNA-Seq gene expression datasets demonstrate that our method consistently outperforms the state-of-the-art methods in application of cancer subtype classification. The multi-class-grained scanning and boosting strategy in our model provide an effective solution to ease the overfitting challenge and improve the robustness of deep forest model working on small-scale data. Our model provides a useful approach to the classification of cancer subtypes by using deep learning on high-dimensional and small-scale biology data.

  11. Semi-Supervised Projective Non-Negative Matrix Factorization for Cancer Classification.

    PubMed

    Zhang, Xiang; Guan, Naiyang; Jia, Zhilong; Qiu, Xiaogang; Luo, Zhigang

    2015-01-01

    Advances in DNA microarray technologies have made gene expression profiles a significant candidate in identifying different types of cancers. Traditional learning-based cancer identification methods utilize labeled samples to train a classifier, but they are inconvenient for practical application because labels are quite expensive in the clinical cancer research community. This paper proposes a semi-supervised projective non-negative matrix factorization method (Semi-PNMF) to learn an effective classifier from both labeled and unlabeled samples, thus boosting subsequent cancer classification performance. In particular, Semi-PNMF jointly learns a non-negative subspace from concatenated labeled and unlabeled samples and indicates classes by the positions of the maximum entries of their coefficients. Because Semi-PNMF incorporates statistical information from the large volume of unlabeled samples in the learned subspace, it can learn more representative subspaces and boost classification performance. We developed a multiplicative update rule (MUR) to optimize Semi-PNMF and proved its convergence. The experimental results of cancer classification for two multiclass cancer gene expression profile datasets show that Semi-PNMF outperforms the representative methods.

  12. Molecular classification of gastric cancer.

    PubMed

    Röcken, Christoph

    2017-03-01

    Gastric cancer is among the most common cancers worldwide. Despite declining incidences, the prognosis remains dismal in Western countries and is better in Asian countries with national cancer screening programs. Complete endoscopic or surgical resection of the primary tumor with or without lymphadenectomy offers the only chance of cure in the early stage of the disease. Survival of more locally advanced gastric cancers was improved by the introduction of perioperative, adjuvant and palliative chemotherapy. However, the identification and usage of novel predictive and diagnostic targets is urgently needed. Areas covered: Recent comprehensive molecular profiling of gastric cancer proposed four molecular subtypes, i.e. Epstein-Barr virus-associated, microsatellite instable, chromosomal instable and genomically stable carcinomas. The new molecular classification will spur clinical trials exploring novel targeted therapeutics. This review summarizes recent advancements of the molecular classification, and based on that, putative pitfalls for the development of tissue-based companion diagnostics, i.e. prevalence of actionable targets and therapeutic efficacy, tumor heterogeneity and tumor evolution, impact of ethnicity on gastric cancer biology, and standards of care in the East and West. Expert commentary: The overall low prevalence of actionable targets and tumor heterogeneity are the two main obstacles of precision medicine for gastric cancer.

  13. Diagnostic classification scheme in Iranian breast cancer patients using a decision tree.

    PubMed

    Malehi, Amal Saki

    2014-01-01

    The objective of this study was to determine a diagnostic classification scheme using a decision tree based model. The study was conducted as a retrospective case-control study in Imam Khomeini hospital in Tehran during 2001 to 2009. Data, including demographic and clinical-pathological characteristics, were uniformly collected from 624 females, 312 of them were referred with positive diagnosis of breast cancer (cases) and 312 healthy women (controls). The decision tree was implemented to develop a diagnostic classification scheme using CART 6.0 Software. The AUC (area under curve), was measured as the overall performance of diagnostic classification of the decision tree. Five variables as main risk factors of breast cancer and six subgroups as high risk were identified. The results indicated that increasing age, low age at menarche, single and divorced statues, irregular menarche pattern and family history of breast cancer are the important diagnostic factors in Iranian breast cancer patients. The sensitivity and specificity of the analysis were 66% and 86.9% respectively. The high AUC (0.82) also showed an excellent classification and diagnostic performance of the model. Decision tree based model appears to be suitable for identifying risk factors and high or low risk subgroups. It can also assists clinicians in making a decision, since it can identify underlying prognostic relationships and understanding the model is very explicit.

  14. The 7th lung cancer TNM classification and staging system: Review of the changes and implications.

    PubMed

    Mirsadraee, Saeed; Oswal, Dilip; Alizadeh, Yalda; Caulo, Andrea; van Beek, Edwin

    2012-04-28

    Lung cancer is the most common cause of death from cancer in males, accounting for more than 1.4 million deaths in 2008. It is a growing concern in China, Asia and Africa as well. Accurate staging of the disease is an important part of the management as it provides estimation of patient's prognosis and identifies treatment sterategies. It also helps to build a database for future staging projects. A major revision of lung cancer staging has been announced with effect from January 2010. The new classification is based on a larger surgical and non-surgical cohort of patients, and thus more accurate in terms of outcome prediction compared to the previous classification. There are several original papers regarding this new classification which give comprehensive description of the methodology, the changes in the staging and the statistical analysis. This overview is a simplified description of the changes in the new classification and their potential impact on patients' treatment and prognosis.

  15. Integrating Colon Cancer Microarray Data: Associating Locus-Specific Methylation Groups to Gene Expression-Based Classifications.

    PubMed

    Barat, Ana; Ruskin, Heather J; Byrne, Annette T; Prehn, Jochen H M

    2015-11-23

    Recently, considerable attention has been paid to gene expression-based classifications of colorectal cancers (CRC) and their association with patient prognosis. In addition to changes in gene expression, abnormal DNA-methylation is known to play an important role in cancer onset and development, and colon cancer is no exception to this rule. Large-scale technologies, such as methylation microarray assays and specific sequencing of methylated DNA, have been used to determine whole genome profiles of CpG island methylation in tissue samples. In this article, publicly available microarray-based gene expression and methylation data sets are used to characterize expression subtypes with respect to locus-specific methylation. A major objective was to determine whether integration of these data types improves previously characterized subtypes, or provides evidence for additional subtypes. We used unsupervised clustering techniques to determine methylation-based subgroups, which are subsequently annotated with three published expression-based classifications, comprising from three to six subtypes. Our results showed that, while methylation profiles provide a further basis for segregation of certain (Inflammatory and Goblet-like) finer-grained expression-based subtypes, they also suggest that other finer-grained subtypes are not distinctive and can be considered as a single subtype.

  16. Integrating Colon Cancer Microarray Data: Associating Locus-Specific Methylation Groups to Gene Expression-Based Classifications

    PubMed Central

    Barat, Ana; Ruskin, Heather J.; Byrne, Annette T.; Prehn, Jochen H. M.

    2015-01-01

    Recently, considerable attention has been paid to gene expression-based classifications of colorectal cancers (CRC) and their association with patient prognosis. In addition to changes in gene expression, abnormal DNA-methylation is known to play an important role in cancer onset and development, and colon cancer is no exception to this rule. Large-scale technologies, such as methylation microarray assays and specific sequencing of methylated DNA, have been used to determine whole genome profiles of CpG island methylation in tissue samples. In this article, publicly available microarray-based gene expression and methylation data sets are used to characterize expression subtypes with respect to locus-specific methylation. A major objective was to determine whether integration of these data types improves previously characterized subtypes, or provides evidence for additional subtypes. We used unsupervised clustering techniques to determine methylation-based subgroups, which are subsequently annotated with three published expression-based classifications, comprising from three to six subtypes. Our results showed that, while methylation profiles provide a further basis for segregation of certain (Inflammatory and Goblet-like) finer-grained expression-based subtypes, they also suggest that other finer-grained subtypes are not distinctive and can be considered as a single subtype. PMID:27600244

  17. Spectral-spatial classification using tensor modeling for cancer detection with hyperspectral imaging

    NASA Astrophysics Data System (ADS)

    Lu, Guolan; Halig, Luma; Wang, Dongsheng; Chen, Zhuo Georgia; Fei, Baowei

    2014-03-01

    As an emerging technology, hyperspectral imaging (HSI) combines both the chemical specificity of spectroscopy and the spatial resolution of imaging, which may provide a non-invasive tool for cancer detection and diagnosis. Early detection of malignant lesions could improve both survival and quality of life of cancer patients. In this paper, we introduce a tensor-based computation and modeling framework for the analysis of hyperspectral images to detect head and neck cancer. The proposed classification method can distinguish between malignant tissue and healthy tissue with an average sensitivity of 96.97% and an average specificity of 91.42% in tumor-bearing mice. The hyperspectral imaging and classification technology has been demonstrated in animal models and can have many potential applications in cancer research and management.

  18. Patch-based Convolutional Neural Network for Whole Slide Tissue Image Classification

    PubMed Central

    Hou, Le; Samaras, Dimitris; Kurc, Tahsin M.; Gao, Yi; Davis, James E.; Saltz, Joel H.

    2016-01-01

    Convolutional Neural Networks (CNN) are state-of-the-art models for many image classification tasks. However, to recognize cancer subtypes automatically, training a CNN on gigapixel resolution Whole Slide Tissue Images (WSI) is currently computationally impossible. The differentiation of cancer subtypes is based on cellular-level visual features observed on image patch scale. Therefore, we argue that in this situation, training a patch-level classifier on image patches will perform better than or similar to an image-level classifier. The challenge becomes how to intelligently combine patch-level classification results and model the fact that not all patches will be discriminative. We propose to train a decision fusion model to aggregate patch-level predictions given by patch-level CNNs, which to the best of our knowledge has not been shown before. Furthermore, we formulate a novel Expectation-Maximization (EM) based method that automatically locates discriminative patches robustly by utilizing the spatial relationships of patches. We apply our method to the classification of glioma and non-small-cell lung carcinoma cases into subtypes. The classification accuracy of our method is similar to the inter-observer agreement between pathologists. Although it is impossible to train CNNs on WSIs, we experimentally demonstrate using a comparable non-cancer dataset of smaller images that a patch-based CNN can outperform an image-based CNN. PMID:27795661

  19. Development and external validation of a risk-prediction model to predict 5-year overall survival in advanced larynx cancer.

    PubMed

    Petersen, Japke F; Stuiver, Martijn M; Timmermans, Adriana J; Chen, Amy; Zhang, Hongzhen; O'Neill, James P; Deady, Sandra; Vander Poorten, Vincent; Meulemans, Jeroen; Wennerberg, Johan; Skroder, Carl; Day, Andrew T; Koch, Wayne; van den Brekel, Michiel W M

    2018-05-01

    TNM-classification inadequately estimates patient-specific overall survival (OS). We aimed to improve this by developing a risk-prediction model for patients with advanced larynx cancer. Cohort study. We developed a risk prediction model to estimate the 5-year OS rate based on a cohort of 3,442 patients with T3T4N0N+M0 larynx cancer. The model was internally validated using bootstrapping samples and externally validated on patient data from five external centers (n = 770). The main outcome was performance of the model as tested by discrimination, calibration, and the ability to distinguish risk groups based on tertiles from the derivation dataset. The model performance was compared to a model based on T and N classification only. We included age, gender, T and N classification, and subsite as prognostic variables in the standard model. After external validation, the standard model had a significantly better fit than a model based on T and N classification alone (C statistic, 0.59 vs. 0.55, P < .001). The model was able to distinguish well among three risk groups based on tertiles of the risk score. Adding treatment modality to the model did not decrease the predictive power. As a post hoc analysis, we tested the added value of comorbidity as scored by American Society of Anesthesiologists score in a subsample, which increased the C statistic to 0.68. A risk prediction model for patients with advanced larynx cancer, consisting of readily available clinical variables, gives more accurate estimations of the estimated 5-year survival rate when compared to a model based on T and N classification alone. 2c. Laryngoscope, 128:1140-1145, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.

  20. [Changes of 2015 WHO Histological Classification of Lung Cancer 
and the Clinical Significance].

    PubMed

    Yang, Xin; Lin, Dongmei

    2016-06-20

    Due in part to remarkable advances over the past decade in our understanding of lung cancer, particularly in area of medical oncology, molecular biology, and radiology, there is a pressing need for a revised classification, based not on pathology alone, but rather on an integrated multidisciplinary approach to classification of lung cancer. The 2015 World Health Organization (WHO) Classification of Tumors of the Lung, Pleura, Thymus and Heart has just been published with numerous important changes from the 2004 WHO classification. The revised classification has been greatly improved in helping advance the field, increasing the impact of research, improving patient care and assisting in predicting outcome. The most significant changes will be summarized in this paper as follows: (1) main changes of lung adenocarcinoma as proposed by the 2011 International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society (IASLC/ATS/ERS) classification, (2) reclassifying squamous cell carcinomas into keratinizing, nonkeratinizing, and basaloid subtypes with the nonkeratinizing tumors requiring immunohistochemistry proof of squamous differentiation, (3) restricting the diagnosis of large cell carcinoma only to resected tumors that lack any clear morphologic or immunohistochemical differentiation with reclassification of the remaining former large cell carcinoma subtypes into different categories, (4) grouping of neuroendocrine tumors together in one category, (5) and the current viewpoint of histologic grading of lung cancer.

  1. Aided diagnosis methods of breast cancer based on machine learning

    NASA Astrophysics Data System (ADS)

    Zhao, Yue; Wang, Nian; Cui, Xiaoyu

    2017-08-01

    In the field of medicine, quickly and accurately determining whether the patient is malignant or benign is the key to treatment. In this paper, K-Nearest Neighbor, Linear Discriminant Analysis, Logistic Regression were applied to predict the classification of thyroid,Her-2,PR,ER,Ki67,metastasis and lymph nodes in breast cancer, in order to recognize the benign and malignant breast tumors and achieve the purpose of aided diagnosis of breast cancer. The results showed that the highest classification accuracy of LDA was 88.56%, while the classification effect of KNN and Logistic Regression were better than that of LDA, the best accuracy reached 96.30%.

  2. Superpixel-based spectral classification for the detection of head and neck cancer with hyperspectral imaging

    NASA Astrophysics Data System (ADS)

    Chung, Hyunkoo; Lu, Guolan; Tian, Zhiqiang; Wang, Dongsheng; Chen, Zhuo Georgia; Fei, Baowei

    2016-03-01

    Hyperspectral imaging (HSI) is an emerging imaging modality for medical applications. HSI acquires two dimensional images at various wavelengths. The combination of both spectral and spatial information provides quantitative information for cancer detection and diagnosis. This paper proposes using superpixels, principal component analysis (PCA), and support vector machine (SVM) to distinguish regions of tumor from healthy tissue. The classification method uses 2 principal components decomposed from hyperspectral images and obtains an average sensitivity of 93% and an average specificity of 85% for 11 mice. The hyperspectral imaging technology and classification method can have various applications in cancer research and management.

  3. [Role of contemporary pathological diagnostics in the personalized treatment of cancer].

    PubMed

    Tímár, József

    2013-03-01

    Due to the developments of pathology in the past decades (immunohistochemistry and molecular pathology) classification of cancers changed fundamentally, laying a ground for personalized management of cancer patients. Our picture of cancer is more complex today, identifying the genetic basis of the morphological variants. On the other hand, this picture has a much higher resolution enabling us to subclassify similar histological cancer types based on molecular markers. This redefined classification of cancers helps us to better predict the possible biological behavior of the disease and/or the therapeutic sensitivity, opening the way toward a more personalized treatment of this disease. The redefined molecular classification of cancer may affect the universal application of treatment protocols. To achieve this goal molecular diagnostics must be an integral and reimbursed part of the routine pathological diagnostics. On the other hand, it is time to extend the multidisciplinary team with molecular pathologist to improve the decision making process of the management of cancer patients.

  4. A protein and mRNA expression-based classification of gastric cancer.

    PubMed

    Setia, Namrata; Agoston, Agoston T; Han, Hye S; Mullen, John T; Duda, Dan G; Clark, Jeffrey W; Deshpande, Vikram; Mino-Kenudson, Mari; Srivastava, Amitabh; Lennerz, Jochen K; Hong, Theodore S; Kwak, Eunice L; Lauwers, Gregory Y

    2016-07-01

    The overall survival of gastric carcinoma patients remains poor despite improved control over known risk factors and surveillance. This highlights the need for new classifications, driven towards identification of potential therapeutic targets. Using sophisticated molecular technologies and analysis, three groups recently provided genetic and epigenetic molecular classifications of gastric cancer (The Cancer Genome Atlas, 'Singapore-Duke' study, and Asian Cancer Research Group). Suggested by these classifications, here, we examined the expression of 14 biomarkers in a cohort of 146 gastric adenocarcinomas and performed unsupervised hierarchical clustering analysis using less expensive and widely available immunohistochemistry and in situ hybridization. Ultimately, we identified five groups of gastric cancers based on Epstein-Barr virus (EBV) positivity, microsatellite instability, aberrant E-cadherin, and p53 expression; the remaining cases constituted a group characterized by normal p53 expression. In addition, the five categories correspond to the reported molecular subgroups by virtue of clinicopathologic features. Furthermore, evaluation between these clusters and survival using the Cox proportional hazards model showed a trend for superior survival in the EBV and microsatellite-instable related adenocarcinomas. In conclusion, we offer as a proposal a simplified algorithm that is able to reproduce the recently proposed molecular subgroups of gastric adenocarcinoma, using immunohistochemical and in situ hybridization techniques.

  5. Improved Classification of Lung Cancer Using Radial Basis Function Neural Network with Affine Transforms of Voss Representation.

    PubMed

    Adetiba, Emmanuel; Olugbara, Oludayo O

    2015-01-01

    Lung cancer is one of the diseases responsible for a large number of cancer related death cases worldwide. The recommended standard for screening and early detection of lung cancer is the low dose computed tomography. However, many patients diagnosed die within one year, which makes it essential to find alternative approaches for screening and early detection of lung cancer. We present computational methods that can be implemented in a functional multi-genomic system for classification, screening and early detection of lung cancer victims. Samples of top ten biomarker genes previously reported to have the highest frequency of lung cancer mutations and sequences of normal biomarker genes were respectively collected from the COSMIC and NCBI databases to validate the computational methods. Experiments were performed based on the combinations of Z-curve and tetrahedron affine transforms, Histogram of Oriented Gradient (HOG), Multilayer perceptron and Gaussian Radial Basis Function (RBF) neural networks to obtain an appropriate combination of computational methods to achieve improved classification of lung cancer biomarker genes. Results show that a combination of affine transforms of Voss representation, HOG genomic features and Gaussian RBF neural network perceptibly improves classification accuracy, specificity and sensitivity of lung cancer biomarker genes as well as achieving low mean square error.

  6. Study design requirements for RNA sequencing-based breast cancer diagnostics.

    PubMed

    Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias

    2016-02-01

    Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic.

  7. DeepGene: an advanced cancer type classifier based on deep learning and somatic point mutations.

    PubMed

    Yuan, Yuchen; Shi, Yi; Li, Changyang; Kim, Jinman; Cai, Weidong; Han, Zeguang; Feng, David Dagan

    2016-12-23

    With the developments of DNA sequencing technology, large amounts of sequencing data have become available in recent years and provide unprecedented opportunities for advanced association studies between somatic point mutations and cancer types/subtypes, which may contribute to more accurate somatic point mutation based cancer classification (SMCC). However in existing SMCC methods, issues like high data sparsity, small volume of sample size, and the application of simple linear classifiers, are major obstacles in improving the classification performance. To address the obstacles in existing SMCC studies, we propose DeepGene, an advanced deep neural network (DNN) based classifier, that consists of three steps: firstly, the clustered gene filtering (CGF) concentrates the gene data by mutation occurrence frequency, filtering out the majority of irrelevant genes; secondly, the indexed sparsity reduction (ISR) converts the gene data into indexes of its non-zero elements, thereby significantly suppressing the impact of data sparsity; finally, the data after CGF and ISR is fed into a DNN classifier, which extracts high-level features for accurate classification. Experimental results on our curated TCGA-DeepGene dataset, which is a reformulated subset of the TCGA dataset containing 12 selected types of cancer, show that CGF, ISR and DNN all contribute in improving the overall classification performance. We further compare DeepGene with three widely adopted classifiers and demonstrate that DeepGene has at least 24% performance improvement in terms of testing accuracy. Based on deep learning and somatic point mutation data, we devise DeepGene, an advanced cancer type classifier, which addresses the obstacles in existing SMCC studies. Experiments indicate that DeepGene outperforms three widely adopted existing classifiers, which is mainly attributed to its deep learning module that is able to extract the high level features between combinatorial somatic point mutations and cancer types.

  8. Classification of TP53 mutations and HPV predict survival in advanced larynx cancer.

    PubMed

    Scheel, Adam; Bellile, Emily; McHugh, Jonathan B; Walline, Heather M; Prince, Mark E; Urba, Susan; Wolf, Gregory T; Eisbruch, Avraham; Worden, Francis; Carey, Thomas E; Bradford, Carol

    2016-09-01

    Assess tumor suppressor p53 (TP53) functional mutations in the context of other biomarkers in advanced larynx cancer. Prospective analysis of pretreatment tumor TP53, human papillomavirus (HPV), Bcl-xL, and cyclin D1 status in stage III and IV larynx cancer patients in a clinical trial. TP53 exons 4 through 9 from 58 tumors were sequenced. Mutations were grouped using three classifications based on their expected function. Each functional group was analyzed for response to induction chemotherapy, time to surgery, survival, HPV status, p16INK4a, Bcl-xl, and cyclin D1 expression. TP53 mutations were found in 22 of 58 (37.9%) patients with advanced larynx cancer, including missense mutations in 13 of 58 (22.4%) patients, nonsense mutations in four of 58 (6.9%), and deletions in five of 58 (8.6%). High-risk HPV was found in 20 of 52 (38.5%) tumors. A classification based on Evolutionary Action score of p53 (EAp53) distinguished missense mutations with high risk for decreased survival from low-risk mutations (P = 0.0315). A model including this TP53 classification, HPV status, cyclin D1, and Bcl-xL staining significantly predicts survival (P = 0.0017). EAp53 functional classification of TP53 mutants and biomarkers predict survival in advanced larynx cancer. NA. Laryngoscope, 126:E292-E299, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.

  9. A comparison of blood vessel features and local binary patterns for colorectal polyp classification

    NASA Astrophysics Data System (ADS)

    Gross, Sebastian; Stehle, Thomas; Behrens, Alexander; Auer, Roland; Aach, Til; Winograd, Ron; Trautwein, Christian; Tischendorf, Jens

    2009-02-01

    Colorectal cancer is the third leading cause of cancer deaths in the United States of America for both women and men. By means of early detection, the five year survival rate can be up to 90%. Polyps can to be grouped into three different classes: hyperplastic, adenomatous, and carcinomatous polyps. Hyperplastic polyps are benign and are not likely to develop into cancer. Adenomas, on the other hand, are known to grow into cancer (adenoma-carcinoma sequence). Carcinomas are fully developed cancers and can be easily distinguished from adenomas and hyperplastic polyps. A recent narrow band imaging (NBI) study by Tischendorf et al. has shown that hyperplastic polyps and adenomas can be discriminated by their blood vessel structure. We designed a computer-aided system for the differentiation between hyperplastic and adenomatous polyps. Our development aim is to provide the medical practitioner with an additional objective interpretation of the available image data as well as a confidence measure for the classification. We propose classification features calculated on the basis of the extracted blood vessel structure. We use the combined length of the detected blood vessels, the average perimeter of the vessels and their average gray level value. We achieve a successful classification rate of more than 90% on 102 polyps from our polyp data base. The classification results based on these features are compared to the results of Local Binary Patterns (LBP). The results indicate that the implemented features are superior to LBP.

  10. A new hybrid method based on fuzzy-artificial immune system and k-nn algorithm for breast cancer diagnosis.

    PubMed

    Sahan, Seral; Polat, Kemal; Kodaz, Halife; Güneş, Salih

    2007-03-01

    The use of machine learning tools in medical diagnosis is increasing gradually. This is mainly because the effectiveness of classification and recognition systems has improved in a great deal to help medical experts in diagnosing diseases. Such a disease is breast cancer, which is a very common type of cancer among woman. As the incidence of this disease has increased significantly in the recent years, machine learning applications to this problem have also took a great attention as well as medical consideration. This study aims at diagnosing breast cancer with a new hybrid machine learning method. By hybridizing a fuzzy-artificial immune system with k-nearest neighbour algorithm, a method was obtained to solve this diagnosis problem via classifying Wisconsin Breast Cancer Dataset (WBCD). This data set is a very commonly used data set in the literature relating the use of classification systems for breast cancer diagnosis and it was used in this study to compare the classification performance of our proposed method with regard to other studies. We obtained a classification accuracy of 99.14%, which is the highest one reached so far. The classification accuracy was obtained via 10-fold cross validation. This result is for WBCD but it states that this method can be used confidently for other breast cancer diagnosis problems, too.

  11. Prognostication in eye cancer: the latest tumor, node, metastasis classification and beyond

    PubMed Central

    Kivelä, T; Kujala, E

    2013-01-01

    The tumour, node, metastasis (TNM) classification is a universal cancer staging system, which has been used for five decades. The current seventh edition became effective in 2010 and covers six ophthalmic sites: eyelids, conjunctiva, uvea, retina, orbit, and lacrimal gland; and five cancer types: carcinoma, sarcoma, melanoma, retinoblastoma, and lymphoma. The TNM categories are based on the anatomic extent of the primary tumour (T), regional lymph node metastases (N), and systemic metastases (M). The T categories of ophthalmic cancers are based on the size of the primary tumour and any invasion of periocular structures. The anatomic category is used to determine the TNM stage that correlates with survival. Such staging is currently implemented only for carcinoma of the eyelid and melanoma of the uvea. The classification of ciliary body and choroidal melanoma is the only one based on clinical evidence so far: a database of 7369 patients analysed by the European Ophthalmic Oncology Group. It spans a prognosis from 96% 5-year survival for stage I to 97% 5-year mortality for stage IV. The most accurate criterion for prognostication in uveal melanoma is, however, analysis of chromosomal alterations and gene expression. When such data are available, the TNM stage may be used for further stratification. Prognosis in retinoblastoma is frequently assigned by using an international classification, which predicts conservation of the eye and vision, and an international staging separate from the TNM system, which predicts survival. The TNM cancer staging manual is a useful tool for all ophthalmologists managing eye cancer. PMID:23258307

  12. A Global Covariance Descriptor for Nuclear Atypia Scoring in Breast Histopathology Images.

    PubMed

    Khan, Adnan Mujahid; Sirinukunwattana, Korsuk; Rajpoot, Nasir

    2015-09-01

    Nuclear atypia scoring is a diagnostic measure commonly used to assess tumor grade of various cancers, including breast cancer. It provides a quantitative measure of deviation in visual appearance of cell nuclei from those in normal epithelial cells. In this paper, we present a novel image-level descriptor for nuclear atypia scoring in breast cancer histopathology images. The method is based on the region covariance descriptor that has recently become a popular method in various computer vision applications. The descriptor in its original form is not suitable for classification of histopathology images as cancerous histopathology images tend to possess diversely heterogeneous regions in a single field of view. Our proposed image-level descriptor, which we term as the geodesic mean of region covariance descriptors, possesses all the attractive properties of covariance descriptors lending itself to tractable geodesic-distance-based k-nearest neighbor classification using efficient kernels. The experimental results suggest that the proposed image descriptor yields high classification accuracy compared to a variety of widely used image-level descriptors.

  13. Sequence variant classification and reporting: recommendations for improving the interpretation of cancer susceptibility genetic test results

    PubMed Central

    Plon, Sharon E.; Eccles, Diana M.; Easton, Douglas; Foulkes, William D.; Genuardi, Maurizio; Greenblatt, Marc S.; Hogervorst, Frans B.L.; Hoogerbrugge, Nicoline; Spurdle, Amanda B.; Tavtigian, Sean

    2011-01-01

    Genetic testing of cancer susceptibility genes is now widely applied in clinical practice to predict risk of developing cancer. In general, sequence-based testing of germline DNA is used to determine whether an individual carries a change that is clearly likely to disrupt normal gene function. Genetic testing may detect changes that are clearly pathogenic, clearly neutral or variants of unclear clinical significance. Such variants present a considerable challenge to the diagnostic laboratory and the receiving clinician in terms of interpretation and clear presentation of the implications of the result to the patient. There does not appear to be a consistent approach to interpreting and reporting the clinical significance of variants either among genes or among laboratories. The potential for confusion among clinicians and patients is considerable and misinterpretation may lead to inappropriate clinical consequences. In this article we review the current state of sequence-based genetic testing, describe other standardized reporting systems used in oncology and propose a standardized classification system for application to sequence based results for cancer predisposition genes. We suggest a system of five classes of variants based on the degree of likelihood of pathogenicity. Each class is associated with specific recommendations for clinical management of at-risk relatives that will depend on the syndrome. We propose that panels of experts on each cancer predisposition syndrome facilitate the classification scheme and designate appropriate surveillance and cancer management guidelines. The international adoption of a standardized reporting system should improve the clinical utility of sequence-based genetic tests to predict cancer risk. PMID:18951446

  14. Deep convolutional neural networks for automatic classification of gastric carcinoma using whole slide images in digital histopathology.

    PubMed

    Sharma, Harshita; Zerbe, Norman; Klempert, Iris; Hellwich, Olaf; Hufnagl, Peter

    2017-11-01

    Deep learning using convolutional neural networks is an actively emerging field in histological image analysis. This study explores deep learning methods for computer-aided classification in H&E stained histopathological whole slide images of gastric carcinoma. An introductory convolutional neural network architecture is proposed for two computerized applications, namely, cancer classification based on immunohistochemical response and necrosis detection based on the existence of tumor necrosis in the tissue. Classification performance of the developed deep learning approach is quantitatively compared with traditional image analysis methods in digital histopathology requiring prior computation of handcrafted features, such as statistical measures using gray level co-occurrence matrix, Gabor filter-bank responses, LBP histograms, gray histograms, HSV histograms and RGB histograms, followed by random forest machine learning. Additionally, the widely known AlexNet deep convolutional framework is comparatively analyzed for the corresponding classification problems. The proposed convolutional neural network architecture reports favorable results, with an overall classification accuracy of 0.6990 for cancer classification and 0.8144 for necrosis detection. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Cervical cancer survival prediction using hybrid of SMOTE, CART and smooth support vector machine

    NASA Astrophysics Data System (ADS)

    Purnami, S. W.; Khasanah, P. M.; Sumartini, S. H.; Chosuvivatwong, V.; Sriplung, H.

    2016-04-01

    According to the WHO, every two minutes there is one patient who died from cervical cancer. The high mortality rate is due to the lack of awareness of women for early detection. There are several factors that supposedly influence the survival of cervical cancer patients, including age, anemia status, stage, type of treatment, complications and secondary disease. This study wants to classify/predict cervical cancer survival based on those factors. Various classifications methods: classification and regression tree (CART), smooth support vector machine (SSVM), three order spline SSVM (TSSVM) were used. Since the data of cervical cancer are imbalanced, synthetic minority oversampling technique (SMOTE) is used for handling imbalanced dataset. Performances of these methods are evaluated using accuracy, sensitivity and specificity. Results of this study show that balancing data using SMOTE as preprocessing can improve performance of classification. The SMOTE-SSVM method provided better result than SMOTE-TSSVM and SMOTE-CART.

  16. Multiclass cancer diagnosis using tumor gene expression signatures

    DOE PAGES

    Ramaswamy, S.; Tamayo, P.; Rifkin, R.; ...

    2001-12-11

    The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a supportmore » vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (9%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.« less

  17. Combining multiple decisions: applications to bioinformatics

    NASA Astrophysics Data System (ADS)

    Yukinawa, N.; Takenouchi, T.; Oba, S.; Ishii, S.

    2008-01-01

    Multi-class classification is one of the fundamental tasks in bioinformatics and typically arises in cancer diagnosis studies by gene expression profiling. This article reviews two recent approaches to multi-class classification by combining multiple binary classifiers, which are formulated based on a unified framework of error-correcting output coding (ECOC). The first approach is to construct a multi-class classifier in which each binary classifier to be aggregated has a weight value to be optimally tuned based on the observed data. In the second approach, misclassification of each binary classifier is formulated as a bit inversion error with a probabilistic model by making an analogy to the context of information transmission theory. Experimental studies using various real-world datasets including cancer classification problems reveal that both of the new methods are superior or comparable to other multi-class classification methods.

  18. Cancer of the esophagus and esophagogastric junction: data-driven staging for the seventh edition of the American Joint Committee on Cancer/International Union Against Cancer Cancer Staging Manuals.

    PubMed

    Rice, Thomas W; Rusch, Valerie W; Ishwaran, Hemant; Blackstone, Eugene H

    2010-08-15

    Previous American Joint Committee on Cancer/International Union Against Cancer (AJCC/UICC) stage groupings for esophageal cancer have not been data driven or harmonized with stomach cancer. At the request of the AJCC, worldwide data from 3 continents were assembled to develop data-driven, harmonized esophageal staging for the seventh edition of the AJCC/UICC cancer staging manuals. All-cause mortality among 4627 patients with esophageal and esophagogastric junction cancer who underwent surgery alone (no preoperative or postoperative adjuvant therapy) was analyzed by using novel random forest methodology to produce stage groups for which survival was monotonically decreasing, distinctive, and homogeneous. For lymph node-negative pN0M0 cancers, risk-adjusted 5-year survival was dominated by pathologic tumor classification (pT) but was modulated by histopathologic cell type, histologic grade, and location. For lymph node-positive, pN+M0 cancers, the number of cancer-positive lymph nodes (a new pN classification) dominated survival. Resulting stage groupings departed from a simple, logical arrangement of TNM. Stage groupings for stage I and II adenocarcinoma were based on pT, pN, and histologic grade; and groupings for squamous cell carcinoma were based on pT, pN, histologic grade, and location. Stage III was similar for histopathologic cell types and was based only on pT and pN. Stage 0 and stage IV, by definition, were categorized as tumor in situ (Tis) (high-grade dysplasia) and pM1, respectively. The prognosis for patients with esophageal and esophagogastric junction cancer depends on the complex interplay of TNM classifications as well as nonanatomic factors, including histopathologic cell type, histologic grade, and cancer location. These features were incorporated into a data-driven staging of these cancers for the seventh edition of the AJCC/UICC cancer staging manuals. Copyright (c) 2010 American Cancer Society.

  19. Computer-aided detection of early cancer in the esophagus using HD endoscopy images

    NASA Astrophysics Data System (ADS)

    van der Sommen, Fons; Zinger, Svitlana; Schoon, Erik J.; de With, Peter H. N.

    2013-02-01

    Esophageal cancer is the fastest rising type of cancer in the Western world. The recent development of High-Definition (HD) endoscopy has enabled the specialist physician to identify cancer at an early stage. Nevertheless, it still requires considerable effort and training to be able to recognize these irregularities associated with early cancer. As a first step towards a Computer-Aided Detection (CAD) system that supports the physician in finding these early stages of cancer, we propose an algorithm that is able to identify irregularities in the esophagus automatically, based on HD endoscopic images. The concept employs tile-based processing, so our system is not only able to identify that an endoscopic image contains early cancer, but it can also locate it. The identification is based on the following steps: (1) preprocessing, (2) feature extraction with dimensionality reduction, (3) classification. We evaluate the detection performance in RGB, HSI and YCbCr color space using the Color Histogram (CH) and Gabor features and we compare with other well-known features to describe texture. For classification, we employ a Support Vector Machine (SVM) and evaluate its performance using different parameters and kernel functions. In experiments, our system achieves a classification accuracy of 95.9% on 50×50 pixel tiles of tumorous and normal tissue and reaches an Area Under the Curve (AUC) of 0.990. In 22 clinical examples our algorithm was able to identify all (pre-)cancerous regions and annotate those regions reasonably well. The experimental and clinical validation are considered promising for a CAD system that supports the physician in finding early stage cancer.

  20. Perspectives on current tumor-node-metastasis (TNM) staging of cancers of the colon and rectum.

    PubMed

    Hu, Huankai; Krasinskas, Alyssa; Willis, Joseph

    2011-08-01

    Improvements in classifications of cancers based on discovery and validation of important histopathological parameters and new molecular markers continue unabated. Though still not perfect, recent updates of classification schemes in gastrointestinal oncology by the American Joint Commission on Cancer (tumor-node-metastasis [TNM] staging) and the World Health Organization further stratify patients and guide optimization of treatment strategies and better predict patient outcomes. These updates recognize the heterogeneity of patient populations with significant subgrouping of each tumor stage and use of tumor deposits to significantly "up-stage" some cancers; change staging parameters for subsets of IIIB and IIIC cancers; and introduce of several new subtypes of colon carcinomas. By the nature of the process, recent discoveries that are important to improving even routine standards of patient care, especially new advances in molecular medicine, are not incorporated into these systems. Nonetheless, these classifications significantly advance clinical standards and are welcome enhancements to our current methods of cancer reporting. Copyright © 2011 Elsevier Inc. All rights reserved.

  1. Histological, molecular and functional subtypes of breast cancers

    PubMed Central

    Malhotra, Gautam K; Zhao, Xiangshan; Band, Hamid

    2010-01-01

    Increased understanding of the molecular heterogeneity that is intrinsic to the various subtypes of breast cancer will likely shape the future of breast cancer diagnosis, prognosis and treatment. Advances in the field over the last several decades have been remarkable and have clearly translated into better patient care as evidenced by the earlier detection, better prognosis and new targeted therapies. There have been two recent advances in the breast cancer research field that have lead to paradigm shifts: first, the identification of intrinsic breast tumor subtypes, which has changed the way we think about breast cancer and second, the recent characterization of cancer stem cells (CSCs), which are suspected to be responsible for tumor initiation, recurrence and resistance to therapy. These findings have opened new exciting avenues to think about breast cancer therapeutic strategies. While these advances constitute major paradigm shifts within the research realm, the clinical arena has yet to adopt and apply our understanding of the molecular basis of the disease to early diagnosis, prognosis and therapy of breast cancers. Here, we will review the current clinical approach to classification of breast cancers, newer molecular-based classification schemes and potential future of biomarkers representing a functional classification of breast cancer. PMID:21057215

  2. Association between gastric cancer and the Kyoto classification of gastritis.

    PubMed

    Shichijo, Satoki; Hirata, Yoshihiro; Niikura, Ryota; Hayakawa, Yoku; Yamada, Atsuo; Koike, Kazuhiko

    2017-09-01

    Histological gastritis is associated with gastric cancer, but its diagnosis requires biopsy. Many classifications of endoscopic gastritis are available, but not all are useful for risk stratification of gastric cancer. The Kyoto Classification of Gastritis was proposed at the 85th Congress of the Japan Gastroenterological Endoscopy Society. This cross-sectional study evaluated the usefulness of the Kyoto Classification of Gastritis for risk stratification of gastric cancer. From August 2013 to September 2014, esophagogastroduodenoscopy was performed and the gastric findings evaluated according to the Kyoto Classification of Gastritis in a total of 4062 patients. The following five endoscopic findings were selected based on previous reports: atrophy, intestinal metaplasia, enlarged folds, nodularity, and diffuse redness. A total of 3392 patients (1746 [51%] men and 1646 [49%] women) were analyzed. Among them, 107 gastric cancers were diagnosed. Atrophy was found in 2585 (78%) and intestinal metaplasia in 924 (27%). Enlarged folds, nodularity, and diffuse redness were found in 197 (5.8%), 22 (0.6%), and 573 (17%), respectively. In univariate analyses, the severity of atrophy, intestinal metaplasia, diffuse redness, age, and male sex were associated with gastric cancer. In a multivariate analysis, atrophy and male sex were found to be independent risk factors. Younger age and severe atrophy were determined to be associated with diffuse-type gastric cancer. Endoscopic detection of atrophy was associated with the risk of gastric cancer. Thus, patients with severe atrophy should be examined carefully and may require intensive follow-up. © 2017 Journal of Gastroenterology and Hepatology Foundation and John Wiley & Sons Australia, Ltd.

  3. Cell type classifiers for breast cancer microscopic images based on fractal dimension texture analysis of image color layers.

    PubMed

    Jitaree, Sirinapa; Phinyomark, Angkoon; Boonyaphiphat, Pleumjit; Phukpattaranont, Pornchai

    2015-01-01

    Having a classifier of cell types in a breast cancer microscopic image (BCMI), obtained with immunohistochemical staining, is required as part of a computer-aided system that counts the cancer cells in such BCMI. Such quantitation by cell counting is very useful in supporting decisions and planning of the medical treatment of breast cancer. This study proposes and evaluates features based on texture analysis by fractal dimension (FD), for the classification of histological structures in a BCMI into either cancer cells or non-cancer cells. The cancer cells include positive cells (PC) and negative cells (NC), while the normal cells comprise stromal cells (SC) and lymphocyte cells (LC). The FD feature values were calculated with the box-counting method from binarized images, obtained by automatic thresholding with Otsu's method of the grayscale images for various color channels. A total of 12 color channels from four color spaces (RGB, CIE-L*a*b*, HSV, and YCbCr) were investigated, and the FD feature values from them were used with decision tree classifiers. The BCMI data consisted of 1,400, 1,200, and 800 images with pixel resolutions 128 × 128, 192 × 192, and 256 × 256, respectively. The best cross-validated classification accuracy was 93.87%, for distinguishing between cancer and non-cancer cells, obtained using the Cr color channel with window size 256. The results indicate that the proposed algorithm, based on fractal dimension features extracted from a color channel, performs well in the automatic classification of the histology in a BCMI. This might support accurate automatic cell counting in a computer-assisted system for breast cancer diagnosis. © Wiley Periodicals, Inc.

  4. Thermogram breast cancer prediction approach based on Neutrosophic sets and fuzzy c-means algorithm.

    PubMed

    Gaber, Tarek; Ismail, Gehad; Anter, Ahmed; Soliman, Mona; Ali, Mona; Semary, Noura; Hassanien, Aboul Ella; Snasel, Vaclav

    2015-08-01

    The early detection of breast cancer makes many women survive. In this paper, a CAD system classifying breast cancer thermograms to normal and abnormal is proposed. This approach consists of two main phases: automatic segmentation and classification. For the former phase, an improved segmentation approach based on both Neutrosophic sets (NS) and optimized Fast Fuzzy c-mean (F-FCM) algorithm was proposed. Also, post-segmentation process was suggested to segment breast parenchyma (i.e. ROI) from thermogram images. For the classification, different kernel functions of the Support Vector Machine (SVM) were used to classify breast parenchyma into normal or abnormal cases. Using benchmark database, the proposed CAD system was evaluated based on precision, recall, and accuracy as well as a comparison with related work. The experimental results showed that our system would be a very promising step toward automatic diagnosis of breast cancer using thermograms as the accuracy reached 100%.

  5. Dual-modal cancer detection based on optical pH sensing and Raman spectroscopy

    NASA Astrophysics Data System (ADS)

    Kim, Soogeun; Lee, Seung Ho; Min, Sun Young; Byun, Kyung Min; Lee, Soo Yeol

    2017-10-01

    A dual-modal approach using Raman spectroscopy and optical pH sensing was investigated to discriminate between normal and cancerous tissues. Raman spectroscopy has demonstrated the potential for in vivo cancer detection. However, Raman spectroscopy has suffered from strong fluorescence background of biological samples and subtle spectral differences between normal and disease tissues. To overcome those issues, pH sensing is adopted to Raman spectroscopy as a dual-modal approach. Based on the fact that the pH level in cancerous tissues is lower than that in normal tissues due to insufficient vasculature formation, the dual-modal approach combining the chemical information of Raman spectrum and the metabolic information of pH level can improve the specificity of cancer diagnosis. From human breast tissue samples, Raman spectra and pH levels are measured using fiber-optic-based Raman and pH probes, respectively. The pH sensing is based on the dependence of pH level on optical transmission spectrum. Multivariate statistical analysis is performed to evaluate the classification capability of the dual-modal method. The analytical results show that the dual-modal method based on Raman spectroscopy and optical pH sensing can improve the performance of cancer classification.

  6. Evaluation of a Web-Based App Demonstrating an Exclusionary Algorithmic Approach to TNM Cancer Staging

    PubMed Central

    2015-01-01

    Background TNM staging plays a critical role in the evaluation and management of a range of different types of cancers. The conventional combinatorial approach to the determination of an anatomic stage relies on the identification of distinct tumor (T), node (N), and metastasis (M) classifications to generate a TNM grouping. This process is inherently inefficient due to the need for scrupulous review of the criteria specified for each classification to ensure accurate assignment. An exclusionary approach to TNM staging based on sequential constraint of options may serve to minimize the number of classifications that need to be reviewed to accurately determine an anatomic stage. Objective Our aim was to evaluate the usability and utility of a Web-based app configured to demonstrate an exclusionary approach to TNM staging. Methods Internal medicine residents, surgery residents, and oncology fellows engaged in clinical training were asked to evaluate a Web-based app developed as an instructional aid incorporating (1) an exclusionary algorithm that polls tabulated classifications and sorts them into ranked order based on frequency counts, (2) reconfiguration of classification criteria to generate disambiguated yes/no questions that function as selection and exclusion prompts, and (3) a selectable grid of TNM groupings that provides dynamic graphic demonstration of the effects of sequentially selecting or excluding specific classifications. Subjects were asked to evaluate the performance of this app after completing exercises simulating the staging of different types of cancers encountered during training. Results Survey responses indicated high levels of agreement with statements supporting the usability and utility of this app. Subjects reported that its user interface provided a clear display with intuitive controls and that the exclusionary approach to TNM staging it demonstrated represented an efficient process of assignment that helped to clarify distinctions between tumor, node, and metastasis classifications. High overall usefulness ratings were bolstered by supplementary comments suggesting that this app might be readily adopted for use in clinical practice. Conclusions A Web-based app that utilizes an exclusionary algorithm to prompt the assignment of tumor, node, and metastasis classifications may serve as an effective instructional aid demonstrating an efficient and informative approach to TNM staging. PMID:28410163

  7. New Image-Based Techniques for Prostate Biopsy and Treatment

    DTIC Science & Technology

    2012-04-01

    C-arm fluoroscopy, MICCAI 2011, Toronto, Canada, 2011. 4) Poster Presentation: Prostate Cancer Probability Estimation Based on DCE- DTI Features...and P. Kozlowski, “Prostate Cancer Probability Estimation Based on DCE- DTI Features and Support Vector Machine Classification,” Annual Meeting of... DTI ), which characterize the de-phasing of the MR signal caused by molecular diffusion. Prostate cancer causes a pathological change in the tissue

  8. Identifying Nursing Interventions in a Cancer Screening Program Using Nursing Interventions Classification Taxonomy.

    PubMed

    Benito, Llucia; Lluch, María Teresa; Falcó, Anna Marta; García, Montse; Puig, Montse

    2017-04-01

    This study aimed to investigate which Nursing Interventions Classification (NIC) labels correspond to specific nursing interventions provided during cancer screening to establish a nursing documentation system. This descriptive study was conducted to identify and classify the interventions that cancer screening nurses perform based on an initial list. The initial list was grouped into 15 interventions that corresponded to four domains and eight classes. The study found expert consensus regarding the duties of cancer screening nurses and identified 15 interventions that should be implemented in clinical practice for cancer screening care, according to the NIC taxonomy. This study is the first step in developing indicators to assess nursing performance in cancer screening, and it helps to establish the core competency requirements for cancer screening nurses. © 2015 NANDA International, Inc.

  9. Fluorescent marker-based and marker-free discrimination between healthy and cancerous human tissues using hyper-spectral imaging

    NASA Astrophysics Data System (ADS)

    Arnold, Thomas; De Biasio, Martin; Leitner, Raimund

    2015-06-01

    Two problems are addressed in this paper (i) the fluorescent marker-based and the (ii) marker-free discrimination between healthy and cancerous human tissues. For both applications the performance of hyper-spectral methods are quantified. Fluorescent marker-based tissue classification uses a number of fluorescent markers to dye specific parts of a human cell. The challenge is that the emission spectra of the fluorescent dyes overlap considerably. They are, furthermore disturbed by the inherent auto-fluorescence of human tissue. This results in ambiguities and decreased image contrast causing difficulties for the treatment decision. The higher spectral resolution introduced by tunable-filter-based spectral imaging in combination with spectral unmixing techniques results in an improvement of the image contrast and therefore more reliable information for the physician to choose the treatment decision. Marker-free tissue classification is based solely on the subtle spectral features of human tissue without the use of artificial markers. The challenge in this case is that the spectral differences between healthy and cancerous tissues are subtle and embedded in intra- and inter-patient variations of these features. The contributions of this paper are (i) the evaluation of hyper-spectral imaging in combination with spectral unmixing techniques for fluorescence marker-based tissue classification, (ii) the evaluation of spectral imaging for marker-free intra surgery tissue classification. Within this paper, we consider real hyper-spectral fluorescence and endoscopy data sets to emphasize the practical capability of the proposed methods. It is shown that the combination of spectral imaging with multivariate statistical methods can improve the sensitivity and specificity of the detection and the staging of cancerous tissues compared to standard procedures.

  10. Classification of ground glass opacity lesion characteristic based on texture feature using lung CT image

    NASA Astrophysics Data System (ADS)

    Sebatubun, M. M.; Haryawan, C.; Windarta, B.

    2018-03-01

    Lung cancer causes a high mortality rate in the world than any other cancers. That can be minimised if the symptoms and cancer cells have been detected early. One of the techniques used to detect lung cancer is by computed tomography (CT) scan. CT scan images have been used in this study to identify one of the lesion characteristics named ground glass opacity (GGO). It has been used to determine the level of malignancy of the lesion. There were three phases in identifying GGO: image cropping, feature extraction using grey level co-occurrence matrices (GLCM) and classification using Naïve Bayes Classifier. In order to improve the classification results, the most significant feature was sought by feature selection using gain ratio evaluation. Based on the results obtained, the most significant features could be identified by using feature selection method used in this research. The accuracy rate increased from 83.33% to 91.67%, the sensitivity from 82.35% to 94.11% and the specificity from 84.21% to 89.47%.

  11. Training sample selection based on self-training for liver cirrhosis classification using ultrasound images

    NASA Astrophysics Data System (ADS)

    Fujita, Yusuke; Mitani, Yoshihiro; Hamamoto, Yoshihiko; Segawa, Makoto; Terai, Shuji; Sakaida, Isao

    2017-03-01

    Ultrasound imaging is a popular and non-invasive tool used in the diagnoses of liver disease. Cirrhosis is a chronic liver disease and it can advance to liver cancer. Early detection and appropriate treatment are crucial to prevent liver cancer. However, ultrasound image analysis is very challenging, because of the low signal-to-noise ratio of ultrasound images. To achieve the higher classification performance, selection of training regions of interest (ROIs) is very important that effect to classification accuracy. The purpose of our study is cirrhosis detection with high accuracy using liver ultrasound images. In our previous works, training ROI selection by MILBoost and multiple-ROI classification based on the product rule had been proposed, to achieve high classification performance. In this article, we propose self-training method to select training ROIs effectively. Evaluation experiments were performed to evaluate effect of self-training, using manually selected ROIs and also automatically selected ROIs. Experimental results show that self-training for manually selected ROIs achieved higher classification performance than other approaches, including our conventional methods. The manually ROI definition and sample selection are important to improve classification accuracy in cirrhosis detection using ultrasound images.

  12. Validation of the Lung Subtyping Panel in Multiple Fresh-Frozen and Formalin-Fixed, Paraffin-Embedded Lung Tumor Gene Expression Data Sets.

    PubMed

    Faruki, Hawazin; Mayhew, Gregory M; Fan, Cheng; Wilkerson, Matthew D; Parker, Scott; Kam-Morgan, Lauren; Eisenberg, Marcia; Horten, Bruce; Hayes, D Neil; Perou, Charles M; Lai-Goldman, Myla

    2016-06-01

    Context .- A histologic classification of lung cancer subtypes is essential in guiding therapeutic management. Objective .- To complement morphology-based classification of lung tumors, a previously developed lung subtyping panel (LSP) of 57 genes was tested using multiple public fresh-frozen gene-expression data sets and a prospectively collected set of formalin-fixed, paraffin-embedded lung tumor samples. Design .- The LSP gene-expression signature was evaluated in multiple lung cancer gene-expression data sets totaling 2177 patients collected from 4 platforms: Illumina RNAseq (San Diego, California), Agilent (Santa Clara, California) and Affymetrix (Santa Clara) microarrays, and quantitative reverse transcription-polymerase chain reaction. Gene centroids were calculated for each of 3 genomic-defined subtypes: adenocarcinoma, squamous cell carcinoma, and neuroendocrine, the latter of which encompassed both small cell carcinoma and carcinoid. Classification by LSP into 3 subtypes was evaluated in both fresh-frozen and formalin-fixed, paraffin-embedded tumor samples, and agreement with the original morphology-based diagnosis was determined. Results .- The LSP-based classifications demonstrated overall agreement with the original clinical diagnosis ranging from 78% (251 of 322) to 91% (492 of 538 and 869 of 951) in the fresh-frozen public data sets and 84% (65 of 77) in the formalin-fixed, paraffin-embedded data set. The LSP performance was independent of tissue-preservation method and gene-expression platform. Secondary, blinded pathology review of formalin-fixed, paraffin-embedded samples demonstrated concordance of 82% (63 of 77) with the original morphology diagnosis. Conclusions .- The LSP gene-expression signature is a reproducible and objective method for classifying lung tumors and demonstrates good concordance with morphology-based classification across multiple data sets. The LSP panel can supplement morphologic assessment of lung cancers, particularly when classification by standard methods is challenging.

  13. Comparison of hand-craft feature based SVM and CNN based deep learning framework for automatic polyp classification.

    PubMed

    Younghak Shin; Balasingham, Ilangko

    2017-07-01

    Colonoscopy is a standard method for screening polyps by highly trained physicians. Miss-detected polyps in colonoscopy are potential risk factor for colorectal cancer. In this study, we investigate an automatic polyp classification framework. We aim to compare two different approaches named hand-craft feature method and convolutional neural network (CNN) based deep learning method. Combined shape and color features are used for hand craft feature extraction and support vector machine (SVM) method is adopted for classification. For CNN approach, three convolution and pooling based deep learning framework is used for classification purpose. The proposed framework is evaluated using three public polyp databases. From the experimental results, we have shown that the CNN based deep learning framework shows better classification performance than the hand-craft feature based methods. It achieves over 90% of classification accuracy, sensitivity, specificity and precision.

  14. Normed kernel function-based fuzzy possibilistic C-means (NKFPCM) algorithm for high-dimensional breast cancer database classification with feature selection is based on Laplacian Score

    NASA Astrophysics Data System (ADS)

    Lestari, A. W.; Rustam, Z.

    2017-07-01

    In the last decade, breast cancer has become the focus of world attention as this disease is one of the primary leading cause of death for women. Therefore, it is necessary to have the correct precautions and treatment. In previous studies, Fuzzy Kennel K-Medoid algorithm has been used for multi-class data. This paper proposes an algorithm to classify the high dimensional data of breast cancer using Fuzzy Possibilistic C-means (FPCM) and a new method based on clustering analysis using Normed Kernel Function-Based Fuzzy Possibilistic C-Means (NKFPCM). The objective of this paper is to obtain the best accuracy in classification of breast cancer data. In order to improve the accuracy of the two methods, the features candidates are evaluated using feature selection, where Laplacian Score is used. The results show the comparison accuracy and running time of FPCM and NKFPCM with and without feature selection.

  15. MR Imaging Based Treatment Planning for Radiotherapy of Prostate Cancer

    DTIC Science & Technology

    2008-02-01

    Radiotherapy, MR-based treatment planning, dosimetry, Monte Carlo dose verification, Prostate Cancer, MRI -based DRRs 16. SECURITY CLASSIFICATION...AcQPlan system Version 5 was used for the study , which is capable of performing dose calculation on both CT and MRI . A four field 3D conformal planning...prostate motion studies for 3DCRT and IMRT of prostate cancer; (2) to investigate and improve the accuracy of MRI -based treatment planning dose calculation

  16. Inference of combinatorial Boolean rules of synergistic gene sets from cancer microarray datasets.

    PubMed

    Park, Inho; Lee, Kwang H; Lee, Doheon

    2010-06-15

    Gene set analysis has become an important tool for the functional interpretation of high-throughput gene expression datasets. Moreover, pattern analyses based on inferred gene set activities of individual samples have shown the ability to identify more robust disease signatures than individual gene-based pattern analyses. Although a number of approaches have been proposed for gene set-based pattern analysis, the combinatorial influence of deregulated gene sets on disease phenotype classification has not been studied sufficiently. We propose a new approach for inferring combinatorial Boolean rules of gene sets for a better understanding of cancer transcriptome and cancer classification. To reduce the search space of the possible Boolean rules, we identify small groups of gene sets that synergistically contribute to the classification of samples into their corresponding phenotypic groups (such as normal and cancer). We then measure the significance of the candidate Boolean rules derived from each group of gene sets; the level of significance is based on the class entropy of the samples selected in accordance with the rules. By applying the present approach to publicly available prostate cancer datasets, we identified 72 significant Boolean rules. Finally, we discuss several identified Boolean rules, such as the rule of glutathione metabolism (down) and prostaglandin synthesis regulation (down), which are consistent with known prostate cancer biology. Scripts written in Python and R are available at http://biosoft.kaist.ac.kr/~ihpark/. The refined gene sets and the full list of the identified Boolean rules are provided in the Supplementary Material. Supplementary data are available at Bioinformatics online.

  17. Classification of cancer cell lines using matrix-assisted laser desorption/ionization time‑of‑flight mass spectrometry and statistical analysis.

    PubMed

    Serafim, Vlad; Shah, Ajit; Puiu, Maria; Andreescu, Nicoleta; Coricovac, Dorina; Nosyrev, Alexander; Spandidos, Demetrios A; Tsatsakis, Aristides M; Dehelean, Cristina; Pinzaru, Iulia

    2017-10-01

    Over the past decade, matrix-assisted laser desorption/ionization time‑of‑flight mass spectrometry (MALDI‑TOF MS) has been established as a valuable platform for microbial identification, and it is also frequently applied in biology and clinical studies to identify new markers expressed in pathological conditions. The aim of the present study was to assess the potential of using this approach for the classification of cancer cell lines as a quantifiable method for the proteomic profiling of cellular organelles. Intact protein extracts isolated from different tumor cell lines (human and murine) were analyzed using MALDI‑TOF MS and the obtained mass lists were processed using principle component analysis (PCA) within Bruker Biotyper® software. Furthermore, reference spectra were created for each cell line and were used for classification. Based on the intact protein profiles, we were able to differentiate and classify six cancer cell lines: two murine melanoma (B16‑F0 and B164A5), one human melanoma (A375), two human breast carcinoma (MCF7 and MDA‑MB‑231) and one human liver carcinoma (HepG2). The cell lines were classified according to cancer type and the species they originated from, as well as by their metastatic potential, offering the possibility to differentiate non‑invasive from invasive cells. The obtained results pave the way for developing a broad‑based strategy for the identification and classification of cancer cells.

  18. On-the-spot lung cancer differential diagnosis by label-free, molecular vibrational imaging and knowledge-based classification

    NASA Astrophysics Data System (ADS)

    Gao, Liang; Li, Fuhai; Thrall, Michael J.; Yang, Yaliang; Xing, Jiong; Hammoudi, Ahmad A.; Zhao, Hong; Massoud, Yehia; Cagle, Philip T.; Fan, Yubo; Wong, Kelvin K.; Wang, Zhiyong; Wong, Stephen T. C.

    2011-09-01

    We report the development and application of a knowledge-based coherent anti-Stokes Raman scattering (CARS) microscopy system for label-free imaging, pattern recognition, and classification of cells and tissue structures for differentiating lung cancer from non-neoplastic lung tissues and identifying lung cancer subtypes. A total of 1014 CARS images were acquired from 92 fresh frozen lung tissue samples. The established pathological workup and diagnostic cellular were used as prior knowledge for establishment of a knowledge-based CARS system using a machine learning approach. This system functions to separate normal, non-neoplastic, and subtypes of lung cancer tissues based on extracted quantitative features describing fibrils and cell morphology. The knowledge-based CARS system showed the ability to distinguish lung cancer from normal and non-neoplastic lung tissue with 91% sensitivity and 92% specificity. Small cell carcinomas were distinguished from nonsmall cell carcinomas with 100% sensitivity and specificity. As an adjunct to submitting tissue samples to routine pathology, our novel system recognizes the patterns of fibril and cell morphology, enabling medical practitioners to perform differential diagnosis of lung lesions in mere minutes. The demonstration of the strategy is also a necessary step toward in vivo point-of-care diagnosis of precancerous and cancerous lung lesions with a fiber-based CARS microendoscope.

  19. Mammographic features and subsequent risk of breast cancer: a comparison of qualitative and quantitative evaluations in the Guernsey prospective studies.

    PubMed

    Torres-Mejía, Gabriela; De Stavola, Bianca; Allen, Diane S; Pérez-Gavilán, Juan J; Ferreira, Jorge M; Fentiman, Ian S; Dos Santos Silva, Isabel

    2005-05-01

    Mammographic features are known to be associated with breast cancer but the magnitude of the effect differs markedly from study to study. Methods to assess mammographic features range from subjective qualitative classifications to computer-automated quantitative measures. We used data from the UK Guernsey prospective studies to examine the relative value of these methods in predicting breast cancer risk. In all, 3,211 women ages > or =35 years who had a mammogram taken in 1986 to 1989 were followed-up to the end of October 2003, with 111 developing breast cancer during this period. Mammograms were classified using the subjective qualitative Wolfe classification and several quantitative mammographic features measured using computer-based techniques. Breast cancer risk was positively associated with high-grade Wolfe classification, percent breast density and area of dense tissue, and negatively associated with area of lucent tissue, fractal dimension, and lacunarity. Inclusion of the quantitative measures in the same model identified area of dense tissue and lacunarity as the best predictors of breast cancer, with risk increasing by 59% [95% confidence interval (95% CI), 29-94%] per SD increase in total area of dense tissue but declining by 39% (95% CI, 53-22%) per SD increase in lacunarity, after adjusting for each other and for other confounders. Comparison of models that included both the qualitative Wolfe classification and these two quantitative measures to models that included either the qualitative or the two quantitative variables showed that they all made significant contributions to prediction of breast cancer risk. These findings indicate that breast cancer risk is affected not only by the amount of mammographic density but also by the degree of heterogeneity of the parenchymal pattern and, presumably, by other features captured by the Wolfe classification.

  20. Pathological Bases for a Robust Application of Cancer Molecular Classification

    PubMed Central

    Diaz-Cano, Salvador J.

    2015-01-01

    Any robust classification system depends on its purpose and must refer to accepted standards, its strength relying on predictive values and a careful consideration of known factors that can affect its reliability. In this context, a molecular classification of human cancer must refer to the current gold standard (histological classification) and try to improve it with key prognosticators for metastatic potential, staging and grading. Although organ-specific examples have been published based on proteomics, transcriptomics and genomics evaluations, the most popular approach uses gene expression analysis as a direct correlate of cellular differentiation, which represents the key feature of the histological classification. RNA is a labile molecule that varies significantly according with the preservation protocol, its transcription reflect the adaptation of the tumor cells to the microenvironment, it can be passed through mechanisms of intercellular transference of genetic information (exosomes), and it is exposed to epigenetic modifications. More robust classifications should be based on stable molecules, at the genetic level represented by DNA to improve reliability, and its analysis must deal with the concept of intratumoral heterogeneity, which is at the origin of tumor progression and is the byproduct of the selection process during the clonal expansion and progression of neoplasms. The simultaneous analysis of multiple DNA targets and next generation sequencing offer the best practical approach for an analytical genomic classification of tumors. PMID:25898411

  1. Parameters selection in gene selection using Gaussian kernel support vector machines by genetic algorithm.

    PubMed

    Mao, Yong; Zhou, Xiao-Bo; Pi, Dao-Ying; Sun, You-Xian; Wong, Stephen T C

    2005-10-01

    In microarray-based cancer classification, gene selection is an important issue owing to the large number of variables and small number of samples as well as its non-linearity. It is difficult to get satisfying results by using conventional linear statistical methods. Recursive feature elimination based on support vector machine (SVM RFE) is an effective algorithm for gene selection and cancer classification, which are integrated into a consistent framework. In this paper, we propose a new method to select parameters of the aforementioned algorithm implemented with Gaussian kernel SVMs as better alternatives to the common practice of selecting the apparently best parameters by using a genetic algorithm to search for a couple of optimal parameter. Fast implementation issues for this method are also discussed for pragmatic reasons. The proposed method was tested on two representative hereditary breast cancer and acute leukaemia datasets. The experimental results indicate that the proposed method performs well in selecting genes and achieves high classification accuracies with these genes.

  2. Clinical application of a microfluidic chip for immunocapture and quantification of circulating exosomes to assist breast cancer diagnosis and molecular classification.

    PubMed

    Fang, Shimeng; Tian, Hongzhu; Li, Xiancheng; Jin, Dong; Li, Xiaojie; Kong, Jing; Yang, Chun; Yang, Xuesong; Lu, Yao; Luo, Yong; Lin, Bingcheng; Niu, Weidong; Liu, Tingjiao

    2017-01-01

    Increasing attention has been attracted by exosomes in blood-based diagnosis because cancer cells release more exosomes in serum than normal cells and these exosomes overexpress a certain number of cancer-related biomarkers. However, capture and biomarker analysis of exosomes for clinical application are technically challenging. In this study, we developed a microfluidic chip for immunocapture and quantification of circulating exosomes from small sample volume and applied this device in clinical study. Circulating EpCAM-positive exosomes were measured in 6 cases breast cancer patients and 3 healthy controls to assist diagnosis. A significant increase in the EpCAM-positive exosome level in these patients was detected, compared to healthy controls. Furthermore, we quantified circulating HER2-positive exosomes in 19 cases of breast cancer patients for molecular classification. We demonstrated that the exosomal HER2 expression levels were almost consistent with that in tumor tissues assessed by immunohistochemical staining. The microfluidic chip might provide a new platform to assist breast cancer diagnosis and molecular classification.

  3. Morphological feature extraction for the classification of digital images of cancerous tissues.

    PubMed

    Thiran, J P; Macq, B

    1996-10-01

    This paper presents a new method for automatic recognition of cancerous tissues from an image of a microscopic section. Based on the shape and the size analysis of the observed cells, this method provides the physician with nonsubjective numerical values for four criteria of malignancy. This automatic approach is based on mathematical morphology, and more specifically on the use of Geodesy. This technique is used first to remove the background noise from the image and then to operate a segmentation of the nuclei of the cells and an analysis of their shape, their size, and their texture. From the values of the extracted criteria, an automatic classification of the image (cancerous or not) is finally operated.

  4. Overview of the 8th Edition TNM Classification for Head and Neck Cancer.

    PubMed

    Huang, Shao Hui; O'Sullivan, Brian

    2017-07-01

    The main purpose of the TNM system is to provide an anatomic-based classification to adequately depict cancer prognosis. Accurate cancer staging is important for treatment selection and outcome prediction, research design, and cancer control activities. To maintain clinical relevance, periodical updates to TNM are necessary. The recently published 8th edition TNM classification institutes the following changes to the staging of head and neck (excluding thyroid cancer): new stage classifications [HPV-related oropharyngeal cancer (HPV+ OPC) and soft tissue sarcoma of the head and neck (HN-STS)] and modification of T and N categories [T and N categories for nasopharyngeal cancer (NPC), T categories for oral cavity squamous cell carcinomas (OSCC), N categories for non-viral related head and neck cancer and unknown primary (CUP), and T categories for head and neck cutaneous carcinoma]. These changes reflect better understanding tumor biology and clinical behavior (e.g., HPV+ OPC and HN-STS), improved outcomes associated with technical advances in diagnosis and treatment (e.g., NPC), evolving knowledge about additional prognostic factors and risk stratification from research and observation (e.g., inclusion of depth of invasion variable for OSCC, inclusion of extranodal extension variable for all non-viral head and neck cancer, and reintroduction of size criteria for non-Merkel cell cutaneous carcinoma of the head and neck). This review summarizes the changes and potential advantages and limitations/caveats associated with them. Further evidence is needed to evaluate whether these changes would result in improvement in TNM stage performance to better serve the needs for clinical care, research, and cancer control.

  5. Confocal Raman imaging for cancer cell classification

    NASA Astrophysics Data System (ADS)

    Mathieu, Evelien; Van Dorpe, Pol; Stakenborg, Tim; Liu, Chengxun; Lagae, Liesbet

    2014-05-01

    We propose confocal Raman imaging as a label-free single cell characterization method that can be used as an alternative for conventional cell identification techniques that typically require labels, long incubation times and complex sample preparation. In this study it is investigated whether cancer and blood cells can be distinguished based on their Raman spectra. 2D Raman scans are recorded of 114 single cells, i.e. 60 breast (MCF-7), 5 cervix (HeLa) and 39 prostate (LNCaP) cancer cells and 10 monocytes (from healthy donors). For each cell an average spectrum is calculated and principal component analysis is performed on all average cell spectra. The main features of these principal components indicate that the information for cell identification based on Raman spectra mainly comes from the fatty acid composition in the cell. Based on the second and third principal component, blood cells could be distinguished from cancer cells; and prostate cancer cells could be distinguished from breast and cervix cancer cells. However, it was not possible to distinguish breast and cervix cancer cells. The results obtained in this study, demonstrate the potential of confocal Raman imaging for cell type classification and identification purposes.

  6. Classification of breast cancer cytological specimen using convolutional neural network

    NASA Astrophysics Data System (ADS)

    Żejmo, Michał; Kowal, Marek; Korbicz, Józef; Monczak, Roman

    2017-01-01

    The paper presents a deep learning approach for automatic classification of breast tumors based on fine needle cytology. The main aim of the system is to distinguish benign from malignant cases based on microscopic images. Experiment was carried out on cytological samples derived from 50 patients (25 benign cases + 25 malignant cases) diagnosed in Regional Hospital in Zielona Góra. To classify microscopic images, we used convolutional neural networks (CNN) of two types: GoogLeNet and AlexNet. Due to the very large size of images of cytological specimen (on average 200000 × 100000 pixels), they were divided into smaller patches of size 256 × 256 pixels. Breast cancer classification usually is based on morphometric features of nuclei. Therefore, training and validation patches were selected using Support Vector Machine (SVM) so that suitable amount of cell material was depicted. Neural classifiers were tuned using GPU accelerated implementation of gradient descent algorithm. Training error was defined as a cross-entropy classification loss. Classification accuracy was defined as the percentage ratio of successfully classified validation patches to the total number of validation patches. The best accuracy rate of 83% was obtained by GoogLeNet model. We observed that more misclassified patches belong to malignant cases.

  7. Automated noninvasive classification of renal cancer on multiphase CT

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Linguraru, Marius George; Wang, Shijun; Shah, Furhawn

    2011-10-15

    Purpose: To explore the added value of the shape of renal lesions for classifying renal neoplasms. To investigate the potential of computer-aided analysis of contrast-enhanced computed-tomography (CT) to quantify and classify renal lesions. Methods: A computer-aided clinical tool based on adaptive level sets was employed to analyze 125 renal lesions from contrast-enhanced abdominal CT studies of 43 patients. There were 47 cysts and 78 neoplasms: 22 Von Hippel-Lindau (VHL), 16 Birt-Hogg-Dube (BHD), 19 hereditary papillary renal carcinomas (HPRC), and 21 hereditary leiomyomatosis and renal cell cancers (HLRCC). The technique quantified the three-dimensional size and enhancement of lesions. Intrapatient and interphasemore » registration facilitated the study of lesion serial enhancement. The histograms of curvature-related features were used to classify the lesion types. The areas under the curve (AUC) were calculated for receiver operating characteristic curves. Results: Tumors were robustly segmented with 0.80 overlap (0.98 correlation) between manual and semi-automated quantifications. The method further identified morphological discrepancies between the types of lesions. The classification based on lesion appearance, enhancement and morphology between cysts and cancers showed AUC = 0.98; for BHD + VHL (solid cancers) vs. HPRC + HLRCC AUC = 0.99; for VHL vs. BHD AUC = 0.82; and for HPRC vs. HLRCC AUC = 0.84. All semi-automated classifications were statistically significant (p < 0.05) and superior to the analyses based solely on serial enhancement. Conclusions: The computer-aided clinical tool allowed the accurate quantification of cystic, solid, and mixed renal tumors. Cancer types were classified into four categories using their shape and enhancement. Comprehensive imaging biomarkers of renal neoplasms on abdominal CT may facilitate their noninvasive classification, guide clinical management, and monitor responses to drugs or interventions.« less

  8. Cardiovascular, diabetes, and cancer strips: evidences, mechanisms, and classifications

    PubMed Central

    Wu, Qing-Hua; Hu, Da-Yi

    2014-01-01

    Objectives To report and name firstly that there are cardiovascular disease (CVD), diabetes mellitus (DM) and cancers (CDC) strips; and disclose their mechanisms, classifications, and clinical significances. Study design Narrative and systematic review study and interpretive analysis. Methods Data sources and study selection: to collect and present related evidences on CDC strips from evidence-based, open-access, both Chinese- and English-language literatures in recent 10 years on clinical trials from PubMed according to keywords “CVD, DM and cancers” as well as authors’ extensive clinical experience with the treatment of more than fifty thousands of patients with CVD, diabetes and cancers over the past decades, and analyze their related mechanisms and categories which based on authors’ previous works. Data extraction: data were mainly extracted from 48 articles which are listed in the reference section of this review. Qualitative, quantitative and mixed data were included, narratively and systematically reviewed. Results With several conceptual and technical breakthrough, authors present related evidences on CDC strips, these are, CVD and DM, DM and cancers, cancers and CVD linked, respectively; And “Bad SEED” +/– “bad soil” theory or doctrine may explain this phenomenon due to “internal environmental injure, abnormal or unbalance” in human body resulting from the role of risk factors (RFs) related multi-pathways and multi-targets, which including organ & tissue (e.g., vascular-specific), cell and gene-based mechanisms. Their classifications include main strips/type B, and Branches/type A as showed by tables and figures in this article. Conclusions There are CDC strips and related mechanisms and classifications. CDC strips may help us to understand, prevent, and control related common non-communicable diseases (NCDs) as well as these high risk strips. PMID:25276377

  9. Dual-modal cancer detection based on optical pH sensing and Raman spectroscopy.

    PubMed

    Kim, Soogeun; Lee, Seung Ho; Min, Sun Young; Byun, Kyung Min; Lee, Soo Yeol

    2017-10-01

    A dual-modal approach using Raman spectroscopy and optical pH sensing was investigated to discriminate between normal and cancerous tissues. Raman spectroscopy has demonstrated the potential for in vivo cancer detection. However, Raman spectroscopy has suffered from strong fluorescence background of biological samples and subtle spectral differences between normal and disease tissues. To overcome those issues, pH sensing is adopted to Raman spectroscopy as a dual-modal approach. Based on the fact that the pH level in cancerous tissues is lower than that in normal tissues due to insufficient vasculature formation, the dual-modal approach combining the chemical information of Raman spectrum and the metabolic information of pH level can improve the specificity of cancer diagnosis. From human breast tissue samples, Raman spectra and pH levels are measured using fiber-optic-based Raman and pH probes, respectively. The pH sensing is based on the dependence of pH level on optical transmission spectrum. Multivariate statistical analysis is performed to evaluate the classification capability of the dual-modal method. The analytical results show that the dual-modal method based on Raman spectroscopy and optical pH sensing can improve the performance of cancer classification. (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE).

  10. An efficient ensemble learning method for gene microarray classification.

    PubMed

    Osareh, Alireza; Shadgar, Bita

    2013-01-01

    The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.

  11. The classification of lung cancers and their degree of malignancy by FTIR, PCA-LDA analysis, and a physics-based computational model.

    PubMed

    Kaznowska, E; Depciuch, J; Łach, K; Kołodziej, M; Koziorowska, A; Vongsvivut, J; Zawlik, I; Cholewa, M; Cebulski, J

    2018-08-15

    Lung cancer has the highest mortality rate of all malignant tumours. The current effects of cancer treatment, as well as its diagnostics, are unsatisfactory. Therefore it is very important to introduce modern diagnostic tools, which will allow for rapid classification of lung cancers and their degree of malignancy. For this purpose, the authors propose the use of Fourier Transform InfraRed (FTIR) spectroscopy combined with Principal Component Analysis-Linear Discriminant Analysis (PCA-LDA) and a physics-based computational model. The results obtained for lung cancer tissues, adenocarcinoma and squamous cell carcinoma FTIR spectra, show a shift in wavenumbers compared to control tissue FTIR spectra. Furthermore, in the FTIR spectra of adenocarcinoma there are no peaks corresponding to glutamate or phospholipid functional groups. Moreover, in the case of G2 and G3 malignancy of adenocarcinoma lung cancer, the absence of an OH groups peak was noticed. Thus, it seems that FTIR spectroscopy is a valuable tool to classify lung cancer and to determine the degree of its malignancy. Copyright © 2018 Elsevier B.V. All rights reserved.

  12. Inhibition of Breast Cancer Progression by Blocking Heterocellular Contact Between Epithelial Cells and Fibroblasts

    DTIC Science & Technology

    2013-04-01

    by employing a microfluidic -based compartmentalized 3D co-culture platform enabling both contact-free and contact-associated co-cultures. 15...SUBJECT TERMS Heterocellular contact between cancer cells and stromal fibroblasts, Microfluidics , 3D 16. SECURITY CLASSIFICATION OF: 17. LIMITATION...and human mammary fibroblasts (HMFs) in breast cancer progression by employing a microfluidic - based compartmentalized 3D co-culture platform

  13. A "TNM" classification system for cancer pain: the Edmonton Classification System for Cancer Pain (ECS-CP).

    PubMed

    Fainsinger, Robin L; Nekolaichuk, Cheryl L

    2008-06-01

    The purpose of this paper is to provide an overview of the development of a "TNM" cancer pain classification system for advanced cancer patients, the Edmonton Classification System for Cancer Pain (ECS-CP). Until we have a common international language to discuss cancer pain, understanding differences in clinical and research experience in opioid rotation and use remains problematic. The complexity of the cancer pain experience presents unique challenges for the classification of pain. To date, no universally accepted pain classification measure can accurately predict the complexity of pain management, particularly for patients with cancer pain that is difficult to treat. In response to this gap in clinical assessment, the Edmonton Staging System (ESS), a classification system for cancer pain, was developed. Difficulties in definitions and interpretation of some aspects of the ESS restricted acceptance and widespread use. Construct, inter-rater reliability, and predictive validity evidence have contributed to the development of the ECS-CP. The five features of the ECS-CP--Pain Mechanism, Incident Pain, Psychological Distress, Addictive Behavior and Cognitive Function--have demonstrated value in predicting pain management complexity. The development of a standardized classification system that is comprehensive, prognostic and simple to use could provide a common language for clinical management and research of cancer pain. An international study to assess the inter-rater reliability and predictive value of the ECS-CP is currently in progress.

  14. Two-dimensional shape classification using generalized Fourier representation and neural networks

    NASA Astrophysics Data System (ADS)

    Chodorowski, Artur; Gustavsson, Tomas; Mattsson, Ulf

    2000-04-01

    A shape-based classification method is developed based upon the Generalized Fourier Representation (GFR). GFR can be regarded as an extension of traditional polar Fourier descriptors, suitable for description of closed objects, both convex and concave, with or without holes. Explicit relations of GFR coefficients to regular moments, moment invariants and affine moment invariants are given in the paper. The dual linear relation between GFR coefficients and regular moments was used to compare shape features derive from GFR descriptors and Hu's moment invariants. the GFR was then applied to a clinical problem within oral medicine and used to represent the contours of the lesions in the oral cavity. The lesions studied were leukoplakia and different forms of lichenoid reactions. Shape features were extracted from GFR coefficients in order to classify potentially cancerous oral lesions. Alternative classifiers were investigated based on a multilayer perceptron with different architectures and extensions. The overall classification accuracy for recognition of potentially cancerous oral lesions when using neural network classifier was 85%, while the classification between leukoplakia and reticular lichenoid reactions gave 96% (5-fold cross-validated) recognition rate.

  15. Enhancing Breast Cancer Recurrence Algorithms Through Selective Use of Medical Record Data.

    PubMed

    Kroenke, Candyce H; Chubak, Jessica; Johnson, Lisa; Castillo, Adrienne; Weltzien, Erin; Caan, Bette J

    2016-03-01

    The utility of data-based algorithms in research has been questioned because of errors in identification of cancer recurrences. We adapted previously published breast cancer recurrence algorithms, selectively using medical record (MR) data to improve classification. We evaluated second breast cancer event (SBCE) and recurrence-specific algorithms previously published by Chubak and colleagues in 1535 women from the Life After Cancer Epidemiology (LACE) and 225 women from the Women's Health Initiative cohorts and compared classification statistics to published values. We also sought to improve classification with minimal MR examination. We selected pairs of algorithms-one with high sensitivity/high positive predictive value (PPV) and another with high specificity/high PPV-using MR information to resolve discrepancies between algorithms, properly classifying events based on review; we called this "triangulation." Finally, in LACE, we compared associations between breast cancer survival risk factors and recurrence using MR data, single Chubak algorithms, and triangulation. The SBCE algorithms performed well in identifying SBCE and recurrences. Recurrence-specific algorithms performed more poorly than published except for the high-specificity/high-PPV algorithm, which performed well. The triangulation method (sensitivity = 81.3%, specificity = 99.7%, PPV = 98.1%, NPV = 96.5%) improved recurrence classification over two single algorithms (sensitivity = 57.1%, specificity = 95.5%, PPV = 71.3%, NPV = 91.9%; and sensitivity = 74.6%, specificity = 97.3%, PPV = 84.7%, NPV = 95.1%), with 10.6% MR review. Triangulation performed well in survival risk factor analyses vs analyses using MR-identified recurrences. Use of multiple recurrence algorithms in administrative data, in combination with selective examination of MR data, may improve recurrence data quality and reduce research costs. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  16. Enhancing Breast Cancer Recurrence Algorithms Through Selective Use of Medical Record Data

    PubMed Central

    Chubak, Jessica; Johnson, Lisa; Castillo, Adrienne; Weltzien, Erin; Caan, Bette J.

    2016-01-01

    Abstract Background: The utility of data-based algorithms in research has been questioned because of errors in identification of cancer recurrences. We adapted previously published breast cancer recurrence algorithms, selectively using medical record (MR) data to improve classification. Methods: We evaluated second breast cancer event (SBCE) and recurrence-specific algorithms previously published by Chubak and colleagues in 1535 women from the Life After Cancer Epidemiology (LACE) and 225 women from the Women’s Health Initiative cohorts and compared classification statistics to published values. We also sought to improve classification with minimal MR examination. We selected pairs of algorithms—one with high sensitivity/high positive predictive value (PPV) and another with high specificity/high PPV—using MR information to resolve discrepancies between algorithms, properly classifying events based on review; we called this “triangulation.” Finally, in LACE, we compared associations between breast cancer survival risk factors and recurrence using MR data, single Chubak algorithms, and triangulation. Results: The SBCE algorithms performed well in identifying SBCE and recurrences. Recurrence-specific algorithms performed more poorly than published except for the high-specificity/high-PPV algorithm, which performed well. The triangulation method (sensitivity = 81.3%, specificity = 99.7%, PPV = 98.1%, NPV = 96.5%) improved recurrence classification over two single algorithms (sensitivity = 57.1%, specificity = 95.5%, PPV = 71.3%, NPV = 91.9%; and sensitivity = 74.6%, specificity = 97.3%, PPV = 84.7%, NPV = 95.1%), with 10.6% MR review. Triangulation performed well in survival risk factor analyses vs analyses using MR-identified recurrences. Conclusions: Use of multiple recurrence algorithms in administrative data, in combination with selective examination of MR data, may improve recurrence data quality and reduce research costs. PMID:26582243

  17. Revision concepts and distinctive points of the new Japanese classification for biliary tract cancers in comparison with the 7(th) edition of the Union for International Cancer Control and the American Joint Committee on Cancer staging system.

    PubMed

    Ohtsuka, Masayuki; Miyakawa, Shuichi; Nagino, Masato; Takada, Tadahiro; Miyazaki, Masaru

    2015-03-01

    The 3(rd) English edition of the Japanese classification of the biliary tract cancers (JC) is now available in this journal. The primary aim of this revision is to provide all clinicians and researchers with a common language of cancer staging at an international level. On the other hand, there are several important issues that should be solved for the optimization of the staging system. Revision concepts and major revision points of the 3(rd) English edition of the JC were reviewed. Furthermore, comparing with the 7(th) edition of staging system developed by the Union for International Cancer Control (UICC) and the American Joint Committee on Cancer (AJCC), distinctive points in the JC was discussed. In this edition of the JC, the same stage groupings as those in the UICC/AJCC staging system were basically adopted. T, N, and M categories were also identical in principle with those in the UICC/AJCC staging system, although slight modifications were proposed as the "Japanese rules". As distinctive points, perihilar cholangiocarcinomas and ampullary region carcinomas were clearly defined. Intraepithelial tumor was discriminated from invasive carcinoma at ductal resection margins. Classifications of site-specific surgical margin status remained in this edition. Histological classification was based on that in the former editions of the JC, but adopted some parts of the World Health Organization classification. The JC now share its staging system of the biliary tact carcinomas with the UICC/AJCC staging system. Future validation of the "Japanese rules" could provide important evidence to make globally standardized staging system. © 2015 Japanese Society of Hepato-Biliary-Pancreatic Surgery.

  18. Recursive Partitioning Analysis for New Classification of Patients With Esophageal Cancer Treated by Chemoradiotherapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nomura, Motoo, E-mail: excell@hkg.odn.ne.jp; Department of Clinical Oncology, Aichi Cancer Center Hospital, Nagoya; Department of Radiation Oncology, Aichi Cancer Center Hospital, Nagoya

    2012-11-01

    Background: The 7th edition of the American Joint Committee on Cancer staging system does not include lymph node size in the guidelines for staging patients with esophageal cancer. The objectives of this study were to determine the prognostic impact of the maximum metastatic lymph node diameter (ND) on survival and to develop and validate a new staging system for patients with esophageal squamous cell cancer who were treated with definitive chemoradiotherapy (CRT). Methods: Information on 402 patients with esophageal cancer undergoing CRT at two institutions was reviewed. Univariate and multivariate analyses of data from one institution were used to assessmore » the impact of clinical factors on survival, and recursive partitioning analysis was performed to develop the new staging classification. To assess its clinical utility, the new classification was validated using data from the second institution. Results: By multivariate analysis, gender, T, N, and ND stages were independently and significantly associated with survival (p < 0.05). The resulting new staging classification was based on the T and ND. The four new stages led to good separation of survival curves in both the developmental and validation datasets (p < 0.05). Conclusions: Our results showed that lymph node size is a strong independent prognostic factor and that the new staging system, which incorporated lymph node size, provided good prognostic power, and discriminated effectively for patients with esophageal cancer undergoing CRT.« less

  19. Pulsed terahertz imaging of breast cancer in freshly excised murine tumors

    NASA Astrophysics Data System (ADS)

    Bowman, Tyler; Chavez, Tanny; Khan, Kamrul; Wu, Jingxian; Chakraborty, Avishek; Rajaram, Narasimhan; Bailey, Keith; El-Shenawee, Magda

    2018-02-01

    This paper investigates terahertz (THz) imaging and classification of freshly excised murine xenograft breast cancer tumors. These tumors are grown via injection of E0771 breast adenocarcinoma cells into the flank of mice maintained on high-fat diet. Within 1 h of excision, the tumor and adjacent tissues are imaged using a pulsed THz system in the reflection mode. The THz images are classified using a statistical Bayesian mixture model with unsupervised and supervised approaches. Correlation with digitized pathology images is conducted using classification images assigned by a modal class decision rule. The corresponding receiver operating characteristic curves are obtained based on the classification results. A total of 13 tumor samples obtained from 9 tumors are investigated. The results show good correlation of THz images with pathology results in all samples of cancer and fat tissues. For tumor samples of cancer, fat, and muscle tissues, THz images show reasonable correlation with pathology where the primary challenge lies in the overlapping dielectric properties of cancer and muscle tissues. The use of a supervised regression approach shows improvement in the classification images although not consistently in all tissue regions. Advancing THz imaging of breast tumors from mice and the development of accurate statistical models will ultimately progress the technique for the assessment of human breast tumor margins.

  20. A hybrid approach to select features and classify diseases based on medical data

    NASA Astrophysics Data System (ADS)

    AbdelLatif, Hisham; Luo, Jiawei

    2018-03-01

    Feature selection is popular problem in the classification of diseases in clinical medicine. Here, we developing a hybrid methodology to classify diseases, based on three medical datasets, Arrhythmia, Breast cancer, and Hepatitis datasets. This methodology called k-means ANOVA Support Vector Machine (K-ANOVA-SVM) uses K-means cluster with ANOVA statistical to preprocessing data and selection the significant features, and Support Vector Machines in the classification process. To compare and evaluate the performance, we choice three classification algorithms, decision tree Naïve Bayes, Support Vector Machines and applied the medical datasets direct to these algorithms. Our methodology was a much better classification accuracy is given of 98% in Arrhythmia datasets, 92% in Breast cancer datasets and 88% in Hepatitis datasets, Compare to use the medical data directly with decision tree Naïve Bayes, and Support Vector Machines. Also, the ROC curve and precision with (K-ANOVA-SVM) Achieved best results than other algorithms

  1. Hybrid analysis for indicating patients with breast cancer using temperature time series.

    PubMed

    Silva, Lincoln F; Santos, Alair Augusto S M D; Bravo, Renato S; Silva, Aristófanes C; Muchaluat-Saade, Débora C; Conci, Aura

    2016-07-01

    Breast cancer is the most common cancer among women worldwide. Diagnosis and treatment in early stages increase cure chances. The temperature of cancerous tissue is generally higher than that of healthy surrounding tissues, making thermography an option to be considered in screening strategies of this cancer type. This paper proposes a hybrid methodology for analyzing dynamic infrared thermography in order to indicate patients with risk of breast cancer, using unsupervised and supervised machine learning techniques, which characterizes the methodology as hybrid. The dynamic infrared thermography monitors or quantitatively measures temperature changes on the examined surface, after a thermal stress. In the dynamic infrared thermography execution, a sequence of breast thermograms is generated. In the proposed methodology, this sequence is processed and analyzed by several techniques. First, the region of the breasts is segmented and the thermograms of the sequence are registered. Then, temperature time series are built and the k-means algorithm is applied on these series using various values of k. Clustering formed by k-means algorithm, for each k value, is evaluated using clustering validation indices, generating values treated as features in the classification model construction step. A data mining tool was used to solve the combined algorithm selection and hyperparameter optimization (CASH) problem in classification tasks. Besides the classification algorithm recommended by the data mining tool, classifiers based on Bayesian networks, neural networks, decision rules and decision tree were executed on the data set used for evaluation. Test results support that the proposed analysis methodology is able to indicate patients with breast cancer. Among 39 tested classification algorithms, K-Star and Bayes Net presented 100% classification accuracy. Furthermore, among the Bayes Net, multi-layer perceptron, decision table and random forest classification algorithms, an average accuracy of 95.38% was obtained. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  2. CrossLink: a novel method for cross-condition classification of cancer subtypes.

    PubMed

    Ma, Chifeng; Sastry, Konduru S; Flore, Mario; Gehani, Salah; Al-Bozom, Issam; Feng, Yusheng; Serpedin, Erchin; Chouchane, Lotfi; Chen, Yidong; Huang, Yufei

    2016-08-22

    We considered the prediction of cancer classes (e.g. subtypes) using patient gene expression profiles that contain both systematic and condition-specific biases when compared with the training reference dataset. The conventional normalization-based approaches cannot guarantee that the gene signatures in the reference and prediction datasets always have the same distribution for all different conditions as the class-specific gene signatures change with the condition. Therefore, the trained classifier would work well under one condition but not under another. To address the problem of current normalization approaches, we propose a novel algorithm called CrossLink (CL). CL recognizes that there is no universal, condition-independent normalization mapping of signatures. In contrast, it exploits the fact that the signature is unique to its associated class under any condition and thus employs an unsupervised clustering algorithm to discover this unique signature. We assessed the performance of CL for cross-condition predictions of PAM50 subtypes of breast cancer by using a simulated dataset modeled after TCGA BRCA tumor samples with a cross-validation scheme, and datasets with known and unknown PAM50 classification. CL achieved prediction accuracy >73 %, highest among other methods we evaluated. We also applied the algorithm to a set of breast cancer tumors derived from Arabic population to assign a PAM50 classification to each tumor based on their gene expression profiles. A novel algorithm CrossLink for cross-condition prediction of cancer classes was proposed. In all test datasets, CL showed robust and consistent improvement in prediction performance over other state-of-the-art normalization and classification algorithms.

  3. A comparative study of breast cancer diagnosis based on neural network ensemble via improved training algorithms.

    PubMed

    Azami, Hamed; Escudero, Javier

    2015-08-01

    Breast cancer is one of the most common types of cancer in women all over the world. Early diagnosis of this kind of cancer can significantly increase the chances of long-term survival. Since diagnosis of breast cancer is a complex problem, neural network (NN) approaches have been used as a promising solution. Considering the low speed of the back-propagation (BP) algorithm to train a feed-forward NN, we consider a number of improved NN trainings for the Wisconsin breast cancer dataset: BP with momentum, BP with adaptive learning rate, BP with adaptive learning rate and momentum, Polak-Ribikre conjugate gradient algorithm (CGA), Fletcher-Reeves CGA, Powell-Beale CGA, scaled CGA, resilient BP (RBP), one-step secant and quasi-Newton methods. An NN ensemble, which is a learning paradigm to combine a number of NN outputs, is used to improve the accuracy of the classification task. Results demonstrate that NN ensemble-based classification methods have better performance than NN-based algorithms. The highest overall average accuracy is 97.68% obtained by NN ensemble trained by RBP for 50%-50% training-test evaluation method.

  4. Recurrent neural networks for breast lesion classification based on DCE-MRIs

    NASA Astrophysics Data System (ADS)

    Antropova, Natasha; Huynh, Benjamin; Giger, Maryellen

    2018-02-01

    Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) plays a significant role in breast cancer screening, cancer staging, and monitoring response to therapy. Recently, deep learning methods are being rapidly incorporated in image-based breast cancer diagnosis and prognosis. However, most of the current deep learning methods make clinical decisions based on 2-dimentional (2D) or 3D images and are not well suited for temporal image data. In this study, we develop a deep learning methodology that enables integration of clinically valuable temporal components of DCE-MRIs into deep learning-based lesion classification. Our work is performed on a database of 703 DCE-MRI cases for the task of distinguishing benign and malignant lesions, and uses the area under the ROC curve (AUC) as the performance metric in conducting that task. We train a recurrent neural network, specifically a long short-term memory network (LSTM), on sequences of image features extracted from the dynamic MRI sequences. These features are extracted with VGGNet, a convolutional neural network pre-trained on a large dataset of natural images ImageNet. The features are obtained from various levels of the network, to capture low-, mid-, and high-level information about the lesion. Compared to a classification method that takes as input only images at a single time-point (yielding an AUC = 0.81 (se = 0.04)), our LSTM method improves lesion classification with an AUC of 0.85 (se = 0.03).

  5. The prognostic efficacy and improvements of the 7th edition Union for International Cancer Control tumor-node-metastasis classifications for Chinese patients with gastric cancer: Results based on a retrospective three-decade population study.

    PubMed

    Gu, Huizi; Li, Dongmei; Zhu, Haitao; Zhang, Hao; Yu, Ying; Qin, Dongxue; Yi, Mei; Li, Xiang; Lu, Ping

    2017-03-01

    This study aimed to evaluate survival trends for patients with gastric cancer in northeast China in the most recent three decades and analyze the applicability of the UICC tumor-node-metastasis (TNM) classification 7th edition for Chinese patients with gastric cancer. A review of all inpatient and outpatient records of patients with gastric cancer was conducted in the first hospital of China Medical University and the Liaoning Cancer Hospital and Institute. All patients who met the inclusion criteria and were seen from January 1980 through December 2009 were included in the study. The primary outcome was 5-year survival, which was analyzed according to decade of diagnosis and TNM classifications. From 1980 through 2009, the 5-year survival rates for patients with gastric cancer (n=2414) increased from 39.1% to 57.3%. Decade of diagnosis was significantly associated with patient survival (p = 0.013), and the 5-year survival rate in the 2000s was remarkably higher than that in the 1980s and 1990s (p = 0.004 and 0.049, respectively). When classified according to the UICC TNM classification of gastric cancer 7th edition, the prognoses of stage IIIA and stage IIIB patients were not significantly different (p = 0.077). However, if stage T4b and stage N0 patients were classified as stage IIIA, the prognoses of stage IIIA and stage IIIB patients were significantly different (p < 0.001). Hence, there was a significant difference in survival during the three time periods in Northeast China. Classifying stage T4b and stage N0 patients as stage IIIA according to the 7th edition of UICC gastric cancer TNM classifications better stratified Chinese patients and predicted prognoses.

  6. Recent time trends in cancer of the oesophagus and gastric cardia in the region of Calvados in France, 1978-1995: a population based study.

    PubMed

    Desoubeaux, N; Le Prieur, A; Launoy, G; Maurel, J; Lefevre, H; Guillois, J M; Gignoux, M

    1999-12-01

    The incidence of oesophageal cancer differs from country to country, and even between areas of the same country. Many studies in recent years have shown an upward trend of a particular histologic type: adenocarcinoma of the oesophagus. It is difficult to precisely locate adenocarcinomas situated at the junction between the oesophagus and the gastric cardia. Clear criteria to define and classify such tumours are essential in order to analyse their evolution. The present study describes the changing incidence of cancers of the oesophagus and gastric cardia according to histologic type from 1978 to 1995 in Calvados, the highest-risk French region with two different topographic classifications of adenocarcinomas: one based on Misumi's criteria and the other based on local extension of cancer. In total, 1835 cancers of the oesophagus and gastric cardia were diagnosed in this period. Incidence rates for oesophageal and gastric cardia cancers standardized on the world population were 24.4/10(5) and 2.4/10(5) in men and 1.4/10(5) and 0.4/10(5) in women, respectively. The time trend in the incidence of squamous cell cancers was downward in men -0.74 (P < 10(-6)) and stable in women +0.04 (P = 0.65). Regarding adenocarcinomas, with the classification based on Misumi's categories, there was a slight but significant upward trend for oesophageal adenocarcinoma in men [mean annual variation of +0.09 (P < 10(-5))] while the tendency was downward and significant for gastric cardia adenocarcinoma [mean annual variation of -0.09 (P < 10(-4))]. When adenocarcinomas of the oesophagus and those of the gastric cardia with oesophageal involvement are taken together (second classification), there was an upward trend which was not significant in men and was significant in women. There was no such upward trend in adenocarcinomas limited to the gastric cardia and/or involving the stomach. Because of the difficulties in determining accurate localization routinely in population-based studies, it seems sensible to preclude classification biases in recommending the grouping together of gastric cardia adenocarcinomas with oesophageal adenocarcinomas, at least with those among the latter occurring in the lower third of the oesophagus.

  7. Automatic classification of ovarian cancer types from cytological images using deep convolutional neural networks.

    PubMed

    Wu, Miao; Yan, Chuanbo; Liu, Huiqiang; Liu, Qian

    2018-06-29

    Ovarian cancer is one of the most common gynecologic malignancies. Accurate classification of ovarian cancer types (serous carcinoma, mucous carcinoma, endometrioid carcinoma, transparent cell carcinoma) is an essential part in the different diagnosis. Computer-aided diagnosis (CADx) can provide useful advice for pathologists to determine the diagnosis correctly. In our study, we employed a Deep Convolutional Neural Networks (DCNN) based on AlexNet to automatically classify the different types of ovarian cancers from cytological images. The DCNN consists of five convolutional layers, three max pooling layers, and two full reconnect layers. Then we trained the model by two group input data separately, one was original image data and the other one was augmented image data including image enhancement and image rotation. The testing results are obtained by the method of 10-fold cross-validation, showing that the accuracy of classification models has been improved from 72.76 to 78.20% by using augmented images as training data. The developed scheme was useful for classifying ovarian cancers from cytological images. © 2018 The Author(s).

  8. Application of linear discriminant analysis and Attenuated Total Reflectance Fourier Transform Infrared microspectroscopy for diagnosis of colon cancer.

    PubMed

    Khanmohammadi, Mohammadreza; Bagheri Garmarudi, Amir; Samani, Simin; Ghasemi, Keyvan; Ashuri, Ahmad

    2011-06-01

    Attenuated Total Reflectance Fourier Transform Infrared (ATR-FTIR) microspectroscopy was applied for detection of colon cancer according to the spectral features of colon tissues. Supervised classification models can be trained to identify the tissue type based on the spectroscopic fingerprint. A total of 78 colon tissues were used in spectroscopy studies. Major spectral differences were observed in 1,740-900 cm(-1) spectral region. Several chemometric methods such as analysis of variance (ANOVA), cluster analysis (CA) and linear discriminate analysis (LDA) were applied for classification of IR spectra. Utilizing the chemometric techniques, clear and reproducible differences were observed between the spectra of normal and cancer cases, suggesting that infrared microspectroscopy in conjunction with spectral data processing would be useful for diagnostic classification. Using LDA technique, the spectra were classified into cancer and normal tissue classes with an accuracy of 95.8%. The sensitivity and specificity was 100 and 93.1%, respectively.

  9. Challenges in projecting clustering results across gene expression-profiling datasets.

    PubMed

    Lusa, Lara; McShane, Lisa M; Reid, James F; De Cecco, Loris; Ambrogi, Federico; Biganzoli, Elia; Gariboldi, Manuela; Pierotti, Marco A

    2007-11-21

    Gene expression microarray studies for several types of cancer have been reported to identify previously unknown subtypes of tumors. For breast cancer, a molecular classification consisting of five subtypes based on gene expression microarray data has been proposed. These subtypes have been reported to exist across several breast cancer microarray studies, and they have demonstrated some association with clinical outcome. A classification rule based on the method of centroids has been proposed for identifying the subtypes in new collections of breast cancer samples; the method is based on the similarity of the new profiles to the mean expression profile of the previously identified subtypes. Previously identified centroids of five breast cancer subtypes were used to assign 99 breast cancer samples, including a subset of 65 estrogen receptor-positive (ER+) samples, to five breast cancer subtypes based on microarray data for the samples. The effect of mean centering the genes (i.e., transforming the expression of each gene so that its mean expression is equal to 0) on subtype assignment by method of centroids was assessed. Further studies of the effect of mean centering and of class prevalence in the test set on the accuracy of method of centroids classifications of ER status were carried out using training and test sets for which ER status had been independently determined by ligand-binding assay and for which the proportion of ER+ and ER- samples were systematically varied. When all 99 samples were considered, mean centering before application of the method of centroids appeared to be helpful for correctly assigning samples to subtypes, as evidenced by the expression of genes that had previously been used as markers to identify the subtypes. However, when only the 65 ER+ samples were considered for classification, many samples appeared to be misclassified, as evidenced by an unexpected distribution of ER+ samples among the resultant subtypes. When genes were mean centered before classification of samples for ER status, the accuracy of the ER subgroup assignments was highly dependent on the proportion of ER+ samples in the test set; this effect of subtype prevalence was not seen when gene expression data were not mean centered. Simple corrections such as mean centering of genes aimed at microarray platform or batch effect correction can have undesirable consequences because patient population effects can easily be confused with these assay-related effects. Careful thought should be given to the comparability of the patient populations before attempting to force data comparability for purposes of assigning subtypes to independent subjects.

  10. Multiclass cancer classification using a feature subset-based ensemble from microRNA expression profiles.

    PubMed

    Piao, Yongjun; Piao, Minghao; Ryu, Keun Ho

    2017-01-01

    Cancer classification has been a crucial topic of research in cancer treatment. In the last decade, messenger RNA (mRNA) expression profiles have been widely used to classify different types of cancers. With the discovery of a new class of small non-coding RNAs; known as microRNAs (miRNAs), various studies have shown that the expression patterns of miRNA can also accurately classify human cancers. Therefore, there is a great demand for the development of machine learning approaches to accurately classify various types of cancers using miRNA expression data. In this article, we propose a feature subset-based ensemble method in which each model is learned from a different projection of the original feature space to classify multiple cancers. In our method, the feature relevance and redundancy are considered to generate multiple feature subsets, the base classifiers are learned from each independent miRNA subset, and the average posterior probability is used to combine the base classifiers. To test the performance of our method, we used bead-based and sequence-based miRNA expression datasets and conducted 10-fold and leave-one-out cross validations. The experimental results show that the proposed method yields good results and has higher prediction accuracy than popular ensemble methods. The Java program and source code of the proposed method and the datasets in the experiments are freely available at https://sourceforge.net/projects/mirna-ensemble/. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Serrated colorectal cancer: Molecular classification, prognosis, and response to chemotherapy

    PubMed Central

    Murcia, Oscar; Juárez, Miriam; Hernández-Illán, Eva; Egoavil, Cecilia; Giner-Calabuig, Mar; Rodríguez-Soler, María; Jover, Rodrigo

    2016-01-01

    Molecular advances support the existence of an alternative pathway of colorectal carcinogenesis that is based on the hypermethylation of specific DNA regions that silences tumor suppressor genes. This alternative pathway has been called the serrated pathway due to the serrated appearance of tumors in histological analysis. New classifications for colorectal cancer (CRC) were proposed recently based on genetic profiles that show four types of molecular alterations: BRAF gene mutations, KRAS gene mutations, microsatellite instability, and hypermethylation of CpG islands. This review summarizes what is known about the serrated pathway of CRC, including CRC molecular and clinical features, prognosis, and response to chemotherapy. PMID:27053844

  12. The road map towards providing a robust Raman spectroscopy-based cancer diagnostic platform and integration into clinic

    NASA Astrophysics Data System (ADS)

    Lau, Katherine; Isabelle, Martin; Lloyd, Gavin R.; Old, Oliver; Shepherd, Neil; Bell, Ian M.; Dorney, Jennifer; Lewis, Aaran; Gaifulina, Riana; Rodriguez-Justo, Manuel; Kendall, Catherine; Stone, Nicolas; Thomas, Geraint; Reece, David

    2016-03-01

    Despite the demonstrated potential as an accurate cancer diagnostic tool, Raman spectroscopy (RS) is yet to be adopted by the clinic for histopathology reviews. The Stratified Medicine through Advanced Raman Technologies (SMART) consortium has begun to address some of the hurdles in its adoption for cancer diagnosis. These hurdles include awareness and acceptance of the technology, practicality of integration into the histopathology workflow, data reproducibility and availability of transferrable models. We have formed a consortium, in joint efforts, to develop optimised protocols for tissue sample preparation, data collection and analysis. These protocols will be supported by provision of suitable hardware and software tools to allow statistically sound classification models to be built and transferred for use on different systems. In addition, we are building a validated gastrointestinal (GI) cancers model, which can be trialled as part of the histopathology workflow at hospitals, and a classification tool. At the end of the project, we aim to deliver a robust Raman based diagnostic platform to enable clinical researchers to stage cancer, define tumour margin, build cancer diagnostic models and discover novel disease bio markers.

  13. Changes in classification of genetic variants in BRCA1 and BRCA2.

    PubMed

    Kast, Karin; Wimberger, Pauline; Arnold, Norbert

    2018-02-01

    Classification of variants of unknown significance (VUS) in the breast cancer genes BRCA1 and BRCA2 changes with accumulating evidence for clinical relevance. In most cases down-staging towards neutral variants without clinical significance is possible. We searched the database of the German Consortium for Hereditary Breast and Ovarian Cancer (GC-HBOC) for changes in classification of genetic variants as an update to our earlier publication on genetic variants in the Centre of Dresden. Changes between 2015 and 2017 were recorded. In the group of variants of unclassified significance (VUS, Class 3, uncertain), only changes of classification towards neutral genetic variants were noted. In BRCA1, 25% of the Class 3 variants (n = 2/8) changed to Class 2 (likely benign) and Class 1 (benign). In BRCA2, in 50% of the Class 3 variants (n = 16/32), a change to Class 2 (n = 10/16) or Class 1 (n = 6/16) was observed. No change in classification was noted in Class 4 (likely pathogenic) and Class 5 (pathogenic) genetic variants in both genes. No up-staging from Class 1, Class 2 or Class 3 to more clinical significance was observed. All variants with a change in classification in our cohort were down-staged towards no clinical significance by a panel of experts of the German Consortium for Hereditary Breast and Ovarian Cancer (GC-HBOC). Prevention in families with Class 3 variants should be based on pedigree based risks and should not be guided by the presence of a VUS.

  14. Cancer Therapy (Preclinical and Clinical): A Decimal Classification, (Categories 51.1, 51.2, and 51.3).

    ERIC Educational Resources Information Center

    Schneider, John H.

    This hierarchical decimal classification of information related to cancer therapy in humans and animals (preceeded by a few general categories) is a working draft of categories taken from an extensive classification of biomedical information. Because the classification identifies very small areas of cancer information, it can be used for precise…

  15. A systematic approach to prioritize drug targets using machine learning, a molecular descriptor-based classification model, and high-throughput screening of plant derived molecules: a case study in oral cancer.

    PubMed

    Randhawa, Vinay; Kumar Singh, Anil; Acharya, Vishal

    2015-12-01

    Systems-biology inspired identification of drug targets and machine learning-based screening of small molecules which modulate their activity have the potential to revolutionize modern drug discovery by complementing conventional methods. To utilize the effectiveness of such pipelines, we first analyzed the dysregulated gene pairs between control and tumor samples and then implemented an ensemble-based feature selection approach to prioritize targets in oral squamous cell carcinoma (OSCC) for therapeutic exploration. Based on the structural information of known inhibitors of CXCR4-one of the best targets identified in this study-a feature selection was implemented for the identification of optimal structural features (molecular descriptor) based on which a classification model was generated. Furthermore, the CXCR4-centered descriptor-based classification model was finally utilized to screen a repository of plant derived small-molecules to obtain potential inhibitors. The application of our methodology may assist effective selection of the best targets which may have previously been overlooked, that in turn will lead to the development of new oral cancer medications. The small molecules identified in this study can be ideal candidates for trials as potential novel anti-oral cancer agents. Importantly, distinct steps of this whole study may provide reference for the analysis of other complex human diseases.

  16. Similarity-balanced discriminant neighbor embedding and its application to cancer classification based on gene expression data.

    PubMed

    Zhang, Li; Qian, Liqiang; Ding, Chuntao; Zhou, Weida; Li, Fanzhang

    2015-09-01

    The family of discriminant neighborhood embedding (DNE) methods is typical graph-based methods for dimension reduction, and has been successfully applied to face recognition. This paper proposes a new variant of DNE, called similarity-balanced discriminant neighborhood embedding (SBDNE) and applies it to cancer classification using gene expression data. By introducing a novel similarity function, SBDNE deals with two data points in the same class and the different classes with different ways. The homogeneous and heterogeneous neighbors are selected according to the new similarity function instead of the Euclidean distance. SBDNE constructs two adjacent graphs, or between-class adjacent graph and within-class adjacent graph, using the new similarity function. According to these two adjacent graphs, we can generate the local between-class scatter and the local within-class scatter, respectively. Thus, SBDNE can maximize the between-class scatter and simultaneously minimize the within-class scatter to find the optimal projection matrix. Experimental results on six microarray datasets show that SBDNE is a promising method for cancer classification. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. Case base classification on digital mammograms: improving the performance of case base classifier

    NASA Astrophysics Data System (ADS)

    Raman, Valliappan; Then, H. H.; Sumari, Putra; Venkatesa Mohan, N.

    2011-10-01

    Breast cancer continues to be a significant public health problem in the world. Early detection is the key for improving breast cancer prognosis. The aim of the research presented here is in twofold. First stage of research involves machine learning techniques, which segments and extracts features from the mass of digital mammograms. Second level is on problem solving approach which includes classification of mass by performance based case base classifier. In this paper we build a case-based Classifier in order to diagnose mammographic images. We explain different methods and behaviors that have been added to the classifier to improve the performance of the classifier. Currently the initial Performance base Classifier with Bagging is proposed in the paper and it's been implemented and it shows an improvement in specificity and sensitivity.

  18. Classification algorithm of lung lobe for lung disease cases based on multislice CT images

    NASA Astrophysics Data System (ADS)

    Matsuhiro, M.; Kawata, Y.; Niki, N.; Nakano, Y.; Mishima, M.; Ohmatsu, H.; Tsuchida, T.; Eguchi, K.; Kaneko, M.; Moriyama, N.

    2011-03-01

    With the development of multi-slice CT technology, to obtain an accurate 3D image of lung field in a short time is possible. To support that, a lot of image processing methods need to be developed. In clinical setting for diagnosis of lung cancer, it is important to study and analyse lung structure. Therefore, classification of lung lobe provides useful information for lung cancer analysis. In this report, we describe algorithm which classify lungs into lung lobes for lung disease cases from multi-slice CT images. The classification algorithm of lung lobes is efficiently carried out using information of lung blood vessel, bronchus, and interlobar fissure. Applying the classification algorithms to multi-slice CT images of 20 normal cases and 5 lung disease cases, we demonstrate the usefulness of the proposed algorithms.

  19. Cross-Disciplinary Analysis of Lymph Node Classification in Lung Cancer on CT Scanning.

    PubMed

    El-Sherief, Ahmed H; Lau, Charles T; Obuchowski, Nancy A; Mehta, Atul C; Rice, Thomas W; Blackstone, Eugene H

    2017-04-01

    Accurate and consistent regional lymph node classification is an important element in the staging and multidisciplinary management of lung cancer. Regional lymph node definition sets-lymph node maps-have been created to standardize regional lymph node classification. In 2009, the International Association for the Study of Lung Cancer (IASLC) introduced a lymph node map to supersede all preexisting lymph node maps. Our aim was to study if and how lung cancer specialists apply the IASLC lymph node map when classifying thoracic lymph nodes encountered on CT scans during lung cancer staging. From April 2013 through July 2013, invitations were distributed to all members of the Fleischner Society, Society of Thoracic Radiology, General Thoracic Surgical Club, and the American Association of Bronchology and Interventional Pulmonology to participate in an anonymous online image-based and text-based 20-question survey regarding lymph node classification for lung cancer staging on CT imaging. Three hundred thirty-seven people responded (approximately 25% participation). Respondents consisted of self-reported thoracic radiologists (n = 158), thoracic surgeons (n = 102), and pulmonologists who perform endobronchial ultrasonography (n = 77). Half of the respondents (50%; 95% CI, 44%-55%) reported using the IASLC lymph node map in daily practice, with no significant differences between subspecialties. A disparity was observed between the IASLC definition sets and their interpretation and application on CT scans, in particular for lymph nodes near the thoracic inlet, anterior to the trachea, anterior to the tracheal bifurcation, near the ligamentum arteriosum, between the bronchus intermedius and esophagus, in the internal mammary space, and adjacent to the heart. Use of older lymph node maps and inconsistencies in interpretation and application of definitions in the IASLC lymph node map may potentially lead to misclassification of stage and suboptimal management of lung cancer in some patients. Published by Elsevier Inc.

  20. Automated Extraction and Classification of Cancer Stage Mentions fromUnstructured Text Fields in a Central Cancer Registry

    PubMed Central

    AAlAbdulsalam, Abdulrahman K.; Garvin, Jennifer H.; Redd, Andrew; Carter, Marjorie E.; Sweeny, Carol; Meystre, Stephane M.

    2018-01-01

    Cancer stage is one of the most important prognostic parameters in most cancer subtypes. The American Joint Com-mittee on Cancer (AJCC) specifies criteria for staging each cancer type based on tumor characteristics (T), lymph node involvement (N), and tumor metastasis (M) known as TNM staging system. Information related to cancer stage is typically recorded in clinical narrative text notes and other informal means of communication in the Electronic Health Record (EHR). As a result, human chart-abstractors (known as certified tumor registrars) have to search through volu-minous amounts of text to extract accurate stage information and resolve discordance between different data sources. This study proposes novel applications of natural language processing and machine learning to automatically extract and classify TNM stage mentions from records at the Utah Cancer Registry. Our results indicate that TNM stages can be extracted and classified automatically with high accuracy (extraction sensitivity: 95.5%–98.4% and classification sensitivity: 83.5%–87%). PMID:29888032

  1. Parenclitic Network Analysis of Methylation Data for Cancer Identification

    PubMed Central

    Karsakov, Alexander; Bartlett, Thomas; Ryblov, Artem; Meyerov, Iosif; Ivanchenko, Mikhail; Zaikin, Alexey

    2017-01-01

    We make use of ideas from the theory of complex networks to implement a machine learning classification of human DNA methylation data, that carry signatures of cancer development. The data were obtained from patients with various kinds of cancers and represented as parenclictic networks, wherein nodes correspond to genes, and edges are weighted according to pairwise variation from control group subjects. We demonstrate that for the 10 types of cancer under study, it is possible to obtain a high performance of binary classification between cancer-positive and negative samples based on network measures. Remarkably, an accuracy as high as 93−99% is achieved with only 12 network topology indices, in a dramatic reduction of complexity from the original 15295 gene methylation levels. Moreover, it was found that the parenclictic networks are scale-free in cancer-negative subjects, and deviate from the power-law node degree distribution in cancer. The node centrality ranking and arising modular structure could provide insights into the systems biology of cancer. PMID:28107365

  2. Automated Extraction and Classification of Cancer Stage Mentions fromUnstructured Text Fields in a Central Cancer Registry.

    PubMed

    AAlAbdulsalam, Abdulrahman K; Garvin, Jennifer H; Redd, Andrew; Carter, Marjorie E; Sweeny, Carol; Meystre, Stephane M

    2018-01-01

    Cancer stage is one of the most important prognostic parameters in most cancer subtypes. The American Joint Com-mittee on Cancer (AJCC) specifies criteria for staging each cancer type based on tumor characteristics (T), lymph node involvement (N), and tumor metastasis (M) known as TNM staging system. Information related to cancer stage is typically recorded in clinical narrative text notes and other informal means of communication in the Electronic Health Record (EHR). As a result, human chart-abstractors (known as certified tumor registrars) have to search through volu-minous amounts of text to extract accurate stage information and resolve discordance between different data sources. This study proposes novel applications of natural language processing and machine learning to automatically extract and classify TNM stage mentions from records at the Utah Cancer Registry. Our results indicate that TNM stages can be extracted and classified automatically with high accuracy (extraction sensitivity: 95.5%-98.4% and classification sensitivity: 83.5%-87%).

  3. An NRG Oncology/GOG study of molecular classification for risk prediction in endometrioid endometrial cancer

    PubMed Central

    Cosgrove, Casey M; Tritchler, David L; Cohn, David E; Mutch, David G; Rush, Craig M; Lankes, Heather A; Creasman, William T.; Miller, David S; Ramirez, Nilsa C; Geller, Melissa A; Powell, Matthew A; Backes, Floor J; Landrum, Lisa M; Timmers, Cynthia; Suarez, Adrian A; Zaino, Richard J; Pearl, Michael L; DiSilvestro, Paul A; Lele, Shashikant B; Goodfellow, Paul J

    2017-01-01

    Objectives The purpose of this study was to assess the prognostic significance of a simplified, clinically accessible classification system for endometrioid endometrial cancers combining Lynch syndrome screening and molecular risk stratification. Methods Tumors from NRG/GOG GOG210 were evaluated for mismatch repair defects (MSI, MMR IHC, and MLH1 methylation), POLE mutations, and loss of heterozygosity. TP53 was evaluated in a subset of cases. Tumors were assigned to four molecular classes. Relationships between molecular classes and clinicopathologic variables were assessed using contingency tests and Cox proportional methods. Results Molecular classification was successful for 982 tumors. Based on the NCI consensus MSI panel assessing MSI and loss of heterozygosity combined with POLE testing, 49% of tumors were classified copy number stable (CNS), 39% MMR deficient, 8% copy number altered (CNA) and 4% POLE mutant. Cancer-specific mortality occurred in 5% of patients with CNS tumors; 2.6% with POLE tumors; 7.6% with MMR deficient tumors and 19% with CNA tumors. The CNA group had worse progression-free (HR 2.31, 95%CI 1.53–3.49) and cancer-specific survival (HR 3.95; 95%CI 2.10–7.44). The POLE group had improved outcomes, but the differences were not statistically significant. CNA class remained significant for cancer-specific survival (HR 2.11; 95%CI 1.04–4.26) in multivariable analysis. The CNA molecular class was associated with TP53 mutation and expression status. Conclusions A simple molecular classification for endometrioid endometrial cancers that can be easily combined with Lynch syndrome screening provides important prognostic information. These findings support prospective clinical validation and further studies on the predictive value of a simplified molecular classification system. PMID:29132872

  4. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks | Center for Cancer Research

    Cancer.gov

    The purpose of this study was to develop a method of classifying cancers to specific diagnostic categories based on their gene expression signatures using artificial neural networks (ANNs). We trained the ANNs using the small, round blue-cell tumors (SRBCTs) as a model. These cancers belong to four distinct diagnostic categories and often present diagnostic dilemmas in

  5. 3D texture analysis for classification of second harmonic generation images of human ovarian cancer

    NASA Astrophysics Data System (ADS)

    Wen, Bruce; Campbell, Kirby R.; Tilbury, Karissa; Nadiarnykh, Oleg; Brewer, Molly A.; Patankar, Manish; Singh, Vikas; Eliceiri, Kevin. W.; Campagnola, Paul J.

    2016-10-01

    Remodeling of the collagen architecture in the extracellular matrix (ECM) has been implicated in ovarian cancer. To quantify these alterations we implemented a form of 3D texture analysis to delineate the fibrillar morphology observed in 3D Second Harmonic Generation (SHG) microscopy image data of normal (1) and high risk (2) ovarian stroma, benign ovarian tumors (3), low grade (4) and high grade (5) serous tumors, and endometrioid tumors (6). We developed a tailored set of 3D filters which extract textural features in the 3D image sets to build (or learn) statistical models of each tissue class. By applying k-nearest neighbor classification using these learned models, we achieved 83-91% accuracies for the six classes. The 3D method outperformed the analogous 2D classification on the same tissues, where we suggest this is due the increased information content. This classification based on ECM structural changes will complement conventional classification based on genetic profiles and can serve as an additional biomarker. Moreover, the texture analysis algorithm is quite general, as it does not rely on single morphological metrics such as fiber alignment, length, and width but their combined convolution with a customizable basis set.

  6. The Consensus Molecular Subtypes of Colorectal Cancer

    PubMed Central

    Guinney, Justin; Dienstmann, Rodrigo; Wang, Xin; de Reyniès, Aurélien; Schlicker, Andreas; Soneson, Charlotte; Marisa, Laetitia; Roepman, Paul; Nyamundanda, Gift; Angelino, Paolo; Bot, Brian M.; Morris, Jeffrey S.; Simon, Iris M.; Gerster, Sarah; Fessler, Evelyn; de Sousa e Melo, Felipe; Missiaglia, Edoardo; Ramay, Hena; Barras, David; Homicsko, Krisztian; Maru, Dipen; Manyam, Ganiraju C.; Broom, Bradley; Boige, Valerie; Perez-Villamil, Beatriz; Laderas, Ted; Salazar, Ramon; Gray, Joe W.; Hanahan, Douglas; Tabernero, Josep; Bernards, Rene; Friend, Stephen H.; Laurent-Puig, Pierre; Medema, Jan Paul; Sadanandam, Anguraj; Wessels, Lodewyk; Delorenzi, Mauro; Kopetz, Scott; Vermeulen, Louis; Tejpar, Sabine

    2015-01-01

    Colorectal cancer (CRC) is a frequently lethal disease with heterogeneous outcomes and drug responses. To resolve inconsistencies among the reported gene expression–based CRC classifications and facilitate clinical translation, we formed an international consortium dedicated to large-scale data sharing and analytics across expert groups. We show marked interconnectivity between six independent classification systems coalescing into four consensus molecular subtypes (CMS) with distinguishing features: CMS1 (MSI Immune, 14%), hypermutated, microsatellite unstable, strong immune activation; CMS2 (Canonical, 37%), epithelial, chromosomally unstable, marked WNT and MYC signaling activation; CMS3 (Metabolic, 13%), epithelial, evident metabolic dysregulation; and CMS4 (Mesenchymal, 23%), prominent transforming growth factor β activation, stromal invasion, and angiogenesis. Samples with mixed features (13%) possibly represent a transition phenotype or intra-tumoral heterogeneity. We consider the CMS groups the most robust classification system currently available for CRC – with clear biological interpretability – and the basis for future clinical stratification and subtype–based targeted interventions. PMID:26457759

  7. A measure of association for ordered categorical data in population-based studies

    PubMed Central

    Nelson, Kerrie P; Edwards, Don

    2016-01-01

    Ordinal classification scales are commonly used to define a patient’s disease status in screening and diagnostic tests such as mammography. Challenges arise in agreement studies when evaluating the association between many raters’ classifications of patients’ disease or health status when an ordered categorical scale is used. In this paper, we describe a population-based approach and chance-corrected measure of association to evaluate the strength of relationship between multiple raters’ ordinal classifications where any number of raters can be accommodated. In contrast to Shrout and Fleiss’ intraclass correlation coefficient, the proposed measure of association is invariant with respect to changes in disease prevalence. We demonstrate how unique characteristics of individual raters can be explored using random effects. Simulation studies are conducted to demonstrate the properties of the proposed method under varying assumptions. The methods are applied to two large-scale agreement studies of breast cancer screening and prostate cancer severity. PMID:27184590

  8. [A systematic review of worldwide natural history models of colorectal cancer: classification, transition rate and a recommendation for developing Chinese population-specific model].

    PubMed

    Li, Z F; Huang, H Y; Shi, J F; Guo, C G; Zou, S M; Liu, C C; Wang, Y; Wang, L; Zhu, S L; Wu, S L; Dai, M

    2017-02-10

    Objective: To review the worldwide studies on natural history models among colorectal cancer (CRC), and to inform building a Chinese population-specific CRC model and developing a platform for further evaluation of CRC screening and other interventions in population in China. Methods: A structured literature search process was conducted in PubMed and the target publication dates were from January 1995 to December 2014. Information about classification systems on both colorectal cancer and precancer on corresponding transition rate, were extracted and summarized. Indicators were mainly expressed by the medians and ranges of annual progression or regression rate. Results: A total of 24 studies were extracted from 1 022 studies, most were from America ( n =9), but 2 from China including 1 from the mainland area, mainly based on Markov model ( n =22). Classification systems for adenomas included progression risk ( n =9) and the sizes of adenoma ( n =13, divided into two ways) as follows: 1) Based on studies where adenoma was risk-dependent, the median annual transition rates, from ' normal status' to ' non-advanced adenoma', 'non-advanced' to ' advanced' and ' advanced adenoma' to CRC were 0.016 0 (range: 0.002 2-0.020 0), 0.020 (range: 0.002-0.177) and 0.044 (range: 0.005-0.063), respectively. 2) Median annual transition rates, based on studies where adenoma were classified by sizes, into <10 mm and ≥10 mm ( n =7), from ' normal' to adenoma <10 mm, from adenoma <10 mm to adenoma ≥10 mm and adenoma ≥ 10 mm to CRC, were 0.016 7 (range: 0.015 0-0.037 0), 0.020 (range: 0.015-0.035) and 0.040 0 (range: 0.008 5-0.050 0), respectively. 3) Median annual transition rates, based on studies where adenoma, were classified by sizes into diminutive (≤5 mm), small (6-9 mm) and large adenoma (≥10 mm) ( n =6), from ' normal' to diminutive adenoma,'diminutive' to ' small','small' to ' large', and large adenoma to CRC were 0.013 (range: 0.009-0.019), 0.043 (range: 0.020-0.085), 0.044 (range: 0.020-0.125) and 0.033 5 (range: 0.030-0.040), respectively. Staging system of CRC mainly included LRD (localized/regional/distant, n =10), Dukes' ( n =7) and TNM ( n =3). When using the LRD classification, the median annual transition rates from ' localized' to ' regional' and ' regional' to 'distant' were 0.28 (range: 0.20-0.33) and 0.40 (range: 0.24-0.63), respectively. Under the Dukes' classification, the median annual transition rates appeared as 0.583 (range: 0.050-0.910), 0.656 (range: 0.280-0.720) and 0.830 (range: 0.630-0.865) from Dukes' A to B, B to C and C to Dukes' D, respectively. Again, when using the TNM classification, very limited transition rate was reported. Serrated pathway was only described in one study. Conclusions: Studies on the natural history model of colorectal cancer was still limited worldwide. Adenoma seemed the most common status setting for precancer model, and the risk-dependent classification for adenoma was consistent with the most commonly used system in clinical practice as well as major cancer screening programs in China. Since the staging systems of cancers varied, and shortage of transition rates based on TNM classification (commonly used in China), there will be a challenge for building Chinese population-specific natural history model of colorectal cancer, information from other classification systems could be conditionally applied.

  9. A deep learning-based multi-model ensemble method for cancer prediction.

    PubMed

    Xiao, Yawen; Wu, Jun; Lin, Zongli; Zhao, Xiaodong

    2018-01-01

    Cancer is a complex worldwide health problem associated with high mortality. With the rapid development of the high-throughput sequencing technology and the application of various machine learning methods that have emerged in recent years, progress in cancer prediction has been increasingly made based on gene expression, providing insight into effective and accurate treatment decision making. Thus, developing machine learning methods, which can successfully distinguish cancer patients from healthy persons, is of great current interest. However, among the classification methods applied to cancer prediction so far, no one method outperforms all the others. In this paper, we demonstrate a new strategy, which applies deep learning to an ensemble approach that incorporates multiple different machine learning models. We supply informative gene data selected by differential gene expression analysis to five different classification models. Then, a deep learning method is employed to ensemble the outputs of the five classifiers. The proposed deep learning-based multi-model ensemble method was tested on three public RNA-seq data sets of three kinds of cancers, Lung Adenocarcinoma, Stomach Adenocarcinoma and Breast Invasive Carcinoma. The test results indicate that it increases the prediction accuracy of cancer for all the tested RNA-seq data sets as compared to using a single classifier or the majority voting algorithm. By taking full advantage of different classifiers, the proposed deep learning-based multi-model ensemble method is shown to be accurate and effective for cancer prediction. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Comparison of the current AJCC-TNM numeric-based with a new anatomical location-based lymph node staging system for gastric cancer: A western experience.

    PubMed

    Galizia, Gennaro; Lieto, Eva; Auricchio, Annamaria; Cardella, Francesca; Mabilia, Andrea; Diana, Anna; Castellano, Paolo; De Vita, Ferdinando; Orditura, Michele

    2017-01-01

    In gastric cancer, the current AJCC numeric-based lymph node staging does not provide information on the anatomical extent of the disease and lymphadenectomy. A new anatomical location-based node staging, proposed by Choi, has shown better prognostic performance, thus soliciting Western world validation. Data from 284 gastric cancers undergoing radical surgery at the Second University of Naples from 2000 to 2014 were reviewed. The lymph nodes were reclassified into three groups (lesser and greater curvature, and extraperigastric nodes); presence of any metastatic lymph node in a given group was considered positive, prompting a new N and TNM stage classification. Receiver-operating-characteristic (ROC) curves for censored survival data and bootstrap methods were used to compare the capability of the two models to predict tumor recurrence. More than one third of node positive patients were reclassified into different N and TNM stages by the new system. Compared to the current staging system, the new classification significantly correlated with tumor recurrence rates and displayed improved indices of prognostic performance, such as the Bayesian information criterion and the Harrell C-index. Higher values at survival ROC analysis demonstrated a significantly better stratification of patients by the new system, mostly in the early phase of the follow-up, with a worse prognosis in more advanced new N stages, despite the same current N stage. This study suggests that the anatomical location-based classification of lymph node metastasis may be an important tool for gastric cancer prognosis and should be considered for future revision of the TNM staging system.

  11. Toward automated classification of consumers' cancer-related questions with a new taxonomy of expected answer types.

    PubMed

    McRoy, Susan; Jones, Sean; Kurmally, Adam

    2016-09-01

    This article examines methods for automated question classification applied to cancer-related questions that people have asked on the web. This work is part of a broader effort to provide automated question answering for health education. We created a new corpus of consumer-health questions related to cancer and a new taxonomy for those questions. We then compared the effectiveness of different statistical methods for developing classifiers, including weighted classification and resampling. Basic methods for building classifiers were limited by the high variability in the natural distribution of questions and typical refinement approaches of feature selection and merging categories achieved only small improvements to classifier accuracy. Best performance was achieved using weighted classification and resampling methods, the latter yielding an accuracy of F1 = 0.963. Thus, it would appear that statistical classifiers can be trained on natural data, but only if natural distributions of classes are smoothed. Such classifiers would be useful for automated question answering, for enriching web-based content, or assisting clinical professionals to answer questions. © The Author(s) 2015.

  12. Laser Raman detection for oral cancer based on an adaptive Gaussian process classification method with posterior probabilities

    NASA Astrophysics Data System (ADS)

    Du, Zhanwei; Yang, Yongjian; Bai, Yuan; Wang, Lijun; Su, Le; Chen, Yong; Li, Xianchang; Zhou, Xiaodong; Jia, Jun; Shen, Aiguo; Hu, Jiming

    2013-03-01

    The existing methods for early and differential diagnosis of oral cancer are limited due to the unapparent early symptoms and the imperfect imaging examination methods. In this paper, the classification models of oral adenocarcinoma, carcinoma tissues and a control group with just four features are established by utilizing the hybrid Gaussian process (HGP) classification algorithm, with the introduction of the mechanisms of noise reduction and posterior probability. HGP shows much better performance in the experimental results. During the experimental process, oral tissues were divided into three groups, adenocarcinoma (n = 87), carcinoma (n = 100) and the control group (n = 134). The spectral data for these groups were collected. The prospective application of the proposed HGP classification method improved the diagnostic sensitivity to 56.35% and the specificity to about 70.00%, and resulted in a Matthews correlation coefficient (MCC) of 0.36. It is proved that the utilization of HGP in LRS detection analysis for the diagnosis of oral cancer gives accurate results. The prospect of application is also satisfactory.

  13. Improved Sparse Multi-Class SVM and Its Application for Gene Selection in Cancer Classification

    PubMed Central

    Huang, Lingkang; Zhang, Hao Helen; Zeng, Zhao-Bang; Bushel, Pierre R.

    2013-01-01

    Background Microarray techniques provide promising tools for cancer diagnosis using gene expression profiles. However, molecular diagnosis based on high-throughput platforms presents great challenges due to the overwhelming number of variables versus the small sample size and the complex nature of multi-type tumors. Support vector machines (SVMs) have shown superior performance in cancer classification due to their ability to handle high dimensional low sample size data. The multi-class SVM algorithm of Crammer and Singer provides a natural framework for multi-class learning. Despite its effective performance, the procedure utilizes all variables without selection. In this paper, we propose to improve the procedure by imposing shrinkage penalties in learning to enforce solution sparsity. Results The original multi-class SVM of Crammer and Singer is effective for multi-class classification but does not conduct variable selection. We improved the method by introducing soft-thresholding type penalties to incorporate variable selection into multi-class classification for high dimensional data. The new methods were applied to simulated data and two cancer gene expression data sets. The results demonstrate that the new methods can select a small number of genes for building accurate multi-class classification rules. Furthermore, the important genes selected by the methods overlap significantly, suggesting general agreement among different variable selection schemes. Conclusions High accuracy and sparsity make the new methods attractive for cancer diagnostics with gene expression data and defining targets of therapeutic intervention. Availability: The source MATLAB code are available from http://math.arizona.edu/~hzhang/software.html. PMID:23966761

  14. Classification of hepatocellular carcinoma stages from free-text clinical and radiology reports

    PubMed Central

    Yim, Wen-wai; Kwan, Sharon W; Johnson, Guy; Yetisgen, Meliha

    2017-01-01

    Cancer stage information is important for clinical research. However, they are not always explicitly noted in electronic medical records. In this paper, we present our work on automatic classification of hepatocellular carcinoma (HCC) stages from free-text clinical and radiology notes. To accomplish this, we defined 11 stage parameters used in the three HCC staging systems, American Joint Committee on Cancer (AJCC), Barcelona Clinic Liver Cancer (BCLC), and Cancer of the Liver Italian Program (CLIP). After aggregating stage parameters to the patient-level, the final stage classifications were achieved using an expert-created decision logic. Each stage parameter relevant for staging was extracted using several classification methods, e.g. sentence classification and automatic information structuring, to identify and normalize text as cancer stage parameter values. Stage parameter extraction for the test set performed at 0.81 F1. Cancer stage prediction for AJCC, BCLC, and CLIP stage classifications were 0.55, 0.50, and 0.43 F1.

  15. Cancer Biochemistry and Host-Tumor Interactions: A Decimal Classification, (Categories 51.6, 51.7, and 51.8).

    ERIC Educational Resources Information Center

    Schneider, John H.

    This is a hierarchical decimal classification of information related to cancer biochemistry, to host-tumor interactions (including cancer immunology), and to occurrence of cancer in special types of animals and plants. It is a working draft of categories taken from an extensive classification of many fields of biomedical information. Because the…

  16. The wisdom of the commons: ensemble tree classifiers for prostate cancer prognosis.

    PubMed

    Koziol, James A; Feng, Anne C; Jia, Zhenyu; Wang, Yipeng; Goodison, Seven; McClelland, Michael; Mercola, Dan

    2009-01-01

    Classification and regression trees have long been used for cancer diagnosis and prognosis. Nevertheless, instability and variable selection bias, as well as overfitting, are well-known problems of tree-based methods. In this article, we investigate whether ensemble tree classifiers can ameliorate these difficulties, using data from two recent studies of radical prostatectomy in prostate cancer. Using time to progression following prostatectomy as the relevant clinical endpoint, we found that ensemble tree classifiers robustly and reproducibly identified three subgroups of patients in the two clinical datasets: non-progressors, early progressors and late progressors. Moreover, the consensus classifications were independent predictors of time to progression compared to known clinical prognostic factors.

  17. Applying Data Mining Techniques to Improve Breast Cancer Diagnosis.

    PubMed

    Diz, Joana; Marreiros, Goreti; Freitas, Alberto

    2016-09-01

    In the field of breast cancer research, and more than ever, new computer aided diagnosis based systems have been developed aiming to reduce diagnostic tests false-positives. Within this work, we present a data mining based approach which might support oncologists in the process of breast cancer classification and diagnosis. The present study aims to compare two breast cancer datasets and find the best methods in predicting benign/malignant lesions, breast density classification, and even for finding identification (mass / microcalcification distinction). To carry out these tasks, two matrices of texture features extraction were implemented using Matlab, and classified using data mining algorithms, on WEKA. Results revealed good percentages of accuracy for each class: 89.3 to 64.7 % - benign/malignant; 75.8 to 78.3 % - dense/fatty tissue; 71.0 to 83.1 % - finding identification. Among the different tests classifiers, Naive Bayes was the best to identify masses texture, and Random Forests was the first or second best classifier for the majority of tested groups.

  18. Computed tomographic atlas for the new international lymph node map for lung cancer: A radiation oncologist perspective.

    PubMed

    Lynch, Rod; Pitson, Graham; Ball, David; Claude, Line; Sarrut, David

    2013-01-01

    To develop a reproducible definition for each mediastinal lymph node station based on the new TNM classification for lung cancer. This paper proposes an atlas using the new international lymph node map used in the seventh edition of the TNM classification for lung cancer. Four radiation oncologists and 1 diagnostic radiologist were involved in the project to put forward a reproducible radiologic description for the lung lymph node stations. The International Association for the Study of Lung Cancer lymph node definitions for stations 1 to 11 have been described and illustrated on axial computed tomographic scan images using a certified radiotherapy planning system. This atlas will assist both diagnostic radiologists and radiation oncologists in accurately defining the lymph node stations on computed tomographic scan in patients diagnosed with lung cancer. Copyright © 2013 American Society for Radiation Oncology. Published by Elsevier Inc. All rights reserved.

  19. Deep learning based tissue analysis predicts outcome in colorectal cancer.

    PubMed

    Bychkov, Dmitrii; Linder, Nina; Turkki, Riku; Nordling, Stig; Kovanen, Panu E; Verrill, Clare; Walliander, Margarita; Lundin, Mikael; Haglund, Caj; Lundin, Johan

    2018-02-21

    Image-based machine learning and deep learning in particular has recently shown expert-level accuracy in medical image classification. In this study, we combine convolutional and recurrent architectures to train a deep network to predict colorectal cancer outcome based on images of tumour tissue samples. The novelty of our approach is that we directly predict patient outcome, without any intermediate tissue classification. We evaluate a set of digitized haematoxylin-eosin-stained tumour tissue microarray (TMA) samples from 420 colorectal cancer patients with clinicopathological and outcome data available. The results show that deep learning-based outcome prediction with only small tissue areas as input outperforms (hazard ratio 2.3; CI 95% 1.79-3.03; AUC 0.69) visual histological assessment performed by human experts on both TMA spot (HR 1.67; CI 95% 1.28-2.19; AUC 0.58) and whole-slide level (HR 1.65; CI 95% 1.30-2.15; AUC 0.57) in the stratification into low- and high-risk patients. Our results suggest that state-of-the-art deep learning techniques can extract more prognostic information from the tissue morphology of colorectal cancer than an experienced human observer.

  20. Comparison of clinical and survival characteristics between prostate cancer patients of PSA-based screening and clinical diagnosis in China.

    PubMed

    Xu, Libo; Wang, Jinguo; Guo, Baofeng; Zhang, Haixia; Wang, Kaichen; Wang, Ding; Dai, Chang; Zhang, Ling; Zhao, Xuejian

    2018-01-02

    Prostate-specific antigen (PSA)-based mass screening remains the most controversial topic in prostate cancer. PSA-based mass screening has not been widely used in China yet. The aim of our study was to evaluate the effect of the PSA-based screening in China. The cohort consisted of 1,012 prostate cancer patients. Data were retrospectively collected and clinical characteristics of the cohorts were investigated. Survival was analyzed for prostatic carcinoma of both PSA screened and clinically diagnosed patients according to clinical characteristics and the National Comprehensive Cancer Network (NCCN) risk classification. Cox Proportional Hazards Model analysis was done for risk predictor identification. The median age was 71 years old. Five-year overall and prostate-cancer-specific survival in prostatic adenocarcinoma patients were 77.52% and 79.65%; 10-year survivals were 62.57% and 68.60%, respectively. Survival was significantly poorer in patients with metastases and non-curative management. T staging and Gleason score by NCCN classification effectively stratified prostatic adenocarcinoma patients into different risk groups. T staging was a significant predictor of survival by COX Proportional Hazard Model. PSA screened patients had a significantly higher percentage diagnosed in early stage. PSA screened prostatic adenocarcinoma patients had a better prognosis in both overall and prostate cancer-specific survivals. This Chinese cohort had a lower overall and prostate cancer survival rate than it is reported in western countries. The incidence of early-stage prostate cancer found in PSA-based mass screening was high and there were significant differences in both overall and prostate cancer-specific survival between the PSA-screened and clinically diagnosed patients.

  1. Comparison of clinical and survival characteristics between prostate cancer patients of PSA-based screening and clinical diagnosis in China

    PubMed Central

    Xu, Libo; Wang, Jinguo; Guo, Baofeng; Zhang, Haixia; Wang, Kaichen; Wang, Ding; Dai, Chang; Zhang, Ling; Zhao, Xuejian

    2018-01-01

    Prostate-specific antigen (PSA)-based mass screening remains the most controversial topic in prostate cancer. PSA-based mass screening has not been widely used in China yet. The aim of our study was to evaluate the effect of the PSA-based screening in China. The cohort consisted of 1,012 prostate cancer patients. Data were retrospectively collected and clinical characteristics of the cohorts were investigated. Survival was analyzed for prostatic carcinoma of both PSA screened and clinically diagnosed patients according to clinical characteristics and the National Comprehensive Cancer Network (NCCN) risk classification. Cox Proportional Hazards Model analysis was done for risk predictor identification. The median age was 71 years old. Five-year overall and prostate-cancer-specific survival in prostatic adenocarcinoma patients were 77.52% and 79.65%; 10-year survivals were 62.57% and 68.60%, respectively. Survival was significantly poorer in patients with metastases and non-curative management. T staging and Gleason score by NCCN classification effectively stratified prostatic adenocarcinoma patients into different risk groups. T staging was a significant predictor of survival by COX Proportional Hazard Model. PSA screened patients had a significantly higher percentage diagnosed in early stage. PSA screened prostatic adenocarcinoma patients had a better prognosis in both overall and prostate cancer-specific survivals. This Chinese cohort had a lower overall and prostate cancer survival rate than it is reported in western countries. The incidence of early-stage prostate cancer found in PSA-based mass screening was high and there were significant differences in both overall and prostate cancer-specific survival between the PSA-screened and clinically diagnosed patients. PMID:29416625

  2. Performance of Four Frailty Classifications in Older Patients With Cancer: Prospective Elderly Cancer Patients Cohort Study.

    PubMed

    Ferrat, Emilie; Paillaud, Elena; Caillet, Philippe; Laurent, Marie; Tournigand, Christophe; Lagrange, Jean-Léon; Droz, Jean-Pierre; Balducci, Lodovico; Audureau, Etienne; Canouï-Poitrine, Florence; Bastuji-Garin, Sylvie

    2017-03-01

    Purpose Frailty classifications of older patients with cancer have been developed to assist physicians in selecting cancer treatments and geriatric interventions. They have not been compared, and their performance in predicting outcomes has not been assessed. Our objectives were to assess agreement among four classifications and to compare their predictive performance in a large cohort of in- and outpatients with various cancers. Patients and Methods We prospectively included 1,021 patients age 70 years or older who had solid or hematologic malignancies and underwent a geriatric assessment in one of two French teaching hospitals between 2007 and 2012. Among them, 763 were assessed using four classifications: Balducci, International Society of Geriatric Oncology (SIOG) 1, SIOG2, and a latent class typology. Agreement was assessed using the κ statistic. Outcomes were 1-year mortality and 6-month unscheduled admissions. Results All four classifications had good discrimination for 1-year mortality (C-index ≥ 0.70); discrimination was best with SIOG1. For 6-month unscheduled admissions, discrimination was good with all four classifications (C-index ≥ 0.70). For classification into three (fit, vulnerable, or frail) or two categories (fit v vulnerable or frail and fit or vulnerable v frail), agreement among the four classifications ranged from very poor (κ ≤ 0.20) to good (0.60 < κ ≤ 0.80). Agreement was best between SIOG1 and the latent class typology and between SIOG1 and Balducci. Conclusion These four frailty classifications have good prognostic performance among older in- and outpatients with various cancers. They may prove useful in decision making about cancer treatments and geriatric interventions and/or in stratifying older patients with cancer in clinical trials.

  3. Ultrasonographic characteristics and BI-RADS-US classification of BRCA1 mutation-associated breast cancer in Guangxi, China.

    PubMed

    Li, Cheng; Liu, Junjie; Wang, Sida; Chen, Yuanyuan; Yuan, Zhigang; Zeng, Jian; Li, Zhixian

    2015-01-01

    To retrospectively analyze and compare the ultrasonographic characteristics and BI-RADS-US classification between patients with BRCA1 mutation-associated breast cancer and those without BRCA1 gene mutation in Guangxi, China. The study was performed in 36 lesions from 34 BRCA1 mutation-associated breast cancer patients. A total of 422 lesions from 422 breast cancer patients without BRCA1 mutations served as control group. The comparison of the ultrasonographic features and BI-RADS-US classification between two the groups were reviewed. More complex inner echo was disclosed in BRCA1 mutation-associated breast cancer patients (x(2) = 4.741, P = 0.029). The BI-RADS classification of BRCA1 mutation-associated breast cancer was lower (U = 6094.0, P = 0.022). BRCA1 mutation-associated breast cancer frequently displays as microlobulated margin and complex echo. It also shows more benign characteristics in morphology, and the BI-RADS classification is prone to be underestimated.

  4. Metabolic profiles are principally different between cancers of the liver, pancreas and breast.

    PubMed

    Budhu, Anuradha; Terunuma, Atsushi; Zhang, Geng; Hussain, S Perwez; Ambs, Stefan; Wang, Xin Wei

    2014-01-01

    Molecular profiling of primary tumors may facilitate the classification of patients with cancer into more homogenous biological groups to aid clinical management. Metabolomic profiling has been shown to be a powerful tool in characterizing the biological mechanisms underlying a disease but has not been evaluated for its ability to classify cancers by their tissue of origin. Thus, we assessed metabolomic profiling as a novel tool for multiclass cancer characterization. Global metabolic profiling was employed to identify metabolites in paired tumor and non-tumor liver (n=60), breast (n=130) and pancreatic (n=76) tissue specimens. Unsupervised principal component analysis showed that metabolites are principally unique to each tissue and cancer type. Such a difference can also be observed even among early stage cancers, suggesting a significant and unique alteration of global metabolic pathways associated with each cancer type. Our global high-throughput metabolomic profiling study shows that specific biochemical alterations distinguish liver, pancreatic and breast cancer and could be applied as cancer classification tools to differentiate tumors based on tissue of origin.

  5. The impact of OCR accuracy on automated cancer classification of pathology reports.

    PubMed

    Zuccon, Guido; Nguyen, Anthony N; Bergheim, Anton; Wickman, Sandra; Grayson, Narelle

    2012-01-01

    To evaluate the effects of Optical Character Recognition (OCR) on the automatic cancer classification of pathology reports. Scanned images of pathology reports were converted to electronic free-text using a commercial OCR system. A state-of-the-art cancer classification system, the Medical Text Extraction (MEDTEX) system, was used to automatically classify the OCR reports. Classifications produced by MEDTEX on the OCR versions of the reports were compared with the classification from a human amended version of the OCR reports. The employed OCR system was found to recognise scanned pathology reports with up to 99.12% character accuracy and up to 98.95% word accuracy. Errors in the OCR processing were found to minimally impact on the automatic classification of scanned pathology reports into notifiable groups. However, the impact of OCR errors is not negligible when considering the extraction of cancer notification items, such as primary site, histological type, etc. The automatic cancer classification system used in this work, MEDTEX, has proven to be robust to errors produced by the acquisition of freetext pathology reports from scanned images through OCR software. However, issues emerge when considering the extraction of cancer notification items.

  6. Hierarchical Gene Selection and Genetic Fuzzy System for Cancer Microarray Data Classification

    PubMed Central

    Nguyen, Thanh; Khosravi, Abbas; Creighton, Douglas; Nahavandi, Saeid

    2015-01-01

    This paper introduces a novel approach to gene selection based on a substantial modification of analytic hierarchy process (AHP). The modified AHP systematically integrates outcomes of individual filter methods to select the most informative genes for microarray classification. Five individual ranking methods including t-test, entropy, receiver operating characteristic (ROC) curve, Wilcoxon and signal to noise ratio are employed to rank genes. These ranked genes are then considered as inputs for the modified AHP. Additionally, a method that uses fuzzy standard additive model (FSAM) for cancer classification based on genes selected by AHP is also proposed in this paper. Traditional FSAM learning is a hybrid process comprising unsupervised structure learning and supervised parameter tuning. Genetic algorithm (GA) is incorporated in-between unsupervised and supervised training to optimize the number of fuzzy rules. The integration of GA enables FSAM to deal with the high-dimensional-low-sample nature of microarray data and thus enhance the efficiency of the classification. Experiments are carried out on numerous microarray datasets. Results demonstrate the performance dominance of the AHP-based gene selection against the single ranking methods. Furthermore, the combination of AHP-FSAM shows a great accuracy in microarray data classification compared to various competing classifiers. The proposed approach therefore is useful for medical practitioners and clinicians as a decision support system that can be implemented in the real medical practice. PMID:25823003

  7. Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification.

    PubMed

    Nguyen, Thanh; Khosravi, Abbas; Creighton, Douglas; Nahavandi, Saeid

    2015-01-01

    This paper introduces a novel approach to gene selection based on a substantial modification of analytic hierarchy process (AHP). The modified AHP systematically integrates outcomes of individual filter methods to select the most informative genes for microarray classification. Five individual ranking methods including t-test, entropy, receiver operating characteristic (ROC) curve, Wilcoxon and signal to noise ratio are employed to rank genes. These ranked genes are then considered as inputs for the modified AHP. Additionally, a method that uses fuzzy standard additive model (FSAM) for cancer classification based on genes selected by AHP is also proposed in this paper. Traditional FSAM learning is a hybrid process comprising unsupervised structure learning and supervised parameter tuning. Genetic algorithm (GA) is incorporated in-between unsupervised and supervised training to optimize the number of fuzzy rules. The integration of GA enables FSAM to deal with the high-dimensional-low-sample nature of microarray data and thus enhance the efficiency of the classification. Experiments are carried out on numerous microarray datasets. Results demonstrate the performance dominance of the AHP-based gene selection against the single ranking methods. Furthermore, the combination of AHP-FSAM shows a great accuracy in microarray data classification compared to various competing classifiers. The proposed approach therefore is useful for medical practitioners and clinicians as a decision support system that can be implemented in the real medical practice.

  8. Genomic Classification of Tumors | Center for Cancer Research

    Cancer.gov

    CCR investigators were among the first to classify tumors based on genetics, laying the groundwork for today’s common practice to molecularly characterize tumors based on their genetic fingerprints for personalized treatments.

  9. On the classification techniques in data mining for microarray data classification

    NASA Astrophysics Data System (ADS)

    Aydadenta, Husna; Adiwijaya

    2018-03-01

    Cancer is one of the deadly diseases, according to data from WHO by 2015 there are 8.8 million more deaths caused by cancer, and this will increase every year if not resolved earlier. Microarray data has become one of the most popular cancer-identification studies in the field of health, since microarray data can be used to look at levels of gene expression in certain cell samples that serve to analyze thousands of genes simultaneously. By using data mining technique, we can classify the sample of microarray data thus it can be identified with cancer or not. In this paper we will discuss some research using some data mining techniques using microarray data, such as Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5, and simulation of Random Forest algorithm with technique of reduction dimension using Relief. The result of this paper show performance measure (accuracy) from classification algorithm (SVM, ANN, Naive Bayes, kNN, C4.5, and Random Forets).The results in this paper show the accuracy of Random Forest algorithm higher than other classification algorithms (Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5). It is hoped that this paper can provide some information about the speed, accuracy, performance and computational cost generated from each Data Mining Classification Technique based on microarray data.

  10. The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population-based to a more "personalized" approach to cancer staging.

    PubMed

    Amin, Mahul B; Greene, Frederick L; Edge, Stephen B; Compton, Carolyn C; Gershenwald, Jeffrey E; Brookland, Robert K; Meyer, Laura; Gress, Donna M; Byrd, David R; Winchester, David P

    2017-03-01

    The American Joint Committee on Cancer (AJCC) staging manual has become the benchmark for classifying patients with cancer, defining prognosis, and determining the best treatment approaches. Many view the primary role of the tumor, lymph node, metastasis (TNM) system as that of a standardized classification system for evaluating cancer at a population level in terms of the extent of disease, both at initial presentation and after surgical treatment, and the overall impact of improvements in cancer treatment. The rapid evolution of knowledge in cancer biology and the discovery and validation of biologic factors that predict cancer outcome and response to treatment with better accuracy have led some cancer experts to question the utility of a TNM-based approach in clinical care at an individualized patient level. In the Eighth Edition of the AJCC Cancer Staging Manual, the goal of including relevant, nonanatomic (including molecular) factors has been foremost, although changes are made only when there is strong evidence for inclusion. The editorial board viewed this iteration as a proactive effort to continue to build the important bridge from a "population-based" to a more "personalized" approach to patient classification, one that forms the conceptual framework and foundation of cancer staging in the era of precision molecular oncology. The AJCC promulgates best staging practices through each new edition in an effort to provide cancer care providers with a powerful, knowledge-based resource for the battle against cancer. In this commentary, the authors highlight the overall organizational and structural changes as well as "what's new" in the Eighth Edition. It is hoped that this information will provide the reader with a better understanding of the rationale behind the aggregate proposed changes and the exciting developments in the upcoming edition. CA Cancer J Clin 2017;67:93-99. © 2017 American Cancer Society. © 2017 American Cancer Society.

  11. Biobank classification in an Australian setting.

    PubMed

    Rush, Amanda; Christiansen, Jeffrey H; Farrell, Jake P; Goode, Susan M; Scott, Rodney J; Spring, Kevin J; Byrne, Jennifer A

    2015-06-01

    In 2011, Watson and Barnes proposed a schema for classifying biobanks into 3 groups (mono-, oligo-, and poly-user), primarily based upon biospecimen access policies. We used results from a recent comprehensive survey of cancer biobanks in New South Wales, Australia to assess the applicability of this biobank classification schema in an Australian setting. Cancer biobanks were identified using publically available data, and by consulting with research managers. A comprehensive survey was developed and administered through a face-to-face setting. Data were analyzed using Microsoft Excel™ 2010 and IBM SPSS Statistics™ version 21.0. The cancer biobank cohort (n=23) represented 5 mono-user biobanks, 7 oligo-user biobanks, and 11 poly-user biobanks, and was analyzed as two groups (mono-/oligo- versus poly-user biobanks). Poly-user biobanks employed significantly more full-time equivalent staff, and were significantly more likely to have a website, share staff between biobanks, access governance support, utilize quality control measures, be aware of biobanking best practice documents, and offer staff training. Mono-/oligo-user biobanks were significantly more likely to seek advice from other biobanks. Our results further delineate a biobank classification system that is primarily based on access policy, and demonstrate its relevance in an Australian setting.

  12. Microfluidic microscopy-assisted label-free approach for cancer screening: automated microfluidic cytology for cancer screening.

    PubMed

    Jagannadh, Veerendra Kalyan; Gopakumar, G; Subrahmanyam, Gorthi R K Sai; Gorthi, Sai Siva

    2017-05-01

    Each year, about 7-8 million deaths occur due to cancer around the world. More than half of the cancer-related deaths occur in the less-developed parts of the world. Cancer mortality rate can be reduced with early detection and subsequent treatment of the disease. In this paper, we introduce a microfluidic microscopy-based cost-effective and label-free approach for identification of cancerous cells. We outline a diagnostic framework for the same and detail an instrumentation layout. We have employed classical computer vision techniques such as 2D principal component analysis-based cell type representation followed by support vector machine-based classification. Analogous to criminal face recognition systems implemented with help of surveillance cameras, a signature-based approach for cancerous cell identification using microfluidic microscopy surveillance is demonstrated. Such a platform would facilitate affordable mass screening camps in the developing countries and therefore help decrease cancer mortality rate.

  13. Colorectal Cancer and Colitis Diagnosis Using Fourier Transform Infrared Spectroscopy and an Improved K-Nearest-Neighbour Classifier.

    PubMed

    Li, Qingbo; Hao, Can; Kang, Xue; Zhang, Jialin; Sun, Xuejun; Wang, Wenbo; Zeng, Haishan

    2017-11-27

    Combining Fourier transform infrared spectroscopy (FTIR) with endoscopy, it is expected that noninvasive, rapid detection of colorectal cancer can be performed in vivo in the future. In this study, Fourier transform infrared spectra were collected from 88 endoscopic biopsy colorectal tissue samples (41 colitis and 47 cancers). A new method, viz., entropy weight local-hyperplane k-nearest-neighbor (EWHK), which is an improved version of K-local hyperplane distance nearest-neighbor (HKNN), is proposed for tissue classification. In order to avoid limiting high dimensions and small values of the nearest neighbor, the new EWHK method calculates feature weights based on information entropy. The average results of the random classification showed that the EWHK classifier for differentiating cancer from colitis samples produced a sensitivity of 81.38% and a specificity of 92.69%.

  14. Sensitivity of Breast Tumors to Oncolytic Viruses

    DTIC Science & Technology

    2006-08-01

    therapies for breast cancer based on the oncolytic virus, vesicular stomatitis virus (VSV). Studies have shown that matrix (M) protein mutants of VSV, such...more resistant to VSV-induced cytopathic effect than breast cancer cells. However, in syngeneic breast cancer system in vivo, rM51R-M virus is only...interleukin 12, breast cancer , interferon 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT 18. NUMBER OF PAGES 19a. NAME OF RESPONSIBLE

  15. Comparison of the current AJCC-TNM numeric-based with a new anatomical location-based lymph node staging system for gastric cancer: A western experience

    PubMed Central

    Auricchio, Annamaria; Cardella, Francesca; Mabilia, Andrea; Diana, Anna; Castellano, Paolo; De Vita, Ferdinando; Orditura, Michele

    2017-01-01

    Background In gastric cancer, the current AJCC numeric-based lymph node staging does not provide information on the anatomical extent of the disease and lymphadenectomy. A new anatomical location-based node staging, proposed by Choi, has shown better prognostic performance, thus soliciting Western world validation. Study design Data from 284 gastric cancers undergoing radical surgery at the Second University of Naples from 2000 to 2014 were reviewed. The lymph nodes were reclassified into three groups (lesser and greater curvature, and extraperigastric nodes); presence of any metastatic lymph node in a given group was considered positive, prompting a new N and TNM stage classification. Receiver-operating-characteristic (ROC) curves for censored survival data and bootstrap methods were used to compare the capability of the two models to predict tumor recurrence. Results More than one third of node positive patients were reclassified into different N and TNM stages by the new system. Compared to the current staging system, the new classification significantly correlated with tumor recurrence rates and displayed improved indices of prognostic performance, such as the Bayesian information criterion and the Harrell C-index. Higher values at survival ROC analysis demonstrated a significantly better stratification of patients by the new system, mostly in the early phase of the follow-up, with a worse prognosis in more advanced new N stages, despite the same current N stage. Conclusions This study suggests that the anatomical location-based classification of lymph node metastasis may be an important tool for gastric cancer prognosis and should be considered for future revision of the TNM staging system. PMID:28380037

  16. Breast cancer surgery and diagnosis-related groups (DRGs): patient classification and hospital reimbursement in 11 European countries.

    PubMed

    Scheller-Kreinsen, David; Quentin, Wilm; Geissler, Alexander; Busse, Reinhard

    2013-10-01

    Researchers from eleven countries (i.e. Austria, England, Estonia, Finland, France, Germany, Ireland, Netherlands, Poland, Spain, and Sweden) compared how their DRG systems deal with breast cancer surgery patients. DRG algorithms and indicators of resource consumption were assessed for those DRGs that individually contain at least 1% of all breast cancer surgery patients. Six standardised case vignettes were defined and quasi prices according to national DRG-based hospital payment systems were ascertained. European DRG systems classify breast cancer surgery patients according to different sets of classification variables into three to seven DRGs. Quasi prices for an index case treated with partial mastectomy range from €577 in Poland to €5780 in the Netherlands. Countries award their highest payments for very different kinds of patients. Breast cancer specialists and national DRG authorities should consider how other countries' DRG systems classify breast cancer patients in order to identify potential scope for improvement and to ensure fair and appropriate reimbursement. Copyright © 2012 Elsevier Ltd. All rights reserved.

  17. The wisdom of the commons: ensemble tree classifiers for prostate cancer prognosis

    PubMed Central

    Koziol, James A.; Feng, Anne C.; Jia, Zhenyu; Wang, Yipeng; Goodison, Seven; McClelland, Michael; Mercola, Dan

    2009-01-01

    Motivation: Classification and regression trees have long been used for cancer diagnosis and prognosis. Nevertheless, instability and variable selection bias, as well as overfitting, are well-known problems of tree-based methods. In this article, we investigate whether ensemble tree classifiers can ameliorate these difficulties, using data from two recent studies of radical prostatectomy in prostate cancer. Results: Using time to progression following prostatectomy as the relevant clinical endpoint, we found that ensemble tree classifiers robustly and reproducibly identified three subgroups of patients in the two clinical datasets: non-progressors, early progressors and late progressors. Moreover, the consensus classifications were independent predictors of time to progression compared to known clinical prognostic factors. Contact: dmercola@uci.edu PMID:18628288

  18. Markerless gating for lung cancer radiotherapy based on machine learning techniques

    NASA Astrophysics Data System (ADS)

    Lin, Tong; Li, Ruijiang; Tang, Xiaoli; Dy, Jennifer G.; Jiang, Steve B.

    2009-03-01

    In lung cancer radiotherapy, radiation to a mobile target can be delivered by respiratory gating, for which we need to know whether the target is inside or outside a predefined gating window at any time point during the treatment. This can be achieved by tracking one or more fiducial markers implanted inside or near the target, either fluoroscopically or electromagnetically. However, the clinical implementation of marker tracking is limited for lung cancer radiotherapy mainly due to the risk of pneumothorax. Therefore, gating without implanted fiducial markers is a promising clinical direction. We have developed several template-matching methods for fluoroscopic marker-less gating. Recently, we have modeled the gating problem as a binary pattern classification problem, in which principal component analysis (PCA) and support vector machine (SVM) are combined to perform the classification task. Following the same framework, we investigated different combinations of dimensionality reduction techniques (PCA and four nonlinear manifold learning methods) and two machine learning classification methods (artificial neural networks—ANN and SVM). Performance was evaluated on ten fluoroscopic image sequences of nine lung cancer patients. We found that among all combinations of dimensionality reduction techniques and classification methods, PCA combined with either ANN or SVM achieved a better performance than the other nonlinear manifold learning methods. ANN when combined with PCA achieves a better performance than SVM in terms of classification accuracy and recall rate, although the target coverage is similar for the two classification methods. Furthermore, the running time for both ANN and SVM with PCA is within tolerance for real-time applications. Overall, ANN combined with PCA is a better candidate than other combinations we investigated in this work for real-time gated radiotherapy.

  19. Anatomical classification of breast sentinel lymph nodes using computed tomography-lymphography.

    PubMed

    Fujita, Tamaki; Miura, Hiroyuki; Seino, Hiroko; Ono, Shuichi; Nishi, Takashi; Nishimura, Akimasa; Hakamada, Kenichi; Aoki, Masahiko

    2018-05-03

    To evaluate the anatomical classification and location of breast sentinel lymph nodes, preoperative computed tomography-lymphography examinations were retrospectively reviewed for sentinel lymph nodes in 464 cases clinically diagnosed with node-negative breast cancer between July 2007 and June 2016. Anatomical classification was performed based on the numbers of lymphatic routes and sentinel lymph nodes, the flow direction of lymphatic routes, and the location of sentinel lymph nodes. Of the 464 cases reviewed, anatomical classification could be performed in 434 (93.5 %). The largest number of cases showed single route/single sentinel lymph node (n = 296, 68.2 %), followed by multiple routes/multiple sentinel lymph nodes (n = 59, 13.6 %), single route/multiple sentinel lymph nodes (n = 53, 12.2 %), and multiple routes/single sentinel lymph node (n = 26, 6.0 %). Classification based on the flow direction of lymphatic routes showed that 429 cases (98.8 %) had outward flow on the superficial fascia toward axillary lymph nodes, whereas classification based on the height of sentinel lymph nodes showed that 323 cases (74.4 %) belonged to the upper pectoral group of axillary lymph nodes. There was wide variation in the number of lymphatic routes and their branching patterns and in the number, location, and direction of flow of sentinel lymph nodes. It is clinically very important to preoperatively understand the anatomical morphology of lymphatic routes and sentinel lymph nodes for optimal treatment of breast cancer, and computed tomography-lymphography is suitable for this purpose.

  20. Challenging the Cancer Molecular Stratification Dogma: Intratumoral Heterogeneity Undermines Consensus Molecular Subtypes and Potential Diagnostic Value in Colorectal Cancer.

    PubMed

    Dunne, Philip D; McArt, Darragh G; Bradley, Conor A; O'Reilly, Paul G; Barrett, Helen L; Cummins, Robert; O'Grady, Tony; Arthur, Ken; Loughrey, Maurice B; Allen, Wendy L; McDade, Simon S; Waugh, David J; Hamilton, Peter W; Longley, Daniel B; Kay, Elaine W; Johnston, Patrick G; Lawler, Mark; Salto-Tellez, Manuel; Van Schaeybroeck, Sandra

    2016-08-15

    A number of independent gene expression profiling studies have identified transcriptional subtypes in colorectal cancer with potential diagnostic utility, culminating in publication of a colorectal cancer Consensus Molecular Subtype classification. The worst prognostic subtype has been defined by genes associated with stem-like biology. Recently, it has been shown that the majority of genes associated with this poor prognostic group are stromal derived. We investigated the potential for tumor misclassification into multiple diagnostic subgroups based on tumoral region sampled. We performed multiregion tissue RNA extraction/transcriptomic analysis using colorectal-specific arrays on invasive front, central tumor, and lymph node regions selected from tissue samples from 25 colorectal cancer patients. We identified a consensus 30-gene list, which represents the intratumoral heterogeneity within a cohort of primary colorectal cancer tumors. Using a series of online datasets, we showed that this gene list displays prognostic potential HR = 2.914 (confidence interval 0.9286-9.162) in stage II/III colorectal cancer patients, but in addition, we demonstrated that these genes are stromal derived, challenging the assumption that poor prognosis tumors with stem-like biology have undergone a widespread epithelial-mesenchymal transition. Most importantly, we showed that patients can be simultaneously classified into multiple diagnostically relevant subgroups based purely on the tumoral region analyzed. Gene expression profiles derived from the nonmalignant stromal region can influence assignment of colorectal cancer transcriptional subtypes, questioning the current molecular classification dogma and highlighting the need to consider pathology sampling region and degree of stromal infiltration when employing transcription-based classifiers to underpin clinical decision making in colorectal cancer. Clin Cancer Res; 22(16); 4095-104. ©2016 AACRSee related commentary by Morris and Kopetz, p. 3989. ©2016 American Association for Cancer Research.

  1. Automated texture-based identification of ovarian cancer in confocal microendoscope images

    NASA Astrophysics Data System (ADS)

    Srivastava, Saurabh; Rodriguez, Jeffrey J.; Rouse, Andrew R.; Brewer, Molly A.; Gmitro, Arthur F.

    2005-03-01

    The fluorescence confocal microendoscope provides high-resolution, in-vivo imaging of cellular pathology during optical biopsy. There are indications that the examination of human ovaries with this instrument has diagnostic implications for the early detection of ovarian cancer. The purpose of this study was to develop a computer-aided system to facilitate the identification of ovarian cancer from digital images captured with the confocal microendoscope system. To achieve this goal, we modeled the cellular-level structure present in these images as texture and extracted features based on first-order statistics, spatial gray-level dependence matrices, and spatial-frequency content. Selection of the best features for classification was performed using traditional feature selection techniques including stepwise discriminant analysis, forward sequential search, a non-parametric method, principal component analysis, and a heuristic technique that combines the results of these methods. The best set of features selected was used for classification, and performance of various machine classifiers was compared by analyzing the areas under their receiver operating characteristic curves. The results show that it is possible to automatically identify patients with ovarian cancer based on texture features extracted from confocal microendoscope images and that the machine performance is superior to that of the human observer.

  2. Co-trained convolutional neural networks for automated detection of prostate cancer in multi-parametric MRI.

    PubMed

    Yang, Xin; Liu, Chaoyue; Wang, Zhiwei; Yang, Jun; Min, Hung Le; Wang, Liang; Cheng, Kwang-Ting Tim

    2017-12-01

    Multi-parameter magnetic resonance imaging (mp-MRI) is increasingly popular for prostate cancer (PCa) detection and diagnosis. However, interpreting mp-MRI data which typically contains multiple unregistered 3D sequences, e.g. apparent diffusion coefficient (ADC) and T2-weighted (T2w) images, is time-consuming and demands special expertise, limiting its usage for large-scale PCa screening. Therefore, solutions to computer-aided detection of PCa in mp-MRI images are highly desirable. Most recent advances in automated methods for PCa detection employ a handcrafted feature based two-stage classification flow, i.e. voxel-level classification followed by a region-level classification. This work presents an automated PCa detection system which can concurrently identify the presence of PCa in an image and localize lesions based on deep convolutional neural network (CNN) features and a single-stage SVM classifier. Specifically, the developed co-trained CNNs consist of two parallel convolutional networks for ADC and T2w images respectively. Each network is trained using images of a single modality in a weakly-supervised manner by providing a set of prostate images with image-level labels indicating only the presence of PCa without priors of lesions' locations. Discriminative visual patterns of lesions can be learned effectively from clutters of prostate and surrounding tissues. A cancer response map with each pixel indicating the likelihood to be cancerous is explicitly generated at the last convolutional layer of the network for each modality. A new back-propagated error E is defined to enforce both optimized classification results and consistent cancer response maps for different modalities, which help capture highly representative PCa-relevant features during the CNN feature learning process. The CNN features of each modality are concatenated and fed into a SVM classifier. For images which are classified to contain cancers, non-maximum suppression and adaptive thresholding are applied to the corresponding cancer response maps for PCa foci localization. Evaluation based on 160 patient data with 12-core systematic TRUS-guided prostate biopsy as the reference standard demonstrates that our system achieves a sensitivity of 0.46, 0.92 and 0.97 at 0.1, 1 and 10 false positives per normal/benign patient which is significantly superior to two state-of-the-art CNN-based methods (Oquab et al., 2015; Zhou et al., 2015) and 6-core systematic prostate biopsies. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Tuning to optimize SVM approach for assisting ovarian cancer diagnosis with photoacoustic imaging.

    PubMed

    Wang, Rui; Li, Rui; Lei, Yanyan; Zhu, Quing

    2015-01-01

    Support vector machine (SVM) is one of the most effective classification methods for cancer detection. The efficiency and quality of a SVM classifier depends strongly on several important features and a set of proper parameters. Here, a series of classification analyses, with one set of photoacoustic data from ovarian tissues ex vivo and a widely used breast cancer dataset- the Wisconsin Diagnostic Breast Cancer (WDBC), revealed the different accuracy of a SVM classification in terms of the number of features used and the parameters selected. A pattern recognition system is proposed by means of SVM-Recursive Feature Elimination (RFE) with the Radial Basis Function (RBF) kernel. To improve the effectiveness and robustness of the system, an optimized tuning ensemble algorithm called as SVM-RFE(C) with correlation filter was implemented to quantify feature and parameter information based on cross validation. The proposed algorithm is first demonstrated outperforming SVM-RFE on WDBC. Then the best accuracy of 94.643% and sensitivity of 94.595% were achieved when using SVM-RFE(C) to test 57 new PAT data from 19 patients. The experiment results show that the classifier constructed with SVM-RFE(C) algorithm is able to learn additional information from new data and has significant potential in ovarian cancer diagnosis.

  4. Classification of ductal carcinoma in situ by gene expression profiling.

    PubMed

    Hannemann, Juliane; Velds, Arno; Halfwerk, Johannes B G; Kreike, Bas; Peterse, Johannes L; van de Vijver, Marc J

    2006-01-01

    Ductal carcinoma in situ (DCIS) is characterised by the intraductal proliferation of malignant epithelial cells. Several histological classification systems have been developed, but assessing the histological type/grade of DCIS lesions is still challenging, making treatment decisions based on these features difficult. To obtain insight in the molecular basis of the development of different types of DCIS and its progression to invasive breast cancer, we have studied differences in gene expression between different types of DCIS and between DCIS and invasive breast carcinomas. Gene expression profiling using microarray analysis has been performed on 40 in situ and 40 invasive breast cancer cases. DCIS cases were classified as well- (n = 6), intermediately (n = 18), and poorly (n = 14) differentiated type. Of the 40 invasive breast cancer samples, five samples were grade I, 11 samples were grade II, and 24 samples were grade III. Using two-dimensional hierarchical clustering, the basal-like type, ERB-B2 type, and the luminal-type tumours originally described for invasive breast cancer could also be identified in DCIS. Using supervised classification, we identified a gene expression classifier of 35 genes, which differed between DCIS and invasive breast cancer; a classifier of 43 genes could be identified separating between well- and poorly differentiated DCIS samples.

  5. Classification of ductal carcinoma in situ by gene expression profiling

    PubMed Central

    Hannemann, Juliane; Velds, Arno; Halfwerk, Johannes BG; Kreike, Bas; Peterse, Johannes L; van de Vijver, Marc J

    2006-01-01

    Introduction Ductal carcinoma in situ (DCIS) is characterised by the intraductal proliferation of malignant epithelial cells. Several histological classification systems have been developed, but assessing the histological type/grade of DCIS lesions is still challenging, making treatment decisions based on these features difficult. To obtain insight in the molecular basis of the development of different types of DCIS and its progression to invasive breast cancer, we have studied differences in gene expression between different types of DCIS and between DCIS and invasive breast carcinomas. Methods Gene expression profiling using microarray analysis has been performed on 40 in situ and 40 invasive breast cancer cases. Results DCIS cases were classified as well- (n = 6), intermediately (n = 18), and poorly (n = 14) differentiated type. Of the 40 invasive breast cancer samples, five samples were grade I, 11 samples were grade II, and 24 samples were grade III. Using two-dimensional hierarchical clustering, the basal-like type, ERB-B2 type, and the luminal-type tumours originally described for invasive breast cancer could also be identified in DCIS. Conclusion Using supervised classification, we identified a gene expression classifier of 35 genes, which differed between DCIS and invasive breast cancer; a classifier of 43 genes could be identified separating between well- and poorly differentiated DCIS samples. PMID:17069663

  6. Molecular Characterization of Epithelial Ovarian Cancer: Implications for Diagnosis and Treatment.

    PubMed

    Rojas, Veronica; Hirshfield, Kim M; Ganesan, Shridar; Rodriguez-Rodriguez, Lorna

    2016-12-15

    Epithelial ovarian cancer is a highly heterogeneous disease characterized by multiple histological subtypes. Molecular diversity has been shown to occur within specific histological subtypes of epithelial ovarian cancer, between different tumors of an individual patient, as well as within individual tumors. Recent advances in the molecular characterization of epithelial ovarian cancer tumors have provided the basis for a simplified classification scheme in which these cancers are classified as either type I or type II tumors, and these two categories have implications regarding disease pathogenesis and prognosis. Molecular analyses, primarily based on next-generation sequencing, otherwise known as high-throughput sequencing, are allowing for further refinement of ovarian cancer classification, facilitating the elucidation of the site(s) of precursor lesions of high-grade serous ovarian cancer, and providing insight into the processes of clonal selection and evolution that may be associated with development of chemoresistance. Potential therapeutic targets have been identified from recent molecular profiling studies of these tumors, and the effectiveness and safety of a number of specific targeted therapies have been evaluated or are currently being studied for the treatment of women with this disease.

  7. Molecular Characterization of Epithelial Ovarian Cancer: Implications for Diagnosis and Treatment

    PubMed Central

    Rojas, Veronica; Hirshfield, Kim M.; Ganesan, Shridar; Rodriguez-Rodriguez, Lorna

    2016-01-01

    Epithelial ovarian cancer is a highly heterogeneous disease characterized by multiple histological subtypes. Molecular diversity has been shown to occur within specific histological subtypes of epithelial ovarian cancer, between different tumors of an individual patient, as well as within individual tumors. Recent advances in the molecular characterization of epithelial ovarian cancer tumors have provided the basis for a simplified classification scheme in which these cancers are classified as either type I or type II tumors, and these two categories have implications regarding disease pathogenesis and prognosis. Molecular analyses, primarily based on next-generation sequencing, otherwise known as high-throughput sequencing, are allowing for further refinement of ovarian cancer classification, facilitating the elucidation of the site(s) of precursor lesions of high-grade serous ovarian cancer, and providing insight into the processes of clonal selection and evolution that may be associated with development of chemoresistance. Potential therapeutic targets have been identified from recent molecular profiling studies of these tumors, and the effectiveness and safety of a number of specific targeted therapies have been evaluated or are currently being studied for the treatment of women with this disease. PMID:27983698

  8. Skin lesion computational diagnosis of dermoscopic images: Ensemble models based on input feature manipulation.

    PubMed

    Oliveira, Roberta B; Pereira, Aledir S; Tavares, João Manuel R S

    2017-10-01

    The number of deaths worldwide due to melanoma has risen in recent times, in part because melanoma is the most aggressive type of skin cancer. Computational systems have been developed to assist dermatologists in early diagnosis of skin cancer, or even to monitor skin lesions. However, there still remains a challenge to improve classifiers for the diagnosis of such skin lesions. The main objective of this article is to evaluate different ensemble classification models based on input feature manipulation to diagnose skin lesions. Input feature manipulation processes are based on feature subset selections from shape properties, colour variation and texture analysis to generate diversity for the ensemble models. Three subset selection models are presented here: (1) a subset selection model based on specific feature groups, (2) a correlation-based subset selection model, and (3) a subset selection model based on feature selection algorithms. Each ensemble classification model is generated using an optimum-path forest classifier and integrated with a majority voting strategy. The proposed models were applied on a set of 1104 dermoscopic images using a cross-validation procedure. The best results were obtained by the first ensemble classification model that generates a feature subset ensemble based on specific feature groups. The skin lesion diagnosis computational system achieved 94.3% accuracy, 91.8% sensitivity and 96.7% specificity. The input feature manipulation process based on specific feature subsets generated the greatest diversity for the ensemble classification model with very promising results. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. [Incidence of melanoma and changes in stage-specific incidence after implementation of skin cancer screening in Schleswig-Holstein].

    PubMed

    Eisemann, N; Waldmann, A; Katalinic, A

    2014-01-01

    A pilot project in skin cancer screening (SCREEN) was conducted in Schleswig-Holstein from July 2003 to June 2004. Although the impact of this screening on the stage-specific incidence of melanoma is of great importance for screening evaluation, it remains unknown. In theory, an effective skin cancer screening program should result in a medium-term incidence decrease of melanomas with a prognostically unfavorable stage. This is studied on a population-based level by using cancer registry data. Based on data from the Cancer Registry of Schleswig-Holstein for 1999-2009, stage-specific (T-category of the TNM-classification system) age-standardized incidence rates were calculated. After implementation of the SCREEN project, the incidence of prognostically favorable melanomas (in situ and T1) was higher than before, while the incidence of advanced melanomas (T2, T3, and for women also T4) decreased considerably. The classification of tumor stages changed during the project period, which may have contributed to an artificial decrease of the stages with a poor prognosis. Nevertheless, the results are in agreement with the observed decrease of melanoma mortality in the screening region.

  10. Prostatic cancers: understanding their molecular pathology and the 2016 WHO classification

    PubMed Central

    Inamura, Kentaro

    2018-01-01

    Accumulating evidence suggests that prostatic cancers represent a group of histologically and molecularly heterogeneous diseases with variable clinical courses. In accordance with the increased knowledge of their clinicopathologies and genetics, the World Health Organization (WHO) classification of prostatic cancers has been revised. Additionally, recent data on their comprehensive molecular characterization have increased our understanding of the genomic basis of prostatic cancers and enabled us to classify them into subtypes with distinct molecular pathologies and clinical features. Our increased understanding of the molecular pathologies of prostatic cancers has permitted their evolution from a poorly understood, heterogeneous group of diseases with variable clinical courses to characteristic molecular subtypes that allow the implementation of personalized therapies and better patient management. This review provides perspectives on the new 2016 WHO classification of prostatic cancers as well as recent knowledge of their molecular pathologies. The WHO classification of prostatic cancers will require additional revisions to allow for reliable and clinically meaningful cancer diagnoses as a better understanding of their molecular characteristics is obtained. PMID:29581876

  11. Prostate segmentation by sparse representation based classification

    PubMed Central

    Gao, Yaozong; Liao, Shu; Shen, Dinggang

    2012-01-01

    Purpose: The segmentation of prostate in CT images is of essential importance to external beam radiotherapy, which is one of the major treatments for prostate cancer nowadays. During the radiotherapy, the prostate is radiated by high-energy x rays from different directions. In order to maximize the dose to the cancer and minimize the dose to the surrounding healthy tissues (e.g., bladder and rectum), the prostate in the new treatment image needs to be accurately localized. Therefore, the effectiveness and efficiency of external beam radiotherapy highly depend on the accurate localization of the prostate. However, due to the low contrast of the prostate with its surrounding tissues (e.g., bladder), the unpredicted prostate motion, and the large appearance variations across different treatment days, it is challenging to segment the prostate in CT images. In this paper, the authors present a novel classification based segmentation method to address these problems. Methods: To segment the prostate, the proposed method first uses sparse representation based classification (SRC) to enhance the prostate in CT images by pixel-wise classification, in order to overcome the limitation of poor contrast of the prostate images. Then, based on the classification results, previous segmented prostates of the same patient are used as patient-specific atlases to align onto the current treatment image and the majority voting strategy is finally adopted to segment the prostate. In order to address the limitations of the traditional SRC in pixel-wise classification, especially for the purpose of segmentation, the authors extend SRC from the following four aspects: (1) A discriminant subdictionary learning method is proposed to learn a discriminant and compact representation of training samples for each class so that the discriminant power of SRC can be increased and also SRC can be applied to the large-scale pixel-wise classification. (2) The L1 regularized sparse coding is replaced by the elastic net in order to obtain a smooth and clear prostate boundary in the classification result. (3) Residue-based linear regression is incorporated to improve the classification performance and to extend SRC from hard classification to soft classification. (4) Iterative SRC is proposed by using context information to iteratively refine the classification results. Results: The proposed method has been comprehensively evaluated on a dataset consisting of 330 CT images from 24 patients. The effectiveness of the extended SRC has been validated by comparing it with the traditional SRC based on the proposed four extensions. The experimental results show that our extended SRC can obtain not only more accurate classification results but also smoother and clearer prostate boundary than the traditional SRC. Besides, the comparison with other five state-of-the-art prostate segmentation methods indicates that our method can achieve better performance than other methods under comparison. Conclusions: The authors have proposed a novel prostate segmentation method based on the sparse representation based classification, which can achieve considerably accurate segmentation results in CT prostate segmentation. PMID:23039673

  12. Prostate segmentation by sparse representation based classification.

    PubMed

    Gao, Yaozong; Liao, Shu; Shen, Dinggang

    2012-10-01

    The segmentation of prostate in CT images is of essential importance to external beam radiotherapy, which is one of the major treatments for prostate cancer nowadays. During the radiotherapy, the prostate is radiated by high-energy x rays from different directions. In order to maximize the dose to the cancer and minimize the dose to the surrounding healthy tissues (e.g., bladder and rectum), the prostate in the new treatment image needs to be accurately localized. Therefore, the effectiveness and efficiency of external beam radiotherapy highly depend on the accurate localization of the prostate. However, due to the low contrast of the prostate with its surrounding tissues (e.g., bladder), the unpredicted prostate motion, and the large appearance variations across different treatment days, it is challenging to segment the prostate in CT images. In this paper, the authors present a novel classification based segmentation method to address these problems. To segment the prostate, the proposed method first uses sparse representation based classification (SRC) to enhance the prostate in CT images by pixel-wise classification, in order to overcome the limitation of poor contrast of the prostate images. Then, based on the classification results, previous segmented prostates of the same patient are used as patient-specific atlases to align onto the current treatment image and the majority voting strategy is finally adopted to segment the prostate. In order to address the limitations of the traditional SRC in pixel-wise classification, especially for the purpose of segmentation, the authors extend SRC from the following four aspects: (1) A discriminant subdictionary learning method is proposed to learn a discriminant and compact representation of training samples for each class so that the discriminant power of SRC can be increased and also SRC can be applied to the large-scale pixel-wise classification. (2) The L1 regularized sparse coding is replaced by the elastic net in order to obtain a smooth and clear prostate boundary in the classification result. (3) Residue-based linear regression is incorporated to improve the classification performance and to extend SRC from hard classification to soft classification. (4) Iterative SRC is proposed by using context information to iteratively refine the classification results. The proposed method has been comprehensively evaluated on a dataset consisting of 330 CT images from 24 patients. The effectiveness of the extended SRC has been validated by comparing it with the traditional SRC based on the proposed four extensions. The experimental results show that our extended SRC can obtain not only more accurate classification results but also smoother and clearer prostate boundary than the traditional SRC. Besides, the comparison with other five state-of-the-art prostate segmentation methods indicates that our method can achieve better performance than other methods under comparison. The authors have proposed a novel prostate segmentation method based on the sparse representation based classification, which can achieve considerably accurate segmentation results in CT prostate segmentation.

  13. An iterated Laplacian based semi-supervised dimensionality reduction for classification of breast cancer on ultrasound images.

    PubMed

    Liu, Xiao; Shi, Jun; Zhou, Shichong; Lu, Minhua

    2014-01-01

    The dimensionality reduction is an important step in ultrasound image based computer-aided diagnosis (CAD) for breast cancer. A newly proposed l2,1 regularized correntropy algorithm for robust feature selection (CRFS) has achieved good performance for noise corrupted data. Therefore, it has the potential to reduce the dimensions of ultrasound image features. However, in clinical practice, the collection of labeled instances is usually expensive and time costing, while it is relatively easy to acquire the unlabeled or undetermined instances. Therefore, the semi-supervised learning is very suitable for clinical CAD. The iterated Laplacian regularization (Iter-LR) is a new regularization method, which has been proved to outperform the traditional graph Laplacian regularization in semi-supervised classification and ranking. In this study, to augment the classification accuracy of the breast ultrasound CAD based on texture feature, we propose an Iter-LR-based semi-supervised CRFS (Iter-LR-CRFS) algorithm, and then apply it to reduce the feature dimensions of ultrasound images for breast CAD. We compared the Iter-LR-CRFS with LR-CRFS, original supervised CRFS, and principal component analysis. The experimental results indicate that the proposed Iter-LR-CRFS significantly outperforms all other algorithms.

  14. Study of support vector machine and serum surface-enhanced Raman spectroscopy for noninvasive esophageal cancer detection

    NASA Astrophysics Data System (ADS)

    Li, Shao-Xin; Zeng, Qiu-Yao; Li, Lin-Fang; Zhang, Yan-Jiao; Wan, Ming-Ming; Liu, Zhi-Ming; Xiong, Hong-Lian; Guo, Zhou-Yi; Liu, Song-Hao

    2013-02-01

    The ability of combining serum surface-enhanced Raman spectroscopy (SERS) with support vector machine (SVM) for improving classification esophageal cancer patients from normal volunteers is investigated. Two groups of serum SERS spectra based on silver nanoparticles (AgNPs) are obtained: one group from patients with pathologically confirmed esophageal cancer (n=30) and the other group from healthy volunteers (n=31). Principal components analysis (PCA), conventional SVM (C-SVM) and conventional SVM combination with PCA (PCA-SVM) methods are implemented to classify the same spectral dataset. Results show that a diagnostic accuracy of 77.0% is acquired for PCA technique, while diagnostic accuracies of 83.6% and 85.2% are obtained for C-SVM and PCA-SVM methods based on radial basis functions (RBF) models. The results prove that RBF SVM models are superior to PCA algorithm in classification serum SERS spectra. The study demonstrates that serum SERS in combination with SVM technique has great potential to provide an effective and accurate diagnostic schema for noninvasive detection of esophageal cancer.

  15. An ensemble predictive modeling framework for breast cancer classification.

    PubMed

    Nagarajan, Radhakrishnan; Upreti, Meenakshi

    2017-12-01

    Molecular changes often precede clinical presentation of diseases and can be useful surrogates with potential to assist in informed clinical decision making. Recent studies have demonstrated the usefulness of modeling approaches such as classification that can predict the clinical outcomes from molecular expression profiles. While useful, a majority of these approaches implicitly use all molecular markers as features in the classification process often resulting in sparse high-dimensional projection of the samples often comparable to that of the sample size. In this study, a variant of the recently proposed ensemble classification approach is used for predicting good and poor-prognosis breast cancer samples from their molecular expression profiles. In contrast to traditional single and ensemble classifiers, the proposed approach uses multiple base classifiers with varying feature sets obtained from two-dimensional projection of the samples in conjunction with a majority voting strategy for predicting the class labels. In contrast to our earlier implementation, base classifiers in the ensembles are chosen based on maximal sensitivity and minimal redundancy by choosing only those with low average cosine distance. The resulting ensemble sets are subsequently modeled as undirected graphs. Performance of four different classification algorithms is shown to be better within the proposed ensemble framework in contrast to using them as traditional single classifier systems. Significance of a subset of genes with high-degree centrality in the network abstractions across the poor-prognosis samples is also discussed. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. North American Magazine Coverage of Skin Cancer and Recreational Tanning Before and After the WHO/IARC 2009 Classification of Indoor Tanning Devices as Carcinogenic.

    PubMed

    McWhirter, Jennifer E; Hoffman-Goetz, Laurie

    2015-09-01

    The mass media is an influential source of skin cancer information for the public. In 2009, the World Health Organization's International Agency for Research on Cancer classified UV radiation from tanning devices as carcinogenic. Our objective was to determine if media coverage of skin cancer and recreational tanning increased in volume or changed in nature after this classification. We conducted a directed content analysis on 29 North American popular magazines (2007-2012) to investigate the overall volume of articles on skin cancer and recreational tanning and, more specifically, the presence of skin cancer risk factors, UV behaviors, and early detection information in article text (n = 410) and images (n = 714). The volume of coverage on skin cancer and recreational tanning did not increase significantly after the 2009 classification of tanning beds as carcinogenic. Key-related messages, including that UV exposure is a risk factor for skin cancer and that indoor tanning should be avoided, were not reported more frequently after the classification, but the promotion of the tanned look as attractive was conveyed more often in images afterwards (p < .01). Content promoting high-SPF sunscreen use increased after the classification (p < .01), but there were no significant positive changes in the frequency of coverage of skin cancer risk factors, other UV behaviors, or early detection information over time. The classification of indoor tanning beds as carcinogenic had no significant impact on the volume or nature of skin cancer and recreational tanning coverage in magazines.

  17. Her2Net: A Deep Framework for Semantic Segmentation and Classification of Cell Membranes and Nuclei in Breast Cancer Evaluation.

    PubMed

    Saha, Monjoy; Chakraborty, Chandan

    2018-05-01

    We present an efficient deep learning framework for identifying, segmenting, and classifying cell membranes and nuclei from human epidermal growth factor receptor-2 (HER2)-stained breast cancer images with minimal user intervention. This is a long-standing issue for pathologists because the manual quantification of HER2 is error-prone, costly, and time-consuming. Hence, we propose a deep learning-based HER2 deep neural network (Her2Net) to solve this issue. The convolutional and deconvolutional parts of the proposed Her2Net framework consisted mainly of multiple convolution layers, max-pooling layers, spatial pyramid pooling layers, deconvolution layers, up-sampling layers, and trapezoidal long short-term memory (TLSTM). A fully connected layer and a softmax layer were also used for classification and error estimation. Finally, HER2 scores were calculated based on the classification results. The main contribution of our proposed Her2Net framework includes the implementation of TLSTM and a deep learning framework for cell membrane and nucleus detection, segmentation, and classification and HER2 scoring. Our proposed Her2Net achieved 96.64% precision, 96.79% recall, 96.71% F-score, 93.08% negative predictive value, 98.33% accuracy, and a 6.84% false-positive rate. Our results demonstrate the high accuracy and wide applicability of the proposed Her2Net in the context of HER2 scoring for breast cancer evaluation.

  18. Clinical study of noninvasive in vivo melanoma and nonmelanoma skin cancers using multimodal spectral diagnosis

    PubMed Central

    Lim, Liang; Nichols, Brandon; Migden, Michael R.; Rajaram, Narasimhan; Reichenberg, Jason S.; Markey, Mia K.; Ross, Merrick I.; Tunnell, James W.

    2014-01-01

    Abstract. The goal of this study was to determine the diagnostic capability of a multimodal spectral diagnosis (SD) for in vivo noninvasive disease diagnosis of melanoma and nonmelanoma skin cancers. We acquired reflectance, fluorescence, and Raman spectra from 137 lesions in 76 patients using custom-built optical fiber-based clinical systems. Biopsies of lesions were classified using standard histopathology as malignant melanoma (MM), nonmelanoma pigmented lesion (PL), basal cell carcinoma (BCC), actinic keratosis (AK), and squamous cell carcinoma (SCC). Spectral data were analyzed using principal component analysis. Using multiple diagnostically relevant principal components, we built leave-one-out logistic regression classifiers. Classification results were compared with histopathology of the lesion. Sensitivity/specificity for classifying MM versus PL (12 versus 17 lesions) was 100%/100%, for SCC and BCC versus AK (57 versus 14 lesions) was 95%/71%, and for AK and SCC and BCC versus normal skin (71 versus 71 lesions) was 90%/85%. The best classification for nonmelanoma skin cancers required multiple modalities; however, the best melanoma classification occurred with Raman spectroscopy alone. The high diagnostic accuracy for classifying both melanoma and nonmelanoma skin cancer lesions demonstrates the potential for SD as a clinical diagnostic device. PMID:25375350

  19. Clinical study of noninvasive in vivo melanoma and nonmelanoma skin cancers using multimodal spectral diagnosis

    NASA Astrophysics Data System (ADS)

    Lim, Liang; Nichols, Brandon; Migden, Michael R.; Rajaram, Narasimhan; Reichenberg, Jason S.; Markey, Mia K.; Ross, Merrick I.; Tunnell, James W.

    2014-11-01

    The goal of this study was to determine the diagnostic capability of a multimodal spectral diagnosis (SD) for in vivo noninvasive disease diagnosis of melanoma and nonmelanoma skin cancers. We acquired reflectance, fluorescence, and Raman spectra from 137 lesions in 76 patients using custom-built optical fiber-based clinical systems. Biopsies of lesions were classified using standard histopathology as malignant melanoma (MM), nonmelanoma pigmented lesion (PL), basal cell carcinoma (BCC), actinic keratosis (AK), and squamous cell carcinoma (SCC). Spectral data were analyzed using principal component analysis. Using multiple diagnostically relevant principal components, we built leave-one-out logistic regression classifiers. Classification results were compared with histopathology of the lesion. Sensitivity/specificity for classifying MM versus PL (12 versus 17 lesions) was 100%;/100%;, for SCC and BCC versus AK (57 versus 14 lesions) was 95%;/71%, and for AK and SCC and BCC versus normal skin (71 versus 71 lesions) was 90%/85%. The best classification for nonmelanoma skin cancers required multiple modalities; however, the best melanoma classification occurred with Raman spectroscopy alone. The high diagnostic accuracy for classifying both melanoma and nonmelanoma skin cancer lesions demonstrates the potential for SD as a clinical diagnostic device.

  20. Numeric pathologic lymph node classification shows prognostic superiority to topographic pN classification in esophageal squamous cell carcinoma.

    PubMed

    Sugawara, Kotaro; Yamashita, Hiroharu; Uemura, Yukari; Mitsui, Takashi; Yagi, Koichi; Nishida, Masato; Aikou, Susumu; Mori, Kazuhiko; Nomura, Sachiyo; Seto, Yasuyuki

    2017-10-01

    The current eighth tumor node metastasis lymph node category pathologic lymph node staging system for esophageal squamous cell carcinoma is based solely on the number of metastatic nodes and does not consider anatomic distribution. We aimed to assess the prognostic capability of the eighth tumor node metastasis pathologic lymph node staging system (numeric-based) compared with the 11th Japan Esophageal Society (topography-based) pathologic lymph node staging system in patients with esophageal squamous cell carcinoma. We retrospectively reviewed the clinical records of 289 patients with esophageal squamous cell carcinoma who underwent esophagectomy with extended lymph node dissection during the period from January 2006 through June 2016. We compared discrimination abilities for overall survival, recurrence-free survival, and cancer-specific survival between these 2 staging systems using C-statistics. The median number of dissected and metastatic nodes was 61 (25% to 75% quartile range, 45 to 79) and 1 (25% to 75% quartile range, 0 to 3), respectively. The eighth tumor node metastasis pathologic lymph node staging system had a greater ability to accurately determine overall survival (C-statistics: tumor node metastasis classification, 0.69, 95% confidence interval, 0.62-0.76; Japan Esophageal Society classification; 0.65, 95% confidence interval, 0.58-0.71; P = .014) and cancer-specific survival (C-statistics: tumor node metastasis classification, 0.78, 95% confidence interval, 0.70-0.87; Japan Esophageal Society classification; 0.72, 95% confidence interval, 0.64-0.80; P = .018). Rates of total recurrence rose as the eighth tumor node metastasis pathologic lymph node stage increased, while stratification of patients according to the topography-based node classification system was not feasible. Numeric nodal staging is an essential tool for stratifying the oncologic outcomes of patients with esophageal squamous cell carcinoma even in the cohort in which adequate numbers of lymph nodes were harvested. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Classifying brain metastases by their primary site of origin using a radiomics approach based on texture analysis: a feasibility study.

    PubMed

    Ortiz-Ramón, Rafael; Larroza, Andrés; Ruiz-España, Silvia; Arana, Estanislao; Moratal, David

    2018-05-14

    To examine the capability of MRI texture analysis to differentiate the primary site of origin of brain metastases following a radiomics approach. Sixty-seven untreated brain metastases (BM) were found in 3D T1-weighted MRI of 38 patients with cancer: 27 from lung cancer, 23 from melanoma and 17 from breast cancer. These lesions were segmented in 2D and 3D to compare the discriminative power of 2D and 3D texture features. The images were quantized using different number of gray-levels to test the influence of quantization. Forty-three rotation-invariant texture features were examined. Feature selection and random forest classification were implemented within a nested cross-validation structure. Classification was evaluated with the area under receiver operating characteristic curve (AUC) considering two strategies: multiclass and one-versus-one. In the multiclass approach, 3D texture features were more discriminative than 2D features. The best results were achieved for images quantized with 32 gray-levels (AUC = 0.873 ± 0.064) using the top four features provided by the feature selection method based on the p-value. In the one-versus-one approach, high accuracy was obtained when differentiating lung cancer BM from breast cancer BM (four features, AUC = 0.963 ± 0.054) and melanoma BM (eight features, AUC = 0.936 ± 0.070) using the optimal dataset (3D features, 32 gray-levels). Classification of breast cancer and melanoma BM was unsatisfactory (AUC = 0.607 ± 0.180). Volumetric MRI texture features can be useful to differentiate brain metastases from different primary cancers after quantizing the images with the proper number of gray-levels. • Texture analysis is a promising source of biomarkers for classifying brain neoplasms. • MRI texture features of brain metastases could help identifying the primary cancer. • Volumetric texture features are more discriminative than traditional 2D texture features.

  2. Breast cancer in the 21st century: from early detection to new therapies.

    PubMed

    Merino Bonilla, J A; Torres Tabanera, M; Ros Mendoza, L H

    The analysis of the causes that have given rise to a change in tendency in the incidence and mortality rates of breast cancer in the last few decades generates important revelations regarding the role of breast screening, the regular application of adjuvant therapies and the change of risk factors. The benefits of early detection have been accompanied by certain adverse effects, even in terms of an excessive number of prophylactic mastectomies. Recently, several updates have been published on the recommendations in breast cancer screening at an international level. On the other hand, the advances in genomics have made it possible to establish a new molecular classification of breast cancer. Our aim is to present an updated overview of the epidemiological situation of breast cancer, as well as some relevant issues from the point of view of diagnosis, such as molecular classification and different strategies for both population-based and opportunistic screening. Copyright © 2017 SERAM. Publicado por Elsevier España, S.L.U. All rights reserved.

  3. An auxiliary classification diagnosis software development of cervical cancer medical data based on various artificial neural networks

    NASA Astrophysics Data System (ADS)

    Qi, Yong; Lei, Kai; Zhang, Lizeqing; Xing, Ximing; Gou, Wenyue

    2018-06-01

    This paper introduced the development of a self-serving medical data assisted diagnosis software of cervical cancer on the basis of artificial neural network (SVN, FNN, KNN). The system is developed based on the idea of self-service platform, supported by the application and innovation of neural network algorithm in medical data identification. Furthermore, it combined the advanced methods in various fields to effectively solve the complicated and inaccurate problem of cervical canceration data in the traditional manual treatment.

  4. Molecular subtypes of metastatic colorectal cancer are associated with patient response to irinotecan-based therapies.

    PubMed

    Del Rio, M; Mollevi, C; Bibeau, F; Vie, N; Selves, J; Emile, J-F; Roger, P; Gongora, C; Robert, J; Tubiana-Mathieu, N; Ychou, M; Martineau, P

    2017-05-01

    Currently, metastatic colorectal cancer is treated as a homogeneous disease and only RAS mutational status has been approved as a negative predictive factor in patients treated with cetuximab. The aim of this study was to evaluate if recently identified molecular subtypes of colon cancer are associated with response of metastatic patients to first-line therapy. We collected and analysed 143 samples of human colorectal tumours with complete clinical annotations, including the response to treatment. Gene expression profiling was used to classify patients in three to six classes using four different molecular classifications. Correlations between molecular subtypes, response to treatment, progression-free and overall survival were analysed. We first demonstrated that the four previously described molecular classifications of colorectal cancer defined in non-metastatic patients also correctly classify stage IV patients. One of the classifications is strongly associated with response to FOLFIRI (P=0.003), but not to FOLFOX (P=0.911) and FOLFIRI + Bevacizumab (P=0.190). In particular, we identify a molecular subtype representing 28% of the patients that shows an exceptionally high response rate to FOLFIRI (87.5%). These patients have a two-fold longer overall survival (40.1 months) when treated with FOLFIRI, as first-line regimen, instead of FOLFOX (18.6 months). Our results demonstrate the interest of molecular classifications to develop tailored therapies for patients with metastatic colorectal cancer and a strong impact of the first-line regimen on the overall survival of some patients. This however remains to be confirmed in a large prospective clinical trial. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Fuzzy support vector machine: an efficient rule-based classification technique for microarrays.

    PubMed

    Hajiloo, Mohsen; Rabiee, Hamid R; Anooshahpour, Mahdi

    2013-01-01

    The abundance of gene expression microarray data has led to the development of machine learning algorithms applicable for tackling disease diagnosis, disease prognosis, and treatment selection problems. However, these algorithms often produce classifiers with weaknesses in terms of accuracy, robustness, and interpretability. This paper introduces fuzzy support vector machine which is a learning algorithm based on combination of fuzzy classifiers and kernel machines for microarray classification. Experimental results on public leukemia, prostate, and colon cancer datasets show that fuzzy support vector machine applied in combination with filter or wrapper feature selection methods develops a robust model with higher accuracy than the conventional microarray classification models such as support vector machine, artificial neural network, decision trees, k nearest neighbors, and diagonal linear discriminant analysis. Furthermore, the interpretable rule-base inferred from fuzzy support vector machine helps extracting biological knowledge from microarray data. Fuzzy support vector machine as a new classification model with high generalization power, robustness, and good interpretability seems to be a promising tool for gene expression microarray classification.

  6. Preventable Exposures Associated With Human Cancers

    PubMed Central

    Baan, Robert; Straif, Kurt; Grosse, Yann; Lauby-Secretan, Béatrice; El Ghissassi, Fatiha; Bouvard, Véronique; Benbrahim-Tallaa, Lamia; Guha, Neela; Freeman, Crystal; Galichet, Laurent; Wild, Christopher P.

    2011-01-01

    Information on the causes of cancer at specific sites is important to cancer control planners, cancer researchers, cancer patients, and the general public. The International Agency for Research on Cancer (IARC) Monograph series, which has classified human carcinogens for more than 40 years, recently completed a review to provide up-to-date information on the cancer sites associated with more than 100 carcinogenic agents. Based on IARC’s review, we listed the cancer sites associated with each agent and then rearranged this information to list the known and suspected causes of cancer at each site. We also summarized the rationale for classifications that were based on mechanistic data. This information, based on the forthcoming IARC Monographs Volume 100, offers insights into the current state-of-the-science of carcinogen identification. Use of mechanistic data to identify carcinogens is increasing, and epidemiological research is identifying additional carcinogens and cancer sites or confirming carcinogenic potential under conditions of lower exposure. Nevertheless, some common human cancers still have few (or no) identified causal agents. PMID:22158127

  7. Call for a Computer-Aided Cancer Detection and Classification Research Initiative in Oman.

    PubMed

    Mirzal, Andri; Chaudhry, Shafique Ahmad

    2016-01-01

    Cancer is a major health problem in Oman. It is reported that cancer incidence in Oman is the second highest after Saudi Arabia among Gulf Cooperation Council countries. Based on GLOBOCAN estimates, Oman is predicted to face an almost two-fold increase in cancer incidence in the period 2008-2020. However, cancer research in Oman is still in its infancy. This is due to the fact that medical institutions and infrastructure that play central roles in data collection and analysis are relatively new developments in Oman. We believe the country requires an organized plan and efforts to promote local cancer research. In this paper, we discuss current research progress in cancer diagnosis using machine learning techniques to optimize computer aided cancer detection and classification (CAD). We specifically discuss CAD using two major medical data, i.e., medical imaging and microarray gene expression profiling, because medical imaging like mammography, MRI, and PET have been widely used in Oman for assisting radiologists in early cancer diagnosis and microarray data have been proven to be a reliable source for differential diagnosis. We also discuss future cancer research directions and benefits to Oman economy for entering the cancer research and treatment business as it is a multi-billion dollar industry worldwide.

  8. Quality of race, Hispanic ethnicity, and immigrant status in population-based cancer registry data: implications for health disparity studies.

    PubMed

    Clegg, Limin X; Reichman, Marsha E; Hankey, Benjamin F; Miller, Barry A; Lin, Yi D; Johnson, Norman J; Schwartz, Stephen M; Bernstein, Leslie; Chen, Vivien W; Goodman, Marc T; Gomez, Scarlett L; Graff, John J; Lynch, Charles F; Lin, Charles C; Edwards, Brenda K

    2007-03-01

    Population-based cancer registry data from the Surveillance, Epidemiology, and End Results (SEER) Program at the National Cancer Institute are based on medical records and administrative information. Although SEER data have been used extensively in health disparities research, the quality of information concerning race, Hispanic ethnicity, and immigrant status has not been systematically evaluated. The quality of this information was determined by comparing SEER data with self-reported data among 13,538 cancer patients diagnosed between 1973-2001 in the SEER--National Longitudinal Mortality Study linked database. The overall agreement was excellent on race (kappa = 0.90, 95% CI = 0.88-0.91), moderate to substantial on Hispanic ethnicity (kappa = 0.61, 95% CI = 0.58-0.64), and low on immigrant status (kappa = 0.21. 95% CI = 0.10, 0.23). The effect of these disagreements was that SEER data tended to under-classify patient numbers when compared to self-identifications, except for the non-Hispanic group which was slightly over-classified. These disagreements translated into varying racial-, ethnic-, and immigrant status-specific cancer statistics, depending on whether self-reported or SEER data were used. In particular, the 5-year Kaplan-Meier survival and the median survival time from all causes for American Indians/Alaska Natives were substantially higher when based on self-classification (59% and 140 months, respectively) than when based on SEER classification (44% and 53 months, respectively), although the number of patients is small. These results can serve as a useful guide to researchers contemplating the use of population-based registry data to ascertain disparities in cancer burden. In particular, the study results caution against evaluating health disparities by using birthplace as a measure of immigrant status and race information for American Indians/Alaska Natives.

  9. Optimal aggregation of binary classifiers for multiclass cancer diagnosis using gene expression profiles.

    PubMed

    Yukinawa, Naoto; Oba, Shigeyuki; Kato, Kikuya; Ishii, Shin

    2009-01-01

    Multiclass classification is one of the fundamental tasks in bioinformatics and typically arises in cancer diagnosis studies by gene expression profiling. There have been many studies of aggregating binary classifiers to construct a multiclass classifier based on one-versus-the-rest (1R), one-versus-one (11), or other coding strategies, as well as some comparison studies between them. However, the studies found that the best coding depends on each situation. Therefore, a new problem, which we call the "optimal coding problem," has arisen: how can we determine which coding is the optimal one in each situation? To approach this optimal coding problem, we propose a novel framework for constructing a multiclass classifier, in which each binary classifier to be aggregated has a weight value to be optimally tuned based on the observed data. Although there is no a priori answer to the optimal coding problem, our weight tuning method can be a consistent answer to the problem. We apply this method to various classification problems including a synthesized data set and some cancer diagnosis data sets from gene expression profiling. The results demonstrate that, in most situations, our method can improve classification accuracy over simple voting heuristics and is better than or comparable to state-of-the-art multiclass predictors.

  10. Optical biopsy using fluorescence spectroscopy for prostate cancer diagnosis

    NASA Astrophysics Data System (ADS)

    Wu, Binlin; Gao, Xin; Smith, Jason; Bailin, Jacob

    2017-02-01

    Native fluorescence spectra are acquired from fresh normal and cancerous human prostate tissues. The fluorescence data are analyzed using a multivariate analysis algorithm such as non-negative matrix factorization. The nonnegative spectral components are retrieved and attributed to the native fluorophores such as collagen, reduced nicotinamide adenine dinucleotide (NADH), and flavin adenine dinucleotide (FAD) in tissue. The retrieved weights of the components, e.g. NADH and FAD are used to estimate the relative concentrations of the native fluorophores and the redox ratio. A machine learning algorithm such as support vector machine (SVM) is used for classification to distinguish normal and cancerous tissue samples based on either the relative concentrations of NADH and FAD or the redox ratio alone. The classification performance is shown based on statistical measures such as sensitivity, specificity, and accuracy, along with the area under receiver operating characteristic (ROC) curve. A cross validation method such as leave-one-out is used to evaluate the predictive performance of the SVM classifier to avoid bias due to overfitting.

  11. Deep learning based classification for head and neck cancer detection with hyperspectral imaging in an animal model

    NASA Astrophysics Data System (ADS)

    Ma, Ling; Lu, Guolan; Wang, Dongsheng; Wang, Xu; Chen, Zhuo Georgia; Muller, Susan; Chen, Amy; Fei, Baowei

    2017-03-01

    Hyperspectral imaging (HSI) is an emerging imaging modality that can provide a noninvasive tool for cancer detection and image-guided surgery. HSI acquires high-resolution images at hundreds of spectral bands, providing big data to differentiating different types of tissue. We proposed a deep learning based method for the detection of head and neck cancer with hyperspectral images. Since the deep learning algorithm can learn the feature hierarchically, the learned features are more discriminative and concise than the handcrafted features. In this study, we adopt convolutional neural networks (CNN) to learn the deep feature of pixels for classifying each pixel into tumor or normal tissue. We evaluated our proposed classification method on the dataset containing hyperspectral images from 12 tumor-bearing mice. Experimental results show that our method achieved an average accuracy of 91.36%. The preliminary study demonstrated that our deep learning method can be applied to hyperspectral images for detecting head and neck tumors in animal models.

  12. hemaClass.org: Online One-By-One Microarray Normalization and Classification of Hematological Cancers for Precision Medicine.

    PubMed

    Falgreen, Steffen; Ellern Bilgrau, Anders; Brøndum, Rasmus Froberg; Hjort Jakobsen, Lasse; Have, Jonas; Lindblad Nielsen, Kasper; El-Galaly, Tarec Christoffer; Bødker, Julie Støve; Schmitz, Alexander; H Young, Ken; Johnsen, Hans Erik; Dybkær, Karen; Bøgsted, Martin

    2016-01-01

    Dozens of omics based cancer classification systems have been introduced with prognostic, diagnostic, and predictive capabilities. However, they often employ complex algorithms and are only applicable on whole cohorts of patients, making them difficult to apply in a personalized clinical setting. This prompted us to create hemaClass.org, an online web application providing an easy interface to one-by-one RMA normalization of microarrays and subsequent risk classifications of diffuse large B-cell lymphoma (DLBCL) into cell-of-origin and chemotherapeutic sensitivity classes. Classification results for one-by-one array pre-processing with and without a laboratory specific RMA reference dataset were compared to cohort based classifiers in 4 publicly available datasets. Classifications showed high agreement between one-by-one and whole cohort pre-processsed data when a laboratory specific reference set was supplied. The website is essentially the R-package hemaClass accompanied by a Shiny web application. The well-documented package can be used to run the website locally or to use the developed methods programmatically. The website and R-package is relevant for biological and clinical lymphoma researchers using affymetrix U-133 Plus 2 arrays, as it provides reliable and swift methods for calculation of disease subclasses. The proposed one-by-one pre-processing method is relevant for all researchers using microarrays.

  13. The p27Kip1 Tumor Suppressor and Multi-Step Tumorigenesis

    DTIC Science & Technology

    2001-08-01

    Breast Cancer , Cell cycle, tumor suppressor 33 16. PRICE CODE 17. SECURITY CLASSIFICATION 18. SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20...in many cancers , including carcinomas of the breast , colon, lung and prostate, and lymphoma. Although these studies of p27 expression in primary...of DMBA-induced pituitary tumors in p27-/- mice precluded determination of breast cancer risk in these mice. Nevertheless, the extensive mammary tissue

  14. Applications of Support Vector Machine (SVM) Learning in Cancer Genomics

    PubMed Central

    HUANG, SHUJUN; CAI, NIANGUANG; PACHECO, PEDRO PENZUTI; NARANDES, SHAVIRA; WANG, YANG; XU, WAYNE

    2017-01-01

    Machine learning with maximization (support) of separating margin (vector), called support vector machine (SVM) learning, is a powerful classification tool that has been used for cancer genomic classification or subtyping. Today, as advancements in high-throughput technologies lead to production of large amounts of genomic and epigenomic data, the classification feature of SVMs is expanding its use in cancer genomics, leading to the discovery of new biomarkers, new drug targets, and a better understanding of cancer driver genes. Herein we reviewed the recent progress of SVMs in cancer genomic studies. We intend to comprehend the strength of the SVM learning and its future perspective in cancer genomic applications. PMID:29275361

  15. History of the FIGO cancer staging system.

    PubMed

    Odicino, Franco; Pecorelli, Sergio; Zigliani, Lucia; Creasman, William T

    2008-05-01

    The main objectives of any good staging system - essential to an evidence-based approach to cancer - are: to aid the clinician in planning treatment; to provide indication of prognosis; to assist the physician in evaluating the results of treatment; to facilitate the exchange of information between treatment centers, thus disseminating knowledge; and to contribute to continuing investigations into human malignancies. A good staging system must have 3 basic characteristics: it must be valid, reliable, and practical. The first staging system for gynecological cancers appeared around the turn of the 20th century and applied to the carcinoma of the cervix uteri-the most common cancer affecting women in high income countries at that time. The classification and staging of the other gynecological malignancies was not put forward until the 1950s. Over the years, these staging classifications - with the exception of cervical cancer and gestational trophoblastic neoplasia - have shifted from a clinical to a surgical-pathological basis. This paper reviews the history of the International Federation of Gynecology and Obstetrics (FIGO) cancer staging system, how it was developed, and why.

  16. Hierarchical Decimal Classification of Information Related to Cancer Research.

    ERIC Educational Resources Information Center

    Schneider, John H.

    The classification may be used (1) to identify cancer research efforts supported by NCI in selected areas of research (at any general or specific level desired), (2) to store information related to cancer research and retrieve this information on request, and (3) to match interests of cancer research scientists against information in published…

  17. A Multi-Label Classification Approach for Coding Cancer Information Service Chat Transcripts

    PubMed Central

    Rios, Anthony; Vanderpool, Robin; Shaw, Pam

    2017-01-01

    National Cancer Institute's (NCI) Cancer Information Service (CIS) offers online instant messaging based information service called LiveHelp to patients, family members, friends, and other cancer information consumers. A cancer information specialist (IS) ‘chats’ with a consumer and provides information on a variety of topics including clinical trials. After a LiveHelp chat session is finished, the IS codes about 20 different elements of metadata about the session in electronic contact record forms (ECRF), which are to be later used for quality control and reporting. Besides straightforward elements like age and gender, more specific elements to be coded include the purpose of contact, the subjects of interaction, and the different responses provided to the consumer, the latter two often taking on multiple values. As such, ECRF coding is a time consuming task and automating this process could help ISs to focus more on their primary goal of helping consumers with valuable cancer related information. As a first attempt in this task, we explored multi-label and multi-class text classification approaches to code the purpose, subjects of interaction, and the responses provided based on the chat transcripts. With a sample dataset of about 673 transcripts, we achieved example-based F-scores of 0.67 (for subjects) and 0.58 (responses). We also achieved label-based micro F-scores of 0.65 (for subjects), 0.62 (for responses), and 0.61 (for purpose). To our knowledge this is the first attempt in automatic coding of Live-Help transcripts and our initial results on the smaller corpus indicate promising future directions in this task. PMID:28736775

  18. CFS-SMO based classification of breast density using multiple texture models.

    PubMed

    Sharma, Vipul; Singh, Sukhwinder

    2014-06-01

    It is highly acknowledged in the medical profession that density of breast tissue is a major cause for the growth of breast cancer. Increased breast density was found to be linked with an increased risk of breast cancer growth, as high density makes it difficult for radiologists to see an abnormality which leads to false negative results. Therefore, there is need for the development of highly efficient techniques for breast tissue classification based on density. This paper presents a hybrid scheme for classification of fatty and dense mammograms using correlation-based feature selection (CFS) and sequential minimal optimization (SMO). In this work, texture analysis is done on a region of interest selected from the mammogram. Various texture models have been used to quantify the texture of parenchymal patterns of breast. To reduce the dimensionality and to identify the features which differentiate between breast tissue densities, CFS is used. Finally, classification is performed using SMO. The performance is evaluated using 322 images of mini-MIAS database. Highest accuracy of 96.46% is obtained for two-class problem (fatty and dense) using proposed approach. Performance of selected features by CFS is also evaluated by Naïve Bayes, Multilayer Perceptron, RBF Network, J48 and kNN classifier. The proposed CFS-SMO method outperforms all other classifiers giving a sensitivity of 100%. This makes it suitable to be taken as a second opinion in classifying breast tissue density.

  19. Implementation of several mathematical algorithms to breast tissue density classification

    NASA Astrophysics Data System (ADS)

    Quintana, C.; Redondo, M.; Tirao, G.

    2014-02-01

    The accuracy of mammographic abnormality detection methods is strongly dependent on breast tissue characteristics, where a dense breast tissue can hide lesions causing cancer to be detected at later stages. In addition, breast tissue density is widely accepted to be an important risk indicator for the development of breast cancer. This paper presents the implementation and the performance of different mathematical algorithms designed to standardize the categorization of mammographic images, according to the American College of Radiology classifications. These mathematical techniques are based on intrinsic properties calculations and on comparison with an ideal homogeneous image (joint entropy, mutual information, normalized cross correlation and index Q) as categorization parameters. The algorithms evaluation was performed on 100 cases of the mammographic data sets provided by the Ministerio de Salud de la Provincia de Córdoba, Argentina—Programa de Prevención del Cáncer de Mama (Department of Public Health, Córdoba, Argentina, Breast Cancer Prevention Program). The obtained breast classifications were compared with the expert medical diagnostics, showing a good performance. The implemented algorithms revealed a high potentiality to classify breasts into tissue density categories.

  20. Classification of skin cancer images using local binary pattern and SVM classifier

    NASA Astrophysics Data System (ADS)

    Adjed, Faouzi; Faye, Ibrahima; Ababsa, Fakhreddine; Gardezi, Syed Jamal; Dass, Sarat Chandra

    2016-11-01

    In this paper, a classification method for melanoma and non-melanoma skin cancer images has been presented using the local binary patterns (LBP). The LBP computes the local texture information from the skin cancer images, which is later used to compute some statistical features that have capability to discriminate the melanoma and non-melanoma skin tissues. Support vector machine (SVM) is applied on the feature matrix for classification into two skin image classes (malignant and benign). The method achieves good classification accuracy of 76.1% with sensitivity of 75.6% and specificity of 76.7%.

  1. Texture analysis applied to second harmonic generation image data for ovarian cancer classification

    NASA Astrophysics Data System (ADS)

    Wen, Bruce L.; Brewer, Molly A.; Nadiarnykh, Oleg; Hocker, James; Singh, Vikas; Mackie, Thomas R.; Campagnola, Paul J.

    2014-09-01

    Remodeling of the extracellular matrix has been implicated in ovarian cancer. To quantitate the remodeling, we implement a form of texture analysis to delineate the collagen fibrillar morphology observed in second harmonic generation microscopy images of human normal and high grade malignant ovarian tissues. In the learning stage, a dictionary of "textons"-frequently occurring texture features that are identified by measuring the image response to a filter bank of various shapes, sizes, and orientations-is created. By calculating a representative model based on the texton distribution for each tissue type using a training set of respective second harmonic generation images, we then perform classification between images of normal and high grade malignant ovarian tissues. By optimizing the number of textons and nearest neighbors, we achieved classification accuracy up to 97% based on the area under receiver operating characteristic curves (true positives versus false positives). The local analysis algorithm is a more general method to probe rapidly changing fibrillar morphologies than global analyses such as FFT. It is also more versatile than other texture approaches as the filter bank can be highly tailored to specific applications (e.g., different disease states) by creating customized libraries based on common image features.

  2. Static micro-array isolation, dynamic time series classification, capture and enumeration of spiked breast cancer cells in blood: the nanotube-CTC chip

    NASA Astrophysics Data System (ADS)

    Khosravi, Farhad; Trainor, Patrick J.; Lambert, Christopher; Kloecker, Goetz; Wickstrom, Eric; Rai, Shesh N.; Panchapakesan, Balaji

    2016-11-01

    We demonstrate the rapid and label-free capture of breast cancer cells spiked in blood using nanotube-antibody micro-arrays. 76-element single wall carbon nanotube arrays were manufactured using photo-lithography, metal deposition, and etching techniques. Anti-epithelial cell adhesion molecule (anti-EpCAM), Anti-human epithelial growth factor receptor 2 (anti-Her2) and non-specific IgG antibodies were functionalized to the surface of the nanotube devices using 1-pyrene-butanoic acid succinimidyl ester. Following device functionalization, blood spiked with SKBR3, MCF7 and MCF10A cells (100/1000 cells per 5 μl per device, 170 elements totaling 0.85 ml of whole blood) were adsorbed on to the nanotube device arrays. Electrical signatures were recorded from each device to screen the samples for differences in interaction (specific or non-specific) between samples and devices. A zone classification scheme enabled the classification of all 170 elements in a single map. A kernel-based statistical classifier for the ‘liquid biopsy’ was developed to create a predictive model based on dynamic time warping series to classify device electrical signals that corresponded to plain blood (control) or SKBR3 spiked blood (case) on anti-Her2 functionalized devices with ˜90% sensitivity, and 90% specificity in capture of 1000 SKBR3 breast cancer cells in blood using anti-Her2 functionalized devices. Screened devices that gave positive electrical signatures were confirmed using optical/confocal microscopy to hold spiked cancer cells. Confocal microscopic analysis of devices that were classified to hold spiked blood based on their electrical signatures confirmed the presence of cancer cells through staining for DAPI (nuclei), cytokeratin (cancer cells) and CD45 (hematologic cells) with single cell sensitivity. We report 55%-100% cancer cell capture yield depending on the active device area for blood adsorption with mean of 62% (˜12 500 captured off 20 000 spiked cells in 0.1 ml blood) in this first nanotube-CTC chip study.

  3. Efficacy of hidden markov model over support vector machine on multiclass classification of healthy and cancerous cervical tissues

    NASA Astrophysics Data System (ADS)

    Mukhopadhyay, Sabyasachi; Kurmi, Indrajit; Pratiher, Sawon; Mukherjee, Sukanya; Barman, Ritwik; Ghosh, Nirmalya; Panigrahi, Prasanta K.

    2018-02-01

    In this paper, a comparative study between SVM and HMM has been carried out for multiclass classification of cervical healthy and cancerous tissues. In our study, the HMM methodology is more promising to produce higher accuracy in classification.

  4. Classification of oral cancers using Raman spectroscopy of serum

    NASA Astrophysics Data System (ADS)

    Sahu, Aditi; Talathi, Sneha; Sawant, Sharada; Krishna, C. Murali

    2014-03-01

    Oral cancers are the sixth most common malignancy worldwide, with low 5-year disease free survival rates, attributable to late detection due to lack of reliable screening modalities. Our in vivo Raman spectroscopy studies have demonstrated classification of normal and tumor as well as cancer field effects (CFE), the earliest events in oral cancers. In view of limitations such as requirement of on-site instrumentation and stringent experimental conditions of this approach, feasibility of classification of normal and cancer using serum was explored using 532 nm excitation. In this study, strong resonance features of β-carotenes, present differentially in normal and pathological conditions, were observed. In the present study, Raman spectra of sera of 36 buccal mucosa, 33 tongue cancers and 17 healthy subjects were recorded using Raman microprobe coupled with 40X objective using 785 nm excitation, a known source of excitation for biomedical applications. To eliminate heterogeneity, average of 3 spectra recorded from each sample was subjected to PC-LDA followed by leave-one-out-cross-validation. Findings indicate average classification efficiency of ~70% for normal and cancer. Buccal mucosa and tongue cancer serum could also be classified with an efficiency of ~68%. Of the two cancers, buccal mucosa cancer and normal could be classified with a higher efficiency. Findings of the study are quite comparable to that of our earlier study, which suggest that there exist significant differences, other than β- carotenes, between normal and cancerous samples which can be exploited for the classification. Prospectively, extensive validation studies will be undertaken to confirm the findings.

  5. A new set of wavelet- and fractals-based features for Gleason grading of prostate cancer histopathology images

    NASA Astrophysics Data System (ADS)

    Mosquera Lopez, Clara; Agaian, Sos

    2013-02-01

    Prostate cancer detection and staging is an important step towards patient treatment selection. Advancements in digital pathology allow the application of new quantitative image analysis algorithms for computer-assisted diagnosis (CAD) on digitized histopathology images. In this paper, we introduce a new set of features to automatically grade pathological images using the well-known Gleason grading system. The goal of this study is to classify biopsy images belonging to Gleason patterns 3, 4, and 5 by using a combination of wavelet and fractal features. For image classification we use pairwise coupling Support Vector Machine (SVM) classifiers. The accuracy of the system, which is close to 97%, is estimated through three different cross-validation schemes. The proposed system offers the potential for automating classification of histological images and supporting prostate cancer diagnosis.

  6. MicroRNA Expression Profile Selection for Cancer Staging Classification Using Backpropagation

    NASA Astrophysics Data System (ADS)

    Anjarwati; Wibowo, Adi; Adhy, Satriyo; Kusumaningrum, Retno

    2018-05-01

    Ovarian cancer, breast cancer, and lung cancer are deadly diseases and require serious treatment. The cancers are among the fifth most common causes of cancer-induced deaths especially for woman. The high mortality rate of cancer is caused by the lack of effective strategies for early detection of the cancer, whereas if its detected in the early stages, the life survival of cancer patients will be 90%, otherwise the survival rate only 30% when the cancers detected on metastasis stages or cancer cells have spread from a primary site of cancer. MicroRNAs can be used as potential biomarkers for cancer due to their profile expression on the cancers. In this paper, we proposed the feature selection of microRNA expression profiles for classification of the cancers stages using Backpropagation Neural Network. The Cancer stages are classified into before metastasis and after metastasis. Several combinations of the microRNA expression profiles from medical references are compared to find the best features for the classification. The accuracy and the mean square errors are used as basis testing the comparison.

  7. Applications of Support Vector Machine (SVM) Learning in Cancer Genomics.

    PubMed

    Huang, Shujun; Cai, Nianguang; Pacheco, Pedro Penzuti; Narrandes, Shavira; Wang, Yang; Xu, Wayne

    2018-01-01

    Machine learning with maximization (support) of separating margin (vector), called support vector machine (SVM) learning, is a powerful classification tool that has been used for cancer genomic classification or subtyping. Today, as advancements in high-throughput technologies lead to production of large amounts of genomic and epigenomic data, the classification feature of SVMs is expanding its use in cancer genomics, leading to the discovery of new biomarkers, new drug targets, and a better understanding of cancer driver genes. Herein we reviewed the recent progress of SVMs in cancer genomic studies. We intend to comprehend the strength of the SVM learning and its future perspective in cancer genomic applications. Copyright© 2018, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.

  8. Classification of lung cancer patients and controls by chromatography of modified nucleosides in serum

    USGS Publications Warehouse

    McEntire, John E.; Kuo, Kenneth C.; Smith, Mark E.; Stalling, David L.; Richens, Jack W.; Zumwalt, Robert W.; Gehrke, Charles W.; Papermaster, Ben W.

    1989-01-01

    A wide spectrum of modified nucleosides has been quantified by high-performance liquid chromatography in serum of 49 male lung cancer patients, 35 patients with other cancers, and 48 patients hospitalized for nonneoplastic diseases. Data for 29 modified nucleoside peaks were normalized to an internal standard and analyzed by discriminant analysis and stepwise discriminant analysis. A model based on peaks selected by a stepwise discriminant procedure correctly classified 79% of the cancer and 75% of the noncancer subjects. It also demonstrated 84% sensitivity and 79% specificity when comparing lung cancer to noncancer subjects, and 80% sensitivity and 55% specificity in comparing lung cancer to other cancers. The nucleoside peaks having the greatest influence on the models varied dependent on the subgroups compared, confirming the importance of quantifying a wide array of nucleosides. These data support and expand previous studies which reported the utility of measuring modified nucleoside levels in serum and show that precise measurement of an array of 29 modified nucleosides in serum by high-performance liquid chromatography with UV scanning with subsequent data modeling may provide a clinically useful approach to patient classification in diagnosis and subsequent therapeutic monitoring.

  9. Application of machine learning on brain cancer multiclass classification

    NASA Astrophysics Data System (ADS)

    Panca, V.; Rustam, Z.

    2017-07-01

    Classification of brain cancer is a problem of multiclass classification. One approach to solve this problem is by first transforming it into several binary problems. The microarray gene expression dataset has the two main characteristics of medical data: extremely many features (genes) and only a few number of samples. The application of machine learning on microarray gene expression dataset mainly consists of two steps: feature selection and classification. In this paper, the features are selected using a method based on support vector machine recursive feature elimination (SVM-RFE) principle which is improved to solve multiclass classification, called multiple multiclass SVM-RFE. Instead of using only the selected features on a single classifier, this method combines the result of multiple classifiers. The features are divided into subsets and SVM-RFE is used on each subset. Then, the selected features on each subset are put on separate classifiers. This method enhances the feature selection ability of each single SVM-RFE. Twin support vector machine (TWSVM) is used as the method of the classifier to reduce computational complexity. While ordinary SVM finds single optimum hyperplane, the main objective Twin SVM is to find two non-parallel optimum hyperplanes. The experiment on the brain cancer microarray gene expression dataset shows this method could classify 71,4% of the overall test data correctly, using 100 and 1000 genes selected from multiple multiclass SVM-RFE feature selection method. Furthermore, the per class results show that this method could classify data of normal and MD class with 100% accuracy.

  10. The IASLC Lung Cancer Staging Project: A Renewed Call to Participation.

    PubMed

    Giroux, Dorothy J; Van Schil, Paul; Asamura, Hisao; Rami-Porta, Ramón; Chansky, Kari; Crowley, John J; Rusch, Valerie W; Kernstine, Kemp

    2018-06-01

    Over the past two decades, the International Association for the Study of Lung Cancer (IASLC) Staging Project has been a steady source of evidence-based recommendations for the TNM classification for lung cancer published by the Union for International Cancer Control and the American Joint Committee on Cancer. The Staging and Prognostic Factors Committee of the IASLC is now issuing a call for participation in the next phase of the project, which is designed to inform the ninth edition of the TNM classification for lung cancer. Following the case recruitment model for the eighth edition database, volunteer site participants are asked to submit data on patients whose lung cancer was diagnosed between January 1, 2011, and December 31, 2019, to the project by means of a secure, electronic data capture system provided by Cancer Research And Biostatistics in Seattle, Washington. Alternatively, participants may transfer existing data sets. The continued success of the IASLC Staging Project in achieving its objectives will depend on the extent of international participation, the degree to which cases are entered directly into the electronic data capture system, and how closely externally submitted cases conform to the data elements for the project. Copyright © 2018 International Association for the Study of Lung Cancer. Published by Elsevier Inc. All rights reserved.

  11. An opto-electronic joint detection system based on DSP aiming at early cervical cancer screening

    NASA Astrophysics Data System (ADS)

    Wang, Weiya; Jia, Mengyu; Gao, Feng; Yang, Lihong; Qu, Pengpeng; Zou, Changping; Liu, Pengxi; Zhao, Huijuan

    2015-02-01

    The cervical cancer screening at a pre-cancer stage is beneficial to reduce the mortality of women. An opto-electronic joint detection system based on DSP aiming at early cervical cancer screening is introduced in this paper. In this system, three electrodes alternately discharge to the cervical tissue and three light emitting diodes in different wavelengths alternately irradiate the cervical tissue. Then the relative optical reflectance and electrical voltage attenuation curve are obtained by optical and electrical detection, respectively. The system is based on DSP to attain the portable and cheap instrument. By adopting the relative reflectance and the voltage attenuation constant, the classification algorithm based on Support Vector Machine (SVM) discriminates abnormal cervical tissue from normal. We use particle swarm optimization to optimize the two key parameters of SVM, i.e. nuclear factor and cost factor. The clinical data were collected on 313 patients to build a clinical database of tissue responses under optical and electrical stimulations with the histopathologic examination as the gold standard. The classification result shows that the opto-electronic joint detection has higher total coincidence rate than separate optical detection or separate electrical detection. The sensitivity, specificity, and total coincidence rate increase with the increasing of sample numbers in the training set. The average total coincidence rate of the system can reach 85.1% compared with the histopathologic examination.

  12. Biological classification with RNA-Seq data: Can alternatively spliced transcript expression enhance machine learning classifier?

    PubMed

    Johnson, Nathan T; Dhroso, Andi; Hughes, Katelyn J; Korkin, Dmitry

    2018-06-25

    The extent to which the genes are expressed in the cell can be simplistically defined as a function of one or more factors of the environment, lifestyle, and genetics. RNA sequencing (RNA-Seq) is becoming a prevalent approach to quantify gene expression, and is expected to gain better insights to a number of biological and biomedical questions, compared to the DNA microarrays. Most importantly, RNA-Seq allows to quantify expression at the gene and alternative splicing isoform levels. However, leveraging the RNA-Seq data requires development of new data mining and analytics methods. Supervised machine learning methods are commonly used approaches for biological data analysis, and have recently gained attention for their applications to the RNA-Seq data. In this work, we assess the utility of supervised learning methods trained on RNA-Seq data for a diverse range of biological classification tasks. We hypothesize that the isoform-level expression data is more informative for biological classification tasks than the gene-level expression data. Our large-scale assessment is done through utilizing multiple datasets, organisms, lab groups, and RNA-Seq analysis pipelines. Overall, we performed and assessed 61 biological classification problems that leverage three independent RNA-Seq datasets and include over 2,000 samples that come from multiple organisms, lab groups, and RNA-Seq analyses. These 61 problems include predictions of the tissue type, sex, or age of the sample, healthy or cancerous phenotypes and, the pathological tumor stage for the samples from the cancerous tissue. For each classification problem, the performance of three normalization techniques and six machine learning classifiers was explored. We find that for every single classification problem, the isoform-based classifiers outperform or are comparable with gene expression based methods. The top-performing supervised learning techniques reached a near perfect classification accuracy, demonstrating the utility of supervised learning for RNA-Seq based data analysis. Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  13. Recognition of multiple imbalanced cancer types based on DNA microarray data using ensemble classifiers.

    PubMed

    Yu, Hualong; Hong, Shufang; Yang, Xibei; Ni, Jun; Dan, Yuanyuan; Qin, Bin

    2013-01-01

    DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that most DNA microarray datasets are skewed, which causes traditional learning algorithms to produce inaccurate results. Some studies have considered this problem, yet they merely focus on binary-class problem. In this paper, we dealt with multiclass imbalanced classification problem, as encountered in cancer DNA microarray, by using ensemble learning. We utilized one-against-all coding strategy to transform multiclass to multiple binary classes, each of them carrying out feature subspace, which is an evolving version of random subspace that generates multiple diverse training subsets. Next, we introduced one of two different correction technologies, namely, decision threshold adjustment or random undersampling, into each training subset to alleviate the damage of class imbalance. Specifically, support vector machine was used as base classifier, and a novel voting rule called counter voting was presented for making a final decision. Experimental results on eight skewed multiclass cancer microarray datasets indicate that unlike many traditional classification approaches, our methods are insensitive to class imbalance.

  14. Automated detection of breast cancer in resected specimens with fluorescence lifetime imaging

    NASA Astrophysics Data System (ADS)

    Phipps, Jennifer E.; Gorpas, Dimitris; Unger, Jakob; Darrow, Morgan; Bold, Richard J.; Marcu, Laura

    2018-01-01

    Re-excision rates for breast cancer lumpectomy procedures are currently nearly 25% due to surgeons relying on inaccurate or incomplete methods of evaluating specimen margins. The objective of this study was to determine if cancer could be automatically detected in breast specimens from mastectomy and lumpectomy procedures by a classification algorithm that incorporated parameters derived from fluorescence lifetime imaging (FLIm). This study generated a database of co-registered histologic sections and FLIm data from breast cancer specimens (N  =  20) and a support vector machine (SVM) classification algorithm able to automatically detect cancerous, fibrous, and adipose breast tissue. Classification accuracies were greater than 97% for automated detection of cancerous, fibrous, and adipose tissue from breast cancer specimens. The classification worked equally well for specimens scanned by hand or with a mechanical stage, demonstrating that the system could be used during surgery or on excised specimens. The ability of this technique to simply discriminate between cancerous and normal breast tissue, in particular to distinguish fibrous breast tissue from tumor, which is notoriously challenging for optical techniques, leads to the conclusion that FLIm has great potential to assess breast cancer margins. Identification of positive margins before waiting for complete histologic analysis could significantly reduce breast cancer re-excision rates.

  15. Atypia and DNA methylation in nipple duct lavage in relation to predicted breast cancer risk.

    PubMed

    Euhus, David M; Bu, Dawei; Ashfaq, Raheela; Xie, Xian-Jin; Bian, Aihua; Leitch, A Marilyn; Lewis, Cheryl M

    2007-09-01

    Tumor suppressor gene (TSG) methylation is identified more frequently in random periareolar fine needle aspiration samples from women at high risk for breast cancer than women at lower risk. It is not known whether TSG methylation or atypia in nipple duct lavage (NDL) samples is related to predicted breast cancer risk. 514 NDL samples obtained from 150 women selected to represent a wide range of breast cancer risk were evaluated cytologically and by quantitative multiplex methylation-specific PCR for methylation of cyclin D2, APC, HIN1, RASSF1A, and RAR-beta2. Based on methylation patterns and cytology, NDL retrieved cancer cells from only 9% of breasts ipsilateral to a breast cancer. Methylation of >/=2 genes correlated with marked atypia by univariate analysis, but not multivariate analysis, that adjusted for sample cellularity and risk group classification. Both marked atypia and TSG methylation independently predicted abundant cellularity in multivariate analyses. Discrimination between Gail lower-risk ducts and Gail high-risk ducts was similar for marked atypia [odds ratio (OR), 3.48; P = 0.06] and measures of TSG methylation (OR, 3.51; P = 0.03). However, marked atypia provided better discrimination between Gail lower-risk ducts and ducts contralateral to a breast cancer (OR, 6.91; P = 0.003, compared with methylation OR, 4.21; P = 0.02). TSG methylation in NDL samples does not predict marked atypia after correcting for sample cellularity and risk group classification. Rather, both methylation and marked atypia are independently associated with highly cellular samples, Gail model risk classifications, and a personal history of breast cancer. This suggests the existence of related, but independent, pathogenic pathways in breast epithelium.

  16. Assessment of nucleosides as putative tumor biomarkers in prostate cancer screening by CE-UV.

    PubMed

    Buzatto, Adriana Zardini; de Oliveira Silva, Mariana; Poppi, Ronei Jesus; Simionato, Ana Valéria Colnaghi

    2017-05-01

    Cancer is responsible for millions of deaths worldwide, but most base diseases may be cured if detected early. Screening tests may be used to identify early-stage malignant neoplasms. However, the major screening tool for prostate cancer, the prostate-specific antigen test, has unsuitable sensitivity. Since cancer cells may affect the pattern of consumption and excretion of nucleosides, such biomolecules are putative biomarkers that can be used for diagnosis and treatment evaluation. Using a previously validated method for the analysis of nucleosides in blood serum by capillary electrophoresis with UV-vis spectroscopy detection, we investigated 60 samples from healthy individuals and 42 samples from prostate cancer patients. The concentrations of nucleosides in both groups were compared and a multivariate partial least squares-discriminant analysis classification model was optimized for prediction of prostate cancer. The validation of the model with an independent sample set resulted in the correct classification of 82.4% of the samples, with sensitivity of 90.5% and specificity of 76.7%. A significant downregulation of 5-methyluridine and inosine was observed, which can be indicative of the carcinogenic process. Therefore, such analytes are potential candidates for prostate cancer screening. Graphical Abstract Separation of the studied nucleosides and the internal standard 8-Bromoguanosine by CE-UV (a); classification of the external validation samples (30 from healthy volunteers and 21 from prostate cancer patients) by the developed Partial Least Square - Discriminant Analysis (PLS-DA) model with accuracy of 82.4% (b); Receiver Operating Characteristics (ROC) curve (c); and Variable Importance in the Projection (VIP) values for the studied nucleosides (d). A significant down-regulation of 5- methyluridine (5mU) and inosine (I) was observed, which can be indicative of the presence of prostate tumors.

  17. Accuracy assessment and characterization of x-ray coded aperture coherent scatter spectral imaging for breast cancer classification

    PubMed Central

    Lakshmanan, Manu N.; Greenberg, Joel A.; Samei, Ehsan; Kapadia, Anuj J.

    2017-01-01

    Abstract. Although transmission-based x-ray imaging is the most commonly used imaging approach for breast cancer detection, it exhibits false negative rates higher than 15%. To improve cancer detection accuracy, x-ray coherent scatter computed tomography (CSCT) has been explored to potentially detect cancer with greater consistency. However, the 10-min scan duration of CSCT limits its possible clinical applications. The coded aperture coherent scatter spectral imaging (CACSSI) technique has been shown to reduce scan time through enabling single-angle imaging while providing high detection accuracy. Here, we use Monte Carlo simulations to test analytical optimization studies of the CACSSI technique, specifically for detecting cancer in ex vivo breast samples. An anthropomorphic breast tissue phantom was modeled, a CACSSI imaging system was virtually simulated to image the phantom, a diagnostic voxel classification algorithm was applied to all reconstructed voxels in the phantom, and receiver-operator characteristics analysis of the voxel classification was used to evaluate and characterize the imaging system for a range of parameters that have been optimized in a prior analytical study. The results indicate that CACSSI is able to identify the distribution of cancerous and healthy tissues (i.e., fibroglandular, adipose, or a mix of the two) in tissue samples with a cancerous voxel identification area-under-the-curve of 0.94 through a scan lasting less than 10 s per slice. These results show that coded aperture scatter imaging has the potential to provide scatter images that automatically differentiate cancerous and healthy tissue within ex vivo samples. Furthermore, the results indicate potential CACSSI imaging system configurations for implementation in subsequent imaging development studies. PMID:28331884

  18. [New TNM classification of lung tumors].

    PubMed

    Wittekind, C

    2014-11-01

    The TNM classification of lung tumors has undergone many changes in the seventh edition published in 2010. These changes reflect current data and are based on the findings of the International Association for the Study of Lung Cancer (IASLC) from 81,495 patients and concern definitions of the T and M categories as well as stage grouping. They include a better description of regional lymph nodes of the lungs based on uniformly accepted definitions by the IASLC. The changes can lead to problems in the use of the definitions and will be discussed.

  19. Impact of geographic area level on measuring socioeconomic disparities in cancer survival in New South Wales, Australia: A period analysis.

    PubMed

    Stanbury, Julia F; Baade, Peter D; Yu, Yan; Yu, Xue Qin

    2016-08-01

    Area-based socioeconomic measures are widely used in health research. In theory, the larger the area used the more individual misclassification is introduced, thus biasing the association between such area level measures and health outcomes. In this study, we examined the socioeconomic disparities in cancer survival using two geographic area-based measures to see if the size of the area matters. We used population-based cancer registry data for patients diagnosed with one of 10 major cancers in New South Wales (NSW), Australia during 2004-2008. Patients were assigned index measures of socioeconomic status (SES) based on two area-level units, census Collection District (CD) and Local Government Area (LGA) of their address at diagnosis. Five-year relative survival was estimated using the period approach for patients alive during 2004-2008, for each socioeconomic quintile at each area-level for each cancer. Poisson-regression modelling was used to adjust for socioeconomic quintile, sex, age-group at diagnosis and disease stage at diagnosis. The relative excess risk of death (RER) by socioeconomic quintile derived from this modelling was compared between area-units. We found extensive disagreement in SES classification between CD and LGA levels across all socioeconomic quintiles, particularly for more disadvantaged groups. In general, more disadvantaged patients had significantly lower survival than the least disadvantaged group for both CD and LGA classifications. The socioeconomic survival disparities detected by CD classification were larger than those detected by LGA. Adjusted RER estimates by SES were similar for most cancers when measured at both area levels. We found that classifying patient SES by the widely used Australian geographic unit LGA results in underestimation of survival disparities for several cancers compared to when SES is classified at the geographically smaller CD level. Despite this, our RER of death estimates derived from these survival estimates were generally similar for both CD and LGA level analyses, suggesting that LGAs remain a valuable spatial unit for use in Australian health and social research, though the potential for misclassification must be considered when interpreting research. While data confidentiality concerns increase with the level of geographical precision, the use of smaller area-level health and census data in the future, with appropriate allowance for confidentiality. Copyright © 2016. Published by Elsevier Ltd.

  20. Japanese Society for Cancer of the Colon and Rectum (JSCCR) guidelines 2016 for the treatment of colorectal cancer.

    PubMed

    Watanabe, Toshiaki; Muro, Kei; Ajioka, Yoichi; Hashiguchi, Yojiro; Ito, Yoshinori; Saito, Yutaka; Hamaguchi, Tetsuya; Ishida, Hideyuki; Ishiguro, Megumi; Ishihara, Soichiro; Kanemitsu, Yukihide; Kawano, Hiroshi; Kinugasa, Yusuke; Kokudo, Norihiro; Murofushi, Keiko; Nakajima, Takako; Oka, Shiro; Sakai, Yoshiharu; Tsuji, Akihito; Uehara, Keisuke; Ueno, Hideki; Yamazaki, Kentaro; Yoshida, Masahiro; Yoshino, Takayuki; Boku, Narikazu; Fujimori, Takahiro; Itabashi, Michio; Koinuma, Nobuo; Morita, Takayuki; Nishimura, Genichi; Sakata, Yuh; Shimada, Yasuhiro; Takahashi, Keiichi; Tanaka, Shinji; Tsuruta, Osamu; Yamaguchi, Toshiharu; Yamaguchi, Naohiko; Tanaka, Toshiaki; Kotake, Kenjiro; Sugihara, Kenichi

    2018-02-01

    Japanese mortality due to colorectal cancer is on the rise, surpassing 49,000 in 2015. Many new treatment methods have been developed during recent decades. The Japanese Society for Cancer of the Colon and Rectum Guidelines 2016 for the treatment of colorectal cancer (JSCCR Guidelines 2016) were prepared to show standard treatment strategies for colorectal cancer, to eliminate disparities among institutions in terms of treatment, to eliminate unnecessary treatment and insufficient treatment, and to deepen mutual understanding between health-care professionals and patients by making these Guidelines available to the general public. These Guidelines were prepared by consensus reached by the JSCCR Guideline Committee, based on a careful review of the evidence retrieved by literature searches, and in view of the medical health insurance system and actual clinical practice settings in Japan. Therefore, these Guidelines can be used as a tool for treating colorectal cancer in actual clinical practice settings. More specifically, they can be used as a guide to obtaining informed consent from patients and choosing the method of treatment for each patient. As a result of the discussions held by the Guideline Committee, controversial issues were selected as Clinical Questions, and recommendations were made. Each recommendation is accompanied by a classification of the evidence and a classification of recommendation categories based on the consensus reached by the Guideline Committee members. Here we present the English version of the JSCCR Guidelines 2016.

  1. Performance comparison of quantitative semantic features and lung-RADS in the National Lung Screening Trial

    NASA Astrophysics Data System (ADS)

    Li, Qian; Balagurunathan, Yoganand; Liu, Ying; Schabath, Matthew; Gillies, Robert J.

    2016-03-01

    Background: Lung-RADS is the new oncology classification guideline proposed by American College of Radiology (ACR), which provides recommendation for further follow up in lung cancer screening. However, only two features (solidity and size) are included in this system. We hypothesize that additional sematic features can be used to better characterize lung nodules and diagnose cancer. Objective: We propose to develop and characterize a systematic methodology based on semantic image traits to more accurately predict occurrence of cancerous nodules. Methods: 24 radiological image traits were systematically scored on a point scale (up to 5) by a trained radiologist, and lung-RADS was independently scored. A linear discriminant model was used on the semantic features to access their performance in predicting cancer status. The semantic predictors were then compared to lung-RADS classification in 199 patients (60 cancers, 139 normal controls) obtained from the National Lung Screening Trial. Result: There were different combinations of semantic features that were strong predictors of cancer status. Of these, contour, border definition, size, solidity, focal emphysema, focal fibrosis and location emerged as top candidates. The performance of two semantic features (short axial diameter and contour) had an AUC of 0.945, and was comparable to that of lung-RADS (AUC: 0.871). Conclusion: We propose that a semantics-based discrimination approach may act as a complement to the lung-RADS to predict cancer status.

  2. A comparative analysis of swarm intelligence techniques for feature selection in cancer classification.

    PubMed

    Gunavathi, Chellamuthu; Premalatha, Kandasamy

    2014-01-01

    Feature selection in cancer classification is a central area of research in the field of bioinformatics and used to select the informative genes from thousands of genes of the microarray. The genes are ranked based on T-statistics, signal-to-noise ratio (SNR), and F-test values. The swarm intelligence (SI) technique finds the informative genes from the top-m ranked genes. These selected genes are used for classification. In this paper the shuffled frog leaping with Lévy flight (SFLLF) is proposed for feature selection. In SFLLF, the Lévy flight is included to avoid premature convergence of shuffled frog leaping (SFL) algorithm. The SI techniques such as particle swarm optimization (PSO), cuckoo search (CS), SFL, and SFLLF are used for feature selection which identifies informative genes for classification. The k-nearest neighbour (k-NN) technique is used to classify the samples. The proposed work is applied on 10 different benchmark datasets and examined with SI techniques. The experimental results show that the results obtained from k-NN classifier through SFLLF feature selection method outperform PSO, CS, and SFL.

  3. Automated classification of cell morphology by coherence-controlled holographic microscopy

    NASA Astrophysics Data System (ADS)

    Strbkova, Lenka; Zicha, Daniel; Vesely, Pavel; Chmelik, Radim

    2017-08-01

    In the last few years, classification of cells by machine learning has become frequently used in biology. However, most of the approaches are based on morphometric (MO) features, which are not quantitative in terms of cell mass. This may result in poor classification accuracy. Here, we study the potential contribution of coherence-controlled holographic microscopy enabling quantitative phase imaging for the classification of cell morphologies. We compare our approach with the commonly used method based on MO features. We tested both classification approaches in an experiment with nutritionally deprived cancer tissue cells, while employing several supervised machine learning algorithms. Most of the classifiers provided higher performance when quantitative phase features were employed. Based on the results, it can be concluded that the quantitative phase features played an important role in improving the performance of the classification. The methodology could be valuable help in refining the monitoring of live cells in an automated fashion. We believe that coherence-controlled holographic microscopy, as a tool for quantitative phase imaging, offers all preconditions for the accurate automated analysis of live cell behavior while enabling noninvasive label-free imaging with sufficient contrast and high-spatiotemporal phase sensitivity.

  4. Automated classification of cell morphology by coherence-controlled holographic microscopy.

    PubMed

    Strbkova, Lenka; Zicha, Daniel; Vesely, Pavel; Chmelik, Radim

    2017-08-01

    In the last few years, classification of cells by machine learning has become frequently used in biology. However, most of the approaches are based on morphometric (MO) features, which are not quantitative in terms of cell mass. This may result in poor classification accuracy. Here, we study the potential contribution of coherence-controlled holographic microscopy enabling quantitative phase imaging for the classification of cell morphologies. We compare our approach with the commonly used method based on MO features. We tested both classification approaches in an experiment with nutritionally deprived cancer tissue cells, while employing several supervised machine learning algorithms. Most of the classifiers provided higher performance when quantitative phase features were employed. Based on the results, it can be concluded that the quantitative phase features played an important role in improving the performance of the classification. The methodology could be valuable help in refining the monitoring of live cells in an automated fashion. We believe that coherence-controlled holographic microscopy, as a tool for quantitative phase imaging, offers all preconditions for the accurate automated analysis of live cell behavior while enabling noninvasive label-free imaging with sufficient contrast and high-spatiotemporal phase sensitivity. (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE).

  5. Relationship between solitary pulmonary nodule lung cancer and CT image features based on gradual clustering

    NASA Astrophysics Data System (ADS)

    Zhang, Weipeng

    2017-06-01

    The relationship between the medical characteristics of lung cancers and computer tomography (CT) images are explored so as to improve the early diagnosis rate of lung cancers. This research collected CT images of patients with solitary pulmonary nodule lung cancer, and used gradual clustering methodology to classify them. Preliminary classifications were made, followed by continuous modification and iteration to determine the optimal condensation point, until iteration stability was achieved. Reasonable classification results were obtained. the clustering results fell into 3 categories. The first type of patients was mostly female, with ages between 50 and 65 years. CT images of solitary pulmonary nodule lung cancer for this group contain complete lobulation and burr, with pleural indentation; The second type of patients was mostly male with ages between 50 and 80 years. CT images of solitary pulmonary nodule lung cancer for this group contain complete lobulation and burr, but with no pleural indentation; The third type of patients was also mostly male with ages between 50 and 80 years. CT images for this group showed no abnormalities. the application of gradual clustering methodology can scientifically classify CT image features of patients with lung cancer in the initial lesion stage. These findings provide the basis for early detection and treatment of malignant lesions in patients with lung cancer.

  6. iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space.

    PubMed

    Akbar, Shahid; Hayat, Maqsood; Iqbal, Muhammad; Jan, Mian Ahmad

    2017-06-01

    Cancer is a fatal disease, responsible for one-quarter of all deaths in developed countries. Traditional anticancer therapies such as, chemotherapy and radiation, are highly expensive, susceptible to errors and ineffective techniques. These conventional techniques induce severe side-effects on human cells. Due to perilous impact of cancer, the development of an accurate and highly efficient intelligent computational model is desirable for identification of anticancer peptides. In this paper, evolutionary intelligent genetic algorithm-based ensemble model, 'iACP-GAEnsC', is proposed for the identification of anticancer peptides. In this model, the protein sequences are formulated, using three different discrete feature representation methods, i.e., amphiphilic Pseudo amino acid composition, g-Gap dipeptide composition, and Reduce amino acid alphabet composition. The performance of the extracted feature spaces are investigated separately and then merged to exhibit the significance of hybridization. In addition, the predicted results of individual classifiers are combined together, using optimized genetic algorithm and simple majority technique in order to enhance the true classification rate. It is observed that genetic algorithm-based ensemble classification outperforms than individual classifiers as well as simple majority voting base ensemble. The performance of genetic algorithm-based ensemble classification is highly reported on hybrid feature space, with an accuracy of 96.45%. In comparison to the existing techniques, 'iACP-GAEnsC' model has achieved remarkable improvement in terms of various performance metrics. Based on the simulation results, it is observed that 'iACP-GAEnsC' model might be a leading tool in the field of drug design and proteomics for researchers. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Performance assessment of automated tissue characterization for prostate H and E stained histopathology

    NASA Astrophysics Data System (ADS)

    DiFranco, Matthew D.; Reynolds, Hayley M.; Mitchell, Catherine; Williams, Scott; Allan, Prue; Haworth, Annette

    2015-03-01

    Reliable automated prostate tumor detection and characterization in whole-mount histology images is sought in many applications, including post-resection tumor staging and as ground-truth data for multi-parametric MRI interpretation. In this study, an ensemble-based supervised classification algorithm for high-resolution histology images was trained on tile-based image features including histogram and gray-level co-occurrence statistics. The algorithm was assessed using different combinations of H and E prostate slides from two separate medical centers and at two different magnifications (400x and 200x), with the aim of applying tumor classification models to new data. Slides from both datasets were annotated by expert pathologists in order to identify homogeneous cancerous and non-cancerous tissue regions of interest, which were then categorized as (1) low-grade tumor (LG-PCa), including Gleason 3 and high-grade prostatic intraepithelial neoplasia (HG-PIN), (2) high-grade tumor (HG-PCa), including various Gleason 4 and 5 patterns, or (3) non-cancerous, including benign stroma and benign prostatic hyperplasia (BPH). Classification models for both LG-PCa and HG-PCa were separately trained using a support vector machine (SVM) approach, and per-tile tumor prediction maps were generated from the resulting ensembles. Results showed high sensitivity for predicting HG-PCa with an AUC up to 0.822 using training data from both medical centres, while LG-PCa showed a lower sensitivity of 0.763 with the same training data. Visual inspection of cancer probability heatmaps from 9 patients showed that 17/19 tumors were detected, and HG-PCa generally reported less false positives than LG-PCa.

  8. Scattering features for lung cancer detection in fibered confocal fluorescence microscopy images.

    PubMed

    Rakotomamonjy, Alain; Petitjean, Caroline; Salaün, Mathieu; Thiberville, Luc

    2014-06-01

    To assess the feasibility of lung cancer diagnosis using fibered confocal fluorescence microscopy (FCFM) imaging technique and scattering features for pattern recognition. FCFM imaging technique is a new medical imaging technique for which interest has yet to be established for diagnosis. This paper addresses the problem of lung cancer detection using FCFM images and, as a first contribution, assesses the feasibility of computer-aided diagnosis through these images. Towards this aim, we have built a pattern recognition scheme which involves a feature extraction stage and a classification stage. The second contribution relies on the features used for discrimination. Indeed, we have employed the so-called scattering transform for extracting discriminative features, which are robust to small deformations in the images. We have also compared and combined these features with classical yet powerful features like local binary patterns (LBP) and their variants denoted as local quinary patterns (LQP). We show that scattering features yielded to better recognition performances than classical features like LBP and their LQP variants for the FCFM image classification problems. Another finding is that LBP-based and scattering-based features provide complementary discriminative information and, in some situations, we empirically establish that performance can be improved when jointly using LBP, LQP and scattering features. In this work we analyze the joint capability of FCFM images and scattering features for lung cancer diagnosis. The proposed method achieves a good recognition rate for such a diagnosis problem. It also performs well when used in conjunction with other features for other classical medical imaging classification problems. Copyright © 2014 Elsevier B.V. All rights reserved.

  9. A novel classification of prostate specific antigen (PSA) biosensors based on transducing elements.

    PubMed

    Najeeb, Mansoor Ani; Ahmad, Zubair; Shakoor, R A; Mohamed, A M A; Kahraman, Ramazan

    2017-06-01

    During the last few decades, there has been a tremendous rise in the number of research studies dedicated towards the development of diagnostic tools based on bio-sensing technology for the early detection of various diseases like cardiovascular diseases (CVD), many types of cancer, diabetes mellitus (DM) and many infectious diseases. Many breakthroughs have been developed in the areas of improving specificity, selectivity and repeatability of the biosensor devices. Innovations in the interdisciplinary areas like biotechnology, genetics, organic electronics and nanotechnology also had a great positive impact on the growth of bio-sensing technology. As a product of these improvements, fast and consistent sensing policies have been productively created for precise and ultrasensitive biomarker-based disease diagnostics. Prostate-specific antigen (PSA) is widely considered as an important biomarker used for diagnosing prostate cancer. There have been many publications based on various biosensors used for PSA detection, but a limited review was available for the classification of these biosensors used for the detection of PSA. This review highlights the various biosensors used for PSA detection and proposes a novel classification for PSA biosensors based on the transducer type used. We also highlight the advantages, disadvantages and limitations of each technique used for PSA biosensing which will make this article a complete reference tool for the future researches in PSA biosensing. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Optical diagnosis of cervical cancer by higher order spectra and boosting

    NASA Astrophysics Data System (ADS)

    Pratiher, Sawon; Mukhopadhyay, Sabyasachi; Barman, Ritwik; Pratiher, Souvik; Pradhan, Asima; Ghosh, Nirmalya; Panigrahi, Prasanta K.

    2017-03-01

    In this contribution, we report the application of higher order statistical moments using decision tree and ensemble based learning methodology for the development of diagnostic algorithms for optical diagnosis of cancer. The classification results were compared to those obtained with an independent feature extractors like linear discriminant analysis (LDA). The performance and efficacy of these methodology using higher order statistics as a classifier using boosting has higher specificity and sensitivity while being much faster as compared to other time-frequency domain based methods.

  11. Optical diagnosis of cervical cancer by intrinsic mode functions

    NASA Astrophysics Data System (ADS)

    Mukhopadhyay, Sabyasachi; Pratiher, Sawon; Pratiher, Souvik; Pradhan, Asima; Ghosh, Nirmalya; Panigrahi, Prasanta K.

    2017-03-01

    In this paper, we make use of the empirical mode decomposition (EMD) to discriminate the cervical cancer tissues from normal ones based on elastic scattering spectroscopy. The phase space has been reconstructed through decomposing the optical signal into a finite set of bandlimited signals known as intrinsic mode functions (IMFs). It has been shown that the area measure of the analytic IMFs provides a good discrimination performance. Simulation results validate the efficacy of the IMFs followed by SVM based classification.

  12. Rational bases for the use of the Immunoscore in routine clinical settings as a prognostic and predictive biomarker in cancer patients

    PubMed Central

    Kirilovsky, Amos; Marliot, Florence; El Sissy, Carine; Haicheur, Nacilla; Galon, Jérôme

    2016-01-01

    The American Joint Committee on Cancer/Union Internationale Contre le Cancer (AJCC/UICC) tumor, nodes, metastasis (TNM) classification system based on tumor features is used for prognosis estimation and treatment recommendations in most cancers. However, the clinical outcome can vary significantly among patients within the same tumor stage and TNM classification does not predict response to therapy. Therefore, many efforts have been focused on the identification of new markers. Multiple tumor cell-based approaches have been proposed but very few have been translated into the clinic. The recent demonstration of the essential role of the immune system in tumor progression has allowed great advances in the understanding of this complex disease and in the design of novel therapies. The analysis of the immune infiltrate by imaging techniques in large patient cohorts highlighted the prognostic impact of the in situ immune cell infiltrate in tumors. Moreover, the characterization of the immune infiltrates (e.g. type, density, distribution within the tumor, phenotype, activation status) in patients treated with checkpoint-blockade strategies could provide information to predict the disease outcome. In colorectal cancer, we have developed a prognostic score (‘Immunoscore’) that takes into account the distribution of the density of both CD3+ lymphocytes and CD8+ cytotoxic T cells in the tumor core and the invasive margin that could outperform TNM staging. Currently, an international retrospective study is under way to validate the Immunoscore prognostic performance in patients with colon cancer. The use of Immunoscore in clinical practice could improve the patients’ prognostic assessment and therapeutic management. PMID:27121213

  13. A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability

    PubMed Central

    2009-01-01

    Background Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. Results We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. Conclusion Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments. PMID:19941644

  14. Wavelet-based multiscale analysis of bioimpedance data measured by electric cell-substrate impedance sensing for classification of cancerous and normal cells.

    PubMed

    Das, Debanjan; Shiladitya, Kumar; Biswas, Karabi; Dutta, Pranab Kumar; Parekh, Aditya; Mandal, Mahitosh; Das, Soumen

    2015-12-01

    The paper presents a study to differentiate normal and cancerous cells using label-free bioimpedance signal measured by electric cell-substrate impedance sensing. The real-time-measured bioimpedance data of human breast cancer cells and human epithelial normal cells employs fluctuations of impedance value due to cellular micromotions resulting from dynamic structural rearrangement of membrane protrusions under nonagitated condition. Here, a wavelet-based multiscale quantitative analysis technique has been applied to analyze the fluctuations in bioimpedance. The study demonstrates a method to classify cancerous and normal cells from the signature of their impedance fluctuations. The fluctuations associated with cellular micromotion are quantified in terms of cellular energy, cellular power dissipation, and cellular moments. The cellular energy and power dissipation are found higher for cancerous cells associated with higher micromotions in cancer cells. The initial study suggests that proposed wavelet-based quantitative technique promises to be an effective method to analyze real-time bioimpedance signal for distinguishing cancer and normal cells.

  15. Efficacy of the Kyoto Classification of Gastritis in Identifying Patients at High Risk for Gastric Cancer.

    PubMed

    Sugimoto, Mitsushige; Ban, Hiromitsu; Ichikawa, Hitomi; Sahara, Shu; Otsuka, Taketo; Inatomi, Osamu; Bamba, Shigeki; Furuta, Takahisa; Andoh, Akira

    2017-01-01

    Objective The Kyoto gastritis classification categorizes the endoscopic characteristics of Helicobacter pylori (H. pylori) infection-associated gastritis and identifies patterns associated with a high risk of gastric cancer. We investigated its efficacy, comparing scores in patients with H. pylori-associated gastritis and with gastric cancer. Methods A total of 1,200 patients with H. pylori-positive gastritis alone (n=932), early-stage H. pylori-positive gastric cancer (n=189), and successfully treated H. pylori-negative cancer (n=79) were endoscopically graded according to the Kyoto gastritis classification for atrophy, intestinal metaplasia, fold hypertrophy, nodularity, and diffuse redness. Results The prevalence of O-II/O-III-type atrophy according to the Kimura-Takemoto classification in early-stage H. pylori-positive gastric cancer and successfully treated H. pylori-negative cancer groups was 45.1%, which was significantly higher than in subjects with gastritis alone (12.7%, p<0.001). Kyoto gastritis scores of atrophy and intestinal metaplasia in the H. pylori-positive cancer group were significantly higher than in subjects with gastritis alone (all p<0.001). No significant differences were noted in the rates of gastric fold hypertrophy or diffuse redness between the two groups. In a multivariate analysis, the risks for H. pylori-positive gastric cancer increased with intestinal metaplasia (odds ratio: 4.453, 95% confidence interval: 3.332-5.950, <0.001) and male sex (1.737, 1.102-2.739, p=0.017). Conclusion Making an appropriate diagnosis and detecting patients at high risk is crucial for achieving total eradication of gastric cancer. The scores of intestinal metaplasia and atrophy of the scoring system in the Kyoto gastritis classification may thus be useful for detecting these patients.

  16. Efficacy of the Kyoto Classification of Gastritis in Identifying Patients at High Risk for Gastric Cancer

    PubMed Central

    Sugimoto, Mitsushige; Ban, Hiromitsu; Ichikawa, Hitomi; Sahara, Shu; Otsuka, Taketo; Inatomi, Osamu; Bamba, Shigeki; Furuta, Takahisa; Andoh, Akira

    2017-01-01

    Objective The Kyoto gastritis classification categorizes the endoscopic characteristics of Helicobacter pylori (H. pylori) infection-associated gastritis and identifies patterns associated with a high risk of gastric cancer. We investigated its efficacy, comparing scores in patients with H. pylori-associated gastritis and with gastric cancer. Methods A total of 1,200 patients with H. pylori-positive gastritis alone (n=932), early-stage H. pylori-positive gastric cancer (n=189), and successfully treated H. pylori-negative cancer (n=79) were endoscopically graded according to the Kyoto gastritis classification for atrophy, intestinal metaplasia, fold hypertrophy, nodularity, and diffuse redness. Results The prevalence of O-II/O-III-type atrophy according to the Kimura-Takemoto classification in early-stage H. pylori-positive gastric cancer and successfully treated H. pylori-negative cancer groups was 45.1%, which was significantly higher than in subjects with gastritis alone (12.7%, p<0.001). Kyoto gastritis scores of atrophy and intestinal metaplasia in the H. pylori-positive cancer group were significantly higher than in subjects with gastritis alone (all p<0.001). No significant differences were noted in the rates of gastric fold hypertrophy or diffuse redness between the two groups. In a multivariate analysis, the risks for H. pylori-positive gastric cancer increased with intestinal metaplasia (odds ratio: 4.453, 95% confidence interval: 3.332-5.950, <0.001) and male sex (1.737, 1.102-2.739, p=0.017). Conclusion Making an appropriate diagnosis and detecting patients at high risk is crucial for achieving total eradication of gastric cancer. The scores of intestinal metaplasia and atrophy of the scoring system in the Kyoto gastritis classification may thus be useful for detecting these patients. PMID:28321054

  17. Classification and Compression of Multi-Resolution Vectors: A Tree Structured Vector Quantizer Approach

    DTIC Science & Technology

    2002-01-01

    their expression profile and for classification of cells into tumerous and non- tumerous classes. Then we will present a parallel tree method for... cancerous cells. We will use the same dataset and use tree structured classifiers with multi-resolution analysis for classifying cancerous from non- cancerous ...cells. We have the expressions of 4096 genes from 98 different cell types. Of these 98, 72 are cancerous while 26 are non- cancerous . We are interested

  18. Fluorescently labeled bevacizumab in human breast cancer: defining the classification threshold

    NASA Astrophysics Data System (ADS)

    Koch, Maximilian; de Jong, Johannes S.; Glatz, Jürgen; Symvoulidis, Panagiotis; Lamberts, Laetitia E.; Adams, Arthur L. L.; Kranendonk, Mariëtte E. G.; Terwisscha van Scheltinga, Anton G. T.; Aichler, Michaela; Jansen, Liesbeth; de Vries, Jakob; Lub-de Hooge, Marjolijn N.; Schröder, Carolien P.; Jorritsma-Smit, Annelies; Linssen, Matthijs D.; de Boer, Esther; van der Vegt, Bert; Nagengast, Wouter B.; Elias, Sjoerd G.; Oliveira, Sabrina; Witkamp, Arjen J.; Mali, Willem P. Th. M.; Van der Wall, Elsken; Garcia-Allende, P. Beatriz; van Diest, Paul J.; de Vries, Elisabeth G. E.; Walch, Axel; van Dam, Gooitzen M.; Ntziachristos, Vasilis

    2017-07-01

    In-vivo fluorescently labelled drug (bevacizumab) breast cancer specimen where obtained from patients. We propose a new structured method to determine the optimal classification threshold in targeted fluorescence intra-operative imaging.

  19. DNA methylation-based classification of central nervous system tumours.

    PubMed

    Capper, David; Jones, David T W; Sill, Martin; Hovestadt, Volker; Schrimpf, Daniel; Sturm, Dominik; Koelsche, Christian; Sahm, Felix; Chavez, Lukas; Reuss, David E; Kratz, Annekathrin; Wefers, Annika K; Huang, Kristin; Pajtler, Kristian W; Schweizer, Leonille; Stichel, Damian; Olar, Adriana; Engel, Nils W; Lindenberg, Kerstin; Harter, Patrick N; Braczynski, Anne K; Plate, Karl H; Dohmen, Hildegard; Garvalov, Boyan K; Coras, Roland; Hölsken, Annett; Hewer, Ekkehard; Bewerunge-Hudler, Melanie; Schick, Matthias; Fischer, Roger; Beschorner, Rudi; Schittenhelm, Jens; Staszewski, Ori; Wani, Khalida; Varlet, Pascale; Pages, Melanie; Temming, Petra; Lohmann, Dietmar; Selt, Florian; Witt, Hendrik; Milde, Till; Witt, Olaf; Aronica, Eleonora; Giangaspero, Felice; Rushing, Elisabeth; Scheurlen, Wolfram; Geisenberger, Christoph; Rodriguez, Fausto J; Becker, Albert; Preusser, Matthias; Haberler, Christine; Bjerkvig, Rolf; Cryan, Jane; Farrell, Michael; Deckert, Martina; Hench, Jürgen; Frank, Stephan; Serrano, Jonathan; Kannan, Kasthuri; Tsirigos, Aristotelis; Brück, Wolfgang; Hofer, Silvia; Brehmer, Stefanie; Seiz-Rosenhagen, Marcel; Hänggi, Daniel; Hans, Volkmar; Rozsnoki, Stephanie; Hansford, Jordan R; Kohlhof, Patricia; Kristensen, Bjarne W; Lechner, Matt; Lopes, Beatriz; Mawrin, Christian; Ketter, Ralf; Kulozik, Andreas; Khatib, Ziad; Heppner, Frank; Koch, Arend; Jouvet, Anne; Keohane, Catherine; Mühleisen, Helmut; Mueller, Wolf; Pohl, Ute; Prinz, Marco; Benner, Axel; Zapatka, Marc; Gottardo, Nicholas G; Driever, Pablo Hernáiz; Kramm, Christof M; Müller, Hermann L; Rutkowski, Stefan; von Hoff, Katja; Frühwald, Michael C; Gnekow, Astrid; Fleischhack, Gudrun; Tippelt, Stephan; Calaminus, Gabriele; Monoranu, Camelia-Maria; Perry, Arie; Jones, Chris; Jacques, Thomas S; Radlwimmer, Bernhard; Gessi, Marco; Pietsch, Torsten; Schramm, Johannes; Schackert, Gabriele; Westphal, Manfred; Reifenberger, Guido; Wesseling, Pieter; Weller, Michael; Collins, Vincent Peter; Blümcke, Ingmar; Bendszus, Martin; Debus, Jürgen; Huang, Annie; Jabado, Nada; Northcott, Paul A; Paulus, Werner; Gajjar, Amar; Robinson, Giles W; Taylor, Michael D; Jaunmuktane, Zane; Ryzhova, Marina; Platten, Michael; Unterberg, Andreas; Wick, Wolfgang; Karajannis, Matthias A; Mittelbronn, Michel; Acker, Till; Hartmann, Christian; Aldape, Kenneth; Schüller, Ulrich; Buslei, Rolf; Lichter, Peter; Kool, Marcel; Herold-Mende, Christel; Ellison, David W; Hasselblatt, Martin; Snuderl, Matija; Brandner, Sebastian; Korshunov, Andrey; von Deimling, Andreas; Pfister, Stefan M

    2018-03-22

    Accurate pathological diagnosis is crucial for optimal management of patients with cancer. For the approximately 100 known tumour types of the central nervous system, standardization of the diagnostic process has been shown to be particularly challenging-with substantial inter-observer variability in the histopathological diagnosis of many tumour types. Here we present a comprehensive approach for the DNA methylation-based classification of central nervous system tumours across all entities and age groups, and demonstrate its application in a routine diagnostic setting. We show that the availability of this method may have a substantial impact on diagnostic precision compared to standard methods, resulting in a change of diagnosis in up to 12% of prospective cases. For broader accessibility, we have designed a free online classifier tool, the use of which does not require any additional onsite data processing. Our results provide a blueprint for the generation of machine-learning-based tumour classifiers across other cancer entities, with the potential to fundamentally transform tumour pathology.

  20. IDM-PhyChm-Ens: intelligent decision-making ensemble methodology for classification of human breast cancer using physicochemical properties of amino acids.

    PubMed

    Ali, Safdar; Majid, Abdul; Khan, Asifullah

    2014-04-01

    Development of an accurate and reliable intelligent decision-making method for the construction of cancer diagnosis system is one of the fast growing research areas of health sciences. Such decision-making system can provide adequate information for cancer diagnosis and drug discovery. Descriptors derived from physicochemical properties of protein sequences are very useful for classifying cancerous proteins. Recently, several interesting research studies have been reported on breast cancer classification. To this end, we propose the exploitation of the physicochemical properties of amino acids in protein primary sequences such as hydrophobicity (Hd) and hydrophilicity (Hb) for breast cancer classification. Hd and Hb properties of amino acids, in recent literature, are reported to be quite effective in characterizing the constituent amino acids and are used to study protein foldings, interactions, structures, and sequence-order effects. Especially, using these physicochemical properties, we observed that proline, serine, tyrosine, cysteine, arginine, and asparagine amino acids offer high discrimination between cancerous and healthy proteins. In addition, unlike traditional ensemble classification approaches, the proposed 'IDM-PhyChm-Ens' method was developed by combining the decision spaces of a specific classifier trained on different feature spaces. The different feature spaces used were amino acid composition, split amino acid composition, and pseudo amino acid composition. Consequently, we have exploited different feature spaces using Hd and Hb properties of amino acids to develop an accurate method for classification of cancerous protein sequences. We developed ensemble classifiers using diverse learning algorithms such as random forest (RF), support vector machines (SVM), and K-nearest neighbor (KNN) trained on different feature spaces. We observed that ensemble-RF, in case of cancer classification, performed better than ensemble-SVM and ensemble-KNN. Our analysis demonstrates that ensemble-RF, ensemble-SVM and ensemble-KNN are more effective than their individual counterparts. The proposed 'IDM-PhyChm-Ens' method has shown improved performance compared to existing techniques.

  1. Identifying metastatic breast tumors using textural kinetic features of a contrast based habitat in DCE-MRI

    NASA Astrophysics Data System (ADS)

    Chaudhury, Baishali; Zhou, Mu; Goldgof, Dmitry B.; Hall, Lawrence O.; Gatenby, Robert A.; Gillies, Robert J.; Drukteinis, Jennifer S.

    2015-03-01

    The ability to identify aggressive tumors from indolent tumors using quantitative analysis on dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) would dramatically change the breast cancer treatment paradigm. With this prognostic information, patients with aggressive tumors that have the ability to spread to distant sites outside of the breast could be selected for more aggressive treatment and surveillance regimens. Conversely, patients with tumors that do not have the propensity to metastasize could be treated less aggressively, avoiding some of the morbidity associated with surgery, radiation and chemotherapy. We propose a computer aided detection framework to determine which breast cancers will metastasize to the loco-regional lymph nodes as well as which tumors will eventually go on to develop distant metastses using quantitative image analysis and radiomics. We defined a new contrast based tumor habitat and analyzed textural kinetic features from this habitat for classification purposes. The proposed tumor habitat, which we call combined-habitat, is derived from the intersection of two individual tumor sub-regions: one that exhibits rapid initial contrast uptake and the other that exhibits rapid delayed contrast washout. Hence the combined-habitat represents the tumor sub-region within which the pixels undergo both rapid initial uptake and rapid delayed washout. We analyzed a dataset of twenty-seven representative two dimensional (2D) images from volumetric DCE-MRI of breast tumors, for classification of tumors with no lymph nodes from tumors with positive number of axillary lymph nodes. For this classification an accuracy of 88.9% was achieved. Twenty of the twenty-seven patients were analyzed for classification of distant metastatic tumors from indolent cancers (tumors with no lymph nodes), for which the accuracy was 84.3%.

  2. Human Papillomavirus Type 16 Genetic Variants: Phylogeny and Classification Based on E6 and LCR

    PubMed Central

    Gheit, Tarik; Franceschi, Silvia; Vignat, Jerome; Burk, Robert D.; Sylla, Bakary S.; Tommasino, Massimo; Clifford, Gary M.

    2012-01-01

    Naturally occurring genetic variants of human papillomavirus type 16 (HPV16) are common and have previously been classified into 4 major lineages; European-Asian (EAS), including the sublineages European (EUR) and Asian (As), African 1 (AFR1), African 2 (AFR2), and North-American/Asian-American (NA/AA). We aimed to improve the classification of HPV16 variant lineages by using a large resource of HPV16-positive cervical samples collected from geographically diverse populations in studies on HPV and/or cervical cancer undertaken by the International Agency for Research on Cancer. In total, we sequenced the entire E6 genes and long control regions (LCRs) of 953 HPV16 isolates from 27 different countries worldwide. Phylogenetic analyses confirmed previously described variant lineages and subclassifications. We characterized two new sublineages within each of the lineages AFR1 and AFR2 that are robustly classified using E6 and/or the LCR. We could differentiate previously identified AA1, AA2, and NA sublineages, although they could not be distinguished by E6 alone, requiring the LCR for correct phylogenetic classification. We thus provide a classification system for HPV16 genomes based on 13 and 32 phylogenetically distinguishing positions in E6 and the LCR, respectively, that distinguish nine HPV16 variant sublineages (EUR, As, AFR1a, AFR1b, AFR2a, AFR2b, NA, AA1, and AA2). Ninety-seven percent of all 953 samples fitted this classification perfectly. Other positions were frequently polymorphic within one or more lineages but did not define phylogenetic subgroups. Such a standardized classification of HPV16 variants is important for future epidemiological and biological studies of the carcinogenic potential of HPV16 variant lineages. PMID:22491459

  3. Human papillomavirus type 16 genetic variants: phylogeny and classification based on E6 and LCR.

    PubMed

    Cornet, Iris; Gheit, Tarik; Franceschi, Silvia; Vignat, Jerome; Burk, Robert D; Sylla, Bakary S; Tommasino, Massimo; Clifford, Gary M

    2012-06-01

    Naturally occurring genetic variants of human papillomavirus type 16 (HPV16) are common and have previously been classified into 4 major lineages; European-Asian (EAS), including the sublineages European (EUR) and Asian (As), African 1 (AFR1), African 2 (AFR2), and North-American/Asian-American (NA/AA). We aimed to improve the classification of HPV16 variant lineages by using a large resource of HPV16-positive cervical samples collected from geographically diverse populations in studies on HPV and/or cervical cancer undertaken by the International Agency for Research on Cancer. In total, we sequenced the entire E6 genes and long control regions (LCRs) of 953 HPV16 isolates from 27 different countries worldwide. Phylogenetic analyses confirmed previously described variant lineages and subclassifications. We characterized two new sublineages within each of the lineages AFR1 and AFR2 that are robustly classified using E6 and/or the LCR. We could differentiate previously identified AA1, AA2, and NA sublineages, although they could not be distinguished by E6 alone, requiring the LCR for correct phylogenetic classification. We thus provide a classification system for HPV16 genomes based on 13 and 32 phylogenetically distinguishing positions in E6 and the LCR, respectively, that distinguish nine HPV16 variant sublineages (EUR, As, AFR1a, AFR1b, AFR2a, AFR2b, NA, AA1, and AA2). Ninety-seven percent of all 953 samples fitted this classification perfectly. Other positions were frequently polymorphic within one or more lineages but did not define phylogenetic subgroups. Such a standardized classification of HPV16 variants is important for future epidemiological and biological studies of the carcinogenic potential of HPV16 variant lineages.

  4. Research on a pulmonary nodule segmentation method combining fast self-adaptive FCM and classification.

    PubMed

    Liu, Hui; Zhang, Cai-Ming; Su, Zhi-Yuan; Wang, Kai; Deng, Kai

    2015-01-01

    The key problem of computer-aided diagnosis (CAD) of lung cancer is to segment pathologically changed tissues fast and accurately. As pulmonary nodules are potential manifestation of lung cancer, we propose a fast and self-adaptive pulmonary nodules segmentation method based on a combination of FCM clustering and classification learning. The enhanced spatial function considers contributions to fuzzy membership from both the grayscale similarity between central pixels and single neighboring pixels and the spatial similarity between central pixels and neighborhood and improves effectively the convergence rate and self-adaptivity of the algorithm. Experimental results show that the proposed method can achieve more accurate segmentation of vascular adhesion, pleural adhesion, and ground glass opacity (GGO) pulmonary nodules than other typical algorithms.

  5. Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data.

    PubMed

    Alakwaa, Fadhl M; Chaudhary, Kumardeep; Garmire, Lana X

    2018-01-05

    Metabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural network, a class of increasingly popular machine learning methods, is suitable to classify metabolomics data. Here we use a cohort of 271 breast cancer tissues, 204 positive estrogen receptor (ER+), and 67 negative estrogen receptor (ER-) to test the accuracies of feed-forward networks, a deep learning (DL) framework, as well as six widely used machine learning models, namely random forest (RF), support vector machines (SVM), recursive partitioning and regression trees (RPART), linear discriminant analysis (LDA), prediction analysis for microarrays (PAM), and generalized boosted models (GBM). DL framework has the highest area under the curve (AUC) of 0.93 in classifying ER+/ER- patients, compared to the other six machine learning algorithms. Furthermore, the biological interpretation of the first hidden layer reveals eight commonly enriched significant metabolomics pathways (adjusted P-value <0.05) that cannot be discovered by other machine learning methods. Among them, protein digestion and absorption and ATP-binding cassette (ABC) transporters pathways are also confirmed in integrated analysis between metabolomics and gene expression data in these samples. In summary, deep learning method shows advantages for metabolomics based breast cancer ER status classification, with both the highest prediction accuracy (AUC = 0.93) and better revelation of disease biology. We encourage the adoption of feed-forward networks based deep learning method in the metabolomics research community for classification.

  6. Exceptions to the rule: case studies in the prediction of pathogenicity for genetic variants in hereditary cancer genes.

    PubMed

    Rosenthal, E T; Bowles, K R; Pruss, D; van Kan, A; Vail, P J; McElroy, H; Wenstrup, R J

    2015-12-01

    Based on current consensus guidelines and standard practice, many genetic variants detected in clinical testing are classified as disease causing based on their predicted impact on the normal expression or function of the gene in the absence of additional data. However, our laboratory has identified a subset of such variants in hereditary cancer genes for which compelling contradictory evidence emerged after the initial evaluation following the first observation of the variant. Three representative examples of variants in BRCA1, BRCA2 and MSH2 that are predicted to disrupt splicing, prematurely truncate the protein, or remove the start codon were evaluated for pathogenicity by analyzing clinical data with multiple classification algorithms. Available clinical data for all three variants contradicts the expected pathogenic classification. These variants illustrate potential pitfalls associated with standard approaches to variant classification as well as the challenges associated with monitoring data, updating classifications, and reporting potentially contradictory interpretations to the clinicians responsible for translating test outcomes to appropriate clinical action. It is important to address these challenges now as the model for clinical testing moves toward the use of large multi-gene panels and whole exome/genome analysis, which will dramatically increase the number of genetic variants identified. © 2015 The Authors. Clinical Genetics published by John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Long-term outcome of 2920 patients with cancers of the esophagus and esophagogastric junction: evaluation of the New Union Internationale Contre le Cancer/American Joint Cancer Committee staging system.

    PubMed

    Gertler, Ralf; Stein, Hubert J; Langer, Rupert; Nettelmann, Marc; Schuster, Tibor; Hoefler, Heinz; Siewert, Joerg-Ruediger; Feith, Marcus

    2011-04-01

    We analyzed the long-term outcome of patients operated for esophageal cancer and evaluated the new seventh edition of the tumor-node-metastasis classification for cancers of the esophagus. Retrospective analysis and new classification. Data of a single-center cohort of 2920 patients operated for cancers of the esophagus according to the seventh edition are presented. Statistical methods to evaluate survival and the prognostic performance of the staging systems included Kaplan-Meier analyses and time-dependent receiver-operating-characteristic-analysis. Union Internationale Contre le Cancer stage, R-status, histologic tumor type and age were identified as independent prognostic factors for cancers of the esophagus. Grade and tumor site, additional parameters in the new American Joint Cancer Committee prognostic groupings, were not significantly correlated with survival. Esophageal adenocarcinoma showed a significantly better long-term prognosis after resection than squamous cell carcinoma (P < 0.0001). The new number-dependent N-classification proved superior to the former site-dependent classification with significantly decreasing prognosis with the increasing number of lymph node metastases (P < 0.001). The new subclassification of T1 tumors also revealed significant differences in prognosis between pT1a and pT1b patients (P < 0.001). However, the multiple new Union Internationale Contre le Cancer and American Joint Cancer Committee subgroupings did not prove distinctive for survival between stages IIA and IIB, between IIIA and IIIB, and between IIIC and IV. The new seventh edition of the tumor-node-metastasis classification improved the predictive ability for cancers of the esophagus; however, stage groups could be condensed to a clinically relevant number. Differences in patient characteristics, pathogenesis, and especially survival clearly identify adenocarcinomas and squamous cell carcinoma of the esophagus as 2 separate tumor entities requiring differentiated therapeutic concepts.

  8. Computer-Aided Breast Cancer Diagnosis with Optimal Feature Sets: Reduction Rules and Optimization Techniques.

    PubMed

    Mathieson, Luke; Mendes, Alexandre; Marsden, John; Pond, Jeffrey; Moscato, Pablo

    2017-01-01

    This chapter introduces a new method for knowledge extraction from databases for the purpose of finding a discriminative set of features that is also a robust set for within-class classification. Our method is generic and we introduce it here in the field of breast cancer diagnosis from digital mammography data. The mathematical formalism is based on a generalization of the k-Feature Set problem called (α, β)-k-Feature Set problem, introduced by Cotta and Moscato (J Comput Syst Sci 67(4):686-690, 2003). This method proceeds in two steps: first, an optimal (α, β)-k-feature set of minimum cardinality is identified and then, a set of classification rules using these features is obtained. We obtain the (α, β)-k-feature set in two phases; first a series of extremely powerful reduction techniques, which do not lose the optimal solution, are employed; and second, a metaheuristic search to identify the remaining features to be considered or disregarded. Two algorithms were tested with a public domain digital mammography dataset composed of 71 malignant and 75 benign cases. Based on the results provided by the algorithms, we obtain classification rules that employ only a subset of these features.

  9. On the International Agency for Research on Cancer classification of glyphosate as a probable human carcinogen.

    PubMed

    Tarone, Robert E

    2018-01-01

    The recent classification by International Agency for Research on Cancer (IARC) of the herbicide glyphosate as a probable human carcinogen has generated considerable discussion. The classification is at variance with evaluations of the carcinogenic potential of glyphosate by several national and international regulatory bodies. The basis for the IARC classification is examined under the assumptions that the IARC criteria are reasonable and that the body of scientific studies determined by IARC staff to be relevant to the evaluation of glyphosate by the Monograph Working Group is sufficiently complete. It is shown that the classification of glyphosate as a probable human carcinogen was the result of a flawed and incomplete summary of the experimental evidence evaluated by the Working Group. Rational and effective cancer prevention activities depend on scientifically sound and unbiased assessments of the carcinogenic potential of suspected agents. Implications of the erroneous classification of glyphosate with respect to the IARC Monograph Working Group deliberative process are discussed.

  10. Detection and classification of Breast Cancer in Wavelet Sub-bands of Fractal Segmented Cancerous Zones.

    PubMed

    Shirazinodeh, Alireza; Noubari, Hossein Ahmadi; Rabbani, Hossein; Dehnavi, Alireza Mehri

    2015-01-01

    Recent studies on wavelet transform and fractal modeling applied on mammograms for the detection of cancerous tissues indicate that microcalcifications and masses can be utilized for the study of the morphology and diagnosis of cancerous cases. It is shown that the use of fractal modeling, as applied to a given image, can clearly discern cancerous zones from noncancerous areas. In this paper, for fractal modeling, the original image is first segmented into appropriate fractal boxes followed by identifying the fractal dimension of each windowed section using a computationally efficient two-dimensional box-counting algorithm. Furthermore, using appropriate wavelet sub-bands and image Reconstruction based on modified wavelet coefficients, it is shown that it is possible to arrive at enhanced features for detection of cancerous zones. In this paper, we have attempted to benefit from the advantages of both fractals and wavelets by introducing a new algorithm. By using a new algorithm named F1W2, the original image is first segmented into appropriate fractal boxes, and the fractal dimension of each windowed section is extracted. Following from that, by applying a maximum level threshold on fractal dimensions matrix, the best-segmented boxes are selected. In the next step, the segmented Cancerous zones which are candidates are then decomposed by utilizing standard orthogonal wavelet transform and db2 wavelet in three different resolution levels, and after nullifying wavelet coefficients of the image at the first scale and low frequency band of the third scale, the modified reconstructed image is successfully utilized for detection of breast cancer regions by applying an appropriate threshold. For detection of cancerous zones, our simulations indicate the accuracy of 90.9% for masses and 88.99% for microcalcifications detection results using the F1W2 method. For classification of detected mictocalcification into benign and malignant cases, eight features are identified and utilized in radial basis function neural network. Our simulation results indicate the accuracy of 92% classification using F1W2 method.

  11. Maastricht Delphi Consensus on Event Definitions for Classification of Recurrence in Breast Cancer Research

    PubMed Central

    van Roozendaal, Lori M.; Strobbe, Luc J. A.; Aebi, Stefan; Cameron, David A.; Dixon, J. Michael; Giuliano, Armando E.; Haffty, Bruce G.; Hickey, Brigid E.; Hudis, Clifford A.; Klimberg, V. Suzanne; Koczwara, Bogda; Kühn, Thorsten; Lippman, Marc E.; Lucci, Anthony; Piccart, Martine; Smith, Benjamin D.; Tjan-Heijnen, Vivianne C. G.; van de Velde, Cornelis J. H.; Van Zee, Kimberly J.; Vermorken, Jan B.; Viale, Giuseppe; Voogd, Adri C.; Wapnir, Irene L.; White, Julia R.; Smidt, Marjolein L.

    2014-01-01

    Background In breast cancer studies, many different endpoints are used. Definitions are often not provided or vary between studies. For instance, “local recurrence” may include different components in similar studies. This limits transparency and comparability of results. This project aimed to reach consensus on the definitions of local event, second primary breast cancer, regional and distant event for breast cancer studies. Methods The RAND-UCLA Appropriateness method (modified Delphi method) was used. A Consensus Group of international breast cancer experts was formed, including representatives of all involved clinical disciplines. Consensus was reached in two rounds of online questionnaires and one meeting. Results Twenty-four international breast cancer experts participated. Consensus was reached on 134 items in four categories. Local event is defined as any epithelial breast cancer or ductal carcinoma in situ (DCIS) in the ipsilateral breast, or skin and subcutaneous tissue on the ipsilateral thoracic wall. Second primary breast cancer is defined as epithelial breast cancer in the contralateral breast. Regional events are breast cancer in ipsilateral lymph nodes. A distant event is breast cancer in any other location. Therefore, this includes metastasis in contralateral lymph nodes and breast cancer involving the sternal bone. If feasible, tissue sampling of a first, solitary, lesion suspected for metastasis is highly recommended. Conclusion This project resulted in consensus-based event definitions for classification of recurrence in breast cancer research. Future breast cancer research projects should adopt these definitions to increase transparency. This should facilitate comparison of results and conducting reviews as well as meta-analysis. PMID:25381395

  12. SVM classifier on chip for melanoma detection.

    PubMed

    Afifi, Shereen; GholamHosseini, Hamid; Sinha, Roopak

    2017-07-01

    Support Vector Machine (SVM) is a common classifier used for efficient classification with high accuracy. SVM shows high accuracy for classifying melanoma (skin cancer) clinical images within computer-aided diagnosis systems used by skin cancer specialists to detect melanoma early and save lives. We aim to develop a medical low-cost handheld device that runs a real-time embedded SVM-based diagnosis system for use in primary care for early detection of melanoma. In this paper, an optimized SVM classifier is implemented onto a recent FPGA platform using the latest design methodology to be embedded into the proposed device for realizing online efficient melanoma detection on a single system on chip/device. The hardware implementation results demonstrate a high classification accuracy of 97.9% and a significant acceleration factor of 26 from equivalent software implementation on an embedded processor, with 34% of resources utilization and 2 watts for power consumption. Consequently, the implemented system meets crucial embedded systems constraints of high performance and low cost, resources utilization and power consumption, while achieving high classification accuracy.

  13. A Comprehensive Repository of Normal and Tumor Human Breast Tissues and Cells

    DTIC Science & Technology

    1999-07-01

    mother was reported to have had cancer of the uterine cervix at the age of 22. Both maternal grandparents had died of colon cancer in their sixties...1 mutation). The repository also includes breast epithelial and stromal cell strains derived from non cancerous breast tissue as well as peripheral...tissue banks. 14. SUBJECT TERMS Breast Cancer 17. SECURITY CLASSIFICATION OF REPORT Unclassified 18. SECURITY CLASSIFICATION OF THIS PAGE

  14. Analysis of framelets for breast cancer diagnosis.

    PubMed

    Thivya, K S; Sakthivel, P; Venkata Sai, P M

    2016-01-01

    Breast cancer is the second threatening tumor among the women. The effective way of reducing breast cancer is its early detection which helps to improve the diagnosing process. Digital mammography plays a significant role in mammogram screening at earlier stage of breast carcinoma. Even though, it is very difficult to find accurate abnormality in prevalent screening by radiologists. But the possibility of precise breast cancer screening is encouraged by predicting the accurate type of abnormality through Computer Aided Diagnosis (CAD) systems. The two most important indicators of breast malignancy are microcalcifications and masses. In this study, framelet transform, a multiresolutional analysis is investigated for the classification of the above mentioned two indicators. The statistical and co-occurrence features are extracted from the framelet decomposed mammograms with different resolution levels and support vector machine is employed for classification with k-fold cross validation. This system achieves 94.82% and 100% accuracy in normal/abnormal classification (stage I) and benign/malignant classification (stage II) of mass classification system and 98.57% and 100% for microcalcification system when using the MIAS database.

  15. Cascaded discrimination of normal, abnormal, and confounder classes in histopathology: Gleason grading of prostate cancer

    PubMed Central

    2012-01-01

    Background Automated classification of histopathology involves identification of multiple classes, including benign, cancerous, and confounder categories. The confounder tissue classes can often mimic and share attributes with both the diseased and normal tissue classes, and can be particularly difficult to identify, both manually and by automated classifiers. In the case of prostate cancer, they may be several confounding tissue types present in a biopsy sample, posing as major sources of diagnostic error for pathologists. Two common multi-class approaches are one-shot classification (OSC), where all classes are identified simultaneously, and one-versus-all (OVA), where a “target” class is distinguished from all “non-target” classes. OSC is typically unable to handle discrimination of classes of varying similarity (e.g. with images of prostate atrophy and high grade cancer), while OVA forces several heterogeneous classes into a single “non-target” class. In this work, we present a cascaded (CAS) approach to classifying prostate biopsy tissue samples, where images from different classes are grouped to maximize intra-group homogeneity while maximizing inter-group heterogeneity. Results We apply the CAS approach to categorize 2000 tissue samples taken from 214 patient studies into seven classes: epithelium, stroma, atrophy, prostatic intraepithelial neoplasia (PIN), and prostate cancer Gleason grades 3, 4, and 5. A series of increasingly granular binary classifiers are used to split the different tissue classes until the images have been categorized into a single unique class. Our automatically-extracted image feature set includes architectural features based on location of the nuclei within the tissue sample as well as texture features extracted on a per-pixel level. The CAS strategy yields a positive predictive value (PPV) of 0.86 in classifying the 2000 tissue images into one of 7 classes, compared with the OVA (0.77 PPV) and OSC approaches (0.76 PPV). Conclusions Use of the CAS strategy increases the PPV for a multi-category classification system over two common alternative strategies. In classification problems such as histopathology, where multiple class groups exist with varying degrees of heterogeneity, the CAS system can intelligently assign class labels to objects by performing multiple binary classifications according to domain knowledge. PMID:23110677

  16. Prioritization of reproductive toxicants in unconventional oil and gas operations using a multi-country regulatory data-driven hazard assessment.

    PubMed

    Inayat-Hussain, Salmaan H; Fukumura, Masao; Muiz Aziz, A; Jin, Chai Meng; Jin, Low Wei; Garcia-Milian, Rolando; Vasiliou, Vasilis; Deziel, Nicole C

    2018-08-01

    Recent trends have witnessed the global growth of unconventional oil and gas (UOG) production. Epidemiologic studies have suggested associations between proximity to UOG operations with increased adverse birth outcomes and cancer, though specific potential etiologic agents have not yet been identified. To perform effective risk assessment of chemicals used in UOG production, the first step of hazard identification followed by prioritization specifically for reproductive toxicity, carcinogenicity and mutagenicity is crucial in an evidence-based risk assessment approach. To date, there is no single hazard classification list based on the United Nations Globally Harmonized System (GHS), with countries applying the GHS standards to generate their own chemical hazard classification lists. A current challenge for chemical prioritization, particularly for a multi-national industry, is inconsistent hazard classification which may result in misjudgment of the potential public health risks. We present a novel approach for hazard identification followed by prioritization of reproductive toxicants found in UOG operations using publicly available regulatory databases. GHS classification for reproductive toxicity of 157 UOG-related chemicals identified as potential reproductive or developmental toxicants in a previous publication was assessed using eleven governmental regulatory agency databases. If there was discordance in classifications across agencies, the most stringent classification was assigned. Chemicals in the category of known or presumed human reproductive toxicants were further evaluated for carcinogenicity and germ cell mutagenicity based on government classifications. A scoring system was utilized to assign numerical values for reproductive health, cancer and germ cell mutation hazard endpoints. Using a Cytoscape analysis, both qualitative and quantitative results were presented visually to readily identify high priority UOG chemicals with evidence of multiple adverse effects. We observed substantial inconsistencies in classification among the 11 databases. By adopting the most stringent classification within and across countries, 43 chemicals were classified as known or presumed human reproductive toxicants (GHS Category 1), while 31 chemicals were classified as suspected human reproductive toxicants (GHS Category 2). The 43 reproductive toxicants were further subjected to analysis for carcinogenic and mutagenic properties. Calculated hazard scores and Cytoscape visualization yielded several high priority chemicals including potassium dichromate, cadmium, benzene and ethylene oxide. Our findings reveal diverging GHS classification outcomes for UOG chemicals across regulatory agencies. Adoption of the most stringent classification with application of hazard scores provides a useful approach to prioritize reproductive toxicants in UOG and other industries for exposure assessments and selection of safer alternatives. Copyright © 2018 Elsevier Ltd. All rights reserved.

  17. Automatic breast density classification using a convolutional neural network architecture search procedure

    NASA Astrophysics Data System (ADS)

    Fonseca, Pablo; Mendoza, Julio; Wainer, Jacques; Ferrer, Jose; Pinto, Joseph; Guerrero, Jorge; Castaneda, Benjamin

    2015-03-01

    Breast parenchymal density is considered a strong indicator of breast cancer risk and therefore useful for preventive tasks. Measurement of breast density is often qualitative and requires the subjective judgment of radiologists. Here we explore an automatic breast composition classification workflow based on convolutional neural networks for feature extraction in combination with a support vector machines classifier. This is compared to the assessments of seven experienced radiologists. The experiments yielded an average kappa value of 0.58 when using the mode of the radiologists' classifications as ground truth. Individual radiologist performance against this ground truth yielded kappa values between 0.56 and 0.79.

  18. Translating genomic information into clinical medicine: lung cancer as a paradigm.

    PubMed

    Levy, Mia A; Lovly, Christine M; Pao, William

    2012-11-01

    We are currently in an era of rapidly expanding knowledge about the genetic landscape and architectural blueprints of various cancers. These discoveries have led to a new taxonomy of malignant diseases based upon clinically relevant molecular alterations in addition to histology or tissue of origin. The new molecularly based classification holds the promise of rational rather than empiric approaches for the treatment of cancer patients. However, the accelerated pace of discovery and the expanding number of targeted anti-cancer therapies present a significant challenge for healthcare practitioners to remain informed and up-to-date on how to apply cutting-edge discoveries into daily clinical practice. In this Perspective, we use lung cancer as a paradigm to discuss challenges related to translating genomic information into the clinic, and we present one approach we took at Vanderbilt-Ingram Cancer Center to address these challenges.

  19. A Deep Convolutional Neural Network for segmenting and classifying epithelial and stromal regions in histopathological images

    PubMed Central

    Xu, Jun; Luo, Xiaofei; Wang, Guanhao; Gilmore, Hannah; Madabhushi, Anant

    2016-01-01

    Epithelial (EP) and stromal (ST) are two types of tissues in histological images. Automated segmentation or classification of EP and ST tissues is important when developing computerized system for analyzing the tumor microenvironment. In this paper, a Deep Convolutional Neural Networks (DCNN) based feature learning is presented to automatically segment or classify EP and ST regions from digitized tumor tissue microarrays (TMAs). Current approaches are based on handcraft feature representation, such as color, texture, and Local Binary Patterns (LBP) in classifying two regions. Compared to handcrafted feature based approaches, which involve task dependent representation, DCNN is an end-to-end feature extractor that may be directly learned from the raw pixel intensity value of EP and ST tissues in a data driven fashion. These high-level features contribute to the construction of a supervised classifier for discriminating the two types of tissues. In this work we compare DCNN based models with three handcraft feature extraction based approaches on two different datasets which consist of 157 Hematoxylin and Eosin (H&E) stained images of breast cancer and 1376 immunohistological (IHC) stained images of colorectal cancer, respectively. The DCNN based feature learning approach was shown to have a F1 classification score of 85%, 89%, and 100%, accuracy (ACC) of 84%, 88%, and 100%, and Matthews Correlation Coefficient (MCC) of 86%, 77%, and 100% on two H&E stained (NKI and VGH) and IHC stained data, respectively. Our DNN based approach was shown to outperform three handcraft feature extraction based approaches in terms of the classification of EP and ST regions. PMID:28154470

  20. A Deep Convolutional Neural Network for segmenting and classifying epithelial and stromal regions in histopathological images.

    PubMed

    Xu, Jun; Luo, Xiaofei; Wang, Guanhao; Gilmore, Hannah; Madabhushi, Anant

    2016-05-26

    Epithelial (EP) and stromal (ST) are two types of tissues in histological images. Automated segmentation or classification of EP and ST tissues is important when developing computerized system for analyzing the tumor microenvironment. In this paper, a Deep Convolutional Neural Networks (DCNN) based feature learning is presented to automatically segment or classify EP and ST regions from digitized tumor tissue microarrays (TMAs). Current approaches are based on handcraft feature representation, such as color, texture, and Local Binary Patterns (LBP) in classifying two regions. Compared to handcrafted feature based approaches, which involve task dependent representation, DCNN is an end-to-end feature extractor that may be directly learned from the raw pixel intensity value of EP and ST tissues in a data driven fashion. These high-level features contribute to the construction of a supervised classifier for discriminating the two types of tissues. In this work we compare DCNN based models with three handcraft feature extraction based approaches on two different datasets which consist of 157 Hematoxylin and Eosin (H&E) stained images of breast cancer and 1376 immunohistological (IHC) stained images of colorectal cancer, respectively. The DCNN based feature learning approach was shown to have a F1 classification score of 85%, 89%, and 100%, accuracy (ACC) of 84%, 88%, and 100%, and Matthews Correlation Coefficient (MCC) of 86%, 77%, and 100% on two H&E stained (NKI and VGH) and IHC stained data, respectively. Our DNN based approach was shown to outperform three handcraft feature extraction based approaches in terms of the classification of EP and ST regions.

  1. Comparison of the prevalence of malnutrition diagnosis in head and neck, gastrointestinal and lung cancer patients by three classification methods

    PubMed Central

    Platek, Mary E.; Popp KPf, Johann V.; Possinger, Candi S.; DeNysschen, Carol A.; Horvath, Peter; Brown, Jean K.

    2011-01-01

    Background Malnutrition is prevalent among patients within certain cancer types. There is lack of universal standard of care for nutrition screening, lack of agreement on an operational definition and on validity of malnutrition indicators. Objective In a secondary data analysis, we investigated prevalence of malnutrition diagnosis by three classification methods using data from medical records of a National Cancer Institute (NCI)-designated comprehensive cancer center. Interventions/Methods Records of 227 patients hospitalized during 1998 with head and neck, gastrointestinal or lung cancer were reviewed for malnutrition based on three methods: 1) physician diagnosed malnutrition related ICD-9 codes; 2) in-hospital nutritional assessment summary conducted by Registered Dietitians; and 3) body mass index (BMI). For patients with multiple admissions, only data from the first hospitalization was included. Results Prevalence of malnutrition diagnosis ranged from 8.8% based on BMI to approximately 26% of all cases based on dietitian assessment. Kappa coefficients between any methods indicated a weak (kappa=0.23, BMI and Dietitians and kappa=0.28, Dietitians and Physicians) to fair strength of agreement (kappa=0.38, BMI and Physicians). Conclusions Available methods to identify patients with malnutrition in an NCI designated comprehensive cancer center resulted in varied prevalence of malnutrition diagnosis. Universal standard of care for nutrition screening that utilizes validated tools is needed. Implications for Practice The Joint Commission on the Accreditation of Healthcare Organizations requires nutritional screening of patients within 24 hours of admission. For this purpose, implementation of a validated tool that can be used by various healthcare practitioners, including nurses, needs to be considered. PMID:21242767

  2. Nomogram for predicting the benefit of neoadjuvant chemoradiotherapy for patients with esophageal cancer: a SEER-Medicare analysis.

    PubMed

    Eil, Robert; Diggs, Brian S; Wang, Samuel J; Dolan, James P; Hunter, John G; Thomas, Charles R

    2014-02-15

    The survival impact of neoadjuvant chemoradiotherapy (CRT) on esophageal cancer remains difficult to establish for specific patients. The aim of the current study was to create a Web-based prediction tool providing individualized survival projections based on tumor and treatment data. Patients diagnosed with esophageal cancer between 1997 and 2005 were selected from the Surveillance, Epidemiology, and End Results (SEER)-Medicare database. The covariates analyzed were sex, T and N classification, histology, total number of lymph nodes examined, and treatment with esophagectomy or CRT followed by esophagectomy. After propensity score weighting, a log-logistic regression model for overall survival was selected based on the Akaike information criterion. A total of 824 patients with esophageal cancer who were treated with esophagectomy or trimodal therapy met the selection criteria. On multivariate analysis, age, sex, T and N classification, number of lymph nodes examined, treatment, and histology were found to be significantly associated with overall survival and were included in the regression analysis. Preoperative staging data and final surgical margin status were not available within the SEER-Medicare data set and therefore were not included. The model predicted that patients with T4 or lymph node disease benefitted from CRT. The internally validated concordance index was 0.72. The SEER-Medicare database of patients with esophageal cancer can be used to produce a survival prediction tool that: 1) serves as a counseling and decision aid to patients and 2) assists in risk modeling. Patients with T4 or lymph node disease appeared to benefit from CRT. This nomogram may underestimate the benefit of CRT due to its variable downstaging effect on pathologic stage. It is available at skynet.ohsu.edu/nomograms. © 2013 American Cancer Society.

  3. Rational bases for the use of the Immunoscore in routine clinical settings as a prognostic and predictive biomarker in cancer patients.

    PubMed

    Kirilovsky, Amos; Marliot, Florence; El Sissy, Carine; Haicheur, Nacilla; Galon, Jérôme; Pagès, Franck

    2016-08-01

    The American Joint Committee on Cancer/Union Internationale Contre le Cancer (AJCC/UICC) tumor, nodes, metastasis (TNM) classification system based on tumor features is used for prognosis estimation and treatment recommendations in most cancers. However, the clinical outcome can vary significantly among patients within the same tumor stage and TNM classification does not predict response to therapy. Therefore, many efforts have been focused on the identification of new markers. Multiple tumor cell-based approaches have been proposed but very few have been translated into the clinic. The recent demonstration of the essential role of the immune system in tumor progression has allowed great advances in the understanding of this complex disease and in the design of novel therapies. The analysis of the immune infiltrate by imaging techniques in large patient cohorts highlighted the prognostic impact of the in situ immune cell infiltrate in tumors. Moreover, the characterization of the immune infiltrates (e.g. type, density, distribution within the tumor, phenotype, activation status) in patients treated with checkpoint-blockade strategies could provide information to predict the disease outcome. In colorectal cancer, we have developed a prognostic score ('Immunoscore') that takes into account the distribution of the density of both CD3(+) lymphocytes and CD8(+) cytotoxic T cells in the tumor core and the invasive margin that could outperform TNM staging. Currently, an international retrospective study is under way to validate the Immunoscore prognostic performance in patients with colon cancer. The use of Immunoscore in clinical practice could improve the patients' prognostic assessment and therapeutic management. © The Japanese Society for Immunology. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  4. FSR: feature set reduction for scalable and accurate multi-class cancer subtype classification based on copy number.

    PubMed

    Wong, Gerard; Leckie, Christopher; Kowalczyk, Adam

    2012-01-15

    Feature selection is a key concept in machine learning for microarray datasets, where features represented by probesets are typically several orders of magnitude larger than the available sample size. Computational tractability is a key challenge for feature selection algorithms in handling very high-dimensional datasets beyond a hundred thousand features, such as in datasets produced on single nucleotide polymorphism microarrays. In this article, we present a novel feature set reduction approach that enables scalable feature selection on datasets with hundreds of thousands of features and beyond. Our approach enables more efficient handling of higher resolution datasets to achieve better disease subtype classification of samples for potentially more accurate diagnosis and prognosis, which allows clinicians to make more informed decisions in regards to patient treatment options. We applied our feature set reduction approach to several publicly available cancer single nucleotide polymorphism (SNP) array datasets and evaluated its performance in terms of its multiclass predictive classification accuracy over different cancer subtypes, its speedup in execution as well as its scalability with respect to sample size and array resolution. Feature Set Reduction (FSR) was able to reduce the dimensions of an SNP array dataset by more than two orders of magnitude while achieving at least equal, and in most cases superior predictive classification performance over that achieved on features selected by existing feature selection methods alone. An examination of the biological relevance of frequently selected features from FSR-reduced feature sets revealed strong enrichment in association with cancer. FSR was implemented in MATLAB R2010b and is available at http://ww2.cs.mu.oz.au/~gwong/FSR.

  5. The applicability of new TNM classification for humanpapilloma virus-related oropharyngeal cancer in the 8th edition of the AJCC/UICC TNM staging system in Japan: A single-centre study.

    PubMed

    Sano, Daisuke; Yabuki, Kenichiro; Arai, Yasuhiro; Tanabe, Teruhiko; Chiba, Yoshihiro; Nishimura, Goshi; Takahashi, Hideaki; Yamanaka, Shoji; Oridate, Nobuhiko

    2018-06-01

    The purpose of this study is to validate the applicability of new TNM classification for human papillomavirus (HPV)-related oropharyngeal cancer (OPC) in the 8th edition of the American Joint Committee on Cancer (AJCC)/Union for International Cancer Control (UICC) TNM staging system in Japan. A total of 91 OPC patients treated with radiation-based therapy between November 2001 and July 2015 were analyzed retrospectively in this study. HPV infection status was evaluated using tumor p16 expression. 40 OPC patients (44.0%) had HPV-positive disease in this study. The distribution of disease stage of HPV-positive OPC patients dramatically changed from the 7th edition to the 8th edition of AJCC/UICC TNM classification. However, neither the 8th edition nor the 7th edition of the AJCC/UICC TNM staging system could adequately predict outcomes of HPV-positive OPC patients in our patient series. On the other hand, our multivariate analysis indicated that matted nodes and age ≥63 were independent prognostic factors for progression-free survival. In addition, HPV-positive OPC patients with stage I without matted nodes showed significantly better overall and progression-free survival compared with those with stage I with matted nodes and stages II and III in the 8th edition of the AJCC/UICC TNM staging system (P=0.008, and P=0.043, respectively). Our results suggested that matted nodes of HPV-positive OPC patients might be additionally examined to apply the 8th edition of AJCC/UICC TNM classification for more adequate predicting outcomes of HPV-positive OPC patients. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Breast cancer statistics and prediction methodology: a systematic review and analysis.

    PubMed

    Dubey, Ashutosh Kumar; Gupta, Umesh; Jain, Sonal

    2015-01-01

    Breast cancer is a menacing cancer, primarily affecting women. Continuous research is going on for detecting breast cancer in the early stage as the possibility of cure in early stages is bright. There are two main objectives of this current study, first establish statistics for breast cancer and second to find methodologies which can be helpful in the early stage detection of the breast cancer based on previous studies. The breast cancer statistics for incidence and mortality of the UK, US, India and Egypt were considered for this study. The finding of this study proved that the overall mortality rates of the UK and US have been improved because of awareness, improved medical technology and screening, but in case of India and Egypt the condition is less positive because of lack of awareness. The methodological findings of this study suggest a combined framework based on data mining and evolutionary algorithms. It provides a strong bridge in improving the classification and detection accuracy of breast cancer data.

  7. GSNFS: Gene subnetwork biomarker identification of lung cancer expression data.

    PubMed

    Doungpan, Narumol; Engchuan, Worrawat; Chan, Jonathan H; Meechai, Asawin

    2016-12-05

    Gene expression has been used to identify disease gene biomarkers, but there are ongoing challenges. Single gene or gene-set biomarkers are inadequate to provide sufficient understanding of complex disease mechanisms and the relationship among those genes. Network-based methods have thus been considered for inferring the interaction within a group of genes to further study the disease mechanism. Recently, the Gene-Network-based Feature Set (GNFS), which is capable of handling case-control and multiclass expression for gene biomarker identification, has been proposed, partly taking into account of network topology. However, its performance relies on a greedy search for building subnetworks and thus requires further improvement. In this work, we establish a new approach named Gene Sub-Network-based Feature Selection (GSNFS) by implementing the GNFS framework with two proposed searching and scoring algorithms, namely gene-set-based (GS) search and parent-node-based (PN) search, to identify subnetworks. An additional dataset is used to validate the results. The two proposed searching algorithms of the GSNFS method for subnetwork expansion are concerned with the degree of connectivity and the scoring scheme for building subnetworks and their topology. For each iteration of expansion, the neighbour genes of a current subnetwork, whose expression data improved the overall subnetwork score, is recruited. While the GS search calculated the subnetwork score using an activity score of a current subnetwork and the gene expression values of its neighbours, the PN search uses the expression value of the corresponding parent of each neighbour gene. Four lung cancer expression datasets were used for subnetwork identification. In addition, using pathway data and protein-protein interaction as network data in order to consider the interaction among significant genes were discussed. Classification was performed to compare the performance of the identified gene subnetworks with three subnetwork identification algorithms. The two searching algorithms resulted in better classification and gene/gene-set agreement compared to the original greedy search of the GNFS method. The identified lung cancer subnetwork using the proposed searching algorithm resulted in an improvement of the cross-dataset validation and an increase in the consistency of findings between two independent datasets. The homogeneity measurement of the datasets was conducted to assess dataset compatibility in cross-dataset validation. The lung cancer dataset with higher homogeneity showed a better result when using the GS search while the dataset with low homogeneity showed a better result when using the PN search. The 10-fold cross-dataset validation on the independent lung cancer datasets showed higher classification performance of the proposed algorithms when compared with the greedy search in the original GNFS method. The proposed searching algorithms provide a higher number of genes in the subnetwork expansion step than the greedy algorithm. As a result, the performance of the subnetworks identified from the GSNFS method was improved in terms of classification performance and gene/gene-set level agreement depending on the homogeneity of the datasets used in the analysis. Some common genes obtained from the four datasets using different searching algorithms are genes known to play a role in lung cancer. The improvement of classification performance and the gene/gene-set level agreement, and the biological relevance indicated the effectiveness of the GSNFS method for gene subnetwork identification using expression data.

  8. Suggestions for Lymph Node Classification of UICC/AJCC Staging System: A Retrospective Study Based on 1197 Nasopharyngeal Carcinoma Patients Treated With Intensity-Modulated Radiation Therapy

    PubMed Central

    Guo, Qiaojuan; Pan, Jianji; Zong, Jingfeng; Zheng, Wei; Zhang, Chun; Tang, Linbo; Chen, Bijuan; Cui, Xiaofei; Xiao, Youping; Chen, Yunbin; Lin, Shaojun

    2015-01-01

    Abstract This article provides suggestions for N classification of Union for International Cancer Control/American Joint Committee on Cancer (UICC/AJCC) staging system of nasopharyngeal carcinoma (NPC), purely based on magnetic resonance imaging (MRI) in intensity-modulated radiation therapy (IMRT) era. A total of 1197 nonmetastatic NPC patients treated with IMRT were enrolled, and all were scanned by MRI at nasopharynx and neck before treatment. MRI-based nodal variables including level, laterality, maximal axial diameter (MAD), extracapsular spread (ECS), and necrosis were analyzed as potential prognostic factors. Modifications of N classification were then proposed and verified. Only nodal level and laterality were considered to be significant variables affecting the treatment outcome. N classification was thus proposed accordingly: N0, no regional lymph node (LN) metastasis; N1, retropharyngeal LNs involvement (regardless of laterality), and/or unilateral levels I, II, III, and/or Va involvement; N2, bilateral levels I, II, III, and/or Va involvement; and N3, levels IV, Vb, and Vc involvement. This proposal showed significant predicting value in multivariate analysis. N3 patients indicated relatively inferior overall survival (OS) and distant metastasis-free survival (DMFS) than N2 patients; however, the difference showed no statistical significance (P = 0.673 and 0.265 for OS and DMFS, respectively), and this was considered to be correlated with the small sample sizes of N3 patients (79 patients, 6.6%). Nodal level and laterality, but not MAD, ECS, and necrosis, were considered to be significant predicting factors for NPC. The proposed N classification was proved to be powerfully predictive in our cohort; however, treatment outcome of the proposed N2 and N3 patients could not differ significantly from each other. This insignificance may be because of the small sample sizes of N3 patients. Our results are based on a single-center data, to develop a new N classification that is universally acceptable; further verification by data from multicenter is warranted. PMID:25997052

  9. Use of multivariate analysis to suggest a new molecular classification of colorectal cancer

    PubMed Central

    Domingo, Enric; Ramamoorthy, Rajarajan; Oukrif, Dahmane; Rosmarin, Daniel; Presz, Michal; Wang, Haitao; Pulker, Hannah; Lockstone, Helen; Hveem, Tarjei; Cranston, Treena; Danielsen, Havard; Novelli, Marco; Davidson, Brian; Xu, Zheng-Zhou; Molloy, Peter; Johnstone, Elaine; Holmes, Christopher; Midgley, Rachel; Kerr, David; Sieber, Oliver; Tomlinson, Ian

    2013-01-01

    Abstract Molecular classification of colorectal cancer (CRC) is currently based on microsatellite instability (MSI), KRAS or BRAF mutation and, occasionally, chromosomal instability (CIN). Whilst useful, these categories may not fully represent the underlying molecular subgroups. We screened 906 stage II/III CRCs from the VICTOR clinical trial for somatic mutations. Multivariate analyses (logistic regression, clustering, Bayesian networks) identified the primary molecular associations. Positive associations occurred between: CIN and TP53 mutation; MSI and BRAF mutation; and KRAS and PIK3CA mutations. Negative associations occurred between: MSI and CIN; MSI and NRAS mutation; and KRAS mutation, and each of NRAS, TP53 and BRAF mutations. Some complex relationships were elucidated: KRAS and TP53 mutations had both a direct negative association and a weaker, confounding, positive association via TP53–CIN–MSI–BRAF–KRAS. Our results suggested a new molecular classification of CRCs: (1) MSI+ and/or BRAF-mutant; (2) CIN+ and/or TP53– mutant, with wild-type KRAS and PIK3CA; (3) KRAS- and/or PIK3CA-mutant, CIN+, TP53-wild-type; (4) KRAS– and/or PIK3CA-mutant, CIN–, TP53-wild-type; (5) NRAS-mutant; (6) no mutations; (7) others. As expected, group 1 cancers were mostly proximal and poorly differentiated, usually occurring in women. Unexpectedly, two different types of CIN+ CRC were found: group 2 cancers were usually distal and occurred in men, whereas group 3 showed neither of these associations but were of higher stage. CIN+ cancers have conventionally been associated with all three of these variables, because they have been tested en masse. Our classification also showed potentially improved prognostic capabilities, with group 3, and possibly group 1, independently predicting disease-free survival. Copyright © 2012 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd. PMID:23165447

  10. Classification of glioblastoma and metastasis for neuropathology intraoperative diagnosis: a multi-resolution textural approach to model the background

    NASA Astrophysics Data System (ADS)

    Ahmad Fauzi, Mohammad Faizal; Gokozan, Hamza Numan; Elder, Brad; Puduvalli, Vinay K.; Otero, Jose J.; Gurcan, Metin N.

    2014-03-01

    Brain cancer surgery requires intraoperative consultation by neuropathology to guide surgical decisions regarding the extent to which the tumor undergoes gross total resection. In this context, the differential diagnosis between glioblastoma and metastatic cancer is challenging as the decision must be made during surgery in a short time-frame (typically 30 minutes). We propose a method to classify glioblastoma versus metastatic cancer based on extracting textural features from the non-nuclei region of cytologic preparations. For glioblastoma, these regions of interest are filled with glial processes between the nuclei, which appear as anisotropic thin linear structures. For metastasis, these regions correspond to a more homogeneous appearance, thus suitable texture features can be extracted from these regions to distinguish between the two tissue types. In our work, we use the Discrete Wavelet Frames to characterize the underlying texture due to its multi-resolution capability in modeling underlying texture. The textural characterization is carried out in primarily the non-nuclei regions after nuclei regions are segmented by adapting our visually meaningful decomposition segmentation algorithm to this problem. k-nearest neighbor method was then used to classify the features into glioblastoma or metastasis cancer class. Experiment on 53 images (29 glioblastomas and 24 metastases) resulted in average accuracy as high as 89.7% for glioblastoma, 87.5% for metastasis and 88.7% overall. Further studies are underway to incorporate nuclei region features into classification on an expanded dataset, as well as expanding the classification to more types of cancers.

  11. Assessment and classification of cancer breakthrough pain: a systematic literature review.

    PubMed

    Haugen, Dagny Faksvåg; Hjermstad, Marianne Jensen; Hagen, Neil; Caraceni, Augusto; Kaasa, Stein

    2010-06-01

    Temporal variations in cancer pain intensity are highly prevalent, and are often difficult to manage. However, the phenomenon is not well understood: several definitions and approaches to classification and bedside assessment of cancer breakthrough pain (BTP) have been described. The present study is a systematic review of published literature on cancer BTP to answer the following questions: which terms and definitions have been used; are there validated assessment tools; which domains of BTP do the tools delineate, and which items do they contain; how have assessment tools been applied within clinical studies; and are there validated classification systems for BTP. A systematic search of the peer-reviewed literature was performed using five major databases. Of 375 titles and abstracts initially identified, 51 articles were examined in detail. Analysis of these publications indicates a range of overlapping but distinct definitions have been used to characterize BTP; 42 of the included papers presented one or more ways of classifying BTP; and while 10 tools to assess patients' experience of BTP were identified, only 2 have been partially validated. We conclude that there is no widely accepted definition, classification system or well-validated assessment tool for cancer-related breakthrough pain, but there is strong concurrence on most of its key attributes. With further work in this area, an internationally agreed upon definition and classification system for cancer-related breakthrough pain, and a standard approach on how to measure it, hold the promise to improve patient care and support research in this poor-prognosis cancer pain syndrome.

  12. Research on a Pulmonary Nodule Segmentation Method Combining Fast Self-Adaptive FCM and Classification

    PubMed Central

    Liu, Hui; Zhang, Cai-Ming; Su, Zhi-Yuan; Wang, Kai; Deng, Kai

    2015-01-01

    The key problem of computer-aided diagnosis (CAD) of lung cancer is to segment pathologically changed tissues fast and accurately. As pulmonary nodules are potential manifestation of lung cancer, we propose a fast and self-adaptive pulmonary nodules segmentation method based on a combination of FCM clustering and classification learning. The enhanced spatial function considers contributions to fuzzy membership from both the grayscale similarity between central pixels and single neighboring pixels and the spatial similarity between central pixels and neighborhood and improves effectively the convergence rate and self-adaptivity of the algorithm. Experimental results show that the proposed method can achieve more accurate segmentation of vascular adhesion, pleural adhesion, and ground glass opacity (GGO) pulmonary nodules than other typical algorithms. PMID:25945120

  13. [Treatment of testicular cancer].

    PubMed

    Droz, Jean-Pierre; Boyle, Helen; Culine, Stéphane; Fizazi, Karim; Fléchon, Aude; Massard, Christophe

    2013-12-01

    Germ-cell tumours (GCTs) are the most common type of cancer in young men. Since the late 1970s, disseminated GCT have been a paradigm for curable metastatic cancer and metastatic GCTs are highly curable with cisplatin-based chemotherapy followed by surgical resection of residual masses. Patients' prognosis is currently assessed using the International Germ-Cell Consensus Classification (IGCCC) and used to adapt the burden of chemotherapy. Approximately 20% of patients still do not achieve cure after first-line cisplatin-based chemotherapy, and need salvage chemotherapy (high dose or standard dose chemotherapy). Clinical stage I testicular cancer is the most common presentation and different strategies are proposed: adjuvant therapies, surgery or surveillance. During the last three decades, clinical trials and strong international collaborations lead to the development of a consensus in the management of GCTs.

  14. Cancer classification using the Immunoscore: a worldwide task force.

    PubMed

    Galon, Jérôme; Pagès, Franck; Marincola, Francesco M; Angell, Helen K; Thurin, Magdalena; Lugli, Alessandro; Zlobec, Inti; Berger, Anne; Bifulco, Carlo; Botti, Gerardo; Tatangelo, Fabiana; Britten, Cedrik M; Kreiter, Sebastian; Chouchane, Lotfi; Delrio, Paolo; Arndt, Hartmann; Asslaber, Martin; Maio, Michele; Masucci, Giuseppe V; Mihm, Martin; Vidal-Vanaclocha, Fernando; Allison, James P; Gnjatic, Sacha; Hakansson, Leif; Huber, Christoph; Singh-Jasuja, Harpreet; Ottensmeier, Christian; Zwierzina, Heinz; Laghi, Luigi; Grizzi, Fabio; Ohashi, Pamela S; Shaw, Patricia A; Clarke, Blaise A; Wouters, Bradly G; Kawakami, Yutaka; Hazama, Shoichi; Okuno, Kiyotaka; Wang, Ena; O'Donnell-Tormey, Jill; Lagorce, Christine; Pawelec, Graham; Nishimura, Michael I; Hawkins, Robert; Lapointe, Réjean; Lundqvist, Andreas; Khleif, Samir N; Ogino, Shuji; Gibbs, Peter; Waring, Paul; Sato, Noriyuki; Torigoe, Toshihiko; Itoh, Kyogo; Patel, Prabhu S; Shukla, Shilin N; Palmqvist, Richard; Nagtegaal, Iris D; Wang, Yili; D'Arrigo, Corrado; Kopetz, Scott; Sinicrope, Frank A; Trinchieri, Giorgio; Gajewski, Thomas F; Ascierto, Paolo A; Fox, Bernard A

    2012-10-03

    Prediction of clinical outcome in cancer is usually achieved by histopathological evaluation of tissue samples obtained during surgical resection of the primary tumor. Traditional tumor staging (AJCC/UICC-TNM classification) summarizes data on tumor burden (T), presence of cancer cells in draining and regional lymph nodes (N) and evidence for metastases (M). However, it is now recognized that clinical outcome can significantly vary among patients within the same stage. The current classification provides limited prognostic information, and does not predict response to therapy. Recent literature has alluded to the importance of the host immune system in controlling tumor progression. Thus, evidence supports the notion to include immunological biomarkers, implemented as a tool for the prediction of prognosis and response to therapy. Accumulating data, collected from large cohorts of human cancers, has demonstrated the impact of immune-classification, which has a prognostic value that may add to the significance of the AJCC/UICC TNM-classification. It is therefore imperative to begin to incorporate the 'Immunoscore' into traditional classification, thus providing an essential prognostic and potentially predictive tool. Introduction of this parameter as a biomarker to classify cancers, as part of routine diagnostic and prognostic assessment of tumors, will facilitate clinical decision-making including rational stratification of patient treatment. Equally, the inherent complexity of quantitative immunohistochemistry, in conjunction with protocol variation across laboratories, analysis of different immune cell types, inconsistent region selection criteria, and variable ways to quantify immune infiltration, all underline the urgent requirement to reach assay harmonization. In an effort to promote the Immunoscore in routine clinical settings, an international task force was initiated. This review represents a follow-up of the announcement of this initiative, and of the J Transl Med. editorial from January 2012. Immunophenotyping of tumors may provide crucial novel prognostic information. The results of this international validation may result in the implementation of the Immunoscore as a new component for the classification of cancer, designated TNM-I (TNM-Immune).

  15. Grouped gene selection and multi-classification of acute leukemia via new regularized multinomial regression.

    PubMed

    Li, Juntao; Wang, Yanyan; Jiang, Tao; Xiao, Huimin; Song, Xuekun

    2018-05-09

    Diagnosing acute leukemia is the necessary prerequisite to treating it. Multi-classification on the gene expression data of acute leukemia is help for diagnosing it which contains B-cell acute lymphoblastic leukemia (BALL), T-cell acute lymphoblastic leukemia (TALL) and acute myeloid leukemia (AML). However, selecting cancer-causing genes is a challenging problem in performing multi-classification. In this paper, weighted gene co-expression networks are employed to divide the genes into groups. Based on the dividing groups, a new regularized multinomial regression with overlapping group lasso penalty (MROGL) has been presented to simultaneously perform multi-classification and select gene groups. By implementing this method on three-class acute leukemia data, the grouped genes which work synergistically are identified, and the overlapped genes shared by different groups are also highlighted. Moreover, MROGL outperforms other five methods on multi-classification accuracy. Copyright © 2017. Published by Elsevier B.V.

  16. A cDNA microarray gene expression data classifier for clinical diagnostics based on graph theory.

    PubMed

    Benso, Alfredo; Di Carlo, Stefano; Politano, Gianfranco

    2011-01-01

    Despite great advances in discovering cancer molecular profiles, the proper application of microarray technology to routine clinical diagnostics is still a challenge. Current practices in the classification of microarrays' data show two main limitations: the reliability of the training data sets used to build the classifiers, and the classifiers' performances, especially when the sample to be classified does not belong to any of the available classes. In this case, state-of-the-art algorithms usually produce a high rate of false positives that, in real diagnostic applications, are unacceptable. To address this problem, this paper presents a new cDNA microarray data classification algorithm based on graph theory and is able to overcome most of the limitations of known classification methodologies. The classifier works by analyzing gene expression data organized in an innovative data structure based on graphs, where vertices correspond to genes and edges to gene expression relationships. To demonstrate the novelty of the proposed approach, the authors present an experimental performance comparison between the proposed classifier and several state-of-the-art classification algorithms.

  17. PCA based feature reduction to improve the accuracy of decision tree c4.5 classification

    NASA Astrophysics Data System (ADS)

    Nasution, M. Z. F.; Sitompul, O. S.; Ramli, M.

    2018-03-01

    Splitting attribute is a major process in Decision Tree C4.5 classification. However, this process does not give a significant impact on the establishment of the decision tree in terms of removing irrelevant features. It is a major problem in decision tree classification process called over-fitting resulting from noisy data and irrelevant features. In turns, over-fitting creates misclassification and data imbalance. Many algorithms have been proposed to overcome misclassification and overfitting on classifications Decision Tree C4.5. Feature reduction is one of important issues in classification model which is intended to remove irrelevant data in order to improve accuracy. The feature reduction framework is used to simplify high dimensional data to low dimensional data with non-correlated attributes. In this research, we proposed a framework for selecting relevant and non-correlated feature subsets. We consider principal component analysis (PCA) for feature reduction to perform non-correlated feature selection and Decision Tree C4.5 algorithm for the classification. From the experiments conducted using available data sets from UCI Cervical cancer data set repository with 858 instances and 36 attributes, we evaluated the performance of our framework based on accuracy, specificity and precision. Experimental results show that our proposed framework is robust to enhance classification accuracy with 90.70% accuracy rates.

  18. Substrate Induced Conformational Studies of the Hormone Binding Domain of the Human Estrogen Receptor by Fluorine NMR

    DTIC Science & Technology

    1998-07-01

    the progression of breast cancer and the estrogen receptor (ER) has been implicated in reproductive cancers . Our laboratory would like to understand how...function. ൖ. SUBJECT TERMS 15. NUMBER OF PAGES Breast Cancer 41 16. PRICE CODE 17. SECURITY CLASSIFICATION 18. SECURITY CLASSIFICATION OF THIS 19...production of estrogen or estrogen like materials. Estrogen has been shown to be involved in the progression of breast cancer and the estrogen receptor (ER

  19. Predicting Long-Term Cognitive Outcome Following Breast Cancer with Pre-Treatment Resting State fMRI and Random Forest Machine Learning.

    PubMed

    Kesler, Shelli R; Rao, Arvind; Blayney, Douglas W; Oakley-Girvan, Ingrid A; Karuturi, Meghan; Palesh, Oxana

    2017-01-01

    We aimed to determine if resting state functional magnetic resonance imaging (fMRI) acquired at pre-treatment baseline could accurately predict breast cancer-related cognitive impairment at long-term follow-up. We evaluated 31 patients with breast cancer (age 34-65) prior to any treatment, post-chemotherapy and 1 year later. Cognitive testing scores were normalized based on data obtained from 43 healthy female controls and then used to categorize patients as impaired or not based on longitudinal changes. We measured clustering coefficient, a measure of local connectivity, by applying graph theory to baseline resting state fMRI and entered these metrics along with relevant patient-related and medical variables into random forest classification. Incidence of cognitive impairment at 1 year follow-up was 55% and was predicted by classification algorithms with up to 100% accuracy ( p < 0.0001). The neuroimaging-based model was significantly more accurate than a model involving patient-related and medical variables ( p = 0.005). Hub regions belonging to several distinct functional networks were the most important predictors of cognitive outcome. Characteristics of these hubs indicated potential spread of brain injury from default mode to other networks over time. These findings suggest that resting state fMRI is a promising tool for predicting future cognitive impairment associated with breast cancer. This information could inform treatment decision making by identifying patients at highest risk for long-term cognitive impairment.

  20. Predicting Long-Term Cognitive Outcome Following Breast Cancer with Pre-Treatment Resting State fMRI and Random Forest Machine Learning

    PubMed Central

    Kesler, Shelli R.; Rao, Arvind; Blayney, Douglas W.; Oakley-Girvan, Ingrid A.; Karuturi, Meghan; Palesh, Oxana

    2017-01-01

    We aimed to determine if resting state functional magnetic resonance imaging (fMRI) acquired at pre-treatment baseline could accurately predict breast cancer-related cognitive impairment at long-term follow-up. We evaluated 31 patients with breast cancer (age 34–65) prior to any treatment, post-chemotherapy and 1 year later. Cognitive testing scores were normalized based on data obtained from 43 healthy female controls and then used to categorize patients as impaired or not based on longitudinal changes. We measured clustering coefficient, a measure of local connectivity, by applying graph theory to baseline resting state fMRI and entered these metrics along with relevant patient-related and medical variables into random forest classification. Incidence of cognitive impairment at 1 year follow-up was 55% and was predicted by classification algorithms with up to 100% accuracy (p < 0.0001). The neuroimaging-based model was significantly more accurate than a model involving patient-related and medical variables (p = 0.005). Hub regions belonging to several distinct functional networks were the most important predictors of cognitive outcome. Characteristics of these hubs indicated potential spread of brain injury from default mode to other networks over time. These findings suggest that resting state fMRI is a promising tool for predicting future cognitive impairment associated with breast cancer. This information could inform treatment decision making by identifying patients at highest risk for long-term cognitive impairment. PMID:29187817

  1. Selecting relevant 3D image features of margin sharpness and texture for lung nodule retrieval.

    PubMed

    Ferreira, José Raniery; de Azevedo-Marques, Paulo Mazzoncini; Oliveira, Marcelo Costa

    2017-03-01

    Lung cancer is the leading cause of cancer-related deaths in the world. Its diagnosis is a challenge task to specialists due to several aspects on the classification of lung nodules. Therefore, it is important to integrate content-based image retrieval methods on the lung nodule classification process, since they are capable of retrieving similar cases from databases that were previously diagnosed. However, this mechanism depends on extracting relevant image features in order to obtain high efficiency. The goal of this paper is to perform the selection of 3D image features of margin sharpness and texture that can be relevant on the retrieval of similar cancerous and benign lung nodules. A total of 48 3D image attributes were extracted from the nodule volume. Border sharpness features were extracted from perpendicular lines drawn over the lesion boundary. Second-order texture features were extracted from a cooccurrence matrix. Relevant features were selected by a correlation-based method and a statistical significance analysis. Retrieval performance was assessed according to the nodule's potential malignancy on the 10 most similar cases and by the parameters of precision and recall. Statistical significant features reduced retrieval performance. Correlation-based method selected 2 margin sharpness attributes and 6 texture attributes and obtained higher precision compared to all 48 extracted features on similar nodule retrieval. Feature space dimensionality reduction of 83 % obtained higher retrieval performance and presented to be a computationaly low cost method of retrieving similar nodules for the diagnosis of lung cancer.

  2. Assessment of the Activation State of RAS and Map Kinase in Human Breast Cancer Specimens (96Breast)

    DTIC Science & Technology

    1999-09-01

    Cancer 16. PRICE CODE 17. SECURITY CLASSIFICATION 18 . SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20. LIMITATION OF ABSTRACT OF REPORT OF...THIS PAGE OF ABSTRACT Unclassified Unclassified Unclassified Unlimited NSN 7640-01-280-5500 Standard Form 298 (Rev. 2-89) Prescribed by ANSI Std. Z39- 18 ...transformation and regulate cell morphology, adhesion and motility through cytoskeletal dynamics and play an important role in carcinogenesis ( 18 ). Rho

  3. Inhibition of Retinoblastoma Protein Inactivation

    DTIC Science & Technology

    2016-09-01

    Retinoblastoma protein, E2F transcription factor, high throughput screen, drug discovery, x-ray crystallography 16. SECURITY CLASSIFICATION OF: 17...developed a method to perform fragment based screening by x-ray crystallography . 2.0 KEYWORDS Retinoblastoma (Rb) pathway, E2F transcription factor...cancer, cell-cycle inhibition, activation, modulation, inhibition, high throughput screening, fragment-based screening, x-ray crystallography

  4. Visualization and tissue classification of human breast cancer images using ultrahigh-resolution OCT.

    PubMed

    Yao, Xinwen; Gan, Yu; Chang, Ernest; Hibshoosh, Hanina; Feldman, Sheldon; Hendon, Christine

    2017-03-01

    Breast cancer is one of the most common cancers, and recognized as the third leading cause of mortality in women. Optical coherence tomography (OCT) enables three dimensional visualization of biological tissue with micrometer level resolution at high speed, and can play an important role in early diagnosis and treatment guidance of breast cancer. In particular, ultra-high resolution (UHR) OCT provides images with better histological correlation. This paper compared UHR OCT performance with standard OCT in breast cancer imaging qualitatively and quantitatively. Automatic tissue classification algorithms were used to automatically detect invasive ductal carcinoma in ex vivo human breast tissue. Human breast tissues, including non-neoplastic/normal tissues from breast reduction and tumor samples from mastectomy specimens, were excised from patients at Columbia University Medical Center. The tissue specimens were imaged by two spectral domain OCT systems at different wavelengths: a home-built ultra-high resolution (UHR) OCT system at 800 nm (measured as 2.72 μm axial and 5.52 μm lateral) and a commercial OCT system at 1,300 nm with standard resolution (measured as 6.5 μm axial and 15 μm lateral), and their imaging performances were analyzed qualitatively. Using regional features derived from OCT images produced by the two systems, we developed an automated classification algorithm based on relevance vector machine (RVM) to differentiate hollow-structured adipose tissue against solid tissue. We further developed B-scan based features for RVM to classify invasive ductal carcinoma (IDC) against normal fibrous stroma tissue among OCT datasets produced by the two systems. For adipose classification, 32 UHR OCT B-scans from 9 normal specimens, and 28 standard OCT B-scans from 6 normal and 4 IDC specimens were employed. For IDC classification, 152 UHR OCT B-scans from 6 normal and 13 IDC specimens, and 104 standard OCT B-scans from 5 normal and 8 IDC specimens were employed. We have demonstrated that UHR OCT images can produce images with better feature delineation compared with images produced by 1,300 nm OCT system. UHR OCT images of a variety of tissue types found in human breast tissue were presented. With a limited number of datasets, we showed that both OCT systems can achieve a good accuracy in identifying adipose tissue. Classification in UHR OCT images achieved higher sensitivity (94%) and specificity (93%) of adipose tissue than the sensitivity (91%) and specificity (76%) in 1,300 nm OCT images. In IDC classification, similarly, we achieved better results with UHR OCT images, featured an overall accuracy of 84%, sensitivity of 89% and specificity of 71% in this preliminary study. In this study, we provided UHR OCT images of different normal and malignant breast tissue types, and qualitatively and quantitatively studied the texture and optical features from OCT images of human breast tissue at different resolutions. We developed an automated approach to differentiate adipose tissue, fibrous stroma, and IDC within human breast tissues. Our work may open the door toward automatic intraoperative OCT evaluation of early-stage breast cancer. Lasers Surg. Med. 49:258-269, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  5. Molecular classification of gastric cancer: a new paradigm.

    PubMed

    Shah, Manish A; Khanin, Raya; Tang, Laura; Janjigian, Yelena Y; Klimstra, David S; Gerdes, Hans; Kelsen, David P

    2011-05-01

    Gastric cancer may be subdivided into 3 distinct subtypes--proximal, diffuse, and distal gastric cancer--based on histopathologic and anatomic criteria. Each subtype is associated with unique epidemiology. Our aim is to test the hypothesis that these distinct gastric cancer subtypes may also be distinguished by gene expression analysis. Patients with localized gastric adenocarcinoma being screened for a phase II preoperative clinical trial (National Cancer Institute, NCI #5917) underwent endoscopic biopsy for fresh tumor procurement. Four to 6 targeted biopsies of the primary tumor were obtained. Macrodissection was carried out to ensure more than 80% carcinoma in the sample. HG-U133A GeneChip (Affymetrix) was used for cDNA expression analysis, and all arrays were processed and analyzed using the Bioconductor R-package. Between November 2003 and January 2006, 57 patients were screened to identify 36 patients with localized gastric cancer who had adequate RNA for expression analysis. Using supervised analysis, we built a classifier to distinguish the 3 gastric cancer subtypes, successfully classifying each into tightly grouped clusters. Leave-one-out cross-validation error was 0.14, suggesting that more than 85% of samples were classified correctly. Gene set analysis with the false discovery rate set at 0.25 identified several pathways that were differentially regulated when comparing each gastric cancer subtype to adjacent normal stomach. Subtypes of gastric cancer that have epidemiologic and histologic distinctions are also distinguished by gene expression data. These preliminary data suggest a new classification of gastric cancer with implications for improving our understanding of disease biology and identification of unique molecular drivers for each gastric cancer subtype. ©2011 AACR.

  6. Classification of radiation effects for dose limitation purposes: history, current situation and future prospects

    PubMed Central

    Hamada, Nobuyuki; Fujimichi, Yuki

    2014-01-01

    Radiation exposure causes cancer and non-cancer health effects, each of which differs greatly in the shape of the dose–response curve, latency, persistency, recurrence, curability, fatality and impact on quality of life. In recent decades, for dose limitation purposes, the International Commission on Radiological Protection has divided such diverse effects into tissue reactions (formerly termed non-stochastic and deterministic effects) and stochastic effects. On the one hand, effective dose limits aim to reduce the risks of stochastic effects (cancer/heritable effects) and are based on the detriment-adjusted nominal risk coefficients, assuming a linear-non-threshold dose response and a dose and dose rate effectiveness factor of 2. On the other hand, equivalent dose limits aim to avoid tissue reactions (vision-impairing cataracts and cosmetically unacceptable non-cancer skin changes) and are based on a threshold dose. However, the boundary between these two categories is becoming vague. Thus, we review the changes in radiation effect classification, dose limitation concepts, and the definition of detriment and threshold. Then, the current situation is overviewed focusing on (i) stochastic effects with a threshold, (ii) tissue reactions without a threshold, (iii) target organs/tissues for circulatory disease, (iv) dose levels for limitation of cancer risks vs prevention of non-life-threatening tissue reactions vs prevention of life-threatening tissue reactions, (v) mortality or incidence of thyroid cancer, and (vi) the detriment for tissue reactions. For future discussion, one approach is suggested that classifies radiation effects according to whether effects are life threatening, and radiobiological research needs are also briefly discussed. PMID:24794798

  7. An efficient abnormal cervical cell detection system based on multi-instance extreme learning machine

    NASA Astrophysics Data System (ADS)

    Zhao, Lili; Yin, Jianping; Yuan, Lihuan; Liu, Qiang; Li, Kuan; Qiu, Minghui

    2017-07-01

    Automatic detection of abnormal cells from cervical smear images is extremely demanded in annual diagnosis of women's cervical cancer. For this medical cell recognition problem, there are three different feature sections, namely cytology morphology, nuclear chromatin pathology and region intensity. The challenges of this problem come from feature combination s and classification accurately and efficiently. Thus, we propose an efficient abnormal cervical cell detection system based on multi-instance extreme learning machine (MI-ELM) to deal with above two questions in one unified framework. MI-ELM is one of the most promising supervised learning classifiers which can deal with several feature sections and realistic classification problems analytically. Experiment results over Herlev dataset demonstrate that the proposed method outperforms three traditional methods for two-class classification in terms of well accuracy and less time.

  8. A fuzzy neural network for intelligent data processing

    NASA Astrophysics Data System (ADS)

    Xie, Wei; Chu, Feng; Wang, Lipo; Lim, Eng Thiam

    2005-03-01

    In this paper, we describe an incrementally generated fuzzy neural network (FNN) for intelligent data processing. This FNN combines the features of initial fuzzy model self-generation, fast input selection, partition validation, parameter optimization and rule-base simplification. A small FNN is created from scratch -- there is no need to specify the initial network architecture, initial membership functions, or initial weights. Fuzzy IF-THEN rules are constantly combined and pruned to minimize the size of the network while maintaining accuracy; irrelevant inputs are detected and deleted, and membership functions and network weights are trained with a gradient descent algorithm, i.e., error backpropagation. Experimental studies on synthesized data sets demonstrate that the proposed Fuzzy Neural Network is able to achieve accuracy comparable to or higher than both a feedforward crisp neural network, i.e., NeuroRule, and a decision tree, i.e., C4.5, with more compact rule bases for most of the data sets used in our experiments. The FNN has achieved outstanding results for cancer classification based on microarray data. The excellent classification result for Small Round Blue Cell Tumors (SRBCTs) data set is shown. Compared with other published methods, we have used a much fewer number of genes for perfect classification, which will help researchers directly focus their attention on some specific genes and may lead to discovery of deep reasons of the development of cancers and discovery of drugs.

  9. Bladder segmentation in MR images with watershed segmentation and graph cut algorithm

    NASA Astrophysics Data System (ADS)

    Blaffert, Thomas; Renisch, Steffen; Schadewaldt, Nicole; Schulz, Heinrich; Wiemker, Rafael

    2014-03-01

    Prostate and cervix cancer diagnosis and treatment planning that is based on MR images benefit from superior soft tissue contrast compared to CT images. For these images an automatic delineation of the prostate or cervix and the organs at risk such as the bladder is highly desirable. This paper describes a method for bladder segmentation that is based on a watershed transform on high image gradient values and gray value valleys together with the classification of watershed regions into bladder contents and tissue by a graph cut algorithm. The obtained results are superior if compared to a simple region-after-region classification.

  10. Alterations in inflammatory biomarkers and energy intake in cancer cachexia: a prospective study in patients with inoperable pancreatic cancer.

    PubMed

    Bye, Asta; Wesseltoft-Rao, Nima; Iversen, Per Ole; Skjegstad, Grete; Holven, Kirsten B; Ulven, Stine; Hjermstad, Marianne J

    2016-06-01

    Chronic systemic inflammatory response is proposed as an underlying mechanism for development of cancer cachexia. We conducted a prospective study to examine changes in inflammatory biomarkers during the disease course and the relationship between inflammatory biomarkers and cachexia in patients with inoperable pancreatic cancer. Twenty patients, median (range) age 67.5 (35-79) years, 5 females, were followed for median 5.5 (1-12) months. Cachexia was diagnosed according to the 2011 consensus-based classification system (weight loss >5 % past six months, BMI < 20 kg/m(2) and weight loss >2 %, or sarcopenia) and the modified Glasgow Prognostic score (mGPS) that combines CRP and albumin levels. Inflammatory biomarkers were measured by enzyme immunoassays. The patients had increased levels of most inflammatory biomarkers, albeit not all statistically significant, both at study entry and close to death, indicating ongoing inflammation. According to the consensus-based classification system, eleven (55 %) patients were classified as cachectic upon inclusion. They did not differ from non-cachectic patients with regard to inflammatory biomarkers or energy intake. According to the mGPS, seven (35 %) were defined as cachectic and had a higher IL-6 (p < 0.001) than the non-cachectic patients. They also had a slightly, but insignificantly longer survival than non-cachectic patients (p = 0.08). The mGPS should be considered as an additional framework for identification of cancer cachexia.

  11. Influence of Texture and Colour in Breast TMA Classification

    PubMed Central

    Fernández-Carrobles, M. Milagro; Bueno, Gloria; Déniz, Oscar; Salido, Jesús; García-Rojo, Marcial; González-López, Lucía

    2015-01-01

    Breast cancer diagnosis is still done by observation of biopsies under the microscope. The development of automated methods for breast TMA classification would reduce diagnostic time. This paper is a step towards the solution for this problem and shows a complete study of breast TMA classification based on colour models and texture descriptors. The TMA images were divided into four classes: i) benign stromal tissue with cellularity, ii) adipose tissue, iii) benign and benign anomalous structures, and iv) ductal and lobular carcinomas. A relevant set of features was obtained on eight different colour models from first and second order Haralick statistical descriptors obtained from the intensity image, Fourier, Wavelets, Multiresolution Gabor, M-LBP and textons descriptors. Furthermore, four types of classification experiments were performed using six different classifiers: (1) classification per colour model individually, (2) classification by combination of colour models, (3) classification by combination of colour models and descriptors, and (4) classification by combination of colour models and descriptors with a previous feature set reduction. The best result shows an average of 99.05% accuracy and 98.34% positive predictive value. These results have been obtained by means of a bagging tree classifier with combination of six colour models and the use of 1719 non-correlated (correlation threshold of 97%) textural features based on Statistical, M-LBP, Gabor and Spatial textons descriptors. PMID:26513238

  12. Proposed morphologic classification of prostate cancer with neuroendocrine differentiation.

    PubMed

    Epstein, Jonathan I; Amin, Mahul B; Beltran, Himisha; Lotan, Tamara L; Mosquera, Juan-Miguel; Reuter, Victor E; Robinson, Brian D; Troncoso, Patricia; Rubin, Mark A

    2014-06-01

    On July 31, 2013, the Prostate Cancer Foundation assembled a working committee on the molecular biology and pathologic classification of neuroendocrine (NE) differentiation in prostate cancer. New clinical and molecular data emerging from prostate cancers treated by contemporary androgen deprivation therapies, as well as primary lesions, have highlighted the need for refinement of diagnostic terminology to encompass the full spectrum of NE differentiation. The classification system consists of: Usual prostate adenocarcinoma with NE differentiation; 2) Adenocarcinoma with Paneth cell NE differentiation; 3) Carcinoid tumor; 4) Small cell carcinoma; 5) Large cell NE carcinoma; and 5) Mixed NE carcinoma - acinar adenocarcinoma. The article also highlights "prostate carcinoma with overlapping features of small cell carcinoma and acinar adenocarcinoma" and "castrate-resistant prostate cancer with small cell cancer-like clinical presentation". It is envisioned that specific criteria associated with the refined diagnostic terminology will lead to clinically relevant pathologic diagnoses that will stimulate further clinical and molecular investigation and identification of appropriate targeted therapies.

  13. Optimal Anti-cancer Drug Profiles for Effective Penetration of the Anti-cancer Drug Market by Generic Drugs in Japan.

    PubMed

    Shibata, Shoyo; Matsushita, Maiko; Saito, Yoshimasa; Suzuki, Takeshi

    2017-01-01

    The increased use of generic drugs is a good indicator of the need to reduce the increasing costs of prescription drugs. Since there are more expensive drugs compared with other therapeutic areas, "oncology" is an important one for generic drugs. The primary objective of this article was to quantify the extent to which generic drugs in Japan occupy each level of the Anatomical Therapeutic Chemical (ATC) classification system. The dataset used in this study was created from publicly available information obtained from the IMS Japan Pharmaceutical Market database. Data on the total amount of sales and number of prescriptions for anti-cancer drugs between 2010 and 2016 in Japan were selected. The data were categorized according to the third level of the ATC classification system. All categories of the ATC classification system had increased market shares in Japan between 2010 and 2016. The barriers to market entry were relatively low in L01F (platinum anti-neoplastics), L01C (plant-based neoplastics), L02B (cytostatic hormone antagonists), and L01D (anti-neoplastic antibiotics) but were high in L02A (cytostatic hormones), L01H (protein kinase inhibitors), and L01B (anti-metabolites). Generic cancer drugs could bring savings to Japanese health care systems. Therefore, their development should be directed toward niche markets, such as L02A, L01H, and L01B, and not competitive markets.

  14. Classification of Microcalcification of the Diagnosis of Breast Cancer using Artificial Neural Networks.

    DTIC Science & Technology

    1995-09-01

    employed to classify benign and malignant microcalcifications in the radiographs of pathological specimen. Digital images were acquired by digitizing...associated with benign and malignant processes. The classification of microcalcifications for the diagnosis of breast cancer was achieved at a high level in

  15. Spectral-spatial classification for noninvasive cancer detection using hyperspectral imaging

    NASA Astrophysics Data System (ADS)

    Lu, Guolan; Halig, Luma; Wang, Dongsheng; Qin, Xulei; Chen, Zhuo Georgia; Fei, Baowei

    2014-10-01

    Early detection of malignant lesions could improve both survival and quality of life of cancer patients. Hyperspectral imaging (HSI) has emerged as a powerful tool for noninvasive cancer detection and diagnosis, with the advantage of avoiding tissue biopsy and providing diagnostic signatures without the need of a contrast agent in real time. We developed a spectral-spatial classification method to distinguish cancer from normal tissue on hyperspectral images. We acquire hyperspectral reflectance images from 450 to 900 nm with a 2-nm increment from tumor-bearing mice. In our animal experiments, the HSI and classification method achieved a sensitivity of 93.7% and a specificity of 91.3%. The preliminary study demonstrated that HSI has the potential to be applied in vivo for noninvasive detection of tumors.

  16. Translational toxicology in setting occupational exposure limits for dusts and hazard classification - a critical evaluation of a recent approach to translate dust overload findings from rats to humans.

    PubMed

    Morfeld, Peter; Bruch, Joachim; Levy, Len; Ngiewih, Yufanyi; Chaudhuri, Ishrat; Muranko, Henry J; Myerson, Ross; McCunney, Robert J

    2015-04-23

    We analyze the scientific basis and methodology used by the German MAK Commission in their recommendations for exposure limits and carcinogen classification of "granular biopersistent particles without known specific toxicity" (GBS). These recommendations are under review at the European Union level. We examine the scientific assumptions in an attempt to reproduce the results. MAK's human equivalent concentrations (HECs) are based on a particle mass and on a volumetric model in which results from rat inhalation studies are translated to derive occupational exposure limits (OELs) and a carcinogen classification. We followed the methods as proposed by the MAK Commission and Pauluhn 2011. We also examined key assumptions in the metrics, such as surface area of the human lung, deposition fractions of inhaled dusts, human clearance rates; and risk of lung cancer among workers, presumed to have some potential for lung overload, the physiological condition in rats associated with an increase in lung cancer risk. The MAK recommendations on exposure limits for GBS have numerous incorrect assumptions that adversely affect the final results. The procedures to derive the respirable occupational exposure limit (OEL) could not be reproduced, a finding raising considerable scientific uncertainty about the reliability of the recommendations. Moreover, the scientific basis of using the rat model is confounded by the fact that rats and humans show different cellular responses to inhaled particles as demonstrated by bronchoalveolar lavage (BAL) studies in both species. Classifying all GBS as carcinogenic to humans based on rat inhalation studies in which lung overload leads to chronic inflammation and cancer is inappropriate. Studies of workers, who have been exposed to relevant levels of dust, have not indicated an increase in lung cancer risk. Using the methods proposed by the MAK, we were unable to reproduce the OEL for GBS recommended by the Commission, but identified substantial errors in the models. Considerable shortcomings in the use of lung surface area, clearance rates, deposition fractions; as well as using the mass and volumetric metrics as opposed to the particle surface area metric limit the scientific reliability of the proposed GBS OEL and carcinogen classification.

  17. Tasked-based quantification of measurement utility for ex vivo multi-spectral Mueller polarimetry of the uterine cervix

    NASA Astrophysics Data System (ADS)

    Kupinski, Meredith; Rehbinder, Jean; Haddad, Huda; Deby, Stanislas; Vizet, Jérémy; Teig, Benjamin; Nazac, André; Pierangelo, Angelo; Moreau, François; Novikova, Tatiana

    2017-07-01

    Significant contrast in visible wavelength Mueller matrix images for healthy and pre-cancerous regions of excised cervical tissue is shown. A novel classification algorithm is used to compute a test statistic from a small patient population.

  18. Cancer Biomarkers from Genome-Scale DNA Methylation: Comparison of Evolutionary and Semantic Analysis Methods

    PubMed Central

    Valavanis, Ioannis; Pilalis, Eleftherios; Georgiadis, Panagiotis; Kyrtopoulos, Soterios; Chatziioannou, Aristotelis

    2015-01-01

    DNA methylation profiling exploits microarray technologies, thus yielding a wealth of high-volume data. Here, an intelligent framework is applied, encompassing epidemiological genome-scale DNA methylation data produced from the Illumina’s Infinium Human Methylation 450K Bead Chip platform, in an effort to correlate interesting methylation patterns with cancer predisposition and, in particular, breast cancer and B-cell lymphoma. Feature selection and classification are employed in order to select, from an initial set of ~480,000 methylation measurements at CpG sites, predictive cancer epigenetic biomarkers and assess their classification power for discriminating healthy versus cancer related classes. Feature selection exploits evolutionary algorithms or a graph-theoretic methodology which makes use of the semantics information included in the Gene Ontology (GO) tree. The selected features, corresponding to methylation of CpG sites, attained moderate-to-high classification accuracies when imported to a series of classifiers evaluated by resampling or blindfold validation. The semantics-driven selection revealed sets of CpG sites performing similarly with evolutionary selection in the classification tasks. However, gene enrichment and pathway analysis showed that it additionally provides more descriptive sets of GO terms and KEGG pathways regarding the cancer phenotypes studied here. Results support the expediency of this methodology regarding its application in epidemiological studies. PMID:27600245

  19. Abnormality detection of mammograms by discriminative dictionary learning on DSIFT descriptors.

    PubMed

    Tavakoli, Nasrin; Karimi, Maryam; Nejati, Mansour; Karimi, Nader; Reza Soroushmehr, S M; Samavi, Shadrokh; Najarian, Kayvan

    2017-07-01

    Detection and classification of breast lesions using mammographic images are one of the most difficult studies in medical image processing. A number of learning and non-learning methods have been proposed for detecting and classifying these lesions. However, the accuracy of the detection/classification still needs improvement. In this paper we propose a powerful classification method based on sparse learning to diagnose breast cancer in mammograms. For this purpose, a supervised discriminative dictionary learning approach is applied on dense scale invariant feature transform (DSIFT) features. A linear classifier is also simultaneously learned with the dictionary which can effectively classify the sparse representations. Our experimental results show the superior performance of our method compared to existing approaches.

  20. Challenges of the Oral Cancer Burden in India

    PubMed Central

    Coelho, Ken Russell

    2012-01-01

    Oral cancer ranks in the top three of all cancers in India, which accounts for over thirty per cent of all cancers reported in the country and oral cancer control is quickly becoming a global health priority. This paper provides a synopsis of the incidence of oral cancer in India by focusing on its measurement in cancer registries across the country. Based on the International Classification of Disease case definition adopted by the World Health Organisation, and the International Agency for Research on Cancer, this review systematically examines primary and secondary data where the incidence or prevalence of oral cancer is known to be directly reported. Variability in age-adjusted incidence with crude incidence is projected to increase by 2030. Challenges focus on measurement of disease incidence and disease-specific risk behavior, predominantly, alcohol, and tobacco use. Future research should be aimed at improving quality of data for early detection and prevention of oral cancer. PMID:23093961

  1. Significance and Application of Digital Breast Tomosynthesis for the BI-RADS Classification of Breast Cancer.

    PubMed

    Cai, Si-Qing; Yan, Jian-Xiang; Chen, Qing-Shi; Huang, Mei-Ling; Cai, Dong-Lu

    2015-01-01

    Full-field digital mammography (FFDM) with dense breasts has a high rate of missed diagnosis, and digital breast tomosynthesis (DBT) could reduce organization overlapping and provide more reliable images for BI-RADS classification. This study aims to explore application of COMBO (FFDM+DBT) for effect and significance of BI-RADS classification of breast cancer. In this study, we selected 832 patients who had been treated from May 2013 to November 2013. Classify FFDM and COMBO examination according to BI-RADS separately and compare the differences for glands in the image of the same patient in judgment, mass characteristics display and indirect signs. Employ Paired Wilcoxon rank sum test was used in 79 breast cancer patients to find differences between two examine methods. The results indicated that COMBO pattern is able to observe more details in distribution of glands when estimating content. Paired Wilcoxon rank sum test showed that overall classification level of COMBO is higher significantly compared to FFDM to BI-RADS diagnosis and classification of breast (P<0.05). The area under FFDM ROC curve is 0.805, while that is 0.941 in COMBO pattern. COMBO shows relation of mass with the surrounding tissues, the calcification in the mass, and multiple foci clearly in breast cancer tissues. The optimal sensitivity of cut-off value in COMBO pattern is 82.9%, which is higher than that in FFDM (60%). They share the same specificity which is both 93.2%. Digital Breast Tomosynthesis (DBT) could be used for the BI-RADS classification in breast cancer in clinical.

  2. Circulating microRNA-based screening tool for breast cancer

    PubMed Central

    Boukerroucha, Meriem; Fasquelle, Corinne; Thiry, Jérôme; Bovy, Nicolas; Struman, Ingrid; Geurts, Pierre; Collignon, Joëlle; Schroeder, Hélène; Kridelka, Frédéric; Lifrange, Eric; Jossa, Véronique

    2016-01-01

    Circulating microRNAs (miRNAs) are increasingly recognized as powerful biomarkers in several pathologies, including breast cancer. Here, their plasmatic levels were measured to be used as an alternative screening procedure to mammography for breast cancer diagnosis. A plasma miRNA profile was determined by RT-qPCR in a cohort of 378 women. A diagnostic model was designed based on the expression of 8 miRNAs measured first in a profiling cohort composed of 41 primary breast cancers and 45 controls, and further validated in diverse cohorts composed of 108 primary breast cancers, 88 controls, 35 breast cancers in remission, 31 metastatic breast cancers and 30 gynecologic tumors. A receiver operating characteristic curve derived from the 8-miRNA random forest based diagnostic tool exhibited an area under the curve of 0.81. The accuracy of the diagnostic tool remained unchanged considering age and tumor stage. The miRNA signature correctly identified patients with metastatic breast cancer. The use of the classification model on cohorts of patients with breast cancers in remission and with gynecologic cancers yielded prediction distributions similar to that of the control group. Using a multivariate supervised learning method and a set of 8 circulating miRNAs, we designed an accurate, minimally invasive screening tool for breast cancer. PMID:26734993

  3. A Developmental Approach to Characterizing the Tissue-Invasion Gene Program in Breast Cancer

    DTIC Science & Technology

    2001-09-01

    OF PAGES Breast Cancer 24 16. PRICE CODE 17. SECURITY CLASSIFICATION 18. SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20. LIMITATION OF...induced host response. Am J. Pathol. 149:273-282, 1996. 4. Wolf, c., Rouyer, N., Lutz, Y., Adida , C., Loriot, M., Bellocq, J.P., Chambon, P., and Basset...following a 5 d incubation period. (upper left and right panels). In contrast, MT1-MMP-transfected cells perforated the BM in representative TEM and

  4. Acute promyelocytic leukaemia is highly frequent among acute myeloid leukaemias in Brazil: a hospital-based cancer registry study from 2001 to 2012.

    PubMed

    Thuler, Luiz Claudio Santos; Pombo-de-Oliveira, Maria S

    2017-03-01

    The WHO classification that defines subtypes of acute myeloid leukaemias (AMLs) is relatively unexplored at the population-based level. This study aimed to examine the frequency of acute promyelocytic leukaemia (APL or AML-M3) in Brazil. Data were extracted from 239 cancer centres (2001-2012) and categorized according to the International Classification of Diseases for Oncology (CID-O 3.0) and WHO classification (n = 9116). CID-O3 code 9866 identified 614 APL patients. AML not otherwise specified (NOS) was frequent, and the APL group represented the main subtype specified. The mean age of APL was lower than that of other AMLs (31.5, standard deviation (SD) 18.6 versus 40.9, SD 24.6; p < 0.001); there was a high frequency of APL in the 13-21-year-old (11.8 %) and ≤12.9-year-old (6.4 %) age groups. Time taken to begin treatment (as ≤14 days versus >14 days) and induction death rate were lower in APL than in other AML subtypes (p < 0.001). This report provides additional evidence on the distribution of APL among cases of AML in Brazil.

  5. Classification of follicular lymphoma images: a holistic approach with symbol-based machine learning methods.

    PubMed

    Zorman, Milan; Sánchez de la Rosa, José Luis; Dinevski, Dejan

    2011-12-01

    It is not very often to see a symbol-based machine learning approach to be used for the purpose of image classification and recognition. In this paper we will present such an approach, which we first used on the follicular lymphoma images. Lymphoma is a broad term encompassing a variety of cancers of the lymphatic system. Lymphoma is differentiated by the type of cell that multiplies and how the cancer presents itself. It is very important to get an exact diagnosis regarding lymphoma and to determine the treatments that will be most effective for the patient's condition. Our work was focused on the identification of lymphomas by finding follicles in microscopy images provided by the Laboratory of Pathology in the University Hospital of Tenerife, Spain. We divided our work in two stages: in the first stage we did image pre-processing and feature extraction, and in the second stage we used different symbolic machine learning approaches for pixel classification. Symbolic machine learning approaches are often neglected when looking for image analysis tools. They are not only known for a very appropriate knowledge representation, but also claimed to lack computational power. The results we got are very promising and show that symbolic approaches can be successful in image analysis applications.

  6. Cloud-scale genomic signals processing classification analysis for gene expression microarray data.

    PubMed

    Harvey, Benjamin; Soo-Yeon Ji

    2014-01-01

    As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.

  7. Utility Estimates of Disease-Specific Health States in Prostate Cancer from Three Different Perspectives.

    PubMed

    Gries, Katharine S; Regier, Dean A; Ramsey, Scott D; Patrick, Donald L

    2017-06-01

    To develop a statistical model generating utility estimates for prostate cancer specific health states, using preference weights derived from the perspectives of prostate cancer patients, men at risk for prostate cancer, and society. Utility estimate values were calculated using standard gamble (SG) methodology. Study participants valued 18 prostate-specific health states with the five attributes: sexual function, urinary function, bowel function, pain, and emotional well-being. Appropriateness of model (linear regression, mixed effects, or generalized estimating equation) to generate prostate cancer utility estimates was determined by paired t-tests to compare observed and predicted values. Mixed-corrected standard SG utility estimates to account for loss aversion were calculated based on prospect theory. 132 study participants assigned values to the health states (n = 40 men at risk for prostate cancer; n = 43 men with prostate cancer; n = 49 general population). In total, 792 valuations were elicited (six health states for each 132 participants). The most appropriate model for the classification system was a mixed effects model; correlations between the mean observed and predicted utility estimates were greater than 0.80 for each perspective. Developing a health-state classification system with preference weights for three different perspectives demonstrates the relative importance of main effects between populations. The predicted values for men with prostate cancer support the hypothesis that patients experiencing the disease state assign higher utility estimates to health states and there is a difference in valuations made by patients and the general population.

  8. 29 CFR 1990.112 - Classification of potential carcinogens.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 29 Labor 9 2012-07-01 2012-07-01 false Classification of potential carcinogens. 1990.112 Section... CARCINOGENS The Osha Cancer Policy § 1990.112 Classification of potential carcinogens. The following criteria for identification, classification and regulation of potential occupational carcinogens will be...

  9. 29 CFR 1990.112 - Classification of potential carcinogens.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 29 Labor 9 2013-07-01 2013-07-01 false Classification of potential carcinogens. 1990.112 Section... CARCINOGENS The Osha Cancer Policy § 1990.112 Classification of potential carcinogens. The following criteria for identification, classification and regulation of potential occupational carcinogens will be...

  10. 29 CFR 1990.112 - Classification of potential carcinogens.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 29 Labor 9 2014-07-01 2014-07-01 false Classification of potential carcinogens. 1990.112 Section... CARCINOGENS The Osha Cancer Policy § 1990.112 Classification of potential carcinogens. The following criteria for identification, classification and regulation of potential occupational carcinogens will be...

  11. Insights into next developments in advanced gastric cancer.

    PubMed

    Obermannová, Radka; Lordick, Florian

    2016-07-01

    The purpose of the review is to delineate novel approaches for biology-based treatment in advanced gastric cancer. We reviewed the latest translational and clinical research articles and congress presentations. A new molecular classification of gastric cancer based on histology, genetic and proteomic alterations has evolved. It provides a roadmap for development of new drugs and combinations and for patient stratification. Anti-HER2 treatment, which is an effective strategy in metastatic gastric cancer, is now also being studied in the perioperative setting. However, resistance mechanisms in advanced disease are poorly understood and optimal patient selection remains challenging. Targeting angiogenesis is an emerging concept in the management of advanced gastric cancer, and ramucirumab has prolonged survival in the second line either as a monotherapy or in combination with paclitaxel. Biomarkers for selecting patients who benefit from ramucirumab are still lacking. Immune checkpoint blockade and inhibition of cancer stemness targets are other emerging directions for the medical treatment of gastric cancer. Large-scale international studies are ongoing. Promising biology-based treatment strategies are evolving. But tumor heterogeneity which is an inherent feature of gastric cancer challenges the development of molecularly targeted and personalized treatment strategies.

  12. Gene-expression signatures can distinguish gastric cancer grades and stages.

    PubMed

    Cui, Juan; Li, Fan; Wang, Guoqing; Fang, Xuedong; Puett, J David; Xu, Ying

    2011-03-18

    Microarray gene-expression data of 54 paired gastric cancer and adjacent noncancerous gastric tissues were analyzed, with the aim to establish gene signatures for cancer grades (well-, moderately-, poorly- or un-differentiated) and stages (I, II, III and IV), which have been determined by pathologists. Our statistical analysis led to the identification of a number of gene combinations whose expression patterns serve well as signatures of different grades and different stages of gastric cancer. A 19-gene signature was found to have discerning power between high- and low-grade gastric cancers in general, with overall classification accuracy at 79.6%. An expanded 198-gene panel allows the stratification of cancers into four grades and control, giving rise to an overall classification agreement of 74.2% between each grade designated by the pathologists and our prediction. Two signatures for cancer staging, consisting of 10 genes and 9 genes, respectively, provide high classification accuracies at 90.0% and 84.0%, among early-, advanced-stage cancer and control. Functional and pathway analyses on these signature genes reveal the significant relevance of the derived signatures to cancer grades and progression. To the best of our knowledge, this represents the first study on identification of genes whose expression patterns can serve as markers for cancer grades and stages.

  13. [Exposed workers to lung cancer risk. An estimation using the ISPESL database of enterprises].

    PubMed

    Scarselli, Alberto; Marinaccio, Alessandro; Nesti, Massimo

    2007-01-01

    lung cancer is the first cause of death in the industrialized country among males and is increasing among females. In 2001 a uniform and standardised list of occupations or jobs known or suspected to be associated with lung cancer has been prepared. The aim of this study is to set up a database of Italian enterprises corresponding to activities related to this list and to assess the number of potentially exposed workers. a detailed and unique list of codes, referred to Ateco91 ISTAT classification with exclusion of the State Railways and the public administration sectors, has been developed. The list is divided into two categories: respectively for occupations or jobs definitely entailing carcinogenic risk and for those which probably/possibly entail a risk. Firms have been selected from the ISPESL database of enterprises and the number of workers has been estimated on the basis of this list. Italy. assessment of the number of workers potentially exposed to lung cancer risk and creation of a register of involved firms. the number of potentially exposed workers in the industrial and services sector related to lung cancer risk is 650,886 blue collars and the number of firms censused in Italy is 117,006 units. Corresponding figures in the agriculture sector are 163,340 and 84,839. This type of evaluation, being based on administrative sources rather then on direct measures of exposure, certainly includes an overestimation of exposed workers. the lists based on a standard classification which have been created allow for the creation of databases which can be used to control occupational exposure to carcinogens and to increase comparability between epidemiologic studies based on job-exposure matrix.

  14. Cytokines Synergize to Combat Metastatic Neuroblastoma | Center for Cancer Research

    Cancer.gov

    Neuroblastoma is the most common extracranial solid tumor in children, and clinical outcomes of patients with this disease are quite variable. Prognosis is particularly poor for patients with high-risk tumors (classification based on patients’ age, extent of disease spread, and other biological features).

  15. Classification of current anticancer immunotherapies

    PubMed Central

    Vacchelli, Erika; Pedro, José-Manuel Bravo-San; Buqué, Aitziber; Senovilla, Laura; Baracco, Elisa Elena; Bloy, Norma; Castoldi, Francesca; Abastado, Jean-Pierre; Agostinis, Patrizia; Apte, Ron N.; Aranda, Fernando; Ayyoub, Maha; Beckhove, Philipp; Blay, Jean-Yves; Bracci, Laura; Caignard, Anne; Castelli, Chiara; Cavallo, Federica; Celis, Estaban; Cerundolo, Vincenzo; Clayton, Aled; Colombo, Mario P.; Coussens, Lisa; Dhodapkar, Madhav V.; Eggermont, Alexander M.; Fearon, Douglas T.; Fridman, Wolf H.; Fučíková, Jitka; Gabrilovich, Dmitry I.; Galon, Jérôme; Garg, Abhishek; Ghiringhelli, François; Giaccone, Giuseppe; Gilboa, Eli; Gnjatic, Sacha; Hoos, Axel; Hosmalin, Anne; Jäger, Dirk; Kalinski, Pawel; Kärre, Klas; Kepp, Oliver; Kiessling, Rolf; Kirkwood, John M.; Klein, Eva; Knuth, Alexander; Lewis, Claire E.; Liblau, Roland; Lotze, Michael T.; Lugli, Enrico; Mach, Jean-Pierre; Mattei, Fabrizio; Mavilio, Domenico; Melero, Ignacio; Melief, Cornelis J.; Mittendorf, Elizabeth A.; Moretta, Lorenzo; Odunsi, Adekunke; Okada, Hideho; Palucka, Anna Karolina; Peter, Marcus E.; Pienta, Kenneth J.; Porgador, Angel; Prendergast, George C.; Rabinovich, Gabriel A.; Restifo, Nicholas P.; Rizvi, Naiyer; Sautès-Fridman, Catherine; Schreiber, Hans; Seliger, Barbara; Shiku, Hiroshi; Silva-Santos, Bruno; Smyth, Mark J.; Speiser, Daniel E.; Spisek, Radek; Srivastava, Pramod K.; Talmadge, James E.; Tartour, Eric; Van Der Burg, Sjoerd H.; Van Den Eynde, Benoît J.; Vile, Richard; Wagner, Hermann; Weber, Jeffrey S.; Whiteside, Theresa L.; Wolchok, Jedd D.; Zitvogel, Laurence; Zou, Weiping

    2014-01-01

    During the past decades, anticancer immunotherapy has evolved from a promising therapeutic option to a robust clinical reality. Many immunotherapeutic regimens are now approved by the US Food and Drug Administration and the European Medicines Agency for use in cancer patients, and many others are being investigated as standalone therapeutic interventions or combined with conventional treatments in clinical studies. Immunotherapies may be subdivided into “passive” and “active” based on their ability to engage the host immune system against cancer. Since the anticancer activity of most passive immunotherapeutics (including tumor-targeting monoclonal antibodies) also relies on the host immune system, this classification does not properly reflect the complexity of the drug-host-tumor interaction. Alternatively, anticancer immunotherapeutics can be classified according to their antigen specificity. While some immunotherapies specifically target one (or a few) defined tumor-associated antigen(s), others operate in a relatively non-specific manner and boost natural or therapy-elicited anticancer immune responses of unknown and often broad specificity. Here, we propose a critical, integrated classification of anticancer immunotherapies and discuss the clinical relevance of these approaches. PMID:25537519

  16. Spectral areas and ratios classifier algorithm for pancreatic tissue classification using optical spectroscopy

    NASA Astrophysics Data System (ADS)

    Chandra, Malavika; Scheiman, James; Simeone, Diane; McKenna, Barbara; Purdy, Julianne; Mycek, Mary-Ann

    2010-01-01

    Pancreatic adenocarcinoma is one of the leading causes of cancer death, in part because of the inability of current diagnostic methods to reliably detect early-stage disease. We present the first assessment of the diagnostic accuracy of algorithms developed for pancreatic tissue classification using data from fiber optic probe-based bimodal optical spectroscopy, a real-time approach that would be compatible with minimally invasive diagnostic procedures for early cancer detection in the pancreas. A total of 96 fluorescence and 96 reflectance spectra are considered from 50 freshly excised tissue sites-including human pancreatic adenocarcinoma, chronic pancreatitis (inflammation), and normal tissues-on nine patients. Classification algorithms using linear discriminant analysis are developed to distinguish among tissues, and leave-one-out cross-validation is employed to assess the classifiers' performance. The spectral areas and ratios classifier (SpARC) algorithm employs a combination of reflectance and fluorescence data and has the best performance, with sensitivity, specificity, negative predictive value, and positive predictive value for correctly identifying adenocarcinoma being 85, 89, 92, and 80%, respectively.

  17. Quantifying heterogeneity of lesion uptake in dynamic contrast enhanced MRI for breast cancer diagnosis

    NASA Astrophysics Data System (ADS)

    Karahaliou, A.; Vassiou, K.; Skiadopoulos, S.; Kanavou, T.; Yiakoumelos, A.; Costaridou, L.

    2009-07-01

    The current study investigates whether texture features extracted from lesion kinetics feature maps can be used for breast cancer diagnosis. Fifty five women with 57 breast lesions (27 benign, 30 malignant) were subjected to dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) on 1.5T system. A linear-slope model was fitted pixel-wise to a representative lesion slice time series and fitted parameters were used to create three kinetic maps (wash out, time to peak enhancement and peak enhancement). 28 grey level co-occurrence matrices features were extracted from each lesion kinetic map. The ability of texture features per map in discriminating malignant from benign lesions was investigated using a Probabilistic Neural Network classifier. Additional classification was performed by combining classification outputs of most discriminating feature subsets from the three maps, via majority voting. The combined scheme outperformed classification based on individual maps achieving area under Receiver Operating Characteristics curve 0.960±0.029. Results suggest that heterogeneity of breast lesion kinetics, as quantified by texture analysis, may contribute to computer assisted tissue characterization in DCE-MRI.

  18. Cancer 2015: a longitudinal whole-of-system study of genomic cancer medicine.

    PubMed

    Thomas, David M; Fox, Stephen; Lorgelly, Paula K; Ashley, David; Richardson, Gary; Lipton, Lara; Parisot, John P; Lucas, Mark; McNeil, John; Wright, Michael

    2015-12-01

    Genomic cancer medicine promises revolutionary change in oncology. The impacts of 'personalized medicine', based upon a molecular classification of cancer and linked to targeted therapies, will extend from individual patient outcomes to the health economy at large. To address the 'whole-of-system' impact of genomic cancer medicine, we have established a prospective cohort of patients with newly diagnosed cancer in the state of Victoria, Australia, about whom we have collected a broad range of clinical, demographic, molecular, and patient-reported data, as well as data on health resource utilization. Our goal is to create a model for investigating public investment in genomic medicine that maximizes the cost:benefit ratio for the Australian community at large. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Detection of mitotic nuclei in breast histopathology images using localized ACM and Random Kitchen Sink based classifier.

    PubMed

    Beevi, K Sabeena; Nair, Madhu S; Bindu, G R

    2016-08-01

    The exact measure of mitotic nuclei is a crucial parameter in breast cancer grading and prognosis. This can be achieved by improving the mitotic detection accuracy by careful design of segmentation and classification techniques. In this paper, segmentation of nuclei from breast histopathology images are carried out by Localized Active Contour Model (LACM) utilizing bio-inspired optimization techniques in the detection stage, in order to handle diffused intensities present along object boundaries. Further, the application of a new optimal machine learning algorithm capable of classifying strong non-linear data such as Random Kitchen Sink (RKS), shows improved classification performance. The proposed method has been tested on Mitosis detection in breast cancer histological images (MITOS) dataset provided for MITOS-ATYPIA CONTEST 2014. The proposed framework achieved 95% recall, 98% precision and 96% F-score.

  20. Clinical staging: its importance in therapeutic decisions and clinical trials.

    PubMed

    Denis, L J

    1992-02-01

    International collaboration has resulted in a revised and unified 1987 formulation for the TNM classification in solid tumors. The simplification and eliminations of most variables caused difficulties for the clinical use of the system in some tumors such as bladder cancer. The approval of the proposed adaptation covering the tumor mass, subdividing the T4 category and adapting the stage grouping, resolves these difficulties. Published reports demonstrate support for the TNM system as a clinical base for treatment decisions and prognosis. The TNMG stage and grade are important basic prognostic factors, but other prognostic factors, especially biologic tumor activity, are under clinical investigation. The TNM classification is the initial evaluation after histologic confirmation of cancer to guide treatment and prognosis. The quality of the evaluation is enhanced by precise communication on the employed methodology.

  1. Next-Generation Sequencing in Oncology: Genetic Diagnosis, Risk Prediction and Cancer Classification

    PubMed Central

    Kamps, Rick; Brandão, Rita D.; van den Bosch, Bianca J.; Paulussen, Aimee D. C.; Xanthoulea, Sofia; Blok, Marinus J.; Romano, Andrea

    2017-01-01

    Next-generation sequencing (NGS) technology has expanded in the last decades with significant improvements in the reliability, sequencing chemistry, pipeline analyses, data interpretation and costs. Such advances make the use of NGS feasible in clinical practice today. This review describes the recent technological developments in NGS applied to the field of oncology. A number of clinical applications are reviewed, i.e., mutation detection in inherited cancer syndromes based on DNA-sequencing, detection of spliceogenic variants based on RNA-sequencing, DNA-sequencing to identify risk modifiers and application for pre-implantation genetic diagnosis, cancer somatic mutation analysis, pharmacogenetics and liquid biopsy. Conclusive remarks, clinical limitations, implications and ethical considerations that relate to the different applications are provided. PMID:28146134

  2. Classification of human carcinoma cells using multispectral imagery

    NASA Astrophysics Data System (ADS)

    Ćinar, Umut; Y. Ćetin, Yasemin; Ćetin-Atalay, Rengul; Ćetin, Enis

    2016-03-01

    In this paper, we present a technique for automatically classifying human carcinoma cell images using textural features. An image dataset containing microscopy biopsy images from different patients for 14 distinct cancer cell line type is studied. The images are captured using a RGB camera attached to an inverted microscopy device. Texture based Gabor features are extracted from multispectral input images. SVM classifier is used to generate a descriptive model for the purpose of cell line classification. The experimental results depict satisfactory performance, and the proposed method is versatile for various microscopy magnification options.

  3. A new breast cancer risk analysis approach using features extracted from multiple sub-regions on bilateral mammograms

    NASA Astrophysics Data System (ADS)

    Sun, Wenqing; Tseng, Tzu-Liang B.; Zheng, Bin; Zhang, Jianying; Qian, Wei

    2015-03-01

    A novel breast cancer risk analysis approach is proposed for enhancing performance of computerized breast cancer risk analysis using bilateral mammograms. Based on the intensity of breast area, five different sub-regions were acquired from one mammogram, and bilateral features were extracted from every sub-region. Our dataset includes 180 bilateral mammograms from 180 women who underwent routine screening examinations, all interpreted as negative and not recalled by the radiologists during the original screening procedures. A computerized breast cancer risk analysis scheme using four image processing modules, including sub-region segmentation, bilateral feature extraction, feature selection, and classification was designed to detect and compute image feature asymmetry between the left and right breasts imaged on the mammograms. The highest computed area under the curve (AUC) is 0.763 ± 0.021 when applying the multiple sub-region features to our testing dataset. The positive predictive value and the negative predictive value were 0.60 and 0.73, respectively. The study demonstrates that (1) features extracted from multiple sub-regions can improve the performance of our scheme compared to using features from whole breast area only; (2) a classifier using asymmetry bilateral features can effectively predict breast cancer risk; (3) incorporating texture and morphological features with density features can boost the classification accuracy.

  4. Java Web Start based software for automated quantitative nuclear analysis of prostate cancer and benign prostate hyperplasia.

    PubMed

    Singh, Swaroop S; Kim, Desok; Mohler, James L

    2005-05-11

    Androgen acts via androgen receptor (AR) and accurate measurement of the levels of AR protein expression is critical for prostate research. The expression of AR in paired specimens of benign prostate and prostate cancer from 20 African and 20 Caucasian Americans was compared to demonstrate an application of this system. A set of 200 immunopositive and 200 immunonegative nuclei were collected from the images using a macro developed in Image Pro Plus. Linear Discriminant and Logistic Regression analyses were performed on the data to generate classification coefficients. Classification coefficients render the automated image analysis software independent of the type of immunostaining or image acquisition system used. The image analysis software performs local segmentation and uses nuclear shape and size to detect prostatic epithelial nuclei. AR expression is described by (a) percentage of immunopositive nuclei; (b) percentage of immunopositive nuclear area; and (c) intensity of AR expression among immunopositive nuclei or areas. The percent positive nuclei and percent nuclear area were similar by race in both benign prostate hyperplasia and prostate cancer. In prostate cancer epithelial nuclei, African Americans exhibited 38% higher levels of AR immunostaining than Caucasian Americans (two sided Student's t-tests; P < 0.05). Intensity of AR immunostaining was similar between races in benign prostate. The differences measured in the intensity of AR expression in prostate cancer were consistent with previous studies. Classification coefficients are required due to non-standardized immunostaining and image collection methods across medical institutions and research laboratories and helps customize the software for the specimen under study. The availability of a free, automated system creates new opportunities for testing, evaluation and use of this image analysis system by many research groups who study nuclear protein expression.

  5. Contour classification in thermographic images for detection of breast cancer

    NASA Astrophysics Data System (ADS)

    Okuniewski, Rafał; Nowak, Robert M.; Cichosz, Paweł; Jagodziński, Dariusz; Matysiewicz, Mateusz; Neumann, Łukasz; Oleszkiewicz, Witold

    2016-09-01

    Thermographic images of breast taken by the Braster device are uploaded into web application which uses different classification algorithms to automatically decide whether a patient should be more thoroughly examined. This article presents the approach to the task of classifying contours visible on thermographic images of breast taken by the Braster device in order to make the decision about the existence of cancerous tumors in breast. It presents the results of the researches conducted on the different classification algorithms.

  6. [S3 guidelines on diagnostics and treatment of cervical cancer: Demands on pathology].

    PubMed

    Horn, L-C; Beckmann, M W; Follmann, M; Koch, M C; Mallmann, P; Marnitz, S; Schmidt, D

    2015-11-01

    Between 2011 and the end of 2014 the former consensus S2k guidelines for the diagnostics and treatment of cervical cancer were updated and upgraded to S3 level, methodologically based on the regulations of the German Cancer Society (DKG). The present article summarizes the relevant aspects for the sectioning, histopathological workup, diagnostics and reporting for the pathology of invasive cancer of the uterine cervix. The recommendations are based on the most recent World Health Organization (WHO) and TNM classification systems and consider the needs of the clinician for appropriate surgical and radiotherapeutic treatment of patients. Detailed processing rules of colposcopy-guided diagnostic biopsies, conization and trachelectomy as well as for radical hysterectomy specimens and lymph node resection (including sentinel lymph node resection) are given. In the guidelines deep stromal invasion in macroinvasive cervical cancer is defined for the first time as tumor infiltration of > 66% of the cervical stromal wall. Furthermore, morphological prognostic factors for microinvasive and macroinvasive cervical cancer are summarized.

  7. Median Filter Noise Reduction of Image and Backpropagation Neural Network Model for Cervical Cancer Classification

    NASA Astrophysics Data System (ADS)

    Wutsqa, D. U.; Marwah, M.

    2017-06-01

    In this paper, we consider spatial operation median filter to reduce the noise in the cervical images yielded by colposcopy tool. The backpropagation neural network (BPNN) model is applied to the colposcopy images to classify cervical cancer. The classification process requires an image extraction by using a gray level co-occurrence matrix (GLCM) method to obtain image features that are used as inputs of BPNN model. The advantage of noise reduction is evaluated by comparing the performances of BPNN models with and without spatial operation median filter. The experimental result shows that the spatial operation median filter can improve the accuracy of the BPNN model for cervical cancer classification.

  8. International Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society International Multidisciplinary Classification of Lung Adenocarcinoma

    PubMed Central

    Travis, William D.; Brambilla, Elisabeth; Noguchi, Masayuki; Nicholson, Andrew G.; Geisinger, Kim R.; Yatabe, Yasushi; Beer, David G.; Powell, Charles A.; Riely, Gregory J.; Van Schil, Paul E.; Garg, Kavita; Austin, John H. M.; Asamura, Hisao; Rusch, Valerie W.; Hirsch, Fred R.; Scagliotti, Giorgio; Mitsudomi, Tetsuya; Huber, Rudolf M.; Ishikawa, Yuichi; Jett, James; Sanchez-Cespedes, Montserrat; Sculier, Jean-Paul; Takahashi, Takashi; Tsuboi, Masahiro; Vansteenkiste, Johan; Wistuba, Ignacio; Yang, Pan-Chyr; Aberle, Denise; Brambilla, Christian; Flieder, Douglas; Franklin, Wilbur; Gazdar, Adi; Gould, Michael; Hasleton, Philip; Henderson, Douglas; Johnson, Bruce; Johnson, David; Kerr, Keith; Kuriyama, Keiko; Lee, Jin Soo; Miller, Vincent A.; Petersen, Iver; Roggli, Victor; Rosell, Rafael; Saijo, Nagahiro; Thunnissen, Erik; Tsao, Ming; Yankelewitz, David

    2015-01-01

    Introduction Adenocarcinoma is the most common histologic type of lung cancer. To address advances in oncology, molecular biology, pathology, radiology, and surgery of lung adenocarcinoma, an international multidisciplinary classification was sponsored by the International Association for the Study of Lung Cancer, American Thoracic Society, and European Respiratory Society. This new adenocarcinoma classification is needed to provide uniform terminology and diagnostic criteria, especially for bronchioloalveolar carcinoma (BAC), the overall approach to small nonresection cancer specimens, and for multidisciplinary strategic management of tissue for molecular and immunohistochemical studies. Methods An international core panel of experts representing all three societies was formed with oncologists/pulmonologists, pathologists, radiologists, molecular biologists, and thoracic surgeons. A systematic review was performed under the guidance of the American Thoracic Society Documents Development and Implementation Committee. The search strategy identified 11,368 citations of which 312 articles met specified eligibility criteria and were retrieved for full text review. A series of meetings were held to discuss the development of the new classification, to develop the recommendations, and to write the current document. Recommendations for key questions were graded by strength and quality of the evidence according to the Grades of Recommendation, Assessment, Development, and Evaluation approach. Results The classification addresses both resection specimens, and small biopsies and cytology. The terms BAC and mixed subtype adenocarcinoma are no longer used. For resection specimens, new concepts are introduced such as adenocarcinoma in situ (AIS) and minimally invasive adenocarcinoma (MIA) for small solitary adenocarcinomas with either pure lepidic growth (AIS) or predominant lepidic growth with ≤5 mm invasion (MIA) to define patients who, if they undergo complete resection, will have 100% or near 100% disease-specific survival, respectively. AIS and MIA are usually nonmucinous but rarely may be mucinous. Invasive adenocarcinomas are classified by predominant pattern after using comprehensive histologic subtyping with lepidic (formerly most mixed subtype tumors with nonmucinous BAC), acinar, papillary, and solid patterns; micropapillary is added as a new histologic subtype. Variants include invasive mucinous adenocarcinoma (formerly mucinous BAC), colloid, fetal, and enteric adenocarcinoma. This classification provides guidance for small biopsies and cytology specimens, as approximately 70% of lung cancers are diagnosed in such samples. Non-small cell lung carcinomas (NSCLCs), in patients with advanced-stage disease, are to be classified into more specific types such as adenocarcinoma or squamous cell carcinoma, whenever possible for several reasons: (1) adenocarcinoma or NSCLC not otherwise specified should be tested for epidermal growth factor receptor (EGFR) mutations as the presence of these mutations is predictive of responsiveness to EGFR tyrosine kinase inhibitors, (2) adenocarcinoma histology is a strong predictor for improved outcome with pemetrexed therapy compared with squamous cell carcinoma, and (3) potential life-threatening hemorrhage may occur in patients with squamous cell carcinoma who receive bevacizumab. If the tumor cannot be classified based on light microscopy alone, special studies such as immunohistochemistry and/or mucin stains should be applied to classify the tumor further. Use of the term NSCLC not otherwise specified should be minimized. Conclusions This new classification strategy is based on a multidisciplinary approach to diagnosis of lung adenocarcinoma that incorporates clinical, molecular, radiologic, and surgical issues, but it is primarily based on histology. This classification is intended to support clinical practice, and research investigation and clinical trials. As EGFR mutation is a validated predictive marker for response and progression-free survival with EGFR tyrosine kinase inhibitors in advanced lung adenocarcinoma, we recommend that patients with advanced adenocarcinomas be tested for EGFR mutation. This has implications for strategic management of tissue, particularly for small biopsies and cytology samples, to maximize high-quality tissue available for molecular studies. Potential impact for tumor, node, and metastasis staging include adjustment of the size T factor according to only the invasive component (1) pathologically in invasive tumors with lepidic areas or (2) radiologically by measuring the solid component of part-solid nodules. PMID:21252716

  9. International association for the study of lung cancer/american thoracic society/european respiratory society international multidisciplinary classification of lung adenocarcinoma.

    PubMed

    Travis, William D; Brambilla, Elisabeth; Noguchi, Masayuki; Nicholson, Andrew G; Geisinger, Kim R; Yatabe, Yasushi; Beer, David G; Powell, Charles A; Riely, Gregory J; Van Schil, Paul E; Garg, Kavita; Austin, John H M; Asamura, Hisao; Rusch, Valerie W; Hirsch, Fred R; Scagliotti, Giorgio; Mitsudomi, Tetsuya; Huber, Rudolf M; Ishikawa, Yuichi; Jett, James; Sanchez-Cespedes, Montserrat; Sculier, Jean-Paul; Takahashi, Takashi; Tsuboi, Masahiro; Vansteenkiste, Johan; Wistuba, Ignacio; Yang, Pan-Chyr; Aberle, Denise; Brambilla, Christian; Flieder, Douglas; Franklin, Wilbur; Gazdar, Adi; Gould, Michael; Hasleton, Philip; Henderson, Douglas; Johnson, Bruce; Johnson, David; Kerr, Keith; Kuriyama, Keiko; Lee, Jin Soo; Miller, Vincent A; Petersen, Iver; Roggli, Victor; Rosell, Rafael; Saijo, Nagahiro; Thunnissen, Erik; Tsao, Ming; Yankelewitz, David

    2011-02-01

    Adenocarcinoma is the most common histologic type of lung cancer. To address advances in oncology, molecular biology, pathology, radiology, and surgery of lung adenocarcinoma, an international multidisciplinary classification was sponsored by the International Association for the Study of Lung Cancer, American Thoracic Society, and European Respiratory Society. This new adenocarcinoma classification is needed to provide uniform terminology and diagnostic criteria, especially for bronchioloalveolar carcinoma (BAC), the overall approach to small nonresection cancer specimens, and for multidisciplinary strategic management of tissue for molecular and immunohistochemical studies. An international core panel of experts representing all three societies was formed with oncologists/pulmonologists, pathologists, radiologists, molecular biologists, and thoracic surgeons. A systematic review was performed under the guidance of the American Thoracic Society Documents Development and Implementation Committee. The search strategy identified 11,368 citations of which 312 articles met specified eligibility criteria and were retrieved for full text review. A series of meetings were held to discuss the development of the new classification, to develop the recommendations, and to write the current document. Recommendations for key questions were graded by strength and quality of the evidence according to the Grades of Recommendation, Assessment, Development, and Evaluation approach. The classification addresses both resection specimens, and small biopsies and cytology. The terms BAC and mixed subtype adenocarcinoma are no longer used. For resection specimens, new concepts are introduced such as adenocarcinoma in situ (AIS) and minimally invasive adenocarcinoma (MIA) for small solitary adenocarcinomas with either pure lepidic growth (AIS) or predominant lepidic growth with ≤ 5 mm invasion (MIA) to define patients who, if they undergo complete resection, will have 100% or near 100% disease-specific survival, respectively. AIS and MIA are usually nonmucinous but rarely may be mucinous. Invasive adenocarcinomas are classified by predominant pattern after using comprehensive histologic subtyping with lepidic (formerly most mixed subtype tumors with nonmucinous BAC), acinar, papillary, and solid patterns; micropapillary is added as a new histologic subtype. Variants include invasive mucinous adenocarcinoma (formerly mucinous BAC), colloid, fetal, and enteric adenocarcinoma. This classification provides guidance for small biopsies and cytology specimens, as approximately 70% of lung cancers are diagnosed in such samples. Non-small cell lung carcinomas (NSCLCs), in patients with advanced-stage disease, are to be classified into more specific types such as adenocarcinoma or squamous cell carcinoma, whenever possible for several reasons: (1) adenocarcinoma or NSCLC not otherwise specified should be tested for epidermal growth factor receptor (EGFR) mutations as the presence of these mutations is predictive of responsiveness to EGFR tyrosine kinase inhibitors, (2) adenocarcinoma histology is a strong predictor for improved outcome with pemetrexed therapy compared with squamous cell carcinoma, and (3) potential life-threatening hemorrhage may occur in patients with squamous cell carcinoma who receive bevacizumab. If the tumor cannot be classified based on light microscopy alone, special studies such as immunohistochemistry and/or mucin stains should be applied to classify the tumor further. Use of the term NSCLC not otherwise specified should be minimized. This new classification strategy is based on a multidisciplinary approach to diagnosis of lung adenocarcinoma that incorporates clinical, molecular, radiologic, and surgical issues, but it is primarily based on histology. This classification is intended to support clinical practice, and research investigation and clinical trials. As EGFR mutation is a validated predictive marker for response and progression-free survival with EGFR tyrosine kinase inhibitors in advanced lung adenocarcinoma, we recommend that patients with advanced adenocarcinomas be tested for EGFR mutation. This has implications for strategic management of tissue, particularly for small biopsies and cytology samples, to maximize high-quality tissue available for molecular studies. Potential impact for tumor, node, and metastasis staging include adjustment of the size T factor according to only the invasive component (1) pathologically in invasive tumors with lepidic areas or (2) radiologically by measuring the solid component of part-solid nodules.

  10. Hyperspectral imaging of neoplastic progression in a mouse model of oral carcinogenesis

    NASA Astrophysics Data System (ADS)

    Lu, Guolan; Qin, Xulei; Wang, Dongsheng; Muller, Susan; Zhang, Hongzheng; Chen, Amy; Chen, Zhuo Georgia; Fei, Baowei

    2016-03-01

    Hyperspectral imaging (HSI) is an emerging modality for medical applications and holds great potential for noninvasive early detection of cancer. It has been reported that early cancer detection can improve the survival and quality of life of head and neck cancer patients. In this paper, we explored the possibility of differentiating between premalignant lesions and healthy tongue tissue using hyperspectral imaging in a chemical induced oral cancer animal model. We proposed a novel classification algorithm for cancer detection using hyperspectral images. The method detected the dysplastic tissue with an average area under the curve (AUC) of 0.89. The hyperspectral imaging and classification technique may provide a new tool for oral cancer detection.

  11. Investigating Sociodemographic Disparities in Cancer Risk Using Web-Based Informatics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yoon, Hong-Jun; Tourassi, Georgia

    Cancer health disparities due to demographic and socioeconomic factors are an area of great interest in the epidemiological community. Adjusting for such factors is important when developing cancer risk models. However, for digital epidemiology studies relying on online sources such information is not readily available. This paper presents a novel method for extracting demographic and socioeconomic information from openly available online obituaries. The method relies on tailored language processing rules and a probabilistic scheme to map subjects’ occupation history to the occupation classification codes and related earnings provided by the U.S. Census Bureau. Using this information, a case-control study ismore » executed fully in silico to investigate how age, gender, parity, and income level impact breast and lung cancer risk. Based on 48,368 online obituaries (4,643 for breast cancer, 6,274 for lung cancer, and 37,451 cancer-free) collected automatically and a generalized cancer risk model, our study shows strong association between age, parity, and socioeconomic status and cancer risk. Although for breast cancer the observed trends are very consistent with traditional epidemiological studies, some inconsistency is observed for lung cancer with respect to socioeconomic status.« less

  12. Investigating Sociodemographic Disparities in Cancer Risk Using Web-Based Informatics

    DOE PAGES

    Yoon, Hong-Jun; Tourassi, Georgia

    2018-01-24

    Cancer health disparities due to demographic and socioeconomic factors are an area of great interest in the epidemiological community. Adjusting for such factors is important when developing cancer risk models. However, for digital epidemiology studies relying on online sources such information is not readily available. This paper presents a novel method for extracting demographic and socioeconomic information from openly available online obituaries. The method relies on tailored language processing rules and a probabilistic scheme to map subjects’ occupation history to the occupation classification codes and related earnings provided by the U.S. Census Bureau. Using this information, a case-control study ismore » executed fully in silico to investigate how age, gender, parity, and income level impact breast and lung cancer risk. Based on 48,368 online obituaries (4,643 for breast cancer, 6,274 for lung cancer, and 37,451 cancer-free) collected automatically and a generalized cancer risk model, our study shows strong association between age, parity, and socioeconomic status and cancer risk. Although for breast cancer the observed trends are very consistent with traditional epidemiological studies, some inconsistency is observed for lung cancer with respect to socioeconomic status.« less

  13. Computer Based Melanocytic and Nevus Image Enhancement and Segmentation.

    PubMed

    Jamil, Uzma; Akram, M Usman; Khalid, Shehzad; Abbas, Sarmad; Saleem, Kashif

    2016-01-01

    Digital dermoscopy aids dermatologists in monitoring potentially cancerous skin lesions. Melanoma is the 5th common form of skin cancer that is rare but the most dangerous. Melanoma is curable if it is detected at an early stage. Automated segmentation of cancerous lesion from normal skin is the most critical yet tricky part in computerized lesion detection and classification. The effectiveness and accuracy of lesion classification are critically dependent on the quality of lesion segmentation. In this paper, we have proposed a novel approach that can automatically preprocess the image and then segment the lesion. The system filters unwanted artifacts including hairs, gel, bubbles, and specular reflection. A novel approach is presented using the concept of wavelets for detection and inpainting the hairs present in the cancer images. The contrast of lesion with the skin is enhanced using adaptive sigmoidal function that takes care of the localized intensity distribution within a given lesion's images. We then present a segmentation approach to precisely segment the lesion from the background. The proposed approach is tested on the European database of dermoscopic images. Results are compared with the competitors to demonstrate the superiority of the suggested approach.

  14. OC-2-KB: A software pipeline to build an evidence-based obesity and cancer knowledge base.

    PubMed

    Lossio-Ventura, Juan Antonio; Hogan, William; Modave, François; Guo, Yi; He, Zhe; Hicks, Amanda; Bian, Jiang

    2017-11-01

    Obesity has been linked to several types of cancer. Access to adequate health information activates people's participation in managing their own health, which ultimately improves their health outcomes. Nevertheless, the existing online information about the relationship between obesity and cancer is heterogeneous and poorly organized. A formal knowledge representation can help better organize and deliver quality health information. Currently, there are several efforts in the biomedical domain to convert unstructured data to structured data and store them in Semantic Web knowledge bases (KB). In this demo paper, we present, OC-2-KB (Obesity and Cancer to Knowledge Base), a system that is tailored to guide the automatic KB construction for managing obesity and cancer knowledge from free-text scientific literature (i.e., PubMed abstracts) in a systematic way. OC-2-KB has two important modules which perform the acquisition of entities and the extraction then classification of relationships among these entities. We tested the OC-2-KB system on a data set with 23 manually annotated obesity and cancer PubMed abstracts and created a preliminary KB with 765 triples. We conducted a preliminary evaluation on this sample of triples and reported our evaluation results.

  15. Molecular Classification of Gastric Cancer: A new paradigm

    PubMed Central

    Shah, Manish A.; Khanin, Raya; Tang, Laura; Janjigian, Yelena Y.; Klimstra, David S.; Gerdes, Hans; Kelsen, David P.

    2011-01-01

    Purpose Gastric cancer may be subdivided into three distinct subtypes –proximal, diffuse, and distal gastric cancer– based on histopathologic and anatomic criteria. Each subtype is associated with unique epidemiology. Our aim is to test the hypothesis that these distinct gastric cancer subtypes may also be distinguished by gene expression analysis. Experimental Design Patients with localized gastric adenocarcinoma being screened for a phase II preoperative clinical trial (NCI 5917) underwent endoscopic biopsy for fresh tumor procurement. 4–6 targeted biopsies of the primary tumor were obtained. Macrodissection was performed to ensure >80% carcinoma in the sample. HG-U133A GeneChip (Affymetrix) was used for cDNA expression analysis, and all arrays were processed and analyzed using the Bioconductor R-package. Results Between November 2003 and January 2006, 57 patients were screened to identify 36 patients with localized gastric cancer who had adequate RNA for expression analysis. Using supervised analysis, we built a classifier to distinguish the three gastric cancer subtypes, successfully classifying each into tightly grouped clusters. Leave-one-out cross validation error was 0.14, suggesting that >85% of samples were classified correctly. Gene set analysis with the False Discovery Rate set at 0.25 identified several pathways that were differentially regulated when comparing each gastric cancer subtype to adjacent normal stomach. Conclusions Subtypes of gastric cancer that have epidemiologic and histologic distinction are also distinguished by gene expression data. These preliminary data suggest a new classification of gastric cancer with implications for improving our understanding of disease biology and identification of unique molecular drivers for each gastric cancer subtype. PMID:21430069

  16. Crowdsourcing the General Public for Large Scale Molecular Pathology Studies in Cancer

    PubMed Central

    Candido dos Reis, Francisco J.; Lynn, Stuart; Ali, H. Raza; Eccles, Diana; Hanby, Andrew; Provenzano, Elena; Caldas, Carlos; Howat, William J.; McDuffus, Leigh-Anne; Liu, Bin; Daley, Frances; Coulson, Penny; Vyas, Rupesh J.; Harris, Leslie M.; Owens, Joanna M.; Carton, Amy F.M.; McQuillan, Janette P.; Paterson, Andy M.; Hirji, Zohra; Christie, Sarah K.; Holmes, Amber R.; Schmidt, Marjanka K.; Garcia-Closas, Montserrat; Easton, Douglas F.; Bolla, Manjeet K.; Wang, Qin; Benitez, Javier; Milne, Roger L.; Mannermaa, Arto; Couch, Fergus; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Cox, Angela; Cross, Simon S.; Blows, Fiona M.; Sanders, Joyce; de Groot, Renate; Figueroa, Jonine; Sherman, Mark; Hooning, Maartje; Brenner, Hermann; Holleczek, Bernd; Stegmaier, Christa; Lintott, Chris; Pharoah, Paul D.P.

    2015-01-01

    Background Citizen science, scientific research conducted by non-specialists, has the potential to facilitate biomedical research using available large-scale data, however validating the results is challenging. The Cell Slider is a citizen science project that intends to share images from tumors with the general public, enabling them to score tumor markers independently through an internet-based interface. Methods From October 2012 to June 2014, 98,293 Citizen Scientists accessed the Cell Slider web page and scored 180,172 sub-images derived from images of 12,326 tissue microarray cores labeled for estrogen receptor (ER). We evaluated the accuracy of Citizen Scientist's ER classification, and the association between ER status and prognosis by comparing their test performance against trained pathologists. Findings The area under ROC curve was 0.95 (95% CI 0.94 to 0.96) for cancer cell identification and 0.97 (95% CI 0.96 to 0.97) for ER status. ER positive tumors scored by Citizen Scientists were associated with survival in a similar way to that scored by trained pathologists. Survival probability at 15 years were 0.78 (95% CI 0.76 to 0.80) for ER-positive and 0.72 (95% CI 0.68 to 0.77) for ER-negative tumors based on Citizen Scientists classification. Based on pathologist classification, survival probability was 0.79 (95% CI 0.77 to 0.81) for ER-positive and 0.71 (95% CI 0.67 to 0.74) for ER-negative tumors. The hazard ratio for death was 0.26 (95% CI 0.18 to 0.37) at diagnosis and became greater than one after 6.5 years of follow-up for ER scored by Citizen Scientists, and 0.24 (95% CI 0.18 to 0.33) at diagnosis increasing thereafter to one after 6.7 (95% CI 4.1 to 10.9) years of follow-up for ER scored by pathologists. Interpretation Crowdsourcing of the general public to classify cancer pathology data for research is viable, engages the public and provides accurate ER data. Crowdsourced classification of research data may offer a valid solution to problems of throughput requiring human input. PMID:26288840

  17. Color edges extraction using statistical features and automatic threshold technique: application to the breast cancer cells.

    PubMed

    Ben Chaabane, Salim; Fnaiech, Farhat

    2014-01-23

    Color image segmentation has been so far applied in many areas; hence, recently many different techniques have been developed and proposed. In the medical imaging area, the image segmentation may be helpful to provide assistance to doctor in order to follow-up the disease of a certain patient from the breast cancer processed images. The main objective of this work is to rebuild and also to enhance each cell from the three component images provided by an input image. Indeed, from an initial segmentation obtained using the statistical features and histogram threshold techniques, the resulting segmentation may represent accurately the non complete and pasted cells and enhance them. This allows real help to doctors, and consequently, these cells become clear and easy to be counted. A novel method for color edges extraction based on statistical features and automatic threshold is presented. The traditional edge detector, based on the first and the second order neighborhood, describing the relationship between the current pixel and its neighbors, is extended to the statistical domain. Hence, color edges in an image are obtained by combining the statistical features and the automatic threshold techniques. Finally, on the obtained color edges with specific primitive color, a combination rule is used to integrate the edge results over the three color components. Breast cancer cell images were used to evaluate the performance of the proposed method both quantitatively and qualitatively. Hence, a visual and a numerical assessment based on the probability of correct classification (PC), the false classification (Pf), and the classification accuracy (Sens(%)) are presented and compared with existing techniques. The proposed method shows its superiority in the detection of points which really belong to the cells, and also the facility of counting the number of the processed cells. Computer simulations highlight that the proposed method substantially enhances the segmented image with smaller error rates better than other existing algorithms under the same settings (patterns and parameters). Moreover, it provides high classification accuracy, reaching the rate of 97.94%. Additionally, the segmentation method may be extended to other medical imaging types having similar properties.

  18. Clear-cell differentiation and lymphatic invasion, but not the revised TNM classification, predict lymph node metastases in pT1 penile cancer: a clinicopathologic study of 76 patients from a low incidence area.

    PubMed

    Mannweiler, Sebastian; Sygulla, Stephan; Tsybrovskyy, Oleksiy; Razmara, Yas; Pummer, Karl; Regauer, Sigrid

    2013-10-01

    Prediction of lymph node (LN) metastases in penile invasive cancer relies on clinical features and histologic characteristics of the primary tumor. Correct prediction, however, is difficult, as only 50% patients undergoing lymphadenectomies will have LN metastases. In 2009, the tumor, nodes, metastases (TNM) classification for staging of early penile cancers was revised. We tested the predictive accuracy of the revised TNM classification in a low incidence area for penile carcinoma. The presence of LN metastases in 76 men with pT1 penile cancers was correlated with the 2009 TNM subclassification, which is based on a combined evaluation of tumor grade and lymphatic invasion, but also with individual parameters, such as histologic grade, lymphatic invasion, perineural invasion, invasion depth, growth pattern and human papilloma virus (HPV) status. 76pT1 penile cancers were reclassified into 31pT1a squamous cell carcinomas (SCC) and 45pT1b (41 SCC; 4 clear-cell carcinomas); 12/22 men (55%; 8 SCC, 4 clear-cell carcinomas) undergoing lymphadenectomy for enlarged inguinal lymph nodes had metastases, 54 patients without enlarged LN and lymphadenectomies had no LN metastases during follow-up of median 47 months. Statistically, clear cell differentiation of the primary carcinoma was highly associated with metastases (100% clear-cell carcinomas vs. 11% SCC) and poor survival (50% vs. 5.5%). Among conventional SCC, only lymphatic invasion showed a highly significant association with metastases with 100% specificity. The 2009 TNM classification, tumor grade alone, perineural invasion, growth pattern, invasion depth or HPV status could not predict LN status. Lymphadenectomy for enlarged LN resulted in 100% sensitivity and 42% predictive probability for identifying metastases and a 16% false positive rate. Statistically, survival correlated significantly with clear-cell differentiation and with lymphatic invasion in both clear-cell carcinomas and conventional SCC. Penile clear-cell carcinomas are more aggressive cancers than SCC. Our observation suggests a benefit of a prophylactic lymphadenectomy for patients with clear-cell carcinomas. Among conventional SCC, only lymphatic invasion predicted LN metastases. Neither tumor grade alone nor perineural invasion, growth pattern, depth of invasion, and subgrouping according to the revised TNM classification correlated with metastases. Clinical evaluation of the LN status was superior to histologic risk stratification. Copyright © 2013 Elsevier Inc. All rights reserved.

  19. CANcer-specific Evaluation System (CANES): a high-accuracy platform, for preclinical single/multi-biomarker discovery

    PubMed Central

    Kwon, Min-Seok; Nam, Seungyoon; Lee, Sungyoung; Ahn, Young Zoo; Chang, Hae Ryung; Kim, Yon Hui; Park, Taesung

    2017-01-01

    The recent creation of enormous, cancer-related “Big Data” public depositories represents a powerful means for understanding tumorigenesis. However, a consistently accurate system for clinically evaluating single/multi-biomarkers remains lacking, and it has been asserted that oft-failed clinical advancement of biomarkers occurs within the very early stages of biomarker assessment. To address these challenges, we developed a clinically testable, web-based tool, CANcer-specific single/multi-biomarker Evaluation System (CANES), to evaluate biomarker effectiveness, across 2,134 whole transcriptome datasets, from 94,147 biological samples (from 18 tumor types). For user-provided single/multi-biomarkers, CANES evaluates the performance of single/multi-biomarker candidates, based on four classification methods, support vector machine, random forest, neural networks, and classification and regression trees. In addition, CANES offers several advantages over earlier analysis tools, including: 1) survival analysis; 2) evaluation of mature miRNAs as markers for user-defined diagnostic or prognostic purposes; and 3) provision of a “pan-cancer” summary view, based on each single marker. We believe that such “landscape” evaluation of single/multi-biomarkers, for diagnostic therapeutic/prognostic decision-making, will be highly valuable for the discovery and “repurposing” of existing biomarkers (and their specific targeted therapies), leading to improved patient therapeutic stratification, a key component of targeted therapy success for the avoidance of therapy resistance. PMID:29050243

  20. Oral cancer screening: serum Raman spectroscopic approach

    NASA Astrophysics Data System (ADS)

    Sahu, Aditi K.; Dhoot, Suyash; Singh, Amandeep; Sawant, Sharada S.; Nandakumar, Nikhila; Talathi-Desai, Sneha; Garud, Mandavi; Pagare, Sandeep; Srivastava, Sanjeeva; Nair, Sudhir; Chaturvedi, Pankaj; Murali Krishna, C.

    2015-11-01

    Serum Raman spectroscopy (RS) has previously shown potential in oral cancer diagnosis and recurrence prediction. To evaluate the potential of serum RS in oral cancer screening, premalignant and cancer-specific detection was explored in the present study using 328 subjects belonging to healthy controls, premalignant, disease controls, and oral cancer groups. Spectra were acquired using a Raman microprobe. Spectral findings suggest changes in amino acids, lipids, protein, DNA, and β-carotene across the groups. A patient-wise approach was employed for data analysis using principal component linear discriminant analysis. In the first step, the classification among premalignant, disease control (nonoral cancer), oral cancer, and normal samples was evaluated in binary classification models. Thereafter, two screening-friendly classification approaches were explored to further evaluate the clinical utility of serum RS: a single four-group model and normal versus abnormal followed by determining the type of abnormality model. Results demonstrate the feasibility of premalignant and specific cancer detection. The normal versus abnormal model yields better sensitivity and specificity rates of 64 and 80% these rates are comparable to standard screening approaches. Prospectively, as the current screening procedure of visual inspection is useful mainly for high-risk populations, serum RS may serve as a useful adjunct for early and specific detection of oral precancers and cancer.

  1. Lesion classification using clinical and visual data fusion by multiple kernel learning

    NASA Astrophysics Data System (ADS)

    Kisilev, Pavel; Hashoul, Sharbell; Walach, Eugene; Tzadok, Asaf

    2014-03-01

    To overcome operator dependency and to increase diagnosis accuracy in breast ultrasound (US), a lot of effort has been devoted to developing computer-aided diagnosis (CAD) systems for breast cancer detection and classification. Unfortunately, the efficacy of such CAD systems is limited since they rely on correct automatic lesions detection and localization, and on robustness of features computed based on the detected areas. In this paper we propose a new approach to boost the performance of a Machine Learning based CAD system, by combining visual and clinical data from patient files. We compute a set of visual features from breast ultrasound images, and construct the textual descriptor of patients by extracting relevant keywords from patients' clinical data files. We then use the Multiple Kernel Learning (MKL) framework to train SVM based classifier to discriminate between benign and malignant cases. We investigate different types of data fusion methods, namely, early, late, and intermediate (MKL-based) fusion. Our database consists of 408 patient cases, each containing US images, textual description of complaints and symptoms filled by physicians, and confirmed diagnoses. We show experimentally that the proposed MKL-based approach is superior to other classification methods. Even though the clinical data is very sparse and noisy, its MKL-based fusion with visual features yields significant improvement of the classification accuracy, as compared to the image features only based classifier.

  2. Evaluation of different radon guideline values based on characterization of ecological risk and visualization of lung cancer mortality trends in British Columbia, Canada.

    PubMed

    Branion-Calles, Michael C; Nelson, Trisalyn A; Henderson, Sarah B

    2015-11-19

    There is no safe concentration of radon gas, but guideline values provide threshold concentrations that are used to map areas at higher risk. These values vary between different regions, countries, and organizations, which can lead to differential classification of risk. For example the World Health Organization suggests a 100 Bq m(-3)value, while Health Canada recommends 200 Bq m(-3). Our objective was to describe how different thresholds characterized ecological radon risk and their visual association with lung cancer mortality trends in British Columbia, Canada. Eight threshold values between 50 and 600 Bq m(-3) were identified, and classes of radon vulnerability were defined based on whether the observed 95(th) percentile radon concentration was above or below each value. A balanced random forest algorithm was used to model vulnerability, and the results were mapped. We compared high vulnerability areas, their estimated populations, and differences in lung cancer mortality trends stratified by smoking prevalence and sex. Classification accuracy improved as the threshold concentrations decreased and the area classified as high vulnerability increased. Majority of the population lived within areas of lower vulnerability regardless of the threshold value. Thresholds as low as 50 Bq m(-3) were associated with higher lung cancer mortality, even in areas with low smoking prevalence. Temporal trends in lung cancer mortality were increasing for women, while decreasing for men. Radon contributes to lung cancer in British Columbia. The results of the study contribute evidence supporting the use of a reference level lower than the current guideline of 200 Bq m(-3) for the province.

  3. Cloud-Based NoSQL Open Database of Pulmonary Nodules for Computer-Aided Lung Cancer Diagnosis and Reproducible Research.

    PubMed

    Ferreira Junior, José Raniery; Oliveira, Marcelo Costa; de Azevedo-Marques, Paulo Mazzoncini

    2016-12-01

    Lung cancer is the leading cause of cancer-related deaths in the world, and its main manifestation is pulmonary nodules. Detection and classification of pulmonary nodules are challenging tasks that must be done by qualified specialists, but image interpretation errors make those tasks difficult. In order to aid radiologists on those hard tasks, it is important to integrate the computer-based tools with the lesion detection, pathology diagnosis, and image interpretation processes. However, computer-aided diagnosis research faces the problem of not having enough shared medical reference data for the development, testing, and evaluation of computational methods for diagnosis. In order to minimize this problem, this paper presents a public nonrelational document-oriented cloud-based database of pulmonary nodules characterized by 3D texture attributes, identified by experienced radiologists and classified in nine different subjective characteristics by the same specialists. Our goal with the development of this database is to improve computer-aided lung cancer diagnosis and pulmonary nodule detection and classification research through the deployment of this database in a cloud Database as a Service framework. Pulmonary nodule data was provided by the Lung Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI), image descriptors were acquired by a volumetric texture analysis, and database schema was developed using a document-oriented Not only Structured Query Language (NoSQL) approach. The proposed database is now with 379 exams, 838 nodules, and 8237 images, 4029 of them are CT scans and 4208 manually segmented nodules, and it is allocated in a MongoDB instance on a cloud infrastructure.

  4. Simultaneous detection and classification of breast masses in digital mammograms via a deep learning YOLO-based CAD system.

    PubMed

    Al-Masni, Mohammed A; Al-Antari, Mugahed A; Park, Jeong-Min; Gi, Geon; Kim, Tae-Yeon; Rivera, Patricio; Valarezo, Edwin; Choi, Mun-Taek; Han, Seung-Moo; Kim, Tae-Seong

    2018-04-01

    Automatic detection and classification of the masses in mammograms are still a big challenge and play a crucial role to assist radiologists for accurate diagnosis. In this paper, we propose a novel Computer-Aided Diagnosis (CAD) system based on one of the regional deep learning techniques, a ROI-based Convolutional Neural Network (CNN) which is called You Only Look Once (YOLO). Although most previous studies only deal with classification of masses, our proposed YOLO-based CAD system can handle detection and classification simultaneously in one framework. The proposed CAD system contains four main stages: preprocessing of mammograms, feature extraction utilizing deep convolutional networks, mass detection with confidence, and finally mass classification using Fully Connected Neural Networks (FC-NNs). In this study, we utilized original 600 mammograms from Digital Database for Screening Mammography (DDSM) and their augmented mammograms of 2,400 with the information of the masses and their types in training and testing our CAD. The trained YOLO-based CAD system detects the masses and then classifies their types into benign or malignant. Our results with five-fold cross validation tests show that the proposed CAD system detects the mass location with an overall accuracy of 99.7%. The system also distinguishes between benign and malignant lesions with an overall accuracy of 97%. Our proposed system even works on some challenging breast cancer cases where the masses exist over the pectoral muscles or dense regions. Copyright © 2018 Elsevier B.V. All rights reserved.

  5. Thermographic image analysis as a pre-screening tool for the detection of canine bone cancer

    NASA Astrophysics Data System (ADS)

    Subedi, Samrat; Umbaugh, Scott E.; Fu, Jiyuan; Marino, Dominic J.; Loughin, Catherine A.; Sackman, Joseph

    2014-09-01

    Canine bone cancer is a common type of cancer that grows fast and may be fatal. It usually appears in the limbs which is called "appendicular bone cancer." Diagnostic imaging methods such as X-rays, computed tomography (CT scan), and magnetic resonance imaging (MRI) are more common methods in bone cancer detection than invasive physical examination such as biopsy. These imaging methods have some disadvantages; including high expense, high dose of radiation, and keeping the patient (canine) motionless during the imaging procedures. This project study identifies the possibility of using thermographic images as a pre-screening tool for diagnosis of bone cancer in dogs. Experiments were performed with thermographic images from 40 dogs exhibiting the disease bone cancer. Experiments were performed with color normalization using temperature data provided by the Long Island Veterinary Specialists. The images were first divided into four groups according to body parts (Elbow/Knee, Full Limb, Shoulder/Hip and Wrist). Each of the groups was then further divided into three sub-groups according to views (Anterior, Lateral and Posterior). Thermographic pattern of normal and abnormal dogs were analyzed using feature extraction and pattern classification tools. Texture features, spectral feature and histogram features were extracted from the thermograms and were used for pattern classification. The best classification success rate in canine bone cancer detection is 90% with sensitivity of 100% and specificity of 80% produced by anterior view of full-limb region with nearest neighbor classification method and normRGB-lum color normalization method. Our results show that it is possible to use thermographic imaging as a pre-screening tool for detection of canine bone cancer.

  6. A Model-Free Machine Learning Method for Risk Classification and Survival Probability Prediction.

    PubMed

    Geng, Yuan; Lu, Wenbin; Zhang, Hao Helen

    2014-01-01

    Risk classification and survival probability prediction are two major goals in survival data analysis since they play an important role in patients' risk stratification, long-term diagnosis, and treatment selection. In this article, we propose a new model-free machine learning framework for risk classification and survival probability prediction based on weighted support vector machines. The new procedure does not require any specific parametric or semiparametric model assumption on data, and is therefore capable of capturing nonlinear covariate effects. We use numerous simulation examples to demonstrate finite sample performance of the proposed method under various settings. Applications to a glioma tumor data and a breast cancer gene expression survival data are shown to illustrate the new methodology in real data analysis.

  7. Model selection for anomaly detection

    NASA Astrophysics Data System (ADS)

    Burnaev, E.; Erofeev, P.; Smolyakov, D.

    2015-12-01

    Anomaly detection based on one-class classification algorithms is broadly used in many applied domains like image processing (e.g. detection of whether a patient is "cancerous" or "healthy" from mammography image), network intrusion detection, etc. Performance of an anomaly detection algorithm crucially depends on a kernel, used to measure similarity in a feature space. The standard approaches (e.g. cross-validation) for kernel selection, used in two-class classification problems, can not be used directly due to the specific nature of a data (absence of a second, abnormal, class data). In this paper we generalize several kernel selection methods from binary-class case to the case of one-class classification and perform extensive comparison of these approaches using both synthetic and real-world data.

  8. Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression.

    PubMed

    Shayan, Zahra; Mohammad Gholi Mezerji, Naser; Shayan, Leila; Naseri, Parisa

    2015-11-03

    Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.

  9. Quantitative diffusion weighted imaging parameters in tumor and peritumoral stroma for prediction of molecular subtypes in breast cancer

    NASA Astrophysics Data System (ADS)

    He, Ting; Fan, Ming; Zhang, Peng; Li, Hui; Zhang, Juan; Shao, Guoliang; Li, Lihua

    2018-03-01

    Breast cancer can be classified into four molecular subtypes of Luminal A, Luminal B, HER2 and Basal-like, which have significant differences in treatment and survival outcomes. We in this study aim to predict immunohistochemistry (IHC) determined molecular subtypes of breast cancer using image features derived from tumor and peritumoral stroma region based on diffusion weighted imaging (DWI). A dataset of 126 breast cancer patients were collected who underwent preoperative breast MRI with a 3T scanner. The apparent diffusion coefficients (ADCs) were recorded from DWI, and breast image was segmented into regions comprising the tumor and the surrounding stromal. Statistical characteristics in various breast tumor and peritumoral regions were computed, including mean, minimum, maximum, variance, interquartile range, range, skewness, and kurtosis of ADC values. Additionally, the difference of features between each two regions were also calculated. The univariate logistic based classifier was performed for evaluating the performance of the individual features for discriminating subtypes. For multi-class classification, multivariate logistic regression model was trained and validated. The results showed that the tumor boundary and proximal peritumoral stroma region derived features have a higher performance in classification compared to that of the other regions. Furthermore, the prediction model using statistical features, difference features and all the features combined from these regions generated AUC values of 0.774, 0.796 and 0.811, respectively. The results in this study indicate that ADC feature in tumor and peritumoral stromal region would be valuable for estimating the molecular subtype in breast cancer.

  10. Recent developments in tissue-type imaging (TTI) for planning and monitoring treatment of prostate cancer.

    PubMed

    Feleppa, Ernest J; Porter, Christopher R; Ketterling, Jeffrey; Lee, Paul; Dasgupta, Shreedevi; Urban, Stella; Kalisz, Andrew

    2004-07-01

    Because current methods of imaging prostate cancer are inadequate, biopsies cannot be effectively guided and treatment cannot be effectively planned and targeted. Therefore, our research is aimed at ultrasonically characterizing cancerous prostate tissue so that we can image it more effectively and thereby provide improved means of detecting, treating and monitoring prostate cancer. We base our characterization methods on spectrum analysis of radiofrequency (rf) echo signals combined with clinical variables such as prostate-specific antigen (PSA). Tissue typing using these parameters is performed by artificial neural networks. We employed and evaluated different approaches to data partitioning into training, validation, and test sets and different neural network configuration options. In this manner, we sought to determine what neural network configuration is optimal for these data and also to assess possible bias that might exist due to correlations among different data entries among the data for a given patient. The classification efficacy of each neural network configuration and data-partitioning method was measured using relative-operating-characteristic (ROC) methods. Neural network classification based on spectral parameters combined with clinical data generally produced ROC-curve areas of 0.80 compared to curve areas of 0.64 for conventional transrectal ultrasound imaging combined with clinical data. We then used the optimal neural network configuration to generate lookup tables that translate local spectral parameter values and global clinical-variable values into pixel values in tissue-type images (TTIs). TTIs continue to show cancerous regions successfully, and may prove to be particularly useful clinically in combination with other ultrasonic and nonultrasonic methods, e.g., magnetic-resonance spectroscopy.

  11. Towards an evidence-based approach for diagnosis and management of adnexal masses: findings of the International Ovarian Tumour Analysis (IOTA) studies.

    PubMed

    Kaijser, J

    2015-01-01

    Whilst the outcomes for patients with ovarian cancer clearly benefit from centralised, comprehensive care in dedicated cancer centres, unfortunately the majority of patients still do not receive appropriate specialist treatment. Any improvement in the accuracy of current triaging and referral pathways whether using new imaging tests or biomarkers would therefore be of value in order to optimise the appropriate selection of patients for such care. An analysis of current evidence shows that such tests are now available, but still await recognition, acceptance and widespread adoption. It is therefore to be hoped that present guidance relating to the classification of ovarian masses will soon become more "evidence-based". These promising tests include the International Ovarian Tumour Analysis (IOTA) LR2 model and ultrasound-based Simple Rules (SR). Based on a comprehensive recent meta-analysis both currently offer the optimal "evidence-based" approach to discriminating between cancer and benign conditions in women with adnexal tumours needing surgery. LR2 and SR are reliable tests having been shown to maintain a high sensitivity for cancer after independent external and temporal validation by the IOTA group in the hands of examiners with various levels of ultrasound expertise. They also offer more accurate triage compared to the existing Risk of Malignancy Index (RMI). The development of the IOTA ADNEX model represents an important step forward towards more individualised patient care in this area. ADNEX is a novel test that enables the more specific subtyping of adnexal cancers (i.e. borderline, stage 1 invasive, stage II-IV invasive, and secondary metastatic malignant tumours) and shares similar levels of accuracy to IOTA LR2 and SR for basic discrimination between cancer and benign disease. The IOTA study has made significant progress in relation to the classification of adnexal masses, however what is now needed is to see if these or new diagnostic tools can assist clinicians to select patients with adnexal masses that are suitable for expectant management, and that will work in all health care settings (i.e. primary vs secondary vs tertiary care). These important themes will likely control the future agenda of the IOTA project.

  12. GTA: a game theoretic approach to identifying cancer subnetwork markers.

    PubMed

    Farahmand, S; Goliaei, S; Ansari-Pour, N; Razaghi-Moghadam, Z

    2016-03-01

    The identification of genetic markers (e.g. genes, pathways and subnetworks) for cancer has been one of the most challenging research areas in recent years. A subset of these studies attempt to analyze genome-wide expression profiles to identify markers with high reliability and reusability across independent whole-transcriptome microarray datasets. Therefore, the functional relationships of genes are integrated with their expression data. However, for a more accurate representation of the functional relationships among genes, utilization of the protein-protein interaction network (PPIN) seems to be necessary. Herein, a novel game theoretic approach (GTA) is proposed for the identification of cancer subnetwork markers by integrating genome-wide expression profiles and PPIN. The GTA method was applied to three distinct whole-transcriptome breast cancer datasets to identify the subnetwork markers associated with metastasis. To evaluate the performance of our approach, the identified subnetwork markers were compared with gene-based, pathway-based and network-based markers. We show that GTA is not only capable of identifying robust metastatic markers, it also provides a higher classification performance. In addition, based on these GTA-based subnetworks, we identified a new bonafide candidate gene for breast cancer susceptibility.

  13. Automatic Gleason grading of H and E stained microscopic prostate images using deep convolutional neural networks

    NASA Astrophysics Data System (ADS)

    Gummeson, Anna; Arvidsson, Ida; Ohlsson, Mattias; Overgaard, Niels C.; Krzyzanowska, Agnieszka; Heyden, Anders; Bjartell, Anders; Aström, Kalle

    2017-03-01

    Prostate cancer is the most diagnosed cancer in men. The diagnosis is confirmed by pathologists based on ocular inspection of prostate biopsies in order to classify them according to Gleason score. The main goal of this paper is to automate the classification using convolutional neural networks (CNNs). The introduction of CNNs has broadened the field of pattern recognition. It replaces the classical way of designing and extracting hand-made features used for classification with the substantially different strategy of letting the computer itself decide which features are of importance. For automated prostate cancer classification into the classes: Benign, Gleason grade 3, 4 and 5 we propose a CNN with small convolutional filters that has been trained from scratch using stochastic gradient descent with momentum. The input consists of microscopic images of haematoxylin and eosin stained tissue, the output is a coarse segmentation into regions of the four different classes. The dataset used consists of 213 images, each considered to be of one class only. Using four-fold cross-validation we obtained an error rate of 7.3%, which is significantly better than previous state of the art using the same dataset. Although the dataset was rather small, good results were obtained. From this we conclude that CNN is a promising method for this problem. Future work includes obtaining a larger dataset, which potentially could diminish the error margin.

  14. Ensemble of sparse classifiers for high-dimensional biological data.

    PubMed

    Kim, Sunghan; Scalzo, Fabien; Telesca, Donatello; Hu, Xiao

    2015-01-01

    Biological data are often high in dimension while the number of samples is small. In such cases, the performance of classification can be improved by reducing the dimension of data, which is referred to as feature selection. Recently, a novel feature selection method has been proposed utilising the sparsity of high-dimensional biological data where a small subset of features accounts for most variance of the dataset. In this study we propose a new classification method for high-dimensional biological data, which performs both feature selection and classification within a single framework. Our proposed method utilises a sparse linear solution technique and the bootstrap aggregating algorithm. We tested its performance on four public mass spectrometry cancer datasets along with two other conventional classification techniques such as Support Vector Machines and Adaptive Boosting. The results demonstrate that our proposed method performs more accurate classification across various cancer datasets than those conventional classification techniques.

  15. [Histological and molecular classification of endometrial carcinoma and therapeutical implications].

    PubMed

    Genestie, Catherine; Leary, Alexandra; Devouassoux, Mojgan; Auguste, Aurélie

    2017-12-01

    Endometrial cancer is the fourth cause of cancer in women in France and is the second most common cancer of the gynecologic cancer after breast cancer with 7275 new cases in 2012. The incidence of this neoplasm tends to increase with population aging, diabetes and obesity's augmentation. In rare cases, a hereditary factor has been described: Lynch's syndrome. The therapeutic management of the patient depends on the endometrial biopsy which specifies the histological type and the histo-prognostic grade as well as the MRI which allow the tumor staging. Within the last decade, improvement in technologies such as genomic, transcriptomic and histological analyses, allowed the establishment of new and finer classifications of endometrial carcinomas. The latest classification proposed by The Cancer Genomic Atlas (TCGA), has been made routinely applicable through the international consortium TransPORTEC. It consists of 4 groups listed from good to poor prognosis: (1) ultra-mutated "POLE"; (2) hyper-mutated "MSI"; (3) low copy number "NSMP" and (4) high number of copies "TP53 mutated" (serous-like). This integrated characterization combined with mutational data opens new opportunities for therapeutic strategies. Copyright © 2017 Société Française du Cancer. Published by Elsevier Masson SAS. All rights reserved.

  16. Application of local binary pattern and human visual Fibonacci texture features for classification different medical images

    NASA Astrophysics Data System (ADS)

    Sanghavi, Foram; Agaian, Sos

    2017-05-01

    The goal of this paper is to (a) test the nuclei based Computer Aided Cancer Detection system using Human Visual based system on the histopathology images and (b) Compare the results of the proposed system with the Local Binary Pattern and modified Fibonacci -p pattern systems. The system performance is evaluated using different parameters such as accuracy, specificity, sensitivity, positive predictive value, and negative predictive value on 251 prostate histopathology images. The accuracy of 96.69% was observed for cancer detection using the proposed human visual based system compared to 87.42% and 94.70% observed for Local Binary patterns and the modified Fibonacci p patterns.

  17. Leukemia and colon tumor detection based on microarray data classification using momentum backpropagation and genetic algorithm as a feature selection method

    NASA Astrophysics Data System (ADS)

    Wisesty, Untari N.; Warastri, Riris S.; Puspitasari, Shinta Y.

    2018-03-01

    Cancer is one of the major causes of mordibility and mortality problems in the worldwide. Therefore, the need of a system that can analyze and identify a person suffering from a cancer by using microarray data derived from the patient’s Deoxyribonucleic Acid (DNA). But on microarray data has thousands of attributes, thus making the challenges in data processing. This is often referred to as the curse of dimensionality. Therefore, in this study built a system capable of detecting a patient whether contracted cancer or not. The algorithm used is Genetic Algorithm as feature selection and Momentum Backpropagation Neural Network as a classification method, with data used from the Kent Ridge Bio-medical Dataset. Based on system testing that has been done, the system can detect Leukemia and Colon Tumor with best accuracy equal to 98.33% for colon tumor data and 100% for leukimia data. Genetic Algorithm as feature selection algorithm can improve system accuracy, which is from 64.52% to 98.33% for colon tumor data and 65.28% to 100% for leukemia data, and the use of momentum parameters can accelerate the convergence of the system in the training process of Neural Network.

  18. Validation of the prognostic gene portfolio, ClinicoMolecular Triad Classification, using an independent prospective breast cancer cohort and external patient populations.

    PubMed

    Wang, Dong-Yu; Done, Susan J; Mc Cready, David R; Leong, Wey L

    2014-07-04

    Using genome-wide expression profiles of a prospective training cohort of breast cancer patients, ClinicoMolecular Triad Classification (CMTC) was recently developed to classify breast cancers into three clinically relevant groups to aid treatment decisions. CMTC was found to be both prognostic and predictive in a large external breast cancer cohort in that study. This study serves to validate the reproducibility of CMTC and its prognostic value using independent patient cohorts. An independent internal cohort (n = 284) and a new external cohort (n = 2,181) were used to validate the association of CMTC between clinicopathological factors, 12 known gene signatures, two molecular subtype classifiers, and 19 oncogenic signalling pathway activities, and to reproduce the abilities of CMTC to predict clinical outcomes of breast cancer. In addition, we also updated the outcome data of the original training cohort (n = 147). The original training cohort reached a statistically significant difference (p < 0.05) in disease-free survivals between the three CMTC groups after an additional two years of follow-up (median = 55 months). The prognostic value of the triad classification was reproduced in the second independent internal cohort and the new external validation cohort. CMTC achieved even higher prognostic significance when all available patients were analyzed (n = 4,851). Oncogenic pathways Myc, E2F1, Ras and β-catenin were again implicated in the high-risk groups. Both prospective internal cohorts and the independent external cohorts reproduced the triad classification of CMTC and its prognostic significance. CMTC is an independent prognostic predictor, and it outperformed 12 other known prognostic gene signatures, molecular subtype classifications, and all other standard prognostic clinicopathological factors. Our results support further development of CMTC portfolio into a guide for personalized breast cancer treatments.

  19. Prediction of pathologic staging with magnetic resonance imaging after preoperative chemoradiotherapy in rectal cancer: pooled analysis of KROG 10-01 and 11-02.

    PubMed

    Lee, Jong Hoon; Jang, Hong Seok; Kim, Jun-Gi; Lee, Myung Ah; Kim, Dae Yong; Kim, Tae Hyun; Oh, Jae Hwan; Park, Sung Chan; Kim, Sun Young; Baek, Ji Yeon; Park, Hee Chul; Kim, Hee Cheol; Nam, Taek-Keun; Chie, Eui Kyu; Jung, Ji-Han; Oh, Seong Taek

    2014-10-01

    The reported overall accuracy of MRI in predicting the pathologic stage of nonirradiated rectal cancer is high. However, the role of MRI in restaging rectal tumors after neoadjuvant CRT is contentious. Thus, we evaluate the accuracy of restaging magnetic resonance imaging (MRI) for rectal cancer patients who receive preoperative chemoradiotherapy (CRT). We analyzed 150 patients with locally advanced rectal cancer (T3-4N0-2) who had received preoperative CRT. Pre-CRT MRI was performed for local tumor and nodal staging. All patients underwent restaging MRI followed by total mesorectal excision after the end of radiotherapy. The primary endpoint of the present study was to estimate the accuracy of post-CRT MRI as compared with pathologic staging. Pathologic T classification matched the post-CRT MRI findings in 97 (64.7%) of 150 patients. 36 (24.0%) of 150 patients were overstaged in T classification, and the concordance degree was moderate (k=0.33, p<0.01). Pathologic N classification matched the post-CRI MRI findings in 85 (56.6%) of 150 patients. 54 (36.0%) of 150 patients were overstaged in N classification. 26 patients achieved downstaging (ycT0-2N0) on restaging MRI after CRT. 23 (88.5%) of 26 patients who had been downstaged on MRI after CRT were confirmed on the pathological staging, and the concordance degree was good (k=0.72, p<0.01). Restaging MRI has low accuracy for the prediction of the pathologic T and N classifications in rectal cancer patients who received preoperative CRT. The diagnostic accuracy of restaging MRI is relatively high in rectal cancer patients who achieved clinical downstaging after CRT. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  20. Prognostic Performance and Reproducibility of the 1973 and 2004/2016 World Health Organization Grading Classification Systems in Non-muscle-invasive Bladder Cancer: A European Association of Urology Non-muscle Invasive Bladder Cancer Guidelines Panel Systematic Review.

    PubMed

    Soukup, Viktor; Čapoun, Otakar; Cohen, Daniel; Hernández, Virginia; Babjuk, Marek; Burger, Max; Compérat, Eva; Gontero, Paolo; Lam, Thomas; MacLennan, Steven; Mostafid, A Hugh; Palou, Joan; van Rhijn, Bas W G; Rouprêt, Morgan; Shariat, Shahrokh F; Sylvester, Richard; Yuan, Yuhong; Zigeuner, Richard

    2017-11-01

    Tumour grade is an important prognostic indicator in non-muscle-invasive bladder cancer (NMIBC). Histopathological classifications are limited by interobserver variability (reproducibility), which may have prognostic implications. European Association of Urology NMIBC guidelines suggest concurrent use of both 1973 and 2004/2016 World Health Organization (WHO) classifications. To compare the prognostic performance and reproducibility of the 1973 and 2004/2016 WHO grading systems for NMIBC. A systematic literature search was undertaken incorporating Medline, Embase, and the Cochrane Library. Studies were critically appraised for risk of bias (QUIPS). For prognosis, the primary outcome was progression to muscle-invasive or metastatic disease. Secondary outcomes were disease recurrence, and overall and cancer-specific survival. For reproducibility, the primary outcome was interobserver variability between pathologists. Secondary outcome was intraobserver variability (repeatability) by the same pathologist. Of 3593 articles identified, 20 were included in the prognostic review; three were eligible for the reproducibility review. Increasing tumour grade in both classifications was associated with higher disease progression and recurrence rates. Progression rates in grade 1 patients were similar to those in low-grade patients; progression rates in grade 3 patients were higher than those in high-grade patients. Survival data were limited. Reproducibility of the 2004/2016 system was marginally better than that of the 1973 system. Two studies on repeatability showed conflicting results. Most studies had a moderate to high risk of bias. Current grading classifications in NMIBC are suboptimal. The 1973 system identifies more aggressive tumours. Intra- and interobserver variability was slightly less in the 2004/2016 classification. We could not confirm that the 2004/2016 classification outperforms the 1973 classification in prediction of recurrence and progression. This article summarises the utility of two different grading systems for non-muscle-invasive bladder cancer. Both systems predict progression and recurrence, although pathologists vary in their reporting; suggestions for further improvements are made. Copyright © 2017 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  1. Evaluation Methodology between Globalization and Localization Features Approaches for Skin Cancer Lesions Classification

    NASA Astrophysics Data System (ADS)

    Ahmed, H. M.; Al-azawi, R. J.; Abdulhameed, A. A.

    2018-05-01

    Huge efforts have been put in the developing of diagnostic methods to skin cancer disease. In this paper, two different approaches have been addressed for detection the skin cancer in dermoscopy images. The first approach uses a global method that uses global features for classifying skin lesions, whereas the second approach uses a local method that uses local features for classifying skin lesions. The aim of this paper is selecting the best approach for skin lesion classification. The dataset has been used in this paper consist of 200 dermoscopy images from Pedro Hispano Hospital (PH2). The achieved results are; sensitivity about 96%, specificity about 100%, precision about 100%, and accuracy about 97% for globalization approach while, sensitivity about 100%, specificity about 100%, precision about 100%, and accuracy about 100% for Localization Approach, these results showed that the localization approach achieved acceptable accuracy and better than globalization approach for skin cancer lesions classification.

  2. A hybrid cost-sensitive ensemble for imbalanced breast thermogram classification.

    PubMed

    Krawczyk, Bartosz; Schaefer, Gerald; Woźniak, Michał

    2015-11-01

    Early recognition of breast cancer, the most commonly diagnosed form of cancer in women, is of crucial importance, given that it leads to significantly improved chances of survival. Medical thermography, which uses an infrared camera for thermal imaging, has been demonstrated as a particularly useful technique for early diagnosis, because it detects smaller tumors than the standard modality of mammography. In this paper, we analyse breast thermograms by extracting features describing bilateral symmetries between the two breast areas, and present a classification system for decision making. Clearly, the costs associated with missing a cancer case are much higher than those for mislabelling a benign case. At the same time, datasets contain significantly fewer malignant cases than benign ones. Standard classification approaches fail to consider either of these aspects. In this paper, we introduce a hybrid cost-sensitive classifier ensemble to address this challenging problem. Our approach entails a pool of cost-sensitive decision trees which assign a higher misclassification cost to the malignant class, thereby boosting its recognition rate. A genetic algorithm is employed for simultaneous feature selection and classifier fusion. As an optimisation criterion, we use a combination of misclassification cost and diversity to achieve both a high sensitivity and a heterogeneous ensemble. Furthermore, we prune our ensemble by discarding classifiers that contribute minimally to the decision making. For a challenging dataset of about 150 thermograms, our approach achieves an excellent sensitivity of 83.10%, while maintaining a high specificity of 89.44%. This not only signifies improved recognition of malignant cases, it also statistically outperforms other state-of-the-art algorithms designed for imbalanced classification, and hence provides an effective approach for analysing breast thermograms. Our proposed hybrid cost-sensitive ensemble can facilitate a highly accurate early diagnostic of breast cancer based on thermogram features. It overcomes the difficulties posed by the imbalanced distribution of patients in the two analysed groups. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. Molecular approaches for classifying endometrial carcinoma.

    PubMed

    Piulats, Josep M; Guerra, Esther; Gil-Martín, Marta; Roman-Canal, Berta; Gatius, Sonia; Sanz-Pamplona, Rebeca; Velasco, Ana; Vidal, August; Matias-Guiu, Xavier

    2017-04-01

    Endometrial carcinoma is the most common cancer of the female genital tract. This review article discusses the usefulness of molecular techniques to classify endometrial carcinoma. Any proposal for molecular classification of neoplasms should integrate morphological features of the tumors. For that reason, we start with the current histological classification of endometrial carcinoma, by discussing the correlation between genotype and phenotype, and the most significant recent improvements. Then, we comment on some of the possible flaws of this classification, by discussing also the value of molecular pathology in improving them, including interobserver variation in pathologic interpretation of high grade tumors. Third, we discuss the importance of applying TCGA molecular approach to clinical practice. We also comment on the impact of intratumor heterogeneity in classification, and finally, we will discuss briefly, the usefulness of TCGA classification in tailoring immunotherapy in endometrial cancer patients. We suggest combining pathologic classification and the surrogate TCGA molecular classification for high-grade endometrial carcinomas, as an option to improve assessment of prognosis. Copyright © 2016 Elsevier Inc. All rights reserved.

  4. Algorithm for lung cancer detection based on PET/CT images

    NASA Astrophysics Data System (ADS)

    Saita, Shinsuke; Ishimatsu, Keita; Kubo, Mitsuru; Kawata, Yoshiki; Niki, Noboru; Ohtsuka, Hideki; Nishitani, Hiromu; Ohmatsu, Hironobu; Eguchi, Kenji; Kaneko, Masahiro; Moriyama, Noriyuki

    2009-02-01

    The five year survival rate of the lung cancer is low with about twenty-five percent. In addition it is an obstinate lung cancer wherein three out of four people die within five years. Then, the early stage detection and treatment of the lung cancer are important. Recently, we can obtain CT and PET image at the same time because PET/CT device has been developed. PET/CT is possible for a highly accurate cancer diagnosis because it analyzes quantitative shape information from CT image and FDG distribution from PET image. However, neither benign-malignant classification nor staging intended for lung cancer have been established still enough by using PET/CT images. In this study, we detect lung nodules based on internal organs extracted from CT image, and we also develop algorithm which classifies benignmalignant and metastatic or non metastatic lung cancer using lung structure and FDG distribution(one and two hour after administering FDG). We apply the algorithm to 59 PET/CT images (malignant 43 cases [Ad:31, Sq:9, sm:3], benign 16 cases) and show the effectiveness of this algorithm.

  5. Analysis of microarray leukemia data using an efficient MapReduce-based K-nearest-neighbor classifier.

    PubMed

    Kumar, Mukesh; Rath, Nitish Kumar; Rath, Santanu Kumar

    2016-04-01

    Microarray-based gene expression profiling has emerged as an efficient technique for classification, prognosis, diagnosis, and treatment of cancer. Frequent changes in the behavior of this disease generates an enormous volume of data. Microarray data satisfies both the veracity and velocity properties of big data, as it keeps changing with time. Therefore, the analysis of microarray datasets in a small amount of time is essential. They often contain a large amount of expression, but only a fraction of it comprises genes that are significantly expressed. The precise identification of genes of interest that are responsible for causing cancer are imperative in microarray data analysis. Most existing schemes employ a two-phase process such as feature selection/extraction followed by classification. In this paper, various statistical methods (tests) based on MapReduce are proposed for selecting relevant features. After feature selection, a MapReduce-based K-nearest neighbor (mrKNN) classifier is also employed to classify microarray data. These algorithms are successfully implemented in a Hadoop framework. A comparative analysis is done on these MapReduce-based models using microarray datasets of various dimensions. From the obtained results, it is observed that these models consume much less execution time than conventional models in processing big data. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. SU-E-J-107: Supervised Learning Model of Aligned Collagen for Human Breast Carcinoma Prognosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bredfeldt, J; Liu, Y; Conklin, M

    Purpose: Our goal is to develop and apply a set of optical and computational tools to enable large-scale investigations of the interaction between collagen and tumor cells. Methods: We have built a novel imaging system for automating the capture of whole-slide second harmonic generation (SHG) images of collagen in registry with bright field (BF) images of hematoxylin and eosin stained tissue. To analyze our images, we have integrated a suite of supervised learning tools that semi-automatically model and score collagen interactions with tumor cells via a variety of metrics, a method we call Electronic Tumor Associated Collagen Signatures (eTACS). Thismore » group of tools first segments regions of epithelial cells and collagen fibers from BF and SHG images respectively. We then associate fibers with groups of epithelial cells and finally compute features based on the angle of interaction and density of the collagen surrounding the epithelial cell clusters. These features are then processed with a support vector machine to separate cancer patients into high and low risk groups. Results: We validated our model by showing that eTACS produces classifications that have statistically significant correlation with manual classifications. In addition, our system generated classification scores that accurately predicted breast cancer patient survival in a cohort of 196 patients. Feature rank analysis revealed that TACS positive fibers are more well aligned with each other, generally lower density, and terminate within or near groups of epithelial cells. Conclusion: We are working to apply our model to predict survival in larger cohorts of breast cancer patients with a diversity of breast cancer types, predict response to treatments such as COX2 inhibitors, and to study collagen architecture changes in other cancer types. In the future, our system may be used to provide metastatic potential information to cancer patients to augment existing clinical assays.« less

  7. Proposal of a new staging system for intrahepatic cholangiocarcinoma: Analysis of surgical patients from a nationwide survey of the Liver Cancer Study Group of Japan.

    PubMed

    Sakamoto, Yoshihiro; Kokudo, Norihiro; Matsuyama, Yutaka; Sakamoto, Michiie; Izumi, Namiki; Kadoya, Masumi; Kaneko, Shuichi; Ku, Yonson; Kudo, Masatoshi; Takayama, Tadatoshi; Nakashima, Osamu

    2016-01-01

    In the current American Joint Committee on Cancer/International Union Against Cancer staging system (seventh edition) for intrahepatic cholangiocarcinoma (ICC), tumor size was excluded, and periductal invasion was added as a new tumor classification-defining factor. The objective of the current report was to propose a new staging system for ICC that would be better for stratifying the survival of patients based on data from the nationwide Liver Cancer Study Group of Japan database. Of 756 patients who underwent surgical resection for ICC between 2000 and 2005, multivariate analyses of the clinicopathologic factors of 419 patients who had complete data sets were performed to elucidate relevant factors for inclusion in a new tumor classification and staging system. Overall survival data were best stratified using a cutoff value of 2 cm using a minimal P value approach to discriminate patient survival. The 5-year survival rate of 15 patients who had ICC measuring ≤ 2 cm in greatest dimension without lymph node metastasis or vascular invasion was 100%, and this cohort was defined as T1. Multivariate analysis of prognostic factors for 267 patients with lymph node-negative and metastasis-negative (N0M0) disease indicated that the number of tumors, the presence arterial invasion, and the presence major biliary invasion were independent and significant prognostic factors. The proposed new system, which included tumor number, tumor size, arterial invasion, and major biliary invasion for tumor classification, provided good stratification of overall patient survival according to disease stage. Macroscopic periductal invasion was associated with major biliary invasion and an inferior prognosis. The proposed new staging system, which includes a tumor cutoff size of 2 cm and major biliary invasion, may be useful for assigning patients to surgery. © 2015 The Authors. Cancer published by Wiley Periodicals, Inc. on behalf of American Cancer Society.

  8. The new Epstein gleason score classification significantly reduces upgrading in prostate cancer patients.

    PubMed

    De Nunzio, Cosimo; Pastore, Antonio Luigi; Lombardo, Riccardo; Simone, Giuseppe; Leonardo, Costantino; Mastroianni, Riccardo; Collura, Devis; Muto, Giovanni; Gallucci, Michele; Carbone, Antonio; Fuschi, Andrea; Dutto, Lorenzo; Witt, Joern Heinrich; De Dominicis, Carlo; Tubaro, Andrea

    2018-06-01

    To evaluate the differences between the old and the new Gleason score classification systems in upgrading and downgrading rates. Between 2012 and 2015, we identified 9703 patients treated with retropubic radical prostatectomy (RP) in four tertiary centers. Biopsy specimens as well as radical prostatectomy specimens were graded according to both 2005 Gleason and 2014 ISUP five-tier Gleason grading system (five-tier GG system). Upgrading and downgrading rates on radical prostatectomy were first recorded for both classifications and then compared. The accuracy of the biopsy for each histological classification was determined by using the kappa coefficient of agreement and by assessing sensitivity, specificity, positive and negative predictive value. The five-tier GG system presented a lower clinically significant upgrading rate (1895/9703: 19,5% vs 2332/9703:24.0%; p = .001) and a similar clinically significant downgrading rate (756/9703: 7,7% vs 779/9703: 8%; p = .267) when compared to the 2005 ISUP classification. When evaluating their accuracy, the new five-tier GG system presented a better specificity (91% vs 83%) and a better negative predictive value (78% vs 60%). The kappa-statistics measures of agreement between needle biopsy and radical prostatectomy specimens were poor and good respectively for the five-tier GG system and for the 2005 Gleason score (k = 0.360 ± 0.007 vs k = 0.426 ± 0.007). The new Epstein classification significantly reduces upgrading events. The implementation of this new classification could better define prostate cancer aggressiveness with important clinical implications, particularly in prostate cancer management. Copyright © 2018 Elsevier Ltd, BASO ~ The Association for Cancer Surgery, and the European Society of Surgical Oncology. All rights reserved.

  9. Foundation and methodologies in computer-aided diagnosis systems for breast cancer detection.

    PubMed

    Jalalian, Afsaneh; Mashohor, Syamsiah; Mahmud, Rozi; Karasfi, Babak; Saripan, M Iqbal B; Ramli, Abdul Rahman B

    2017-01-01

    Breast cancer is the most prevalent cancer that affects women all over the world. Early detection and treatment of breast cancer could decline the mortality rate. Some issues such as technical reasons, which related to imaging quality and human error, increase misdiagnosis of breast cancer by radiologists. Computer-aided detection systems (CADs) are developed to overcome these restrictions and have been studied in many imaging modalities for breast cancer detection in recent years. The CAD systems improve radiologists' performance in finding and discriminating between the normal and abnormal tissues. These procedures are performed only as a double reader but the absolute decisions are still made by the radiologist. In this study, the recent CAD systems for breast cancer detection on different modalities such as mammography, ultrasound, MRI, and biopsy histopathological images are introduced. The foundation of CAD systems generally consist of four stages: Pre-processing, Segmentation, Feature extraction, and Classification. The approaches which applied to design different stages of CAD system are summarised. Advantages and disadvantages of different segmentation, feature extraction and classification techniques are listed. In addition, the impact of imbalanced datasets in classification outcomes and appropriate methods to solve these issues are discussed. As well as, performance evaluation metrics for various stages of breast cancer detection CAD systems are reviewed.

  10. Foundation and methodologies in computer-aided diagnosis systems for breast cancer detection

    PubMed Central

    Jalalian, Afsaneh; Mashohor, Syamsiah; Mahmud, Rozi; Karasfi, Babak; Saripan, M. Iqbal B.; Ramli, Abdul Rahman B.

    2017-01-01

    Breast cancer is the most prevalent cancer that affects women all over the world. Early detection and treatment of breast cancer could decline the mortality rate. Some issues such as technical reasons, which related to imaging quality and human error, increase misdiagnosis of breast cancer by radiologists. Computer-aided detection systems (CADs) are developed to overcome these restrictions and have been studied in many imaging modalities for breast cancer detection in recent years. The CAD systems improve radiologists' performance in finding and discriminating between the normal and abnormal tissues. These procedures are performed only as a double reader but the absolute decisions are still made by the radiologist. In this study, the recent CAD systems for breast cancer detection on different modalities such as mammography, ultrasound, MRI, and biopsy histopathological images are introduced. The foundation of CAD systems generally consist of four stages: Pre-processing, Segmentation, Feature extraction, and Classification. The approaches which applied to design different stages of CAD system are summarised. Advantages and disadvantages of different segmentation, feature extraction and classification techniques are listed. In addition, the impact of imbalanced datasets in classification outcomes and appropriate methods to solve these issues are discussed. As well as, performance evaluation metrics for various stages of breast cancer detection CAD systems are reviewed. PMID:28435432

  11. 29 CFR 1990.112 - Classification of potential carcinogens.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 1990.112 Labor Regulations Relating to Labor (Continued) OCCUPATIONAL SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR (CONTINUED) IDENTIFICATION, CLASSIFICATION, AND REGULATION OF POTENTIAL OCCUPATIONAL CARCINOGENS The Osha Cancer Policy § 1990.112 Classification of potential carcinogens. The following criteria...

  12. 29 CFR 1990.112 - Classification of potential carcinogens.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 1990.112 Labor Regulations Relating to Labor (Continued) OCCUPATIONAL SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR (CONTINUED) IDENTIFICATION, CLASSIFICATION, AND REGULATION OF POTENTIAL OCCUPATIONAL CARCINOGENS The Osha Cancer Policy § 1990.112 Classification of potential carcinogens. The following criteria...

  13. Combined Raman and autofluorescence ex vivo diagnostics of skin cancer in near-infrared and visible regions

    NASA Astrophysics Data System (ADS)

    Bratchenko, Ivan A.; Artemyev, Dmitry N.; Myakinin, Oleg O.; Khristoforova, Yulia A.; Moryatov, Alexander A.; Kozlov, Sergey V.; Zakharov, Valery P.

    2017-02-01

    The differentiation of skin melanomas and basal cell carcinomas (BCCs) was demonstrated based on combined analysis of Raman and autofluorescence spectra stimulated by visible and NIR lasers. It was ex vivo tested on 39 melanomas and 40 BCCs. Six spectroscopic criteria utilizing information about alteration of melanin, porphyrins, flavins, lipids, and collagen content in tumor with a comparison to healthy skin were proposed. The measured correlation between the proposed criteria makes it possible to define weakly correlated criteria groups for discriminant analysis and principal components analysis application. It was shown that the accuracy of cancerous tissues classification reaches 97.3% for a combined 6-criteria multimodal algorithm, while the accuracy determined separately for each modality does not exceed 79%. The combined 6-D method is a rapid and reliable tool for malignant skin detection and classification.

  14. Failure of evidence-based cancer care in the United States: the association between rectal cancer treatment, cancer center volume, and geography.

    PubMed

    Monson, John R T; Probst, Christian P; Wexner, Steven D; Remzi, Feza H; Fleshman, James W; Garcia-Aguilar, Julio; Chang, George J; Dietz, David W

    2014-10-01

    This study examines recent adherence to recommended neoadjuvant chemoradiotherapy guidelines for patients with rectal cancer across geographic regions and institution volume and assesses trends over time. A recent report by the Institute of Medicine described US cancer care as chaotic. Cited deficiencies included wide variation in adherence to evidence-based guidelines even where clear consensus exists. Patients operated on for clinical stage II and III rectal cancer were selected from the 2006-2011 National Cancer Data Base. Multivariable logistic regressions were used to assess variation in chemotherapy and radiation use by cancer center type, geographical location, and hospital volume. The analysis controlled for patient age at diagnosis, sex, race/ethnicity, primary payer, average household income, average education, urban/rural classification of patient residence, comorbidity, and oncologic stage. There were 30,994 patients who met the inclusion criteria. Use of neoadjuvant radiation therapy and chemotherapy varied significantly by type of cancer center. The highest rates of adherence were observed in high-volume centers compared with low-volume centers (78% vs 69%; adjusted odds ratio = 1.46; P < 0.001). This variation is mirrored by hospital geographic location. Primary payer and year of diagnosis were not predictive of rates of neoadjuvant chemoradiotherapy. Adherence to evidence-based treatment guidelines in rectal cancer is suboptimal in the United States, with significant differences based on hospital volume and geographic regions. Little improvement has occurred in the last 5 years. These results support the implementation of standardized care pathways and a Centers of Excellence program for US patients with rectal cancer.

  15. Gene selection in cancer classification using sparse logistic regression with Bayesian regularization.

    PubMed

    Cawley, Gavin C; Talbot, Nicola L C

    2006-10-01

    Gene selection algorithms for cancer classification, based on the expression of a small number of biomarker genes, have been the subject of considerable research in recent years. Shevade and Keerthi propose a gene selection algorithm based on sparse logistic regression (SLogReg) incorporating a Laplace prior to promote sparsity in the model parameters, and provide a simple but efficient training procedure. The degree of sparsity obtained is determined by the value of a regularization parameter, which must be carefully tuned in order to optimize performance. This normally involves a model selection stage, based on a computationally intensive search for the minimizer of the cross-validation error. In this paper, we demonstrate that a simple Bayesian approach can be taken to eliminate this regularization parameter entirely, by integrating it out analytically using an uninformative Jeffrey's prior. The improved algorithm (BLogReg) is then typically two or three orders of magnitude faster than the original algorithm, as there is no longer a need for a model selection step. The BLogReg algorithm is also free from selection bias in performance estimation, a common pitfall in the application of machine learning algorithms in cancer classification. The SLogReg, BLogReg and Relevance Vector Machine (RVM) gene selection algorithms are evaluated over the well-studied colon cancer and leukaemia benchmark datasets. The leave-one-out estimates of the probability of test error and cross-entropy of the BLogReg and SLogReg algorithms are very similar, however the BlogReg algorithm is found to be considerably faster than the original SLogReg algorithm. Using nested cross-validation to avoid selection bias, performance estimation for SLogReg on the leukaemia dataset takes almost 48 h, whereas the corresponding result for BLogReg is obtained in only 1 min 24 s, making BLogReg by far the more practical algorithm. BLogReg also demonstrates better estimates of conditional probability than the RVM, which are of great importance in medical applications, with similar computational expense. A MATLAB implementation of the sparse logistic regression algorithm with Bayesian regularization (BLogReg) is available from http://theoval.cmp.uea.ac.uk/~gcc/cbl/blogreg/

  16. A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets.

    PubMed

    Li, Der-Chiang; Liu, Chiao-Wen; Hu, Susan C

    2011-05-01

    Medical data sets are usually small and have very high dimensionality. Too many attributes will make the analysis less efficient and will not necessarily increase accuracy, while too few data will decrease the modeling stability. Consequently, the main objective of this study is to extract the optimal subset of features to increase analytical performance when the data set is small. This paper proposes a fuzzy-based non-linear transformation method to extend classification related information from the original data attribute values for a small data set. Based on the new transformed data set, this study applies principal component analysis (PCA) to extract the optimal subset of features. Finally, we use the transformed data with these optimal features as the input data for a learning tool, a support vector machine (SVM). Six medical data sets: Pima Indians' diabetes, Wisconsin diagnostic breast cancer, Parkinson disease, echocardiogram, BUPA liver disorders dataset, and bladder cancer cases in Taiwan, are employed to illustrate the approach presented in this paper. This research uses the t-test to evaluate the classification accuracy for a single data set; and uses the Friedman test to show the proposed method is better than other methods over the multiple data sets. The experiment results indicate that the proposed method has better classification performance than either PCA or kernel principal component analysis (KPCA) when the data set is small, and suggest creating new purpose-related information to improve the analysis performance. This paper has shown that feature extraction is important as a function of feature selection for efficient data analysis. When the data set is small, using the fuzzy-based transformation method presented in this work to increase the information available produces better results than the PCA and KPCA approaches. Copyright © 2011 Elsevier B.V. All rights reserved.

  17. Mutation-profile-based methods for understanding selection forces in cancer somatic mutations: a comparative analysis.

    PubMed

    Zhou, Zhan; Zou, Yangyun; Liu, Gangbiao; Zhou, Jingqi; Wu, Jingcheng; Zhao, Shimin; Su, Zhixi; Gu, Xun

    2017-08-29

    Human genes exhibit different effects on fitness in cancer and normal cells. Here, we present an evolutionary approach to measure the selection pressure on human genes, using the well-known ratio of the nonsynonymous to synonymous substitution rate in both cancer genomes ( C N / C S ) and normal populations ( p N / p S ). A new mutation-profile-based method that adopts sample-specific mutation rate profiles instead of conventional substitution models was developed. We found that cancer-specific selection pressure is quite different from the selection pressure at the species and population levels. Both the relaxation of purifying selection on passenger mutations and the positive selection of driver mutations may contribute to the increased C N / C S values of human genes in cancer genomes compared with the p N / p S values in human populations. The C N / C S values also contribute to the improved classification of cancer genes and a better understanding of the onco-functionalization of cancer genes during oncogenesis. The use of our computational pipeline to identify cancer-specific positively and negatively selected genes may provide useful information for understanding the evolution of cancers and identifying possible targets for therapeutic intervention.

  18. Assay based on electrical impedance spectroscopy to discriminate between normal and cancerous mammalian cells

    NASA Astrophysics Data System (ADS)

    Giana, Fabián Eduardo; Bonetto, Fabián José; Bellotti, Mariela Inés

    2018-03-01

    In this work we present an assay to discriminate between normal and cancerous cells. The method is based on the measurement of electrical impedance spectra of in vitro cell cultures. We developed a protocol consisting on four consecutive measurement phases, each of them designed to obtain different information about the cell cultures. Through the analysis of the measured data, 26 characteristic features were obtained for both cell types. From the complete set of features, we selected the most relevant in terms of their discriminant capacity by means of conventional statistical tests. A linear discriminant analysis was then carried out on the selected features, allowing the classification of the samples in normal or cancerous with 4.5% of false positives and no false negatives.

  19. Assay based on electrical impedance spectroscopy to discriminate between normal and cancerous mammalian cells.

    PubMed

    Giana, Fabián Eduardo; Bonetto, Fabián José; Bellotti, Mariela Inés

    2018-03-01

    In this work we present an assay to discriminate between normal and cancerous cells. The method is based on the measurement of electrical impedance spectra of in vitro cell cultures. We developed a protocol consisting on four consecutive measurement phases, each of them designed to obtain different information about the cell cultures. Through the analysis of the measured data, 26 characteristic features were obtained for both cell types. From the complete set of features, we selected the most relevant in terms of their discriminant capacity by means of conventional statistical tests. A linear discriminant analysis was then carried out on the selected features, allowing the classification of the samples in normal or cancerous with 4.5% of false positives and no false negatives.

  20. Compensatory neurofuzzy model for discrete data classification in biomedical

    NASA Astrophysics Data System (ADS)

    Ceylan, Rahime

    2015-03-01

    Biomedical data is separated to two main sections: signals and discrete data. So, studies in this area are about biomedical signal classification or biomedical discrete data classification. There are artificial intelligence models which are relevant to classification of ECG, EMG or EEG signals. In same way, in literature, many models exist for classification of discrete data taken as value of samples which can be results of blood analysis or biopsy in medical process. Each algorithm could not achieve high accuracy rate on classification of signal and discrete data. In this study, compensatory neurofuzzy network model is presented for classification of discrete data in biomedical pattern recognition area. The compensatory neurofuzzy network has a hybrid and binary classifier. In this system, the parameters of fuzzy systems are updated by backpropagation algorithm. The realized classifier model is conducted to two benchmark datasets (Wisconsin Breast Cancer dataset and Pima Indian Diabetes dataset). Experimental studies show that compensatory neurofuzzy network model achieved 96.11% accuracy rate in classification of breast cancer dataset and 69.08% accuracy rate was obtained in experiments made on diabetes dataset with only 10 iterations.

  1. Study Points to Genetic Subtypes of Esophageal Cancer

    Cancer.gov

    A Cancer Currents blog post about a study by The Cancer Genome Atlas Research Network that identified distinct genetic and molecular changes in esophageal cancers that could improve their classification and identify potential new treatments.

  2. Chaotic particle swarm optimization with mutation for classification.

    PubMed

    Assarzadeh, Zahra; Naghsh-Nilchi, Ahmad Reza

    2015-01-01

    In this paper, a chaotic particle swarm optimization with mutation-based classifier particle swarm optimization is proposed to classify patterns of different classes in the feature space. The introduced mutation operators and chaotic sequences allows us to overcome the problem of early convergence into a local minima associated with particle swarm optimization algorithms. That is, the mutation operator sharpens the convergence and it tunes the best possible solution. Furthermore, to remove the irrelevant data and reduce the dimensionality of medical datasets, a feature selection approach using binary version of the proposed particle swarm optimization is introduced. In order to demonstrate the effectiveness of our proposed classifier, mutation-based classifier particle swarm optimization, it is checked out with three sets of data classifications namely, Wisconsin diagnostic breast cancer, Wisconsin breast cancer and heart-statlog, with different feature vector dimensions. The proposed algorithm is compared with different classifier algorithms including k-nearest neighbor, as a conventional classifier, particle swarm-classifier, genetic algorithm, and Imperialist competitive algorithm-classifier, as more sophisticated ones. The performance of each classifier was evaluated by calculating the accuracy, sensitivity, specificity and Matthews's correlation coefficient. The experimental results show that the mutation-based classifier particle swarm optimization unequivocally performs better than all the compared algorithms.

  3. Contextual convolutional neural networks for lung nodule classification using Gaussian-weighted average image patches

    NASA Astrophysics Data System (ADS)

    Lee, Haeil; Lee, Hansang; Park, Minseok; Kim, Junmo

    2017-03-01

    Lung cancer is the most common cause of cancer-related death. To diagnose lung cancers in early stages, numerous studies and approaches have been developed for cancer screening with computed tomography (CT) imaging. In recent years, convolutional neural networks (CNN) have become one of the most common and reliable techniques in computer aided detection (CADe) and diagnosis (CADx) by achieving state-of-the-art-level performances for various tasks. In this study, we propose a CNN classification system for false positive reduction of initially detected lung nodule candidates. First, image patches of lung nodule candidates are extracted from CT scans to train a CNN classifier. To reflect the volumetric contextual information of lung nodules to 2D image patch, we propose a weighted average image patch (WAIP) generation by averaging multiple slice images of lung nodule candidates. Moreover, to emphasize central slices of lung nodules, slice images are locally weighted according to Gaussian distribution and averaged to generate the 2D WAIP. With these extracted patches, 2D CNN is trained to achieve the classification of WAIPs of lung nodule candidates into positive and negative labels. We used LUNA 2016 public challenge database to validate the performance of our approach for false positive reduction in lung CT nodule classification. Experiments show our approach improves the classification accuracy of lung nodules compared to the baseline 2D CNN with patches from single slice image.

  4. [New molecular classification of colorectal cancer, pancreatic cancer and stomach cancer: Towards "à la carte" treatment?].

    PubMed

    Dreyer, Chantal; Afchain, Pauline; Trouilloud, Isabelle; André, Thierry

    2016-01-01

    This review reports 3 of recently published molecular classifications of the 3 main gastro-intestinal cancers: gastric, pancreatic and colorectal adenocarcinoma. In colorectal adenocarcinoma, 6 independent classifications were combined to finally hold 4 molecular sub-groups, Consensus Molecular Subtypes (CMS 1-4), linked to various clinical, molecular and survival data. CMS1 (14% MSI with immune activation); CMS2 (37%: canonical with epithelial differentiation and activation of the WNT/MYC pathway); CMS3 (13% metabolic with epithelial differentiation and RAS mutation); CMS4 (23%: mesenchymal with activation of TGFβ pathway and angiogenesis with stromal invasion). In gastric adenocarcinoma, 4 groups were established: subtype "EBV" (9%, high frequency of PIK3CA mutations, hypermetylation and amplification of JAK2, PD-L1 and PD-L2), subtype "MSI" (22%, high rate of mutation), subtype "genomically stable tumor" (20%, diffuse histology type and mutations of RAS and genes encoding integrins and adhesion proteins including CDH1) and subtype "tumors with chromosomal instability" (50%, intestinal type, aneuploidy and receptor tyrosine kinase amplification). In pancreatic adenocarcinomas, a classification in four sub-groups has been proposed, stable subtype (20%, aneuploidy), locally rearranged subtype (30%, focal event on one or two chromosoms), scattered subtype (36%,<200 structural variation events), and unstable subtype (14%,>200 structural variation events, defects in DNA maintenance). Although currently away from the care of patients, these classifications open the way to "à la carte" treatment depending on molecular biology. Copyright © 2016 Société Française du Cancer. Published by Elsevier Masson SAS. All rights reserved.

  5. Computer-aided Prognosis of Neuroblastoma on Whole-slide Images: Classification of Stromal Development

    PubMed Central

    Sertel, O.; Kong, J.; Shimada, H.; Catalyurek, U.V.; Saltz, J.H.; Gurcan, M.N.

    2009-01-01

    We are developing a computer-aided prognosis system for neuroblastoma (NB), a cancer of the nervous system and one of the most malignant tumors affecting children. Histopathological examination is an important stage for further treatment planning in routine clinical diagnosis of NB. According to the International Neuroblastoma Pathology Classification (the Shimada system), NB patients are classified into favorable and unfavorable histology based on the tissue morphology. In this study, we propose an image analysis system that operates on digitized H&E stained whole-slide NB tissue samples and classifies each slide as either stroma-rich or stroma-poor based on the degree of Schwannian stromal development. Our statistical framework performs the classification based on texture features extracted using co-occurrence statistics and local binary patterns. Due to the high resolution of digitized whole-slide images, we propose a multi-resolution approach that mimics the evaluation of a pathologist such that the image analysis starts from the lowest resolution and switches to higher resolutions when necessary. We employ an offine feature selection step, which determines the most discriminative features at each resolution level during the training step. A modified k-nearest neighbor classifier is used to determine the confidence level of the classification to make the decision at a particular resolution level. The proposed approach was independently tested on 43 whole-slide samples and provided an overall classification accuracy of 88.4%. PMID:20161324

  6. Application of Fuzzy c-Means and Joint-Feature-Clustering to Detect Redundancies of Image-Features in Drug Combinations Studies of Breast Cancer

    NASA Astrophysics Data System (ADS)

    Brandl, Miriam B.; Beck, Dominik; Pham, Tuan D.

    2011-06-01

    The high dimensionality of image-based dataset can be a drawback for classification accuracy. In this study, we propose the application of fuzzy c-means clustering, cluster validity indices and the notation of a joint-feature-clustering matrix to find redundancies of image-features. The introduced matrix indicates how frequently features are grouped in a mutual cluster. The resulting information can be used to find data-derived feature prototypes with a common biological meaning, reduce data storage as well as computation times and improve the classification accuracy.

  7. Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov

    PubMed Central

    Xu, Jun; Lee, Hee-Jin; Zeng, Jia; Wu, Yonghui; Zhang, Yaoyun; Huang, Liang-Chin; Johnson, Amber; Holla, Vijaykumar; Bailey, Ann M; Cohen, Trevor; Meric-Bernstam, Funda; Bernstam, Elmer V

    2016-01-01

    Objective: Clinical trials investigating drugs that target specific genetic alterations in tumors are important for promoting personalized cancer therapy. The goal of this project is to create a knowledge base of cancer treatment trials with annotations about genetic alterations from ClinicalTrials.gov. Methods: We developed a semi-automatic framework that combines advanced text-processing techniques with manual review to curate genetic alteration information in cancer trials. The framework consists of a document classification system to identify cancer treatment trials from ClinicalTrials.gov and an information extraction system to extract gene and alteration pairs from the Title and Eligibility Criteria sections of clinical trials. By applying the framework to trials at ClinicalTrials.gov, we created a knowledge base of cancer treatment trials with genetic alteration annotations. We then evaluated each component of the framework against manually reviewed sets of clinical trials and generated descriptive statistics of the knowledge base. Results and Discussion: The automated cancer treatment trial identification system achieved a high precision of 0.9944. Together with the manual review process, it identified 20 193 cancer treatment trials from ClinicalTrials.gov. The automated gene-alteration extraction system achieved a precision of 0.8300 and a recall of 0.6803. After validation by manual review, we generated a knowledge base of 2024 cancer trials that are labeled with specific genetic alteration information. Analysis of the knowledge base revealed the trend of increased use of targeted therapy for cancer, as well as top frequent gene-alteration pairs of interest. We expect this knowledge base to be a valuable resource for physicians and patients who are seeking information about personalized cancer therapy. PMID:27013523

  8. Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov.

    PubMed

    Xu, Jun; Lee, Hee-Jin; Zeng, Jia; Wu, Yonghui; Zhang, Yaoyun; Huang, Liang-Chin; Johnson, Amber; Holla, Vijaykumar; Bailey, Ann M; Cohen, Trevor; Meric-Bernstam, Funda; Bernstam, Elmer V; Xu, Hua

    2016-07-01

    Clinical trials investigating drugs that target specific genetic alterations in tumors are important for promoting personalized cancer therapy. The goal of this project is to create a knowledge base of cancer treatment trials with annotations about genetic alterations from ClinicalTrials.gov. We developed a semi-automatic framework that combines advanced text-processing techniques with manual review to curate genetic alteration information in cancer trials. The framework consists of a document classification system to identify cancer treatment trials from ClinicalTrials.gov and an information extraction system to extract gene and alteration pairs from the Title and Eligibility Criteria sections of clinical trials. By applying the framework to trials at ClinicalTrials.gov, we created a knowledge base of cancer treatment trials with genetic alteration annotations. We then evaluated each component of the framework against manually reviewed sets of clinical trials and generated descriptive statistics of the knowledge base. The automated cancer treatment trial identification system achieved a high precision of 0.9944. Together with the manual review process, it identified 20 193 cancer treatment trials from ClinicalTrials.gov. The automated gene-alteration extraction system achieved a precision of 0.8300 and a recall of 0.6803. After validation by manual review, we generated a knowledge base of 2024 cancer trials that are labeled with specific genetic alteration information. Analysis of the knowledge base revealed the trend of increased use of targeted therapy for cancer, as well as top frequent gene-alteration pairs of interest. We expect this knowledge base to be a valuable resource for physicians and patients who are seeking information about personalized cancer therapy. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  9. An enhancement of binary particle swarm optimization for gene selection in classifying cancer classes

    PubMed Central

    2013-01-01

    Background Gene expression data could likely be a momentous help in the progress of proficient cancer diagnoses and classification platforms. Lately, many researchers analyze gene expression data using diverse computational intelligence methods, for selecting a small subset of informative genes from the data for cancer classification. Many computational methods face difficulties in selecting small subsets due to the small number of samples compared to the huge number of genes (high-dimension), irrelevant genes, and noisy genes. Methods We propose an enhanced binary particle swarm optimization to perform the selection of small subsets of informative genes which is significant for cancer classification. Particle speed, rule, and modified sigmoid function are introduced in this proposed method to increase the probability of the bits in a particle’s position to be zero. The method was empirically applied to a suite of ten well-known benchmark gene expression data sets. Results The performance of the proposed method proved to be superior to other previous related works, including the conventional version of binary particle swarm optimization (BPSO) in terms of classification accuracy and the number of selected genes. The proposed method also requires lower computational time compared to BPSO. PMID:23617960

  10. Support vector machine and principal component analysis for microarray data classification

    NASA Astrophysics Data System (ADS)

    Astuti, Widi; Adiwijaya

    2018-03-01

    Cancer is a leading cause of death worldwide although a significant proportion of it can be cured if it is detected early. In recent decades, technology called microarray takes an important role in the diagnosis of cancer. By using data mining technique, microarray data classification can be performed to improve the accuracy of cancer diagnosis compared to traditional techniques. The characteristic of microarray data is small sample but it has huge dimension. Since that, there is a challenge for researcher to provide solutions for microarray data classification with high performance in both accuracy and running time. This research proposed the usage of Principal Component Analysis (PCA) as a dimension reduction method along with Support Vector Method (SVM) optimized by kernel functions as a classifier for microarray data classification. The proposed scheme was applied on seven data sets using 5-fold cross validation and then evaluation and analysis conducted on term of both accuracy and running time. The result showed that the scheme can obtained 100% accuracy for Ovarian and Lung Cancer data when Linear and Cubic kernel functions are used. In term of running time, PCA greatly reduced the running time for every data sets.

  11. A proposal for the annotation of recurrent colorectal cancer: the 'Sheffield classification'.

    PubMed

    Majeed, A W; Shorthouse, A J; Blakeborough, A; Bird, N C

    2011-11-01

    Current classification systems of large bowel cancer only refer to metastatic disease as M0, M1 or Mx. Recurrent colorectal cancer primarily occurs in the liver, lungs, nodes or peritoneum. The management of each of these sites of recurrence has made significant advances and each is a subspecialty in its own right. The aim of this paper was to devise a classification system which accurately describes the site and extent of metastatic spread. An amendment of the current system is proposed in which liver, lung and peritoneal metastases are annotated by 'Liv 0,1', 'Pul 0,1' and 'Per 0,1' in describing the primary presentation. These are then subclassified, taking into account the chronology, size, number and geographical distribution of metastatic disease or logoregional recurrence and its K-Ras status. This discussion document proposes a classification system which is logical and simple to use. We plan to validate it prospectively. © 2011 The Authors. Colorectal Disease © 2011 The Association of Coloproctology of Great Britain and Ireland.

  12. The IASLC Lung Cancer Staging Project: Background Data and Proposals for the Classification of Lung Cancer with Separate Tumor Nodules in the Forthcoming Eighth Edition of the TNM Classification for Lung Cancer.

    PubMed

    Detterbeck, Frank C; Bolejack, Vanessa; Arenberg, Douglas A; Crowley, John; Donington, Jessica S; Franklin, Wilbur A; Girard, Nicolas; Marom, Edith M; Mazzone, Peter J; Nicholson, Andrew G; Rusch, Valerie W; Tanoue, Lynn T; Travis, William D; Asamura, Hisao; Rami-Porta, Ramón

    2016-05-01

    Separate tumor nodules with the same histologic appearance occur in the lungs in a small proportion of patients with primary lung cancer. This article addresses how such tumors can be classified to inform the eighth edition of the anatomic classification of lung cancer. Separate tumor nodules should be distinguished from second primary lung cancer, multifocal ground glass/lepidic tumors, and pneumonic-type lung cancer, which are addressed in separate analyses. Survival of patients with separate tumor nodules in the International Association for the Study of Lung Cancer database were analyzed. This was compared with a systematic literature review. Survival of clinically staged patients decreased according to the location of the separate tumor nodule relative to the index tumor (same lobe > same side > other side) in N0 and N-any cohorts (all M0 except possible other-side nodules). However, there was also a decrease in the proportion of patients resected; among only surgically resected or among nonresected patients no survival differences were noted. There were no survival differences between patients with same-lobe nodules and those with other T3 tumors, between patients with same-side nodules and those with T4 tumors, and patients with other-side nodules and those with other M1a tumors. The data correlated with those identified in a literature review. Tumors with same-lobe separate tumor nodules (with the same histologic appearance) are recommended to be classified as T3, same-side nodules as T4, and other-side nodules as M1a. Thus, there is no recommended change between the seventh and eighth edition of the TNM classification of lung cancer. Copyright © 2016 International Association for the Study of Lung Cancer. Published by Elsevier Inc. All rights reserved.

  13. Evaluation of the 7(th) edition of the UICC-AJCC tumor, node, metastasis classification for esophageal cancer in a Chinese cohort.

    PubMed

    Huang, Yan; Guo, Weigang; Shi, Shiming; He, Jian

    2016-07-01

    To assess and evaluate the prognostic value of the 7(th) edition of the Union for International Cancer Control-American Joint Committee on Cancer (UICC-AJCC) tumor, node, metastasis (TNM) staging system for Chinese patients with esophageal cancer in comparison with the 6(th) edition. A retrospective review was performed on 766 consecutive esophageal cancer patients treated with esophagectomy between 2008 and 2012. Patients were staged according to the 6(th) and 7(th) editions for esophageal cancer respectively. Survival was calculated by the Kaplan-Meier method, and multivariate analysis was performed using Cox regression model. Overall 3-year survival rate was 59.5%. There were significant differences in 3-year survival rates among T stages both according to the 6(th) edition and the 7(th) edition (P<0.001). According to the 7(th) edition, the 3-year survival rates of N0 (75.4%), N1 (65.2%), N2 (39.7%) and N3 (27.3%) patients were significant differences (P<0.001). Kaplan-Meier curve revealed a good discriminatory ability from stage I to IV, except for stage IB, IIA and IIB in the 7(th) edition staging system. Based on the 7(th) edition, the degree of differentiation, tumor length and tumor location were not independent prognostic factors on multivariate analysis. The multivariate analyses suggested that pT-, pN-, pTNM-category were all the independent prognostic factors based on the 6(th) and 7(th) edition staging system. The 7(th) edition of AJCC TNM staging system of esophageal cancer should discriminate pT2-3N0M0 (stage IB, IIA and IIB) better when considering the esophageal squamous cell cancer patients. Therefore, to improve and optimize the AJCC TNM classification for Chinese patients with esophageal cancer, more considerations about the value of tumor grade and tumor location in pT2-3N0M0 esophageal squamous cell cancer should be taken in the next new TNM staging system.

  14. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling

    PubMed Central

    Alshamlan, Hala; Badr, Ghada; Alohali, Yousef

    2015-01-01

    An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems. PMID:25961028

  15. mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling.

    PubMed

    Alshamlan, Hala; Badr, Ghada; Alohali, Yousef

    2015-01-01

    An artificial bee colony (ABC) is a relatively recent swarm intelligence optimization approach. In this paper, we propose the first attempt at applying ABC algorithm in analyzing a microarray gene expression profile. In addition, we propose an innovative feature selection algorithm, minimum redundancy maximum relevance (mRMR), and combine it with an ABC algorithm, mRMR-ABC, to select informative genes from microarray profile. The new approach is based on a support vector machine (SVM) algorithm to measure the classification accuracy for selected genes. We evaluate the performance of the proposed mRMR-ABC algorithm by conducting extensive experiments on six binary and multiclass gene expression microarray datasets. Furthermore, we compare our proposed mRMR-ABC algorithm with previously known techniques. We reimplemented two of these techniques for the sake of a fair comparison using the same parameters. These two techniques are mRMR when combined with a genetic algorithm (mRMR-GA) and mRMR when combined with a particle swarm optimization algorithm (mRMR-PSO). The experimental results prove that the proposed mRMR-ABC algorithm achieves accurate classification performance using small number of predictive genes when tested using both datasets and compared to previously suggested methods. This shows that mRMR-ABC is a promising approach for solving gene selection and cancer classification problems.

  16. Application of biomarkers in cancer risk management: evaluation from stochastic clonal evolutionary and dynamic system optimization points of view.

    PubMed

    Li, Xiaohong; Blount, Patricia L; Vaughan, Thomas L; Reid, Brian J

    2011-02-01

    Aside from primary prevention, early detection remains the most effective way to decrease mortality associated with the majority of solid cancers. Previous cancer screening models are largely based on classification of at-risk populations into three conceptually defined groups (normal, cancer without symptoms, and cancer with symptoms). Unfortunately, this approach has achieved limited successes in reducing cancer mortality. With advances in molecular biology and genomic technologies, many candidate somatic genetic and epigenetic "biomarkers" have been identified as potential predictors of cancer risk. However, none have yet been validated as robust predictors of progression to cancer or shown to reduce cancer mortality. In this Perspective, we first define the necessary and sufficient conditions for precise prediction of future cancer development and early cancer detection within a simple physical model framework. We then evaluate cancer risk prediction and early detection from a dynamic clonal evolution point of view, examining the implications of dynamic clonal evolution of biomarkers and the application of clonal evolution for cancer risk management in clinical practice. Finally, we propose a framework to guide future collaborative research between mathematical modelers and biomarker researchers to design studies to investigate and model dynamic clonal evolution. This approach will allow optimization of available resources for cancer control and intervention timing based on molecular biomarkers in predicting cancer among various risk subsets that dynamically evolve over time.

  17. Gene selection for tumor classification using neighborhood rough sets and entropy measures.

    PubMed

    Chen, Yumin; Zhang, Zunjun; Zheng, Jianzhong; Ma, Ying; Xue, Yu

    2017-03-01

    With the development of bioinformatics, tumor classification from gene expression data becomes an important useful technology for cancer diagnosis. Since a gene expression data often contains thousands of genes and a small number of samples, gene selection from gene expression data becomes a key step for tumor classification. Attribute reduction of rough sets has been successfully applied to gene selection field, as it has the characters of data driving and requiring no additional information. However, traditional rough set method deals with discrete data only. As for the gene expression data containing real-value or noisy data, they are usually employed by a discrete preprocessing, which may result in poor classification accuracy. In this paper, we propose a novel gene selection method based on the neighborhood rough set model, which has the ability of dealing with real-value data whilst maintaining the original gene classification information. Moreover, this paper addresses an entropy measure under the frame of neighborhood rough sets for tackling the uncertainty and noisy of gene expression data. The utilization of this measure can bring about a discovery of compact gene subsets. Finally, a gene selection algorithm is designed based on neighborhood granules and the entropy measure. Some experiments on two gene expression data show that the proposed gene selection is an effective method for improving the accuracy of tumor classification. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. Review of mass spectrometry-based metabolomics in cancer research

    PubMed Central

    Liesenfeld, David B.; Habermann, Nina; Owen, Robert W.; Scalbert, Augustin; Ulrich, Cornelia M.

    2014-01-01

    Metabolomics, the systematic investigation of all metabolites present within a biological system, is used in biomarker development for many human diseases, including cancer. In this review we investigate the current role of mass spectrometry-based metabolomics in cancer research. A literature review was carried out within the databases PubMed, Embase and Web of Knowledge. We included 106 studies reporting on 21 different types of cancer in 7 different sample types. Metabolomics in cancer research is most often used for case-control comparisons. Secondary applications include translational areas, such as patient prognosis, therapy control and tumor classification or grading. Metabolomics is at a developmental stage with respect to epidemiology, with the majority of studies including <100 patients. Standardization is required especially concerning sample preparation and data analysis. In a second part of this review, we reconstructed a metabolic network of cancer patients by quantitatively extracting all reports of altered metabolites: Alterations in energy metabolism, membrane and fatty acid synthesis emerged, with tryptophan levels changed most frequently in various cancers. Metabolomics has the potential to evolve into a standard tool for future applications in epidemiology and translational cancer research, but further, large-scale studies including prospective validation are needed. PMID:24096148

  19. Comparison of staging diagnosis by two magnifying endoscopy classification for superficial oesophageal cancer.

    PubMed

    Ebi, Masahide; Shimura, Takaya; Murakami, Kenji; Yamada, Tomonori; Hirata, Yoshikazu; Tsukamoto, Hironobu; Mizoshita, Tsutomu; Tanida, Satoshi; Kataoka, Hiromi; Kamiya, Takeshi; Joh, Takashi

    2012-11-01

    Due to the possibility of lymph node metastasis, surgical resection is indicated for superficial oesophageal cancer with invasion to a depth greater than the muscularis mucosa. Although two magnifying endoscopy classifications are currently used to diagnose the depth of invasion, which classification is more suitable remains controversial. To compare and evaluate the clinical outcomes of two classifications for superficial oesophageal squamous cell carcinoma. This cross-sectional study consists of 44 superficial oesophageal squamous cell carcinoma lesions with magnification image-enhanced endoscopy images. Only magnifying endoscopic images were displayed to two experienced endoscopists who independently diagnosed the depth of invasion according to both classifications. The sensitivity of invasion greater than the muscularis mucosa tended to be higher in Inoue's classification than Arima's classification (78.3±6.2% vs. 50.0±3.0%; P=0.144), whereas the specificity was significantly lower in Inoue's classification than in Arima's classification (61.9±0.0% vs. 97.6±3.4%; P=0.043). For both classifications, rates of concordance were 90.9% and 84.4%, and κ statistics were 0.81 and 0.66, respectively. Our results suggest that Arima's classification is suitable for general screening before treatment to avoid unnecessary surgery. Inoue's classification is appropriate for assessing wide lesion. Copyright © 2012 Editrice Gastroenterologica Italiana S.r.l. Published by Elsevier Ltd. All rights reserved.

  20. CRISPR/Cas9-mediated noncoding RNA editing in human cancers.

    PubMed

    Yang, Jie; Meng, Xiaodan; Pan, Jinchang; Jiang, Nan; Zhou, Chengwei; Wu, Zhenhua; Gong, Zhaohui

    2018-01-02

    Cancer is characterized by multiple genetic and epigenetic alterations, including a higher prevalence of mutations of oncogenes and/or tumor suppressors. Mounting evidences have shown that noncoding RNAs (ncRNAs) are involved in the epigenetic regulation of cancer genes and their associated pathways. The clustered regularly interspaced short palindromic repeats (CRISPR)-associated nuclease 9 (CRISPR/Cas9) system, a revolutionary genome-editing technology, has shed light on ncRNA-based cancer therapy. Here, we briefly introduce the classifications and mechanisms of CRISPR/Cas9 system. Importantly, we mainly focused on the applications of CRISPR/Cas9 system as a molecular tool for ncRNA (microRNA, long noncoding RNA and circular RNA, etc.) editing in human cancers, and the novel techniques that are based on CRISPR/Cas9 system. Additionally, the off-target effects and the corresponding solutions as well as the challenges toward CRISPR/Cas9 were also evaluated and discussed. Long- and short-ncRNAs have been employed as targets in precision oncology, and CRISPR/Cas9-mediated ncRNA editing may provide an excellent way to cure cancer.

  1. Analysis of classifiers performance for classification of potential microcalcification

    NASA Astrophysics Data System (ADS)

    M. N., Arun K.; Sheshadri, H. S.

    2013-07-01

    Breast cancer is a significant public health problem in the world. According to the literature early detection improve breast cancer prognosis. Mammography is a screening tool used for early detection of breast cancer. About 10-30% cases are missed during the routine check as it is difficult for the radiologists to make accurate analysis due to large amount of data. The Microcalcifications (MCs) are considered to be important signs of breast cancer. It has been reported in literature that 30% - 50% of breast cancer detected radio graphically show MCs on mammograms. Histologic examinations report 62% to 79% of breast carcinomas reveals MCs. MC are tiny, vary in size, shape, and distribution, and MC may be closely connected to surrounding tissues. There is a major challenge using the traditional classifiers in the classification of individual potential MCs as the processing of mammograms in appropriate stage generates data sets with an unequal amount of information for both classes (i.e., MC, and Not-MC). Most of the existing state-of-the-art classification approaches are well developed by assuming the underlying training set is evenly distributed. However, they are faced with a severe bias problem when the training set is highly imbalanced in distribution. This paper addresses this issue by using classifiers which handle the imbalanced data sets. In this paper, we also compare the performance of classifiers which are used in the classification of potential MC.

  2. An integrated method for cancer classification and rule extraction from microarray data

    PubMed Central

    Huang, Liang-Tsung

    2009-01-01

    Different microarray techniques recently have been successfully used to investigate useful information for cancer diagnosis at the gene expression level due to their ability to measure thousands of gene expression levels in a massively parallel way. One important issue is to improve classification performance of microarray data. However, it would be ideal that influential genes and even interpretable rules can be explored at the same time to offer biological insight. Introducing the concepts of system design in software engineering, this paper has presented an integrated and effective method (named X-AI) for accurate cancer classification and the acquisition of knowledge from DNA microarray data. This method included a feature selector to systematically extract the relative important genes so as to reduce the dimension and retain as much as possible of the class discriminatory information. Next, diagonal quadratic discriminant analysis (DQDA) was combined to classify tumors, and generalized rule induction (GRI) was integrated to establish association rules which can give an understanding of the relationships between cancer classes and related genes. Two non-redundant datasets of acute leukemia were used to validate the proposed X-AI, showing significantly high accuracy for discriminating different classes. On the other hand, I have presented the abilities of X-AI to extract relevant genes, as well as to develop interpretable rules. Further, a web server has been established for cancer classification and it is freely available at . PMID:19272192

  3. Refining the definition of mandibular osteoradionecrosis in clinical trials: The cancer research UK HOPON trial (Hyperbaric Oxygen for the Prevention of Osteoradionecrosis).

    PubMed

    Shaw, Richard; Tesfaye, Binyam; Bickerstaff, Matt; Silcocks, Paul; Butterworth, Christopher

    2017-01-01

    Mandibular osteoradionecrosis (ORN) is a common and serious complication of head and neck radiotherapy for which there is little reliable evidence for prevention or treatment. The diagnosis and classification of ORN have been inconsistently and imprecisely defined, even in clinical trials. A systematic review of diagnosis and classifications of ORN with specific focus on clinical trials is presented. The most suitable classification was evaluated for consistency using blinded independent review of outcome data (clinical photographs and radiographs) in the HOPON trial. Of 16 ORN classifications found, only one (Notani) appeared suitable as an endpoint in clinical trials. Clinical records of 217 timepoints were analysed amongst 94 randomised patients in the HOPON trial. The only inconsistency in classification arose where minor bone spicules (MBS) were apparent, which occurred in 19% of patients. Some trial investigators judged MBS as clinically unimportant and not reflecting ORN, others classified as ORN based on rigid definitions in common clinical use. When MBS was added as a distinct category to the Notani classification this ambiguity was resolved and agreement between observers was achieved. Most definitions and clinical classifications are based on retrospective case series and may be unsuitable for prospective interventional trials of ORN prevention or treatment. When ORN is used as a primary or secondary outcome in prospective clinical trials, the use of Notani classification with the additional category of MBS is recommended as it avoids subjectivity and enhances reliability and consistency of reporting. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Comparison of Clinical and Automated Breast Density Measurements: Implications for Risk Prediction and Supplemental Screening

    PubMed Central

    Brandt, Kathleen R.; Scott, Christopher G.; Ma, Lin; Mahmoudzadeh, Amir P.; Jensen, Matthew R.; Whaley, Dana H.; Wu, Fang Fang; Malkov, Serghei; Hruska, Carrie B.; Norman, Aaron D.; Heine, John; Shepherd, John; Pankratz, V. Shane; Kerlikowske, Karla

    2016-01-01

    Purpose To compare the classification of breast density with two automated methods, Volpara (version 1.5.0; Matakina Technology, Wellington, New Zealand) and Quantra (version 2.0; Hologic, Bedford, Mass), with clinical Breast Imaging Reporting and Data System (BI-RADS) density classifications and to examine associations of these measures with breast cancer risk. Materials and Methods In this study, 1911 patients with breast cancer and 4170 control subjects matched for age, race, examination date, and mammography machine were evaluated. Participants underwent mammography at Mayo Clinic or one of four sites within the San Francisco Mammography Registry between 2006 and 2012 and provided informed consent or a waiver for research, in compliance with HIPAA regulations and institutional review board approval. Digital mammograms were retrieved a mean of 2.1 years (range, 6 months to 6 years) before cancer diagnosis, with the corresponding clinical BI-RADS density classifications, and Volpara and Quantra density estimates were generated. Agreement was assessed with weighted κ statistics among control subjects. Breast cancer associations were evaluated with conditional logistic regression, adjusted for age and body mass index. Odds ratios, C statistics, and 95% confidence intervals (CIs) were estimated. Results Agreement between clinical BI-RADS density classifications and Volpara and Quantra BI-RADS estimates was moderate, with κ values of 0.57 (95% CI: 0.55, 0.59) and 0.46 (95% CI: 0.44, 0.47), respectively. Differences of up to 14% in dense tissue classification were found, with Volpara classifying 51% of women as having dense breasts, Quantra classifying 37%, and clinical BI-RADS assessment used to classify 43%. Clinical and automated measures showed similar breast cancer associations; odds ratios for extremely dense breasts versus scattered fibroglandular densities were 1.8 (95% CI: 1.5, 2.2), 1.9 (95% CI: 1.5, 2.5), and 2.3 (95% CI: 1.9, 2.8) for Volpara, Quantra, and BI-RADS classifications, respectively. Clinical BI-RADS assessment showed better discrimination of case status (C = 0.60; 95% CI: 0.58, 0.61) than did Volpara (C = 0.58; 95% CI: 0.56, 0.59) and Quantra (C = 0.56; 95% CI: 0.54, 0.58) BI-RADS classifications. Conclusion Automated and clinical assessments of breast density are similarly associated with breast cancer risk but differ up to 14% in the classification of women with dense breasts. This could have substantial effects on clinical practice patterns. © RSNA, 2015 Online supplemental material is available for this article. PMID:26694052

  5. Toward improving fine needle aspiration cytology by applying Raman microspectroscopy

    NASA Astrophysics Data System (ADS)

    Becker-Putsche, Melanie; Bocklitz, Thomas; Clement, Joachim; Rösch, Petra; Popp, Jürgen

    2013-04-01

    Medical diagnosis of biopsies performed by fine needle aspiration has to be very reliable. Therefore, pathologists/cytologists need additional biochemical information on single cancer cells for an accurate diagnosis. Accordingly, we applied three different classification models for discriminating various features of six breast cancer cell lines by analyzing Raman microspectroscopic data. The statistical evaluations are implemented by linear discriminant analysis (LDA) and support vector machines (SVM). For the first model, a total of 61,580 Raman spectra from 110 single cells are discriminated at the cell-line level with an accuracy of 99.52% using an SVM. The LDA classification based on Raman data achieved an accuracy of 94.04% by discriminating cell lines by their origin (solid tumor versus pleural effusion). In the third model, Raman cell spectra are classified by their cancer subtypes. LDA results show an accuracy of 97.45% and specificities of 97.78%, 99.11%, and 98.97% for the subtypes basal-like, HER2+/ER-, and luminal, respectively. These subtypes are confirmed by gene expression patterns, which are important prognostic features in diagnosis. This work shows the applicability of Raman spectroscopy and statistical data handling in analyzing cancer-relevant biochemical information for advanced medical diagnosis on the single-cell level.

  6. Molecular classification of gastric cancer.

    PubMed

    Chia, N-Y; Tan, P

    2016-05-01

    Gastric cancer (GC), a heterogeneous disease characterized by epidemiologic and histopathologic differences across countries, is a leading cause of cancer-related death. Treatment of GC patients is currently suboptimal due to patients being commonly treated in a uniform fashion irrespective of disease subtype. With the advent of next-generation sequencing and other genomic technologies, GCs are now being investigated in great detail at the molecular level. High-throughput technologies now allow a comprehensive study of genomic and epigenomic alterations associated with GC. Gene mutations, chromosomal aberrations, differential gene expression and epigenetic alterations are some of the genetic/epigenetic influences on GC pathogenesis. In addition, integrative analyses of molecular profiling data have led to the identification of key dysregulated pathways and importantly, the establishment of GC molecular classifiers. Recently, The Cancer Genome Atlas (TCGA) network proposed a four subtype classification scheme for GC based on the underlying tumor molecular biology of each subtype. This landmark study, together with other studies, has expanded our understanding on the characteristics of GC at the molecular level. Such knowledge may improve the medical management of GC in the future. © The Author 2016. Published by Oxford University Press on behalf of the European Society for Medical Oncology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  7. Gene selection for microarray cancer classification using a new evolutionary method employing artificial intelligence concepts.

    PubMed

    Dashtban, M; Balafar, Mohammadali

    2017-03-01

    Gene selection is a demanding task for microarray data analysis. The diverse complexity of different cancers makes this issue still challenging. In this study, a novel evolutionary method based on genetic algorithms and artificial intelligence is proposed to identify predictive genes for cancer classification. A filter method was first applied to reduce the dimensionality of feature space followed by employing an integer-coded genetic algorithm with dynamic-length genotype, intelligent parameter settings, and modified operators. The algorithmic behaviors including convergence trends, mutation and crossover rate changes, and running time were studied, conceptually discussed, and shown to be coherent with literature findings. Two well-known filter methods, Laplacian and Fisher score, were examined considering similarities, the quality of selected genes, and their influences on the evolutionary approach. Several statistical tests concerning choice of classifier, choice of dataset, and choice of filter method were performed, and they revealed some significant differences between the performance of different classifiers and filter methods over datasets. The proposed method was benchmarked upon five popular high-dimensional cancer datasets; for each, top explored genes were reported. Comparing the experimental results with several state-of-the-art methods revealed that the proposed method outperforms previous methods in DLBCL dataset. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Multiclass classification for skin cancer profiling based on the integration of heterogeneous gene expression series.

    PubMed

    Gálvez, Juan Manuel; Castillo, Daniel; Herrera, Luis Javier; San Román, Belén; Valenzuela, Olga; Ortuño, Francisco Manuel; Rojas, Ignacio

    2018-01-01

    Most of the research studies developed applying microarray technology to the characterization of different pathological states of any disease may fail in reaching statistically significant results. This is largely due to the small repertoire of analysed samples, and to the limitation in the number of states or pathologies usually addressed. Moreover, the influence of potential deviations on the gene expression quantification is usually disregarded. In spite of the continuous changes in omic sciences, reflected for instance in the emergence of new Next-Generation Sequencing-related technologies, the existing availability of a vast amount of gene expression microarray datasets should be properly exploited. Therefore, this work proposes a novel methodological approach involving the integration of several heterogeneous skin cancer series, and a later multiclass classifier design. This approach is thus a way to provide the clinicians with an intelligent diagnosis support tool based on the use of a robust set of selected biomarkers, which simultaneously distinguishes among different cancer-related skin states. To achieve this, a multi-platform combination of microarray datasets from Affymetrix and Illumina manufacturers was carried out. This integration is expected to strengthen the statistical robustness of the study as well as the finding of highly-reliable skin cancer biomarkers. Specifically, the designed operation pipeline has allowed the identification of a small subset of 17 differentially expressed genes (DEGs) from which to distinguish among 7 involved skin states. These genes were obtained from the assessment of a number of potential batch effects on the gene expression data. The biological interpretation of these genes was inspected in the specific literature to understand their underlying information in relation to skin cancer. Finally, in order to assess their possible effectiveness in cancer diagnosis, a cross-validation Support Vector Machines (SVM)-based classification including feature ranking was performed. The accuracy attained exceeded the 92% in overall recognition of the 7 different cancer-related skin states. The proposed integration scheme is expected to allow the co-integration with other state-of-the-art technologies such as RNA-seq.

  9. Superpixel-based classification of gastric chromoendoscopy images

    NASA Astrophysics Data System (ADS)

    Boschetto, Davide; Grisan, Enrico

    2017-03-01

    Chromoendoscopy (CH) is a gastroenterology imaging modality that involves the staining of tissues with methylene blue, which reacts with the internal walls of the gastrointestinal tract, improving the visual contrast in mucosal surfaces and thus enhancing a doctor's ability to screen precancerous lesions or early cancer. This technique helps identify areas that can be targeted for biopsy or treatment and in this work we will focus on gastric cancer detection. Gastric chromoendoscopy for cancer detection has several taxonomies available, one of which classifies CH images into three classes (normal, metaplasia, dysplasia) based on color, shape and regularity of pit patterns. Computer-assisted diagnosis is desirable to help us improve the reliability of the tissue classification and abnormalities detection. However, traditional computer vision methodologies, mainly segmentation, do not translate well to the specific visual characteristics of a gastroenterology imaging scenario. We propose the exploitation of a first unsupervised segmentation via superpixel, which groups pixels into perceptually meaningful atomic regions, used to replace the rigid structure of the pixel grid. For each superpixel, a set of features is extracted and then fed to a random forest based classifier, which computes a model used to predict the class of each superpixel. The average general accuracy of our model is 92.05% in the pixel domain (86.62% in the superpixel domain), while detection accuracies on the normal and abnormal class are respectively 85.71% and 95%. Eventually, the whole image class can be predicted image through a majority vote on each superpixel's predicted class.

  10. CoReCG: a comprehensive database of genes associated with colon-rectal cancer

    PubMed Central

    Agarwal, Rahul; Kumar, Binayak; Jayadev, Msk; Raghav, Dhwani; Singh, Ashutosh

    2016-01-01

    Cancer of large intestine is commonly referred as colorectal cancer, which is also the third most frequently prevailing neoplasm across the globe. Though, much of work is being carried out to understand the mechanism of carcinogenesis and advancement of this disease but, fewer studies has been performed to collate the scattered information of alterations in tumorigenic cells like genes, mutations, expression changes, epigenetic alteration or post translation modification, genetic heterogeneity. Earlier findings were mostly focused on understanding etiology of colorectal carcinogenesis but less emphasis were given for the comprehensive review of the existing findings of individual studies which can provide better diagnostics based on the suggested markers in discrete studies. Colon Rectal Cancer Gene Database (CoReCG), contains 2056 colon-rectal cancer genes information involved in distinct colorectal cancer stages sourced from published literature with an effective knowledge based information retrieval system. Additionally, interactive web interface enriched with various browsing sections, augmented with advance search facility for querying the database is provided for user friendly browsing, online tools for sequence similarity searches and knowledge based schema ensures a researcher friendly information retrieval mechanism. Colorectal cancer gene database (CoReCG) is expected to be a single point source for identification of colorectal cancer-related genes, thereby helping with the improvement of classification, diagnosis and treatment of human cancers. Database URL: lms.snu.edu.in/corecg PMID:27114494

  11. Assessing Unmet Information Needs of Breast Cancer Survivors: Exploratory Study of Online Health Forums Using Text Classification and Retrieval.

    PubMed

    McRoy, Susan; Rastegar-Mojarad, Majid; Wang, Yanshan; Ruddy, Kathryn J; Haddad, Tufia C; Liu, Hongfang

    2018-05-15

    Patient education materials given to breast cancer survivors may not be a good fit for their information needs. Needs may change over time, be forgotten, or be misreported, for a variety of reasons. An automated content analysis of survivors' postings to online health forums can identify expressed information needs over a span of time and be repeated regularly at low cost. Identifying these unmet needs can guide improvements to existing education materials and the creation of new resources. The primary goals of this project are to assess the unmet information needs of breast cancer survivors from their own perspectives and to identify gaps between information needs and current education materials. This approach employs computational methods for content modeling and supervised text classification to data from online health forums to identify explicit and implicit requests for health-related information. Potential gaps between needs and education materials are identified using techniques from information retrieval. We provide a new taxonomy for the classification of sentences in online health forum data. 260 postings from two online health forums were selected, yielding 4179 sentences for coding. After annotation of data and training alternative one-versus-others classifiers, a random forest-based approach achieved F1 scores from 66% (Other, dataset2) to 90% (Medical, dataset1) on the primary information types. 136 expressions of need were used to generate queries to indexed education materials. Upon examination of the best two pages retrieved for each query, 12% (17/136) of queries were found to have relevant content by all coders, and 33% (45/136) were judged to have relevant content by at least one. Text from online health forums can be analyzed effectively using automated methods. Our analysis confirms that breast cancer survivors have many information needs that are not covered by the written documents they typically receive, as our results suggest that at most a third of breast cancer survivors' questions would be addressed by the materials currently provided to them. ©Susan McRoy, Majid Rastegar-Mojarad, Yanshan Wang, Kathryn J. Ruddy, Tufia C. Haddad, Hongfang Liu. Originally published in JMIR Cancer (http://cancer.jmir.org), 15.05.2018.

  12. ‘I–We’ boundary fluctuations in couple adjustment to rectal cancer and life with a permanent colostomy

    PubMed Central

    McCarthy, Molly; Fergus, Karen; Miller, Debbie

    2016-01-01

    This study investigates couples’ adjustment to rectal cancer and a colostomy using the ‘Classification System of Couple Adjustment to Cancer’, a framework delineating fluctuations in couples’ sense of ‘I’ and ‘We’ in response to cancer. Nine couples affected by rectal cancer and adjusting to life with a colostomy were interviewed. A theoretical thematic analysis of the transcripts was conducted; nearly all ‘I–We’ shifts of the Classification System of Couple Adjustment to Cancer were observed – often in unique ways in response to rectal cancer–specific challenges – and one new shift was described. The results provide a novel and experientially grounded means of conceptualizing complex dyadic coping processes. PMID:28070388

  13. Effects of the length of central cancer registry operations on identification of subsequent cancers and on survival estimates.

    PubMed

    Qiao, Baozhen; Schymura, Maria J; Kahn, Amy R

    2016-10-01

    Population-based cancer survival analyses have traditionally been based on the first primary cancer. Recent studies have brought this practice into question, arguing that varying registry reference dates affect the ability to identify earlier cancers, resulting in selection bias. We used a theoretical approach to evaluate the extent to which the length of registry operations affects the classification of first versus subsequent cancers and consequently survival estimates. Sequence number central was used to classify tumors from the New York State Cancer Registry, diagnosed 2001-2010, as either first primaries (value=0 or 1) or subsequent primaries (≥2). A set of three sequence numbers, each based on an assumed reference year (1976, 1986 or 1996), was assigned to each tumor. Percent of subsequent cancers was evaluated by reference year, cancer site and age. 5-year relative survival estimates were compared under four different selection scenarios. The percent of cancer cases classified as subsequent primaries was 15.3%, 14.3% and 11.2% for reference years 1976, 1986 and 1996, respectively; and varied by cancer site and age. When only the first primary was included, shorter registry operation time was associated with slightly lower 5-year survival estimates. When all primary cancers were included, survival estimates decreased, with the largest decreases seen for the earliest reference year. Registry operation length affected the identification of subsequent cancers, but the overall effect of this misclassification on survival estimates was small. Survival estimates based on all primary cancers were slightly lower, but might be more comparable across registries. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. SDL: Saliency-Based Dictionary Learning Framework for Image Similarity.

    PubMed

    Sarkar, Rituparna; Acton, Scott T

    2018-02-01

    In image classification, obtaining adequate data to learn a robust classifier has often proven to be difficult in several scenarios. Classification of histological tissue images for health care analysis is a notable application in this context due to the necessity of surgery, biopsy or autopsy. To adequately exploit limited training data in classification, we propose a saliency guided dictionary learning method and subsequently an image similarity technique for histo-pathological image classification. Salient object detection from images aids in the identification of discriminative image features. We leverage the saliency values for the local image regions to learn a dictionary and respective sparse codes for an image, such that the more salient features are reconstructed with smaller error. The dictionary learned from an image gives a compact representation of the image itself and is capable of representing images with similar content, with comparable sparse codes. We employ this idea to design a similarity measure between a pair of images, where local image features of one image, are encoded with the dictionary learned from the other and vice versa. To effectively utilize the learned dictionary, we take into account the contribution of each dictionary atom in the sparse codes to generate a global image representation for image comparison. The efficacy of the proposed method was evaluated using three tissue data sets that consist of mammalian kidney, lung and spleen tissue, breast cancer, and colon cancer tissue images. From the experiments, we observe that our methods outperform the state of the art with an increase of 14.2% in the average classification accuracy over all data sets.

  15. Predicting Time-to-Relapse in Breast Cancer Using Neural Networks

    DTIC Science & Technology

    1997-12-01

    CODE 17. SECURITY CLASSIFICATION OF REPORT Unclassified 118. SECURITY CLASSIFICATION OF THIS PAGE Unclassified 19. SECURITY CLASSIFICATION OF...Lowell WE, and Davis GL. A neural network that predicts psychiatric length of stay. MD Computing 10:87-92, 1993. Ebell MH. Artificial neural netowrks

  16. Risk Stratification of Neck Lesions Detected Sonographically During the Follow-Up of Differentiated Thyroid Cancer.

    PubMed

    Lamartina, Livia; Grani, Giorgio; Biffoni, Marco; Giacomelli, Laura; Costante, Giuseppe; Lupo, Stefania; Maranghi, Marianna; Plasmati, Katia; Sponziello, Marialuisa; Trulli, Fabiana; Verrienti, Antonella; Filetti, Sebastiano; Durante, Cosimo

    2016-08-01

    The European Thyroid Association (ETA) has classified posttreatment cervical ultrasound findings in thyroid cancer patients based on their association with disease persistence/recurrence. The objective of the study was to assess this classification's ability to predict the growth and persistence of such lesions during active posttreatment surveillance of patients with differentiated thyroid cancer (DTC). This was a retrospective, observational study. The study was conducted at a thyroid cancer center in a large Italian teaching hospital. Center referrals (2005-2014) were reviewed and patients selected with pathologically-confirmed DTC; total thyroidectomy, with or without neck dissection and/or radioiodine remnant ablation; abnormal findings on two or more consecutive posttreatment neck sonograms; and subsequent follow-up consisting of active surveillance. Baseline ultrasound abnormalities (thyroid bed masses, lymph nodes) were classified according to the ETA system. Patients were divided into group S (those with one or more lesions classified as suspicious) and group I (indeterminate lesions only). We recorded baseline and follow-up clinical data through June 30, 2015. The main outcomes were patients with growth (>3 mm, largest diameter) of one or more lesions during follow-up and patients with one or more persistent lesions at the final visit. The cohort included 58 of the 637 DTC cases screened (9%). A total of 113 lesions were followed up (18 thyroid bed masses, 95 lymph nodes). During surveillance (median 3.7 y), group I had significantly lower rates than group S of lesion growth (8% vs 36%, P = .01) and persistence (64% vs 97%, P = .014). The median time to scan normalization was 2.9 years. The ETA's evidence-based classification of sonographically detected neck abnormalities can help identify papillary thyroid cancer patients eligible for more relaxed follow-up.

  17. Label-free image-based detection of drug resistance with optofluidic time-stretch microscopy (Conference Presentation)

    NASA Astrophysics Data System (ADS)

    Kobayashi, Hirofumi; Lei, Cheng; Mao, Ailin; Jiang, Yiyue; Guo, Baoshan; Ozeki, Yasuyuki; Goda, Keisuke

    2017-02-01

    Acquired drug resistance is a fundamental predicament in cancer therapy. Early detection of drug-resistant cancer cells during or after treatment is expected to benefit patients from unnecessary drug administration and thus play a significant role in the development of a therapeutic strategy. However, the development of an effective method of detecting drug-resistant cancer cells is still in its infancy due to their complex mechanism in drug resistance. To address this problem, we propose and experimentally demonstrate label-free image-based drug resistance detection with optofluidic time-stretch microscopy using leukemia cells (K562 and K562/ADM). By adding adriamycin (ADM) to both K562 and K562/ADM (ADM-resistant K562 cells) cells, both types of cells express unique morphological changes, which are subsequently captured by an optofluidic time-stretch microscope. These unique morphological changes are extracted as image features and are subjected to supervised machine learning for cell classification. We hereby have successfully differentiated K562 and K562/ADM solely with label-free images, which suggests that our technique is capable of detecting drug-resistant cancer cells. Our optofluidic time-stretch microscope consists of a time-stretch microscope with a high spatial resolution of 780 nm at a 1D frame rate of 75 MHz and a microfluidic device that focuses and orders cells. We compare various machine learning algorithms as well as various concentrations of ADM for cell classification. Owing to its unprecedented versatility of using label-free image and its independency from specific molecules, our technique holds great promise for detecting drug resistance of cancer cells for which its underlying mechanism is still unknown or chemical probes are still unavailable.

  18. Identification of immune cell infiltration in hematoxylin-eosin stained breast cancer samples: texture-based classification of tissue morphologies

    NASA Astrophysics Data System (ADS)

    Turkki, Riku; Linder, Nina; Kovanen, Panu E.; Pellinen, Teijo; Lundin, Johan

    2016-03-01

    The characteristics of immune cells in the tumor microenvironment of breast cancer capture clinically important information. Despite the heterogeneity of tumor-infiltrating immune cells, it has been shown that the degree of infiltration assessed by visual evaluation of hematoxylin-eosin (H and E) stained samples has prognostic and possibly predictive value. However, quantification of the infiltration in H and E-stained tissue samples is currently dependent on visual scoring by an expert. Computer vision enables automated characterization of the components of the tumor microenvironment, and texture-based methods have successfully been used to discriminate between different tissue morphologies and cell phenotypes. In this study, we evaluate whether local binary pattern texture features with superpixel segmentation and classification with support vector machine can be utilized to identify immune cell infiltration in H and E-stained breast cancer samples. Guided with the pan-leukocyte CD45 marker, we annotated training and test sets from 20 primary breast cancer samples. In the training set of arbitrary sized image regions (n=1,116) a 3-fold cross-validation resulted in 98% accuracy and an area under the receiver-operating characteristic curve (AUC) of 0.98 to discriminate between immune cell -rich and - poor areas. In the test set (n=204), we achieved an accuracy of 96% and AUC of 0.99 to label cropped tissue regions correctly into immune cell -rich and -poor categories. The obtained results demonstrate strong discrimination between immune cell -rich and -poor tissue morphologies. The proposed method can provide a quantitative measurement of the degree of immune cell infiltration and applied to digitally scanned H and E-stained breast cancer samples for diagnostic purposes.

  19. Automated computer-aided detection of prostate cancer in MR images: from a whole-organ to a zone-based approach

    NASA Astrophysics Data System (ADS)

    Litjens, G. J. S.; Barentsz, J. O.; Karssemeijer, N.; Huisman, H. J.

    2012-03-01

    MRI has shown to have great potential in prostate cancer localization and grading, but interpreting those exams requires expertise that is not widely available. Therefore, CAD applications are being developed to aid radiologists in detecting prostate cancer. Existing CAD applications focus on the prostate as a whole. However, in clinical practice transition zone cancer and peripheral zone cancer are considered to have different appearances. In this paper we present zone-specific CAD, in addition to an atlas based segmentation technique which includes zonal segmentation. Our CAD system consists of a detection and a classification stage. Prior to the detection stage the prostate is segmented into two zones. After segmentation features are extracted. Subsequently a likelihood map is generated on which local maxima detection is performed. For each local maximum a region is segmented. In the classification stage additional shape features are calculated, after which the regions are classified. Validation was performed on 288 data sets with MR-guided biopsy results as ground truth. Freeresponse Receiver Operating Characteristic (FROC) analysis was used for statistical evaluation. The difference between whole-prostate and zone-specific CAD was assessed using the difference between the FROCs. Our results show that evaluating the two zones separately results in an increase in performance compared to whole-prostate CAD. The FROC curves at .1, 1 and 3 false positives have a sensitivity of 0.0, 0.55 and 0.72 for whole-prostate and 0.08, 0.57 and 0.80 for zone-specific CAD. The FROC curve of the zone-specific CAD also showed significantly better performance overall (p < 0.05).

  20. Cell nuclei attributed relational graphs for efficient representation and classification of gastric cancer in digital histopathology

    NASA Astrophysics Data System (ADS)

    Sharma, Harshita; Zerbe, Norman; Heim, Daniel; Wienert, Stephan; Lohmann, Sebastian; Hellwich, Olaf; Hufnagl, Peter

    2016-03-01

    This paper describes a novel graph-based method for efficient representation and subsequent classification in histological whole slide images of gastric cancer. Her2/neu immunohistochemically stained and haematoxylin and eosin stained histological sections of gastric carcinoma are digitized. Immunohistochemical staining is used in practice by pathologists to determine extent of malignancy, however, it is laborious to visually discriminate the corresponding malignancy levels in the more commonly used haematoxylin and eosin stain, and this study attempts to solve this problem using a computer-based method. Cell nuclei are first isolated at high magnification using an automatic cell nuclei segmentation strategy, followed by construction of cell nuclei attributed relational graphs of the tissue regions. These graphs represent tissue architecture comprehensively, as they contain information about cell nuclei morphology as vertex attributes, along with knowledge of neighborhood in the form of edge linking and edge attributes. Global graph characteristics are derived and ensemble learning is used to discriminate between three types of malignancy levels, namely, non-tumor, Her2/neu positive tumor and Her2/neu negative tumor. Performance is compared with state of the art methods including four texture feature groups (Haralick, Gabor, Local Binary Patterns and Varma Zisserman features), color and intensity features, and Voronoi diagram and Delaunay triangulation. Texture, color and intensity information is also combined with graph-based knowledge, followed by correlation analysis. Quantitative assessment is performed using two cross validation strategies. On investigating the experimental results, it can be concluded that the proposed method provides a promising way for computer-based analysis of histopathological images of gastric cancer.

  1. TNM and Modified Dukes staging along with the demographic characteristics of patients with colorectal carcinoma

    PubMed Central

    Akkoca, Ayşe Neslin; Yanık, Serdar; Özdemir, Zeynep Tuğba; Cihan, Fatma Gökşin; Sayar, Süleyman; Cincin, Tarık Gandi; Çam, Akın; Özer, Cahit

    2014-01-01

    Aim: Colon adenocarcinoma, is the most common cancer in gastrointesinal system (GIS). The whole world is an important cause of morbidity and mortality. TNM and modified Dukes classification which has great importance in the diagnosis and treatment of Colorectal cancer (CRC). TNM and Modified Dukes classification results of histopathological examination and the demographic characteristics of patients and their relation were investigated. Materials and methods: Lower gastrointestinal operation results of 85 patients were examined accepted to clinical Pathology between January 1997-November 2013. Colon cancer had been diagnosed at 85 patients with pathology materials and staging was done according to the TNM and Modified Duke classification. The demographic characteristics of patients, differentiation grade, lymph node involvement, serous involvement were evaluated retrospectively. Results: In this study 37 patients (43.52%) were men and 48 (56.47%) were women. Ages of patients were between 19 and 87 with a mean age of 57.31 ± 15.31. Lymph node, differentiation, serosa involvement, Modified Dukes and TNM classification was assessed according to sex and age. TNM classification by sex was not statistically significant (p > 0.05). There was no statistically significant relationship between age and differentiation (p = 0.085). Value of differentiation increased towards from 1 to 3 inversely proportional to age. So young patients defined as well-differentiated at the conclusion. Negative relationship was evaluated between age and TNM Class variables. As a result, the relationship between age and TNM was not significant (p > 0.05). However, with increasing age the degree of staging was also found to increase. TNM classification was associated with the differentiation and it was significant (p = 0.043). Conclusion: Colon cancer, when contracted at an early stage, it is suitable for surgery and curative treatment can be done with minimal morbidity and mortality. However, some of the patients have advanced disease at diagnosis and their 5-year survival rate is only 8%. Every year there is prolongation of overall survival of colon cancer. It is so common cancer type so that determination of prognostic factors, disease staging and treatment strategy which affects survival is significant. PMID:25356145

  2. Classification of normal and abnormal images of lung cancer

    NASA Astrophysics Data System (ADS)

    Bhatnagar, Divyesh; Tiwari, Amit Kumar; Vijayarajan, V.; Krishnamoorthy, A.

    2017-11-01

    To find the exact symptoms of lung cancer is difficult, because of the formation of the most cancers tissues, wherein large structure of tissues is intersect in a different way. This problem can be evaluated with the help of digital images. In this strategy images will be examined with basic operation of PCA Algorithm. In this paper, GLCM method is used for pre-processing of the snap shots and function extraction system and to test the level of diseases of a patient in its premature stage get to know it is regular or unusual. With the help of result stage of cancer will be evaluated. With the help of dataset and result survival rate of cancer patient can be estimated. Result is based totally on the precise and wrong arrangement of the patterns of tissues.

  3. On the effect of experimental noise on the classification of biological samples using Raman micro-spectroscopy

    NASA Astrophysics Data System (ADS)

    Barton, Sinead J.; Kerr, Laura T.; Domijan, Katarina; Hennelly, Bryan M.

    2016-04-01

    Raman micro-spectroscopy is an optoelectronic technique that can be used to evaluate the chemical composition of biological samples and has been shown to be a powerful diagnostic tool for the investigation of various cancer related diseases including bladder, breast, and cervical cancer. Raman scattering is an inherently weak process with approximately 1 in 107 photons undergoing scattering and for this reason, noise from the recording system can have a significant impact on the quality of the signal, and its suitability for diagnostic classification. The main sources of noise in the recorded signal are shot noise, CCD dark current, and CCD readout noise. Shot noise results from the low signal photon count while dark current results from thermally generated electrons in the semiconductor pixels. Both of these noise sources are time dependent; readout noise is time independent but is inherent in each individual recording and results in the fundamental limit of measurement, arising from the internal electronics of the camera. In this paper, each of the aforementioned noise sources are analysed in isolation, and used to experimentally validate a mathematical model. This model is then used to simulate spectra that might be acquired under various experimental conditions including the use of different cameras, different source wavelength, and power etc. Simulated noisy datasets of T24 and RT112 cell line spectra are generated based on true cell Raman spectrum irradiance values (recorded using very long exposure times) and the addition of simulated noise. These datasets are then input to multivariate classification using Principal Components Analysis and Linear Discriminant Analysis. This method enables an investigation into the effect of noise on the sensitivity and specificity of Raman based classification under various experimental conditions and using different equipment.

  4. [Diagnosis, prognosis, and prediction of non-small cell lung cancer. Importance of morphology, immunohistochemistry and molecular pathology].

    PubMed

    Warth, A

    2015-11-01

    Tumor diagnostics are based on histomorphology, immunohistochemistry and molecular pathological analysis of mutations, translocations and amplifications which are of diagnostic, prognostic and/or predictive value. In recent decades only histomorphology was used to classify lung cancer as either small (SCLC) or non-small cell lung cancer (NSCLC), although NSCLC was further subdivided in different entities; however, as no specific therapy options were available classification of specific subtypes was not clinically meaningful. This fundamentally changed with the discovery of specific molecular alterations in adenocarcinoma (ADC), e.g. mutations in KRAS, EGFR and BRAF or translocations of the ALK and ROS1 gene loci, which now form the basis of targeted therapies and have led to a significantly improved patient outcome. The diagnostic, prognostic and predictive value of imaging, morphological, immunohistochemical and molecular characteristics as well as their interaction were systematically assessed in a large cohort with available clinical data including patient survival. Specific and sensitive diagnostic markers and marker panels were defined and diagnostic test algorithms for predictive biomarker assessment were optimized. It was demonstrated that the semi-quantitative assessment of ADC growth patterns is a stage-independent predictor of survival and is reproducibly applicable in the routine setting. Specific histomorphological characteristics correlated with computed tomography (CT) imaging features and thus allowed an improved interdisciplinary classification, especially in the preoperative or palliative setting. Moreover, specific molecular characteristics, for example BRAF mutations and the proliferation index (Ki-67) were identified as clinically relevant prognosticators. Comprehensive clinical, morphological, immunohistochemical and molecular assessment of NSCLCs allow an optimized patient stratification. Respective algorithms now form the backbone of the 2015 lung cancer World Health Organization (WHO) classification.

  5. Pathological and Molecular Evaluation of Pancreatic Neoplasms

    PubMed Central

    Rishi, Arvind; Goggins, Michael; Wood, Laura D.; Hruban, Ralph H.

    2015-01-01

    Pancreatic neoplasms are morphologically and genetically heterogeneous and include wide variety of neoplasms ranging from benign to malignant with an extremely poor clinical outcome. Our understanding of these pancreatic neoplasms has improved significantly with recent advances in cancer sequencing. Awareness of molecular pathogenesis brings in new opportunities for early detection, improved prognostication, and personalized gene-specific therapies. Here we review the pathological classification of pancreatic neoplasms from their molecular and genetic perspective. All of the major tumor types that arise in the pancreas have been sequenced, and a new classification that incorporates molecular findings together with pathological findings is now possible (Table 1). This classification has significant implications for our understanding of why tumors aggregate in some families, for the development of early detection tests, and for the development of personalized therapies for patients with established cancers. Here we describe this new classification using the framework of the standard histological classification. PMID:25726050

  6. Use of routinely available clinical, nutritional, and functional criteria to classify cachexia in advanced cancer patients.

    PubMed

    Vigano, Antonio A L; Morais, José A; Ciutto, Lorella; Rosenthall, Leonard; di Tomasso, Jonathan; Khan, Sarah; Olders, Henry; Borod, Manuel; Kilgour, Robert D

    2017-10-01

    Cachexia is a highly prevalent syndrome in cancer and chronic diseases. However, due to the heterogeneous features of cancer cachexia, its identification and classification challenge clinical practitioners. To determine the clinical relevance of a cancer cachexia classification system in advanced cancer patients. Beginning with the four-stage classification system proposed for cachexia [non-cachexia (NCa), pre-cachexia (PCa), cachexia (Ca) and refractory cachexia (RCa)], we assigned patients to these cachexia stages according to five classification criteria available in clinical practice: 1) biochemistry (high C-reactive protein or leukocytes, or hypoalbuminemia, or anemia), 2) food intake (normal/decreased), weight loss: 3) moderate (≤5%) or 4) significant (>5%/past six months) and 5) performance status (Eastern Cooperative Oncology Group Performance Status ≥ 3). We then determined if symptom severity, body composition changes, functional levels, hospitalizations and survival rates varied significantly across cachexia stages. Two-hundred and ninety-seven advanced cancer patients with primary gastrointestinal and lung tumors were included. Patients were classified into Ca (36%), PCa and RCa (21%, respectively) and NCa (15%). Significant (p < 0.05) differences were observed among cachexia stages for most of the outcome measures (symptoms, body composition, handgrip strength, emergency room visits and length of hospital stays) according to cachexia severity. Survival also differed between cachexia stages (except between PCa and Ca). Five clinical criteria can be used to stage cancer cachexia patients and predict important clinical, nutritional and functional outcomes. The lack of statistical difference between PCa and Ca in almost all clinical outcomes examined suggests either that the PCa group includes patients already affected by early cachexia or that more precise criteria are needed to differentiate PCa from Ca patients. More studies are required to validate these findings. Copyright © 2016. Published by Elsevier Ltd.

  7. Classification of melanoma lesions using sparse coded features and random forests

    NASA Astrophysics Data System (ADS)

    Rastgoo, Mojdeh; Lemaître, Guillaume; Morel, Olivier; Massich, Joan; Garcia, Rafael; Meriaudeau, Fabrice; Marzani, Franck; Sidibé, Désiré

    2016-03-01

    Malignant melanoma is the most dangerous type of skin cancer, yet it is the most treatable kind of cancer, conditioned by its early diagnosis which is a challenging task for clinicians and dermatologists. In this regard, CAD systems based on machine learning and image processing techniques are developed to differentiate melanoma lesions from benign and dysplastic nevi using dermoscopic images. Generally, these frameworks are composed of sequential processes: pre-processing, segmentation, and classification. This architecture faces mainly two challenges: (i) each process is complex with the need to tune a set of parameters, and is specific to a given dataset; (ii) the performance of each process depends on the previous one, and the errors are accumulated throughout the framework. In this paper, we propose a framework for melanoma classification based on sparse coding which does not rely on any pre-processing or lesion segmentation. Our framework uses Random Forests classifier and sparse representation of three features: SIFT, Hue and Opponent angle histograms, and RGB intensities. The experiments are carried out on the public PH2 dataset using a 10-fold cross-validation. The results show that SIFT sparse-coded feature achieves the highest performance with sensitivity and specificity of 100% and 90.3% respectively, with a dictionary size of 800 atoms and a sparsity level of 2. Furthermore, the descriptor based on RGB intensities achieves similar results with sensitivity and specificity of 100% and 71.3%, respectively for a smaller dictionary size of 100 atoms. In conclusion, dictionary learning techniques encode strong structures of dermoscopic images and provide discriminant descriptors.

  8. Genomic landscape of gastric cancer: molecular classification and potential targets.

    PubMed

    Guo, Jiawei; Yu, Weiwei; Su, Hui; Pang, Xiufeng

    2017-02-01

    Gastric cancer imposes a considerable health burden worldwide, and its mortality ranks as the second highest for all types of cancers. The limited knowledge of the molecular mechanisms underlying gastric cancer tumorigenesis hinders the development of therapeutic strategies. However, ongoing collaborative sequencing efforts facilitate molecular classification and unveil the genomic landscape of gastric cancer. Several new drivers and tumorigenic pathways in gastric cancer, including chromatin remodeling genes, RhoA-related pathways, TP53 dysregulation, activation of receptor tyrosine kinases, stem cell pathways and abnormal DNA methylation, have been revealed. These newly identified genomic alterations await translation into clinical diagnosis and targeted therapies. Considering that loss-of-function mutations are intractable, synthetic lethality could be employed when discussing feasible therapeutic strategies. Although many challenges remain to be tackled, we are optimistic regarding improvements in the prognosis and treatment of gastric cancer in the near future.

  9. Mixture of learners for cancer stem cell detection using CD13 and H and E stained images

    NASA Astrophysics Data System (ADS)

    Oǧuz, Oǧuzhan; Akbaş, Cem Emre; Mallah, Maen; Taşdemir, Kasım.; Akhan Güzelcan, Ece; Muenzenmayer, Christian; Wittenberg, Thomas; Üner, Ayşegül; Cetin, A. E.; ćetin Atalay, Rengül

    2016-03-01

    In this article, algorithms for cancer stem cell (CSC) detection in liver cancer tissue images are developed. Conventionally, a pathologist examines of cancer cell morphologies under microscope. Computer aided diagnosis systems (CAD) aims to help pathologists in this tedious and repetitive work. The first algorithm locates CSCs in CD13 stained liver tissue images. The method has also an online learning algorithm to improve the accuracy of detection. The second family of algorithms classify the cancer tissues stained with H and E which is clinically routine and cost effective than immunohistochemistry (IHC) procedure. The algorithms utilize 1D-SIFT and Eigen-analysis based feature sets as descriptors. Normal and cancerous tissues can be classified with 92.1% accuracy in H and E stained images. Classification accuracy of low and high-grade cancerous tissue images is 70.4%. Therefore, this study paves the way for diagnosing the cancerous tissue and grading the level of it using H and E stained microscopic tissue images.

  10. Gene Selection and Cancer Classification: A Rough Sets Based Approach

    NASA Astrophysics Data System (ADS)

    Sun, Lijun; Miao, Duoqian; Zhang, Hongyun

    Indentification of informative gene subsets responsible for discerning between available samples of gene expression data is an important task in bioinformatics. Reducts, from rough sets theory, corresponding to a minimal set of essential genes for discerning samples, is an efficient tool for gene selection. Due to the compuational complexty of the existing reduct algoritms, feature ranking is usually used to narrow down gene space as the first step and top ranked genes are selected . In this paper,we define a novel certierion based on the expression level difference btween classes and contribution to classification of the gene for scoring genes and present a algorithm for generating all possible reduct from informative genes.The algorithm takes the whole attribute sets into account and find short reduct with a significant reduction in computational complexity. An exploration of this approach on benchmark gene expression data sets demonstrates that this approach is successful for selecting high discriminative genes and the classification accuracy is impressive.

  11. Mass type-specific sparse representation for mass classification in computer-aided detection on mammograms

    PubMed Central

    2013-01-01

    Background Breast cancer is the leading cause of both incidence and mortality in women population. For this reason, much research effort has been devoted to develop Computer-Aided Detection (CAD) systems for early detection of the breast cancers on mammograms. In this paper, we propose a new and novel dictionary configuration underpinning sparse representation based classification (SRC). The key idea of the proposed algorithm is to improve the sparsity in terms of mass margins for the purpose of improving classification performance in CAD systems. Methods The aim of the proposed SRC framework is to construct separate dictionaries according to the types of mass margins. The underlying idea behind our method is that the separated dictionaries can enhance the sparsity of mass class (true-positive), leading to an improved performance for differentiating mammographic masses from normal tissues (false-positive). When a mass sample is given for classification, the sparse solutions based on corresponding dictionaries are separately solved and combined at score level. Experiments have been performed on both database (DB) named as Digital Database for Screening Mammography (DDSM) and clinical Full Field Digital Mammogram (FFDM) DBs. In our experiments, sparsity concentration in the true class (SCTC) and area under the Receiver operating characteristic (ROC) curve (AUC) were measured for the comparison between the proposed method and a conventional single dictionary based approach. In addition, a support vector machine (SVM) was used for comparing our method with state-of-the-arts classifier extensively used for mass classification. Results Comparing with the conventional single dictionary configuration, the proposed approach is able to improve SCTC of up to 13.9% and 23.6% on DDSM and FFDM DBs, respectively. Moreover, the proposed method is able to improve AUC with 8.2% and 22.1% on DDSM and FFDM DBs, respectively. Comparing to SVM classifier, the proposed method improves AUC with 2.9% and 11.6% on DDSM and FFDM DBs, respectively. Conclusions The proposed dictionary configuration is found to well improve the sparsity of dictionaries, resulting in an enhanced classification performance. Moreover, the results show that the proposed method is better than conventional SVM classifier for classifying breast masses subject to various margins from normal tissues. PMID:24564973

  12. Evaluation of gene expression classification studies: factors associated with classification performance.

    PubMed

    Novianti, Putri W; Roes, Kit C B; Eijkemans, Marinus J C

    2014-01-01

    Classification methods used in microarray studies for gene expression are diverse in the way they deal with the underlying complexity of the data, as well as in the technique used to build the classification model. The MAQC II study on cancer classification problems has found that performance was affected by factors such as the classification algorithm, cross validation method, number of genes, and gene selection method. In this paper, we study the hypothesis that the disease under study significantly determines which method is optimal, and that additionally sample size, class imbalance, type of medical question (diagnostic, prognostic or treatment response), and microarray platform are potentially influential. A systematic literature review was used to extract the information from 48 published articles on non-cancer microarray classification studies. The impact of the various factors on the reported classification accuracy was analyzed through random-intercept logistic regression. The type of medical question and method of cross validation dominated the explained variation in accuracy among studies, followed by disease category and microarray platform. In total, 42% of the between study variation was explained by all the study specific and problem specific factors that we studied together.

  13. A Quantum Hybrid PSO Combined with Fuzzy k-NN Approach to Feature Selection and Cell Classification in Cervical Cancer Detection.

    PubMed

    Iliyasu, Abdullah M; Fatichah, Chastine

    2017-12-19

    A quantum hybrid (QH) intelligent approach that blends the adaptive search capability of the quantum-behaved particle swarm optimisation (QPSO) method with the intuitionistic rationality of traditional fuzzy k -nearest neighbours (Fuzzy k -NN) algorithm (known simply as the Q-Fuzzy approach) is proposed for efficient feature selection and classification of cells in cervical smeared (CS) images. From an initial multitude of 17 features describing the geometry, colour, and texture of the CS images, the QPSO stage of our proposed technique is used to select the best subset features (i.e., global best particles) that represent a pruned down collection of seven features. Using a dataset of almost 1000 images, performance evaluation of our proposed Q-Fuzzy approach assesses the impact of our feature selection on classification accuracy by way of three experimental scenarios that are compared alongside two other approaches: the All-features (i.e., classification without prior feature selection) and another hybrid technique combining the standard PSO algorithm with the Fuzzy k -NN technique (P-Fuzzy approach). In the first and second scenarios, we further divided the assessment criteria in terms of classification accuracy based on the choice of best features and those in terms of the different categories of the cervical cells. In the third scenario, we introduced new QH hybrid techniques, i.e., QPSO combined with other supervised learning methods, and compared the classification accuracy alongside our proposed Q-Fuzzy approach. Furthermore, we employed statistical approaches to establish qualitative agreement with regards to the feature selection in the experimental scenarios 1 and 3. The synergy between the QPSO and Fuzzy k -NN in the proposed Q-Fuzzy approach improves classification accuracy as manifest in the reduction in number cell features, which is crucial for effective cervical cancer detection and diagnosis.

  14. Context aware decision support in neurosurgical oncology based on an efficient classification of endomicroscopic data.

    PubMed

    Li, Yachun; Charalampaki, Patra; Liu, Yong; Yang, Guang-Zhong; Giannarou, Stamatia

    2018-06-13

    Probe-based confocal laser endomicroscopy (pCLE) enables in vivo, in situ tissue characterisation without changes in the surgical setting and simplifies the oncological surgical workflow. The potential of this technique in identifying residual cancer tissue and improving resection rates of brain tumours has been recently verified in pilot studies. The interpretation of endomicroscopic information is challenging, particularly for surgeons who do not themselves routinely review histopathology. Also, the diagnosis can be examiner-dependent, leading to considerable inter-observer variability. Therefore, automatic tissue characterisation with pCLE would support the surgeon in establishing diagnosis as well as guide robot-assisted intervention procedures. The aim of this work is to propose a deep learning-based framework for brain tissue characterisation for context aware diagnosis support in neurosurgical oncology. An efficient representation of the context information of pCLE data is presented by exploring state-of-the-art CNN models with different tuning configurations. A novel video classification framework based on the combination of convolutional layers with long-range temporal recursion has been proposed to estimate the probability of each tumour class. The video classification accuracy is compared for different network architectures and data representation and video segmentation methods. We demonstrate the application of the proposed deep learning framework to classify Glioblastoma and Meningioma brain tumours based on endomicroscopic data. Results show significant improvement of our proposed image classification framework over state-of-the-art feature-based methods. The use of video data further improves the classification performance, achieving accuracy equal to 99.49%. This work demonstrates that deep learning can provide an efficient representation of pCLE data and accurately classify Glioblastoma and Meningioma tumours. The performance evaluation analysis shows the potential clinical value of the technique.

  15. Assessment of the Hong Kong Liver Cancer Staging System in Europe.

    PubMed

    Kolly, Philippe; Reeves, Helen; Sangro, Bruno; Knöpfli, Marina; Candinas, Daniel; Dufour, Jean-François

    2016-06-01

    European and American guidelines have endorsed the Barcelona Clinic Liver Cancer (BCLC) staging system. The aim of this study was to assess the performance of the recently developed Hong Kong Liver Cancer (HKLC) classification as a staging system for hepatocellular carcinoma (HCC) in Europe. We used a pooled set of 1693 HCC patients combining three prospective European cohorts. Discrimination ability between the nine substages and five stages of the HKLC classification system was assessed. To evaluate the predictive power of the HKLC and BCLC staging systems on overall survival, Nagelkerke pseudo R2, Bayesian Information Criterion and Harrell's concordance index were calculated. The number of patients who would benefit from a curative therapy was assessed for both staging systems. The HKLC classification in nine substages shows suboptimal discrimination between the staging groups. The classification in five stages shows better discrimination between groups. However, the BCLC classification performs better than the HKLC classification in the ability to predict overall survival (OS). The HKLC treatment algorithm tags significantly more patients to curative therapy than the BCLC. The BCLC staging system performs better for European patients than the HKLC staging system in predicting OS. Twice more patients are eligible for a curative therapy with the HKLC algorithm; whether this translates in survival benefit remains to be investigated. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  16. Examining applying high performance genetic data feature selection and classification algorithms for colon cancer diagnosis.

    PubMed

    Al-Rajab, Murad; Lu, Joan; Xu, Qiang

    2017-07-01

    This paper examines the accuracy and efficiency (time complexity) of high performance genetic data feature selection and classification algorithms for colon cancer diagnosis. The need for this research derives from the urgent and increasing need for accurate and efficient algorithms. Colon cancer is a leading cause of death worldwide, hence it is vitally important for the cancer tissues to be expertly identified and classified in a rapid and timely manner, to assure both a fast detection of the disease and to expedite the drug discovery process. In this research, a three-phase approach was proposed and implemented: Phases One and Two examined the feature selection algorithms and classification algorithms employed separately, and Phase Three examined the performance of the combination of these. It was found from Phase One that the Particle Swarm Optimization (PSO) algorithm performed best with the colon dataset as a feature selection (29 genes selected) and from Phase Two that the Support Vector Machine (SVM) algorithm outperformed other classifications, with an accuracy of almost 86%. It was also found from Phase Three that the combined use of PSO and SVM surpassed other algorithms in accuracy and performance, and was faster in terms of time analysis (94%). It is concluded that applying feature selection algorithms prior to classification algorithms results in better accuracy than when the latter are applied alone. This conclusion is important and significant to industry and society. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. Breast cancer Ki67 expression preoperative discrimination by DCE-MRI radiomics features

    NASA Astrophysics Data System (ADS)

    Ma, Wenjuan; Ji, Yu; Qin, Zhuanping; Guo, Xinpeng; Jian, Xiqi; Liu, Peifang

    2018-02-01

    To investigate whether quantitative radiomics features extracted from dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) are associated with Ki67 expression of breast cancer. In this institutional review board approved retrospective study, we collected 377 cases Chinese women who were diagnosed with invasive breast cancer in 2015. This cohort included 53 low-Ki67 expression (Ki67 proliferation index less than 14%) and 324 cases with high-Ki67 expression (Ki67 proliferation index more than 14%). A binary-classification of low- vs. high- Ki67 expression was performed. A set of 52 quantitative radiomics features, including morphological, gray scale statistic, and texture features, were extracted from the segmented lesion area. Three most common machine learning classification methods, including Naive Bayes, k-Nearest Neighbor and support vector machine with Gaussian kernel, were employed for the classification and the least absolute shrink age and selection operator (LASSO) method was used to select most predictive features set for the classifiers. Classification performance was evaluated by the area under receiver operating characteristic curve (AUC), accuracy, sensitivity and specificity. The model that used Naive Bayes classification method achieved the best performance than the other two methods, yielding 0.773 AUC value, 0.757 accuracy, 0.777 sensitivity and 0.769 specificity. Our study showed that quantitative radiomics imaging features of breast tumor extracted from DCE-MRI are associated with breast cancer Ki67 expression. Future larger studies are needed in order to further evaluate the findings.

  18. DESNT: A Poor Prognosis Category of Human Prostate Cancer.

    PubMed

    Luca, Bogdan-Alexandru; Brewer, Daniel S; Edwards, Dylan R; Edwards, Sandra; Whitaker, Hayley C; Merson, Sue; Dennis, Nening; Cooper, Rosalin A; Hazell, Steven; Warren, Anne Y; Eeles, Rosalind; Lynch, Andy G; Ross-Adams, Helen; Lamb, Alastair D; Neal, David E; Sethia, Krishna; Mills, Robert D; Ball, Richard Y; Curley, Helen; Clark, Jeremy; Moulton, Vincent; Cooper, Colin S

    2017-03-06

    A critical problem in the clinical management of prostate cancer is that it is highly heterogeneous. Accurate prediction of individual cancer behaviour is therefore not achievable at the time of diagnosis leading to substantial overtreatment. It remains an enigma that, in contrast to breast cancer, unsupervised analyses of global expression profiles have not currently defined robust categories of prostate cancer with distinct clinical outcomes. To devise a novel classification framework for human prostate cancer based on unsupervised mathematical approaches. Our analyses are based on the hypothesis that previous attempts to classify prostate cancer have been unsuccessful because individual samples of prostate cancer frequently have heterogeneous compositions. To address this issue, we applied an unsupervised Bayesian procedure called Latent Process Decomposition to four independent prostate cancer transcriptome datasets obtained using samples from prostatectomy patients and containing between 78 and 182 participants. Biochemical failure was assessed using log-rank analysis and Cox regression analysis. Application of Latent Process Decomposition identified a common process in all four independent datasets examined. Cancers assigned to this process (designated DESNT cancers) are characterized by low expression of a core set of 45 genes, many encoding proteins involved in the cytoskeleton machinery, ion transport, and cell adhesion. For the three datasets with linked prostate-specific antigen failure data following prostatectomy, patients with DESNT cancer exhibited poor outcome relative to other patients (p=2.65×10 -5 , p=4.28×10 -5 , and p=2.98×10 -8 ). When these three datasets were combined the independent predictive value of DESNT membership was p=1.61×10 -7 compared with p=1.00×10 -5 for Gleason sum. A limitation of the study is that only prediction of prostate-specific antigen failure was examined. Our results demonstrate the existence of a novel poor prognosis category of human prostate cancer and will assist in the targeting of therapy, helping avoid treatment-associated morbidity in men with indolent disease. Prostate cancer, unlike breast cancer, does not have a robust classification framework. We propose that this failure has occurred because prostate cancer samples selected for analysis frequently have heterozygous compositions (individual samples are made up of many different parts that each have different characteristics). Applying a mathematical approach that can overcome this problem we identify a novel poor prognosis category of human prostate cancer called DESNT. Copyright © 2017 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  19. Inferring gene dependency network specific to phenotypic alteration based on gene expression data and clinical information of breast cancer.

    PubMed

    Zhou, Xionghui; Liu, Juan

    2014-01-01

    Although many methods have been proposed to reconstruct gene regulatory network, most of them, when applied in the sample-based data, can not reveal the gene regulatory relations underlying the phenotypic change (e.g. normal versus cancer). In this paper, we adopt phenotype as a variable when constructing the gene regulatory network, while former researches either neglected it or only used it to select the differentially expressed genes as the inputs to construct the gene regulatory network. To be specific, we integrate phenotype information with gene expression data to identify the gene dependency pairs by using the method of conditional mutual information. A gene dependency pair (A,B) means that the influence of gene A on the phenotype depends on gene B. All identified gene dependency pairs constitute a directed network underlying the phenotype, namely gene dependency network. By this way, we have constructed gene dependency network of breast cancer from gene expression data along with two different phenotype states (metastasis and non-metastasis). Moreover, we have found the network scale free, indicating that its hub genes with high out-degrees may play critical roles in the network. After functional investigation, these hub genes are found to be biologically significant and specially related to breast cancer, which suggests that our gene dependency network is meaningful. The validity has also been justified by literature investigation. From the network, we have selected 43 discriminative hubs as signature to build the classification model for distinguishing the distant metastasis risks of breast cancer patients, and the result outperforms those classification models with published signatures. In conclusion, we have proposed a promising way to construct the gene regulatory network by using sample-based data, which has been shown to be effective and accurate in uncovering the hidden mechanism of the biological process and identifying the gene signature for phenotypic change.

  20. Clinical relevance of rare germline sequence variants in cancer genes: evolution and application of classification models.

    PubMed

    Spurdle, Amanda B

    2010-06-01

    Multifactorial models developed for BRCA1/2 variant classification have proved very useful for delineating BRCA1/2 variants associated with very high risk of cancer, or with little clinical significance. Recent linkage of this quantitative assessment of risk to clinical management guidelines has provided a basis to standardize variant reporting, variant classification and management of families with such variants, and can theoretically be applied to any disease gene. As proof of principle, the multifactorial approach already shows great promise for application to the evaluation of mismatch repair gene variants identified in families with suspected Lynch syndrome. However there is need to be cautious of the noted limitations and caveats of the current model, some of which may be exacerbated by differences in ascertainment and biological pathways to disease for different cancer syndromes.

  1. 21 CFR 866.6040 - Gene expression profiling test system for breast cancer prognosis.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... cancer prognosis. 866.6040 Section 866.6040 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF... cancer prognosis. (a) Identification. A gene expression profiling test system for breast cancer prognosis... previously diagnosed breast cancer. (b) Classification. Class II (special controls). The special control is...

  2. 21 CFR 866.6040 - Gene expression profiling test system for breast cancer prognosis.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... cancer prognosis. 866.6040 Section 866.6040 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF... cancer prognosis. (a) Identification. A gene expression profiling test system for breast cancer prognosis... previously diagnosed breast cancer. (b) Classification. Class II (special controls). The special control is...

  3. 21 CFR 866.6040 - Gene expression profiling test system for breast cancer prognosis.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... cancer prognosis. 866.6040 Section 866.6040 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF... cancer prognosis. (a) Identification. A gene expression profiling test system for breast cancer prognosis... previously diagnosed breast cancer. (b) Classification. Class II (special controls). The special control is...

  4. 21 CFR 866.6040 - Gene expression profiling test system for breast cancer prognosis.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... cancer prognosis. 866.6040 Section 866.6040 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF... cancer prognosis. (a) Identification. A gene expression profiling test system for breast cancer prognosis... previously diagnosed breast cancer. (b) Classification. Class II (special controls). The special control is...

  5. 21 CFR 866.6040 - Gene expression profiling test system for breast cancer prognosis.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... cancer prognosis. 866.6040 Section 866.6040 Food and Drugs FOOD AND DRUG ADMINISTRATION, DEPARTMENT OF... cancer prognosis. (a) Identification. A gene expression profiling test system for breast cancer prognosis... previously diagnosed breast cancer. (b) Classification. Class II (special controls). The special control is...

  6. The Anti-Cancer Effect of Polyphenols against Breast Cancer and Cancer Stem Cells: Molecular Mechanisms

    PubMed Central

    Abdal Dayem, Ahmed; Choi, Hye Yeon; Yang, Gwang-Mo; Kim, Kyeongseok; Saha, Subbroto Kumar; Cho, Ssang-Goo

    2016-01-01

    The high incidence of breast cancer in developed and developing countries, and its correlation to cancer-related deaths, has prompted concerned scientists to discover novel alternatives to deal with this challenge. In this review, we will provide a brief overview of polyphenol structures and classifications, as well as on the carcinogenic process. The biology of breast cancer cells will also be discussed. The molecular mechanisms involved in the anti-cancer activities of numerous polyphenols, against a wide range of breast cancer cells, in vitro and in vivo, will be explained in detail. The interplay between autophagy and apoptosis in the anti-cancer activity of polyphenols will also be highlighted. In addition, the potential of polyphenols to target cancer stem cells (CSCs) via various mechanisms will be explained. Recently, the use of natural products as chemotherapeutics and chemopreventive drugs to overcome the side effects and resistance that arise from using chemical-based agents has garnered the attention of the scientific community. Polyphenol research is considered a promising field in the treatment and prevention of breast cancer. PMID:27657126

  7. Building a model for disease classification integration in oncology, an approach based on the national cancer institute thesaurus.

    PubMed

    Jouhet, Vianney; Mougin, Fleur; Bréchat, Bérénice; Thiessard, Frantz

    2017-02-07

    Identifying incident cancer cases within a population remains essential for scientific research in oncology. Data produced within electronic health records can be useful for this purpose. Due to the multiplicity of providers, heterogeneous terminologies such as ICD-10 and ICD-O-3 are used for oncology diagnosis recording purpose. To enable disease identification based on these diagnoses, there is a need for integrating disease classifications in oncology. Our aim was to build a model integrating concepts involved in two disease classifications, namely ICD-10 (diagnosis) and ICD-O-3 (topography and morphology), despite their structural heterogeneity. Based on the NCIt, a "derivative" model for linking diagnosis and topography-morphology combinations was defined and built. ICD-O-3 and ICD-10 codes were then used to instantiate classes of the "derivative" model. Links between terminologies obtained through the model were then compared to mappings provided by the Surveillance, Epidemiology, and End Results (SEER) program. The model integrated 42% of neoplasm ICD-10 codes (excluding metastasis), 98% of ICD-O-3 morphology codes (excluding metastasis) and 68% of ICD-O-3 topography codes. For every codes instantiating at least a class in the "derivative" model, comparison with SEER mappings reveals that all mappings were actually available in the model as a link between the corresponding codes. We have proposed a method to automatically build a model for integrating ICD-10 and ICD-O-3 based on the NCIt. The resulting "derivative" model is a machine understandable resource that enables an integrated view of these heterogeneous terminologies. The NCIt structure and the available relationships can help to bridge disease classifications taking into account their structural and granular heterogeneities. However, (i) inconsistencies exist within the NCIt leading to misclassifications in the "derivative" model, (ii) the "derivative" model only integrates a part of ICD-10 and ICD-O-3. The NCIt is not sufficient for integration purpose and further work based on other termino-ontological resources is needed in order to enrich the model and avoid identified inconsistencies.

  8. [Feature extraction for breast cancer data based on geometric algebra theory and feature selection using differential evolution].

    PubMed

    Li, Jing; Hong, Wenxue

    2014-12-01

    The feature extraction and feature selection are the important issues in pattern recognition. Based on the geometric algebra representation of vector, a new feature extraction method using blade coefficient of geometric algebra was proposed in this study. At the same time, an improved differential evolution (DE) feature selection method was proposed to solve the elevated high dimension issue. The simple linear discriminant analysis was used as the classifier. The result of the 10-fold cross-validation (10 CV) classification of public breast cancer biomedical dataset was more than 96% and proved superior to that of the original features and traditional feature extraction method.

  9. Cancer classification through filtering progressive transductive support vector machine based on gene expression data

    NASA Astrophysics Data System (ADS)

    Lu, Xinguo; Chen, Dan

    2017-08-01

    Traditional supervised classifiers neglect a large amount of data which not have sufficient follow-up information, only work with labeled data. Consequently, the small sample size limits the advancement of design appropriate classifier. In this paper, a transductive learning method which combined with the filtering strategy in transductive framework and progressive labeling strategy is addressed. The progressive labeling strategy does not need to consider the distribution of labeled samples to evaluate the distribution of unlabeled samples, can effective solve the problem of evaluate the proportion of positive and negative samples in work set. Our experiment result demonstrate that the proposed technique have great potential in cancer prediction based on gene expression.

  10. [Guideline development for rehabilitation of breast cancer patients - phase 2: findings from the classification of therapeutic procedures, KTL-data-analysis].

    PubMed

    Domann, U; Brüggemann, S; Klosterhuis, H; Weis, J

    2007-08-01

    Aim of this project is the development of an evidence based guideline for the rehabilitation of breast cancer patients, funded by the German Pension Insurance scheme. The project consists of four phases. This paper is focused on the 2nd phase, i.e., analysis of procedures in rehabilitation based on evidence based therapeutic modules. As a result of a systematic literature review 14 therapeutic modules were defined. From a total of 840 possible KTL Codes (Klassifikation Therapeutischer Leistungen, Classification of therapeutic procedures), 229 could be assigned to these modules. These analyses are based on 24685 patients in 57 rehabilitation clinics, who had been treated in 2003. For these modules the number of patients having received those interventions as well as the duration of the modules were calculated. The data were analysed with respect to the influence of age and comorbidity. Moreover, differences between rehabilitation clinics were investigated according to the category of interventions. Our findings show great variability in the use of the therapeutic modules. Therapeutic modules like Physiotherapy (91.6%), Training Therapy (85.2%) and Information (97.8%) are provided to most of the patients. Younger patients receive more treatments than older patients, and patients with higher comorbidity receive more Physiotherapie, Lymphoedema Therapy and Psychological Interventions than patients without comorbidities. Data analysis shows wide interindividual variability with regard to the therapeutic modules. This variability is related to age and comorbidity of the patients. Furthermore, great differences were found between the rehabilitation clinics concerning the use of the various interventions. This variability supports the necessity of developing and implementing an evidence based guideline for the rehabilitation of breast cancer patients. The next step will be discussing these findings with experts from science and clinical practice.

  11. Computer-aided diagnostics of screening mammography using content-based image retrieval

    NASA Astrophysics Data System (ADS)

    Deserno, Thomas M.; Soiron, Michael; de Oliveira, Júlia E. E.; de A. Araújo, Arnaldo

    2012-03-01

    Breast cancer is one of the main causes of death among women in occidental countries. In the last years, screening mammography has been established worldwide for early detection of breast cancer, and computer-aided diagnostics (CAD) is being developed to assist physicians reading mammograms. A promising method for CAD is content-based image retrieval (CBIR). Recently, we have developed a classification scheme of suspicious tissue pattern based on the support vector machine (SVM). In this paper, we continue moving towards automatic CAD of screening mammography. The experiments are based on in total 10,509 radiographs that have been collected from different sources. From this, 3,375 images are provided with one and 430 radiographs with more than one chain code annotation of cancerous regions. In different experiments, this data is divided into 12 and 20 classes, distinguishing between four categories of tissue density, three categories of pathology and in the 20 class problem two categories of different types of lesions. Balancing the number of images in each class yields 233 and 45 images remaining in each of the 12 and 20 classes, respectively. Using a two-dimensional principal component analysis, features are extracted from small patches of 128 x 128 pixels and classified by means of a SVM. Overall, the accuracy of the raw classification was 61.6 % and 52.1 % for the 12 and the 20 class problem, respectively. The confusion matrices are assessed for detailed analysis. Furthermore, an implementation of a SVM-based CBIR system for CADx in screening mammography is presented. In conclusion, with a smarter patch extraction, the CBIR approach might reach precision rates that are helpful for the physicians. This, however, needs more comprehensive evaluation on clinical data.

  12. Validation of the prognostic gene portfolio, ClinicoMolecular Triad Classification, using an independent prospective breast cancer cohort and external patient populations

    PubMed Central

    2014-01-01

    Introduction Using genome-wide expression profiles of a prospective training cohort of breast cancer patients, ClinicoMolecular Triad Classification (CMTC) was recently developed to classify breast cancers into three clinically relevant groups to aid treatment decisions. CMTC was found to be both prognostic and predictive in a large external breast cancer cohort in that study. This study serves to validate the reproducibility of CMTC and its prognostic value using independent patient cohorts. Methods An independent internal cohort (n = 284) and a new external cohort (n = 2,181) were used to validate the association of CMTC between clinicopathological factors, 12 known gene signatures, two molecular subtype classifiers, and 19 oncogenic signalling pathway activities, and to reproduce the abilities of CMTC to predict clinical outcomes of breast cancer. In addition, we also updated the outcome data of the original training cohort (n = 147). Results The original training cohort reached a statistically significant difference (p < 0.05) in disease-free survivals between the three CMTC groups after an additional two years of follow-up (median = 55 months). The prognostic value of the triad classification was reproduced in the second independent internal cohort and the new external validation cohort. CMTC achieved even higher prognostic significance when all available patients were analyzed (n = 4,851). Oncogenic pathways Myc, E2F1, Ras and β-catenin were again implicated in the high-risk groups. Conclusions Both prospective internal cohorts and the independent external cohorts reproduced the triad classification of CMTC and its prognostic significance. CMTC is an independent prognostic predictor, and it outperformed 12 other known prognostic gene signatures, molecular subtype classifications, and all other standard prognostic clinicopathological factors. Our results support further development of CMTC portfolio into a guide for personalized breast cancer treatments. PMID:24996446

  13. Early developed ASD (adjacent segmental disease) in patients after surgical treatment of the spine due to cancer metastases.

    PubMed

    Guzik, Grzegorz

    2017-05-12

    The causes of ASD are still relatively unknown. Correlation between clinical status of patients and radiological MRI findings is of primary importance. The radiological classifications proposed by Pfirmann and Oner are most commonly used to assess intradiscal degenerative changes. The aim of the study was to assess the influence of the extension of spine fixation on the risk of developing ASD in a short time after surgery. A total of 332 patients with spinal tumors were treated in our hospital between 2010 and 2013. Of these patients, 287 underwent surgeries. A follow-up MRI examination was performed 12 months after surgical treatment. The study population comprised of 194 patients. Among metastases, breast cancer was predominant (29%); neurological deficits were detected in 76 patients. Metastases were seen in the thoracic (45%) and lumbar (30%) spine; in 25% of cases, they were of multisegmental character. Pathological fractures concerned 88% of the patients. Statistical calculations were made using the χ2 test. Statistical analysis was done using the Statistica v. 10 software. A p value <0.05 was accepted as statistically significant. The study population was divided on seven groups according to applied treatment. Clinical signs of ASD were noted in only seven patients. Two patients had symptoms of nerve root irritation in the lumbar spine. Twenty-two patients (11%) were diagnosed with ASD according to the MRI classifications by Oner, Rijt, and Ramos, while the more sensitive Pfirmann classification allowed to detect the disease in 46 patients (24%). Healthy or almost healthy discs of Oner type I correlated with the criteria of Pfirmann types II and III. The percentage of the incidence of ASD diagnosed 1 year after the surgery using the Pfirmann classifications was significantly higher than diagnosed according to the clinical examination. The incidence of ASD in patients after spine surgeries due to cancer metastases does not differ between the study groups. ASD detectability based on clinical signs is significantly lower than ASD detectability based on MR images according to the system by Pfirrmann et.al. ASD risk increase among patients with multilevel fixation.

  14. Textural pattern classification for oral squamous cell carcinoma.

    PubMed

    Rahman, T Y; Mahanta, L B; Chakraborty, C; DAS, A K; Sarma, J D

    2018-01-01

    Despite being an area of cancer with highest worldwide incidence, oral cancer yet remains to be widely researched. Studies on computer-aided analysis of pathological slides of oral cancer contribute a lot to the diagnosis and treatment of the disease. Some researches in this direction have been carried out on oral submucous fibrosis. In this work an approach for analysing abnormality based on textural features present in squamous cell carcinoma histological slides have been considered. Histogram and grey-level co-occurrence matrix approaches for extraction of textural features from biopsy images with normal and malignant cells are used here. Further, we have used linear support vector machine classifier for automated diagnosis of the oral cancer, which gives 100% accuracy. © 2017 The Authors Journal of Microscopy © 2017 Royal Microscopical Society.

  15. Tissues from population-based cancer registries: a novel approach to increasing research potential.

    PubMed

    Goodman, Marc T; Hernandez, Brenda Y; Hewitt, Stephen; Lynch, Charles F; Coté, Timothy R; Frierson, Henry F; Moskaluk, Christopher A; Killeen, Jeffrey L; Cozen, Wendy; Key, Charles R; Clegg, Limin; Reichman, Marsha; Hankey, Benjamin F; Edwards, Brenda

    2005-07-01

    Population-based cancer registries, such as those included in the Surveillance, Epidemiology, and End-Results (SEER) Program, offer tremendous research potential beyond traditional surveillance activities. We describe the expansion of SEER registries to gather formalin-fixed, paraffin-embedded tissue from cancer patients on a population basis. Population-based tissue banks have the advantage of providing an unbiased sampling frame for evaluating the public health impact of genes or protein targets that may be used for therapeutic or diagnostic purposes in defined communities. Such repositories provide a unique resource for testing new molecular classification schemes for cancer, validating new biologic markers of malignancy, prognosis and progression, assessing therapeutic targets, and measuring allele frequencies of cancer-associated genetic polymorphisms or germline mutations in representative samples. The assembly of tissue microarrays will allow for the use of rapid, large-scale protein-expression profiling of tumor samples while limiting depletion of this valuable resource. Access to biologic specimens through SEER registries will provide researchers with demographic, clinical, and risk factor information on cancer patients with assured data quality and completeness. Clinical outcome data, such as disease-free survival, can be correlated with previously validated prognostic markers. Furthermore, the anonymity of the study subject can be protected through rigorous standards of confidentiality. SEER-based tissue resources represent a step forward in true, population-based tissue repositories of tumors from US patients and may serve as a foundation for molecular epidemiology studies of cancer in this country.

  16. A novel artificial neural network method for biomedical prediction based on matrix pseudo-inversion.

    PubMed

    Cai, Binghuang; Jiang, Xia

    2014-04-01

    Biomedical prediction based on clinical and genome-wide data has become increasingly important in disease diagnosis and classification. To solve the prediction problem in an effective manner for the improvement of clinical care, we develop a novel Artificial Neural Network (ANN) method based on Matrix Pseudo-Inversion (MPI) for use in biomedical applications. The MPI-ANN is constructed as a three-layer (i.e., input, hidden, and output layers) feed-forward neural network, and the weights connecting the hidden and output layers are directly determined based on MPI without a lengthy learning iteration. The LASSO (Least Absolute Shrinkage and Selection Operator) method is also presented for comparative purposes. Single Nucleotide Polymorphism (SNP) simulated data and real breast cancer data are employed to validate the performance of the MPI-ANN method via 5-fold cross validation. Experimental results demonstrate the efficacy of the developed MPI-ANN for disease classification and prediction, in view of the significantly superior accuracy (i.e., the rate of correct predictions), as compared with LASSO. The results based on the real breast cancer data also show that the MPI-ANN has better performance than other machine learning methods (including support vector machine (SVM), logistic regression (LR), and an iterative ANN). In addition, experiments demonstrate that our MPI-ANN could be used for bio-marker selection as well. Copyright © 2013 Elsevier Inc. All rights reserved.

  17. [Research progress in molecular classification of gastric cancer].

    PubMed

    Zhou, Menglong; Li, Guichao; Zhang, Zhen

    2016-09-25

    Gastric cancer(GC) is a highly heterogeneous malignancy. The present widely used histopathological classifications have gradually failed to meet the needs of individualized diagnosis and treatment. Development of technologies such as microarray and next-generation sequencing (NGS) has allowed GC to be studied at the molecular level. Mechanisms about tumorigenesis and progression of GC can be elucidated in the aspects of gene mutations, chromosomal alterations, transcriptional and epigenetic changes, on the basis of which GC can be divided into several subtypes. The classifications of Tan's, Lei's, TCGA and ACRG are relatively comprehensive. Especially the TCGA and ACRG classifications have large sample size and abundant molecular profiling data, thus, the genomic characteristics of GC can be depicted more accurately. However, significant differences between both classifications still exist so that they cannot be substituted for each other. So far there is no widely accepted molecular classification of GC. Compared with TCGA classification, ACRG system may have more clinical significance in Chinese GC patients since the samples are mostly from Asian population and show better association with prognosis. The molecular classification of GC may provide the theoretical and experimental basis for early diagnosis, therapeutic efficacy prediction and treatment stratification while their clinical application is still limited. Future work should involve the application of molecular classifications in the clinical settings for improving the medical management of GC.

  18. Logistic regression model for diagnosis of transition zone prostate cancer on multi-parametric MRI.

    PubMed

    Dikaios, Nikolaos; Alkalbani, Jokha; Sidhu, Harbir Singh; Fujiwara, Taiki; Abd-Alazeez, Mohamed; Kirkham, Alex; Allen, Clare; Ahmed, Hashim; Emberton, Mark; Freeman, Alex; Halligan, Steve; Taylor, Stuart; Atkinson, David; Punwani, Shonit

    2015-02-01

    We aimed to develop logistic regression (LR) models for classifying prostate cancer within the transition zone on multi-parametric magnetic resonance imaging (mp-MRI). One hundred and fifty-five patients (training cohort, 70 patients; temporal validation cohort, 85 patients) underwent mp-MRI and transperineal-template-prostate-mapping (TPM) biopsy. Positive cores were classified by cancer definitions: (1) any-cancer; (2) definition-1 [≥Gleason 4 + 3 or ≥ 6 mm cancer core length (CCL)] [high risk significant]; and (3) definition-2 (≥Gleason 3 + 4 or ≥ 4 mm CCL) cancer [intermediate-high risk significant]. For each, logistic-regression mp-MRI models were derived from the training cohort and validated internally and with the temporal cohort. Sensitivity/specificity and the area under the receiver operating characteristic (ROC-AUC) curve were calculated. LR model performance was compared to radiologists' performance. Twenty-eight of 70 patients from the training cohort, and 25/85 patients from the temporal validation cohort had significant cancer on TPM. The ROC-AUC of the LR model for classification of cancer was 0.73/0.67 at internal/temporal validation. The radiologist A/B ROC-AUC was 0.65/0.74 (temporal cohort). For patients scored by radiologists as Prostate Imaging Reporting and Data System (Pi-RADS) score 3, sensitivity/specificity of radiologist A 'best guess' and LR model was 0.14/0.54 and 0.71/0.61, respectively; and radiologist B 'best guess' and LR model was 0.40/0.34 and 0.50/0.76, respectively. LR models can improve classification of Pi-RADS score 3 lesions similar to experienced radiologists. • MRI helps find prostate cancer in the anterior of the gland • Logistic regression models based on mp-MRI can classify prostate cancer • Computers can help confirm cancer in areas doctors are uncertain about.

  19. Multivariate classification of infrared spectra of cell and tissue samples

    DOEpatents

    Haaland, David M.; Jones, Howland D. T.; Thomas, Edward V.

    1997-01-01

    Multivariate classification techniques are applied to spectra from cell and tissue samples irradiated with infrared radiation to determine if the samples are normal or abnormal (cancerous). Mid and near infrared radiation can be used for in vivo and in vitro classifications using at least different wavelengths.

  20. Hyaluronic Acid and Hyaluronidase in Prostate Cancer: Evaluation of Their Therapeutic and Prognostic Potential

    DTIC Science & Technology

    2005-01-01

    PAGES No subject terms provided. 75 16. PRICE CODE 17. SECURITY CLASSIFICATION 18 . SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20. LIMITATION OF...Prescribed by ANSI Std. Z39- 18 298-102 Lokeshwar, Vinata B Table of Contents Cover...1 Body ................................................................................................. 2- 18 Key Research

  1. In Vivo Testing of Chemopreventive Agents Using the Dog Model of Spontaneous Prostate Carcinogenesis

    DTIC Science & Technology

    2001-03-01

    SECURITY CLASSIFICATION 18. SECURITY CLASSIFICATION 19. SECURITY CLASSIFICATION 20. LIMITATION OF ABSTRACT OF REPORT OF THIS PAGE OF ABSTRACT...retarded tate Cancer Research Program (PC-970492, awarded to population. J Gerontol 1969;24:395-411. 19. Hayflick LH. How and why we age. Exp Gerontol

  2. Automatic machine learning based prediction of cardiovascular events in lung cancer screening data

    NASA Astrophysics Data System (ADS)

    de Vos, Bob D.; de Jong, Pim A.; Wolterink, Jelmer M.; Vliegenthart, Rozemarijn; Wielingen, Geoffrey V. F.; Viergever, Max A.; Išgum, Ivana

    2015-03-01

    Calcium burden determined in CT images acquired in lung cancer screening is a strong predictor of cardiovascular events (CVEs). This study investigated whether subjects undergoing such screening who are at risk of a CVE can be identified using automatic image analysis and subject characteristics. Moreover, the study examined whether these individuals can be identified using solely image information, or if a combination of image and subject data is needed. A set of 3559 male subjects undergoing Dutch-Belgian lung cancer screening trial was included. Low-dose non-ECG synchronized chest CT images acquired at baseline were analyzed (1834 scanned in the University Medical Center Groningen, 1725 in the University Medical Center Utrecht). Aortic and coronary calcifications were identified using previously developed automatic algorithms. A set of features describing number, volume and size distribution of the detected calcifications was computed. Age of the participants was extracted from image headers. Features describing participants' smoking status, smoking history and past CVEs were obtained. CVEs that occurred within three years after the imaging were used as outcome. Support vector machine classification was performed employing different feature sets using sets of only image features, or a combination of image and subject related characteristics. Classification based solely on the image features resulted in the area under the ROC curve (Az) of 0.69. A combination of image and subject features resulted in an Az of 0.71. The results demonstrate that subjects undergoing lung cancer screening who are at risk of CVE can be identified using automatic image analysis. Adding subject information slightly improved the performance.

  3. Recent Developments in Tissue-type Imaging(TTI) for Planning and Monitoring Treatment of Prostate Cancer

    PubMed Central

    Feleppa, Ernest J.; Porter, Christopher R.; Ketterling, Jeffrey; Lee, Paul; Dasgupta, Shreedevi; Urban, Stella; Kalisz, Andrew

    2006-01-01

    Because current methods of imaging prostate cancer are inadequate, biopsies cannot be effectively guided and treatment cannot be effectively planned and targeted. Therefore, our research is aimed at ultrasonically characterizing cancerous prostate tissue so that we can image it more effectively and thereby provide improved means of detecting, treating and monitoring prostate cancer. We base our characterization methods on spectrum analysis of radio frequency (rf) echo signals combined with clinical variables such as prostate-specific antigen (PSA). Tissue typing using these parameters is performed by artificial neural networks. We employedand evaluated different approaches to data partitioning into training, validation, and test sets and different neural network configuration options. In this manner, we sought to determine what neural network configuration is optimal for these data and also to assess possible bias that might exist due to correlations among different data entries among the data for a given patient. The classification efficacy of each neural network configuration and data-partitioning method was measured using relative-operating-characteristic (ROC) methods. Neural network classification based on spectral parameters combined with clinical data generally produced ROC-curve areas of 0.80 compared to curve areas of 0.64 for conventional transrectal ultrasound imaging combined with clinical data. We then used the optimal neural network configuration to generate lookup tables that translate local spectral parameter values and global clinical-variable values into pixel values in tissue-type images (TTIs). TTIs continue to show can cerous regions successfully, and may prove to be particularly useful clinically in combination with other ultrasonic and nonultrasonic methods, e.g., magnetic-resonance spectroscopy. PMID:15754797

  4. The seventh tumour-node-metastasis staging system for lung cancer: Sequel or prequel?

    PubMed

    van Meerbeeck, Jan P; Janssens, Annelies

    2013-09-01

    Anatomical cancer extent is an important predictor of prognosis and determines treatment choices. In non-small-cell lung cancer (NSCLC) the tumour-node-metastasis (TNM) classification developed by Pierre Denoix replaced in 1968 the Veterans Administration Lung cancer Group (VALG) classification, which was still in use for small-cell lung cancer (SCLC). Clifton Mountain suggested several improvements based on a database of mostly surgically treated United States (US) patients from a limited number of centres. This database was pivotal for a uniform reporting of lung cancer extent by the American Joint Committee of Cancer (AJCC) and the International Union against Cancer (IUCC), but it suffered increasingly from obsolete diagnostic and staging procedures and did not reflect new treatment modalities. Moreover, its findings were not externally validated in large Japanese and European databases, resulting in persisting controversies which could not be solved with the available database. The use of different mediastinal lymph-node maps in Japan, the (US) and Europe facilitated neither the exchange nor the comparison of treatment results. Peter Goldstraw, a United Kingdom (UK) thoracic surgeon, started the process of updating the sixth version in 1996 and brought it to a good end 10 years later. His goals were to improve the TNM system in lung cancer by addressing the ongoing controversies, to validate the modifications and additional descriptors, to validate the TNM for use in staging SCLC and carcinoid tumours, to propose a new uniform lymph-node map and to investigate the prognostic value of non-anatomical factors. A staging committee was formed within the International Association for the Study of Lung Cancer (IASLC) - which supervised the collection of the retrospective data from >100,000 patients with lung cancer - treated throughout the world between 1990 and 2000, analyse them with the help of solid statistics and validate externally with the Surveillance, Epidemiology and End Results (SEER) database. The ten modifications and the mediastinal lymph-node map - which were proposed in 2007 and adopted by the AJCC and IUCC in their respective seventh revision of the TNM system - were implemented as of 2010 and were rapidly adopted by the thoracic oncology community and cancer registries. As expected, not all controversies could be fully addressed, and the need for a prospective data set containing more granular information was felt early on. This data set of 25,000 consecutive incident cases will form the base for the eighth revision in 2017 and is currently being collected. Other threats are the role of stage migration and the increasing number of biological factors interfering with disease extent for prognostication. The latter issue will be addressed by the creation of a prognostic index, including several prognostic factors, of which stage will be one. For the time being, the seventh TNM classification is considered the gold standard for the description of disease extent, initial treatment allocation and the reporting of treatment results. The uniform use of the TNM descriptors and the lymph-node map by all involved in lung cancer care is to be considered a process indicator of quality.

  5. Application of time series discretization using evolutionary programming for classification of precancerous cervical lesions.

    PubMed

    Acosta-Mesa, Héctor-Gabriel; Rechy-Ramírez, Fernando; Mezura-Montes, Efrén; Cruz-Ramírez, Nicandro; Hernández Jiménez, Rodolfo

    2014-06-01

    In this work, we present a novel application of time series discretization using evolutionary programming for the classification of precancerous cervical lesions. The approach optimizes the number of intervals in which the length and amplitude of the time series should be compressed, preserving the important information for classification purposes. Using evolutionary programming, the search for a good discretization scheme is guided by a cost function which considers three criteria: the entropy regarding the classification, the complexity measured as the number of different strings needed to represent the complete data set, and the compression rate assessed as the length of the discrete representation. This discretization approach is evaluated using a time series data based on temporal patterns observed during a classical test used in cervical cancer detection; the classification accuracy reached by our method is compared with the well-known times series discretization algorithm SAX and the dimensionality reduction method PCA. Statistical analysis of the classification accuracy shows that the discrete representation is as efficient as the complete raw representation for the present application, reducing the dimensionality of the time series length by 97%. This representation is also very competitive in terms of classification accuracy when compared with similar approaches. Copyright © 2014 Elsevier Inc. All rights reserved.

  6. Super-Penetrant Androgen Receptor: Overcoming Enzalutamide Sensitivity in Castration-Resistant Prostate Cancer

    DTIC Science & Technology

    2016-07-01

    Prostate Cancer Research Symposium- Prostate Cancer Epigenetic Reprogramming of the Androgen Receptor in Castration Resistant Prostate Cancer , May19... cancer cells rely critically on the androgen receptor (AR) for initiation, growth and progression to castration resistant prostate cancer (CRPC...Androgen receptor, castration resistant prostate cancer , Enzalutamide , kinases. 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT 18. NUMBER

  7. Multiplex profiling of tumor-associated proteolytic activity in serum of colorectal cancer patients.

    PubMed

    Yepes, Diego; Costina, Victor; Pilz, Lothar R; Hofheinz, Ralf; Neumaier, Michael; Findeisen, Peter

    2014-06-01

    The monitoring of tumor-associated protease activity in blood specimens has recently been proposed as new diagnostic tool in cancer research. In this paper, we describe the screening of a peptide library for identification of reporter peptides (RPs) that are selectively cleaved in serum specimens from colorectal cancer patients and investigate the benefits of RP multiplexing. A library of 144 RPs was constructed that contained amino acid sequences of abundant plasma proteins. Proteolytic cleavage of RPs was monitored with MS. Five RPs that were selectively cleaved in serum specimens from tumor patients were selected for further validation in serum specimens of colorectal tumor patients (n = 30) and nonmalignant controls (n = 60). RP spiking and subsequent quantification of proteolytic fragments with LC-MS showed good reproducibility with CVs always below 26%. The linear discriminant analysis and PCA revealed that a combination of RPs for diagnostic classification is superior to single markers. Classification accuracy reached 88% (79/90) when all five markers were combined. Functional protease profiling with RPs might improve the laboratory-based diagnosis, monitoring and prognosis of malignant disease, and has to be evaluated thoroughly in future studies. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Optical coherence tomography (OCT) imaging and computer aided diagnosis of human cervical tissue specimens

    NASA Astrophysics Data System (ADS)

    Bazant-Hegemark, F.; Stone, N.; Read, M. D.; McCarthy, K.; Wang, R. K.

    2007-07-01

    The keyword for management of cervical cancer is prevention. The present program within the UK, the 'National Health Service (NHS) cervical screening programme' (NHSCSP), is based on cytology. Although the program has reduced the incidence of cervical cancer, this program requires patient follow ups and relies on diagnostic biopsying. There is potential for reducing costs and workload within the NHS, and relieving anxiety of patients. In this study, Optical Coherence Tomography (OCT) was investigated for its capability to improve this situation. Our time domain bench top system used a superluminescent diode (Superlum), centre wave length ~1.3 μm, resolution (air) ~15 μm. Tissue samples were obtained according to the ethics approval by Gloucestershire LREC, Nr. 05/Q2005/123. 1387 images of 199 participants have been compared with histopathology results and categorized accordingly. Our OCT images do not reach the clarity and resolution of histopathology. Further, establishing and recognizing features of diagnostic significance seems difficult. Automated classification would allow one to take decision-making to move from the subjective appraisal of a physician to an objective assessment. Hence we investigated a classification algorithm for its ability in recognizing pre-cancerous stages from OCT images. The initial results show promise.

  9. American Thyroid Association Guidelines on the Management of Thyroid Nodules and Differentiated Thyroid Cancer Task Force Review and Recommendation on the Proposed Renaming of Encapsulated Follicular Variant Papillary Thyroid Carcinoma Without Invasion to Noninvasive Follicular Thyroid Neoplasm with Papillary-Like Nuclear Features.

    PubMed

    Haugen, Bryan R; Sawka, Anna M; Alexander, Erik K; Bible, Keith C; Caturegli, Patrizio; Doherty, Gerard M; Mandel, Susan J; Morris, John C; Nassar, Aziza; Pacini, Furio; Schlumberger, Martin; Schuff, Kathryn; Sherman, Steven I; Somerset, Hilary; Sosa, Julie Ann; Steward, David L; Wartofsky, Leonard; Williams, Michelle D

    2017-04-01

    American Thyroid Association (ATA) leadership asked the ATA Thyroid Nodules and Differentiated Thyroid Cancer Guidelines Task Force to review, comment on, and make recommendations related to the suggested new classification of encapsulated follicular variant papillary thyroid carcinoma (eFVPTC) without capsular or vascular invasion to noninvasive follicular thyroid neoplasm with papillary-like nuclear features (NIFTP). The task force consists of members from the 2015 guidelines task force with the recusal of three members who were authors on the paper under review. Four pathologists and one endocrinologist were added for this specific review. The manuscript proposing the new classification and related literature were assessed. It is recommended that the histopathologic nomenclature for eFVPTC without invasion be reclassified as a NIFTP, given the excellent prognosis of this neoplastic variant. This is a weak recommendation based on moderate-quality evidence. It is also noted that prospective studies are needed to validate the observed patient outcomes (and test performance in predicting thyroid cancer outcomes), as well as implications on patients' psychosocial health and economics.

  10. Evaluation of a multi-fibre needle Raman probe for tissue analysis

    NASA Astrophysics Data System (ADS)

    Fullwood, Leanne M.; Iping Petterson, Ingeborg E.; Dudgeon, Alexander P.; Lloyd, Gavin R.; Kendall, Catherine; Hall, Charlie; Day, John C. C.; Stone, Nick

    2016-03-01

    Raman spectroscopy is a rapid technique for the identification of cancers. Its coupling with a hypodermic needle provides a minimally invasive instrument with the potential to aid real time assessment of suspicious lesions in vivo and guide surgery. A fibre optic Raman needle probe was utilised in this study to evaluate the classification ability of the instrument as a diagnostic tool together with multivariate analysis, through measurements of tissues from different animal species as well as various different porcine tissue types. Cross validation was performed and preliminary classification accuracies were calculated as 100% for the identification of tissue type and 97.5% for the identification of animal species. A lymph node sample was also measured using the needle probe to assess the use of the technique for human tissue and hence its efficiency as a clinical instrument. This needle probe has been demonstrated to have the capabilities to classify tissue samples based on their biochemical components. The Raman needle probe also has the potential to act as a diagnostic and surgical tool to delineate cancerous from non-cancerous cells in real time, thus assisting complete removal of a tumour.

  11. Colorectal Cancer Classification and Cell Heterogeneity: A Systems Oncology Approach

    PubMed Central

    Blanco-Calvo, Moisés; Concha, Ángel; Figueroa, Angélica; Garrido, Federico; Valladares-Ayerbes, Manuel

    2015-01-01

    Colorectal cancer is a heterogeneous disease that manifests through diverse clinical scenarios. During many years, our knowledge about the variability of colorectal tumors was limited to the histopathological analysis from which generic classifications associated with different clinical expectations are derived. However, currently we are beginning to understand that under the intense pathological and clinical variability of these tumors there underlies strong genetic and biological heterogeneity. Thus, with the increasing available information of inter-tumor and intra-tumor heterogeneity, the classical pathological approach is being displaced in favor of novel molecular classifications. In the present article, we summarize the most relevant proposals of molecular classifications obtained from the analysis of colorectal tumors using powerful high throughput techniques and devices. We also discuss the role that cancer systems biology may play in the integration and interpretation of the high amount of data generated and the challenges to be addressed in the future development of precision oncology. In addition, we review the current state of implementation of these novel tools in the pathological laboratory and in clinical practice. PMID:26084042

  12. Rough set soft computing cancer classification and network: one stone, two birds.

    PubMed

    Zhang, Yue

    2010-07-15

    Gene expression profiling provides tremendous information to help unravel the complexity of cancer. The selection of the most informative genes from huge noise for cancer classification has taken centre stage, along with predicting the function of such identified genes and the construction of direct gene regulatory networks at different system levels with a tuneable parameter. A new study by Wang and Gotoh described a novel Variable Precision Rough Sets-rooted robust soft computing method to successfully address these problems and has yielded some new insights. The significance of this progress and its perspectives will be discussed in this article.

  13. Binary Classification using Decision Tree based Genetic Programming and Its Application to Analysis of Bio-mass Data

    NASA Astrophysics Data System (ADS)

    To, Cuong; Pham, Tuan D.

    2010-01-01

    In machine learning, pattern recognition may be the most popular task. "Similar" patterns identification is also very important in biology because first, it is useful for prediction of patterns associated with disease, for example cancer tissue (normal or tumor); second, similarity or dissimilarity of the kinetic patterns is used to identify coordinately controlled genes or proteins involved in the same regulatory process. Third, similar genes (proteins) share similar functions. In this paper, we present an algorithm which uses genetic programming to create decision tree for binary classification problem. The application of the algorithm was implemented on five real biological databases. Base on the results of comparisons with well-known methods, we see that the algorithm is outstanding in most of cases.

  14. Functional proteomics outlines the complexity of breast cancer molecular subtypes.

    PubMed

    Gámez-Pozo, Angelo; Trilla-Fuertes, Lucía; Berges-Soria, Julia; Selevsek, Nathalie; López-Vacas, Rocío; Díaz-Almirón, Mariana; Nanni, Paolo; Arevalillo, Jorge M; Navarro, Hilario; Grossmann, Jonas; Gayá Moreno, Francisco; Gómez Rioja, Rubén; Prado-Vázquez, Guillermo; Zapater-Moros, Andrea; Main, Paloma; Feliú, Jaime; Martínez Del Prado, Purificación; Zamora, Pilar; Ciruelos, Eva; Espinosa, Enrique; Fresno Vara, Juan Ángel

    2017-08-30

    Breast cancer is a heterogeneous disease comprising a variety of entities with various genetic backgrounds. Estrogen receptor-positive, human epidermal growth factor receptor 2-negative tumors typically have a favorable outcome; however, some patients eventually relapse, which suggests some heterogeneity within this category. In the present study, we used proteomics and miRNA profiling techniques to characterize a set of 102 either estrogen receptor-positive (ER+)/progesterone receptor-positive (PR+) or triple-negative formalin-fixed, paraffin-embedded breast tumors. Protein expression-based probabilistic graphical models and flux balance analyses revealed that some ER+/PR+ samples had a protein expression profile similar to that of triple-negative samples and had a clinical outcome similar to those with triple-negative disease. This probabilistic graphical model-based classification had prognostic value in patients with luminal A breast cancer. This prognostic information was independent of that provided by standard genomic tests for breast cancer, such as MammaPrint, OncoType Dx and the 8-gene Score.

  15. Multiple Biomarker Panels for Early Detection of Breast Cancer in Peripheral Blood

    PubMed Central

    Zhang, Fan; Deng, Youping; Drabier, Renee

    2013-01-01

    Detecting breast cancer at early stages can be challenging. Traditional mammography and tissue microarray that have been studied for early breast cancer detection and prediction have many drawbacks. Therefore, there is a need for more reliable diagnostic tools for early detection of breast cancer due to a number of factors and challenges. In the paper, we presented a five-marker panel approach based on SVM for early detection of breast cancer in peripheral blood and show how to use SVM to model the classification and prediction problem of early detection of breast cancer in peripheral blood. We found that the five-marker panel can improve the prediction performance (area under curve) in the testing data set from 0.5826 to 0.7879. Further pathway analysis showed that the top four five-marker panels are associated with signaling, steroid hormones, metabolism, immune system, and hemostasis, which are consistent with previous findings. Our prediction model can serve as a general model for multibiomarker panel discovery in early detection of other cancers. PMID:24371830

  16. Multiple biomarker panels for early detection of breast cancer in peripheral blood.

    PubMed

    Zhang, Fan; Deng, Youping; Drabier, Renee

    2013-01-01

    Detecting breast cancer at early stages can be challenging. Traditional mammography and tissue microarray that have been studied for early breast cancer detection and prediction have many drawbacks. Therefore, there is a need for more reliable diagnostic tools for early detection of breast cancer due to a number of factors and challenges. In the paper, we presented a five-marker panel approach based on SVM for early detection of breast cancer in peripheral blood and show how to use SVM to model the classification and prediction problem of early detection of breast cancer in peripheral blood. We found that the five-marker panel can improve the prediction performance (area under curve) in the testing data set from 0.5826 to 0.7879. Further pathway analysis showed that the top four five-marker panels are associated with signaling, steroid hormones, metabolism, immune system, and hemostasis, which are consistent with previous findings. Our prediction model can serve as a general model for multibiomarker panel discovery in early detection of other cancers.

  17. [Implementation of cytology images classification--the Bethesda 2001 System--in a group of screened women from Podlaskie region--effect evaluation].

    PubMed

    Zbroch, Tomasz; Knapp, Paweł Grzegorz; Knapp, Piotr Andrzej

    2007-09-01

    Increasing knowledge concerning carcinogenesis within cervical epithelium has forced us to make continues modifications of cytology classification of the cervical smears. Eventually, new descriptions of the submicroscopic cytomorphological abnormalities have enabled the implementation of Bethesda System which was meant to take place of the former Papanicolaou classification although temporarily both are sometimes used simultaneously. The aim of this study was to compare results of these two classification systems in the aspect of diagnostic accuracy verified by further tests of the diagnostic algorithm for the cervical lesion evaluation. The study was conducted in the group of women selected from general population, the criteria being the place of living and cervical cancer age risk group, in the consecutive periods of mass screening in Podlaski region. The performed diagnostic tests have been based on the commonly used algorithm, as well as identical laboratory and methodological conditions. Performed assessment revealed comparable diagnostic accuracy of both analyzing classifications, verified by histological examination, although with marked higher specificity for dysplastic lesions with decreased number of HSIL results and increased diagnosis of LSILs. Higher number of performed colposcopies and biopsies were an additional consequence of TBS classification. Results based on Bethesda System made it possible to find the sources and reasons of abnormalities with much greater precision, which enabled causing agent treatment. Two evaluated cytology classification systems, although not much different, depicted higher potential of TBS and better, more effective communication between cytology laboratory and gynecologist, making reasonable implementation of The Bethesda System in the daily cytology screening work.

  18. SVM and SVM Ensembles in Breast Cancer Prediction.

    PubMed

    Huang, Min-Wei; Chen, Chih-Wen; Lin, Wei-Chao; Ke, Shih-Wen; Tsai, Chih-Fong

    2017-01-01

    Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been shown to outperform many related techniques. To construct the SVM classifier, it is first necessary to decide the kernel function, and different kernel functions can result in different prediction performance. However, there have been very few studies focused on examining the prediction performances of SVM based on different kernel functions. Moreover, it is unknown whether SVM classifier ensembles which have been proposed to improve the performance of single classifiers can outperform single SVM classifiers in terms of breast cancer prediction. Therefore, the aim of this paper is to fully assess the prediction performance of SVM and SVM ensembles over small and large scale breast cancer datasets. The classification accuracy, ROC, F-measure, and computational times of training SVM and SVM ensembles are compared. The experimental results show that linear kernel based SVM ensembles based on the bagging method and RBF kernel based SVM ensembles with the boosting method can be the better choices for a small scale dataset, where feature selection should be performed in the data pre-processing stage. For a large scale dataset, RBF kernel based SVM ensembles based on boosting perform better than the other classifiers.

  19. SVM and SVM Ensembles in Breast Cancer Prediction

    PubMed Central

    Huang, Min-Wei; Chen, Chih-Wen; Lin, Wei-Chao; Ke, Shih-Wen; Tsai, Chih-Fong

    2017-01-01

    Breast cancer is an all too common disease in women, making how to effectively predict it an active research problem. A number of statistical and machine learning techniques have been employed to develop various breast cancer prediction models. Among them, support vector machines (SVM) have been shown to outperform many related techniques. To construct the SVM classifier, it is first necessary to decide the kernel function, and different kernel functions can result in different prediction performance. However, there have been very few studies focused on examining the prediction performances of SVM based on different kernel functions. Moreover, it is unknown whether SVM classifier ensembles which have been proposed to improve the performance of single classifiers can outperform single SVM classifiers in terms of breast cancer prediction. Therefore, the aim of this paper is to fully assess the prediction performance of SVM and SVM ensembles over small and large scale breast cancer datasets. The classification accuracy, ROC, F-measure, and computational times of training SVM and SVM ensembles are compared. The experimental results show that linear kernel based SVM ensembles based on the bagging method and RBF kernel based SVM ensembles with the boosting method can be the better choices for a small scale dataset, where feature selection should be performed in the data pre-processing stage. For a large scale dataset, RBF kernel based SVM ensembles based on boosting perform better than the other classifiers. PMID:28060807

  20. Microscopic medical image classification framework via deep learning and shearlet transform.

    PubMed

    Rezaeilouyeh, Hadi; Mollahosseini, Ali; Mahoor, Mohammad H

    2016-10-01

    Cancer is the second leading cause of death in US after cardiovascular disease. Image-based computer-aided diagnosis can assist physicians to efficiently diagnose cancers in early stages. Existing computer-aided algorithms use hand-crafted features such as wavelet coefficients, co-occurrence matrix features, and recently, histogram of shearlet coefficients for classification of cancerous tissues and cells in images. These hand-crafted features often lack generalizability since every cancerous tissue and cell has a specific texture, structure, and shape. An alternative approach is to use convolutional neural networks (CNNs) to learn the most appropriate feature abstractions directly from the data and handle the limitations of hand-crafted features. A framework for breast cancer detection and prostate Gleason grading using CNN trained on images along with the magnitude and phase of shearlet coefficients is presented. Particularly, we apply shearlet transform on images and extract the magnitude and phase of shearlet coefficients. Then we feed shearlet features along with the original images to our CNN consisting of multiple layers of convolution, max pooling, and fully connected layers. Our experiments show that using the magnitude and phase of shearlet coefficients as extra information to the network can improve the accuracy of detection and generalize better compared to the state-of-the-art methods that rely on hand-crafted features. This study expands the application of deep neural networks into the field of medical image analysis, which is a difficult domain considering the limited medical data available for such analysis.

  1. Understanding the influences and impact of patient-clinician communication in cancer care.

    PubMed

    Lafata, Jennifer Elston; Shay, Laura A; Winship, Jodi M

    2017-12-01

    Patient-clinician communication is thought to be central to care outcomes, but when and how communication affects patient outcomes is not well understood. We propose a conceptual model and classification framework upon which the empirical evidence base for the impact of patient-clinician communication can be summarized and further built. We use the proposed model and framework to summarize findings from two recent systematic reviews, one evaluating the use of shared decision making (SDM) on cancer care outcomes and the other evaluating the role of physician recommendation in cancer screening use. Using this approach, we identified clusters of studies with positive findings, including those relying on the measurement of SDM from the patients' perspective and affective-cognitive outcomes, particularly in the context of surgical treatment decision making. We also identify important gaps in the literature, including the role of SDM in post-surgical treatment and end-of-life care decisions, and those specifying particular physician communication strategies when recommending cancer screening. Transparent linkages between key conceptual domains and the influence of methodological approaches on observed patient outcomes are needed to advance our understanding of how and when patient-clinician communication influences patient outcomes. The proposed conceptual model and classification framework can be used to facilitate the translation of empirical evidence into practice and to identify critical gaps in knowledge regarding how and when patient-clinician communication impacts care outcomes in the context of cancer and health care more broadly. © 2017 The Authors Health Expectations Published by John Wiley & Sons Ltd.

  2. Chaotic Particle Swarm Optimization with Mutation for Classification

    PubMed Central

    Assarzadeh, Zahra; Naghsh-Nilchi, Ahmad Reza

    2015-01-01

    In this paper, a chaotic particle swarm optimization with mutation-based classifier particle swarm optimization is proposed to classify patterns of different classes in the feature space. The introduced mutation operators and chaotic sequences allows us to overcome the problem of early convergence into a local minima associated with particle swarm optimization algorithms. That is, the mutation operator sharpens the convergence and it tunes the best possible solution. Furthermore, to remove the irrelevant data and reduce the dimensionality of medical datasets, a feature selection approach using binary version of the proposed particle swarm optimization is introduced. In order to demonstrate the effectiveness of our proposed classifier, mutation-based classifier particle swarm optimization, it is checked out with three sets of data classifications namely, Wisconsin diagnostic breast cancer, Wisconsin breast cancer and heart-statlog, with different feature vector dimensions. The proposed algorithm is compared with different classifier algorithms including k-nearest neighbor, as a conventional classifier, particle swarm-classifier, genetic algorithm, and Imperialist competitive algorithm-classifier, as more sophisticated ones. The performance of each classifier was evaluated by calculating the accuracy, sensitivity, specificity and Matthews's correlation coefficient. The experimental results show that the mutation-based classifier particle swarm optimization unequivocally performs better than all the compared algorithms. PMID:25709937

  3. The use of a gas chromatography-sensor system combined with advanced statistical methods, towards the diagnosis of urological malignancies

    PubMed Central

    Aggio, Raphael B. M.; de Lacy Costello, Ben; White, Paul; Khalid, Tanzeela; Ratcliffe, Norman M.; Persad, Raj; Probert, Chris S. J.

    2016-01-01

    Prostate cancer is one of the most common cancers. Serum prostate-specific antigen (PSA) is used to aid the selection of men undergoing biopsies. Its use remains controversial. We propose a GC-sensor algorithm system for classifying urine samples from patients with urological symptoms. This pilot study includes 155 men presenting to urology clinics, 58 were diagnosed with prostate cancer, 24 with bladder cancer and 73 with haematuria and or poor stream, without cancer. Principal component analysis (PCA) was applied to assess the discrimination achieved, while linear discriminant analysis (LDA) and support vector machine (SVM) were used as statistical models for sample classification. Leave-one-out cross-validation (LOOCV), repeated 10-fold cross-validation (10FoldCV), repeated double cross-validation (DoubleCV) and Monte Carlo permutations were applied to assess performance. Significant separation was found between prostate cancer and control samples, bladder cancer and controls and between bladder and prostate cancer samples. For prostate cancer diagnosis, the GC/SVM system classified samples with 95% sensitivity and 96% specificity after LOOCV. For bladder cancer diagnosis, the SVM reported 96% sensitivity and 100% specificity after LOOCV, while the DoubleCV reported 87% sensitivity and 99% specificity, with SVM showing 78% and 98% sensitivity between prostate and bladder cancer samples. Evaluation of the results of the Monte Carlo permutation of class labels obtained chance-like accuracy values around 50% suggesting the observed results for bladder cancer and prostate cancer detection are not due to over fitting. The results of the pilot study presented here indicate that the GC system is able to successfully identify patterns that allow classification of urine samples from patients with urological cancers. An accurate diagnosis based on urine samples would reduce the number of negative prostate biopsies performed, and the frequency of surveillance cystoscopy for bladder cancer patients. Larger cohort studies are planned to investigate the potential of this system. Future work may lead to non-invasive breath analyses for diagnosing urological conditions. PMID:26865331

  4. Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model

    PubMed Central

    Tan, Maxine; Pu, Jiantao; Zheng, Bin

    2014-01-01

    Purpose: Improving radiologists’ performance in classification between malignant and benign breast lesions is important to increase cancer detection sensitivity and reduce false-positive recalls. For this purpose, developing computer-aided diagnosis (CAD) schemes has been attracting research interest in recent years. In this study, we investigated a new feature selection method for the task of breast mass classification. Methods: We initially computed 181 image features based on mass shape, spiculation, contrast, presence of fat or calcifications, texture, isodensity, and other morphological features. From this large image feature pool, we used a sequential forward floating selection (SFFS)-based feature selection method to select relevant features, and analyzed their performance using a support vector machine (SVM) model trained for the classification task. On a database of 600 benign and 600 malignant mass regions of interest (ROIs), we performed the study using a ten-fold cross-validation method. Feature selection and optimization of the SVM parameters were conducted on the training subsets only. Results: The area under the receiver operating characteristic curve (AUC) = 0.805±0.012 was obtained for the classification task. The results also showed that the most frequently-selected features by the SFFS-based algorithm in 10-fold iterations were those related to mass shape, isodensity and presence of fat, which are consistent with the image features frequently used by radiologists in the clinical environment for mass classification. The study also indicated that accurately computing mass spiculation features from the projection mammograms was difficult, and failed to perform well for the mass classification task due to tissue overlap within the benign mass regions. Conclusions: In conclusion, this comprehensive feature analysis study provided new and valuable information for optimizing computerized mass classification schemes that may have potential to be useful as a “second reader” in future clinical practice. PMID:24664267

  5. Machine learning models in breast cancer survival prediction.

    PubMed

    Montazeri, Mitra; Montazeri, Mohadeseh; Montazeri, Mahdieh; Beigzadeh, Amin

    2016-01-01

    Breast cancer is one of the most common cancers with a high mortality rate among women. With the early diagnosis of breast cancer survival will increase from 56% to more than 86%. Therefore, an accurate and reliable system is necessary for the early diagnosis of this cancer. The proposed model is the combination of rules and different machine learning techniques. Machine learning models can help physicians to reduce the number of false decisions. They try to exploit patterns and relationships among a large number of cases and predict the outcome of a disease using historical cases stored in datasets. The objective of this study is to propose a rule-based classification method with machine learning techniques for the prediction of different types of Breast cancer survival. We use a dataset with eight attributes that include the records of 900 patients in which 876 patients (97.3%) and 24 (2.7%) patients were females and males respectively. Naive Bayes (NB), Trees Random Forest (TRF), 1-Nearest Neighbor (1NN), AdaBoost (AD), Support Vector Machine (SVM), RBF Network (RBFN), and Multilayer Perceptron (MLP) machine learning techniques with 10-cross fold technique were used with the proposed model for the prediction of breast cancer survival. The performance of machine learning techniques were evaluated with accuracy, precision, sensitivity, specificity, and area under ROC curve. Out of 900 patients, 803 patients and 97 patients were alive and dead, respectively. In this study, Trees Random Forest (TRF) technique showed better results in comparison to other techniques (NB, 1NN, AD, SVM and RBFN, MLP). The accuracy, sensitivity and the area under ROC curve of TRF are 96%, 96%, 93%, respectively. However, 1NN machine learning technique provided poor performance (accuracy 91%, sensitivity 91% and area under ROC curve 78%). This study demonstrates that Trees Random Forest model (TRF) which is a rule-based classification model was the best model with the highest level of accuracy. Therefore, this model is recommended as a useful tool for breast cancer survival prediction as well as medical decision making.

  6. Electronic Risk Assessment System as an Appropriate Tool for the Prevention of Cancer: a Qualitative Study.

    PubMed

    Javan Amoli, Amir Hossein; Maserat, Elham; Safdari, Reza; Zali, Mohammad Reza

    2015-01-01

    Decision making modalities for screening for many cancer conditions and different stages have become increasingly complex. Computer-based risk assessment systems facilitate scheduling and decision making and support the delivery of cancer screening services. The aim of this article was to survey electronic risk assessment system as an appropriate tool for the prevention of cancer. A qualitative design was used involving 21 face-to-face interviews. Interviewing involved asking questions and getting answers from exclusive managers of cancer screening. Of the participants 6 were female and 15 were male, and ages ranged from 32 to 78 years. The study was based on a grounded theory approach and the tool was a semi- structured interview. Researchers studied 5 dimensions, comprising electronic guideline standards of colorectal cancer screening, work flow of clinical and genetic activities, pathways of colorectal cancer screening and functionality of computer based guidelines and barriers. Electronic guideline standards of colorectal cancer screening were described in the s3 categories of content standard, telecommunications and technical standards and nomenclature and classification standards. According to the participations' views, workflow and genetic pathways of colorectal cancer screening were identified. The study demonstrated an effective role of computer-guided consultation for screening management. Electronic based systems facilitate real-time decision making during a clinical interaction. Electronic pathways have been applied for clinical and genetic decision support, workflow management, update recommendation and resource estimates. A suitable technical and clinical infrastructure is an integral part of clinical practice guidline of screening. As a conclusion, it is recommended to consider the necessity of architecture assessment and also integration standards.

  7. Clinical characteristics and oncological outcomes of testicular cancer patients registered in 2005 and 2008: the first large-scale study from the Cancer Registration Committee of the Japanese Urological Association.

    PubMed

    Miki, Tsuneharu; Kamoi, Kazumi; Fujimoto, Hiroyuki; Kanayama, Hiro-omi; Ohyama, Chikara; Suzuki, Kazuhiro; Nishiyama, Hiroyuki; Eto, Masatoshi; Naito, Seiji; Fukumori, Tomoharu; Kubota, Yoshinobu; Takahashi, Satoru; Mikami, Kazuya; Homma, Yukio

    2014-08-01

    To describe the clinical and pathological characteristics and oncological outcomes of testicular cancer diagnosed in Japan, we report the results of the testicular cancer registration carried out by the Japanese Urological Association. Testicular cancer survey was conducted by the Japanese Urological Association in 2011 to register newly diagnosed testicular cancers in 2005 and 2008. The survey included details such as age, presenting symptoms, physical examination findings, tumor markers, histopathology, clinical stage, initial treatment and clinical outcomes. We analyzed 1121 cases of testicular primary germ cell tumor among 1157 registered patients. The median age was 37.0 years. Seminomas and non-seminomatous germ cell tumors accounted for 61.9% and 38.1%, respectively. Measurements of tumor markers were documented in 98.6% of the patients; however, there was an unsatisfactory uniform measurement of human chorionic gonadotropin, which made it difficult to evaluate the International Germ Cell Consensus Classification in all patients. The 1- and 3-year overall survival rates from the entire cohort were 98.3% and 96.8%, respectively. According to the International Germ Cell Consensus Classification, 3-year overall survival rates in the good, intermediate, and poor prognosis group were 99.1%, 100% and 79.9%, respectively. The present report is the first large-scale study of the characteristics and survival of testicular cancer patients in Japan based on multi-institutional registry data, and showed a good prognosis even in an advanced stage. The improved survival attributed substantially to accurate diagnosis and effective multimodal treatment. © 2014 The Japanese Urological Association.

  8. Population-based mammography screening: comparison of screen-film and full-field digital mammography with soft-copy reading--Oslo I study.

    PubMed

    Skaane, Per; Young, Kari; Skjennald, Arnulf

    2003-12-01

    To compare screen-film and full-field digital mammography with soft-copy reading in a population-based screening program. Full-field digital and screen-film mammography were performed in 3,683 women aged 50-69 years. Two standard views of each breast were acquired with each modality. Images underwent independent double reading with use of a five-point rating scale for probability of cancer. Recall rates and positive predictive values were calculated. Cancer detection rates determined with both modalities were compared by using the McNemar test for paired proportions. Retrospective side-by-side analysis for conspicuity of cancers was performed by an external independent radiologist group with experience in both modalities. In 3,683 cases, 31 cancers were detected. Screen-film mammography depicted 28 (0.76%) malignancies, and full-field digital mammography depicted 23 (0.62%) malignancies. The difference between cancer detection rates was not significant (P =.23). The recall rate for full-field digital mammography (4.6%; 168 of 3,683 cases) was slightly higher than that for screen-film mammography (3.5%; 128 of 3,683 cases). The positive predictive value based on needle biopsy results was 46% for screen-film mammography and 39% for full-field digital mammography. Side-by-side image comparison for cancer conspicuity led to classification of 19 cancers as equal for probability of malignancy, six cancers as slightly better demonstrated at screen-film mammography, and six cancers as slightly better demonstrated at full-field digital mammography. There was no statistically significant difference in cancer detection rate between screen-film and full-field digital mammography. Cancer conspicuity was equal with both modalities. Full-field digital mammography with soft-copy reading is comparable to screen-film mammography in population-based screening.

  9. Ultra-sensitive high performance liquid chromatography-laser-induced fluorescence based proteomics for clinical applications.

    PubMed

    Patil, Ajeetkumar; Bhat, Sujatha; Pai, Keerthilatha M; Rai, Lavanya; Kartha, V B; Chidangil, Santhosh

    2015-09-08

    An ultra-sensitive high performance liquid chromatography-laser induced fluorescence (HPLC-LIF) based technique has been developed by our group at Manipal, for screening, early detection, and staging for various cancers, using protein profiling of clinical samples like, body fluids, cellular specimens, and biopsy-tissue. More than 300 protein profiles of different clinical samples (serum, saliva, cellular samples and tissue homogenates) from volunteers (normal, and different pre-malignant/malignant conditions) were recorded using this set-up. The protein profiles were analyzed using principal component analysis (PCA) to achieve objective detection and classification of malignant, premalignant and healthy conditions with high sensitivity and specificity. The HPLC-LIF protein profiling combined with PCA, as a routine method for screening, diagnosis, and staging of cervical cancer and oral cancer, is discussed in this paper. In recent years, proteomics techniques have advanced tremendously in life sciences and medical sciences for the detection and identification of proteins in body fluids, tissue homogenates and cellular samples to understand biochemical mechanisms leading to different diseases. Some of the methods include techniques like high performance liquid chromatography, 2D-gel electrophoresis, MALDI-TOF-MS, SELDI-TOF-MS, CE-MS and LC-MS techniques. We have developed an ultra-sensitive high performance liquid chromatography-laser induced fluorescence (HPLC-LIF) based technique, for screening, early detection, and staging for various cancers, using protein profiling of clinical samples like, body fluids, cellular specimens, and biopsy-tissue. More than 300 protein profiles of different clinical samples (serum, saliva, cellular samples and tissue homogenates) from healthy and volunteers with different malignant conditions were recorded by using this set-up. The protein profile data were analyzed using principal component analysis (PCA) for objective classification and detection of malignant, premalignant and healthy conditions. The method is extremely sensitive to detect proteins with limit of detection of the order of femto-moles. The HPLC-LIF combined with PCA as a potential proteomic method for the diagnosis of oral cancer and cervical cancer has been discussed in this paper. This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. Prostate cancer detection using machine learning techniques by employing combination of features extracting strategies.

    PubMed

    Hussain, Lal; Ahmed, Adeel; Saeed, Sharjil; Rathore, Saima; Awan, Imtiaz Ahmed; Shah, Saeed Arif; Majid, Abdul; Idris, Adnan; Awan, Anees Ahmed

    2018-02-06

    Prostate is a second leading causes of cancer deaths among men. Early detection of cancer can effectively reduce the rate of mortality caused by Prostate cancer. Due to high and multiresolution of MRIs from prostate cancer require a proper diagnostic systems and tools. In the past researchers developed Computer aided diagnosis (CAD) systems that help the radiologist to detect the abnormalities. In this research paper, we have employed novel Machine learning techniques such as Bayesian approach, Support vector machine (SVM) kernels: polynomial, radial base function (RBF) and Gaussian and Decision Tree for detecting prostate cancer. Moreover, different features extracting strategies are proposed to improve the detection performance. The features extracting strategies are based on texture, morphological, scale invariant feature transform (SIFT), and elliptic Fourier descriptors (EFDs) features. The performance was evaluated based on single as well as combination of features using Machine Learning Classification techniques. The Cross validation (Jack-knife k-fold) was performed and performance was evaluated in term of receiver operating curve (ROC) and specificity, sensitivity, Positive predictive value (PPV), negative predictive value (NPV), false positive rate (FPR). Based on single features extracting strategies, SVM Gaussian Kernel gives the highest accuracy of 98.34% with AUC of 0.999. While, using combination of features extracting strategies, SVM Gaussian kernel with texture + morphological, and EFDs + morphological features give the highest accuracy of 99.71% and AUC of 1.00.

  11. Artificial neural networks for processing fluorescence spectroscopy data in skin cancer diagnostics

    NASA Astrophysics Data System (ADS)

    Lenhardt, L.; Zeković, I.; Dramićanin, T.; Dramićanin, M. D.

    2013-11-01

    Over the years various optical spectroscopic techniques have been widely used as diagnostic tools in the discrimination of many types of malignant diseases. Recently, synchronous fluorescent spectroscopy (SFS) coupled with chemometrics has been applied in cancer diagnostics. The SFS method involves simultaneous scanning of both emission and excitation wavelengths while keeping the interval of wavelengths (constant-wavelength mode) or frequencies (constant-energy mode) between them constant. This method is fast, relatively inexpensive, sensitive and non-invasive. Total synchronous fluorescence spectra of normal skin, nevus and melanoma samples were used as input for training of artificial neural networks. Two different types of artificial neural networks were trained, the self-organizing map and the feed-forward neural network. Histopathology results of investigated skin samples were used as the gold standard for network output. Based on the obtained classification success rate of neural networks, we concluded that both networks provided high sensitivity with classification errors between 2 and 4%.

  12. Can-CSC-GBE: Developing Cost-sensitive Classifier with Gentleboost Ensemble for breast cancer classification using protein amino acids and imbalanced data.

    PubMed

    Ali, Safdar; Majid, Abdul; Javed, Syed Gibran; Sattar, Mohsin

    2016-06-01

    Early prediction of breast cancer is important for effective treatment and survival. We developed an effective Cost-Sensitive Classifier with GentleBoost Ensemble (Can-CSC-GBE) for the classification of breast cancer using protein amino acid features. In this work, first, discriminant information of the protein sequences related to breast tissue is extracted. Then, the physicochemical properties hydrophobicity and hydrophilicity of amino acids are employed to generate molecule descriptors in different feature spaces. For comparison, we obtained results by combining Cost-Sensitive learning with conventional ensemble of AdaBoostM1 and Bagging. The proposed Can-CSC-GBE system has effectively reduced the misclassification costs and thereby improved the overall classification performance. Our novel approach has highlighted promising results as compared to the state-of-the-art ensemble approaches. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. Enhanced Patient Expectant and Antiemetic Drug Efficacy

    DTIC Science & Technology

    1999-07-01

    Breast Cancer Nausea and Vomiting Expectancy Patient Information Antiemetic Side Effect 15. NUMBER OF PAGES 15 16. PRICE CODE 17. SECURITY ...CLASSIFICATION OF REPORT Unclassified 18. SECURITY CLASSIFICATION OF THIS PAGE Unclassified 19. SECURITY CLASSIFICATION OF ABSTRACT...5-HT3 receptor antagonist class of antiemetics (ondansetron, granisetron , tropisitron) have greatly reduced chemotherapy-related vomiting, this has

  14. [Turcot's syndrome confirmed by molecular biological tests].

    PubMed

    Jeannin, S; Lebrun, C; Van Den Bos, F; Olschwang, S; Bourg, V; Frenay, M

    2006-06-01

    Turcot's syndrome is characterized clinically by the concurrence of a primary brain tumor and a familial adenomatous polyposis or a hereditary nonpolyposis colorectal cancer. We report a case of a 45-year-old woman who underwent in 1995 neuro-oncological treatment for an anaplastic astrocytoma (grade III according to the World Health Organization classification). Treatment included complete surgery, radiotherapy, a first-line nitrosourea-based chemotherapy regimen and a second-line platinium salt-based regimen. It was then noted that the patient's brother had colorectal cancer. A genetic study detected a germ-line mutation on the hMSH2 gene specific of HNPCC syndrome (Human Non Polyposis Colorectal Cancer). Colonoscopy was normal. Eight years after the diagnosis, the patient developed a gliomatosis cerebri and died. Relevant personal and familial history can provide the clue to the diagnosis of Turcot's syndrome. Molecular diagnosis may contribute to appropriate care of affected patients.

  15. Comparison of breast cancer survival in two populations: Ardabil, Iran and British Columbia, Canada.

    PubMed

    Sadjadi, Alireza; Hislop, T Gregory; Bajdik, Chris; Bashash, Morteza; Ghorbani, Anahita; Nouraie, Mehdi; Babaei, Masoud; Malekzadeh, Reza; Yavari, Parvin

    2009-10-28

    Patterns in survival can provide information about the burden and severity of cancer, help uncover gaps in systemic policy and program delivery, and support the planning of enhanced cancer control systems. The aim of this paper is to describe the one-year survival rates for breast cancer in two populations using population-based cancer registries: Ardabil, Iran, and British Columbia (BC), Canada. All newly diagnosed cases of female breast cancer were identified in the Ardabil cancer registry from 2003 to 2005 and the BC cancer registry for 2003. The International Classification of Disease for Oncology (ICDO) was used for coding cancer morphology and topography. Survival time was determined from cancer diagnosis to death. Age-specific one-year survival rates, relative survival rates and weighted standard errors were calculated using life-tables for each country. Breast cancer patients in BC had greater one-year survival rates than patients in Ardabil overall and for each age group under 60. These findings support the need for breast cancer screening programs (including regular clinical breast examinations and mammography), public education and awareness regarding early detection of breast cancer, and education of health care providers.

  16. Cancer Liquid Biopsy: Is It Ready for Clinic?

    PubMed

    Pan, Ying; Ji, John S; Jin, Jason Gang; Kuo, Winston Patrick; Kang, Hongjun

    2017-01-01

    The management of cancer relies on a combination of imaging and tissue biopsy for diagnosis, monitoring, and molecular classification-based patient stratification to ensure appropriate treatment. Conventional tissue biopsy harvests tumor samples with invasive procedures, which are often difficult for patients with advanced disease. Given the well-recognized intratumor genetic heterogeneity [1], the biopsy of small tumor fragments does not necessarily represent all the genetic aberrations in the tumor, but sampling the entire tumor in each patient is not realistic. Moreover, tumors evolve all the time from local to advanced disease and by adapting to selective pressure from treatment.

  17. A prognostic classifier for patients with colorectal cancer liver metastasis, based on AURKA, PTGS2 and MMP9.

    PubMed

    Goos, Jeroen A C M; Coupé, Veerle M H; van de Wiel, Mark A; Diosdado, Begoña; Delis-Van Diemen, Pien M; Hiemstra, Annemieke C; de Cuba, Erienne M V; Beliën, Jeroen A M; Menke-van der Houven van Oordt, C Willemien; Geldof, Albert A; Meijer, Gerrit A; Hoekstra, Otto S; Fijneman, Remond J A

    2016-01-12

    Prognosis of patients with colorectal cancer liver metastasis (CRCLM) is estimated based on clinicopathological models. Stratifying patients based on tumor biology may have additional value. Tissue micro-arrays (TMAs), containing resected CRCLM and corresponding primary tumors from a multi-institutional cohort of 507 patients, were immunohistochemically stained for 18 candidate biomarkers. Cross-validated hazard rate ratios (HRRs) for overall survival (OS) and the proportion of HRRs with opposite effect (P(HRR < 1) or P(HRR > 1)) were calculated. A classifier was constructed by classification and regression tree (CART) analysis and its prognostic value determined by permutation analysis. Correlations between protein expression in primary tumor-CRCLM pairs were calculated. Based on their putative prognostic value, EGFR (P(HRR < 1) = .02), AURKA (P(HRR < 1) = .02), VEGFA (P(HRR < 1) = .02), PTGS2 (P(HRR < 1) = .01), SLC2A1 (P(HRR > 1) < 01), HIF1α (P(HRR > 1) = .06), KCNQ1 (P(HRR > 1) = .09), CEA (P (HRR > 1) = .05) and MMP9 (P(HRR < 1) = .07) were included in the CART analysis (n = 201). The resulting classifier was based on AURKA, PTGS2 and MMP9 expression and was associated with OS (HRR 2.79, p < .001), also after multivariate analysis (HRR 3.57, p < .001). The prognostic value of the biomarker-based classifier was superior to the clinicopathological model (p = .001). Prognostic value was highest for colon cancer patients (HRR 5.71, p < .001) and patients not treated with systemic therapy (HRR 3.48, p < .01). Classification based on protein expression in primary tumors could be based on AURKA expression only (HRR 2.59, p = .04). A classifier was generated for patients with CRCLM with improved prognostic value compared to the standard clinicopathological prognostic parameters, which may aid selection of patients who may benefit from adjuvant systemic therapy.

  18. Supervised machine learning and active learning in classification of radiology reports.

    PubMed

    Nguyen, Dung H M; Patrick, Jon D

    2014-01-01

    This paper presents an automated system for classifying the results of imaging examinations (CT, MRI, positron emission tomography) into reportable and non-reportable cancer cases. This system is part of an industrial-strength processing pipeline built to extract content from radiology reports for use in the Victorian Cancer Registry. In addition to traditional supervised learning methods such as conditional random fields and support vector machines, active learning (AL) approaches were investigated to optimize training production and further improve classification performance. The project involved two pilot sites in Victoria, Australia (Lake Imaging (Ballarat) and Peter MacCallum Cancer Centre (Melbourne)) and, in collaboration with the NSW Central Registry, one pilot site at Westmead Hospital (Sydney). The reportability classifier performance achieved 98.25% sensitivity and 96.14% specificity on the cancer registry's held-out test set. Up to 92% of training data needed for supervised machine learning can be saved by AL. AL is a promising method for optimizing the supervised training production used in classification of radiology reports. When an AL strategy is applied during the data selection process, the cost of manual classification can be reduced significantly. The most important practical application of the reportability classifier is that it can dramatically reduce human effort in identifying relevant reports from the large imaging pool for further investigation of cancer. The classifier is built on a large real-world dataset and can achieve high performance in filtering relevant reports to support cancer registries. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  19. AUCTSP: an improved biomarker gene pair class predictor.

    PubMed

    Kagaris, Dimitri; Khamesipour, Alireza; Yiannoutsos, Constantin T

    2018-06-26

    The Top Scoring Pair (TSP) classifier, based on the concept of relative ranking reversals in the expressions of pairs of genes, has been proposed as a simple, accurate, and easily interpretable decision rule for classification and class prediction of gene expression profiles. The idea that differences in gene expression ranking are associated with presence or absence of disease is compelling and has strong biological plausibility. Nevertheless, the TSP formulation ignores significant available information which can improve classification accuracy and is vulnerable to selecting genes which do not have differential expression in the two conditions ("pivot" genes). We introduce the AUCTSP classifier as an alternative rank-based estimator of the magnitude of the ranking reversals involved in the original TSP. The proposed estimator is based on the Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) and as such, takes into account the separation of the entire distribution of gene expression levels in gene pairs under the conditions considered, as opposed to comparing gene rankings within individual subjects as in the original TSP formulation. Through extensive simulations and case studies involving classification in ovarian, leukemia, colon, breast and prostate cancers and diffuse large b-cell lymphoma, we show the superiority of the proposed approach in terms of improving classification accuracy, avoiding overfitting and being less prone to selecting non-informative (pivot) genes. The proposed AUCTSP is a simple yet reliable and robust rank-based classifier for gene expression classification. While the AUCTSP works by the same principle as TSP, its ability to determine the top scoring gene pair based on the relative rankings of two marker genes across all subjects as opposed to each individual subject results in significant performance gains in classification accuracy. In addition, the proposed method tends to avoid selection of non-informative (pivot) genes as members of the top-scoring pair.

  20. Big genomics and clinical data analytics strategies for precision cancer prognosis.

    PubMed

    Ow, Ghim Siong; Kuznetsov, Vladimir A

    2016-11-07

    The field of personalized and precise medicine in the era of big data analytics is growing rapidly. Previously, we proposed our model of patient classification termed Prognostic Signature Vector Matching (PSVM) and identified a 37 variable signature comprising 36 let-7b associated prognostic significant mRNAs and the age risk factor that stratified large high-grade serous ovarian cancer patient cohorts into three survival-significant risk groups. Here, we investigated the predictive performance of PSVM via optimization of the prognostic variable weights, which represent the relative importance of one prognostic variable over the others. In addition, we compared several multivariate prognostic models based on PSVM with classical machine learning techniques such as K-nearest-neighbor, support vector machine, random forest, neural networks and logistic regression. Our results revealed that negative log-rank p-values provides more robust weight values as opposed to the use of other quantities such as hazard ratios, fold change, or a combination of those factors. PSVM, together with the classical machine learning classifiers were combined in an ensemble (multi-test) voting system, which collectively provides a more precise and reproducible patient stratification. The use of the multi-test system approach, rather than the search for the ideal classification/prediction method, might help to address limitations of the individual classification algorithm in specific situation.

Top