NASA Astrophysics Data System (ADS)
Chen, Zhe; Parker, B. J.; Feng, D. D.; Fulton, R.
2004-10-01
In this paper, we compare various temporal analysis schemes applied to dynamic PET for improved quantification, image quality and temporal compression purposes. We compare an optimal sampling schedule (OSS) design, principal component analysis (PCA) applied in the image domain, and principal component analysis applied in the sinogram domain; for region-of-interest quantification, sinogram-domain PCA is combined with the Huesman algorithm to quantify from the sinograms directly without requiring reconstruction of all PCA channels. Using a simulated phantom FDG brain study and three clinical studies, we evaluate the fidelity of the compressed data for estimation of local cerebral metabolic rate of glucose by a four-compartment model. Our results show that using a noise-normalized PCA in the sinogram domain gives similar compression ratio and quantitative accuracy to OSS, but with substantially better precision. These results indicate that sinogram-domain PCA for dynamic PET can be a useful preprocessing stage for PET compression and quantification applications.
NASA Astrophysics Data System (ADS)
Aida, S.; Matsuno, T.; Hasegawa, T.; Tsuji, K.
2017-07-01
Micro X-ray fluorescence (micro-XRF) analysis is repeated as a means of producing elemental maps. In some cases, however, the XRF images of trace elements that are obtained are not clear due to high background intensity. To solve this problem, we applied principal component analysis (PCA) to XRF spectra. We focused on improving the quality of XRF images by applying PCA. XRF images of the dried residue of standard solution on the glass substrate were taken. The XRF intensities for the dried residue were analyzed before and after PCA. Standard deviations of XRF intensities in the PCA-filtered images were improved, leading to clear contrast of the images. This improvement of the XRF images was effective in cases where the XRF intensity was weak.
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge
Wagner, Florian
2015-01-01
Method Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. Results I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets. PMID:26575370
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.
Wagner, Florian
2015-01-01
Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.
Farnell, D J J; Popat, H; Richmond, S
2016-06-01
Methods used in image processing should reflect any multilevel structures inherent in the image dataset or they run the risk of functioning inadequately. We wish to test the feasibility of multilevel principal components analysis (PCA) to build active shape models (ASMs) for cases relevant to medical and dental imaging. Multilevel PCA was used to carry out model fitting to sets of landmark points and it was compared to the results of "standard" (single-level) PCA. Proof of principle was tested by applying mPCA to model basic peri-oral expressions (happy, neutral, sad) approximated to the junction between the mouth/lips. Monte Carlo simulations were used to create this data which allowed exploration of practical implementation issues such as the number of landmark points, number of images, and number of groups (i.e., "expressions" for this example). To further test the robustness of the method, mPCA was subsequently applied to a dental imaging dataset utilising landmark points (placed by different clinicians) along the boundary of mandibular cortical bone in panoramic radiographs of the face. Changes of expression that varied between groups were modelled correctly at one level of the model and changes in lip width that varied within groups at another for the Monte Carlo dataset. Extreme cases in the test dataset were modelled adequately by mPCA but not by standard PCA. Similarly, variations in the shape of the cortical bone were modelled by one level of mPCA and variations between the experts at another for the panoramic radiographs dataset. Results for mPCA were found to be comparable to those of standard PCA for point-to-point errors via miss-one-out testing for this dataset. These errors reduce with increasing number of eigenvectors/values retained, as expected. We have shown that mPCA can be used in shape models for dental and medical image processing. mPCA was found to provide more control and flexibility when compared to standard "single-level" PCA. Specifically, mPCA is preferable to "standard" PCA when multiple levels occur naturally in the dataset. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Principal Component Analysis of Thermographic Data
NASA Technical Reports Server (NTRS)
Winfree, William P.; Cramer, K. Elliott; Zalameda, Joseph N.; Howell, Patricia A.; Burke, Eric R.
2015-01-01
Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. While a reliable technique for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the "good" material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued where a fixed set of eigenvectors, generated from an analytic model of the thermal response of the material under examination, is used to process the thermal data from composite materials. This method has been applied for characterization of flaws.
Investigation of domain walls in PPLN by confocal raman microscopy and PCA analysis
NASA Astrophysics Data System (ADS)
Shur, Vladimir Ya.; Zelenovskiy, Pavel; Bourson, Patrice
2017-07-01
Confocal Raman microscopy (CRM) is a powerful tool for investigation of ferroelectric domains. Mechanical stresses and electric fields existed in the vicinity of neutral and charged domain walls modify frequency, intensity and width of spectral lines [1], thus allowing to visualize micro- and nanodomain structures both at the surface and in the bulk of the crystal [2,3]. Stresses and fields are naturally coupled in ferroelectrics due to inverse piezoelectric effect and hardly can be separated in Raman spectra. PCA is a powerful statistical method for analysis of large data matrix providing a set of orthogonal variables, called principal components (PCs). PCA is widely used for classification of experimental data, for example, in crystallization experiments, for detection of small amounts of components in solid mixtures etc. [4,5]. In Raman spectroscopy PCA was applied for analysis of phase transitions and provided critical pressure with good accuracy [6]. In the present work we for the first time applied Principal Component Analysis (PCA) method for analysis of Raman spectra measured in periodically poled lithium niobate (PPLN). We found that principal components demonstrate different sensitivity to mechanical stresses and electric fields in the vicinity of the domain walls. This allowed us to separately visualize spatial distribution of fields and electric fields at the surface and in the bulk of PPLN.
Q-mode versus R-mode principal component analysis for linear discriminant analysis (LDA)
NASA Astrophysics Data System (ADS)
Lee, Loong Chuen; Liong, Choong-Yeun; Jemain, Abdul Aziz
2017-05-01
Many literature apply Principal Component Analysis (PCA) as either preliminary visualization or variable con-struction methods or both. Focus of PCA can be on the samples (R-mode PCA) or variables (Q-mode PCA). Traditionally, R-mode PCA has been the usual approach to reduce high-dimensionality data before the application of Linear Discriminant Analysis (LDA), to solve classification problems. Output from PCA composed of two new matrices known as loadings and scores matrices. Each matrix can then be used to produce a plot, i.e. loadings plot aids identification of important variables whereas scores plot presents spatial distribution of samples on new axes that are also known as Principal Components (PCs). Fundamentally, the scores matrix always be the input variables for building classification model. A recent paper uses Q-mode PCA but the focus of analysis was not on the variables but instead on the samples. As a result, the authors have exchanged the use of both loadings and scores plots in which clustering of samples was studied using loadings plot whereas scores plot has been used to identify important manifest variables. Therefore, the aim of this study is to statistically validate the proposed practice. Evaluation is based on performance of external error obtained from LDA models according to number of PCs. On top of that, bootstrapping was also conducted to evaluate the external error of each of the LDA models. Results show that LDA models produced by PCs from R-mode PCA give logical performance and the matched external error are also unbiased whereas the ones produced with Q-mode PCA show the opposites. With that, we concluded that PCs produced from Q-mode is not statistically stable and thus should not be applied to problems of classifying samples, but variables. We hope this paper will provide some insights on the disputable issues.
Bouhlel, Jihéne; Jouan-Rimbaud Bouveresse, Delphine; Abouelkaram, Said; Baéza, Elisabeth; Jondreville, Catherine; Travel, Angélique; Ratel, Jérémy; Engel, Erwan; Rutledge, Douglas N
2018-02-01
The aim of this work is to compare a novel exploratory chemometrics method, Common Components Analysis (CCA), with Principal Components Analysis (PCA) and Independent Components Analysis (ICA). CCA consists in adapting the multi-block statistical method known as Common Components and Specific Weights Analysis (CCSWA or ComDim) by applying it to a single data matrix, with one variable per block. As an application, the three methods were applied to SPME-GC-MS volatolomic signatures of livers in an attempt to reveal volatile organic compounds (VOCs) markers of chicken exposure to different types of micropollutants. An application of CCA to the initial SPME-GC-MS data revealed a drift in the sample Scores along CC2, as a function of injection order, probably resulting from time-related evolution in the instrument. This drift was eliminated by orthogonalization of the data set with respect to CC2, and the resulting data are used as the orthogonalized data input into each of the three methods. Since the first step in CCA is to norm-scale all the variables, preliminary data scaling has no effect on the results, so that CCA was applied only to orthogonalized SPME-GC-MS data, while, PCA and ICA were applied to the "orthogonalized", "orthogonalized and Pareto-scaled", and "orthogonalized and autoscaled" data. The comparison showed that PCA results were highly dependent on the scaling of variables, contrary to ICA where the data scaling did not have a strong influence. Nevertheless, for both PCA and ICA the clearest separations of exposed groups were obtained after autoscaling of variables. The main part of this work was to compare the CCA results using the orthogonalized data with those obtained with PCA and ICA applied to orthogonalized and autoscaled variables. The clearest separations of exposed chicken groups were obtained by CCA. CCA Loadings also clearly identified the variables contributing most to the Common Components giving separations. The PCA Loadings did not highlight the most influencing variables for each separation, whereas the ICA Loadings highlighted the same variables as did CCA. This study shows the potential of CCA for the extraction of pertinent information from a data matrix, using a procedure based on an original optimisation criterion, to produce results that are complementary, and in some cases may be superior, to those of PCA and ICA. Copyright © 2017 Elsevier B.V. All rights reserved.
A two-stage linear discriminant analysis via QR-decomposition.
Ye, Jieping; Li, Qi
2005-06-01
Linear Discriminant Analysis (LDA) is a well-known method for feature extraction and dimension reduction. It has been used widely in many applications involving high-dimensional data, such as image and text classification. An intrinsic limitation of classical LDA is the so-called singularity problems; that is, it fails when all scatter matrices are singular. Many LDA extensions were proposed in the past to overcome the singularity problems. Among these extensions, PCA+LDA, a two-stage method, received relatively more attention. In PCA+LDA, the LDA stage is preceded by an intermediate dimension reduction stage using Principal Component Analysis (PCA). Most previous LDA extensions are computationally expensive, and not scalable, due to the use of Singular Value Decomposition or Generalized Singular Value Decomposition. In this paper, we propose a two-stage LDA method, namely LDA/QR, which aims to overcome the singularity problems of classical LDA, while achieving efficiency and scalability simultaneously. The key difference between LDA/QR and PCA+LDA lies in the first stage, where LDA/QR applies QR decomposition to a small matrix involving the class centroids, while PCA+LDA applies PCA to the total scatter matrix involving all training data points. We further justify the proposed algorithm by showing the relationship among LDA/QR and previous LDA methods. Extensive experiments on face images and text documents are presented to show the effectiveness of the proposed algorithm.
Mueller, Daniela; Ferrão, Marco Flôres; Marder, Luciano; da Costa, Adilson Ben; de Cássia de Souza Schneider, Rosana
2013-01-01
The main objective of this study was to use infrared spectroscopy to identify vegetable oils used as raw material for biodiesel production and apply multivariate analysis to the data. Six different vegetable oil sources—canola, cotton, corn, palm, sunflower and soybeans—were used to produce biodiesel batches. The spectra were acquired by Fourier transform infrared spectroscopy using a universal attenuated total reflectance sensor (FTIR-UATR). For the multivariate analysis principal component analysis (PCA), hierarchical cluster analysis (HCA), interval principal component analysis (iPCA) and soft independent modeling of class analogy (SIMCA) were used. The results indicate that is possible to develop a methodology to identify vegetable oils used as raw material in the production of biodiesel by FTIR-UATR applying multivariate analysis. It was also observed that the iPCA found the best spectral range for separation of biodiesel batches using FTIR-UATR data, and with this result, the SIMCA method classified 100% of the soybean biodiesel samples. PMID:23539030
Reese, Sarah E; Archer, Kellie J; Therneau, Terry M; Atkinson, Elizabeth J; Vachon, Celine M; de Andrade, Mariza; Kocher, Jean-Pierre A; Eckel-Passow, Jeanette E
2013-11-15
Batch effects are due to probe-specific systematic variation between groups of samples (batches) resulting from experimental features that are not of biological interest. Principal component analysis (PCA) is commonly used as a visual tool to determine whether batch effects exist after applying a global normalization method. However, PCA yields linear combinations of the variables that contribute maximum variance and thus will not necessarily detect batch effects if they are not the largest source of variability in the data. We present an extension of PCA to quantify the existence of batch effects, called guided PCA (gPCA). We describe a test statistic that uses gPCA to test whether a batch effect exists. We apply our proposed test statistic derived using gPCA to simulated data and to two copy number variation case studies: the first study consisted of 614 samples from a breast cancer family study using Illumina Human 660 bead-chip arrays, whereas the second case study consisted of 703 samples from a family blood pressure study that used Affymetrix SNP Array 6.0. We demonstrate that our statistic has good statistical properties and is able to identify significant batch effects in two copy number variation case studies. We developed a new statistic that uses gPCA to identify whether batch effects exist in high-throughput genomic data. Although our examples pertain to copy number data, gPCA is general and can be used on other data types as well. The gPCA R package (Available via CRAN) provides functionality and data to perform the methods in this article. reesese@vcu.edu
Detecting most influencing courses on students grades using block PCA
NASA Astrophysics Data System (ADS)
Othman, Osama H.; Gebril, Rami Salah
2014-12-01
One of the modern solutions adopted in dealing with the problem of large number of variables in statistical analyses is the Block Principal Component Analysis (Block PCA). This modified technique can be used to reduce the vertical dimension (variables) of the data matrix Xn×p by selecting a smaller number of variables, (say m) containing most of the statistical information. These selected variables can then be employed in further investigations and analyses. Block PCA is an adapted multistage technique of the original PCA. It involves the application of Cluster Analysis (CA) and variable selection throughout sub principal components scores (PC's). The application of Block PCA in this paper is a modified version of the original work of Liu et al (2002). The main objective was to apply PCA on each group of variables, (established using cluster analysis), instead of involving the whole large pack of variables which was proved to be unreliable. In this work, the Block PCA is used to reduce the size of a huge data matrix ((n = 41) × (p = 251)) consisting of Grade Point Average (GPA) of the students in 251 courses (variables) in the faculty of science in Benghazi University. In other words, we are constructing a smaller analytical data matrix of the GPA's of the students with less variables containing most variation (statistical information) in the original database. By applying the Block PCA, (12) courses were found to `absorb' most of the variation or influence from the original data matrix, and hence worth to be keep for future statistical exploring and analytical studies. In addition, the course Independent Study (Math.) was found to be the most influencing course on students GPA among the 12 selected courses.
Roquigny, Roxane; Novinscak, Amy; Arseneault, Tanya; Joly, David L; Filion, Martin
2018-06-19
Phytophthora infestans is responsible for late blight, one of the most important potato diseases. Phenazine-1-carboxylic acid (PCA)-producing Pseudomonas fluorescens strain LBUM223 isolated in our laboratory shows biocontrol potential against various plant pathogens. To characterize the effect of LBUM223 on the transcriptome of P. infestans, we conducted an in vitro time-course study. Confrontational assay was performed using P. infestans inoculated alone (control) or with LBUM223, its phzC- isogenic mutant (not producing PCA), or exogenically applied PCA. Destructive sampling was performed at 6, 9 and 12 days and the transcriptome of P. infestans was analysed using RNA-Seq. The expression of a subset of differentially expressed genes was validated by RT-qPCR. Both LBUM223 and exogenically applied PCA significantly repressed P. infestans' growth at all times. Compared to the control treatment, transcriptomic analyses showed that the percentages of all P. infestans' genes significantly altered by LBUM223 and exogenically applied PCA increased as time progressed, from 50 to 61% and from to 32 to 46%, respectively. When applying an absolute cut-off value of 3 fold change or more for all three harvesting times, 207 genes were found significantly differentially expressed by PCA, either produced by LBUM223 or exogenically applied. Gene ontology analysis revealed that both treatments altered the expression of key functional genes involved in major functions like phosphorylation mechanisms, transmembrane transport and oxidoreduction activities. Interestingly, even though no host plant tissue was present in the in vitro system, PCA also led to the overexpression of several genes encoding effectors. The mutant only slightly repressed P. infestans' growth and barely altered its transcriptome. Our study suggests that PCA is involved in P. infestans' growth repression and led to important transcriptomic changes by both up- and down-regulating gene expression in P. infestans over time. Different metabolic functions were altered and many effectors were found to be upregulated, suggesting their implication in biocontrol.
NASA Astrophysics Data System (ADS)
Hristian, L.; Ostafe, M. M.; Manea, L. R.; Apostol, L. L.
2017-06-01
The work pursued the distribution of combed wool fabrics destined to manufacturing of external articles of clothing in terms of the values of durability and physiological comfort indices, using the mathematical model of Principal Component Analysis (PCA). Principal Components Analysis (PCA) applied in this study is a descriptive method of the multivariate analysis/multi-dimensional data, and aims to reduce, under control, the number of variables (columns) of the matrix data as much as possible to two or three. Therefore, based on the information about each group/assortment of fabrics, it is desired that, instead of nine inter-correlated variables, to have only two or three new variables called components. The PCA target is to extract the smallest number of components which recover the most of the total information contained in the initial data.
NASA Astrophysics Data System (ADS)
Gharibnezhad, Fahit; Mujica, Luis E.; Rodellar, José
2015-01-01
Using Principal Component Analysis (PCA) for Structural Health Monitoring (SHM) has received considerable attention over the past few years. PCA has been used not only as a direct method to identify, classify and localize damages but also as a significant primary step for other methods. Despite several positive specifications that PCA conveys, it is very sensitive to outliers. Outliers are anomalous observations that can affect the variance and the covariance as vital parts of PCA method. Therefore, the results based on PCA in the presence of outliers are not fully satisfactory. As a main contribution, this work suggests the use of robust variant of PCA not sensitive to outliers, as an effective way to deal with this problem in SHM field. In addition, the robust PCA is compared with the classical PCA in the sense of detecting probable damages. The comparison between the results shows that robust PCA can distinguish the damages much better than using classical one, and even in many cases allows the detection where classic PCA is not able to discern between damaged and non-damaged structures. Moreover, different types of robust PCA are compared with each other as well as with classical counterpart in the term of damage detection. All the results are obtained through experiments with an aircraft turbine blade using piezoelectric transducers as sensors and actuators and adding simulated damages.
Liu, Tsang-Sen; Lin, Jhen-Nan; Peng, Tsung-Ren
2018-01-16
Isotopic compositions of δ 2 H, δ 18 O, δ 13 C, and δ 15 N and concentrations of 22 trace elements from garlic samples were analyzed and processed with stepwise principal component analysis (PCA) to discriminate garlic's country of origin among Asian regions including South Korea, Vietnam, Taiwan, and China. Results indicate that there is no single trace-element concentration or isotopic composition that can accomplish the study's purpose and the stepwise PCA approach proposed does allow for discrimination between countries on a regional basis. Sequentially, Step-1 PCA distinguishes garlic's country of origin among Taiwanese, South Korean, and Vietnamese samples; Step-2 PCA discriminates Chinese garlic from South Korean garlic; and Step-3 and Step-4 PCA, Chinese garlic from Vietnamese garlic. In model tests, countries of origin of all audit samples were correctly discriminated by stepwise PCA. Consequently, this study demonstrates that stepwise PCA as applied is a simple and effective approach to discriminating country of origin among Asian garlics. © 2018 American Academy of Forensic Sciences.
Zeemering, Stef; Bonizzi, Pietro; Maesen, Bart; Peeters, Ralf; Schotten, Ulrich
2015-01-01
Spatiotemporal complexity of atrial fibrillation (AF) patterns is often quantified by annotated intracardiac contact mapping. We introduce a new approach that applies recurrence plot (RP) construction followed by recurrence quantification analysis (RQA) to epicardial atrial electrograms, recorded with a high-density grid of electrodes. In 32 patients with no history of AF (aAF, n=11), paroxysmal AF (PAF, n=12) and persistent AF (persAF, n=9), RPs were constructed using a phase space electrogram embedding dimension equal to the estimated AF cycle length. Spatial information was incorporated by 1) averaging the recurrence over all electrodes, and 2) by applying principal component analysis (PCA) to the matrix of embedded electrograms and selecting the first principal component as a representation of spatial diversity. Standard RQA parameters were computed on the constructed RPs and correlated to the number of fibrillation waves per AF cycle (NW). Averaged RP RQA parameters showed no correlation with NW. Correlations improved when applying PCA, with maximum correlation achieved between RP threshold and NW (RR1%, r=0.68, p <; 0.001) and RP determinism (DET, r=-0.64, p <; 0.001). All studied RQA parameters based on the PCA RP were able to discriminate between persAF and aAF/PAF (DET persAF 0.40 ± 0.11 vs. 0.59 ± 0.14/0.62 ± 0.16, p <; 0.01). RP construction and RQA combined with PCA provide a quick and reliable tool to visualize dynamical behaviour and to assess the complexity of contact mapping patterns in AF.
Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map
An, Yan; Zou, Zhihong; Li, Ranran
2016-01-01
In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009–2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data. PMID:26761018
Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map.
An, Yan; Zou, Zhihong; Li, Ranran
2016-01-08
In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009-2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data.
ToF-SIMS PCA analysis of Myrtus communis L.
NASA Astrophysics Data System (ADS)
Piras, F. M.; Dettori, M. F.; Magnani, A.
2009-06-01
Nowadays there is a growing interest of researchers for the application of sophisticated analytical techniques in conjunction with statistical data analysis methods to the characterization of natural products to assure their authenticity and quality, and for the possibility of direct analysis of food to obtain maximum information. In this work, time-of-flight secondary ion mass spectrometry (ToF-SIMS) in conjunction with principal components analysis (PCA) are applied to study the chemical composition and variability of Sardinian myrtle ( Myrtus communis L.) through the analysis of both berries alcoholic extracts and berries epicarp. ToF-SIMS spectra of berries epicarp show that the epicuticular waxes consist mainly of carboxylic acids with chain length ranging from C20 to C30, or identical species formed from fragmentation of long-chain esters. PCA of ToF-SIMS data from myrtle berries epicarp distinguishes two groups characterized by a different surface concentration of triacontanoic acid. Variability in antocyanins, flavonols, α-tocopherol, and myrtucommulone contents is showed by ToF-SIMS PCA analysis of myrtle berries alcoholic extracts.
Non-linear principal component analysis applied to Lorenz models and to North Atlantic SLP
NASA Astrophysics Data System (ADS)
Russo, A.; Trigo, R. M.
2003-04-01
A non-linear generalisation of Principal Component Analysis (PCA), denoted Non-Linear Principal Component Analysis (NLPCA), is introduced and applied to the analysis of three data sets. Non-Linear Principal Component Analysis allows for the detection and characterisation of low-dimensional non-linear structure in multivariate data sets. This method is implemented using a 5-layer feed-forward neural network introduced originally in the chemical engineering literature (Kramer, 1991). The method is described and details of its implementation are addressed. Non-Linear Principal Component Analysis is first applied to a data set sampled from the Lorenz attractor (1963). It is found that the NLPCA approximations are more representative of the data than are the corresponding PCA approximations. The same methodology was applied to the less known Lorenz attractor (1984). However, the results obtained weren't as good as those attained with the famous 'Butterfly' attractor. Further work with this model is underway in order to assess if NLPCA techniques can be more representative of the data characteristics than are the corresponding PCA approximations. The application of NLPCA to relatively 'simple' dynamical systems, such as those proposed by Lorenz, is well understood. However, the application of NLPCA to a large climatic data set is much more challenging. Here, we have applied NLPCA to the sea level pressure (SLP) field for the entire North Atlantic area and the results show a slight imcrement of explained variance associated. Finally, directions for future work are presented.%}
A stable systemic risk ranking in China's banking sector: Based on principal component analysis
NASA Astrophysics Data System (ADS)
Fang, Libing; Xiao, Binqing; Yu, Honghai; You, Qixing
2018-02-01
In this paper, we compare five popular systemic risk rankings, and apply principal component analysis (PCA) model to provide a stable systemic risk ranking for the Chinese banking sector. Our empirical results indicate that five methods suggest vastly different systemic risk rankings for the same bank, while the combined systemic risk measure based on PCA provides a reliable ranking. Furthermore, according to factor loadings of the first component, PCA combined ranking is mainly based on fundamentals instead of market price data. We clearly find that price-based rankings are not as practical a method as fundamentals-based ones. This PCA combined ranking directly shows systemic risk contributions of each bank for banking supervision purpose and reminds banks to prevent and cope with the financial crisis in advance.
Caprihan, A; Pearlson, G D; Calhoun, V D
2008-08-15
Principal component analysis (PCA) is often used to reduce the dimension of data before applying more sophisticated data analysis methods such as non-linear classification algorithms or independent component analysis. This practice is based on selecting components corresponding to the largest eigenvalues. If the ultimate goal is separation of data in two groups, then these set of components need not have the most discriminatory power. We measured the distance between two such populations using Mahalanobis distance and chose the eigenvectors to maximize it, a modified PCA method, which we call the discriminant PCA (DPCA). DPCA was applied to diffusion tensor-based fractional anisotropy images to distinguish age-matched schizophrenia subjects from healthy controls. The performance of the proposed method was evaluated by the one-leave-out method. We show that for this fractional anisotropy data set, the classification error with 60 components was close to the minimum error and that the Mahalanobis distance was twice as large with DPCA, than with PCA. Finally, by masking the discriminant function with the white matter tracts of the Johns Hopkins University atlas, we identified left superior longitudinal fasciculus as the tract which gave the least classification error. In addition, with six optimally chosen tracts the classification error was zero.
From measurements to metrics: PCA-based indicators of cyber anomaly
NASA Astrophysics Data System (ADS)
Ahmed, Farid; Johnson, Tommy; Tsui, Sonia
2012-06-01
We present a framework of the application of Principal Component Analysis (PCA) to automatically obtain meaningful metrics from intrusion detection measurements. In particular, we report the progress made in applying PCA to analyze the behavioral measurements of malware and provide some preliminary results in selecting dominant attributes from an arbitrary number of malware attributes. The results will be useful in formulating an optimal detection threshold in the principal component space, which can both validate and augment existing malware classifiers.
NASA Astrophysics Data System (ADS)
Rojek, Barbara; Wesolowski, Marek; Suchacz, Bogdan
2013-12-01
In the paper infrared (IR) spectroscopy and multivariate exploration techniques: principal component analysis (PCA) and cluster analysis (CA) were applied as supportive methods for the detection of physicochemical incompatibilities between baclofen and excipients. In the course of research, the most useful rotational strategy in PCA proved to be varimax normalized, while in CA Ward's hierarchical agglomeration with Euclidean distance measure enabled to yield the most interpretable results. Chemometrical calculations confirmed the suitability of PCA and CA as the auxiliary methods for interpretation of infrared spectra in order to recognize whether compatibilities or incompatibilities between active substance and excipients occur. On the basis of IR spectra and the results of PCA and CA it was possible to demonstrate that the presence of lactose, β-cyclodextrin and meglumine in binary mixtures produce interactions with baclofen. The results were verified using differential scanning calorimetry, differential thermal analysis, thermogravimetry/differential thermogravimetry and X-ray powder diffraction analyses.
Ardila, Jorge Armando; Funari, Cristiano Soleo; Andrade, André Marques; Cavalheiro, Alberto José; Carneiro, Renato Lajarim
2015-01-01
Bauhinia forficata Link. is recognised by the Brazilian Health Ministry as a treatment of hypoglycemia and diabetes. Analytical methods are useful to assess the plant identity due the similarities found in plants from Bauhinia spp. HPLC-UV/PDA in combination with chemometric tools is an alternative widely used and suitable for authentication of plant material, however, the shifts of retention times for similar compounds in different samples is a problem. To perform comparisons between the authentic medicinal plant (Bauhinia forficata Link.) and samples commercially available in drugstores claiming to be "Bauhinia spp. to treat diabetes" and to evaluate the performance of multivariate curve resolution - alternating least squares (MCR-ALS) associated to principal component analysis (PCA) when compared to pure PCA. HPLC-UV/PDA data obtained from extracts of leaves were evaluated employing a combination of MCR-ALS and PCA, which allowed the use of the full chromatographic and spectrometric information without the need of peak alignment procedures. The use of MCR-ALS/PCA showed better results than the conventional PCA using only one wavelength. Only two of nine commercial samples presented characteristics similar to the authentic Bauhinia forficata spp., considering the full HPLC-UV/PDA data. The combination of MCR-ALS and PCA is very useful when applied to a group of samples where a general alignment procedure could not be applied due to the different chromatographic profiles. This work also demonstrates the need of more strict control from the health authorities regarding herbal products available on the market. Copyright © 2015 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Tsai, Jinn-Tsong; Chou, Ping-Yi; Chou, Jyh-Horng
2015-11-01
The aim of this study is to generate vector quantisation (VQ) codebooks by integrating principle component analysis (PCA) algorithm, Linde-Buzo-Gray (LBG) algorithm, and evolutionary algorithms (EAs). The EAs include genetic algorithm (GA), particle swarm optimisation (PSO), honey bee mating optimisation (HBMO), and firefly algorithm (FF). The study is to provide performance comparisons between PCA-EA-LBG and PCA-LBG-EA approaches. The PCA-EA-LBG approaches contain PCA-GA-LBG, PCA-PSO-LBG, PCA-HBMO-LBG, and PCA-FF-LBG, while the PCA-LBG-EA approaches contain PCA-LBG, PCA-LBG-GA, PCA-LBG-PSO, PCA-LBG-HBMO, and PCA-LBG-FF. All training vectors of test images are grouped according to PCA. The PCA-EA-LBG used the vectors grouped by PCA as initial individuals, and the best solution gained by the EAs was given for LBG to discover a codebook. The PCA-LBG approach is to use the PCA to select vectors as initial individuals for LBG to find a codebook. The PCA-LBG-EA used the final result of PCA-LBG as an initial individual for EAs to find a codebook. The search schemes in PCA-EA-LBG first used global search and then applied local search skill, while in PCA-LBG-EA first used local search and then employed global search skill. The results verify that the PCA-EA-LBG indeed gain superior results compared to the PCA-LBG-EA, because the PCA-EA-LBG explores a global area to find a solution, and then exploits a better one from the local area of the solution. Furthermore the proposed PCA-EA-LBG approaches in designing VQ codebooks outperform existing approaches shown in the literature.
Lin, Yuxin; Chen, Feifei; Shen, Li; Tang, Xiaoyu; Du, Cui; Sun, Zhandong; Ding, Huijie; Chen, Jiajia; Shen, Bairong
2018-05-21
Prostate cancer (PCa) is a fatal malignant tumor among males in the world and the metastasis is a leading cause for PCa death. Biomarkers are therefore urgently needed to detect PCa metastatic signature at the early time. MicroRNAs are small non-coding RNAs with the potential to be biomarkers for disease prediction. In addition, computer-aided biomarker discovery is now becoming an attractive paradigm for precision diagnosis and prognosis of complex diseases. In this study, we identified key microRNAs as biomarkers for predicting PCa metastasis based on network vulnerability analysis. We first extracted microRNAs and mRNAs that were differentially expressed between primary PCa and metastatic PCa (MPCa) samples. Then we constructed the MPCa-specific microRNA-mRNA network and screened microRNA biomarkers by a novel bioinformatics model. The model emphasized the characterization of systems stability changes and the network vulnerability with three measurements, i.e. the structurally single-line regulation, the functional importance of microRNA targets and the percentage of transcription factor genes in microRNA unique targets. With this model, we identified five microRNAs as putative biomarkers for PCa metastasis. Among them, miR-101-3p and miR-145-5p have been previously reported as biomarkers for PCa metastasis and the remaining three, i.e. miR-204-5p, miR-198 and miR-152, were screened as novel biomarkers for PCa metastasis. The results were further confirmed by the assessment of their predictive power and biological function analysis. Five microRNAs were identified as candidate biomarkers for predicting PCa metastasis based on our network vulnerability analysis model. The prediction performance, literature exploration and functional enrichment analysis convinced our findings. This novel bioinformatics model could be applied to biomarker discovery for other complex diseases.
Principal components analysis in clinical studies.
Zhang, Zhongheng; Castelló, Adela
2017-09-01
In multivariate analysis, independent variables are usually correlated to each other which can introduce multicollinearity in the regression models. One approach to solve this problem is to apply principal components analysis (PCA) over these variables. This method uses orthogonal transformation to represent sets of potentially correlated variables with principal components (PC) that are linearly uncorrelated. PCs are ordered so that the first PC has the largest possible variance and only some components are selected to represent the correlated variables. As a result, the dimension of the variable space is reduced. This tutorial illustrates how to perform PCA in R environment, the example is a simulated dataset in which two PCs are responsible for the majority of the variance in the data. Furthermore, the visualization of PCA is highlighted.
Roopwani, Rahul; Buckner, Ira S
2011-10-14
Principal component analysis (PCA) was applied to pharmaceutical powder compaction. A solid fraction parameter (SF(c/d)) and a mechanical work parameter (W(c/d)) representing irreversible compression behavior were determined as functions of applied load. Multivariate analysis of the compression data was carried out using PCA. The first principal component (PC1) showed loadings for the solid fraction and work values that agreed with changes in the relative significance of plastic deformation to consolidation at different pressures. The PC1 scores showed the same rank order as the relative plasticity ranking derived from the literature for common pharmaceutical materials. The utility of PC1 in understanding deformation was extended to binary mixtures using a subset of the original materials. Combinations of brittle and plastic materials were characterized using the PCA method. The relationships between PC1 scores and the weight fractions of the mixtures were typically linear showing ideal mixing in their deformation behaviors. The mixture consisting of two plastic materials was the only combination to show a consistent positive deviation from ideality. The application of PCA to solid fraction and mechanical work data appears to be an effective means of predicting deformation behavior during compaction of simple powder mixtures. Copyright © 2011 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Darwish, Hany W.; Hassan, Said A.; Salem, Maissa Y.; El-Zeany, Badr A.
2014-03-01
Different chemometric models were applied for the quantitative analysis of Amlodipine (AML), Valsartan (VAL) and Hydrochlorothiazide (HCT) in ternary mixture, namely, Partial Least Squares (PLS) as traditional chemometric model and Artificial Neural Networks (ANN) as advanced model. PLS and ANN were applied with and without variable selection procedure (Genetic Algorithm GA) and data compression procedure (Principal Component Analysis PCA). The chemometric methods applied are PLS-1, GA-PLS, ANN, GA-ANN and PCA-ANN. The methods were used for the quantitative analysis of the drugs in raw materials and pharmaceutical dosage form via handling the UV spectral data. A 3-factor 5-level experimental design was established resulting in 25 mixtures containing different ratios of the drugs. Fifteen mixtures were used as a calibration set and the other ten mixtures were used as validation set to validate the prediction ability of the suggested methods. The validity of the proposed methods was assessed using the standard addition technique.
NASA Astrophysics Data System (ADS)
Fernández-Peralbo, M. A.; Gómez-Gómez, E.; Calderón-Santiago, M.; Carrasco-Valiente, J.; Ruiz-García, J.; Requena-Tapia, M. J.; Luque de Castro, M. D.; Priego-Capote, F.
2016-12-01
The existing clinical biomarkers for prostate cancer (PCa) diagnosis are far from ideal (e.g., the prostate specific antigen (PSA) serum level suffers from lack of specificity, providing frequent false positives leading to over-diagnosis). A key step in the search for minimum invasive tests to complement or replace PSA should be supported on the changes experienced by the biochemical pathways in PCa patients as compared to negative biopsy control individuals. In this research a comprehensive global analysis by LC-QTOF was applied to urine from 62 patients with a clinically significant PCa and 42 healthy individuals, both groups confirmed by biopsy. An unpaired t-test (p-value < 0.05) provided 28 significant metabolites tentatively identified in urine, used to develop a partial least squares discriminant analysis (PLS-DA) model characterized by 88.4 and 92.9% of sensitivity and specificity, respectively. Among the 28 significant metabolites 27 were present at lower concentrations in PCa patients than in control individuals, while only one reported higher concentrations in PCa patients. The connection among the biochemical pathways in which they are involved (DNA methylation, epigenetic marks on histones and RNA cap methylation) could explain the concentration changes with PCa and supports, once again, the role of metabolomics in upstream processes.
Combination of PCA and LORETA for sources analysis of ERP data: an emotional processing study
NASA Astrophysics Data System (ADS)
Hu, Jin; Tian, Jie; Yang, Lei; Pan, Xiaohong; Liu, Jiangang
2006-03-01
The purpose of this paper is to study spatiotemporal patterns of neuronal activity in emotional processing by analysis of ERP data. 108 pictures (categorized as positive, negative and neutral) were presented to 24 healthy, right-handed subjects while 128-channel EEG data were recorded. An analysis of two steps was applied to the ERP data. First, principal component analysis was performed to obtain significant ERP components. Then LORETA was applied to each component to localize their brain sources. The first six principal components were extracted, each of which showed different spatiotemporal patterns of neuronal activity. The results agree with other emotional study by fMRI or PET. The combination of PCA and LORETA can be used to analyze spatiotemporal patterns of ERP data in emotional processing.
Lycopene and Risk of Prostate Cancer
Chen, Ping; Zhang, Wenhao; Wang, Xiao; Zhao, Keke; Negi, Devendra Singh; Zhuo, Li; Qi, Mao; Wang, Xinghuan; Zhang, Xinhua
2015-01-01
Abstract Prostate cancer (PCa) is a common illness for aging males. Lycopene has been identified as an antioxidant agent with potential anticancer properties. Studies investigating the relation between lycopene and PCa risk have produced inconsistent results. This study aims to determine dietary lycopene consumption/circulating concentration and any potential dose–response associations with the risk of PCa. Eligible studies published in English up to April 10, 2014, were searched and identified from Pubmed, Sciencedirect Online, Wiley online library databases and hand searching. The STATA (version 12.0) was applied to process the dose–response meta-analysis. Random effects models were used to calculate pooled relative risks (RRs) and 95% confidence intervals (CIs) and to incorporate variation between studies. The linear and nonlinear dose–response relations were evaluated with data from categories of lycopene consumption/circulating concentrations. Twenty-six studies were included with 17,517 cases of PCa reported from 563,299 participants. Although inverse association between lycopene consumption and PCa risk was not found in all studies, there was a trend that with higher lycopene intake, there was reduced incidence of PCa (P = 0.078). Removal of one Chinese study in sensitivity analysis, or recalculation using data from only high-quality studies for subgroup analysis, indicated that higher lycopene consumption significantly lowered PCa risk. Furthermore, our dose–response meta-analysis demonstrated that higher lycopene consumption was linearly associated with a reduced risk of PCa with a threshold between 9 and 21 mg/day. Consistently, higher circulating lycopene levels significantly reduced the risk of PCa. Interestingly, the concentration of circulating lycopene between 2.17 and 85 μg/dL was linearly inversed with PCa risk whereas there was no linear association >85 μg/dL. In addition, greater efficacy for the circulating lycopene concentration on preventing PCa was found for studies with high quality, follow-up >10 years and where results were adjusted by the age or the body mass index. In conclusion, our novel data demonstrates that higher lycopene consumption/circulating concentration is associated with a lower risk of PCa. However, further studies are required to determine the mechanism by which lycopene reduces the risk of PCa and if there are other factors in tomato products that might potentially decrease PCa risk and progression. PMID:26287411
Principal Component Analysis: A Method for Determining the Essential Dynamics of Proteins
David, Charles C.; Jacobs, Donald J.
2015-01-01
It has become commonplace to employ principal component analysis to reveal the most important motions in proteins. This method is more commonly known by its acronym, PCA. While most popular molecular dynamics packages inevitably provide PCA tools to analyze protein trajectories, researchers often make inferences of their results without having insight into how to make interpretations, and they are often unaware of limitations and generalizations of such analysis. Here we review best practices for applying standard PCA, describe useful variants, discuss why one may wish to make comparison studies, and describe a set of metrics that make comparisons possible. In practice, one will be forced to make inferences about the essential dynamics of a protein without having the desired amount of samples. Therefore, considerable time is spent on describing how to judge the significance of results, highlighting pitfalls. The topic of PCA is reviewed from the perspective of many practical considerations, and useful recipes are provided. PMID:24061923
Principal component analysis: a method for determining the essential dynamics of proteins.
David, Charles C; Jacobs, Donald J
2014-01-01
It has become commonplace to employ principal component analysis to reveal the most important motions in proteins. This method is more commonly known by its acronym, PCA. While most popular molecular dynamics packages inevitably provide PCA tools to analyze protein trajectories, researchers often make inferences of their results without having insight into how to make interpretations, and they are often unaware of limitations and generalizations of such analysis. Here we review best practices for applying standard PCA, describe useful variants, discuss why one may wish to make comparison studies, and describe a set of metrics that make comparisons possible. In practice, one will be forced to make inferences about the essential dynamics of a protein without having the desired amount of samples. Therefore, considerable time is spent on describing how to judge the significance of results, highlighting pitfalls. The topic of PCA is reviewed from the perspective of many practical considerations, and useful recipes are provided.
2014-01-01
Background Measures of household socio-economic position (SEP) are widely used in health research. There exist a number of approaches to their measurement, with Principal Components Analysis (PCA) applied to a basket of household assets being one of the most common. PCA, however, carries a number of assumptions about the distribution of the data which may be untenable, and alternative, non-parametric, approaches may be preferred. Mokken scale analysis is a non-parametric, item response theory approach to scale development which appears never to have been applied to household asset data. A Mokken scale can be used to rank order items (measures of wealth) as well as households. Using data on household asset ownership from a national sample of 4,154 consenting households in the World Health Survey from Vietnam, 2003, we construct two measures of household SEP. Seventeen items asking about assets, and utility and infrastructure use were used. Mokken Scaling and PCA were applied to the data. A single item measure of total household expenditure is used as a point of contrast. Results An 11 item scale, out of the 17 items, was identified that conformed to the assumptions of a Mokken Scale. All the items in the scale were identified as strong items (Hi > .5). Two PCA measures of SEP were developed as a point of contrast. One PCA measure was developed using all 17 available asset items, the other used the reduced set of 11 items identified in the Mokken scale analaysis. The Mokken Scale measure of SEP and the 17 item PCA measure had a very high correlation (r = .98), and they both correlated moderately with total household expenditure: r = .59 and r = .57 respectively. In contrast the 11 item PCA measure correlated moderately with the Mokken scale (r = .68), and weakly with the total household expenditure (r = .18). Conclusion The Mokken scale measure of household SEP performed at least as well as PCA, and outperformed the PCA measure developed with the 11 items used in the Mokken scale. Unlike PCA, Mokken scaling carries no assumptions about the underlying shape of the distribution of the data, and can be used simultaneous to order household SEP and items. The approach, however, has not been tested with data from other countries and remains an interesting, but under researched approach. PMID:25126103
Reidpath, Daniel D; Ahmadi, Keivan
2014-01-01
Measures of household socio-economic position (SEP) are widely used in health research. There exist a number of approaches to their measurement, with Principal Components Analysis (PCA) applied to a basket of household assets being one of the most common. PCA, however, carries a number of assumptions about the distribution of the data which may be untenable, and alternative, non-parametric, approaches may be preferred. Mokken scale analysis is a non-parametric, item response theory approach to scale development which appears never to have been applied to household asset data. A Mokken scale can be used to rank order items (measures of wealth) as well as households. Using data on household asset ownership from a national sample of 4,154 consenting households in the World Health Survey from Vietnam, 2003, we construct two measures of household SEP. Seventeen items asking about assets, and utility and infrastructure use were used. Mokken Scaling and PCA were applied to the data. A single item measure of total household expenditure is used as a point of contrast. An 11 item scale, out of the 17 items, was identified that conformed to the assumptions of a Mokken Scale. All the items in the scale were identified as strong items (Hi > .5). Two PCA measures of SEP were developed as a point of contrast. One PCA measure was developed using all 17 available asset items, the other used the reduced set of 11 items identified in the Mokken scale analaysis. The Mokken Scale measure of SEP and the 17 item PCA measure had a very high correlation (r = .98), and they both correlated moderately with total household expenditure: r = .59 and r = .57 respectively. In contrast the 11 item PCA measure correlated moderately with the Mokken scale (r = .68), and weakly with the total household expenditure (r = .18). The Mokken scale measure of household SEP performed at least as well as PCA, and outperformed the PCA measure developed with the 11 items used in the Mokken scale. Unlike PCA, Mokken scaling carries no assumptions about the underlying shape of the distribution of the data, and can be used simultaneous to order household SEP and items. The approach, however, has not been tested with data from other countries and remains an interesting, but under researched approach.
Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty.
de Pierrefeu, Amicie; Lofstedt, Tommy; Hadj-Selem, Fouad; Dubois, Mathieu; Jardri, Renaud; Fovet, Thomas; Ciuciu, Philippe; Frouin, Vincent; Duchesnay, Edouard
2018-02-01
Principal component analysis (PCA) is an exploratory tool widely used in data analysis to uncover the dominant patterns of variability within a population. Despite its ability to represent a data set in a low-dimensional space, PCA's interpretability remains limited. Indeed, the components produced by PCA are often noisy or exhibit no visually meaningful patterns. Furthermore, the fact that the components are usually non-sparse may also impede interpretation, unless arbitrary thresholding is applied. However, in neuroimaging, it is essential to uncover clinically interpretable phenotypic markers that would account for the main variability in the brain images of a population. Recently, some alternatives to the standard PCA approach, such as sparse PCA (SPCA), have been proposed, their aim being to limit the density of the components. Nonetheless, sparsity alone does not entirely solve the interpretability problem in neuroimaging, since it may yield scattered and unstable components. We hypothesized that the incorporation of prior information regarding the structure of the data may lead to improved relevance and interpretability of brain patterns. We therefore present a simple extension of the popular PCA framework that adds structured sparsity penalties on the loading vectors in order to identify the few stable regions in the brain images that capture most of the variability. Such structured sparsity can be obtained by combining, e.g., and total variation (TV) penalties, where the TV regularization encodes information on the underlying structure of the data. This paper presents the structured SPCA (denoted SPCA-TV) optimization framework and its resolution. We demonstrate SPCA-TV's effectiveness and versatility on three different data sets. It can be applied to any kind of structured data, such as, e.g., -dimensional array images or meshes of cortical surfaces. The gains of SPCA-TV over unstructured approaches (such as SPCA and ElasticNet PCA) or structured approach (such as GraphNet PCA) are significant, since SPCA-TV reveals the variability within a data set in the form of intelligible brain patterns that are easier to interpret and more stable across different samples.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koch, C.D.; Pirkle, F.L.; Schmidt, J.S.
1981-01-01
A Principal Components Analysis (PCA) has been written to aid in the interpretation of multivariate aerial radiometric data collected by the US Department of Energy (DOE) under the National Uranium Resource Evaluation (NURE) program. The variations exhibited by these data have been reduced and classified into a number of linear combinations by using the PCA program. The PCA program then generates histograms and outlier maps of the individual variates. Black and white plots can be made on a Calcomp plotter by the application of follow-up programs. All programs referred to in this guide were written for a DEC-10. From thismore » analysis a geologist may begin to interpret the data structure. Insight into geological processes underlying the data may be obtained.« less
Classification of fMRI resting-state maps using machine learning techniques: A comparative study
NASA Astrophysics Data System (ADS)
Gallos, Ioannis; Siettos, Constantinos
2017-11-01
We compare the efficiency of Principal Component Analysis (PCA) and nonlinear learning manifold algorithms (ISOMAP and Diffusion maps) for classifying brain maps between groups of schizophrenia patients and healthy from fMRI scans during a resting-state experiment. After a standard pre-processing pipeline, we applied spatial Independent component analysis (ICA) to reduce (a) noise and (b) spatial-temporal dimensionality of fMRI maps. On the cross-correlation matrix of the ICA components, we applied PCA, ISOMAP and Diffusion Maps to find an embedded low-dimensional space. Finally, support-vector-machines (SVM) and k-NN algorithms were used to evaluate the performance of the algorithms in classifying between the two groups.
Perturbational formulation of principal component analysis in molecular dynamics simulation.
Koyama, Yohei M; Kobayashi, Tetsuya J; Tomoda, Shuji; Ueda, Hiroki R
2008-10-01
Conformational fluctuations of a molecule are important to its function since such intrinsic fluctuations enable the molecule to respond to the external environmental perturbations. For extracting large conformational fluctuations, which predict the primary conformational change by the perturbation, principal component analysis (PCA) has been used in molecular dynamics simulations. However, several versions of PCA, such as Cartesian coordinate PCA and dihedral angle PCA (dPCA), are limited to use with molecules with a single dominant state or proteins where the dihedral angle represents an important internal coordinate. Other PCAs with general applicability, such as the PCA using pairwise atomic distances, do not represent the physical meaning clearly. Therefore, a formulation that provides general applicability and clearly represents the physical meaning is yet to be developed. For developing such a formulation, we consider the conformational distribution change by the perturbation with arbitrary linearly independent perturbation functions. Within the second order approximation of the Kullback-Leibler divergence by the perturbation, the PCA can be naturally interpreted as a method for (1) decomposing a given perturbation into perturbations that independently contribute to the conformational distribution change or (2) successively finding the perturbation that induces the largest conformational distribution change. In this perturbational formulation of PCA, (i) the eigenvalue measures the Kullback-Leibler divergence from the unperturbed to perturbed distributions, (ii) the eigenvector identifies the combination of the perturbation functions, and (iii) the principal component determines the probability change induced by the perturbation. Based on this formulation, we propose a PCA using potential energy terms, and we designate it as potential energy PCA (PEPCA). The PEPCA provides both general applicability and clear physical meaning. For demonstrating its power, we apply the PEPCA to an alanine dipeptide molecule in vacuum as a minimal model of a nonsingle dominant conformational biomolecule. The first and second principal components clearly characterize two stable states and the transition state between them. Positive and negative components with larger absolute values of the first and second eigenvectors identify the electrostatic interactions, which stabilize or destabilize each stable state and the transition state. Our result therefore indicates that PCA can be applied, by carefully selecting the perturbation functions, not only to identify the molecular conformational fluctuation but also to predict the conformational distribution change by the perturbation beyond the limitation of the previous methods.
Perturbational formulation of principal component analysis in molecular dynamics simulation
NASA Astrophysics Data System (ADS)
Koyama, Yohei M.; Kobayashi, Tetsuya J.; Tomoda, Shuji; Ueda, Hiroki R.
2008-10-01
Conformational fluctuations of a molecule are important to its function since such intrinsic fluctuations enable the molecule to respond to the external environmental perturbations. For extracting large conformational fluctuations, which predict the primary conformational change by the perturbation, principal component analysis (PCA) has been used in molecular dynamics simulations. However, several versions of PCA, such as Cartesian coordinate PCA and dihedral angle PCA (dPCA), are limited to use with molecules with a single dominant state or proteins where the dihedral angle represents an important internal coordinate. Other PCAs with general applicability, such as the PCA using pairwise atomic distances, do not represent the physical meaning clearly. Therefore, a formulation that provides general applicability and clearly represents the physical meaning is yet to be developed. For developing such a formulation, we consider the conformational distribution change by the perturbation with arbitrary linearly independent perturbation functions. Within the second order approximation of the Kullback-Leibler divergence by the perturbation, the PCA can be naturally interpreted as a method for (1) decomposing a given perturbation into perturbations that independently contribute to the conformational distribution change or (2) successively finding the perturbation that induces the largest conformational distribution change. In this perturbational formulation of PCA, (i) the eigenvalue measures the Kullback-Leibler divergence from the unperturbed to perturbed distributions, (ii) the eigenvector identifies the combination of the perturbation functions, and (iii) the principal component determines the probability change induced by the perturbation. Based on this formulation, we propose a PCA using potential energy terms, and we designate it as potential energy PCA (PEPCA). The PEPCA provides both general applicability and clear physical meaning. For demonstrating its power, we apply the PEPCA to an alanine dipeptide molecule in vacuum as a minimal model of a nonsingle dominant conformational biomolecule. The first and second principal components clearly characterize two stable states and the transition state between them. Positive and negative components with larger absolute values of the first and second eigenvectors identify the electrostatic interactions, which stabilize or destabilize each stable state and the transition state. Our result therefore indicates that PCA can be applied, by carefully selecting the perturbation functions, not only to identify the molecular conformational fluctuation but also to predict the conformational distribution change by the perturbation beyond the limitation of the previous methods.
Binding Isotherms and Time Courses Readily from Magnetic Resonance.
Xu, Jia; Van Doren, Steven R
2016-08-16
Evidence is presented that binding isotherms, simple or biphasic, can be extracted directly from noninterpreted, complex 2D NMR spectra using principal component analysis (PCA) to reveal the largest trend(s) across the series. This approach renders peak picking unnecessary for tracking population changes. In 1:1 binding, the first principal component captures the binding isotherm from NMR-detected titrations in fast, slow, and even intermediate and mixed exchange regimes, as illustrated for phospholigand associations with proteins. Although the sigmoidal shifts and line broadening of intermediate exchange distorts binding isotherms constructed conventionally, applying PCA directly to these spectra along with Pareto scaling overcomes the distortion. Applying PCA to time-domain NMR data also yields binding isotherms from titrations in fast or slow exchange. The algorithm readily extracts from magnetic resonance imaging movie time courses such as breathing and heart rate in chest imaging. Similarly, two-step binding processes detected by NMR are easily captured by principal components 1 and 2. PCA obviates the customary focus on specific peaks or regions of images. Applying it directly to a series of complex data will easily delineate binding isotherms, equilibrium shifts, and time courses of reactions or fluctuations.
Mapping brain activity in gradient-echo functional MRI using principal component analysis
NASA Astrophysics Data System (ADS)
Khosla, Deepak; Singh, Manbir; Don, Manuel
1997-05-01
The detection of sites of brain activation in functional MRI has been a topic of immense research interest and many technique shave been proposed to this end. Recently, principal component analysis (PCA) has been applied to extract the activated regions and their time course of activation. This method is based on the assumption that the activation is orthogonal to other signal variations such as brain motion, physiological oscillations and other uncorrelated noises. A distinct advantage of this method is that it does not require any knowledge of the time course of the true stimulus paradigm. This technique is well suited to EPI image sequences where the sampling rate is high enough to capture the effects of physiological oscillations. In this work, we propose and apply tow methods that are based on PCA to conventional gradient-echo images and investigate their usefulness as tools to extract reliable information on brain activation. The first method is a conventional technique where a single image sequence with alternating on and off stages is subject to a principal component analysis. The second method is a PCA-based approach called the common spatial factor analysis technique (CSF). As the name suggests, this method relies on common spatial factors between the above fMRI image sequence and a background fMRI. We have applied these methods to identify active brain ares during visual stimulation and motor tasks. The results from these methods are compared to those obtained by using the standard cross-correlation technique. We found good agreement in the areas identified as active across all three techniques. The results suggest that PCA and CSF methods have good potential in detecting the true stimulus correlated changes in the presence of other interfering signals.
Unsupervised analysis of small animal dynamic Cerenkov luminescence imaging
NASA Astrophysics Data System (ADS)
Spinelli, Antonello E.; Boschi, Federico
2011-12-01
Clustering analysis (CA) and principal component analysis (PCA) were applied to dynamic Cerenkov luminescence images (dCLI). In order to investigate the performances of the proposed approaches, two distinct dynamic data sets obtained by injecting mice with 32P-ATP and 18F-FDG were acquired using the IVIS 200 optical imager. The k-means clustering algorithm has been applied to dCLI and was implemented using interactive data language 8.1. We show that cluster analysis allows us to obtain good agreement between the clustered and the corresponding emission regions like the bladder, the liver, and the tumor. We also show a good correspondence between the time activity curves of the different regions obtained by using CA and manual region of interest analysis on dCLIT and PCA images. We conclude that CA provides an automatic unsupervised method for the analysis of preclinical dynamic Cerenkov luminescence image data.
Bravo, Ignacio; Mazo, Manuel; Lázaro, José L.; Gardel, Alfredo; Jiménez, Pedro; Pizarro, Daniel
2010-01-01
This paper presents a complete implementation of the Principal Component Analysis (PCA) algorithm in Field Programmable Gate Array (FPGA) devices applied to high rate background segmentation of images. The classical sequential execution of different parts of the PCA algorithm has been parallelized. This parallelization has led to the specific development and implementation in hardware of the different stages of PCA, such as computation of the correlation matrix, matrix diagonalization using the Jacobi method and subspace projections of images. On the application side, the paper presents a motion detection algorithm, also entirely implemented on the FPGA, and based on the developed PCA core. This consists of dynamically thresholding the differences between the input image and the one obtained by expressing the input image using the PCA linear subspace previously obtained as a background model. The proposal achieves a high ratio of processed images (up to 120 frames per second) and high quality segmentation results, with a completely embedded and reliable hardware architecture based on commercial CMOS sensors and FPGA devices. PMID:22163406
Bravo, Ignacio; Mazo, Manuel; Lázaro, José L; Gardel, Alfredo; Jiménez, Pedro; Pizarro, Daniel
2010-01-01
This paper presents a complete implementation of the Principal Component Analysis (PCA) algorithm in Field Programmable Gate Array (FPGA) devices applied to high rate background segmentation of images. The classical sequential execution of different parts of the PCA algorithm has been parallelized. This parallelization has led to the specific development and implementation in hardware of the different stages of PCA, such as computation of the correlation matrix, matrix diagonalization using the Jacobi method and subspace projections of images. On the application side, the paper presents a motion detection algorithm, also entirely implemented on the FPGA, and based on the developed PCA core. This consists of dynamically thresholding the differences between the input image and the one obtained by expressing the input image using the PCA linear subspace previously obtained as a background model. The proposal achieves a high ratio of processed images (up to 120 frames per second) and high quality segmentation results, with a completely embedded and reliable hardware architecture based on commercial CMOS sensors and FPGA devices.
NASA Technical Reports Server (NTRS)
Boyd, R. K.; Brumfield, J. O.; Campbell, W. J.
1984-01-01
Three feature extraction methods, canonical analysis (CA), principal component analysis (PCA), and band selection, have been applied to Thematic Mapper Simulator (TMS) data in order to evaluate the relative performance of the methods. The results obtained show that CA is capable of providing a transformation of TMS data which leads to better classification results than provided by all seven bands, by PCA, or by band selection. A second conclusion drawn from the study is that TMS bands 2, 3, 4, and 7 (thermal) are most important for landcover classification.
Gao, Lin; Zhang, Tongsheng; Wang, Jue; Stephen, Julia
2014-01-01
When connectivity analysis is carried out for event related EEG and MEG, the presence of strong spatial correlations from spontaneous activity in background may mask the local neuronal evoked activity and lead to spurious connections. In this paper, we hypothesized PCA decomposition could be used to diminish the background activity and further improve the performance of connectivity analysis in event related experiments. The idea was tested using simulation, where we found that for the 306-channel Elekta Neuromag system, the first 4 PCs represent the dominant background activity, and the source connectivity pattern after preprocessing is consistent with the true connectivity pattern designed in the simulation. Improving signal to noise of the evoked responses by discarding the first few PCs demonstrates increased coherences at major physiological frequency bands when removing the first few PCs. Furthermore, the evoked information was maintained after PCA preprocessing. In conclusion, it is demonstrated that the first few PCs represent background activity, and PCA decomposition can be employed to remove it to expose the evoked activity for the channels under investigation. Therefore, PCA can be applied as a preprocessing approach to improve neuronal connectivity analysis for event related data. PMID:22918837
Gao, Lin; Zhang, Tongsheng; Wang, Jue; Stephen, Julia
2013-04-01
When connectivity analysis is carried out for event related EEG and MEG, the presence of strong spatial correlations from spontaneous activity in background may mask the local neuronal evoked activity and lead to spurious connections. In this paper, we hypothesized PCA decomposition could be used to diminish the background activity and further improve the performance of connectivity analysis in event related experiments. The idea was tested using simulation, where we found that for the 306-channel Elekta Neuromag system, the first 4 PCs represent the dominant background activity, and the source connectivity pattern after preprocessing is consistent with the true connectivity pattern designed in the simulation. Improving signal to noise of the evoked responses by discarding the first few PCs demonstrates increased coherences at major physiological frequency bands when removing the first few PCs. Furthermore, the evoked information was maintained after PCA preprocessing. In conclusion, it is demonstrated that the first few PCs represent background activity, and PCA decomposition can be employed to remove it to expose the evoked activity for the channels under investigation. Therefore, PCA can be applied as a preprocessing approach to improve neuronal connectivity analysis for event related data.
Plazas-Nossa, Leonardo; Hofer, Thomas; Gruber, Günter; Torres, Andres
2017-02-01
This work proposes a methodology for the forecasting of online water quality data provided by UV-Vis spectrometry. Therefore, a combination of principal component analysis (PCA) to reduce the dimensionality of a data set and artificial neural networks (ANNs) for forecasting purposes was used. The results obtained were compared with those obtained by using discrete Fourier transform (DFT). The proposed methodology was applied to four absorbance time series data sets composed by a total number of 5705 UV-Vis spectra. Absolute percentage errors obtained by applying the proposed PCA/ANN methodology vary between 10% and 13% for all four study sites. In general terms, the results obtained were hardly generalizable, as they appeared to be highly dependent on specific dynamics of the water system; however, some trends can be outlined. PCA/ANN methodology gives better results than PCA/DFT forecasting procedure by using a specific spectra range for the following conditions: (i) for Salitre wastewater treatment plant (WWTP) (first hour) and Graz West R05 (first 18 min), from the last part of UV range to all visible range; (ii) for Gibraltar pumping station (first 6 min) for all UV-Vis absorbance spectra; and (iii) for San Fernando WWTP (first 24 min) for all of UV range to middle part of visible range.
Steingass, Christof Björn; Jutzi, Manfred; Müller, Jenny; Carle, Reinhold; Schmarr, Hans-Georg
2015-03-01
Ripening-dependent changes of pineapple volatiles were studied in a nontargeted profiling analysis. Volatiles were isolated via headspace solid phase microextraction and analyzed by comprehensive 2D gas chromatography and mass spectrometry (HS-SPME-GC×GC-qMS). Profile patterns presented in the contour plots were evaluated applying image processing techniques and subsequent multivariate statistical data analysis. Statistical methods comprised unsupervised hierarchical cluster analysis (HCA) and principal component analysis (PCA) to classify the samples. Supervised partial least squares discriminant analysis (PLS-DA) and partial least squares (PLS) regression were applied to discriminate different ripening stages and describe the development of volatiles during postharvest storage, respectively. Hereby, substantial chemical markers allowing for class separation were revealed. The workflow permitted the rapid distinction between premature green-ripe pineapples and postharvest-ripened sea-freighted fruits. Volatile profiles of fully ripe air-freighted pineapples were similar to those of green-ripe fruits postharvest ripened for 6 days after simulated sea freight export, after PCA with only two principal components. However, PCA considering also the third principal component allowed differentiation between air-freighted fruits and the four progressing postharvest maturity stages of sea-freighted pineapples.
Multivariate analysis for scanning tunneling spectroscopy data
NASA Astrophysics Data System (ADS)
Yamanishi, Junsuke; Iwase, Shigeru; Ishida, Nobuyuki; Fujita, Daisuke
2018-01-01
We applied principal component analysis (PCA) to two-dimensional tunneling spectroscopy (2DTS) data obtained on a Si(111)-(7 × 7) surface to explore the effectiveness of multivariate analysis for interpreting 2DTS data. We demonstrated that several components that originated mainly from specific atoms at the Si(111)-(7 × 7) surface can be extracted by PCA. Furthermore, we showed that hidden components in the tunneling spectra can be decomposed (peak separation), which is difficult to achieve with normal 2DTS analysis without the support of theoretical calculations. Our analysis showed that multivariate analysis can be an additional powerful way to analyze 2DTS data and extract hidden information from a large amount of spectroscopic data.
SESNPCA: Principal Component Analysis Applied to Stripped-Envelope Core-Collapse Supernovae
NASA Astrophysics Data System (ADS)
Williamson, Marc; Bianco, Federica; Modjaz, Maryam
2018-01-01
In the new era of time-domain astronomy, it will become increasingly important to have rigorous, data driven models for classifying transients, including supernovae (SNe). We present the first application of principal component analysis (PCA) to stripped-envelope core-collapse supernovae (SESNe). Previous studies of SNe types Ib, IIb, Ic, and broad-line Ic (Ic-BL) focus only on specific spectral features, while our PCA algorithm uses all of the information contained in each spectrum. We use one of the largest compiled datasets of SESNe, containing over 150 SNe, each with spectra taken at multiple phases. Our work focuses on 49 SNe with spectra taken 15 ± 5 days after maximum V-band light where better distinctions can be made between SNe type Ib and Ic spectra. We find that spectra of SNe type IIb and Ic-BL are separable from the other types in PCA space, indicating that PCA is a promising option for developing a purely data driven model for SESNe classification.
Health status monitoring for ICU patients based on locally weighted principal component analysis.
Ding, Yangyang; Ma, Xin; Wang, Youqing
2018-03-01
Intelligent status monitoring for critically ill patients can help medical stuff quickly discover and assess the changes of disease and then make appropriate treatment strategy. However, general-type monitoring model now widely used is difficult to adapt the changes of intensive care unit (ICU) patients' status due to its fixed pattern, and a more robust, efficient and fast monitoring model should be developed to the individual. A data-driven learning approach combining locally weighted projection regression (LWPR) and principal component analysis (PCA) is firstly proposed and applied to monitor the nonlinear process of patients' health status in ICU. LWPR is used to approximate the complex nonlinear process with local linear models, in which PCA could be further applied to status monitoring, and finally a global weighted statistic will be acquired for detecting the possible abnormalities. Moreover, some improved versions are developed, such as LWPR-MPCA and LWPR-JPCA, which also have superior performance. Eighteen subjects were selected from the Physiobank's Multi-parameter Intelligent Monitoring for Intensive Care II (MIMIC II) database, and two vital signs of each subject were chosen for online monitoring. The proposed method was compared with several existing methods including traditional PCA, Partial least squares (PLS), just in time learning combined with modified PCA (L-PCA), and Kernel PCA (KPCA). The experimental results demonstrated that the mean fault detection rate (FDR) of PCA can be improved by 41.7% after adding LWPR. The mean FDR of LWPR-MPCA was increased by 8.3%, compared with the latest reported method L-PCA. Meanwhile, LWPR spent less training time than others, especially KPCA. LWPR is first introduced into ICU patients monitoring and achieves the best monitoring performance including adaptability to changes in patient status, sensitivity for abnormality detection as well as its fast learning speed and low computational complexity. The algorithm is an excellent approach to establishing a personalized model for patients, which is the mainstream direction of modern medicine in the following development, as well as improving the global monitoring performance. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Selle, B.; Schwientek, M.
2012-04-01
Water quality of ground and surface waters in catchments is typically driven by many complex and interacting processes. While small scale processes are often studied in great detail, their relevance and interplay at catchment scales remain often poorly understood. For many catchments, extensive monitoring data on water quality have been collected for different purposes. These heterogeneous data sets contain valuable information on catchment scale processes but are rarely analysed using integrated methods. Principle component analysis (PCA) has previously been applied to this kind of data sets. However, a detailed analysis of scores, which are an important result of a PCA, is often missing. Mathematically, PCA expresses measured variables on water quality, e.g. nitrate concentrations, as linear combination of independent, not directly observable key processes. These computed key processes are represented by principle components. Their scores are interpretable as process intensities which vary in space and time. Subsequently, scores can be correlated with other key variables and catchment characteristics, such as water travel times and land use that were not considered in PCA. This detailed analysis of scores represents an extension of the commonly applied PCA which could considerably improve the understanding of processes governing water quality at catchment scales. In this study, we investigated the 170 km2 Ammer catchment in SW Germany which is characterised by an above average proportion of agricultural (71%) and urban (17%) areas. The Ammer River is mainly fed by karstic springs. For PCA, we separately analysed concentrations from (a) surface waters of the Ammer River and its tributaries, (b) spring waters from the main aquifers and (c) deep groundwater from production wells. This analysis was extended by a detailed analysis of scores. We analysed measured concentrations on major ions and selected organic micropollutants. Additionally, redox-sensitive variables and environmental tracers indicating groundwater age were analysed for deep groundwater from production wells. For deep groundwater, we found that microbial turnover was stronger influenced by local availability of energy sources than by travel times of groundwater to the wells. Groundwater quality primarily reflected the input of pollutants determined by landuse, e.g. agrochemicals. We concluded that for water quality in the Ammer catchment, conservative mixing of waters with different origin is more important than reactive transport processes along the flow path.
Söhn, Matthias; Alber, Markus; Yan, Di
2007-09-01
The variability of dose-volume histogram (DVH) shapes in a patient population can be quantified using principal component analysis (PCA). We applied this to rectal DVHs of prostate cancer patients and investigated the correlation of the PCA parameters with late bleeding. PCA was applied to the rectal wall DVHs of 262 patients, who had been treated with a four-field box, conformal adaptive radiotherapy technique. The correlated changes in the DVH pattern were revealed as "eigenmodes," which were ordered by their importance to represent data set variability. Each DVH is uniquely characterized by its principal components (PCs). The correlation of the first three PCs and chronic rectal bleeding of Grade 2 or greater was investigated with uni- and multivariate logistic regression analyses. Rectal wall DVHs in four-field conformal RT can primarily be represented by the first two or three PCs, which describe approximately 94% or 96% of the DVH shape variability, respectively. The first eigenmode models the total irradiated rectal volume; thus, PC1 correlates to the mean dose. Mode 2 describes the interpatient differences of the relative rectal volume in the two- or four-field overlap region. Mode 3 reveals correlations of volumes with intermediate doses ( approximately 40-45 Gy) and volumes with doses >70 Gy; thus, PC3 is associated with the maximal dose. According to univariate logistic regression analysis, only PC2 correlated significantly with toxicity. However, multivariate logistic regression analysis with the first two or three PCs revealed an increased probability of bleeding for DVHs with more than one large PC. PCA can reveal the correlation structure of DVHs for a patient population as imposed by the treatment technique and provide information about its relationship to toxicity. It proves useful for augmenting normal tissue complication probability modeling approaches.
Moura, Felipe Arruda; Santana, Juliana Exel; Vieira, Nathália Arnosti; Santiago, Paulo Roberto Pereira; Cunha, Sergio Augusto
2015-01-01
The purpose of this study was to analyse players’ positional variability during the 2012 UEFA European Championship by applying principal component analysis (PCA) to data gathered from heat maps posted on the UEFA website. We analysed the teams that reached the finals and semi-finals of the competition. The players’ 2D coordinates from each match were obtained by applying an image-processing algorithm to the heat maps. With all the players’ 2D coordinates for each match, we applied PCA to identify the directions of greatest variability. Then, two orthogonal segments were centred on each player’s mean position for all matches. The segments’ directions were driven by the eigenvectors of the PCA, and the length of each segment was defined as one standard deviation around the mean. Finally, an ellipse was circumscribed around both segments. To represent player variability, segment lengths and elliptical areas were analysed. The results demonstrate that Portugal exhibited the lowest variability, followed by Germany, Spain and Italy. Additionally, a graphical representation of every player’s ellipse provided insight into the teams’ organisational features throughout the competition. The presented study provides important information regarding soccer teams’ tactical strategy in high-level championships that allows coaches to better control team organisation on the pitch. PMID:26557206
Liu, Changhong; Liu, Wei; Lu, Xuzhong; Chen, Wei; Yang, Jianbo; Zheng, Lei
2014-06-15
Crop-to-crop transgene flow may affect the seed purity of non-transgenic rice varieties, resulting in unwanted biosafety consequences. The feasibility of a rapid and nondestructive determination of transgenic rice seeds from its non-transgenic counterparts was examined by using multispectral imaging system combined with chemometric data analysis. Principal component analysis (PCA), partial least squares discriminant analysis (PLSDA), least squares-support vector machines (LS-SVM), and PCA-back propagation neural network (PCA-BPNN) methods were applied to classify rice seeds according to their genetic origins. The results demonstrated that clear differences between non-transgenic and transgenic rice seeds could be easily visualized with the nondestructive determination method developed through this study and an excellent classification (up to 100% with LS-SVM model) can be achieved. It is concluded that multispectral imaging together with chemometric data analysis is a promising technique to identify transgenic rice seeds with high efficiency, providing bright prospects for future applications. Copyright © 2013 Elsevier Ltd. All rights reserved.
Analysis of antique bronze coins by Laser Induced Breakdown Spectroscopy and multivariate analysis
NASA Astrophysics Data System (ADS)
Bachler, M. Orlić; Bišćan, M.; Kregar, Z.; Jelovica Badovinac, I.; Dobrinić, J.; Milošević, S.
2016-09-01
This work presents a feasibility study of applying the Principal Component Analysis (PCA) to data obtained by Laser-Induced Breakdown Spectroscopy (LIBS) with the aim of determining correlation between different samples. The samples were antique bronze coins coated in silver (follis) dated in the Roman Empire period and were made during different rulers in different mints. While raw LIBS data revealed that in the period from the year 286 to 383 CE content of silver was constantly decreasing, the PCA showed that the samples can be somewhat grouped together based on their place of origin, which could be a useful hint when analysing unknown samples. It was also found that PCA can help in discriminating spectra corresponding to ablation from the surface and from the bulk. Furthermore, Partial Least Squares method (PLS) was used to obtain, based on a set of samples with known composition, an estimation of relative copper concentration in studied ancient coins. This analysis showed that copper concentration in surface layers ranged from 83% to 90%.
NASA Astrophysics Data System (ADS)
Oh, Han Bin; Leach, Franklin E.; Arungundram, Sailaja; Al-Mafraji, Kanar; Venot, Andre; Boons, Geert-Jan; Amster, I. Jonathan
2011-03-01
The structural characterization of glycosaminoglycan (GAG) carbohydrates by mass spectrometry has been a long-standing analytical challenge due to the inherent heterogeneity of these biomolecules, specifically polydispersity, variability in sulfation, and hexuronic acid stereochemistry. Recent advances in tandem mass spectrometry methods employing threshold and electron-based ion activation have resulted in the ability to determine the location of the labile sulfate modification as well as assign the stereochemistry of hexuronic acid residues. To facilitate the analysis of complex electron detachment dissociation (EDD) spectra, principal component analysis (PCA) is employed to differentiate the hexuronic acid stereochemistry of four synthetic GAG epimers whose EDD spectra are nearly identical upon visual inspection. For comparison, PCA is also applied to infrared multiphoton dissociation spectra (IRMPD) of the examined epimers. To assess the applicability of multivariate methods in GAG mixture analysis, PCA is utilized to identify the relative content of two epimers in a binary mixture.
Niu, Yue; Zhang, Ling; Bi, Xing; Yuan, Shuai; Chen, Peng
2016-03-05
To detect the expression of vitronectin (VTN) in the tissues and blood serum of prostate cancer (PCa) patients, and evaluate its clinical significance and to evaluate the significance of the combined assay of VTN and prostate specific antigens (PSA) in PCa diagnosis. To detect the expression of VTN as a potential marker for PCa diagnosis and prognosis, immunohistochemistry was performed on the tissues of 32 patients with metastatic PCa (PCaM), 34 patients with PCa without metastasis (PCa), and 41 patients with benign prostatic hyperplasia (BPH). The sera were then subjected to Western blot analysis. All cases were subsequently examined to determine the concentrations of PSA and VTN in the sera. The collected data were collated and analyzed. The positive expression rates of VTN in the tissues of the BPH and PCa groups (including PCa and PCaM groups) were 75.61% and 45.45%, respectively (P = .005). VTN was more highly expressed in the sera of the BPH patients (0.83 ± 0.07) than in the sera of the PCa patients (0.65 ± 0.06) (P < .05). It was also more highly expressed in the sera of the PCa patients than in the sera of the PCaM patients (0.35 ± 0.08) (P < .05). In the diagnosis of BPH and PCa, the Youden indexes of PSA detection, VTN detection, and combined detection were 0.2620, 0.3468, and 0.5635; the kappa values were 0.338, 0.304, and 0.448, respectively, and the areas under the receiver operating characteristic curve were 0.625, 0.673, and 0.703 (P < .05), respectively. VTN levels in sera may be used as a potential marker of PCa for the diagnosis and assessment of disease progression and metastasis. The combined detection of VTN and PSA in sera can be clinically applied in PCa diagnosis. .
Clustering analysis strategies for electron energy loss spectroscopy (EELS).
Torruella, Pau; Estrader, Marta; López-Ortega, Alberto; Baró, Maria Dolors; Varela, Maria; Peiró, Francesca; Estradé, Sònia
2018-02-01
In this work, the use of cluster analysis algorithms, widely applied in the field of big data, is proposed to explore and analyze electron energy loss spectroscopy (EELS) data sets. Three different data clustering approaches have been tested both with simulated and experimental data from Fe 3 O 4 /Mn 3 O 4 core/shell nanoparticles. The first method consists on applying data clustering directly to the acquired spectra. A second approach is to analyze spectral variance with principal component analysis (PCA) within a given data cluster. Lastly, data clustering on PCA score maps is discussed. The advantages and requirements of each approach are studied. Results demonstrate how clustering is able to recover compositional and oxidation state information from EELS data with minimal user input, giving great prospects for its usage in EEL spectroscopy. Copyright © 2017 Elsevier B.V. All rights reserved.
Diagnostics and Active Control of Aircraft Interior Noise
NASA Technical Reports Server (NTRS)
Fuller, C. R.
1998-01-01
This project deals with developing advanced methods for investigating and controlling interior noise in aircraft. The work concentrates on developing and applying the techniques of Near Field Acoustic Holography (NAH) and Principal Component Analysis (PCA) to the aircraft interior noise dynamic problem. This involves investigating the current state of the art, developing new techniques and then applying them to the particular problem being studied. The knowledge gained under the first part of the project was then used to develop and apply new, advanced noise control techniques for reducing interior noise. A new fully active control approach based on the PCA was developed and implemented on a test cylinder. Finally an active-passive approach based on tunable vibration absorbers was to be developed and analytically applied to a range of test structures from simple plates to aircraft fuselages.
PCA based clustering for brain tumor segmentation of T1w MRI images.
Kaya, Irem Ersöz; Pehlivanlı, Ayça Çakmak; Sekizkardeş, Emine Gezmez; Ibrikci, Turgay
2017-03-01
Medical images are huge collections of information that are difficult to store and process consuming extensive computing time. Therefore, the reduction techniques are commonly used as a data pre-processing step to make the image data less complex so that a high-dimensional data can be identified by an appropriate low-dimensional representation. PCA is one of the most popular multivariate methods for data reduction. This paper is focused on T1-weighted MRI images clustering for brain tumor segmentation with dimension reduction by different common Principle Component Analysis (PCA) algorithms. Our primary aim is to present a comparison between different variations of PCA algorithms on MRIs for two cluster methods. Five most common PCA algorithms; namely the conventional PCA, Probabilistic Principal Component Analysis (PPCA), Expectation Maximization Based Principal Component Analysis (EM-PCA), Generalize Hebbian Algorithm (GHA), and Adaptive Principal Component Extraction (APEX) were applied to reduce dimensionality in advance of two clustering algorithms, K-Means and Fuzzy C-Means. In the study, the T1-weighted MRI images of the human brain with brain tumor were used for clustering. In addition to the original size of 512 lines and 512 pixels per line, three more different sizes, 256 × 256, 128 × 128 and 64 × 64, were included in the study to examine their effect on the methods. The obtained results were compared in terms of both the reconstruction errors and the Euclidean distance errors among the clustered images containing the same number of principle components. According to the findings, the PPCA obtained the best results among all others. Furthermore, the EM-PCA and the PPCA assisted K-Means algorithm to accomplish the best clustering performance in the majority as well as achieving significant results with both clustering algorithms for all size of T1w MRI images. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Low-Dimensional Feature Representation for Instrument Identification
NASA Astrophysics Data System (ADS)
Ihara, Mizuki; Maeda, Shin-Ichi; Ikeda, Kazushi; Ishii, Shin
For monophonic music instrument identification, various feature extraction and selection methods have been proposed. One of the issues toward instrument identification is that the same spectrum is not always observed even in the same instrument due to the difference of the recording condition. Therefore, it is important to find non-redundant instrument-specific features that maintain information essential for high-quality instrument identification to apply them to various instrumental music analyses. For such a dimensionality reduction method, the authors propose the utilization of linear projection methods: local Fisher discriminant analysis (LFDA) and LFDA combined with principal component analysis (PCA). After experimentally clarifying that raw power spectra are actually good for instrument classification, the authors reduced the feature dimensionality by LFDA or by PCA followed by LFDA (PCA-LFDA). The reduced features achieved reasonably high identification performance that was comparable or higher than those by the power spectra and those achieved by other existing studies. These results demonstrated that our LFDA and PCA-LFDA can successfully extract low-dimensional instrument features that maintain the characteristic information of the instruments.
Osis, Sean T; Hettinga, Blayne A; Leitch, Jessica; Ferber, Reed
2014-08-22
As 3-dimensional (3D) motion-capture for clinical gait analysis continues to evolve, new methods must be developed to improve the detection of gait cycle events based on kinematic data. Recently, the application of principal component analysis (PCA) to gait data has shown promise in detecting important biomechanical features. Therefore, the purpose of this study was to define a new foot strike detection method for a continuum of striking techniques, by applying PCA to joint angle waveforms. In accordance with Newtonian mechanics, it was hypothesized that transient features in the sagittal-plane accelerations of the lower extremity would be linked with the impulsive application of force to the foot at foot strike. Kinematic and kinetic data from treadmill running were selected for 154 subjects, from a database of gait biomechanics. Ankle, knee and hip sagittal plane angular acceleration kinematic curves were chained together to form a row input to a PCA matrix. A linear polynomial was calculated based on PCA scores, and a 10-fold cross-validation was performed to evaluate prediction accuracy against gold-standard foot strike as determined by a 10 N rise in the vertical ground reaction force. Results show 89-94% of all predicted foot strikes were within 4 frames (20 ms) of the gold standard with the largest error being 28 ms. It is concluded that this new foot strike detection is an improvement on existing methods and can be applied regardless of whether the runner exhibits a rearfoot, midfoot, or forefoot strike pattern. Copyright © 2014 Elsevier Ltd. All rights reserved.
Liu, Wei; Wang, Zhen-Zhong; Qing, Jian-Ping; Li, Hong-Juan; Xiao, Wei
2014-01-01
Background: Peach kernels which contain kinds of fatty acids play an important role in the regulation of a variety of physiological and biological functions. Objective: To establish an innovative and rapid diffuse reflectance near-infrared spectroscopy (DR-NIR) analysis method along with chemometric techniques for the qualitative and quantitative determination of a peach kernel. Materials and Methods: Peach kernel samples from nine different origins were analyzed with high-performance liquid chromatography (HPLC) as a reference method. DR-NIR is in the spectral range 1100-2300 nm. Principal component analysis (PCA) and partial least squares regression (PLSR) algorithm were applied to obtain prediction models, The Savitzky-Golay derivative and first derivative were adopted for the spectral pre-processing, PCA was applied to classify the varieties of those samples. For the quantitative calibration, the models of linoleic and oleinic acids were established with the PLSR algorithm and the optimal principal component (PC) numbers were selected with leave-one-out (LOO) cross-validation. The established models were evaluated with the root mean square error of deviation (RMSED) and corresponding correlation coefficients (R2). Results: The PCA results of DR-NIR spectra yield clear classification of the two varieties of peach kernel. PLSR had a better predictive ability. The correlation coefficients of the two calibration models were above 0.99, and the RMSED of linoleic and oleinic acids were 1.266% and 1.412%, respectively. Conclusion: The DR-NIR combined with PCA and PLSR algorithm could be used efficiently to identify and quantify peach kernels and also help to solve variety problem. PMID:25422544
Mari, Angela; Montoro, Paola; Pizza, Cosimo; Piacente, Sonia
2012-11-01
A validated analytical method for the quantitative determination of seven chemical markers occurring in a hydroalcoholic extract of Vitex agnus-castus fruits by liquid chromatography electrospray triple quadrupole tandem mass spectrometry (LC/ESI/(QqQ)MSMS) is reported. To carry out a comparative study, five commercial food supplements corresponding to hydroalcoholic extracts of V. agnus-castus fruits were analysed under the same chromatographic conditions of the crude extract. Principal component analysis (PCA), based only on the variation of the amount of the seven chemical markers, was applied in order to find similarities between the hydroalcoholic extract and the food supplements. A second PCA analysis was carried out considering the whole spectroscopic data deriving from liquid chromatography electrospray linear ion trap mass spectrometry (LC/ESI/(LIT)MS) analysis. High similarity between the two PCA was observed, showing the possibility to select one of these two approaches for future applications in the field of comparative analysis of food supplements and quality control procedures. Copyright © 2012 Elsevier B.V. All rights reserved.
Variability search in M 31 using principal component analysis and the Hubble Source Catalogue
NASA Astrophysics Data System (ADS)
Moretti, M. I.; Hatzidimitriou, D.; Karampelas, A.; Sokolovsky, K. V.; Bonanos, A. Z.; Gavras, P.; Yang, M.
2018-06-01
Principal component analysis (PCA) is being extensively used in Astronomy but not yet exhaustively exploited for variability search. The aim of this work is to investigate the effectiveness of using the PCA as a method to search for variable stars in large photometric data sets. We apply PCA to variability indices computed for light curves of 18 152 stars in three fields in M 31 extracted from the Hubble Source Catalogue. The projection of the data into the principal components is used as a stellar variability detection and classification tool, capable of distinguishing between RR Lyrae stars, long-period variables (LPVs) and non-variables. This projection recovered more than 90 per cent of the known variables and revealed 38 previously unknown variable stars (about 30 per cent more), all LPVs except for one object of uncertain variability type. We conclude that this methodology can indeed successfully identify candidate variable stars.
NASA Astrophysics Data System (ADS)
Lin, Jyh-Woei
2012-09-01
This paper uses Nonlinear Principal Component Analysis (NLPCA) and Principal Component Analysis (PCA) to determine Total Electron Content (TEC) anomalies in the ionosphere for the Nakri Typhoon on 29 May, 2008 (UTC). NLPCA, PCA and image processing are applied to the global ionospheric map (GIM) with transforms conducted for the time period 12:00-14:00 UT on 29 May 2008 when the wind was most intense. Results show that at a height of approximately 150-200 km the TEC anomaly using NLPCA is more localized; however its intensity increases with height and becomes more widespread. The TEC anomalies are not found by PCA. Potential causes of the results are discussed with emphasis given to vertical acoustic gravity waves. The approximate position of the typhoon's eye can be detected if the GIM is divided into fine enough maps with adequate spatial-resolution at GPS-TEC receivers. This implies that the trace of the typhoon in the regional GIM is caught using NLPCA.
Iorgulescu, E; Voicu, V A; Sârbu, C; Tache, F; Albu, F; Medvedovici, A
2016-08-01
The influence of the experimental variability (instrumental repeatability, instrumental intermediate precision and sample preparation variability) and data pre-processing (normalization, peak alignment, background subtraction) on the discrimination power of multivariate data analysis methods (Principal Component Analysis -PCA- and Cluster Analysis -CA-) as well as a new algorithm based on linear regression was studied. Data used in the study were obtained through positive or negative ion monitoring electrospray mass spectrometry (+/-ESI/MS) and reversed phase liquid chromatography/UV spectrometric detection (RPLC/UV) applied to green tea extracts. Extractions in ethanol and heated water infusion were used as sample preparation procedures. The multivariate methods were directly applied to mass spectra and chromatograms, involving strictly a holistic comparison of shapes, without assignment of any structural identity to compounds. An alternative data interpretation based on linear regression analysis mutually applied to data series is also discussed. Slopes, intercepts and correlation coefficients produced by the linear regression analysis applied on pairs of very large experimental data series successfully retain information resulting from high frequency instrumental acquisition rates, obviously better defining the profiles being compared. Consequently, each type of sample or comparison between samples produces in the Cartesian space an ellipsoidal volume defined by the normal variation intervals of the slope, intercept and correlation coefficient. Distances between volumes graphically illustrates (dis)similarities between compared data. The instrumental intermediate precision had the major effect on the discrimination power of the multivariate data analysis methods. Mass spectra produced through ionization from liquid state in atmospheric pressure conditions of bulk complex mixtures resulting from extracted materials of natural origins provided an excellent data basis for multivariate analysis methods, equivalent to data resulting from chromatographic separations. The alternative evaluation of very large data series based on linear regression analysis produced information equivalent to results obtained through application of PCA an CA. Copyright © 2016 Elsevier B.V. All rights reserved.
Classification of plum spirit drinks by synchronous fluorescence spectroscopy.
Sádecká, J; Jakubíková, M; Májek, P; Kleinová, A
2016-04-01
Synchronous fluorescence spectroscopy was used in combination with principal component analysis (PCA) and linear discriminant analysis (LDA) for the differentiation of plum spirits according to their geographical origin. A total of 14 Czech, 12 Hungarian and 18 Slovak plum spirit samples were used. The samples were divided in two categories: colorless (22 samples) and colored (22 samples). Synchronous fluorescence spectra (SFS) obtained at a wavelength difference of 60 nm provided the best results. Considering the PCA-LDA applied to the SFS of all samples, Czech, Hungarian and Slovak colorless samples were properly classified in both the calibration and prediction sets. 100% of correct classification was also obtained for Czech and Hungarian colored samples. However, one group of Slovak colored samples was classified as belonging to the Hungarian group in the calibration set. Thus, the total correct classifications obtained were 94% and 100% for the calibration and prediction steps, respectively. The results were compared with those obtained using near-infrared (NIR) spectroscopy. Applying PCA-LDA to NIR spectra (5500-6000 cm(-1)), the total correct classifications were 91% and 92% for the calibration and prediction steps, respectively, which were slightly lower than those obtained using SFS. Copyright © 2015 Elsevier Ltd. All rights reserved.
Decomposing the Apoptosis Pathway Into Biologically Interpretable Principal Components
Wang, Min; Kornblau, Steven M; Coombes, Kevin R
2018-01-01
Principal component analysis (PCA) is one of the most common techniques in the analysis of biological data sets, but applying PCA raises 2 challenges. First, one must determine the number of significant principal components (PCs). Second, because each PC is a linear combination of genes, it rarely has a biological interpretation. Existing methods to determine the number of PCs are either subjective or computationally extensive. We review several methods and describe a new R package, PCDimension, that implements additional methods, the most important being an algorithm that extends and automates a graphical Bayesian method. Using simulations, we compared the methods. Our newly automated procedure is competitive with the best methods when considering both accuracy and speed and is the most accurate when the number of objects is small compared with the number of attributes. We applied the method to a proteomics data set from patients with acute myeloid leukemia. Proteins in the apoptosis pathway could be explained using 6 PCs. By clustering the proteins in PC space, we were able to replace the PCs by 6 “biological components,” 3 of which could be immediately interpreted from the current literature. We expect this approach combining PCA with clustering to be widely applicable. PMID:29881252
Karasawa, N; Mitsutake, A; Takano, H
2017-12-01
Proteins implement their functionalities when folded into specific three-dimensional structures, and their functions are related to the protein structures and dynamics. Previously, we applied a relaxation mode analysis (RMA) method to protein systems; this method approximately estimates the slow relaxation modes and times via simulation and enables investigation of the dynamic properties underlying the protein structural fluctuations. Recently, two-step RMA with multiple evolution times has been proposed and applied to a slightly complex homopolymer system, i.e., a single [n]polycatenane. This method can be applied to more complex heteropolymer systems, i.e., protein systems, to estimate the relaxation modes and times more accurately. In two-step RMA, we first perform RMA and obtain rough estimates of the relaxation modes and times. Then, we apply RMA with multiple evolution times to a small number of the slowest relaxation modes obtained in the previous calculation. Herein, we apply this method to the results of principal component analysis (PCA). First, PCA is applied to a 2-μs molecular dynamics simulation of hen egg-white lysozyme in aqueous solution. Then, the two-step RMA method with multiple evolution times is applied to the obtained principal components. The slow relaxation modes and corresponding relaxation times for the principal components are much improved by the second RMA.
NASA Astrophysics Data System (ADS)
Karasawa, N.; Mitsutake, A.; Takano, H.
2017-12-01
Proteins implement their functionalities when folded into specific three-dimensional structures, and their functions are related to the protein structures and dynamics. Previously, we applied a relaxation mode analysis (RMA) method to protein systems; this method approximately estimates the slow relaxation modes and times via simulation and enables investigation of the dynamic properties underlying the protein structural fluctuations. Recently, two-step RMA with multiple evolution times has been proposed and applied to a slightly complex homopolymer system, i.e., a single [n ] polycatenane. This method can be applied to more complex heteropolymer systems, i.e., protein systems, to estimate the relaxation modes and times more accurately. In two-step RMA, we first perform RMA and obtain rough estimates of the relaxation modes and times. Then, we apply RMA with multiple evolution times to a small number of the slowest relaxation modes obtained in the previous calculation. Herein, we apply this method to the results of principal component analysis (PCA). First, PCA is applied to a 2-μ s molecular dynamics simulation of hen egg-white lysozyme in aqueous solution. Then, the two-step RMA method with multiple evolution times is applied to the obtained principal components. The slow relaxation modes and corresponding relaxation times for the principal components are much improved by the second RMA.
Application of EOF/PCA-based methods in the post-processing of GRACE derived water variations
NASA Astrophysics Data System (ADS)
Forootan, Ehsan; Kusche, Jürgen
2010-05-01
Two problems that users of monthly GRACE gravity field solutions face are 1) the presence of correlated noise in the Stokes coefficients that increases with harmonic degree and causes ‘striping', and 2) the fact that different physical signals are overlaid and difficult to separate from each other in the data. These problems are termed the signal-noise separation problem and the signal-signal separation problem. Methods that are based on principal component analysis and empirical orthogonal functions (PCA/EOF) have been frequently proposed to deal with these problems for GRACE. However, different strategies have been applied to different (spatial: global/regional, spectral: global/order-wise, geoid/equivalent water height) representations of the GRACE level 2 data products, leading to differing results and a general feeling that PCA/EOF-based methods are to be applied ‘with care'. In addition, it is known that conventional EOF/PCA methods force separated modes to be orthogonal, and that, on the other hand, to either EOFs or PCs an arbitrary orthogonal rotation can be applied. The aim of this paper is to provide a common theoretical framework and to study the application of PCA/EOF-based methods as a signal separation tool due to post-process GRACE data products. In order to investigate and illustrate the applicability of PCA/EOF-based methods, we have employed them on GRACE level 2 monthly solutions based on the Center for Space Research, University of Texas (CSR/UT) RL04 products and on the ITG-GRACE03 solutions from the University of Bonn, and on various representations of them. Our results show that EOF modes do reveal the dominating annual, semiannual and also long-periodic signals in the global water storage variations, but they also show how choosing different strategies changes the outcome and may lead to unexpected results.
Selection of solubility parameters for characterization of pharmaceutical excipients.
Adamska, Katarzyna; Voelkel, Adam; Héberger, Károly
2007-11-09
The solubility parameter (delta(2)), corrected solubility parameter (delta(T)) and its components (delta(d), delta(p), delta(h)) were determined for series of pharmaceutical excipients by using inverse gas chromatography (IGC). Principal component analysis (PCA) was applied for the selection of the solubility parameters which assure the complete characterization of examined materials. Application of PCA suggests that complete description of examined materials is achieved with four solubility parameters, i.e. delta(2) and Hansen solubility parameters (delta(d), delta(p), delta(h)). Selection of the excipients through PCA of their solubility parameters data can be used for prediction of their behavior in a multi-component system, e.g. for selection of the best materials to form stable pharmaceutical liquid mixtures or stable coating formulation.
Alizadeh Behbahani, Behrooz; Tabatabaei Yazdi, Farideh; Shahidi, Fakhri; Mortazavi, Seyed Ali; Mohebbi, Mohebbat
2017-04-01
Principle component analysis (PCA) was employed to examine the effect of the exerted treatments on the beef shelf life as well as discovering the correlations between the studied responses. Considering the variability of the dimensions of the responses, correlation coefficients were applied to form the matrix and extract the eigenvalue. Antimicrobial effect was evaluated on 10 pathogenic microorganisms through the methods of hole-plate diffusion method, disk diffusion method, pour plate method, minimum inhibitory concentration and minimum bactericidal/fungicidal concentration. Antioxidant potential and total phenolic content were examined through the method of 2,2-diphenyl-1-picrylhydrazyl (DPPH) and Folin-Ciocalteu method, respectively. The components were identified through gas chromatography and gas chromatography/mass spectrometry. Barhang seed mucilage (BSM) based edible coating containing 0, 0.5, 1 and 1.5% (w/w) Tarragon (T) essential oil mix were applied on beef slices to control the growth of pathogenic microorganisms. Microbiological (total viable count, psychrotrophic count, Escherichia coli, Staphylococcus aureus and fungi), chemical (thiobarbituric acid, peroxide value and pH) and sensory characteristics (odor, color and overall acceptability) analysis measurements were made during the storage periodically. PCA was employed to examine the effect of the exerted treatments on the beef shelf life as well as discovering the correlations between the studied responses. Considering the variability of the dimensions of the responses, correlation coefficients were applied to form the matrix and extract the eigenvalue. The PCA showed that the properties of the uncoated meat samples on the 9th, 12th, 15th and 18th days of storage are continuously changing independent of the exerted treatments on the other samples. This reveals the effect of the exerted treatments on the samples. Copyright © 2017 Elsevier Ltd. All rights reserved.
An improved PCA method with application to boiler leak detection.
Sun, Xi; Marquez, Horacio J; Chen, Tongwen; Riaz, Muhammad
2005-07-01
Principal component analysis (PCA) is a popular fault detection technique. It has been widely used in process industries, especially in the chemical industry. In industrial applications, achieving a sensitive system capable of detecting incipient faults, which maintains the false alarm rate to a minimum, is a crucial issue. Although a lot of research has been focused on these issues for PCA-based fault detection and diagnosis methods, sensitivity of the fault detection scheme versus false alarm rate continues to be an important issue. In this paper, an improved PCA method is proposed to address this problem. In this method, a new data preprocessing scheme and a new fault detection scheme designed for Hotelling's T2 as well as the squared prediction error are developed. A dynamic PCA model is also developed for boiler leak detection. This new method is applied to boiler water/steam leak detection with real data from Syncrude Canada's utility plant in Fort McMurray, Canada. Our results demonstrate that the proposed method can effectively reduce false alarm rate, provide effective and correct leak alarms, and give early warning to operators.
Gabor-based kernel PCA with fractional power polynomial models for face recognition.
Liu, Chengjun
2004-05-01
This paper presents a novel Gabor-based kernel Principal Component Analysis (PCA) method by integrating the Gabor wavelet representation of face images and the kernel PCA method for face recognition. Gabor wavelets first derive desirable facial features characterized by spatial frequency, spatial locality, and orientation selectivity to cope with the variations due to illumination and facial expression changes. The kernel PCA method is then extended to include fractional power polynomial models for enhanced face recognition performance. A fractional power polynomial, however, does not necessarily define a kernel function, as it might not define a positive semidefinite Gram matrix. Note that the sigmoid kernels, one of the three classes of widely used kernel functions (polynomial kernels, Gaussian kernels, and sigmoid kernels), do not actually define a positive semidefinite Gram matrix either. Nevertheless, the sigmoid kernels have been successfully used in practice, such as in building support vector machines. In order to derive real kernel PCA features, we apply only those kernel PCA eigenvectors that are associated with positive eigenvalues. The feasibility of the Gabor-based kernel PCA method with fractional power polynomial models has been successfully tested on both frontal and pose-angled face recognition, using two data sets from the FERET database and the CMU PIE database, respectively. The FERET data set contains 600 frontal face images of 200 subjects, while the PIE data set consists of 680 images across five poses (left and right profiles, left and right half profiles, and frontal view) with two different facial expressions (neutral and smiling) of 68 subjects. The effectiveness of the Gabor-based kernel PCA method with fractional power polynomial models is shown in terms of both absolute performance indices and comparative performance against the PCA method, the kernel PCA method with polynomial kernels, the kernel PCA method with fractional power polynomial models, the Gabor wavelet-based PCA method, and the Gabor wavelet-based kernel PCA method with polynomial kernels.
An application of principal component analysis to the clavicle and clavicle fixation devices.
Daruwalla, Zubin J; Courtis, Patrick; Fitzpatrick, Clare; Fitzpatrick, David; Mullett, Hannan
2010-03-26
Principal component analysis (PCA) enables the building of statistical shape models of bones and joints. This has been used in conjunction with computer assisted surgery in the past. However, PCA of the clavicle has not been performed. Using PCA, we present a novel method that examines the major modes of size and three-dimensional shape variation in male and female clavicles and suggests a method of grouping the clavicle into size and shape categories. Twenty-one high-resolution computerized tomography scans of the clavicle were reconstructed and analyzed using a specifically developed statistical software package. After performing statistical shape analysis, PCA was applied to study the factors that account for anatomical variation. The first principal component representing size accounted for 70.5 percent of anatomical variation. The addition of a further three principal components accounted for almost 87 percent. Using statistical shape analysis, clavicles in males have a greater lateral depth and are longer, wider and thicker than in females. However, the sternal angle in females is larger than in males. PCA confirmed these differences between genders but also noted that men exhibit greater variance and classified clavicles into five morphological groups. This unique approach is the first that standardizes a clavicular orientation. It provides information that is useful to both, the biomedical engineer and clinician. Other applications include implant design with regard to modifying current or designing future clavicle fixation devices. Our findings support the need for further development of clavicle fixation devices and the questioning of whether gender-specific devices are necessary.
Analyzing coastal environments by means of functional data analysis
NASA Astrophysics Data System (ADS)
Sierra, Carlos; Flor-Blanco, Germán; Ordoñez, Celestino; Flor, Germán; Gallego, José R.
2017-07-01
Here we used Functional Data Analysis (FDA) to examine particle-size distributions (PSDs) in a beach/shallow marine sedimentary environment in Gijón Bay (NW Spain). The work involved both Functional Principal Components Analysis (FPCA) and Functional Cluster Analysis (FCA). The grainsize of the sand samples was characterized by means of laser dispersion spectroscopy. Within this framework, FPCA was used as a dimension reduction technique to explore and uncover patterns in grain-size frequency curves. This procedure proved useful to describe variability in the structure of the data set. Moreover, an alternative approach, FCA, was applied to identify clusters and to interpret their spatial distribution. Results obtained with this latter technique were compared with those obtained by means of two vector approaches that combine PCA with CA (Cluster Analysis). The first method, the point density function (PDF), was employed after adapting a log-normal distribution to each PSD and resuming each of the density functions by its mean, sorting, skewness and kurtosis. The second applied a centered-log-ratio (clr) to the original data. PCA was then applied to the transformed data, and finally CA to the retained principal component scores. The study revealed functional data analysis, specifically FPCA and FCA, as a suitable alternative with considerable advantages over traditional vector analysis techniques in sedimentary geology studies.
Comparison of multi-subject ICA methods for analysis of fMRI data
Erhardt, Erik Barry; Rachakonda, Srinivas; Bedrick, Edward; Allen, Elena; Adali, Tülay; Calhoun, Vince D.
2010-01-01
Spatial independent component analysis (ICA) applied to functional magnetic resonance imaging (fMRI) data identifies functionally connected networks by estimating spatially independent patterns from their linearly mixed fMRI signals. Several multi-subject ICA approaches estimating subject-specific time courses (TCs) and spatial maps (SMs) have been developed, however there has not yet been a full comparison of the implications of their use. Here, we provide extensive comparisons of four multi-subject ICA approaches in combination with data reduction methods for simulated and fMRI task data. For multi-subject ICA, the data first undergo reduction at the subject and group levels using principal component analysis (PCA). Comparisons of subject-specific, spatial concatenation, and group data mean subject-level reduction strategies using PCA and probabilistic PCA (PPCA) show that computationally intensive PPCA is equivalent to PCA, and that subject-specific and group data mean subject-level PCA are preferred because of well-estimated TCs and SMs. Second, aggregate independent components are estimated using either noise free ICA or probabilistic ICA (PICA). Third, subject-specific SMs and TCs are estimated using back-reconstruction. We compare several direct group ICA (GICA) back-reconstruction approaches (GICA1-GICA3) and an indirect back-reconstruction approach, spatio-temporal regression (STR, or dual regression). Results show the earlier group ICA (GICA1) approximates STR, however STR has contradictory assumptions and may show mixed-component artifacts in estimated SMs. Our evidence-based recommendation is to use GICA3, introduced here, with subject-specific PCA and noise-free ICA, providing the most robust and accurate estimated SMs and TCs in addition to offering an intuitive interpretation. PMID:21162045
Kovács, Gábor; Somogyvári, Zsolt; Maka, Erika; Nagyjánosi, László
Peter Cerny Ambulance Service - Premature Eye Rescue Program (PCA-PERP) uses digital retinal imaging (DRI) with remote interpretation in bedside ROP screening, which has advantages over binocular indirect ophthalmoscopy (BIO) in screening of premature newborns. We aimed to demonstrate that PCA-PERP provides good value for the money and to model the cost ramifications of a similar newly launched system. As DRI was demonstrated to have high diagnostic performance, only the costs of bedside DRI-based screening were compared to those of traditional transport and BIO-based screening (cost-minimization analysis). The total costs of investment and maintenance were analyzed with micro-costing method. A ten-year analysis time-horizon and service provider's perspective were applied. From the launch of PCA-PERP up to the end of 2014, 3722 bedside examinations were performed in the PCA covered central region of Hungary. From 2009 to 2014, PCA-PERP saved 92,248km and 3633 staff working hours, with an annual nominal cost-savings ranging from 17,435 to 35,140 Euro. The net present value was 127,847 Euro at the end of 2014, with a payback period of 4.1years and an internal rate of return of 20.8%. Our model presented the NPVs of different scenarios with different initial investments, annual number of transports and average transport distances. PCA-PERP as bedside screening with remote interpretation, when compared to a transport-based screening with BIO, produced better cost-savings from the perspective of the service provider and provided a return on initial investment within five years after the project initiation. Copyright © 2017 Elsevier B.V. All rights reserved.
Willard, Melissa A Bodnar; McGuffin, Victoria L; Smith, Ruth Waddell
2012-01-01
Salvia divinorum is a hallucinogenic herb that is internationally regulated. In this study, salvinorin A, the active compound in S. divinorum, was extracted from S. divinorum plant leaves using a 5-min extraction with dichloromethane. Four additional Salvia species (Salvia officinalis, Salvia guaranitica, Salvia splendens, and Salvia nemorosa) were extracted using this procedure, and all extracts were analyzed by gas chromatography-mass spectrometry. Differentiation of S. divinorum from other Salvia species was successful based on visual assessment of the resulting chromatograms. To provide a more objective comparison, the total ion chromatograms (TICs) were subjected to principal components analysis (PCA). Prior to PCA, the TICs were subjected to a series of data pretreatment procedures to minimize non-chemical sources of variance in the data set. Successful discrimination of S. divinorum from the other four Salvia species was possible based on visual assessment of the PCA scores plot. To provide a numerical assessment of the discrimination, a series of statistical procedures such as Euclidean distance measurement, hierarchical cluster analysis, Student's t tests, Wilcoxon rank-sum tests, and Pearson product moment correlation were also applied to the PCA scores. The statistical procedures were then compared to determine the advantages and disadvantages for forensic applications.
Ciucci, Sara; Ge, Yan; Durán, Claudio; Palladini, Alessandra; Jiménez-Jiménez, Víctor; Martínez-Sánchez, Luisa María; Wang, Yuting; Sales, Susanne; Shevchenko, Andrej; Poser, Steven W.; Herbig, Maik; Otto, Oliver; Androutsellis-Theotokis, Andreas; Guck, Jochen; Gerl, Mathias J.; Cannistraci, Carlo Vittorio
2017-01-01
Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics. PMID:28287094
Determination of butter adulteration with margarine using Raman spectroscopy.
Uysal, Reyhan Selin; Boyaci, Ismail Hakki; Genis, Hüseyin Efe; Tamer, Ugur
2013-12-15
In this study, adulteration of butter with margarine was analysed using Raman spectroscopy combined with chemometric methods (principal component analysis (PCA), principal component regression (PCR), partial least squares (PLS)) and artificial neural networks (ANNs). Different butter and margarine samples were mixed at various concentrations ranging from 0% to 100% w/w. PCA analysis was applied for the classification of butters, margarines and mixtures. PCR, PLS and ANN were used for the detection of adulteration ratios of butter. Models were created using a calibration data set and developed models were evaluated using a validation data set. The coefficient of determination (R(2)) values between actual and predicted values obtained for PCR, PLS and ANN for the validation data set were 0.968, 0.987 and 0.978, respectively. In conclusion, a combination of Raman spectroscopy with chemometrics and ANN methods can be applied for testing butter adulteration. Copyright © 2013 Elsevier Ltd. All rights reserved.
Zakaria, Ammar; Shakaff, Ali Yeon Md; Masnan, Maz Jamilah; Saad, Fathinul Syahir Ahmad; Adom, Abdul Hamid; Ahmad, Mohd Noor; Jaafar, Mahmad Nor; Abdullah, Abu Hassan; Kamarudin, Latifah Munirah
2012-01-01
In recent years, there have been a number of reported studies on the use of non-destructive techniques to evaluate and determine mango maturity and ripeness levels. However, most of these reported works were conducted using single-modality sensing systems, either using an electronic nose, acoustics or other non-destructive measurements. This paper presents the work on the classification of mangoes (Magnifera Indica cv. Harumanis) maturity and ripeness levels using fusion of the data of an electronic nose and an acoustic sensor. Three groups of samples each from two different harvesting times (week 7 and week 8) were evaluated by the e-nose and then followed by the acoustic sensor. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) were able to discriminate the mango harvested at week 7 and week 8 based solely on the aroma and volatile gases released from the mangoes. However, when six different groups of different maturity and ripeness levels were combined in one classification analysis, both PCA and LDA were unable to discriminate the age difference of the Harumanis mangoes. Instead of six different groups, only four were observed using the LDA, while PCA showed only two distinct groups. By applying a low level data fusion technique on the e-nose and acoustic data, the classification for maturity and ripeness levels using LDA was improved. However, no significant improvement was observed using PCA with data fusion technique. Further work using a hybrid LDA-Competitive Learning Neural Network was performed to validate the fusion technique and classify the samples. It was found that the LDA-CLNN was also improved significantly when data fusion was applied. PMID:22778629
Zakaria, Ammar; Shakaff, Ali Yeon Md; Masnan, Maz Jamilah; Saad, Fathinul Syahir Ahmad; Adom, Abdul Hamid; Ahmad, Mohd Noor; Jaafar, Mahmad Nor; Abdullah, Abu Hassan; Kamarudin, Latifah Munirah
2012-01-01
In recent years, there have been a number of reported studies on the use of non-destructive techniques to evaluate and determine mango maturity and ripeness levels. However, most of these reported works were conducted using single-modality sensing systems, either using an electronic nose, acoustics or other non-destructive measurements. This paper presents the work on the classification of mangoes (Magnifera Indica cv. Harumanis) maturity and ripeness levels using fusion of the data of an electronic nose and an acoustic sensor. Three groups of samples each from two different harvesting times (week 7 and week 8) were evaluated by the e-nose and then followed by the acoustic sensor. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) were able to discriminate the mango harvested at week 7 and week 8 based solely on the aroma and volatile gases released from the mangoes. However, when six different groups of different maturity and ripeness levels were combined in one classification analysis, both PCA and LDA were unable to discriminate the age difference of the Harumanis mangoes. Instead of six different groups, only four were observed using the LDA, while PCA showed only two distinct groups. By applying a low level data fusion technique on the e-nose and acoustic data, the classification for maturity and ripeness levels using LDA was improved. However, no significant improvement was observed using PCA with data fusion technique. Further work using a hybrid LDA-Competitive Learning Neural Network was performed to validate the fusion technique and classify the samples. It was found that the LDA-CLNN was also improved significantly when data fusion was applied.
Data on Support Vector Machines (SVM) model to forecast photovoltaic power.
Malvoni, M; De Giorgi, M G; Congedo, P M
2016-12-01
The data concern the photovoltaic (PV) power, forecasted by a hybrid model that considers weather variations and applies a technique to reduce the input data size, as presented in the paper entitled "Photovoltaic forecast based on hybrid pca-lssvm using dimensionality reducted data" (M. Malvoni, M.G. De Giorgi, P.M. Congedo, 2015) [1]. The quadratic Renyi entropy criteria together with the principal component analysis (PCA) are applied to the Least Squares Support Vector Machines (LS-SVM) to predict the PV power in the day-ahead time frame. The data here shared represent the proposed approach results. Hourly PV power predictions for 1,3,6,12, 24 ahead hours and for different data reduction sizes are provided in Supplementary material.
Coarse-to-fine markerless gait analysis based on PCA and Gauss-Laguerre decomposition
NASA Astrophysics Data System (ADS)
Goffredo, Michela; Schmid, Maurizio; Conforto, Silvia; Carli, Marco; Neri, Alessandro; D'Alessio, Tommaso
2005-04-01
Human movement analysis is generally performed through the utilization of marker-based systems, which allow reconstructing, with high levels of accuracy, the trajectories of markers allocated on specific points of the human body. Marker based systems, however, show some drawbacks that can be overcome by the use of video systems applying markerless techniques. In this paper, a specifically designed computer vision technique for the detection and tracking of relevant body points is presented. It is based on the Gauss-Laguerre Decomposition, and a Principal Component Analysis Technique (PCA) is used to circumscribe the region of interest. Results obtained on both synthetic and experimental tests provide significant reduction of the computational costs, with no significant reduction of the tracking accuracy.
Multiple fingerprinting analyses in quality control of Cassiae Semen polysaccharides.
Cheng, Jing; He, Siyu; Wan, Qiang; Jing, Pu
2018-03-01
Quality control issue overshadows potential health benefits of Cassiae Semen due to the analytic limitations. In this study, multiple-fingerprint analysis integrated with several chemometrics was performed to assess the polysaccharide quality of Cassiae Semen harvested from different locations. FT-IR, HPLC, and GC fingerprints of polysaccharide extracts from the authentic source were established as standard profiles, applying to assess the quality of foreign sources. Analyses of FT-IR fingerprints of polysaccharide extracts using either Pearson correlation analysis or principal component analysis (PCA), or HPLC fingerprints of partially hydrolyzed polysaccharides with PCA, distinguished the foreign sources from the authentic source. However, HPLC or GC fingerprints of completely hydrolyzed polysaccharides couldn't identify all foreign sources and the methodology using GC is quite limited in determining the monosaccharide composition. This indicates that FT-IR/HPLC fingerprints of non/partially-hydrolyzed polysaccharides, respectively, accompanied by multiple chemometrics methods, might be potentially applied in detecting and differentiating sources of Cassiae Semen. Copyright © 2018 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Pacholski, Michaeleen L.
2004-06-01
Principal component analysis (PCA) has been successfully applied to time-of-flight secondary ion mass spectrometry (TOF-SIMS) spectra, images and depth profiles. Although SIMS spectral data sets can be small (in comparison to datasets typically discussed in literature from other analytical techniques such as gas or liquid chromatography), each spectrum has thousands of ions resulting in what can be a difficult comparison of samples. Analysis of industrially-derived samples means the identity of most surface species are unknown a priori and samples must be analyzed rapidly to satisfy customer demands. PCA enables rapid assessment of spectral differences (or lack there of) between samples and identification of chemically different areas on sample surfaces for images. Depth profile analysis helps define interfaces and identify low-level components in the system.
Optimized principal component analysis on coronagraphic images of the fomalhaut system
DOE Office of Scientific and Technical Information (OSTI.GOV)
Meshkat, Tiffany; Kenworthy, Matthew A.; Quanz, Sascha P.
We present the results of a study to optimize the principal component analysis (PCA) algorithm for planet detection, a new algorithm complementing angular differential imaging and locally optimized combination of images (LOCI) for increasing the contrast achievable next to a bright star. The stellar point spread function (PSF) is constructed by removing linear combinations of principal components, allowing the flux from an extrasolar planet to shine through. The number of principal components used determines how well the stellar PSF is globally modeled. Using more principal components may decrease the number of speckles in the final image, but also increases themore » background noise. We apply PCA to Fomalhaut Very Large Telescope NaCo images acquired at 4.05 μm with an apodized phase plate. We do not detect any companions, with a model dependent upper mass limit of 13-18 M {sub Jup} from 4-10 AU. PCA achieves greater sensitivity than the LOCI algorithm for the Fomalhaut coronagraphic data by up to 1 mag. We make several adaptations to the PCA code and determine which of these prove the most effective at maximizing the signal-to-noise from a planet very close to its parent star. We demonstrate that optimizing the number of principal components used in PCA proves most effective for pulling out a planet signal.« less
Carcinogenic potential of hydrotreated petroleum aromatic extracts.
Doak, S M; Hend, R W; van der Wiel, A; Hunt, P F
1985-01-01
Five experimental petroleum extracts were produced from luboil distillates derived from Middle East paraffinic crude by solvent extraction and severe hydrotreatment. The polycyclic aromatic content (PCA) of the extracts was determined by dimethyl sulphoxide extraction and ranged from 3.7-9.2% w/w. The five extracts were evaluated for their potential to induce cutaneous and systemic neoplasia in female mice derived from Carworth Farm No 1 strain (CF1). The test substances were applied undiluted (0.2 ml per application) to the shorn dorsal skin twice weekly for up to 78 weeks, with 48 mice in each treatment group and 96 in the untreated control group; two further groups, each of 48 mice, were similarly treated either with a non-hydrotreated commercial aromatic extract (PCA content, 19.7% w/v) or with a low dose of benzo(a)pyrene (12.5 micrograms/ml acetone). The mice were housed individually in polypropylene cages in specified pathogen free conditions. The incidence of cutaneous and systemic tumours was determined from histological analysis of haematoxylin and eosin stained tissue sections. The results were correlated with the PCA content of the extracts and compared with those from female mice exposed to a non-hydrotreated commercial aromatic extract. Four of the hydrotreated extracts were carcinogenic for murine skin; the two products with the lower PCA contents were less carcinogenic than the products with the higher PCA contents and all were less carcinogenic than the commercial extract. One extract with the lowest PCA content was non-carcinogenic. Thus refining by severe hydrotreatment was an effective method of reducing the carcinogenic potential of petroleum aromatic extracts. Although other physicochemical properties may influence the biological activity of oil products, the PCA content determined by dimethyl sulphoxide extraction may be a useful indicator of the potential of oil products to induce cutaneous tumours in experimental animals. There was no evidence that the commercial or hydrotreated extracts increased the incidence of systemic neoplasms when applied twice weekly to the dorsal skin. PMID:4005190
NASA Astrophysics Data System (ADS)
Chen, Long; Wang, Yue; Liu, Nenrong; Lin, Duo; Weng, Cuncheng; Zhang, Jixue; Zhu, Lihuan; Chen, Weisheng; Chen, Rong; Feng, Shangyuan
2013-06-01
The diagnostic capability of using tissue intrinsic micro-Raman signals to obtain biochemical information from human esophageal tissue is presented in this paper. Near-infrared micro-Raman spectroscopy combined with multivariate analysis was applied for discrimination of esophageal cancer tissue from normal tissue samples. Micro-Raman spectroscopy measurements were performed on 54 esophageal cancer tissues and 55 normal tissues in the 400-1750 cm-1 range. The mean Raman spectra showed significant differences between the two groups. Tentative assignments of the Raman bands in the measured tissue spectra suggested some changes in protein structure, a decrease in the relative amount of lactose, and increases in the percentages of tryptophan, collagen and phenylalanine content in esophageal cancer tissue as compared to those of a normal subject. The diagnostic algorithms based on principal component analysis (PCA) and linear discriminate analysis (LDA) achieved a diagnostic sensitivity of 87.0% and specificity of 70.9% for separating cancer from normal esophageal tissue samples. The result demonstrated that near-infrared micro-Raman spectroscopy combined with PCA-LDA analysis could be an effective and sensitive tool for identification of esophageal cancer.
Measuring the Indonesian provinces competitiveness by using PCA technique
NASA Astrophysics Data System (ADS)
Runita, Ditha; Fajriyah, Rohmatul
2017-12-01
Indonesia is a country which has vast teritoty. It has 34 provinces. Building local competitiveness is critical to enhance the long-term national competitiveness especially for a country as diverse as Indonesia. A competitive local government can attract and maintain successful firms and increase living standards for its inhabitants, because investment and skilled workers gravitate from uncompetitive regions to more competitive ones. Altough there are other methods to measuring competitiveness, but here we have demonstrated a simple method using principal component analysis (PCA). It can directly be applied to correlated, multivariate data. The analysis on Indonesian provinces provides 3 clusters based on the competitiveness measurement and the clusters are Bad, Good and Best perform provinces.
NASA Astrophysics Data System (ADS)
LIN, JYH-WOEI
2012-08-01
Principal Component Analysis (PCA) and image processing are used to determine Total Electron Content (TEC) anomalies in the F-layer of the ionosphere relating to Typhoon Nakri for 29 May, 2008 (UTC). PCA and image processing are applied to the global ionospheric map (GIM) with transforms conducted for the time period 12:00-14:00 UT on 29 May, 2008 when the wind was most intense. Results show that at a height of approximately 150-200 km the TEC anomaly is highly localized; however, it becomes more intense and widespread with height. Potential causes of these results are discussed with emphasis given to acoustic gravity waves caused by wind force.
Recent changes of rice heat stress in Jiangxi province, southeast China.
Huang, Jin; Zhang, Fangmin; Xue, Yan; Lin, Jie
2017-04-01
Around the intensity, frequency, duration, accumulated temperature, and even extremes of high-temperature events, nine selected temperature-related indices were used to explore the space and time changes of rice heat stress in Jiangxi province, southeast China. Several statistical methods including Mann-Kendall trend test (M-K test) and principal component analysis (PCA) were used in this study, and main results were listed as follows: (1) The changes in the intensity indices for high-temperature events were more significant, it was mainly embodied in that more than 80 % of stations had positive trends. (2) R-mode PCA was applied to the multiannual average values of nine selected indices of whole stations, and the results showed that the higher hazard for rice heat stress could be mainly detected in the middle and northeast area of Jiangxi. (3) S-mode PCA was applied to the integrated heat stress index series, and the results demonstrated that Jiangxi could be divided into four sub-regions with different variability in rice heat stress. However, all the sub-regions are dominated by increasing tendencies in rice heat stress since 1990. (4) Further analysis indicated that the western north Pacific sub-tropical high (WPSH) had the significant dominant influence on the rice heat stress in Jiangxi province.
NASA Astrophysics Data System (ADS)
Dai, Xiaoqian; Tian, Jie; Chen, Zhe
2010-03-01
Parametric images can represent both spatial distribution and quantification of the biological and physiological parameters of tracer kinetics. The linear least square (LLS) method is a well-estimated linear regression method for generating parametric images by fitting compartment models with good computational efficiency. However, bias exists in LLS-based parameter estimates, owing to the noise present in tissue time activity curves (TTACs) that propagates as correlated error in the LLS linearized equations. To address this problem, a volume-wise principal component analysis (PCA) based method is proposed. In this method, firstly dynamic PET data are properly pre-transformed to standardize noise variance as PCA is a data driven technique and can not itself separate signals from noise. Secondly, the volume-wise PCA is applied on PET data. The signals can be mostly represented by the first few principle components (PC) and the noise is left in the subsequent PCs. Then the noise-reduced data are obtained using the first few PCs by applying 'inverse PCA'. It should also be transformed back according to the pre-transformation method used in the first step to maintain the scale of the original data set. Finally, the obtained new data set is used to generate parametric images using the linear least squares (LLS) estimation method. Compared with other noise-removal method, the proposed method can achieve high statistical reliability in the generated parametric images. The effectiveness of the method is demonstrated both with computer simulation and with clinical dynamic FDG PET study.
Zhou, Fei; Zhao, Yajing; Peng, Jiyu; Jiang, Yirong; Li, Maiquan; Jiang, Yuan; Lu, Baiyi
2017-07-01
Osmanthus fragrans flowers are used as folk medicine and additives for teas, beverages and foods. The metabolites of O. fragrans flowers from different geographical origins were inconsistent in some extent. Chromatography and mass spectrometry combined with multivariable analysis methods provides an approach for discriminating the origin of O. fragrans flowers. To discriminate the Osmanthus fragrans var. thunbergii flowers from different origins with the identified metabolites. GC-MS and UPLC-PDA were conducted to analyse the metabolites in O. fragrans var. thunbergii flowers (in total 150 samples). Principal component analysis (PCA), soft independent modelling of class analogy analysis (SIMCA) and random forest (RF) analysis were applied to group the GC-MS and UPLC-PDA data. GC-MS identified 32 compounds common to all samples while UPLC-PDA/QTOF-MS identified 16 common compounds. PCA of the UPLC-PDA data generated a better clustering than PCA of the GC-MS data. Ten metabolites (six from GC-MS and four from UPLC-PDA) were selected as effective compounds for discrimination by PCA loadings. SIMCA and RF analysis were used to build classification models, and the RF model, based on the four effective compounds (caffeic acid derivative, acteoside, ligustroside and compound 15), yielded better results with the classification rate of 100% in the calibration set and 97.8% in the prediction set. GC-MS and UPLC-PDA combined with multivariable analysis methods can discriminate the origin of Osmanthus fragrans var. thunbergii flowers. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
A Dimensionally Reduced Clustering Methodology for Heterogeneous Occupational Medicine Data Mining.
Saâdaoui, Foued; Bertrand, Pierre R; Boudet, Gil; Rouffiac, Karine; Dutheil, Frédéric; Chamoux, Alain
2015-10-01
Clustering is a set of techniques of the statistical learning aimed at finding structures of heterogeneous partitions grouping homogenous data called clusters. There are several fields in which clustering was successfully applied, such as medicine, biology, finance, economics, etc. In this paper, we introduce the notion of clustering in multifactorial data analysis problems. A case study is conducted for an occupational medicine problem with the purpose of analyzing patterns in a population of 813 individuals. To reduce the data set dimensionality, we base our approach on the Principal Component Analysis (PCA), which is the statistical tool most commonly used in factorial analysis. However, the problems in nature, especially in medicine, are often based on heterogeneous-type qualitative-quantitative measurements, whereas PCA only processes quantitative ones. Besides, qualitative data are originally unobservable quantitative responses that are usually binary-coded. Hence, we propose a new set of strategies allowing to simultaneously handle quantitative and qualitative data. The principle of this approach is to perform a projection of the qualitative variables on the subspaces spanned by quantitative ones. Subsequently, an optimal model is allocated to the resulting PCA-regressed subspaces.
Mao, Zhi-Hua; Yin, Jian-Hua; Zhang, Xue-Xi; Wang, Xiao; Xia, Yang
2016-01-01
Fourier transform infrared spectroscopic imaging (FTIRI) technique can be used to obtain the quantitative information of content and spatial distribution of principal components in cartilage by combining with chemometrics methods. In this study, FTIRI combining with principal component analysis (PCA) and Fisher’s discriminant analysis (FDA) was applied to identify the healthy and osteoarthritic (OA) articular cartilage samples. Ten 10-μm thick sections of canine cartilages were imaged at 6.25μm/pixel in FTIRI. The infrared spectra extracted from the FTIR images were imported into SPSS software for PCA and FDA. Based on the PCA result of 2 principal components, the healthy and OA cartilage samples were effectively discriminated by the FDA with high accuracy of 94% for the initial samples (training set) and cross validation, as well as 86.67% for the prediction group. The study showed that cartilage degeneration became gradually weak with the increase of the depth. FTIRI combined with chemometrics may become an effective method for distinguishing healthy and OA cartilages in future. PMID:26977354
Chavez, P.S.; Kwarteng, A.Y.
1989-01-01
A challenge encountered with Landsat Thematic Mapper (TM) data, which includes data from size reflective spectral bands, is displaying as much information as possible in a three-image set for color compositing or digital analysis. Principal component analysis (PCA) applied to the six TM bands simultaneously is often used to address this problem. However, two problems that can be encountered using the PCA method are that information of interest might be mathematically mapped to one of the unused components and that a color composite can be difficult to interpret. "Selective' PCA can be used to minimize both of these problems. The spectral contrast among several spectral regions was mapped for a northern Arizona site using Landsat TM data. Field investigations determined that most of the spectral contrast seen in this area was due to one of the following: the amount of iron and hematite in the soils and rocks, vegetation differences, standing and running water, or the presence of gypsum, which has a higher moisture retention capability than do the surrounding soils and rocks. -from Authors
Nurjuliana, M; Che Man, Y B; Mat Hashim, D; Mohamed, A K S
2011-08-01
The volatile compounds of pork, other meats and meat products were studied using an electronic nose and gas chromatography mass spectrometer with headspace analyzer (GCMS-HS) for halal verification. The zNose™ was successfully employed for identification and differentiation of pork and pork sausages from beef, mutton and chicken meats and sausages which were achieved using a visual odor pattern called VaporPrint™, derived from the frequency of the surface acoustic wave (SAW) detector of the electronic nose. GCMS-HS was employed to separate and analyze the headspace gasses from samples into peaks corresponding to individual compounds for the purpose of identification. Principal component analysis (PCA) was applied for data interpretation. Analysis by PCA was able to cluster and discriminate pork from other types of meats and sausages. It was shown that PCA could provide a good separation of the samples with 67% of the total variance accounted by PC1. Copyright © 2011 Elsevier Ltd. All rights reserved.
Principal Component Analysis: Resources for an Essential Application of Linear Algebra
ERIC Educational Resources Information Center
Pankavich, Stephen; Swanson, Rebecca
2015-01-01
Principal Component Analysis (PCA) is a highly useful topic within an introductory Linear Algebra course, especially since it can be used to incorporate a number of applied projects. This method represents an essential application and extension of the Spectral Theorem and is commonly used within a variety of fields, including statistics,…
Bertani, Francesca R; Mozetic, Pamela; Fioramonti, Marco; Iuliani, Michele; Ribelli, Giulia; Pantano, Francesco; Santini, Daniele; Tonini, Giuseppe; Trombetta, Marcella; Businaro, Luca; Selci, Stefano; Rainer, Alberto
2017-08-21
The possibility of detecting and classifying living cells in a label-free and non-invasive manner holds significant theranostic potential. In this work, Hyperspectral Imaging (HSI) has been successfully applied to the analysis of macrophagic polarization, given its central role in several pathological settings, including the regulation of tumour microenvironment. Human monocyte derived macrophages have been investigated using hyperspectral reflectance confocal microscopy, and hyperspectral datasets have been analysed in terms of M1 vs. M2 polarization by Principal Components Analysis (PCA). Following PCA, Linear Discriminant Analysis has been implemented for semi-automatic classification of macrophagic polarization from HSI data. Our results confirm the possibility to perform single-cell-level in vitro classification of M1 vs. M2 macrophages in a non-invasive and label-free manner with a high accuracy (above 98% for cells deriving from the same donor), supporting the idea of applying the technique to the study of complex interacting cellular systems, such in the case of tumour-immunity in vitro models.
NASA Astrophysics Data System (ADS)
Di Anibal, Carolina V.; Marsal, Lluís F.; Callao, M. Pilar; Ruisánchez, Itziar
2012-02-01
Raman spectroscopy combined with multivariate analysis was evaluated as a tool for detecting Sudan I dye in culinary spices. Three Raman modalities were studied: normal Raman, FT-Raman and SERS. The results show that SERS is the most appropriate modality capable of providing a proper Raman signal when a complex matrix is analyzed. To get rid of the spectral noise and background, Savitzky-Golay smoothing with polynomial baseline correction and wavelet transform were applied. Finally, to check whether unadulterated samples can be differentiated from samples adulterated with Sudan I dye, an exploratory analysis such as principal component analysis (PCA) was applied to raw data and data processed with the two mentioned strategies. The results obtained by PCA show that Raman spectra need to be properly treated if useful information is to be obtained and both spectra treatments are appropriate for processing the Raman signal. The proposed methodology shows that SERS combined with appropriate spectra treatment can be used as a practical screening tool to distinguish samples suspicious to be adulterated with Sudan I dye.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chetvertkov, M; Henry Ford Health System, Detroit, MI; Siddiqui, F
2016-06-15
Purpose: To use daily cone beam CTs (CBCTs) to develop regularized principal component analysis (PCA) models of anatomical changes in head and neck (H&N) patients, to guide replanning decisions in adaptive radiation therapy (ART). Methods: Known deformations were applied to planning CT (pCT) images of 10 H&N patients to model several different systematic anatomical changes. A Pinnacle plugin was used to interpolate systematic changes over 35 fractions, generating a set of 35 synthetic CTs for each patient. Deformation vector fields (DVFs) were acquired between the pCT and synthetic CTs and random fraction-to-fraction changes were superimposed on the DVFs. Standard non-regularizedmore » and regularized patient-specific PCA models were built using the DVFs. The ability of PCA to extract the known deformations was quantified. PCA models were also generated from clinical CBCTs, for which the deformations and DVFs were not known. It was hypothesized that resulting eigenvectors/eigenfunctions with largest eigenvalues represent the major anatomical deformations during the course of treatment. Results: As demonstrated with quantitative results in the supporting document regularized PCA is more successful than standard PCA at capturing systematic changes early in the treatment. Regularized PCA is able to detect smaller systematic changes against the background of random fraction-to-fraction changes. To be successful at guiding ART, regularized PCA should be coupled with models of when anatomical changes occur: early, late or throughout the treatment course. Conclusion: The leading eigenvector/eigenfunction from the both PCA approaches can tentatively be identified as a major systematic change during radiotherapy course when systematic changes are large enough with respect to random fraction-to-fraction changes. In all cases the regularized PCA approach appears to be more reliable at capturing systematic changes, enabling dosimetric consequences to be projected once trends are established early in the treatment course. This work is supported in part by a grant from Varian Medical Systems, Palo Alto, CA.« less
Chen, Cheng; Chen, Ye; Hu, Lin-Kun; Jiang, Chang-Chuan; Xu, Ren-Fang; He, Xiao-Zhou
2018-02-27
We evaluated the prognosis of the new grade groups and American Joint Committee on Cancer (AJCC) stage groups in men with prostate cancer (PCa) who were treated conservatively. A total of 13 798 eligible men were chosen from the Surveillance Epidemiology and End Results database. The new grade and AJCC stage groups were investigated on prostate biopsy specimens. Kaplan-Meier survival analysis and multivariable hazards models were applied to estimate the association of new grade and stage groups with overall survival (OS) and PCa-specific survival (CSS). Mean follow-up was 42.65 months (95% confidence interval: 42.47-42.84) in the entire cohort. The 3-year OS and CSS rates stepped down for grade groups 1-5 and AJCC stage groups I-IVB, respectively. After adjusting for clinical and pathological characteristics, all grade groups and AJCC stage groups were associated with higher all-cause and PCa-specific mortality compared to the reference group (all P ≤ 0.003). In conclusion, we evaluated the oncological outcome of the new grade and AJCC stage groups on biopsy specimens of conservatively treated PCa. These two novel clinically relevant classifications can assist physicians to determine different therapeutic strategies for PCa patients.
Xu, Huan; Fu, Shi; Chen, Qi; Gu, Meng; Zhou, Juan; Liu, Chong; Chen, Yanbo; Wang, Zhong
2017-05-09
To measure the level of oxytocin in serum and prostate cancer (PCa) tissue and study its effect on the proliferation of PCa cells. Oxytocin level in serum was significantly increased in PCa patients compared with the no-carcinoma individuals. Additionally, the levels of oxytocin and its receptor were also elevated in the PCa tissue. However, no significant difference existed among the PCa of various Gleason grades. Western blot analysis confirmed the previous results and revealed an increased expression level of APPL1. The level of oxytocin in serum was measured by ELISA analysis. The expression of oxytocin and its receptor in prostate was analyzed by immunohistochemistry. The proliferation and apoptosis of PCa cells were assessed by the Cell Counting Kit 8 (CCK8) assay, cell cycle analysis and caspase3 activity analysis, respectively. Western blot analysis was used for the detection of PCNA, Caspase3 and APPL1 protein levels. Serum and prostatic oxytocin levels are increased in the PCa subjects. Serum oxytocin level may be a biomarker for PCa in the future. Oxytocin increases PCa growth and APPL1 expression.
NASA Astrophysics Data System (ADS)
Chatterjee, Shiladitya; Singh, Bhupinder; Diwan, Anubhav; Lee, Zheng Rong; Engelhard, Mark H.; Terry, Jeff; Tolley, H. Dennis; Gallagher, Neal B.; Linford, Matthew R.
2018-03-01
X-ray photoelectron spectroscopy (XPS) and time-of-flight secondary ion mass spectrometry (ToF-SIMS) are much used analytical techniques that provide information about the outermost atomic and molecular layers of materials. In this work, we discuss the application of multivariate spectral techniques, including principal component analysis (PCA) and multivariate curve resolution (MCR), to the analysis of XPS and ToF-SIMS depth profiles. Multivariate analyses often provide insight into data sets that is not easily obtained in a univariate fashion. Pattern recognition entropy (PRE), which has its roots in Shannon's information theory, is also introduced. This approach is not the same as the mutual information/entropy approaches sometimes used in data processing. A discussion of the theory of each technique is presented. PCA, MCR, and PRE are applied to four different data sets obtained from: a ToF-SIMS depth profile through ca. 100 nm of plasma polymerized C3F6 on Si, a ToF-SIMS depth profile through ca. 100 nm of plasma polymerized PNIPAM (poly (N-isopropylacrylamide)) on Si, an XPS depth profile through a film of SiO2 on Si, and an XPS depth profile through a film of Ta2O5 on Ta. PCA, MCR, and PRE reveal the presence of interfaces in the films, and often indicate that the first few scans in the depth profiles are different from those that follow. PRE and backward difference PRE provide this information in a straightforward fashion. Rises in the PRE signals at interfaces suggest greater complexity to the corresponding spectra. Results from PCA, especially for the higher principal components, were sometimes difficult to understand. MCR analyses were generally more interpretable.
Probability distributions of the electroencephalogram envelope of preterm infants.
Saji, Ryoya; Hirasawa, Kyoko; Ito, Masako; Kusuda, Satoshi; Konishi, Yukuo; Taga, Gentaro
2015-06-01
To determine the stationary characteristics of electroencephalogram (EEG) envelopes for prematurely born (preterm) infants and investigate the intrinsic characteristics of early brain development in preterm infants. Twenty neurologically normal sets of EEGs recorded in infants with a post-conceptional age (PCA) range of 26-44 weeks (mean 37.5 ± 5.0 weeks) were analyzed. Hilbert transform was applied to extract the envelope. We determined the suitable probability distribution of the envelope and performed a statistical analysis. It was found that (i) the probability distributions for preterm EEG envelopes were best fitted by lognormal distributions at 38 weeks PCA or less, and by gamma distributions at 44 weeks PCA; (ii) the scale parameter of the lognormal distribution had positive correlations with PCA as well as a strong negative correlation with the percentage of low-voltage activity; (iii) the shape parameter of the lognormal distribution had significant positive correlations with PCA; (iv) the statistics of mode showed significant linear relationships with PCA, and, therefore, it was considered a useful index in PCA prediction. These statistics, including the scale parameter of the lognormal distribution and the skewness and mode derived from a suitable probability distribution, may be good indexes for estimating stationary nature in developing brain activity in preterm infants. The stationary characteristics, such as discontinuity, asymmetry, and unimodality, of preterm EEGs are well indicated by the statistics estimated from the probability distribution of the preterm EEG envelopes. Copyright © 2014 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Water quality analysis of the Rapur area, Andhra Pradesh, South India using multivariate techniques
NASA Astrophysics Data System (ADS)
Nagaraju, A.; Sreedhar, Y.; Thejaswi, A.; Sayadi, Mohammad Hossein
2017-10-01
The groundwater samples from Rapur area were collected from different sites to evaluate the major ion chemistry. The large number of data can lead to difficulties in the integration, interpretation, and representation of the results. Two multivariate statistical methods, hierarchical cluster analysis (HCA) and factor analysis (FA), were applied to evaluate their usefulness to classify and identify geochemical processes controlling groundwater geochemistry. Four statistically significant clusters were obtained from 30 sampling stations. This has resulted two important clusters viz., cluster 1 (pH, Si, CO3, Mg, SO4, Ca, K, HCO3, alkalinity, Na, Na + K, Cl, and hardness) and cluster 2 (EC and TDS) which are released to the study area from different sources. The application of different multivariate statistical techniques, such as principal component analysis (PCA), assists in the interpretation of complex data matrices for a better understanding of water quality of a study area. From PCA, it is clear that the first factor (factor 1), accounted for 36.2% of the total variance, was high positive loading in EC, Mg, Cl, TDS, and hardness. Based on the PCA scores, four significant cluster groups of sampling locations were detected on the basis of similarity of their water quality.
A modified receptor model for source apportionment of heavy metal pollution in soil.
Huang, Ying; Deng, Meihua; Wu, Shaofu; Japenga, Jan; Li, Tingqiang; Yang, Xiaoe; He, Zhenli
2018-07-15
Source apportionment is a crucial step toward reduction of heavy metal pollution in soil. Existing methods are generally based on receptor models. However, overestimation or underestimation occurs when they are applied to heavy metal source apportionment in soil. Therefore, a modified model (PCA-MLRD) was developed, which is based on principal component analysis (PCA) and multiple linear regression with distance (MLRD). This model was applied to a case study conducted in a peri-urban area in southeast China where soils were contaminated by arsenic (As), cadmium (Cd), mercury (Hg) and lead (Pb). Compared with existing models, PCA-MLRD is able to identify specific sources and quantify the extent of influence for each emission. The zinc (Zn)-Pb mine was identified as the most important anthropogenic emission, which affected approximately half area for Pb and As accumulation, and approximately one third for Cd. Overall, the influence extent of the anthropogenic emissions decreased in the order of mine (3 km) > dyeing mill (2 km) ≈ industrial hub (2 km) > fluorescent factory (1.5 km) > road (0.5 km). Although algorithm still needs to improved, the PCA-MLRD model has the potential to become a useful tool for heavy metal source apportionment in soil. Copyright © 2018 Elsevier B.V. All rights reserved.
Tahir, Haroon Elrasheid; Xiaobo, Zou; Xiaowei, Huang; Jiyong, Shi; Mariod, Abdalbasit Adam
2016-09-01
Aroma profiles of six honey varieties of different botanical origins were investigated using colorimetric sensor array, gas chromatography-mass spectrometry (GC-MS) and descriptive sensory analysis. Fifty-eight aroma compounds were identified, including 2 norisoprenoids, 5 hydrocarbons, 4 terpenes, 6 phenols, 7 ketones, 9 acids, 12 aldehydes and 13 alcohols. Twenty abundant or active compounds were chosen as key compounds to characterize honey aroma. Discrimination of the honeys was subsequently implemented using multivariate analysis, including hierarchical clustering analysis (HCA) and principal component analysis (PCA). Honeys of the same botanical origin were grouped together in the PCA score plot and HCA dendrogram. SPME-GC/MS and colorimetric sensor array were able to discriminate the honeys effectively with the advantages of being rapid, simple and low-cost. Moreover, partial least squares regression (PLSR) was applied to indicate the relationship between sensory descriptors and aroma compounds. Copyright © 2016 Elsevier Ltd. All rights reserved.
Study of force loss due to friction comparing two ceramic brackets during sliding tooth movement.
AlSubaie, Mai; Talic, Nabeel; Khawatmi, Said; Alobeid, Ahmad; Bourauel, Christoph; El-Bialy, Tarek
2016-09-01
To compare the percentage of force loss generated during canine sliding movements in newly introduced ceramic brackets with metal brackets. Two types of ceramic brackets, namely polycrystalline alumina (PCA) ceramic brackets (Clarity Advanced) and monocrystalline alumina (MCA) ceramic brackets (Inspire Ice) were compared with stainless steel (SS) brackets (Victory Series). All bracket groups (n = 5 each) were for the maxillary canines and had a 0.018-inch slot size. The brackets were mounted on an Orthodontic Measurement and Simulation System (OMSS) to simulate the canine retraction movement into the first premolar extraction space. Using elastic ligatures, 0.016 × 0.022″ (0.40 × 0.56 mm) stainless steel archwires were ligated onto the brackets. Retraction force was applied via a nickel-titanium coil spring with a nearly constant force of approximately 1 N. The OMSS measured the percentage of force loss over the retraction path by referring to the difference between the applied retraction force and actual force acting on each bracket. Between group comparisons were done with one-way analysis of variance. The metal brackets revealed the lowest percentage of force loss due to friction, followed by the PCA and MCA ceramic bracket groups (67 ± 4, 68 ± 7, and 76 ± 3 %, respectively). There was no significant difference between SS and PCA brackets (p = 0.97), but we did observe significant differences between metal and MCA brackets (p = 0.03) and between PCA and MCA ceramic brackets (p = 0.04). PCA ceramic brackets, whose slot surface is covered with an yttria-stabilized zirconia-based coating exhibited frictional properties similar to those of metal brackets. Frictional resistance resulted in an over 60 % loss of the applied force due to the use of elastic ligatures.
NASA Astrophysics Data System (ADS)
Unglert, K.; Radić, V.; Jellinek, A. M.
2016-06-01
Variations in the spectral content of volcano seismicity related to changes in volcanic activity are commonly identified manually in spectrograms. However, long time series of monitoring data at volcano observatories require tools to facilitate automated and rapid processing. Techniques such as self-organizing maps (SOM) and principal component analysis (PCA) can help to quickly and automatically identify important patterns related to impending eruptions. For the first time, we evaluate the performance of SOM and PCA on synthetic volcano seismic spectra constructed from observations during two well-studied eruptions at Klauea Volcano, Hawai'i, that include features observed in many volcanic settings. In particular, our objective is to test which of the techniques can best retrieve a set of three spectral patterns that we used to compose a synthetic spectrogram. We find that, without a priori knowledge of the given set of patterns, neither SOM nor PCA can directly recover the spectra. We thus test hierarchical clustering, a commonly used method, to investigate whether clustering in the space of the principal components and on the SOM, respectively, can retrieve the known patterns. Our clustering method applied to the SOM fails to detect the correct number and shape of the known input spectra. In contrast, clustering of the data reconstructed by the first three PCA modes reproduces these patterns and their occurrence in time more consistently. This result suggests that PCA in combination with hierarchical clustering is a powerful practical tool for automated identification of characteristic patterns in volcano seismic spectra. Our results indicate that, in contrast to PCA, common clustering algorithms may not be ideal to group patterns on the SOM and that it is crucial to evaluate the performance of these tools on a control dataset prior to their application to real data.
Soneson, Charlotte; Lilljebjörn, Henrik; Fioretos, Thoas; Fontes, Magnus
2010-04-15
With the rapid development of new genetic measurement methods, several types of genetic alterations can be quantified in a high-throughput manner. While the initial focus has been on investigating each data set separately, there is an increasing interest in studying the correlation structure between two or more data sets. Multivariate methods based on Canonical Correlation Analysis (CCA) have been proposed for integrating paired genetic data sets. The high dimensionality of microarray data imposes computational difficulties, which have been addressed for instance by studying the covariance structure of the data, or by reducing the number of variables prior to applying the CCA. In this work, we propose a new method for analyzing high-dimensional paired genetic data sets, which mainly emphasizes the correlation structure and still permits efficient application to very large data sets. The method is implemented by translating a regularized CCA to its dual form, where the computational complexity depends mainly on the number of samples instead of the number of variables. The optimal regularization parameters are chosen by cross-validation. We apply the regularized dual CCA, as well as a classical CCA preceded by a dimension-reducing Principal Components Analysis (PCA), to a paired data set of gene expression changes and copy number alterations in leukemia. Using the correlation-maximizing methods, regularized dual CCA and PCA+CCA, we show that without pre-selection of known disease-relevant genes, and without using information about clinical class membership, an exploratory analysis singles out two patient groups, corresponding to well-known leukemia subtypes. Furthermore, the variables showing the highest relevance to the extracted features agree with previous biological knowledge concerning copy number alterations and gene expression changes in these subtypes. Finally, the correlation-maximizing methods are shown to yield results which are more biologically interpretable than those resulting from a covariance-maximizing method, and provide different insight compared to when each variable set is studied separately using PCA. We conclude that regularized dual CCA as well as PCA+CCA are useful methods for exploratory analysis of paired genetic data sets, and can be efficiently implemented also when the number of variables is very large.
Tianniam, Sukanda; Tarachiwin, Lucksanaporn; Bamba, Takeshi; Kobayashi, Akio; Fukusaki, Eiichiro
2008-06-01
Gas chromatography time-of-flight mass spectrometry was applied to elucidate the profiling of primary metabolites and to evaluate the differences between quality differences in Angelica acutiloba (or Yamato-toki) roots through the utilization of multivariate pattern recognition-principal component analysis (PCA). Twenty-two metabolites consisting of sugars, amino and organic acids were identified. PCA analysis successfully discriminated the good, the moderate and the bad quality Yamato-toki roots in accordance to their cultivation areas. The results signified two reducing sugars, fructose and glucose being the most accumulated in the bad quality, whereas higher quantity of phosphoric acid, proline, malic acid and citric acid were found in the good and the moderate quality toki roots. PCA was also effective in discriminating samples derive from different cultivars. Yamato-toki roots with the moderate quality were compared by means of PCA, and the results illustrated good discrimination which was influenced most by malic acid. Overall, this study demonstrated that metabolomics technique is accurate and efficient in determining the quality differences in Yamato-toki roots, and has a potential to be a superior and suitable method to assess the quality of this medicinal plant.
Gad, Haidy A; Bouzabata, Amel
2017-12-15
Turmeric (Curcuma longa L.) belongs to the family Zingiberaceae that is widely used as a spice in food preparations in addition to its biological activities. UV, FT-IR, 1 H NMR in addition to HPLC were applied to construct a metabolic fingerprint for Turmeric in an attempt to assess its quality. 30 samples were analyzed, and then principal component analysis (PCA) and hierarchical clustering analysis (HCA) were utilized to assess the differences and similarities between collected samples. PCA score plot based on both HPLC and UV spectroscopy showed the same discriminatory pattern, where the samples were segregated into four main groups depending on their total curcuminoids content. The results revealed that UV could be utilized as a simple and rapid alternative for HPLC. However, FT-IR failed to discriminate between the same species. By applying 1 H NMR, the metabolic variability between samples was more evident in the essential oils/fatty acid region. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pérez Aparicio, Jesús; Toledano Medina, M Angeles; Lafuente Rosales, Victoria
2007-07-09
Free-choice profile (FCP), developed in the 1980s, is a sensory analysis method that can be carried out by untrained panels. The participants need only to be able to use a scale and be consumers of the product under evaluation. The data are analysed by sophisticated statistical methodologies like Generalized Procrustean Analysis (GPA) or STATIS. To facilitate a wider use of the free-choice profiling procedure, different authors have advocated simpler methods based on principal components analysis (PCA) of merged data sets. The purpose of this work was to apply another easy procedure to this type of data by means of a robust PCA. The most important characteristic of the proposed method is that quality responsible managers could use this methodology without any scale evaluation. Only the free terms generated by the assessors are necessary to apply the script, thus avoiding the error associated with scale utilization by inexpert assessors. Also, it is possible to use the application with missing data and with differences in the assessors' attendance at sessions. An example was performed to generate the descriptors from different orange juice types. The results were compared with the STATIS method and with the PCA on the merged data sets. The samples evaluated were fresh orange juices with differences in storage days and pasteurized, concentrated and orange nectar drinks from different brands. Eighteen assessors with a low-level training program were used in a six-session free-choice profile framework. The results proved that this script could be of use in marketing decisions and product quality program development.
Liu, Gui-Song; Guo, Hao-Song; Pan, Tao; Wang, Ji-Hua; Cao, Gan
2014-10-01
Based on Savitzky-Golay (SG) smoothing screening, principal component analysis (PCA) combined with separately supervised linear discriminant analysis (LDA) and unsupervised hierarchical clustering analysis (HCA) were used for non-destructive visible and near-infrared (Vis-NIR) detection for breed screening of transgenic sugarcane. A random and stability-dependent framework of calibration, prediction, and validation was proposed. A total of 456 samples of sugarcane leaves planting in the elongating stage were collected from the field, which was composed of 306 transgenic (positive) samples containing Bt and Bar gene and 150 non-transgenic (negative) samples. A total of 156 samples (negative 50 and positive 106) were randomly selected as the validation set; the remaining samples (negative 100 and positive 200, a total of 300 samples) were used as the modeling set, and then the modeling set was subdivided into calibration (negative 50 and positive 100, a total of 150 samples) and prediction sets (negative 50 and positive 100, a total of 150 samples) for 50 times. The number of SG smoothing points was ex- panded, while some modes of higher derivative were removed because of small absolute value, and a total of 264 smoothing modes were used for screening. The pairwise combinations of first three principal components were used, and then the optimal combination of principal components was selected according to the model effect. Based on all divisions of calibration and prediction sets and all SG smoothing modes, the SG-PCA-LDA and SG-PCA-HCA models were established, the model parameters were optimized based on the average prediction effect for all divisions to produce modeling stability. Finally, the model validation was performed by validation set. With SG smoothing, the modeling accuracy and stability of PCA-LDA, PCA-HCA were signif- icantly improved. For the optimal SG-PCA-LDA model, the recognition rate of positive and negative validation samples were 94.3%, 96.0%; and were 92.5%, 98.0% for the optimal SG-PCA-LDA model, respectively. Vis-NIR spectro- scopic pattern recognition combined with SG smoothing could be used for accurate recognition of transgenic sugarcane leaves, and provided a convenient screening method for transgenic sugarcane breeding.
Xia, Bing; Zhou, Yan; Liu, Xin; Xiao, Juan; Liu, Qing; Gu, Yucheng; Ding, Lisheng
2012-06-15
Carbohydrates are good source of drugs and play important roles in metabolism processes and cellular interactions in organisms. Distinguishing monosaccharide isomers in saccharide derivates is an important and elementary work in investigating saccharides. It is important to develop a fast, simple and direct method for this purpose, which is described in this study. Stock solutions of monosaccharide with a concentration of 400 μM and sodium chloride at a concentration of 10 μM were made in water/methanol (50:50, v/v). The samples were subjected to electrospray ionization ion-trap tandem mass spectrometry (ESI-MS) and the detected [2M + Na - H(2)O](+) ions were further investigated by tandem mass spectrometry (MS/MS), followed by applying principal component analysis (PCA) on the obtained MS/MS data sets. The MS/MS spectra of the [2M + Na - H(2)O](+) ions at m/z 365 for hexoses and m/z 305 for pentoses yielded unambiguous fragment patterns, while rhamnose can be directly identified by its ESI-MS [M + Na](+) ion at m/z 187. PCA showed clustering of MS/MS data of identical monosaccharide samples obtained from different experiments. By using this method, the monosaccharide in daucosterol hydrolysate was successfully identified. A new strategy was developed for differentiation of the monosaccharides using ESI-MS/MS and PCA. In MS/MS spectra, the [2M + Na - H(2)O](+) ions yielded unambiguous distinction. PCA of the archived MS/MS data sets was applied to demonstrate the spatial resolution of the studied samples. This method presented a simple and reliable way for distinguishing monosaccharides by ESI-MS/MS. Copyright © 2012 John Wiley & Sons, Ltd.
Mayer, Rulon; Simone, Charles B; Skinner, William; Turkbey, Baris; Choykey, Peter
2018-03-01
Gleason Score (GS) is a validated predictor of prostate cancer (PCa) disease progression and outcomes. GS from invasive needle biopsies suffers from significant inter-observer variability and possible sampling error, leading to underestimating disease severity ("underscoring") and can result in possible complications. A robust non-invasive image-based approach is, therefore, needed. Use spatially registered multi-parametric MRI (MP-MRI), signatures, and supervised target detection algorithms (STDA) to non-invasively GS PCa at the voxel level. This study retrospectively analyzed 26 MP-MRI from The Cancer Imaging Archive. The MP-MRI (T2, Diffusion Weighted, Dynamic Contrast Enhanced) were spatially registered to each other, combined into stacks, and stitched together to form hypercubes. Multi-parametric (or multi-spectral) signatures derived from a training set of registered MP-MRI were transformed using statistics-based Whitening-Dewhitening (WD). Transformed signatures were inserted into STDA (having conical decision surfaces) applied to registered MP-MRI determined the tumor GS. The MRI-derived GS was quantitatively compared to the pathologist's assessment of the histology of sectioned whole mount prostates from patients who underwent radical prostatectomy. In addition, a meta-analysis of 17 studies of needle biopsy determined GS with confusion matrices and was compared to the MRI-determined GS. STDA and histology determined GS are highly correlated (R = 0.86, p < 0.02). STDA more accurately determined GS and reduced GS underscoring of PCa relative to needle biopsy as summarized by meta-analysis (p < 0.05). This pilot study found registered MP-MRI, STDA, and WD transforms of signatures shows promise in non-invasively GS PCa and reducing underscoring with high spatial resolution. Copyright © 2018 Elsevier Ltd. All rights reserved.
Zhang, Wei; Ren, Shan-Cheng; Shi, Xiao-Lei; Liu, Ya-Wei; Zhu, Ya-Sheng; Jing, Tai-Le; Wang, Fu-Bo; Chen, Rui; Xu, Chuan-Liang; Wang, Hui-Qing; Wang, Hai-Feng; Wang, Yan; Liu, Bing; Li, Yao-Ming; Fang, Zi-Yu; Guo, Fei; Lu, Xin; Shen, Dan; Gao, Xu; Hou, Jian-Guo; Sun, Ying-Hao
2015-05-01
Long non-coding RNA (LncRNA) PCA3 has been a well-established urine biomarker for the detection of prostate cancer (PCa). Our previous study showed a novel LncRNA FR0348383 is up-regulated in over 70% of PCa compared with matched benign tissues. The aim of this study was to evaluate the diagnostic value of urinary FR0348383 for men undergoing prostate biopsy due to elevated PSA (PSA > 4.0 ng/ml) and/or abnormal digital rectal examination (DRE). Post-DRE first-catch urine specimens prior to prostate biopsies were prospectively collected. After the whole transcriptome amplification, quantitative real time polymerase chain reaction was applied to quantify urine FR0348383 and PSA levels. The FR0348383 score was calculated as the ratio of PSA and FR0348383 mRNA (PSA mRNA/FR0348383 mRNA × 1000). The diagnostic value of FR0348383 score was evaluated by logistic regression and decision curve analysis. 213 cases with urine samples containing sufficient mRNA were included, 94 cases had serum PSA level 4.0-10.0 ng/ml. PCa was identified in 72 cases. An increasing FR0348383 score was correlated with an increasing probability of a positive biopsy (P < 0.001). Multivariable logistic analysis indicated FR0348383 score (P < 0.001), PSA (P = 0.004), age (P = 0.007), prostate volume (P < 0.001) were independent predictors of PCa. ROC analysis demonstrated FR0348383 score outperformed PSA, %free PSA, and PSA Density in the prediction of PCa in the subgroup of patients with grey area PSA (AUC: 0.815 vs. 0.562 vs. 0.599 vs. 0.645). When using a probability threshold of 30% in the grey zone cohort, The FR0348383 score would save 52.0% of avoidable biopsies without missing any high grade cancers. FR0348383 transcript in post-DRE urine may be a novel biomarker for detection of PCa with great diagnostic value, especially in the grey zone cohort. The application of FR0348383 score in clinical practice might avoid unnecessary prostate biopsies and increase the specificity of PCa diagnosis. © 2015 Wiley Periodicals, Inc.
Exploring patterns enriched in a dataset with contrastive principal component analysis.
Abid, Abubakar; Zhang, Martin J; Bagaria, Vivek K; Zou, James
2018-05-30
Visualization and exploration of high-dimensional data is a ubiquitous challenge across disciplines. Widely used techniques such as principal component analysis (PCA) aim to identify dominant trends in one dataset. However, in many settings we have datasets collected under different conditions, e.g., a treatment and a control experiment, and we are interested in visualizing and exploring patterns that are specific to one dataset. This paper proposes a method, contrastive principal component analysis (cPCA), which identifies low-dimensional structures that are enriched in a dataset relative to comparison data. In a wide variety of experiments, we demonstrate that cPCA with a background dataset enables us to visualize dataset-specific patterns missed by PCA and other standard methods. We further provide a geometric interpretation of cPCA and strong mathematical guarantees. An implementation of cPCA is publicly available, and can be used for exploratory data analysis in many applications where PCA is currently used.
Arbogast, Luke W; Delaglio, Frank; Schiel, John E; Marino, John P
2017-11-07
Two-dimensional (2D) 1 H- 13 C methyl NMR provides a powerful tool to probe the higher order structure (HOS) of monoclonal antibodies (mAbs), since spectra can readily be acquired on intact mAbs at natural isotopic abundance, and small changes in chemical environment and structure give rise to observable changes in corresponding spectra, which can be interpreted at atomic resolution. This makes it possible to apply 2D NMR spectral fingerprinting approaches directly to drug products in order to systematically characterize structure and excipient effects. Systematic collections of NMR spectra are often analyzed in terms of the changes in specifically identified peak positions, as well as changes in peak height and line widths. A complementary approach is to apply principal component analysis (PCA) directly to the matrix of spectral data, correlating spectra according to similarities and differences in their overall shapes, rather than according to parameters of individually identified peaks. This is particularly well-suited for spectra of mAbs, where some of the individual peaks might not be well resolved. Here we demonstrate the performance of the PCA method for discriminating structural variation among systematic sets of 2D NMR fingerprint spectra using the NISTmAb and illustrate how spectral variability identified by PCA may be correlated to structure.
Quantitative analysis of NMR spectra with chemometrics
NASA Astrophysics Data System (ADS)
Winning, H.; Larsen, F. H.; Bro, R.; Engelsen, S. B.
2008-01-01
The number of applications of chemometrics to series of NMR spectra is rapidly increasing due to an emerging interest for quantitative NMR spectroscopy e.g. in the pharmaceutical and food industries. This paper gives an analysis of advantages and limitations of applying the two most common chemometric procedures, Principal Component Analysis (PCA) and Multivariate Curve Resolution (MCR), to a designed set of 231 simple alcohol mixture (propanol, butanol and pentanol) 1H 400 MHz spectra. The study clearly demonstrates that the major advantage of chemometrics is the visualisation of larger data structures which adds a new exploratory dimension to NMR research. While robustness and powerful data visualisation and exploration are the main qualities of the PCA method, the study demonstrates that the bilinear MCR method is an even more powerful method for resolving pure component NMR spectra from mixtures when certain conditions are met.
Portable XRF and principal component analysis for bill characterization in forensic science.
Appoloni, C R; Melquiades, F L
2014-02-01
Several modern techniques have been applied to prevent counterfeiting of money bills. The objective of this study was to demonstrate the potential of Portable X-ray Fluorescence (PXRF) technique and the multivariate analysis method of Principal Component Analysis (PCA) for classification of bills in order to use it in forensic science. Bills of Dollar, Euro and Real (Brazilian currency) were measured directly at different colored regions, without any previous preparation. Spectra interpretation allowed the identification of Ca, Ti, Fe, Cu, Sr, Y, Zr and Pb. PCA analysis separated the bills in three groups and subgroups among Brazilian currency. In conclusion, the samples were classified according to its origin identifying the elements responsible for differentiation and basic pigment composition. PXRF allied to multivariate discriminate methods is a promising technique for rapid and no destructive identification of false bills in forensic science. Copyright © 2013 Elsevier Ltd. All rights reserved.
Degradation trend estimation of slewing bearing based on LSSVM model
NASA Astrophysics Data System (ADS)
Lu, Chao; Chen, Jie; Hong, Rongjing; Feng, Yang; Li, Yuanyuan
2016-08-01
A novel prediction method is proposed based on least squares support vector machine (LSSVM) to estimate the slewing bearing's degradation trend with small sample data. This method chooses the vibration signal which contains rich state information as the object of the study. Principal component analysis (PCA) was applied to fuse multi-feature vectors which could reflect the health state of slewing bearing, such as root mean square, kurtosis, wavelet energy entropy, and intrinsic mode function (IMF) energy. The degradation indicator fused by PCA can reflect the degradation more comprehensively and effectively. Then the degradation trend of slewing bearing was predicted by using the LSSVM model optimized by particle swarm optimization (PSO). The proposed method was demonstrated to be more accurate and effective by the whole life experiment of slewing bearing. Therefore, it can be applied in engineering practice.
A Novel Weighted Kernel PCA-Based Method for Optimization and Uncertainty Quantification
NASA Astrophysics Data System (ADS)
Thimmisetty, C.; Talbot, C.; Chen, X.; Tong, C. H.
2016-12-01
It has been demonstrated that machine learning methods can be successfully applied to uncertainty quantification for geophysical systems through the use of the adjoint method coupled with kernel PCA-based optimization. In addition, it has been shown through weighted linear PCA how optimization with respect to both observation weights and feature space control variables can accelerate convergence of such methods. Linear machine learning methods, however, are inherently limited in their ability to represent features of non-Gaussian stochastic random fields, as they are based on only the first two statistical moments of the original data. Nonlinear spatial relationships and multipoint statistics leading to the tortuosity characteristic of channelized media, for example, are captured only to a limited extent by linear PCA. With the aim of coupling the kernel-based and weighted methods discussed, we present a novel mathematical formulation of kernel PCA, Weighted Kernel Principal Component Analysis (WKPCA), that both captures nonlinear relationships and incorporates the attribution of significance levels to different realizations of the stochastic random field of interest. We also demonstrate how new instantiations retaining defining characteristics of the random field can be generated using Bayesian methods. In particular, we present a novel WKPCA-based optimization method that minimizes a given objective function with respect to both feature space random variables and observation weights through which optimal snapshot significance levels and optimal features are learned. We showcase how WKPCA can be applied to nonlinear optimal control problems involving channelized media, and in particular demonstrate an application of the method to learning the spatial distribution of material parameter values in the context of linear elasticity, and discuss further extensions of the method to stochastic inversion.
Discrimination of transgenic soybean seeds by terahertz spectroscopy
NASA Astrophysics Data System (ADS)
Liu, Wei; Liu, Changhong; Chen, Feng; Yang, Jianbo; Zheng, Lei
2016-10-01
Discrimination of genetically modified organisms is increasingly demanded by legislation and consumers worldwide. The feasibility of a non-destructive discrimination of glyphosate-resistant and conventional soybean seeds and their hybrid descendants was examined by terahertz time-domain spectroscopy system combined with chemometrics. Principal component analysis (PCA), least squares-support vector machines (LS-SVM) and PCA-back propagation neural network (PCA-BPNN) models with the first and second derivative and standard normal variate (SNV) transformation pre-treatments were applied to classify soybean seeds based on genotype. Results demonstrated clear differences among glyphosate-resistant, hybrid descendants and conventional non-transformed soybean seeds could easily be visualized with an excellent classification (accuracy was 88.33% in validation set) using the LS-SVM and the spectra with SNV pre-treatment. The results indicated that THz spectroscopy techniques together with chemometrics would be a promising technique to distinguish transgenic soybean seeds from non-transformed seeds with high efficiency and without any major sample preparation.
NASA Technical Reports Server (NTRS)
2005-01-01
Under funding from this proposal three in situ profile measurements of stratospheric sulfate aerosol and ozone were completed from balloon-borne platforms. The measured quantities are aerosol size resolved number concentration and ozone. The one derived product is aerosol size distribution, from which aerosol moments, such as surface area, volume, and extinction can be calculated for comparison with SAGE III measurements and SAGE III derived products, such as surface area. The analysis of these profiles and comparison with SAGE III extinction measurements and SAGE III derived surface areas are provided in Yongxiao (2005), which comprised the research thesis component of Mr. Jian Yongxiao's M.S. degree in Atmospheric Science at the University of Wyoming. In addition analysis continues on using principal component analysis (PCA) to derive aerosol surface area from the 9 wavelength extinction measurements available from SAGE III. Ths paper will present PCA components to calculate surface area from SAGE III measurements and compare these derived surface areas with those available directly from in situ size distribution measurements, as well as surface areas which would be derived from PCA and Thomason's algorithm applied to the four wavelength SAGE II extinction measurements.
Principal elementary mode analysis (PEMA).
Folch-Fortuny, Abel; Marques, Rodolfo; Isidro, Inês A; Oliveira, Rui; Ferrer, Alberto
2016-03-01
Principal component analysis (PCA) has been widely applied in fluxomics to compress data into a few latent structures in order to simplify the identification of metabolic patterns. These latent structures lack a direct biological interpretation due to the intrinsic constraints associated with a PCA model. Here we introduce a new method that significantly improves the interpretability of the principal components with a direct link to metabolic pathways. This method, called principal elementary mode analysis (PEMA), establishes a bridge between a PCA-like model, aimed at explaining the maximum variance in flux data, and the set of elementary modes (EMs) of a metabolic network. It provides an easy way to identify metabolic patterns in large fluxomics datasets in terms of the simplest pathways of the organism metabolism. The results using a real metabolic model of Escherichia coli show the ability of PEMA to identify the EMs that generated the different simulated flux distributions. Actual flux data of E. coli and Pichia pastoris cultures confirm the results observed in the simulated study, providing a biologically meaningful model to explain flux data of both organisms in terms of the EM activation. The PEMA toolbox is freely available for non-commercial purposes on http://mseg.webs.upv.es.
NASA Astrophysics Data System (ADS)
Kopparla, P.; Natraj, V.; Shia, R. L.; Spurr, R. J. D.; Crisp, D.; Yung, Y. L.
2015-12-01
Radiative transfer (RT) computations form the engine of atmospheric retrieval codes. However, full treatment of RT processes is computationally expensive, prompting usage of two-stream approximations in current exoplanetary atmospheric retrieval codes [Line et al., 2013]. Natraj et al. [2005, 2010] and Spurr and Natraj [2013] demonstrated the ability of a technique using principal component analysis (PCA) to speed up RT computations. In the PCA method for RT performance enhancement, empirical orthogonal functions are developed for binned sets of inherent optical properties that possess some redundancy; costly multiple-scattering RT calculations are only done for those few optical states corresponding to the most important principal components, and correction factors are applied to approximate radiation fields. Kopparla et al. [2015, in preparation] extended the PCA method to a broadband spectral region from the ultraviolet to the shortwave infrared (0.3-3 micron), accounting for major gas absorptions in this region. Here, we apply the PCA method to a some typical (exo-)planetary retrieval problems. Comparisons between the new model, called Universal Principal Component Analysis Radiative Transfer (UPCART) model, two-stream models and line-by-line RT models are performed, for spectral radiances, spectral fluxes and broadband fluxes. Each of these are calculated at the top of the atmosphere for several scenarios with varying aerosol types, extinction and scattering optical depth profiles, and stellar and viewing geometries. We demonstrate that very accurate radiance and flux estimates can be obtained, with better than 1% accuracy in all spectral regions and better than 0.1% in most cases, as compared to a numerically exact line-by-line RT model. The accuracy is enhanced when the results are convolved to typical instrument resolutions. The operational speed and accuracy of UPCART can be further improved by optimizing binning schemes and parallelizing the codes, work on which is under way.
Jesse, Stephen; Kalinin, Sergei V
2009-02-25
An approach for the analysis of multi-dimensional, spectroscopic-imaging data based on principal component analysis (PCA) is explored. PCA selects and ranks relevant response components based on variance within the data. It is shown that for examples with small relative variations between spectra, the first few PCA components closely coincide with results obtained using model fitting, and this is achieved at rates approximately four orders of magnitude faster. For cases with strong response variations, PCA allows an effective approach to rapidly process, de-noise, and compress data. The prospects for PCA combined with correlation function analysis of component maps as a universal tool for data analysis and representation in microscopy are discussed.
Henriksson, S; Hagberg, J; Bäckström, M; Persson, I; Lindström, G
2013-09-01
Polychlorinated dibenzo-p-dioxins and polychlorinated dibenzo-p-furans (PCDD/Fs) were analysed in soil from a Swedish sawmill site where chlorophenols (CPs) had been used more than 40 years ago. The most contaminated area at the site was the preservation subarea where the PCDD/F WHO2005-TEQ level was 3450 times higher than the current Swedish guideline value of 200 ng TEQ/kg soil for land for industrial use. It was also shown that a fire which destroyed the sawmill might have affected the congener distribution at the concerned areas. To get a broader picture of the contamination both GIS (spatial interpolation analysis) and multivariate data analysis (PCA) were applied to visualize and compare PCDD/F levels as well as congener distributions at different areas at the site. It is shown that GIS and PCA are powerful tools in decisions on future investigations, risk assessments and remediation of contaminated sites. Copyright © 2013 Elsevier Ltd. All rights reserved.
Chemometric studies on potential larvicidal compounds against Aedes aegypti.
Scotti, Luciana; Scotti, Marcus Tullius; Silva, Viviane Barros; Santos, Sandra Regina Lima; Cavalcanti, Sócrates C H; Mendonça, Francisco J B
2014-03-01
The mosquito Aedes aegypti (Diptera, Culicidae) is the vector of yellow and dengue fever. In this study, chemometric tools, such as, Principal Component Analysis (PCA), Consensus PCA (CPCA), and Partial Least Squares Regression (PLS), were applied to a set of fifty five active compounds against Ae. aegypti larvae, which includes terpenes, cyclic alcohols, phenolic compounds, and their synthetic derivatives. The calculations were performed using the VolSurf+ program. CPCA analysis suggests that the higher weight blocks of descriptors were SIZE/SHAPE, DRY, and H2O. The PCA was generated with 48 descriptors selected from the previous blocks. The scores plot showed good separation between more and less potent compounds. The first two PCs accounted for over 60% of the data variance. The best model obtained in PLS, after validation leave-one-out, exhibited q(2) = 0.679 and r(2) = 0.714. External prediction model was R(2) = 0.623. The independent variables having a hydrophobic profile were strongly correlated to the biological data. The interaction maps generated with the GRID force field showed that the most active compounds exhibit more interaction with the DRY probe.
NASA Technical Reports Server (NTRS)
Cramer, K. Elliott; Winfree, William P.
2006-01-01
The Nondestructive Evaluation Sciences Branch at NASA s Langley Research Center has been actively involved in the development of thermographic inspection techniques for more than 15 years. Since the Space Shuttle Columbia accident, NASA has focused on the improvement of advanced NDE techniques for the Reinforced Carbon-Carbon (RCC) panels that comprise the orbiter s wing leading edge. Various nondestructive inspection techniques have been used in the examination of the RCC, but thermography has emerged as an effective inspection alternative to more traditional methods. Thermography is a non-contact inspection method as compared to ultrasonic techniques which typically require the use of a coupling medium between the transducer and material. Like radiographic techniques, thermography can be used to inspect large areas, but has the advantage of minimal safety concerns and the ability for single-sided measurements. Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. A typical implementation of PCA is when the eigenvectors are generated from the data set being analyzed. Although it is a powerful tool for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the good material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued when a fixed set of eigenvectors is used to process the thermal data from the RCC materials. These eigen vectors can be generated either from an analytic model of the thermal response of the material under examination, or from a large cross section of experimental data. This paper will provide the details of the analytic model; an overview of the PCA process; as well as a quantitative signal-to-noise comparison of the results of performing both embodiments of PCA on thermographic data from various RCC specimens. Details of a system that has been developed to allow insitu inspection of a majority of shuttle RCC components will be presented along with the acceptance test results for this system. Additionally, the results of applying this technology to the Space Shuttle Discovery after its return from flight will be presented.
ERIC Educational Resources Information Center
Rahayu, Sri; Sugiarto, Teguh; Madu, Ludiro; Holiawati; Subagyo, Ahmad
2017-01-01
This study aims to apply the model principal component analysis to reduce multicollinearity on variable currency exchange rate in eight countries in Asia against US Dollar including the Yen (Japan), Won (South Korea), Dollar (Hong Kong), Yuan (China), Bath (Thailand), Rupiah (Indonesia), Ringgit (Malaysia), Dollar (Singapore). It looks at yield…
Towards the generation of a parametric foot model using principal component analysis: A pilot study.
Scarton, Alessandra; Sawacha, Zimi; Cobelli, Claudio; Li, Xinshan
2016-06-01
There have been many recent developments in patient-specific models with their potential to provide more information on the human pathophysiology and the increase in computational power. However they are not yet successfully applied in a clinical setting. One of the main challenges is the time required for mesh creation, which is difficult to automate. The development of parametric models by means of the Principle Component Analysis (PCA) represents an appealing solution. In this study PCA has been applied to the feet of a small cohort of diabetic and healthy subjects, in order to evaluate the possibility of developing parametric foot models, and to use them to identify variations and similarities between the two populations. Both the skin and the first metatarsal bones have been examined. Besides the reduced sample of subjects considered in the analysis, results demonstrated that the method adopted herein constitutes a first step towards the realization of a parametric foot models for biomechanical analysis. Furthermore the study showed that the methodology can successfully describe features in the foot, and evaluate differences in the shape of healthy and diabetic subjects. Copyright © 2016 IPEM. Published by Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Benitez-Garcia, Gibran; Nakamura, Tomoaki; Kaneko, Masahide
2017-01-01
Darwin was the first one to assert that facial expressions are innate and universal, which are recognized across all cultures. However, recent some cross-cultural studies have questioned this assumed universality. Therefore, this paper presents an analysis of the differences between Western and East-Asian faces of the six basic expressions (anger, disgust, fear, happiness, sadness and surprise) focused on three individual facial regions of eyes-eyebrows, nose and mouth. The analysis is conducted by applying PCA for two feature extraction methods: appearance-based by using the pixel intensities of facial parts, and geometric-based by handling 125 feature points from the face. Both methods are evaluated using 4 standard databases for both racial groups and the results are compared with a cross-cultural human study applied to 20 participants. Our analysis reveals that differences between Westerns and East-Asians exist mainly on the regions of eyes-eyebrows and mouth for expressions of fear and disgust respectively. This work presents important findings for a better design of automatic facial expression recognition systems based on the difference between two racial groups.
Priority of VHS Development Based in Potential Area using Principal Component Analysis
NASA Astrophysics Data System (ADS)
Meirawan, D.; Ana, A.; Saripudin, S.
2018-02-01
The current condition of VHS is still inadequate in quality, quantity and relevance. The purpose of this research is to analyse the development of VHS based on the development of regional potential by using principal component analysis (PCA) in Bandung, Indonesia. This study used descriptive qualitative data analysis using the principle of secondary data reduction component. The method used is Principal Component Analysis (PCA) analysis with Minitab Statistics Software tool. The results of this study indicate the value of the lowest requirement is a priority of the construction of development VHS with a program of majors in accordance with the development of regional potential. Based on the PCA score found that the main priority in the development of VHS in Bandung is in Saguling, which has the lowest PCA value of 416.92 in area 1, Cihampelas with the lowest PCA value in region 2 and Padalarang with the lowest PCA value.
DOE Office of Scientific and Technical Information (OSTI.GOV)
University of Illinois at Chicago; Montana State University; Bhardwaj, Chhavi
2013-04-01
7.87 to 10.5 eV vacuum ultraviolet (VUV) photon energies were used in laser desorption postionization mass spectrometry (LDPI-MS) to analyze biofilms comprised of binary cultures of interacting microorganisms. The effect of photon energy was examined using both tunable synchrotron and laser sources of VUV radiation. Principal components analysis (PCA) was applied to the MS data to differentiate species in Escherichia coli-Saccharomyces cerevisiae coculture biofilms. PCA of LDPI-MS also differentiated individual E. coli strains in a biofilm comprised of two interacting gene deletion strains, even though these strains differed from the wild type K-12 strain by no more than four genemore » deletions each out of approximately 2000 genes. PCA treatment of 7.87 eV LDPI-MS data separated the E. coli strains into three distinct groups two ?pure? groups and a mixed region. Furthermore, the ?pure? regions of the E. coli cocultures showed greater variance by PCA when analyzed by 7.87 eV photon energies than by 10.5 eV radiation. Comparison of the 7.87 and 10.5 eV data is consistent with the expectation that the lower photon energy selects a subset of low ionization energy analytes while 10.5 eV is more inclusive, detecting a wider range of analytes. These two VUV photon energies therefore give different spreads via PCA and their respective use in LDPI-MS constitute an additional experimental parameter to differentiate strains and species.« less
NASA Astrophysics Data System (ADS)
Zhao, Yan-Ru; Yu, Ke-Qiang; Li, Xiaoli; He, Yong
2016-12-01
Infected petals are often regarded as the source for the spread of fungi Sclerotinia sclerotiorum in all growing process of rapeseed (Brassica napus L.) plants. This research aimed to detect fungal infection of rapeseed petals by applying hyperspectral imaging in the spectral region of 874-1734 nm coupled with chemometrics. Reflectance was extracted from regions of interest (ROIs) in the hyperspectral image of each sample. Firstly, principal component analysis (PCA) was applied to conduct a cluster analysis with the first several principal components (PCs). Then, two methods including X-loadings of PCA and random frog (RF) algorithm were used and compared for optimizing wavebands selection. Least squares-support vector machine (LS-SVM) methodology was employed to establish discriminative models based on the optimal and full wavebands. Finally, area under the receiver operating characteristics curve (AUC) was utilized to evaluate classification performance of these LS-SVM models. It was found that LS-SVM based on the combination of all optimal wavebands had the best performance with AUC of 0.929. These results were promising and demonstrated the potential of applying hyperspectral imaging in fungus infection detection on rapeseed petals.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.
Here, we apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models - the square and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-one Ising (BSI) model, and the 2D XY model, and examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow exploration of different phases and symmetry-breaking, but can distinguish phase transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which ismore » particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the 'charge' correlations (vorticity) in the BSI model (XY model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the 'antoencoder method', and demonstrate that it too can be trained to capture phase transitions and critical points.« less
Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.
2017-06-19
Here, we apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models - the square and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-one Ising (BSI) model, and the 2D XY model, and examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow exploration of different phases and symmetry-breaking, but can distinguish phase transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which ismore » particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the 'charge' correlations (vorticity) in the BSI model (XY model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the 'antoencoder method', and demonstrate that it too can be trained to capture phase transitions and critical points.« less
NASA Astrophysics Data System (ADS)
Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.
2017-06-01
We apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models—the square- and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-1 Ising (BSI) model, and the two-dimensional X Y model—and we examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow the exploration of different phases and symmetry-breaking, but they can distinguish phase-transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which is particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the "charge" correlations (vorticity) in the BSI model (X Y model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the "autoencoder method," and we demonstrate that it too can be trained to capture phase transitions and critical points.
[Research on spectra recognition method for cabbages and weeds based on PCA and SIMCA].
Zu, Qin; Deng, Wei; Wang, Xiu; Zhao, Chun-Jiang
2013-10-01
In order to improve the accuracy and efficiency of weed identification, the difference of spectral reflectance was employed to distinguish between crops and weeds. Firstly, the different combinations of Savitzky-Golay (SG) convolutional derivation and multiplicative scattering correction (MSC) method were applied to preprocess the raw spectral data. Then the clustering analysis of various types of plants was completed by using principal component analysis (PCA) method, and the feature wavelengths which were sensitive for classifying various types of plants were extracted according to the corresponding loading plots of the optimal principal components in PCA results. Finally, setting the feature wavelengths as the input variables, the soft independent modeling of class analogy (SIMCA) classification method was used to identify the various types of plants. The experimental results of classifying cabbages and weeds showed that on the basis of the optimal pretreatment by a synthetic application of MSC and SG convolutional derivation with SG's parameters set as 1rd order derivation, 3th degree polynomial and 51 smoothing points, 23 feature wavelengths were extracted in accordance with the top three principal components in PCA results. When SIMCA method was used for classification while the previously selected 23 feature wavelengths were set as the input variables, the classification rates of the modeling set and the prediction set were respectively up to 98.6% and 100%.
Wang, Lei; Beg, Faisal; Ratnanather, Tilak; Ceritoglu, Can; Younes, Laurent; Morris, John C.; Csernansky, John G.; Miller, Michael I.
2010-01-01
In large-deformation diffeomorphic metric mapping (LDDMM), the diffeomorphic matching of images are modeled as evolution in time, or a flow, of an associated smooth velocity vector field v controlling the evolution. The initial momentum parameterizes the whole geodesic and encodes the shape and form of the target image. Thus, methods such as principal component analysis (PCA) of the initial momentum leads to analysis of anatomical shape and form in target images without being restricted to small-deformation assumption in the analysis of linear displacements. We apply this approach to a study of dementia of the Alzheimer type (DAT). The left hippocampus in the DAT group shows significant shape abnormality while the right hippocampus shows similar pattern of abnormality. Further, PCA of the initial momentum leads to correct classification of 12 out of 18 DAT subjects and 22 out of 26 control subjects. PMID:17427733
Saberi, Saeed; Farré, Pau; Cuvier, Olivier; Emberly, Eldon
2015-05-23
A variety of DNA binding proteins are involved in regulating and shaping the packing of chromatin. They aid the formation of loops in the DNA that function to isolate different structural domains. A recent experimental technique, Hi-C, provides a method for determining the frequency of such looping between all distant parts of the genome. Given that the binding locations of many chromatin associated proteins have also been measured, it has been possible to make estimates for their influence on the long-range interactions as measured by Hi-C. However, a challenge in this analysis is the predominance of non-specific contacts that mask out the specific interactions of interest. We show that transforming the Hi-C contact frequencies into free energies gives a natural method for separating out the distance dependent non-specific interactions. In particular we apply Principal Component Analysis (PCA) to the transformed free energy matrix to identify the dominant modes of interaction. PCA identifies systematic effects as well as high frequency spatial noise in the Hi-C data which can be filtered out. Thus it can be used as a data driven approach for normalizing Hi-C data. We assess this PCA based normalization approach, along with several other normalization schemes, by fitting the transformed Hi-C data using a pairwise interaction model that takes as input the known locations of bound chromatin factors. The result of fitting is a set of predictions for the coupling energies between the various chromatin factors and their effect on the energetics of looping. We show that the quality of the fit can be used as a means to determine how much PCA filtering should be applied to the Hi-C data. We find that the different normalizations of the Hi-C data vary in the quality of fit to the pairwise interaction model. PCA filtering can improve the fit, and the predicted coupling energies lead to biologically meaningful insights for how various chromatin bound factors influence the stability of DNA loops in chromatin.
Behavior of the PCA3 gene in the urine of men with high grade prostatic intraepithelial neoplasia.
Morote, Juan; Rigau, Marina; Garcia, Marta; Mir, Carmen; Ballesteros, Carlos; Planas, Jacques; Raventós, Carles X; Placer, José; de Torres, Inés M; Reventós, Jaume; Doll, Andreas
2010-12-01
An ideal marker for the early detection of prostate cancer (PCa) should also differentiate between men with isolated high grade prostatic intraepithelial neoplasia (HGPIN) and those with PCa. Prostate Cancer Gene 3 (PCA3) is a highly specific PCa gene and its score, in relation to the PSA gene in post-prostate massage urine (PMU-PCA3), seems to be useful in ruling out PCa, especially after a negative prostate biopsy. Because PCA3 is also expressed in the HGPIN lesion, the aim of this study was to determine the efficacy of PMU-PCA3 scores for ruling out PCa in men with previous HGPIN. The PMU-PCA3 score was assessed by quantitative PCR (multiplex research assay) in 244 men subjected to prostate biopsy: 64 men with an isolated HGPIN (no cancer detected after two or more repeated biopsies), 83 men with PCa and 97 men with benign pathology findings (BP: no PCa, HGPIN or ASAP). The median PMU-PCA3 score was 1.56 in men with BP, 2.01 in men with HGPIN (p = 0.128) and 9.06 in men with PCa (p = 0.008). The AUC in the ROC analysis was 0.705 in the subset of men with BP and PCa, while it decreased to 0.629 when only men with isolated HGPIN and PCa were included in the analysis. Fixing the sensitivity of the PMU-PCA3 score at 90%, its specificity was 79% in men with BP and 69% in men with isolated HGPIN. The efficacy of the PMU-PCA3 score to rule out PCa in men with HGPIN is lower than in men with BP.
Common factor analysis versus principal component analysis: choice for symptom cluster research.
Kim, Hee-Ju
2008-03-01
The purpose of this paper is to examine differences between two factor analytical methods and their relevance for symptom cluster research: common factor analysis (CFA) versus principal component analysis (PCA). Literature was critically reviewed to elucidate the differences between CFA and PCA. A secondary analysis (N = 84) was utilized to show the actual result differences from the two methods. CFA analyzes only the reliable common variance of data, while PCA analyzes all the variance of data. An underlying hypothetical process or construct is involved in CFA but not in PCA. PCA tends to increase factor loadings especially in a study with a small number of variables and/or low estimated communality. Thus, PCA is not appropriate for examining the structure of data. If the study purpose is to explain correlations among variables and to examine the structure of the data (this is usual for most cases in symptom cluster research), CFA provides a more accurate result. If the purpose of a study is to summarize data with a smaller number of variables, PCA is the choice. PCA can also be used as an initial step in CFA because it provides information regarding the maximum number and nature of factors. In using factor analysis for symptom cluster research, several issues need to be considered, including subjectivity of solution, sample size, symptom selection, and level of measure.
Transforming Graph Data for Statistical Relational Learning
2012-10-01
Jordan, 2003), PLSA (Hofmann, 1999), ? Classification via RMN (Taskar et al., 2003) or SVM (Hasan, Chaoji, Salem , & Zaki, 2006) ? Hierarchical...dimensionality reduction methods such as Principal 407 Rossi, McDowell, Aha, & Neville Component Analysis (PCA), Principal Factor Analysis ( PFA ), and...clustering algorithm. Journal of the Royal Statistical Society. Series C, Applied statistics, 28, 100–108. Hasan, M. A., Chaoji, V., Salem , S., & Zaki, M
Multivariate frequency domain analysis of protein dynamics
NASA Astrophysics Data System (ADS)
Matsunaga, Yasuhiro; Fuchigami, Sotaro; Kidera, Akinori
2009-03-01
Multivariate frequency domain analysis (MFDA) is proposed to characterize collective vibrational dynamics of protein obtained by a molecular dynamics (MD) simulation. MFDA performs principal component analysis (PCA) for a bandpass filtered multivariate time series using the multitaper method of spectral estimation. By applying MFDA to MD trajectories of bovine pancreatic trypsin inhibitor, we determined the collective vibrational modes in the frequency domain, which were identified by their vibrational frequencies and eigenvectors. At near zero temperature, the vibrational modes determined by MFDA agreed well with those calculated by normal mode analysis. At 300 K, the vibrational modes exhibited characteristic features that were considerably different from the principal modes of the static distribution given by the standard PCA. The influences of aqueous environments were discussed based on two different sets of vibrational modes, one derived from a MD simulation in water and the other from a simulation in vacuum. Using the varimax rotation, an algorithm of the multivariate statistical analysis, the representative orthogonal set of eigenmodes was determined at each vibrational frequency.
NASA Astrophysics Data System (ADS)
He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei
2015-02-01
A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety.
Analysis of the principal component algorithm in phase-shifting interferometry.
Vargas, J; Quiroga, J Antonio; Belenguer, T
2011-06-15
We recently presented a new asynchronous demodulation method for phase-sampling interferometry. The method is based in the principal component analysis (PCA) technique. In the former work, the PCA method was derived heuristically. In this work, we present an in-depth analysis of the PCA demodulation method.
Investigation of inversion polymorphisms in the human genome using principal components analysis.
Ma, Jianzhong; Amos, Christopher I
2012-01-01
Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct "populations" of inversion homozygotes of different orientations and their 1:1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases.
Cejnar, Pavel; Kuckova, Stepanka; Prochazka, Ales; Karamonova, Ludmila; Svobodova, Barbora
2018-06-15
Explorative statistical analysis of mass spectrometry data is still a time-consuming step. We analyzed critical factors for application of principal component analysis (PCA) in mass spectrometry and focused on two whole spectrum based normalization techniques and their application in the analysis of registered peak data and, in comparison, in full spectrum data analysis. We used this technique to identify different metabolic patterns in the bacterial culture of Cronobacter sakazakii, an important foodborne pathogen. Two software utilities, the ms-alone, a python-based utility for mass spectrometry data preprocessing and peak extraction, and the multiMS-toolbox, an R software tool for advanced peak registration and detailed explorative statistical analysis, were implemented. The bacterial culture of Cronobacter sakazakii was cultivated on Enterobacter sakazakii Isolation Agar, Blood Agar Base and Tryptone Soya Agar for 24 h and 48 h and applied by the smear method on an Autoflex speed MALDI-TOF mass spectrometer. For three tested cultivation media only two different metabolic patterns of Cronobacter sakazakii were identified using PCA applied on data normalized by two different normalization techniques. Results from matched peak data and subsequent detailed full spectrum analysis identified only two different metabolic patterns - a cultivation on Enterobacter sakazakii Isolation Agar showed significant differences to the cultivation on the other two tested media. The metabolic patterns for all tested cultivation media also proved the dependence on cultivation time. Both whole spectrum based normalization techniques together with the full spectrum PCA allow identification of important discriminative factors in experiments with several variable condition factors avoiding any problems with improper identification of peaks or emphasis on bellow threshold peak data. The amounts of processed data remain still manageable. Both implemented software utilities are available free of charge from http://uprt.vscht.cz/ms. Copyright © 2018 John Wiley & Sons, Ltd.
Prediction of pH of fresh chicken breast fillets by VNIR hyperspectral imaging
USDA-ARS?s Scientific Manuscript database
Visible and near-infrared (VNIR) hyperspectral imaging (400–900 nm) was used to evaluate pH of fresh chicken breast fillets (pectoralis major muscle) from the bone (dorsal) side of individual fillets. After the principal component analysis (PCA), a band threshold method was applied to the first prin...
Finger crease pattern recognition using Legendre moments and principal component analysis
NASA Astrophysics Data System (ADS)
Luo, Rongfang; Lin, Tusheng
2007-03-01
The finger joint lines defined as finger creases and its distribution can identify a person. In this paper, we propose a new finger crease pattern recognition method based on Legendre moments and principal component analysis (PCA). After obtaining the region of interest (ROI) for each finger image in the pre-processing stage, Legendre moments under Radon transform are applied to construct a moment feature matrix from the ROI, which greatly decreases the dimensionality of ROI and can represent principal components of the finger creases quite well. Then, an approach to finger crease pattern recognition is designed based on Karhunen-Loeve (K-L) transform. The method applies PCA to a moment feature matrix rather than the original image matrix to achieve the feature vector. The proposed method has been tested on a database of 824 images from 103 individuals using the nearest neighbor classifier. The accuracy up to 98.584% has been obtained when using 4 samples per class for training. The experimental results demonstrate that our proposed approach is feasible and effective in biometrics.
A novel principal component analysis for spatially misaligned multivariate air pollution data.
Jandarov, Roman A; Sheppard, Lianne A; Sampson, Paul D; Szpiro, Adam A
2017-01-01
We propose novel methods for predictive (sparse) PCA with spatially misaligned data. These methods identify principal component loading vectors that explain as much variability in the observed data as possible, while also ensuring the corresponding principal component scores can be predicted accurately by means of spatial statistics at locations where air pollution measurements are not available. This will make it possible to identify important mixtures of air pollutants and to quantify their health effects in cohort studies, where currently available methods cannot be used. We demonstrate the utility of predictive (sparse) PCA in simulated data and apply the approach to annual averages of particulate matter speciation data from national Environmental Protection Agency (EPA) regulatory monitors.
NASA Astrophysics Data System (ADS)
Lipovsky, B.; Funning, G. J.
2009-12-01
We compare several techniques for the analysis of geodetic time series with the ultimate aim to characterize the physical processes which are represented therein. We compare three methods for the analysis of these data: Principal Component Analysis (PCA), Non-Linear PCA (NLPCA), and Rotated PCA (RPCA). We evaluate each method by its ability to isolate signals which may be any combination of low amplitude (near noise level), temporally transient, unaccompanied by seismic emissions, and small scale with respect to the spatial domain. PCA is a powerful tool for extracting structure from large datasets which is traditionally realized through either the solution of an eigenvalue problem or through iterative methods. PCA is an transformation of the coordinate system of our data such that the new "principal" data axes retain maximal variance and minimal reconstruction error (Pearson, 1901; Hotelling, 1933). RPCA is achieved by an orthogonal transformation of the principal axes determined in PCA. In the analysis of meteorological data sets, RPCA has been seen to overcome domain shape dependencies, correct for sampling errors, and to determine principal axes which more closely represent physical processes (e.g., Richman, 1986). NLPCA generalizes PCA such that principal axes are replaced by principal curves (e.g., Hsieh 2004). We achieve NLPCA through an auto-associative feed-forward neural network (Scholz, 2005). We show the geophysical relevance of these techniques by application of each to a synthetic data set. Results are compared by inverting principal axes to determine deformation source parameters. Temporal variability in source parameters, estimated by each method, are also compared.
Principal Component Analysis for Normal-Distribution-Valued Symbolic Data.
Wang, Huiwen; Chen, Meiling; Shi, Xiaojun; Li, Nan
2016-02-01
This paper puts forward a new approach to principal component analysis (PCA) for normal-distribution-valued symbolic data, which has a vast potential of applications in the economic and management field. We derive a full set of numerical characteristics and variance-covariance structure for such data, which forms the foundation for our analytical PCA approach. Our approach is able to use all of the variance information in the original data than the prevailing representative-type approach in the literature which only uses centers, vertices, etc. The paper also provides an accurate approach to constructing the observations in a PC space based on the linear additivity property of normal distribution. The effectiveness of the proposed method is illustrated by simulated numerical experiments. At last, our method is applied to explain the puzzle of risk-return tradeoff in China's stock market.
Statistical analysis of aerosol species, trace gasses, and meteorology in Chicago.
Binaku, Katrina; O'Brien, Timothy; Schmeling, Martina; Fosco, Tinamarie
2013-09-01
Both canonical correlation analysis (CCA) and principal component analysis (PCA) were applied to atmospheric aerosol and trace gas concentrations and meteorological data collected in Chicago during the summer months of 2002, 2003, and 2004. Concentrations of ammonium, calcium, nitrate, sulfate, and oxalate particulate matter, as well as, meteorological parameters temperature, wind speed, wind direction, and humidity were subjected to CCA and PCA. Ozone and nitrogen oxide mixing ratios were also included in the data set. The purpose of statistical analysis was to determine the extent of existing linear relationship(s), or lack thereof, between meteorological parameters and pollutant concentrations in addition to reducing dimensionality of the original data to determine sources of pollutants. In CCA, the first three canonical variate pairs derived were statistically significant at the 0.05 level. Canonical correlation between the first canonical variate pair was 0.821, while correlations of the second and third canonical variate pairs were 0.562 and 0.461, respectively. The first canonical variate pair indicated that increasing temperatures resulted in high ozone mixing ratios, while the second canonical variate pair showed wind speed and humidity's influence on local ammonium concentrations. No new information was uncovered in the third variate pair. Canonical loadings were also interpreted for information regarding relationships between data sets. Four principal components (PCs), expressing 77.0 % of original data variance, were derived in PCA. Interpretation of PCs suggested significant production and/or transport of secondary aerosols in the region (PC1). Furthermore, photochemical production of ozone and wind speed's influence on pollutants were expressed (PC2) along with overall measure of local meteorology (PC3). In summary, CCA and PCA results combined were successful in uncovering linear relationships between meteorology and air pollutants in Chicago and aided in determining possible pollutant sources.
Levin-Schwartz, Yuri; Song, Yang; Schreier, Peter J.; Calhoun, Vince D.; Adalı, Tülay
2016-01-01
Due to their data-driven nature, multivariate methods such as canonical correlation analysis (CCA) have proven very useful for fusion of multimodal neurological data. However, being able to determine the degree of similarity between datasets and appropriate order selection are crucial to the success of such techniques. The standard methods for calculating the order of multimodal data focus only on sources with the greatest individual energy and ignore relations across datasets. Additionally, these techniques as well as the most widely-used methods for determining the degree of similarity between datasets assume sufficient sample support and are not effective in the sample-poor regime. In this paper, we propose to jointly estimate the degree of similarity between datasets and their order when few samples are present using principal component analysis and canonical correlation analysis (PCA-CCA). By considering these two problems simultaneously, we are able to minimize the assumptions placed on the data and achieve superior performance in the sample-poor regime compared to traditional techniques. We apply PCA-CCA to the pairwise combinations of functional magnetic resonance imaging (fMRI), structural magnetic resonance imaging (sMRI), and electroencephalogram (EEG) data drawn from patients with schizophrenia and healthy controls while performing an auditory oddball task. The PCA-CCA results indicate that the fMRI and sMRI datasets are the most similar, whereas the sMRI and EEG datasets share the least similarity. We also demonstrate that the degree of similarity obtained by PCA-CCA is highly predictive of the degree of significance found for components generated using CCA. PMID:27039696
Dihedral angle principal component analysis of molecular dynamics simulations.
Altis, Alexandros; Nguyen, Phuong H; Hegger, Rainer; Stock, Gerhard
2007-06-28
It has recently been suggested by Mu et al. [Proteins 58, 45 (2005)] to use backbone dihedral angles instead of Cartesian coordinates in a principal component analysis of molecular dynamics simulations. Dihedral angles may be advantageous because internal coordinates naturally provide a correct separation of internal and overall motion, which was found to be essential for the construction and interpretation of the free energy landscape of a biomolecule undergoing large structural rearrangements. To account for the circular statistics of angular variables, a transformation from the space of dihedral angles {phi(n)} to the metric coordinate space {x(n)=cos phi(n),y(n)=sin phi(n)} was employed. To study the validity and the applicability of the approach, in this work the theoretical foundations underlying the dihedral angle principal component analysis (dPCA) are discussed. It is shown that the dPCA amounts to a one-to-one representation of the original angle distribution and that its principal components can readily be characterized by the corresponding conformational changes of the peptide. Furthermore, a complex version of the dPCA is introduced, in which N angular variables naturally lead to N eigenvalues and eigenvectors. Applying the methodology to the construction of the free energy landscape of decaalanine from a 300 ns molecular dynamics simulation, a critical comparison of the various methods is given.
Dihedral angle principal component analysis of molecular dynamics simulations
NASA Astrophysics Data System (ADS)
Altis, Alexandros; Nguyen, Phuong H.; Hegger, Rainer; Stock, Gerhard
2007-06-01
It has recently been suggested by Mu et al. [Proteins 58, 45 (2005)] to use backbone dihedral angles instead of Cartesian coordinates in a principal component analysis of molecular dynamics simulations. Dihedral angles may be advantageous because internal coordinates naturally provide a correct separation of internal and overall motion, which was found to be essential for the construction and interpretation of the free energy landscape of a biomolecule undergoing large structural rearrangements. To account for the circular statistics of angular variables, a transformation from the space of dihedral angles {φn} to the metric coordinate space {xn=cosφn,yn=sinφn} was employed. To study the validity and the applicability of the approach, in this work the theoretical foundations underlying the dihedral angle principal component analysis (dPCA) are discussed. It is shown that the dPCA amounts to a one-to-one representation of the original angle distribution and that its principal components can readily be characterized by the corresponding conformational changes of the peptide. Furthermore, a complex version of the dPCA is introduced, in which N angular variables naturally lead to N eigenvalues and eigenvectors. Applying the methodology to the construction of the free energy landscape of decaalanine from a 300ns molecular dynamics simulation, a critical comparison of the various methods is given.
Metsalu, Tauno; Vilo, Jaak
2015-01-01
The Principal Component Analysis (PCA) is a widely used method of reducing the dimensionality of high-dimensional data, often followed by visualizing two of the components on the scatterplot. Although widely used, the method is lacking an easy-to-use web interface that scientists with little programming skills could use to make plots of their own data. The same applies to creating heatmaps: it is possible to add conditional formatting for Excel cells to show colored heatmaps, but for more advanced features such as clustering and experimental annotations, more sophisticated analysis tools have to be used. We present a web tool called ClustVis that aims to have an intuitive user interface. Users can upload data from a simple delimited text file that can be created in a spreadsheet program. It is possible to modify data processing methods and the final appearance of the PCA and heatmap plots by using drop-down menus, text boxes, sliders etc. Appropriate defaults are given to reduce the time needed by the user to specify input parameters. As an output, users can download PCA plot and heatmap in one of the preferred file formats. This web server is freely available at http://biit.cs.ut.ee/clustvis/. PMID:25969447
Kalegowda, Yogesh; Harmer, Sarah L
2013-01-08
Artificial neural network (ANN) and a hybrid principal component analysis-artificial neural network (PCA-ANN) classifiers have been successfully implemented for classification of static time-of-flight secondary ion mass spectrometry (ToF-SIMS) mass spectra collected from complex Cu-Fe sulphides (chalcopyrite, bornite, chalcocite and pyrite) at different flotation conditions. ANNs are very good pattern classifiers because of: their ability to learn and generalise patterns that are not linearly separable; their fault and noise tolerance capability; and high parallelism. In the first approach, fragments from the whole ToF-SIMS spectrum were used as input to the ANN, the model yielded high overall correct classification rates of 100% for feed samples, 88% for conditioned feed samples and 91% for Eh modified samples. In the second approach, the hybrid pattern classifier PCA-ANN was integrated. PCA is a very effective multivariate data analysis tool applied to enhance species features and reduce data dimensionality. Principal component (PC) scores which accounted for 95% of the raw spectral data variance, were used as input to the ANN, the model yielded high overall correct classification rates of 88% for conditioned feed samples and 95% for Eh modified samples. Copyright © 2012 Elsevier B.V. All rights reserved.
Carvajal, Roberto C; Arias, Luis E; Garces, Hugo O; Sbarbaro, Daniel G
2016-04-01
This work presents a non-parametric method based on a principal component analysis (PCA) and a parametric one based on artificial neural networks (ANN) to remove continuous baseline features from spectra. The non-parametric method estimates the baseline based on a set of sampled basis vectors obtained from PCA applied over a previously composed continuous spectra learning matrix. The parametric method, however, uses an ANN to filter out the baseline. Previous studies have demonstrated that this method is one of the most effective for baseline removal. The evaluation of both methods was carried out by using a synthetic database designed for benchmarking baseline removal algorithms, containing 100 synthetic composed spectra at different signal-to-baseline ratio (SBR), signal-to-noise ratio (SNR), and baseline slopes. In addition to deomonstrating the utility of the proposed methods and to compare them in a real application, a spectral data set measured from a flame radiation process was used. Several performance metrics such as correlation coefficient, chi-square value, and goodness-of-fit coefficient were calculated to quantify and compare both algorithms. Results demonstrate that the PCA-based method outperforms the one based on ANN both in terms of performance and simplicity. © The Author(s) 2016.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dhou, S; Cai, W; Hurwitz, M
Purpose: The goal of this study is to quantify the interfraction reproducibility of patient-specific motion models derived from 4DCBCT acquired on the day of treatment of lung cancer stereotactic body radiotherapy (SBRT) patients. Methods: Motion models are derived from patient 4DCBCT images acquired daily over 3–5 fractions of treatment by 1) applying deformable image registration between each 4DCBCT image and a reference phase from that day, resulting in a set of displacement vector fields (DVFs), and 2) performing principal component analysis (PCA) on the DVFs to derive a motion model. The motion model from the first day of treatment ismore » compared to motion models from each successive day of treatment to quantify variability in motion models generated from different days. Four SBRT patient datasets have been acquired thus far in this IRB approved study. Results: Fraction-specific motion models for each fraction and patient were derived and PCA eigenvectors and their associated eigenvalues are compared for each fraction. For the first patient dataset, the average root mean square error between the first two eigenvectors associated with the highest two eigenvalues, in four fractions was 0.1, while it was 0.25 between the last three PCA eigenvectors associated with the lowest three eigenvalues. It was found that the eigenvectors and eigenvalues of PCA motion models for each treatment fraction have variations and the first few eigenvectors are shown to be more stable across treatment fractions than others. Conclusion: Analysis of this dataset showed that the first two eigenvectors of the PCA patient-specific motion models derived from 4DCBCT were stable over the course of several treatment fractions. The third, fourth, and fifth eigenvectors had larger variations.« less
Improved medical image fusion based on cascaded PCA and shift invariant wavelet transforms.
Reena Benjamin, J; Jayasree, T
2018-02-01
In the medical field, radiologists need more informative and high-quality medical images to diagnose diseases. Image fusion plays a vital role in the field of biomedical image analysis. It aims to integrate the complementary information from multimodal images, producing a new composite image which is expected to be more informative for visual perception than any of the individual input images. The main objective of this paper is to improve the information, to preserve the edges and to enhance the quality of the fused image using cascaded principal component analysis (PCA) and shift invariant wavelet transforms. A novel image fusion technique based on cascaded PCA and shift invariant wavelet transforms is proposed in this paper. PCA in spatial domain extracts relevant information from the large dataset based on eigenvalue decomposition, and the wavelet transform operating in the complex domain with shift invariant properties brings out more directional and phase details of the image. The significance of maximum fusion rule applied in dual-tree complex wavelet transform domain enhances the average information and morphological details. The input images of the human brain of two different modalities (MRI and CT) are collected from whole brain atlas data distributed by Harvard University. Both MRI and CT images are fused using cascaded PCA and shift invariant wavelet transform method. The proposed method is evaluated based on three main key factors, namely structure preservation, edge preservation, contrast preservation. The experimental results and comparison with other existing fusion methods show the superior performance of the proposed image fusion framework in terms of visual and quantitative evaluations. In this paper, a complex wavelet-based image fusion has been discussed. The experimental results demonstrate that the proposed method enhances the directional features as well as fine edge details. Also, it reduces the redundant details, artifacts, distortions.
Rodón, N; Trías, I; Verdú, M; Román, R; Domínguez, A; Calvo, M; Banus, J M; Ballesta, A M; Maestro, M L; Puig, X
2014-04-01
Analyze the impact of the introduction of the study of PCA3 gene in post-prostatic massage urine in the clinical management of patients with PSA altered, evaluating its diagnostic ability and predictive value of tumor aggressiveness. Observational, prospective, multicenter study of patients with suspected prostate cancer (PC) candidates for biopsy. We present a series of 670 consecutive samples of urine collected post-prostatic massage for three years in which we determined the "PCA3 score" (s-PCA3). Biopsy was only indicated in cases with s-positive PCA3. The s-PCA3 was positive in 43.7% of samples. In the 124 biopsies performed, the incidence of PC or atypical small acinar proliferation was 54%, reaching 68,6% in s-PCA3≥100. Statistically significant relationship between the s-PCA3 and tumor grade was demonstrated. In cases with s-PCA3 between 35 and 50 only 23% of PC were high grade (Gleason≥7), compared to 76.7% in cases with s-PCA3 over 50. There was a statistically significant correlation between s-PCA3 and cylinders affected. Both relationships were confirmed by applying a log-linear model. The incorporation of PCA3 can avoid the need for biopsies in 54% of patients. s-PCA3 positivity increases the likelihood of a positive biopsy, especially in higher s-PCA3 100 (68.6%). s-PCA3 is also an indicator of tumor aggressiveness and provides essential information in making treatment decisions. Copyright © 2013 AEU. Published by Elsevier Espana. All rights reserved.
Does adding ketamine to morphine patient-controlled analgesia safely improve post-thoracotomy pain?
Mathews, Timothy J; Churchhouse, Antonia M D; Housden, Tessa; Dunning, Joel
2012-02-01
A best evidence topic in thoracic surgery was written according to a structured protocol. The question addressed was 'is the addition of ketamine to morphine patient-controlled analgesia (PCA) following thoracic surgery superior to morphine alone'. Altogether 201 papers were found using the reported search, of which nine represented the best evidence to answer the clinical question. The authors, journal, date and country of publication, patient group studied, study type, relevant outcomes and results of these papers are tabulated. This consisted of one systematic review of PCA morphine with ketamine (PCA-MK) trials, one meta-analysis of PCA-MK trials, four randomized controlled trials of PCA-MK, one meta-analysis of trials using a variety of peri-operative ketamine regimes and two cohort studies of PCA-MK. Main outcomes measured included pain score rated on visual analogue scale, morphine consumption and incidence of psychotomimetic side effects/hallucination. Two papers reported the measurements of respiratory function. This evidence shows that adding ketamine to morphine PCA is safe, with a reported incidence of hallucination requiring intervention of 2.9%, and a meta-analysis finding an incidence of all central nervous system side effects of 18% compared with 15% with morphine alone, P = 0.31, RR 1.27 with 95% CI (0.8-2.01). All randomized controlled trials of its use following thoracic surgery found no hallucination or psychological side effect. All five studies in thoracic surgery (n = 243) found reduced morphine requirements with PCA-MK. Pain scores were significantly lower in PCA-MK patients in thoracic surgery papers, with one paper additionally reporting increased patient satisfaction. However, no significant improvement was found in a meta-analysis of five papers studying PCA-MK in a variety of surgical settings. Both papers reporting respiratory outcomes found improved oxygen saturations and PaCO(2) levels in PCA-MK patients following thoracic surgery. We conclude that adding low-dose ketamine to morphine PCA is safe and post-thoracotomy may provide better pain control than PCA with morphine alone (PCA-MO), with reduced morphine consumption and possible improvement in respiratory function. These studies thus support the routine use of PCA-MK instead of PCA-MO to improve post-thoracotomy pain control.
Soy Consumption and the Risk of Prostate Cancer: An Updated Systematic Review and Meta-Analysis
Ranard, Katherine M.; Jeon, Sookyoung; Erdman, John W.
2018-01-01
Prostate cancer (PCa) is the second most commonly diagnosed cancer in men, accounting for 15% of all cancers in men worldwide. Asian populations consume soy foods as part of a regular diet, which may contribute to the lower PCa incidence observed in these countries. This meta-analysis provides a comprehensive updated analysis that builds on previously published meta-analyses, demonstrating that soy foods and their isoflavones (genistein and daidzein) are associated with a lower risk of prostate carcinogenesis. Thirty articles were included for analysis of the potential impacts of soy food intake, isoflavone intake, and circulating isoflavone levels, on both primary and advanced PCa. Total soy food (p < 0.001), genistein (p = 0.008), daidzein (p = 0.018), and unfermented soy food (p < 0.001) intakes were significantly associated with a reduced risk of PCa. Fermented soy food intake, total isoflavone intake, and circulating isoflavones were not associated with PCa risk. Neither soy food intake nor circulating isoflavones were associated with advanced PCa risk, although very few studies currently exist to examine potential associations. Combined, this evidence from observational studies shows a statistically significant association between soy consumption and decreased PCa risk. Further studies are required to support soy consumption as a prophylactic dietary approach to reduce PCa carcinogenesis. PMID:29300347
PCA-HOG symmetrical feature based diseased cell detection
NASA Astrophysics Data System (ADS)
Wan, Min-jie
2016-04-01
A histogram of oriented gradient (HOG) feature is applied to the field of diseased cell detection, which can detect diseased cells in high resolution tissue images rapidly, accurately and efficiently. Firstly, motivated by symmetrical cellular forms, a new HOG symmetrical feature based on the traditional HOG feature is proposed to meet the condition of cell detection. Secondly, considering the high feature dimension of traditional HOG feature leads to plenty of memory resources and long runtime in practical applications, a classical dimension reduction method called principal component analysis (PCA) is used to reduce the dimension of high-dimensional HOG descriptor. Because of that, computational speed is increased greatly, and the accuracy of detection can be controlled in a proper range at the same time. Thirdly, support vector machine (SVM) classifier is trained with PCA-HOG symmetrical features proposed above. At last, practical tissue images is detected and analyzed by SVM classifier. In order to verify the effectiveness of this new algorithm, it is practically applied to conduct diseased cell detection which takes 200 pieces of H&E (hematoxylin & eosin) high resolution staining histopathological images collected from 20 breast cancer patients as a sample. The experiment shows that the average processing rate can be 25 frames per second and the detection accuracy can be 92.1%.
Seibert, Tyler M; Fan, Chun Chieh; Wang, Yunpeng; Zuber, Verena; Karunamuni, Roshan; Parsons, J Kellogg; Eeles, Rosalind A; Easton, Douglas F; Kote-Jarai, ZSofia; Al Olama, Ali Amin; Garcia, Sara Benlloch; Muir, Kenneth; Grönberg, Henrik; Wiklund, Fredrik; Aly, Markus; Schleutker, Johanna; Sipeky, Csilla; Tammela, Teuvo Lj; Nordestgaard, Børge G; Nielsen, Sune F; Weischer, Maren; Bisbjerg, Rasmus; Røder, M Andreas; Iversen, Peter; Key, Tim J; Travis, Ruth C; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Pharoah, Paul; Pashayan, Nora; Khaw, Kay-Tee; Maier, Christiane; Vogel, Walther; Luedeke, Manuel; Herkommer, Kathleen; Kibel, Adam S; Cybulski, Cezary; Wokolorczyk, Dominika; Kluzniak, Wojciech; Cannon-Albright, Lisa; Brenner, Hermann; Cuk, Katarina; Saum, Kai-Uwe; Park, Jong Y; Sellers, Thomas A; Slavov, Chavdar; Kaneva, Radka; Mitev, Vanio; Batra, Jyotsna; Clements, Judith A; Spurdle, Amanda; Teixeira, Manuel R; Paulo, Paula; Maia, Sofia; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Karow, David S; Mills, Ian G; Andreassen, Ole A; Dale, Anders M
2018-01-10
To develop and validate a genetic tool to predict age of onset of aggressive prostate cancer (PCa) and to guide decisions of who to screen and at what age. Analysis of genotype, PCa status, and age to select single nucleotide polymorphisms (SNPs) associated with diagnosis. These polymorphisms were incorporated into a survival analysis to estimate their effects on age at diagnosis of aggressive PCa (that is, not eligible for surveillance according to National Comprehensive Cancer Network guidelines; any of Gleason score ≥7, stage T3-T4, PSA (prostate specific antigen) concentration ≥10 ng/L, nodal metastasis, distant metastasis). The resulting polygenic hazard score is an assessment of individual genetic risk. The final model was applied to an independent dataset containing genotype and PSA screening data. The hazard score was calculated for these men to test prediction of survival free from PCa. Multiple institutions that were members of international PRACTICAL consortium. All consortium participants of European ancestry with known age, PCa status, and quality assured custom (iCOGS) array genotype data. The development dataset comprised 31 747 men; the validation dataset comprised 6411 men. Prediction with hazard score of age of onset of aggressive cancer in validation set. In the independent validation set, the hazard score calculated from 54 single nucleotide polymorphisms was a highly significant predictor of age at diagnosis of aggressive cancer (z=11.2, P<10 -16 ). When men in the validation set with high scores (>98th centile) were compared with those with average scores (30th-70th centile), the hazard ratio for aggressive cancer was 2.9 (95% confidence interval 2.4 to 3.4). Inclusion of family history in a combined model did not improve prediction of onset of aggressive PCa (P=0.59), and polygenic hazard score performance remained high when family history was accounted for. Additionally, the positive predictive value of PSA screening for aggressive PCa was increased with increasing polygenic hazard score. Polygenic hazard scores can be used for personalised genetic risk estimates that can predict for age at onset of aggressive PCa. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Yücel, Yasin; Sultanoğlu, Pınar
2013-09-01
Chemical characterisation has been carried out on 45 honey samples collected from Hatay region of Turkey. The concentrations of 17 elements were determined by inductively coupled plasma optical emission spectrometry (ICP-OES). Ca, K, Mg and Na were the most abundant elements, with mean contents of 219.38, 446.93, 49.06 and 95.91 mg kg(-1) respectively. The trace element mean contents ranged between 0.03 and 15.07 mg kg(-1). Chemometric methods such as principal component analysis (PCA) and cluster analysis (CA) techniques were applied to classify honey according to mineral content. The first most important principal component (PC) was strongly associated with the value of Al, B, Cd and Co. CA showed eight clusters corresponding to the eight botanical origins of honey. PCA explained 75.69% of the variance with the first six PC variables. Chemometric analysis of the analytical data allowed the accurate classification of the honey samples according to origin. Copyright © 2013 Elsevier Ltd. All rights reserved.
Corvucci, Francesca; Nobili, Lara; Melucci, Dora; Grillenzoni, Francesca-Vittoria
2015-02-15
Honey traceability to food quality is required by consumers and food control institutions. Melissopalynologists traditionally use percentages of nectariferous pollens to discriminate the botanical origin and the entire pollen spectrum (presence/absence, type and quantities and association of some pollen types) to determinate the geographical origin of honeys. To improve melissopalynological routine analysis, principal components analysis (PCA) was used. A remarkable and innovative result was that the most significant pollens for the traditional discrimination of the botanical and geographical origin of honeys were the same as those individuated with the chemometric model. The reliability of assignments of samples to honey classes was estimated through explained variance (85%). This confirms that the chemometric model properly describes the melissopalynological data. With the aim to improve honey discrimination, FT-microRaman spectrography and multivariate analysis were also applied. Well performing PCA models and good agreement with known classes were achieved. Encouraging results were obtained for botanical discrimination. Copyright © 2014 Elsevier Ltd. All rights reserved.
Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival.
Kaplan, Adam; Lock, Eric F
2017-01-01
Predictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of 'omics data measure different but related biological components. In this article, we use recent advances in the dimension reduction of multisource data for predictive modeling. In particular, we apply exploratory results from Joint and Individual Variation Explained (JIVE), an extension of PCA for multisource data, for prediction of differing response types. We conduct illustrative simulations to illustrate the practical advantages and interpretability of our approach. As an application example, we consider predicting survival for patients with glioblastoma multiforme from 3 data sources measuring messenger RNA expression, microRNA expression, and DNA methylation. We also introduce a method to estimate JIVE scores for new samples that were not used in the initial dimension reduction and study its theoretical properties; this method is implemented in the R package R.JIVE on CRAN, in the function jive.predict.
Principal component analysis on a torus: Theory and application to protein dynamics.
Sittel, Florian; Filk, Thomas; Stock, Gerhard
2017-12-28
A dimensionality reduction method for high-dimensional circular data is developed, which is based on a principal component analysis (PCA) of data points on a torus. Adopting a geometrical view of PCA, various distance measures on a torus are introduced and the associated problem of projecting data onto the principal subspaces is discussed. The main idea is that the (periodicity-induced) projection error can be minimized by transforming the data such that the maximal gap of the sampling is shifted to the periodic boundary. In a second step, the covariance matrix and its eigendecomposition can be computed in a standard manner. Adopting molecular dynamics simulations of two well-established biomolecular systems (Aib 9 and villin headpiece), the potential of the method to analyze the dynamics of backbone dihedral angles is demonstrated. The new approach allows for a robust and well-defined construction of metastable states and provides low-dimensional reaction coordinates that accurately describe the free energy landscape. Moreover, it offers a direct interpretation of covariances and principal components in terms of the angular variables. Apart from its application to PCA, the method of maximal gap shifting is general and can be applied to any other dimensionality reduction method for circular data.
Diurnal global variability of the Earth's magnetic field during geomagnetically quiet conditions
NASA Astrophysics Data System (ADS)
Klausner, V.
2012-12-01
This work proposes a methodology (or treatment) to establish a representative signal of the global magnetic diurnal variation. It is based on a spatial distribution in both longitude and latitude of a set of magnetic stations as well as their magnetic behavior on a time basis. We apply the Principal Component Analysis (PCA) technique using gapped wavelet transform and wavelet correlation. This new approach was used to describe the characteristics of the magnetic variations at Vassouras (Brazil) and 12 other magnetic stations spread around the terrestrial globe. Using magnetograms from 2007, we have investigated the global dominant pattern of the Sq variation as a function of low solar activity. This year was divided into two seasons for seasonal variation analysis: solstices (June and December) and equinoxes (March and September). We aim to reconstruct the original geomagnetic data series of the H component taking into account only the diurnal variations with periods of 24 hours on geomagnetically quiet days. We advance a proposal to reconstruct the Sq baseline using only the PCA first mode. The first interpretation of the results suggests that PCA/wavelet method could be used to the reconstruction of the Sq baseline.
Principal component analysis on a torus: Theory and application to protein dynamics
NASA Astrophysics Data System (ADS)
Sittel, Florian; Filk, Thomas; Stock, Gerhard
2017-12-01
A dimensionality reduction method for high-dimensional circular data is developed, which is based on a principal component analysis (PCA) of data points on a torus. Adopting a geometrical view of PCA, various distance measures on a torus are introduced and the associated problem of projecting data onto the principal subspaces is discussed. The main idea is that the (periodicity-induced) projection error can be minimized by transforming the data such that the maximal gap of the sampling is shifted to the periodic boundary. In a second step, the covariance matrix and its eigendecomposition can be computed in a standard manner. Adopting molecular dynamics simulations of two well-established biomolecular systems (Aib9 and villin headpiece), the potential of the method to analyze the dynamics of backbone dihedral angles is demonstrated. The new approach allows for a robust and well-defined construction of metastable states and provides low-dimensional reaction coordinates that accurately describe the free energy landscape. Moreover, it offers a direct interpretation of covariances and principal components in terms of the angular variables. Apart from its application to PCA, the method of maximal gap shifting is general and can be applied to any other dimensionality reduction method for circular data.
Wei, Zhenbo; Wang, Jun; Ye, Linshuang
2011-08-15
A voltammetric electronic tongue (VE-tongue) was developed to discriminate the difference between Chinese rice wines in this research. Three types of Chinese rice wine with different marked ages (1, 3, and 5 years) were classified by the VE-tongue by principal component analysis (PCA) and cluster analysis (CA). The VE-tongue consisted of six working electrodes (gold, silver, platinum, palladium, tungsten, and titanium) in a standard three-electrode configuration. The multi-frequency large amplitude pulse voltammetry (MLAPV), which consisted of four segments of 1 Hz, 10 Hz, 100 Hz, and 1000 Hz, was applied as the potential waveform. The three types of Chinese rice wine could be classified accurately by PCA and CA, and some interesting regularity is shown in the score plots with the help of PCA. Two regression models, partial least squares (PLS) and back-error propagation-artificial neural network (BP-ANN), were used for wine age prediction. The regression results showed that the marked ages of the three types of Chinese rice wine were successfully predicted using PLS and BP-ANN. Copyright © 2011 Elsevier B.V. All rights reserved.
PCA feature extraction for change detection in multidimensional unlabeled data.
Kuncheva, Ludmila I; Faithfull, William J
2014-01-01
When classifiers are deployed in real-world applications, it is assumed that the distribution of the incoming data matches the distribution of the data used to train the classifier. This assumption is often incorrect, which necessitates some form of change detection or adaptive classification. While there has been a lot of work on change detection based on the classification error monitored over the course of the operation of the classifier, finding changes in multidimensional unlabeled data is still a challenge. Here, we propose to apply principal component analysis (PCA) for feature extraction prior to the change detection. Supported by a theoretical example, we argue that the components with the lowest variance should be retained as the extracted features because they are more likely to be affected by a change. We chose a recently proposed semiparametric log-likelihood change detection criterion that is sensitive to changes in both mean and variance of the multidimensional distribution. An experiment with 35 datasets and an illustration with a simple video segmentation demonstrate the advantage of using extracted features compared to raw data. Further analysis shows that feature extraction through PCA is beneficial, specifically for data with multiple balanced classes.
Perdonà, Sisto; Bruzzese, Dario; Ferro, Matteo; Autorino, Riccardo; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; Longo, Michele; Spinelli, Rosa; Di Lorenzo, Giuseppe; Oliva, Andrea; De Sio, Marco; Damiano, Rocco; Altieri, Vincenzo; Terracciano, Daniela
2013-02-15
Prostate health index (phi) and prostate cancer antigen 3 (PCA3) have been recently proposed as novel biomarkers for prostate cancer (PCa). We assessed the diagnostic performance of these biomarkers, alone or in combination, in men undergoing first prostate biopsy for suspicion of PCa. One hundred sixty male subjects were enrolled in this prospective observational study. PSA molecular forms, phi index (Beckman coulter immunoassay), PCA3 score (Progensa PCA3 assay), and other established biomarkers (tPSA, fPSA, and %fPSA) were assessed before patients underwent a 18-core first prostate biopsy. The discriminating ability between PCa-negative and PCa-positive biopsies of Beckman coulter phi and PCA3 score and other used biomarkers were determined. One hundred sixty patients met inclusion criteria. %p2PSA (p2PSA/fPSA × 100), phi and PCA3 were significantly higher in patients with PCa compared to PCa-negative group (median values: 1.92 vs. 1.55, 49.97 vs. 36.84, and 50 vs. 32, respectively, P ≤ 0.001). ROC curve analysis showed that %p2PSA, phi, and PCA3 are good indicator of malignancy (AUCs = 0.68, 0.71, and 0.66, respectively). A multivariable logistic regression model consisting of both the phi index and PCA3 score allowed to reach an overall diagnostic accuracy of 0.77. Decision curve analysis revealed that this "combined" marker achieved the highest net benefit over the examined range of the threshold probability. phi and PCA3 showed no significant difference in the ability to predict PCa diagnosis in men undergoing first prostate biopsy. However, diagnostic performance is significantly improved by combining phi and PCA3. Copyright © 2012 Wiley Periodicals, Inc.
LeTourneau, Melissa K; Marshall, Matthew J; Cliff, John B; Bonsall, Robert F; Dohnalkova, Alice C; Mavrodi, Dmitri V; Devi, S Indira; Mavrodi, Olga V; Harsh, James B; Weller, David M; Thomashow, Linda S
2018-04-24
Phenazine-1-carboxylic acid (PCA) is produced by rhizobacteria in dryland but not in irrigated wheat fields of the Pacific Northwest, USA. PCA promotes biofilm development in bacterial cultures and bacterial colonization of wheat rhizospheres. However, its impact upon biofilm development has not been demonstrated in the rhizosphere, where biofilms influence terrestrial carbon and nitrogen cycles with ramifications for crop and soil health. Furthermore, the relationships between soil moisture and the rates of PCA biosynthesis and degradation have not been established. In this study, expression of PCA biosynthesis genes was up-regulated relative to background transcription, and persistence of PCA was slightly decreased in dryland relative to irrigated wheat rhizospheres. Biofilms in dryland rhizospheres inoculated with the PCA-producing (PCA + ) strain Pseudomonas synxantha 2-79RN 10 were more robust than those in rhizospheres inoculated with an isogenic PCA-deficient (PCA - ) mutant strain. This trend was reversed in irrigated rhizospheres. In dryland PCA + rhizospheres, the turnover of 15 N-labelled rhizobacterial biomass was slower than in the PCA - and irrigated PCA + treatments, and incorporation of bacterial 15 N into root cell walls was observed in multiple treatments. These results indicate that PCA promotes biofilm development in dryland rhizospheres, and likely influences crop nutrition and soil health in dryland wheat fields. This article is protected by copyright. All rights reserved. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Gualandi, Adriano; Serpelloni, Enrico; Elina Belardinelli, Maria; Bonafede, Maurizio; Pezzo, Giuseppe; Tolomei, Cristiano
2015-04-01
A critical point in the analysis of ground displacement time series, as those measured by modern space geodetic techniques (primarly continuous GPS/GNSS and InSAR) is the development of data driven methods that allow to discern and characterize the different sources that generate the observed displacements. A widely used multivariate statistical technique is the Principal Component Analysis (PCA), which allows to reduce the dimensionality of the data space maintaining most of the variance of the dataset explained. It reproduces the original data using a limited number of Principal Components, but it also shows some deficiencies, since PCA does not perform well in finding the solution to the so-called Blind Source Separation (BSS) problem. The recovering and separation of the different sources that generate the observed ground deformation is a fundamental task in order to provide a physical meaning to the possible different sources. PCA fails in the BSS problem since it looks for a new Euclidean space where the projected data are uncorrelated. Usually, the uncorrelation condition is not strong enough and it has been proven that the BSS problem can be tackled imposing on the components to be independent. The Independent Component Analysis (ICA) is, in fact, another popular technique adopted to approach this problem, and it can be used in all those fields where PCA is also applied. An ICA approach enables us to explain the displacement time series imposing a fewer number of constraints on the model, and to reveal anomalies in the data such as transient deformation signals. However, the independence condition is not easy to impose, and it is often necessary to introduce some approximations. To work around this problem, we use a variational bayesian ICA (vbICA) method, which models the probability density function (pdf) of each source signal using a mix of Gaussian distributions. This technique allows for more flexibility in the description of the pdf of the sources, giving a more reliable estimate of them. Here we introduce the vbICA technique and present its application on synthetic data that simulate a GPS network recording ground deformation in a tectonically active region, with synthetic time-series containing interseismic, coseismic, and postseismic deformation, plus seasonal deformation, and white and coloured noise. We study the ability of the algorithm to recover the original (known) sources of deformation, and then apply it to a real scenario: the Emilia seismic sequence (2012, northern Italy), which is an example of seismic sequence occurred in a slowly converging tectonic setting, characterized by several local to regional anthropogenic or natural sources of deformation, mainly subsidence due to fluid withdrawal and sediments compaction. We apply both PCA and vbICA to displacement time-series recorded by continuous GPS and InSAR (Pezzo et al., EGU2015-8950).
E-selectin ligand-1 controls circulating prostate cancer cell rolling/adhesion and metastasis
Yasmin-Karim, Sayeda; King, Michael R.; Messing, Edward M.; Lee, Yi-Fen
2014-01-01
Circulating prostate cancer (PCa) cells preferentially roll and adhere on bone marrow vascular endothelial cells, where abundant E-selectin and stromal cell-derived factor 1 (SDF-1) are expressed, subsequently initiating a cascade of activation events that eventually lead to the development of metastases. To elucidate the roles of circulating PCa cells' rolling and adhesion behaviors in cancer metastases, we applied a dynamic cylindrical flow-based microchannel device that is coated with E-selectin and SDF-1, mimicking capillary endothelium. Using this device we captured a small fraction of rolling PCa cells. These rolling cells display higher static adhesion ability, more aggressive cancer phenotypes and stem-like properties. Importantly, mice received rolling PCa cells, but not floating PCa cells, developed cancer metastases. Genes coding for E-selectin ligands and genes associated with cancer stem cells and metastasis were elevated in rolling PCa cells. Knock down of E-selectin ligand 1(ESL-1), significantly impaired PCa cells' rolling capacity and reduced cancer aggressiveness. Moreover, ESL-1 activates RAS and MAP kinase signal cascade, consequently inducing the downstream targets. In summary, circulating PCa cells' rolling capacity contributes to PCa metastasis, and that is in part controlled by ESL-1. PMID:25301730
Integration of multispectral satellite and hyperspectral field data for aquatic macrophyte studies
NASA Astrophysics Data System (ADS)
John, C. M.; Kavya, N.
2014-11-01
Aquatic macrophytes (AM) can serve as useful indicators of water pollution along the littoral zones. The spectral signatures of various AM were investigated to determine whether species could be discriminated by remote sensing. In this study the spectral readings of different AM communities identified were done using the ASD Fieldspec® Hand Held spectro-radiometer in the wavelength range of 325-1075 nm. The collected specific reflectance spectra were applied to space borne multi-spectral remote sensing data from Worldview-2, acquired on 26th March 2011. The dimensionality reduction of the spectro-radiometric data was done using the technique principal components analysis (PCA). Out of the different PCA axes generated, 93.472 % variance of the spectra was explained by the first axis. The spectral derivative analysis was done to identify the wavelength where the greatest difference in reflectance is shown. The identified wavelengths are 510, 690, 720, 756, 806, 885, 907 and 923 nm. The output of PCA and derivative analysis were applied to Worldview-2 satellite data for spectral subsetting. The unsupervised classification was used to effectively classify the AM species using the different spectral subsets. The accuracy assessment of the results of the unsupervised classification and their comparison were done. The overall accuracy of the result of unsupervised classification using the band combinations Red-Edge, Green, Coastal blue & Red-edge, Yellow, Blue is 100%. The band combinations NIR-1, Green, Coastal blue & NIR-1, Yellow, Blue yielded an accuracy of 82.35 %. The existing vegetation indices and new hyper-spectral indices for the different type of AM communities were computed. Overall, results of this study suggest that high spectral and spatial resolution images provide useful information for natural resource managers especially with regard to the location identification and distribution mapping of macrophyte species and their communities.
Visualizing Hyolaryngeal Mechanics in Swallowing Using Dynamic MRI
Pearson, William G.; Zumwalt, Ann C.
2013-01-01
Introduction Coordinates of anatomical landmarks are captured using dynamic MRI to explore whether a proposed two-sling mechanism underlies hyolaryngeal elevation in pharyngeal swallowing. A principal components analysis (PCA) is applied to coordinates to determine the covariant function of the proposed mechanism. Methods Dynamic MRI (dMRI) data were acquired from eleven healthy subjects during a repeated swallows task. Coordinates mapping the proposed mechanism are collected from each dynamic (frame) of a dynamic MRI swallowing series of a randomly selected subject in order to demonstrate shape changes in a single subject. Coordinates representing minimum and maximum hyolaryngeal elevation of all 11 subjects were also mapped to demonstrate shape changes of the system among all subjects. MophoJ software was used to perform PCA and determine vectors of shape change (eigenvectors) for elements of the two-sling mechanism of hyolaryngeal elevation. Results For both single subject and group PCAs, hyolaryngeal elevation accounted for the first principal component of variation. For the single subject PCA, the first principal component accounted for 81.5% of the variance. For the between subjects PCA, the first principal component accounted for 58.5% of the variance. Eigenvectors and shape changes associated with this first principal component are reported. Discussion Eigenvectors indicate that two-muscle slings and associated skeletal elements function as components of a covariant mechanism to elevate the hyolaryngeal complex. Morphological analysis is useful to model shape changes in the two-sling mechanism of hyolaryngeal elevation. PMID:25090608
Hagiwara, Kazuhisa; Tobisawa, Yuki; Kaya, Takatoshi; Kaneko, Tomonori; Hatakeyama, Shingo; Mori, Kazuyuki; Hashimoto, Yasuhiro; Koie, Takuya; Suda, Yoshihiko; Ohyama, Chikara; Yoneyama, Tohru
2017-01-01
Wisteria floribunda agglutinin (WFA) preferably binds to LacdiNAc glycans, and its reactivity is associated with tumor progression. The aim of this study to examine whether the serum LacdiNAc carrying prostate-specific antigen–glycosylation isomer (PSA-Gi) and WFA-reactivity of tumor tissue can be applied as a diagnostic and prognostic marker of prostate cancer (PCa). Between 2007 and 2016, serum PSA-Gi levels before prostate biopsy (Pbx) were measured in 184 biopsy-proven benign prostatic hyperplasia patients and 244 PCa patients using an automated lectin-antibody immunoassay. WFA-reactivity on tumor was analyzed in 260 radical prostatectomy (RP) patients. Diagnostic and prognostic performance of serum PSA-Gi was evaluated using area under the receiver-operator characteristic curve (AUC). Prognostic performance of WFA-reactivity on tumor was evaluated via Cox proportional hazards regression analysis and nomogram. The AUC of serum PSA-Gi detecting PCa and predicting Pbx Grade Group (GG) 3 and GG ≥ 3 after RP was much higher than those of conventional PSA. Multivariate analysis showed that WFA-reactivity on prostate tumor was an independent risk factor of PSA recurrence. The nomogram was a strong model for predicting PSA-free survival provability with a c-index ≥0.7. Serum PSA-Gi levels and WFA-reactivity on prostate tumor may be a novel diagnostic and pre- and post-operative prognostic biomarkers of PCa, respectively. PMID:28134773
Hagiwara, Kazuhisa; Tobisawa, Yuki; Kaya, Takatoshi; Kaneko, Tomonori; Hatakeyama, Shingo; Mori, Kazuyuki; Hashimoto, Yasuhiro; Koie, Takuya; Suda, Yoshihiko; Ohyama, Chikara; Yoneyama, Tohru
2017-01-26
Wisteria floribunda agglutinin (WFA) preferably binds to LacdiNAc glycans, and its reactivity is associated with tumor progression. The aim of this study to examine whether the serum LacdiNAc carrying prostate-specific antigen-glycosylation isomer (PSA-Gi) and WFA-reactivity of tumor tissue can be applied as a diagnostic and prognostic marker of prostate cancer (PCa). Between 2007 and 2016, serum PSA-Gi levels before prostate biopsy (Pbx) were measured in 184 biopsy-proven benign prostatic hyperplasia patients and 244 PCa patients using an automated lectin-antibody immunoassay. WFA-reactivity on tumor was analyzed in 260 radical prostatectomy (RP) patients. Diagnostic and prognostic performance of serum PSA-Gi was evaluated using area under the receiver-operator characteristic curve (AUC). Prognostic performance of WFA-reactivity on tumor was evaluated via Cox proportional hazards regression analysis and nomogram. The AUC of serum PSA-Gi detecting PCa and predicting Pbx Grade Group (GG) 3 and GG ≥ 3 after RP was much higher than those of conventional PSA. Multivariate analysis showed that WFA-reactivity on prostate tumor was an independent risk factor of PSA recurrence. The nomogram was a strong model for predicting PSA-free survival provability with a c -index ≥0.7. Serum PSA-Gi levels and WFA-reactivity on prostate tumor may be a novel diagnostic and pre- and post-operative prognostic biomarkers of PCa, respectively.
Nonlinear Principal Components Analysis: Introduction and Application
ERIC Educational Resources Information Center
Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Koojj, Anita J.
2007-01-01
The authors provide a didactic treatment of nonlinear (categorical) principal components analysis (PCA). This method is the nonlinear equivalent of standard PCA and reduces the observed variables to a number of uncorrelated principal components. The most important advantages of nonlinear over linear PCA are that it incorporates nominal and ordinal…
Yang, Lei; Wei, Ran; Shen, Henggen
2017-01-01
New principal component analysis (PCA) respirator fit test panels had been developed for current American and Chinese civilian workers based on anthropometric surveys. The PCA panels used the first two principal components (PCs) obtained from a set of 10 facial dimensions. Although the PCA panels for American and Chinese subjects adopted the bivairate framework with two PCs, the number of the PCs retained in the PCA analysis was different between Chinese subjects and Americans. For the Chinese youth group, the third PC should be retained in the PCA analysis for developing new fit test panels. In this article, an additional number label (ANL) is used to explain the third PC in PCA analysis when the first two PCs are used to construct the PCA half-facepiece respirator fit test panel for Chinese group. The three-dimensional box-counting method is proposed to estimate the ANLs by calculating fractal dimensions of the facial anthropometric data of the Chinese youth. The linear regression coefficients of scale-free range R 2 are all over 0.960, which demonstrates that the facial anthropometric data of the Chinese youth has fractal characteristic. The youth subjects born in Henan province has an ANL of 2.002, which is lower than the composite facial anthropometric data of Chinese subjects born in many provinces. Hence, Henan youth subjects have the self-similar facial anthropometric characteristic and should use the particular ANL (2.002) as the important tool along with using the PCA panel. The ANL method proposed in this article not only provides a new methodology in quantifying the characteristics of facial anthropometric dimensions for any ethnic/racial group, but also extends the scope of PCA panel studies to higher dimensions.
Time-dependent analysis of dosage delivery information for patient-controlled analgesia services.
Kuo, I-Ting; Chang, Kuang-Yi; Juan, De-Fong; Hsu, Steen J; Chan, Chia-Tai; Tsou, Mei-Yung
2018-01-01
Pain relief always plays the essential part of perioperative care and an important role of medical quality improvement. Patient-controlled analgesia (PCA) is a method that allows a patient to self-administer small boluses of analgesic to relieve the subjective pain. PCA logs from the infusion pump consisted of a lot of text messages which record all events during the therapies. The dosage information can be extracted from PCA logs to provide easily understanding features. The analysis of dosage information with time has great help to figure out the variance of a patient's pain relief condition. To explore the trend of pain relief requirement, we developed a PCA dosage information generator (PCA DIG) to extract meaningful messages from PCA logs during the first 48 hours of therapies. PCA dosage information including consumption, delivery, infusion rate, and the ratio between demand and delivery is presented with corresponding values in 4 successive time frames. Time-dependent statistical analysis demonstrated the trends of analgesia requirements decreased gradually along with time. These findings are compatible with clinical observations and further provide valuable information about the strategy to customize postoperative pain management.
Zakaria, Ammar; Shakaff, Ali Yeon Md.; Adom, Abdul Hamid; Ahmad, Mohd Noor; Masnan, Maz Jamilah; Aziz, Abdul Hallis Abdul; Fikri, Nazifah Ahmad; Abdullah, Abu Hassan; Kamarudin, Latifah Munirah
2010-01-01
An improved classification of Orthosiphon stamineus using a data fusion technique is presented. Five different commercial sources along with freshly prepared samples were discriminated using an electronic nose (e-nose) and an electronic tongue (e-tongue). Samples from the different commercial brands were evaluated by the e-tongue and then followed by the e-nose. Applying Principal Component Analysis (PCA) separately on the respective e-tongue and e-nose data, only five distinct groups were projected. However, by employing a low level data fusion technique, six distinct groupings were achieved. Hence, this technique can enhance the ability of PCA to analyze the complex samples of Orthosiphon stamineus. Linear Discriminant Analysis (LDA) was then used to further validate and classify the samples. It was found that the LDA performance was also improved when the responses from the e-nose and e-tongue were fused together. PMID:22163381
Zakaria, Ammar; Shakaff, Ali Yeon Md; Adom, Abdul Hamid; Ahmad, Mohd Noor; Masnan, Maz Jamilah; Aziz, Abdul Hallis Abdul; Fikri, Nazifah Ahmad; Abdullah, Abu Hassan; Kamarudin, Latifah Munirah
2010-01-01
An improved classification of Orthosiphon stamineus using a data fusion technique is presented. Five different commercial sources along with freshly prepared samples were discriminated using an electronic nose (e-nose) and an electronic tongue (e-tongue). Samples from the different commercial brands were evaluated by the e-tongue and then followed by the e-nose. Applying Principal Component Analysis (PCA) separately on the respective e-tongue and e-nose data, only five distinct groups were projected. However, by employing a low level data fusion technique, six distinct groupings were achieved. Hence, this technique can enhance the ability of PCA to analyze the complex samples of Orthosiphon stamineus. Linear Discriminant Analysis (LDA) was then used to further validate and classify the samples. It was found that the LDA performance was also improved when the responses from the e-nose and e-tongue were fused together.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eifler, Tim; Krause, Elisabeth; Dodelson, Scott
2014-05-28
Systematic uncertainties that have been subdominant in past large-scale structure (LSS) surveys are likely to exceed statistical uncertainties of current and future LSS data sets, potentially limiting the extraction of cosmological information. Here we present a general framework (PCA marginalization) to consistently incorporate systematic effects into a likelihood analysis. This technique naturally accounts for degeneracies between nuisance parameters and can substantially reduce the dimension of the parameter space that needs to be sampled. As a practical application, we apply PCA marginalization to account for baryonic physics as an uncertainty in cosmic shear tomography. Specifically, we use CosmoLike to run simulatedmore » likelihood analyses on three independent sets of numerical simulations, each covering a wide range of baryonic scenarios differing in cooling, star formation, and feedback mechanisms. We simulate a Stage III (Dark Energy Survey) and Stage IV (Large Synoptic Survey Telescope/Euclid) survey and find a substantial bias in cosmological constraints if baryonic physics is not accounted for. We then show that PCA marginalization (employing at most 3 to 4 nuisance parameters) removes this bias. Our study demonstrates that it is possible to obtain robust, precise constraints on the dark energy equation of state even in the presence of large levels of systematic uncertainty in astrophysical processes. We conclude that the PCA marginalization technique is a powerful, general tool for addressing many of the challenges facing the precision cosmology program.« less
Sanchez, Tino W.; Zhang, Guangyu; Li, Jitian; Dai, Liping; Mirshahidi, Saied; Wall, Nathan R.; Yates, Clayton; Wilson, Colwick; Montgomery, Susanne; Zhang, Jian-Ying; Casiano, Carlos A.
2016-01-01
African American (AA) men suffer from a disproportionately high incidence and mortality of prostate cancer (PCa) compared with other racial/ethnic groups. Despite these disparities, African American men are underrepresented in clinical trials and in studies on PCa biology and biomarker discovery. We used immunoseroproteomics to profile antitumor autoantibody responses in AA and European American (EA) men with PCa, and explored differences in these responses. This minimally invasive approach detects autoantibodies to tumor-associated antigens that could serve as clinical biomarkers and immunotherapeutic agents. Sera from AA and EA men with PCa were probed by immunoblotting against PC3 cell proteins, with AA sera showing stronger immunoreactivity. Mass spectrometry analysis of immunoreactive protein spots revealed that several AA sera contained autoantibodies to a number of proteins associated with both the glycolysis and plasminogen pathways, particularly to alpha-enolase (ENO1). The proteomic data is deposited in ProteomeXchange with identifier PXD003968. Analysis of sera from 340 racially diverse men by enzyme-linked immunosorbent assays (ELISA) showed higher frequency of anti-ENO1 autoantibodies in PCa sera compared with control sera. We observed differences between AA-PCa and EA-PCa patients in their immunoreactivity against ENO1. Although EA-PCa sera reacted with higher frequency against purified ENO1 in ELISA and recognized by immunoblotting the endogenous cellular ENO1 across a panel of prostate cell lines, AA-PCa sera reacted weakly against this protein by ELISA but recognized it by immunoblotting preferentially in metastatic cell lines. These race-related differences in immunoreactivity to ENO1 could not be accounted by differential autoantibody recognition of phosphoepitopes within this antigen. Proteomic analysis revealed differences in the posttranslational modification profiles of ENO1 variants differentially recognized by AA-PCa and EA-PCa sera. These intriguing results suggest the possibility of race-related differences in the antitumor autoantibody response in PCa, and have implications for defining novel biological determinants of PCa health disparities. PMID:27742740
Hou, Qi; Bing, Zhi-Tong; Hu, Cheng; Li, Mao-Yin; Yang, Ke-Hu; Mo, Zu; Xie, Xiang-Wei; Liao, Ji-Lin; Lu, Yan; Horie, Shigeo; Lou, Ming-Wu
2018-06-01
Prostate cancer (PCa) is the most commonly diagnosed cancer in males in the Western world. Although prostate-specific antigen (PSA) has been widely used as a biomarker for PCa diagnosis, its results can be controversial. Therefore, new biomarkers are needed to enhance the clinical management of PCa. From publicly available microarray data, differentially expressed genes (DEGs) were identified by meta-analysis with RankProd. Genetic algorithm optimized artificial neural network (GA-ANN) was introduced to establish a diagnostic prediction model and to filter candidate genes. The diagnostic and prognostic capability of the prediction model and candidate genes were investigated in both GEO and TCGA datasets. Candidate genes were further validated by qPCR, Western Blot and Tissue microarray. By RankProd meta-analyses, 2306 significantly up- and 1311 down-regulated probes were found in 133 cases and 30 controls microarray data. The overall accuracy rate of the PCa diagnostic prediction model, consisting of a 15-gene signature, reached up to 100% in both the training and test dataset. The prediction model also showed good results for the diagnosis (AUC = 0.953) and prognosis (AUC of 5 years overall survival time = 0.808) of PCa in the TCGA database. The expression levels of three genes, FABP5, C1QTNF3 and LPHN3, were validated by qPCR. C1QTNF3 high expression was further validated in PCa tissue by Western Blot and Tissue microarray. In the GEO datasets, C1QTNF3 was a good predictor for the diagnosis of PCa (GSE6956: AUC = 0.791; GSE8218: AUC = 0.868; GSE26910: AUC = 0.972). In the TCGA database, C1QTNF3 was significantly associated with PCa patient recurrence free survival (P < .001, AUC = 0.57). In this study, we have developed a diagnostic and prognostic prediction model for PCa. C1QTNF3 was revealed as a promising biomarker for PCa. This approach can be applied to other high-throughput data from different platforms for the discovery of oncogenes or biomarkers in different kinds of diseases. Copyright © 2018. Published by Elsevier B.V.
Taguchi, Y-h; Iwadate, Mitsuo; Umeyama, Hideaki
2015-04-30
Feature extraction (FE) is difficult, particularly if there are more features than samples, as small sample numbers often result in biased outcomes or overfitting. Furthermore, multiple sample classes often complicate FE because evaluating performance, which is usual in supervised FE, is generally harder than the two-class problem. Developing sample classification independent unsupervised methods would solve many of these problems. Two principal component analysis (PCA)-based FE, specifically, variational Bayes PCA (VBPCA) was extended to perform unsupervised FE, and together with conventional PCA (CPCA)-based unsupervised FE, were tested as sample classification independent unsupervised FE methods. VBPCA- and CPCA-based unsupervised FE both performed well when applied to simulated data, and a posttraumatic stress disorder (PTSD)-mediated heart disease data set that had multiple categorical class observations in mRNA/microRNA expression of stressed mouse heart. A critical set of PTSD miRNAs/mRNAs were identified that show aberrant expression between treatment and control samples, and significant, negative correlation with one another. Moreover, greater stability and biological feasibility than conventional supervised FE was also demonstrated. Based on the results obtained, in silico drug discovery was performed as translational validation of the methods. Our two proposed unsupervised FE methods (CPCA- and VBPCA-based) worked well on simulated data, and outperformed two conventional supervised FE methods on a real data set. Thus, these two methods have suggested equivalence for FE on categorical multiclass data sets, with potential translational utility for in silico drug discovery.
ERIC Educational Resources Information Center
Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Kooij, Anita J.
2007-01-01
Principal components analysis (PCA) is used to explore the structure of data sets containing linearly related numeric variables. Alternatively, nonlinear PCA can handle possibly nonlinearly related numeric as well as nonnumeric variables. For linear PCA, the stability of its solution can be established under the assumption of multivariate…
Azadeh, Ali; Sheikhalishahi, Mohammad
2015-06-01
A unique framework for performance optimization of generation companies (GENCOs) based on health, safety, environment, and ergonomics (HSEE) indicators is presented. To rank this sector of industry, the combination of data envelopment analysis (DEA), principal component analysis (PCA), and Taguchi are used for all branches of GENCOs. These methods are applied in an integrated manner to measure the performance of GENCO. The preferred model between DEA, PCA, and Taguchi is selected based on sensitivity analysis and maximum correlation between rankings. To achieve the stated objectives, noise is introduced into input data. The results show that Taguchi outperforms other methods. Moreover, a comprehensive experiment is carried out to identify the most influential factor for ranking GENCOs. The approach developed in this study could be used for continuous assessment and improvement of GENCO's performance in supplying energy with respect to HSEE factors. The results of such studies would help managers to have better understanding of weak and strong points in terms of HSEE factors.
NASA Astrophysics Data System (ADS)
Bradde, Serena; Bialek, William
A system with many degrees of freedom can be characterized by a covariance matrix; principal components analysis (PCA) focuses on the eigenvalues of this matrix, hoping to find a lower dimensional description. But when the spectrum is nearly continuous, any distinction between components that we keep and those that we ignore becomes arbitrary; it then is natural to ask what happens as we vary this arbitrary cutoff. We argue that this problem is analogous to the momentum shell renormalization group (RG). Following this analogy, we can define relevant and irrelevant operators, where the role of dimensionality is played by properties of the eigenvalue density. These results also suggest an approach to the analysis of real data. As an example, we study neural activity in the vertebrate retina as it responds to naturalistic movies, and find evidence of behavior controlled by a nontrivial fixed point. Applied to financial data, our analysis separates modes dominated by sampling noise from a smaller but still macroscopic number of modes described by a non-Gaussian distribution.
Siegmund, Barbara; Urdl, Katharina; Jurek, Andrea; Leitner, Erich
2018-03-14
Eight monovarietal honeys from dandelion, fir tree, linden tree, chestnut tree, robinia, orange, lavender, and rape were investigated with respect to their volatile compounds and sensory properties. Analysis of the volatile compounds was performed by gas chromatographic techniques (one-dimensional GC-MS as well as comprehensive GC×GC-MS). For sensory evaluation Napping in combination with ultraflash profiling was applied using sensory experts. For dandelion honey, 34 volatile compounds are described for the first time to be present in dandelion honey. PCA and cluster analysis of the volatile compounds, respectively, show high correlation with the PCA obtained from sensory evaluation. Lavender and linden honey showed sensory characteristics that were not expected from these honey types. Analysis of the volatile compounds resulted in the identification of odor-active compounds that are very likely derived from sources other than the respective honeyflow. Contamination with essential oils used in apiculture is very likely to be the reason for the occurrence of these compounds in the investigated honeys.
Wu, Chen-Jiang; Wang, Qing; Li, Hai; Wang, Xiao-Ning; Liu, Xi-Sheng; Shi, Hai-Bin; Zhang, Yu-Dong
2015-10-01
To investigate diagnostic efficiency of DWI using entire-tumor histogram analysis in differentiating the low-grade (LG) prostate cancer (PCa) from intermediate-high-grade (HG) PCa in comparison with conventional ROI-based measurement. DW images (b of 0-1400 s/mm(2)) from 126 pathology-confirmed PCa (diameter >0.5 cm) in 110 patients were retrospectively collected and processed by mono-exponential model. The measurement of tumor apparent diffusion coefficients (ADCs) was performed with using histogram-based and ROI-based approach, respectively. The diagnostic ability of ADCs from two methods for differentiating LG-PCa (Gleason score, GS ≤ 6) from HG-PCa (GS > 6) was determined by ROC regression, and compared by McNemar's test. There were 49 LG-tumor and 77 HG-tumor at pathologic findings. Histogram-based ADCs (mean, median, 10th and 90th) and ROI-based ADCs (mean) showed dominant relationships with ordinal GS of Pca (ρ = -0.225 to -0.406, p < 0.05). All above imaging indices reflected significant difference between LG-PCa and HG-PCa (all p values <0.01). Histogram 10th ADCs had dominantly high Az (0.738), Youden index (0.415), and positive likelihood ratio (LR+, 2.45) in stratifying tumor GS against mean, median and 90th ADCs, and ROI-based ADCs. Histogram mean, median, and 10th ADCs showed higher specificity (65.3%-74.1% vs. 44.9%, p < 0.01), but lower sensitivity (57.1%-71.3% vs. 84.4%, p < 0.05) than ROI-based ADCs in differentiating LG-PCa from HG-PCa. DWI-associated histogram analysis had higher specificity, Az, Youden index, and LR+ for differentiation of PCa Gleason grade than ROI-based approach.
He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei
2015-02-25
A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety. Copyright © 2014 Elsevier B.V. All rights reserved.
Automated Classification and Analysis of Non-metallic Inclusion Data Sets
NASA Astrophysics Data System (ADS)
Abdulsalam, Mohammad; Zhang, Tongsheng; Tan, Jia; Webler, Bryan A.
2018-05-01
The aim of this study is to utilize principal component analysis (PCA), clustering methods, and correlation analysis to condense and examine large, multivariate data sets produced from automated analysis of non-metallic inclusions. Non-metallic inclusions play a major role in defining the properties of steel and their examination has been greatly aided by automated analysis in scanning electron microscopes equipped with energy dispersive X-ray spectroscopy. The methods were applied to analyze inclusions on two sets of samples: two laboratory-scale samples and four industrial samples from a near-finished 4140 alloy steel components with varying machinability. The laboratory samples had well-defined inclusions chemistries, composed of MgO-Al2O3-CaO, spinel (MgO-Al2O3), and calcium aluminate inclusions. The industrial samples contained MnS inclusions as well as (Ca,Mn)S + calcium aluminate oxide inclusions. PCA could be used to reduce inclusion chemistry variables to a 2D plot, which revealed inclusion chemistry groupings in the samples. Clustering methods were used to automatically classify inclusion chemistry measurements into groups, i.e., no user-defined rules were required.
Anomaly Detection of Electromyographic Signals.
Ijaz, Ahsan; Choi, Jongeun
2018-04-01
In this paper, we provide a robust framework to detect anomalous electromyographic (EMG) signals and identify contamination types. As a first step for feature selection, optimally selected Lawton wavelets transform is applied. Robust principal component analysis (rPCA) is then performed on these wavelet coefficients to obtain features in a lower dimension. The rPCA based features are used for constructing a self-organizing map (SOM). Finally, hierarchical clustering is applied on the SOM that separates anomalous signals residing in the smaller clusters and breaks them into logical units for contamination identification. The proposed methodology is tested using synthetic and real world EMG signals. The synthetic EMG signals are generated using a heteroscedastic process mimicking desired experimental setups. A sub-part of these synthetic signals is introduced with anomalies. These results are followed with real EMG signals introduced with synthetic anomalies. Finally, a heterogeneous real world data set is used with known quality issues under an unsupervised setting. The framework provides recall of 90% (± 3.3) and precision of 99%(±0.4).
Wang, Jing; Wu, Chen-Jiang; Bao, Mei-Ling; Zhang, Jing; Wang, Xiao-Ning; Zhang, Yu-Dong
2017-10-01
To investigate whether machine learning-based analysis of MR radiomics can help improve the performance PI-RADS v2 in clinically relevant prostate cancer (PCa). This IRB-approved study included 54 patients with PCa undergoing multi-parametric (mp) MRI before prostatectomy. Imaging analysis was performed on 54 tumours, 47 normal peripheral (PZ) and 48 normal transitional (TZ) zone based on histological-radiological correlation. Mp-MRI was scored via PI-RADS, and quantified by measuring radiomic features. Predictive model was developed using a novel support vector machine trained with: (i) radiomics, (ii) PI-RADS scores, (iii) radiomics and PI-RADS scores. Paired comparison was made via ROC analysis. For PCa versus normal TZ, the model trained with radiomics had a significantly higher area under the ROC curve (Az) (0.955 [95% CI 0.923-0.976]) than PI-RADS (Az: 0.878 [0.834-0.914], p < 0.001). The Az between them was insignificant for PCa versus PZ (0.972 [0.945-0.988] vs. 0.940 [0.905-0.965], p = 0.097). When radiomics was added, performance of PI-RADS was significantly improved for PCa versus PZ (Az: 0.983 [0.960-0.995]) and PCa versus TZ (Az: 0.968 [0.940-0.985]). Machine learning analysis of MR radiomics can help improve the performance of PI-RADS in clinically relevant PCa. • Machine-based analysis of MR radiomics outperformed in TZ cancer against PI-RADS. • Adding MR radiomics significantly improved the performance of PI-RADS. • DKI-derived Dapp and Kapp were two strong markers for the diagnosis of PCa.
Kim, Yong-June; Yoon, Hyung-Yoon; Kim, Seon-Kyu; Kim, Young-Won; Kim, Eun-Jung; Kim, Isaac Yi; Kim, Wun-Jae
2011-07-01
Abnormal DNA methylation is associated with many human cancers. The aim of the present study was to identify novel methylation markers in prostate cancer (PCa) by microarray analysis and to test whether these markers could discriminate normal and PCa cells. Microarray-based DNA methylation and gene expression profiling was carried out using a panel of PCa cell lines and a control normal prostate cell line. The methylation status of candidate genes in prostate cell lines was confirmed by real-time reverse transcriptase-PCR, bisulfite sequencing analysis, and treatment with a demethylation agent. DNA methylation and gene expression analysis in 203 human prostate specimens, including 106 PCa and 97 benign prostate hyperplasia (BPH), were carried out. Further validation using microarray gene expression data from the Gene Expression Omnibus (GEO) was carried out. Epidermal growth factor-containing fibulin-like extracellular matrix protein 1 (EFEMP1) was identified as a lead candidate methylation marker for PCa. The gene expression level of EFEMP1 was significantly higher in tissue samples from patients with BPH than in those with PCa (P < 0.001). The sensitivity and specificity of EFEMP1 methylation status in discriminating between PCa and BPH reached 95.3% (101 of 106) and 86.6% (84 of 97), respectively. From the GEO data set, we confirmed that the expression level of EFEMP1 was significantly different between PCa and BPH. Genome-wide characterization of DNA methylation profiles enabled the identification of EFEMP1 aberrant methylation patterns in PCa. EFEMP1 might be a useful indicator for the detection of PCa.
Dai, Liping; Li, Jitian; Xing, Mengtao; Sanchez, Tino W; Casiano, Carlos A; Zhang, Jian-Ying
2016-11-01
The prostate-specific antigen (PSA) testing has been widely implemented for the early detection and management of prostate cancer (PCa). However, the lack of specificity has led to overdiagnosis, resulting in many possibly unnecessary biopsies and overtreatment. Therefore, novel serological biomarkers with high sensitivity and specificity are of vital importance needed to complement PSA testing in the early diagnosis and effective management of PCa. This is particularly critical in the context of PCa health disparities, where early detection and management could help reduce the disproportionately high PCa mortality observed in African-American men. Previous studies have demonstrated that sera from patients with PCa contain autoantibodies that react with tumor-associated antigens (TAAs). The serological proteome analysis (SERPA) approach was used to identify tumor-associated antigens (TAAs) of PCa. In evaluation study, the level of anti-NPM1 antibody was examined in sera from test cohort, validation cohort, as well as European-American (EA) and African-American (AA) men with PCa by using immunoassay. Nucleophosmin 1 (NPM1) as a 33 kDa TAA in PCa was identified and characterized by SERPA approach. Anti-NPM1 antibody level in PCa was higher than in benign prostatic hyperplasia (BPH) patients and healthy individuals. Receiver operating characteristic (ROC) curve analysis showed similar high diagnostic value for PCa in the test cohort (area under the curve (AUC):0.860) and validation cohort (AUC: 0.822) to differentiate from normal individuals and BPH. Interestingly, AUC values were significantly higher for AA PCa patients. When considering concurrent serum measurements of anti-NPM1 antibody and PSA, 97.1% PCa patients at early stage were identified correctly, while 69.2% BPH patients who had elevated PSA levels were found to be anti-NPM1 negative. Additionally, anti-NPM1 antibody levels in PCa patients at early stage significantly increased after surgery treatment. This intriguing data suggested that NPM1 can elicit autoantibody response in PCa and might be a potential biomarker for the immunodiagnosis and prognosis of PCa, and for supplementing PSA testing in distinguishing PCa from BPH. Prostate 76:1375-1386, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Ma, Teng; Yang, Shaolin; Jing, Haiyan; Cong, Lin; Cao, Zhixin; Liu, Zhiling; Huang, Zhaoqin
2018-03-01
Prostate cancer (PCa) is the second most common cancer in men. The Gleason score (GS) and biomarkers play important roles in the diagnosis and treatment of patients with PCa. The purpose of this study was to investigate the relationship between the apparent diffusion coefficient (ADC) and the molecular markers Ki-67, hypoxia-inducible factor-1α (HIF-1α) and vascular endothelial growth factor (VEGF) in PCa. Thirty-nine patients with 39 lesions, who had been diagnosed with PCa, were enrolled in this study. All patients underwent diffusion-weighted magnetic resonance imaging (DW-MRI) (b = 800 s/mm 2 ). The expression of Ki-67, HIF-1α and VEGF was assessed by immunohistochemistry. Statistical analysis was applied to analyze the association between ADC and prostate-specific antigen (PSA), GS and the expression of Ki-67, HIF-1α and VEGF. The group differences in ADC among different grades of Ki-67, HIF-1α and VEGF were also analyzed. The mean ± standard deviation of ADC was (0.76 ± 0.27) × 10 -3 mm 2 /s. ADC correlated negatively with PSA and GS (p < 0.05). The Ki-67 staining index (SI), HIF-1α expression and VEGF expression in PCa were correlated inversely with ADC, controlling for age (r = -0.332, p < 0.05; r = -0.662, p < 0.0005; and r = -0.714, p < 0.0005, respectively). ADC showed a significant difference among different grades of Ki-67 (F = 9.164, p = 0.005), HIF-1α (F = 40.333, p < 0.0005) and VEGF (F = 22.048, p < 0.0005). In conclusion, ADC was correlated with PSA, GS, and Ki-67, HIF-1α and VEGF expression in patients with PCa. ADC may be used to evaluate tumor proliferation, hypoxia and angiogenesis in PCa. Copyright © 2018 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Gualandi, A.; Serpelloni, E.; Belardinelli, M. E.
2014-12-01
A critical point in the analysis of ground displacements time series is the development of data driven methods that allow to discern and characterize the different sources that generate the observed displacements. A widely used multivariate statistical technique is the Principal Component Analysis (PCA), which allows to reduce the dimensionality of the data space maintaining most of the variance of the dataset explained. It reproduces the original data using a limited number of Principal Components, but it also shows some deficiencies. Indeed, PCA does not perform well in finding the solution to the so-called Blind Source Separation (BSS) problem, i.e. in recovering and separating the original sources that generated the observed data. This is mainly due to the assumptions on which PCA relies: it looks for a new Euclidean space where the projected data are uncorrelated. Usually, the uncorrelation condition is not strong enough and it has been proven that the BSS problem can be tackled imposing on the components to be independent. The Independent Component Analysis (ICA) is, in fact, another popular technique adopted to approach this problem, and it can be used in all those fields where PCA is also applied. An ICA approach enables us to explain the time series imposing a fewer number of constraints on the model, and to reveal anomalies in the data such as transient signals. However, the independence condition is not easy to impose, and it is often necessary to introduce some approximations. To work around this problem, we use a variational bayesian ICA (vbICA) method, which models the probability density function (pdf) of each source signal using a mix of Gaussian distributions. This technique allows for more flexibility in the description of the pdf of the sources, giving a more reliable estimate of them. Here we present the application of the vbICA technique to GPS position time series. First, we use vbICA on synthetic data that simulate a seismic cycle (interseismic + coseismic + postseismic + seasonal + noise), and study the ability of the algorithm to recover the original (known) sources of deformation. Secondly, we apply vbICA to different tectonically active scenarios, such as earthquakes in central and northern Italy, as well as the study of slow slip events in Cascadia.
NASA Astrophysics Data System (ADS)
Kong, Xianyu; Liu, Yanfang; Jian, Huimin; Su, Rongguo; Yao, Qingzhen; Shi, Xiaoyong
2017-10-01
To realize potential cost savings in coastal monitoring programs and provide timely advice for marine management, there is an urgent need for efficient evaluation tools based on easily measured variables for the rapid and timely assessment of estuarine and offshore eutrophication. In this study, using parallel factor analysis (PARAFAC), principal component analysis (PCA), and discriminant function analysis (DFA) with the trophic index (TRIX) for reference, we developed an approach for rapidly assessing the eutrophication status of coastal waters using easy-to-measure parameters, including chromophoric dissolved organic matter (CDOM), fluorescence excitation-emission matrices, CDOM UV-Vis absorbance, and other water-quality parameters (turbidity, chlorophyll a, and dissolved oxygen). First, we decomposed CDOM excitation-emission matrices (EEMs) by PARAFAC to identify three components. Then, we applied PCA to simplify the complexity of the relationships between the water-quality parameters. Finally, we used the PCA score values as independent variables in DFA to develop a eutrophication assessment model. The developed model yielded classification accuracy rates of 97.1%, 80.5%, 90.3%, and 89.1% for good, moderate, and poor water qualities, and for the overall data sets, respectively. Our results suggest that these easy-to-measure parameters could be used to develop a simple approach for rapid in-situ assessment and monitoring of the eutrophication of estuarine and offshore areas.
Zachery A. Holden; Michael A. Crimmins; Samuel A. Cushman; Jeremy S. Littell
2010-01-01
Accurate, fine spatial resolution predictions of surface air temperatures are critical for understanding many hydrologic and ecological processes. This study examines the spatial and temporal variability in nocturnal air temperatures across a mountainous region of Northern Idaho. Principal components analysis (PCA) was applied to a network of 70 Hobo temperature...
Kandadai, Venk; Yang, Haodong; Jiang, Ling; Yang, Christopher C; Fleisher, Linda; Winston, Flaura Koplin
2016-05-05
Little is known about the ability of individual stakeholder groups to achieve health information dissemination goals through Twitter. This study aimed to develop and apply methods for the systematic evaluation and optimization of health information dissemination by stakeholders through Twitter. Tweet content from 1790 followers of @SafetyMD (July-November 2012) was examined. User emphasis, a new indicator of Twitter information dissemination, was defined and applied to retweets across two levels of retweeters originating from @SafetyMD. User interest clusters were identified based on principal component analysis (PCA) and hierarchical cluster analysis (HCA) of a random sample of 170 followers. User emphasis of keywords remained across levels but decreased by 9.5 percentage points. PCA and HCA identified 12 statistically unique clusters of followers within the @SafetyMD Twitter network. This study is one of the first to develop methods for use by stakeholders to evaluate and optimize their use of Twitter to disseminate health information. Our new methods provide preliminary evidence that individual stakeholders can evaluate the effectiveness of health information dissemination and create content-specific clusters for more specific targeted messaging.
Spyropoulos, Evangelos; Kotsiris, Dimitrios; Spyropoulos, Katherine; Panagopoulos, Aggelos; Galanakis, Ioannis; Mavrikos, Stamatios
2017-02-01
We developed a mathematical "prostate cancer (PCa) conditions simulating" predictive model (PCP-SMART), from which we derived a novel PCa predictor (prostate cancer risk determinator [PCRD] index) and a PCa risk equation. We used these to estimate the probability of finding PCa on prostate biopsy, on an individual basis. A total of 371 men who had undergone transrectal ultrasound-guided prostate biopsy were enrolled in the present study. Given that PCa risk relates to the total prostate-specific antigen (tPSA) level, age, prostate volume, free PSA (fPSA), fPSA/tPSA ratio, and PSA density and that tPSA ≥ 50 ng/mL has a 98.5% positive predictive value for a PCa diagnosis, we hypothesized that correlating 2 variables composed of 3 ratios (1, tPSA/age; 2, tPSA/prostate volume; and 3, fPSA/tPSA; 1 variable including the patient's tPSA and the other, a tPSA value of 50 ng/mL) could operate as a PCa conditions imitating/simulating model. Linear regression analysis was used to derive the coefficient of determination (R 2 ), termed the PCRD index. To estimate the PCRD index's predictive validity, we used the χ 2 test, multiple logistic regression analysis with PCa risk equation formation, calculation of test performance characteristics, and area under the receiver operating characteristic curve analysis using SPSS, version 22 (P < .05). The biopsy findings were positive for PCa in 167 patients (45.1%) and negative in 164 (44.2%). The PCRD index was positively signed in 89.82% positive PCa cases and negative in 91.46% negative PCa cases (χ 2 test; P < .001; relative risk, 8.98). The sensitivity was 89.8%, specificity was 91.5%, positive predictive value was 91.5%, negative predictive value was 89.8%, positive likelihood ratio was 10.5, negative likelihood ratio was 0.11, and accuracy was 90.6%. Multiple logistic regression revealed the PCRD index as an independent PCa predictor, and the formulated risk equation was 91% accurate in predicting the probability of finding PCa. On the receiver operating characteristic analysis, the PCRD index (area under the curve, 0.926) significantly (P < .001) outperformed other, established PCa predictors. The PCRD index effectively predicted the prostate biopsy outcome, correctly identifying 9 of 10 men who were eventually diagnosed with PCa and correctly ruling out PCa for 9 of 10 men who did not have PCa. Its predictive power significantly outperformed established PCa predictors, and the formulated risk equation accurately calculated the probability of finding cancer on biopsy, on an individual patient basis. Copyright © 2016 Elsevier Inc. All rights reserved.
Li, Ziyi; Safo, Sandra E; Long, Qi
2017-07-11
Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.
Incorporating principal component analysis into air quality ...
The efficacy of standard air quality model evaluation techniques is becoming compromised as the simulation periods continue to lengthen in response to ever increasing computing capacity. Accordingly, the purpose of this paper is to demonstrate a statistical approach called Principal Component Analysis (PCA) with the intent of motivating its use by the evaluation community. One of the main objectives of PCA is to identify, through data reduction, the recurring and independent modes of variations (or signals) within a very large dataset, thereby summarizing the essential information of that dataset so that meaningful and descriptive conclusions can be made. In this demonstration, PCA is applied to a simple evaluation metric – the model bias associated with EPA's Community Multi-scale Air Quality (CMAQ) model when compared to weekly observations of sulfate (SO42−) and ammonium (NH4+) ambient air concentrations measured by the Clean Air Status and Trends Network (CASTNet). The advantages of using this technique are demonstrated as it identifies strong and systematic patterns of CMAQ model bias across a myriad of spatial and temporal scales that are neither constrained to geopolitical boundaries nor monthly/seasonal time periods (a limitation of many current studies). The technique also identifies locations (station–grid cell pairs) that are used as indicators for a more thorough diagnostic evaluation thereby hastening and facilitating understanding of the prob
Representation of Probability Density Functions from Orbit Determination using the Particle Filter
NASA Technical Reports Server (NTRS)
Mashiku, Alinda K.; Garrison, James; Carpenter, J. Russell
2012-01-01
Statistical orbit determination enables us to obtain estimates of the state and the statistical information of its region of uncertainty. In order to obtain an accurate representation of the probability density function (PDF) that incorporates higher order statistical information, we propose the use of nonlinear estimation methods such as the Particle Filter. The Particle Filter (PF) is capable of providing a PDF representation of the state estimates whose accuracy is dependent on the number of particles or samples used. For this method to be applicable to real case scenarios, we need a way of accurately representing the PDF in a compressed manner with little information loss. Hence we propose using the Independent Component Analysis (ICA) as a non-Gaussian dimensional reduction method that is capable of maintaining higher order statistical information obtained using the PF. Methods such as the Principal Component Analysis (PCA) are based on utilizing up to second order statistics, hence will not suffice in maintaining maximum information content. Both the PCA and the ICA are applied to two scenarios that involve a highly eccentric orbit with a lower apriori uncertainty covariance and a less eccentric orbit with a higher a priori uncertainty covariance, to illustrate the capability of the ICA in relation to the PCA.
NASA Astrophysics Data System (ADS)
Gao, Yang; Chen, Maomao; Wu, Junyu; Zhou, Yuan; Cai, Chuangjian; Wang, Daliang; Luo, Jianwen
2017-09-01
Fluorescence molecular imaging has been used to target tumors in mice with xenograft tumors. However, tumor imaging is largely distorted by the aggregation of fluorescent probes in the liver. A principal component analysis (PCA)-based strategy was applied on the in vivo dynamic fluorescence imaging results of three mice with xenograft tumors to facilitate tumor imaging, with the help of a tumor-specific fluorescent probe. Tumor-relevant features were extracted from the original images by PCA and represented by the principal component (PC) maps. The second principal component (PC2) map represented the tumor-related features, and the first principal component (PC1) map retained the original pharmacokinetic profiles, especially of the liver. The distribution patterns of the PC2 map of the tumor-bearing mice were in good agreement with the actual tumor location. The tumor-to-liver ratio and contrast-to-noise ratio were significantly higher on the PC2 map than on the original images, thus distinguishing the tumor from its nearby fluorescence noise of liver. The results suggest that the PC2 map could serve as a bioimaging marker to facilitate in vivo tumor localization, and dynamic fluorescence molecular imaging with PCA could be a valuable tool for future studies of in vivo tumor metabolism and progression.
Zhang, Huai-zhu; Lin, Jun; Zhang, Huai-Zhu
2014-06-01
In the present paper, the outlier detection methods for determination of oil yield in oil shale using near-infrared (NIR) diffuse reflection spectroscopy was studied. During the quantitative analysis with near-infrared spectroscopy, environmental change and operator error will both produce outliers. The presence of outliers will affect the overall distribution trend of samples and lead to the decrease in predictive capability. Thus, the detection of outliers are important for the construction of high-quality calibration models. The methods including principal component analysis-Mahalanobis distance (PCA-MD) and resampling by half-means (RHM) were applied to the discrimination and elimination of outliers in this work. The thresholds and confidences for MD and RHM were optimized using the performance of partial least squares (PLS) models constructed after the elimination of outliers, respectively. Compared with the model constructed with the data of full spectrum, the values of RMSEP of the models constructed with the application of PCA-MD with a threshold of a value equal to the sum of average and standard deviation of MD, RHM with the confidence level of 85%, and the combination of PCA-MD and RHM, were reduced by 48.3%, 27.5% and 44.8%, respectively. The predictive ability of the calibration model has been improved effectively.
NASA Astrophysics Data System (ADS)
Salman, Ahmad; Lapidot, Itshak; Pomerantz, Ami; Tsror, Leah; Shufan, Elad; Moreh, Raymond; Mordechai, Shaul; Huleihel, Mahmoud
2012-01-01
The early diagnosis of phytopathogens is of a great importance; it could save large economical losses due to crops damaged by fungal diseases, and prevent unnecessary soil fumigation or the use of fungicides and bactericides and thus prevent considerable environmental pollution. In this study, 18 isolates of three different fungi genera were investigated; six isolates of Colletotrichum coccodes, six isolates of Verticillium dahliae and six isolates of Fusarium oxysporum. Our main goal was to differentiate these fungi samples on the level of isolates, based on their infrared absorption spectra obtained using the Fourier transform infrared-attenuated total reflection (FTIR-ATR) sampling technique. Advanced statistical and mathematical methods: principal component analysis (PCA), linear discriminant analysis (LDA), and k-means were applied to the spectra after manipulation. Our results showed significant spectral differences between the various fungi genera examined. The use of k-means enabled classification between the genera with a 94.5% accuracy, whereas the use of PCA [3 principal components (PCs)] and LDA has achieved a 99.7% success rate. However, on the level of isolates, the best differentiation results were obtained using PCA (9 PCs) and LDA for the lower wavenumber region (800-1775 cm-1), with identification success rates of 87%, 85.5%, and 94.5% for Colletotrichum, Fusarium, and Verticillium strains, respectively.
Ye, Tao; Jin, Cheng; Zhou, Jian; Li, Xingfeng; Wang, Haitao; Deng, Pingye; Yang, Ying; Wu, Yanwen; Xiao, Xiaohe
2011-07-15
Musk is a precious and wide applied material in traditional Chinese medicine, also, an important material for the perfume industry all over the world. To establish a rapid, cost-effective and relatively objective assessment for the quality of musk, different musk samples, including authentic, fake and adulterate, were collected. A oxide sensor based electronic nose (E-nose) was employed to measure the musk samples, the E-nose generated data were analyzed by principal component analysis (PCA), the responses of 18 sensors of E-nose were evaluated by loading analysis. Results showed that a rapid evaluation of complex response of the samples could be obtained, in combination with PCA and the perception level of the E-nose was given better results in the recognition values of the musk aroma. The authentic, fake and adulterate musk could be distinguished by E-nose coupled with PCA, sensor 2, 3, 5, 12, 15 and 17 were found to be able to better discriminate between musk samples, confirming the potential application of an electronic instrument coupled with chemometrics for a rapid and on-line quality control for the traditional medicines. Copyright © 2011 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Lee, Kyunghoon
To evaluate the maximum likelihood estimates (MLEs) of probabilistic principal component analysis (PPCA) parameters such as a factor-loading, PPCA can invoke an expectation-maximization (EM) algorithm, yielding an EM algorithm for PPCA (EM-PCA). In order to examine the benefits of the EM-PCA for aerospace engineering applications, this thesis attempts to qualitatively and quantitatively scrutinize the EM-PCA alongside both POD and gappy POD using high-dimensional simulation data. In pursuing qualitative investigations, the theoretical relationship between POD and PPCA is transparent such that the factor-loading MLE of PPCA, evaluated by the EM-PCA, pertains to an orthogonal basis obtained by POD. By contrast, the analytical connection between gappy POD and the EM-PCA is nebulous because they distinctively approximate missing data due to their antithetical formulation perspectives: gappy POD solves a least-squares problem whereas the EM-PCA relies on the expectation of the observation probability model. To juxtapose both gappy POD and the EM-PCA, this research proposes a unifying least-squares perspective that embraces the two disparate algorithms within a generalized least-squares framework. As a result, the unifying perspective reveals that both methods address similar least-squares problems; however, their formulations contain dissimilar bases and norms. Furthermore, this research delves into the ramifications of the different bases and norms that will eventually characterize the traits of both methods. To this end, two hybrid algorithms of gappy POD and the EM-PCA are devised and compared to the original algorithms for a qualitative illustration of the different basis and norm effects. After all, a norm reflecting a curve-fitting method is found to more significantly affect estimation error reduction than a basis for two example test data sets: one is absent of data only at a single snapshot and the other misses data across all the snapshots. From a numerical performance aspect, the EM-PCA is computationally less efficient than POD for intact data since it suffers from slow convergence inherited from the EM algorithm. For incomplete data, this thesis quantitatively found that the number of data missing snapshots predetermines whether the EM-PCA or gappy POD outperforms the other because of the computational cost of a coefficient evaluation, resulting from a norm selection. For instance, gappy POD demands laborious computational effort in proportion to the number of data-missing snapshots as a consequence of the gappy norm. In contrast, the computational cost of the EM-PCA is invariant to the number of data-missing snapshots thanks to the L2 norm. In general, the higher the number of data-missing snapshots, the wider the gap between the computational cost of gappy POD and the EM-PCA. Based on the numerical experiments reported in this thesis, the following criterion is recommended regarding the selection between gappy POD and the EM-PCA for computational efficiency: gappy POD for an incomplete data set containing a few data-missing snapshots and the EM-PCA for an incomplete data set involving multiple data-missing snapshots. Last, the EM-PCA is applied to two aerospace applications in comparison to gappy POD as a proof of concept: one with an emphasis on basis extraction and the other with a focus on missing data reconstruction for a given incomplete data set with scattered missing data. The first application exploits the EM-PCA to efficiently construct reduced-order models of engine deck responses obtained by the numerical propulsion system simulation (NPSS), some of whose results are absent due to failed analyses caused by numerical instability. Model-prediction tests validate that engine performance metrics estimated by the reduced-order NPSS model exhibit considerably good agreement with those directly obtained by NPSS. Similarly, the second application illustrates that the EM-PCA is significantly more cost effective than gappy POD at repairing spurious PIV measurements obtained from acoustically-excited, bluff-body jet flow experiments. The EM-PCA reduces computational cost on factors 8 ˜ 19 compared to gappy POD while generating the same restoration results as those evaluated by gappy POD. All in all, through comprehensive theoretical and numerical investigation, this research establishes that the EM-PCA is an efficient alternative to gappy POD for an incomplete data set containing missing data over an entire data set. (Abstract shortened by UMI.)
Bermudo, R; Abia, D; Mozos, A; García-Cruz, E; Alcaraz, A; Ortiz, Á R; Thomson, T M; Fernández, P L
2011-01-01
Introduction: Currently, final diagnosis of prostate cancer (PCa) is based on histopathological analysis of needle biopsies, but this process often bears uncertainties due to small sample size, tumour focality and pathologist's subjective assessment. Methods: Prostate cancer diagnostic signatures were generated by applying linear discriminant analysis to microarray and real-time RT–PCR (qRT–PCR) data from normal and tumoural prostate tissue samples. Additionally, after removal of biopsy tissues, material washed off from transrectal biopsy needles was used for molecular profiling and discriminant analysis. Results: Linear discriminant analysis applied to microarray data for a set of 318 genes differentially expressed between non-tumoural and tumoural prostate samples produced 26 gene signatures, which classified the 84 samples used with 100% accuracy. To identify signatures potentially useful for the diagnosis of prostate biopsies, surplus material washed off from routine biopsy needles from 53 patients was used to generate qRT–PCR data for a subset of 11 genes. This analysis identified a six-gene signature that correctly assigned the biopsies as benign or tumoural in 92.6% of the cases, with 88.8% sensitivity and 96.1% specificity. Conclusion: Surplus material from prostate needle biopsies can be used for minimal-size gene signature analysis for sensitive and accurate discrimination between non-tumoural and tumoural prostates, without interference with current diagnostic procedures. This approach could be a useful adjunct to current procedures in PCa diagnosis. PMID:22009027
Tailored multivariate analysis for modulated enhanced diffraction
DOE Office of Scientific and Technical Information (OSTI.GOV)
Caliandro, Rocco; Guccione, Pietro; Nico, Giovanni
2015-10-21
Modulated enhanced diffraction (MED) is a technique allowing the dynamic structural characterization of crystalline materials subjected to an external stimulus, which is particularly suited forin situandoperandostructural investigations at synchrotron sources. Contributions from the (active) part of the crystal system that varies synchronously with the stimulus can be extracted by an offline analysis, which can only be applied in the case of periodic stimuli and linear system responses. In this paper a new decomposition approach based on multivariate analysis is proposed. The standard principal component analysis (PCA) is adapted to treat MED data: specific figures of merit based on their scoresmore » and loadings are found, and the directions of the principal components obtained by PCA are modified to maximize such figures of merit. As a result, a general method to decompose MED data, called optimum constrained components rotation (OCCR), is developed, which produces very precise results on simulated data, even in the case of nonperiodic stimuli and/or nonlinear responses. The multivariate analysis approach is able to supply in one shot both the diffraction pattern related to the active atoms (through the OCCR loadings) and the time dependence of the system response (through the OCCR scores). When applied to real data, OCCR was able to supply only the latter information, as the former was hindered by changes in abundances of different crystal phases, which occurred besides structural variations in the specific case considered. To develop a decomposition procedure able to cope with this combined effect represents the next challenge in MED analysis.« less
A feasibility study on age-related factors of wrist pulse using principal component analysis.
Jang-Han Bae; Young Ju Jeon; Sanghun Lee; Jaeuk U Kim
2016-08-01
Various analysis methods for examining wrist pulse characteristics are needed for accurate pulse diagnosis. In this feasibility study, principal component analysis (PCA) was performed to observe age-related factors of wrist pulse from various analysis parameters. Forty subjects in the age group of 20s and 40s were participated, and their wrist pulse signal and respiration signal were acquired with the pulse tonometric device. After pre-processing of the signals, twenty analysis parameters which have been regarded as values reflecting pulse characteristics were calculated and PCA was performed. As a results, we could reduce complex parameters to lower dimension and age-related factors of wrist pulse were observed by combining-new analysis parameter derived from PCA. These results demonstrate that PCA can be useful tool for analyzing wrist pulse signal.
Fraser, Graham M; Goldman, Daniel; Ellis, Christopher G
2013-11-01
We compare RMN to PCA under several simulated physiological conditions to determine how the use of different vascular geometry affects oxygen transport solutions. Three discrete networks were reconstructed from intravital video microscopy of rat skeletal muscle (84 × 168 × 342 μm, 70 × 157 × 268 μm, and 65 × 240 × 571 μm), and hemodynamic measurements were made in individual capillaries. PCAs were created based on statistical measurements from RMNs. Blood flow and O₂ transport models were applied, and the resulting solutions for RMN and PCA models were compared under four conditions (rest, exercise, ischemia, and hypoxia). Predicted tissue PO₂ was consistently lower in all RMN simulations compared to the paired PCA. PO₂ for 3D reconstructions at rest were 28.2 ± 4.8, 28.1 ± 3.5, and 33.0 ± 4.5 mmHg for networks I, II, and III compared to the PCA mean values of 31.2 ± 4.5, 30.6 ± 3.4, and 33.8 ± 4.6 mmHg. Simulated exercise yielded mean tissue PO₂ in the RMN of 10.1 ± 5.4, 12.6 ± 5.7, and 19.7 ± 5.7 mmHg compared to 15.3 ± 7.3, 18.8 ± 5.3, and 21.7 ± 6.0 in PCA. These findings suggest that volume matched PCA yield different results compared to reconstructed microvascular geometries when applied to O₂ transport modeling; the predominant characteristic of this difference being an over estimate of mean tissue PO₂. Despite this limitation, PCA models remain important for theoretical studies as they produce PO₂ distributions with similar shape and parameter dependence as RMN. © 2013 John Wiley & Sons Ltd.
Experiences of Australian men diagnosed with advanced prostate cancer: a qualitative study
Chambers, Suzanne K; Hyde, Melissa K; Laurie, Kirstyn; Legg, Melissa; Frydenberg, Mark; Davis, Ian D; Lowe, Anthony; Dunn, Jeff
2018-01-01
Objective To explore men’s lived experience of advanced prostate cancer (PCa) and preferences for support. Design Cross-sectional qualitative study applying open-ended surveys and interviews conducted between June and November 2016. Interviews audio-recorded and transcribed verbatim and analysed from an interpretive phenomenological perspective. Setting Australia, nation-wide. Participants 39 men diagnosed with advanced PCa (metastatic or castration-resistant biochemical progression) were surveyed with 28 men subsequently completing a semistructured in depth telephone interview. Results Thematic analysis of interviews identified two organising themes: lived experience and supportive care. Lived experience included six superordinate themes: regret about late diagnosis and treatment decisions, being discounted in the health system, fear/uncertainty about the future, acceptance of their situation, masculinity and treatment effects. Supportive care included five superordinate themes: communication, care coordination, accessible care, shared experience/peer support and involvement of their partner/family. Conclusions Life course and the health and social context of PCa influence men’s experiences of advanced disease. Multimodal interventions integrating peer support and specialist nurses are needed that more closely articulate with men’s expressed needs. PMID:29455168
A comparison of PCA/ICA for data preprocessing in remote sensing imagery classification
NASA Astrophysics Data System (ADS)
He, Hui; Yu, Xianchuan
2005-10-01
In this paper a performance comparison of a variety of data preprocessing algorithms in remote sensing image classification is presented. These selected algorithms are principal component analysis (PCA) and three different independent component analyses, ICA (Fast-ICA (Aapo Hyvarinen, 1999), Kernel-ICA (KCCA and KGV (Bach & Jordan, 2002), EFFICA (Aiyou Chen & Peter Bickel, 2003). These algorithms were applied to a remote sensing imagery (1600×1197), obtained from Shunyi, Beijing. For classification, a MLC method is used for the raw and preprocessed data. The results show that classification with the preprocessed data have more confident results than that with raw data and among the preprocessing algorithms, ICA algorithms improve on PCA and EFFICA performs better than the others. The convergence of these ICA algorithms (for data points more than a million) are also studied, the result shows EFFICA converges much faster than the others. Furthermore, because EFFICA is a one-step maximum likelihood estimate (MLE) which reaches asymptotic Fisher efficiency (EFFICA), it computers quite small so that its demand of memory come down greatly, which settled the "out of memory" problem occurred in the other algorithms.
Lu, Xiaonan; Webb, Molly; Talbott, Mariah; Van Eenennaam, Joel; Palumbo, Amanda; Linares-Casenave, Javier; Doroshov, Serge; Struffenegger, Peter; Rasco, Barbara
2010-04-14
Fourier transform infrared spectroscopy (FT-IR, 4000-400 cm(-1)) was applied to blood plasma of farmed white sturgeon (N = 40) to differentiate and predict the stages of ovarian maturity. Spectral features of sex steroids (approximately 3000 cm(-1)) and vitellogenin (approximately 1080 cm(-1)) were identified. Clear segregation of maturity stages (previtellogenesis, vitellogenesis, postvitellogenesis, and follicular atresia) was achieved using principal component analysis (PCA). Progression of oocyte development in the late phase of vitellogenesis was also monitored using PCA based on changes in plasma concentrations of sex steroid and lipid content. The observed oocyte polarization index (PI, a measure of nuclear migration) was correlated with changes in plasma sex steroid levels revealed by FT-IR PCA results. A partial least squares (PLS) model predicted PI values within the range 0.12-0.40 (R = 0.95, SEP = 2.18%) from differences in spectral features. These results suggest that FT-IR may be a good tool for assessing ovarian maturity in farmed sturgeon and will reduce the need for the invasive ovarian biopsy required for PI determination.
Chemical information obtained from Auger depth profiles by means of advanced factor analysis (MLCFA)
NASA Astrophysics Data System (ADS)
De Volder, P.; Hoogewijs, R.; De Gryse, R.; Fiermans, L.; Vennik, J.
1993-01-01
The advanced multivariate statistical technique "maximum likelihood common factor analysis (MLCFA)" is shown to be superior to "principal component analysis (PCA)" for decomposing overlapping peaks into their individual component spectra of which neither the number of components nor the peak shape of the component spectra is known. An examination of the maximum resolving power of both techniques, MLCFA and PCA, by means of artificially created series of multicomponent spectra confirms this finding unambiguously. Substantial progress in the use of AES as a chemical-analysis technique is accomplished through the implementation of MLCFA. Chemical information from Auger depth profiles is extracted by investigating the variation of the line shape of the Auger signal as a function of the changing chemical state of the element. In particular, MLCFA combined with Auger depth profiling has been applied to problems related to steelcord-rubber tyre adhesion. MLCFA allows one to elucidate the precise nature of the interfacial layer of reaction products between natural rubber vulcanized on a thin brass layer. This study reveals many interesting chemical aspects of the oxi-sulfidation of brass undetectable with classical AES.
The impact of moderate wine consumption on the risk of developing prostate cancer.
Vartolomei, Mihai Dorin; Kimura, Shoji; Ferro, Matteo; Foerster, Beat; Abufaraj, Mohammad; Briganti, Alberto; Karakiewicz, Pierre I; Shariat, Shahrokh F
2018-01-01
To investigate the impact of moderate wine consumption on the risk of prostate cancer (PCa). We focused on the differential effect of moderate consumption of red versus white wine. This study was a meta-analysis that includes data from case-control and cohort studies. A systematic search of Web of Science, Medline/PubMed, and Cochrane library was performed on December 1, 2017. Studies were deemed eligible if they assessed the risk of PCa due to red, white, or any wine using multivariable logistic regression analysis. We performed a formal meta-analysis for the risk of PCa according to moderate wine and wine type consumption (white or red). Heterogeneity between studies was assessed using Cochrane's Q test and I 2 statistics. Publication bias was assessed using Egger's regression test. A total of 930 abstracts and titles were initially identified. After removal of duplicates, reviews, and conference abstracts, 83 full-text original articles were screened. Seventeen studies (611,169 subjects) were included for final evaluation and fulfilled the inclusion criteria. In the case of moderate wine consumption: the pooled risk ratio (RR) for the risk of PCa was 0.98 (95% CI 0.92-1.05, p =0.57) in the multivariable analysis. Moderate white wine consumption increased the risk of PCa with a pooled RR of 1.26 (95% CI 1.10-1.43, p =0.001) in the multi-variable analysis. Meanwhile, moderate red wine consumption had a protective role reducing the risk by 12% (RR 0.88, 95% CI 0.78-0.999, p =0.047) in the multivariable analysis that comprised 222,447 subjects. In this meta-analysis, moderate wine consumption did not impact the risk of PCa. Interestingly, regarding the type of wine, moderate consumption of white wine increased the risk of PCa, whereas moderate consumption of red wine had a protective effect. Further analyses are needed to assess the differential molecular effect of white and red wine conferring their impact on PCa risk.
NASA Astrophysics Data System (ADS)
Matthews, Q.; Jirasek, A.; Lum, J. J.; Brolo, A. G.
2011-11-01
This work applies noninvasive single-cell Raman spectroscopy (RS) and principal component analysis (PCA) to analyze and correlate radiation-induced biochemical changes in a panel of human tumour cell lines that vary by tissue of origin, p53 status and intrinsic radiosensitivity. Six human tumour cell lines, derived from prostate (DU145, PC3 and LNCaP), breast (MDA-MB-231 and MCF7) and lung (H460), were irradiated in vitro with single fractions (15, 30 or 50 Gy) of 6 MV photons. Remaining live cells were harvested for RS analysis at 0, 24, 48 and 72 h post-irradiation, along with unirradiated controls. Single-cell Raman spectra were acquired from 20 cells per sample utilizing a 785 nm excitation laser. All spectra (200 per cell line) were individually post-processed using established methods and the total data set for each cell line was analyzed with PCA using standard algorithms. One radiation-induced PCA component was detected for each cell line by identification of statistically significant changes in the PCA score distributions for irradiated samples, as compared to unirradiated samples, in the first 24-72 h post-irradiation. These RS response signatures arise from radiation-induced changes in cellular concentrations of aromatic amino acids, conformational protein structures and certain nucleic acid and lipid functional groups. Correlation analysis between the radiation-induced PCA components separates the cell lines into three distinct RS response categories: R1 (H460 and MCF7), R2 (MDA-MB-231 and PC3) and R3 (DU145 and LNCaP). These RS categories partially segregate according to radiosensitivity, as the R1 and R2 cell lines are radioresistant (SF2 > 0.6) and the R3 cell lines are radiosensitive (SF2 < 0.5). The R1 and R2 cell lines further segregate according to p53 gene status, corroborated by cell cycle analysis post-irradiation. Potential radiation-induced biochemical response mechanisms underlying our RS observations are proposed, such as (1) the regulated synthesis and degradation of structured proteins and (2) the expression of anti-apoptosis factors or other survival signals. This study demonstrates the utility of RS for noninvasive radiobiological analysis of tumour cell radiation response, and indicates the potential for future RS studies designed to investigate, monitor or predict radiation response.
Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
Ma, Jianzhong; Amos, Christopher I.
2012-01-01
Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct “populations” of inversion homozygotes of different orientations and their 1∶1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases. PMID:22808122
Poniah, Prevathe; Mohd Zain, Shamsul; Abdul Razack, Azad Hassan; Kuppusamy, Shanggar; Karuppayah, Shankar; Sian Eng, Hooi; Mohamed, Zahurin
2017-09-01
Two key issues in prostate cancer (PCa) that demand attention currently are the need for a more precise and minimally invasive screening test owing to the inaccuracy of prostate-specific antigen and differential diagnosis to distinguish advanced vs. indolent cancers. This continues to pose a tremendous challenge in diagnosis and prognosis of PCa and could potentially lead to overdiagnosis and overtreatment complications. Copy number variations (CNVs) in the human genome have been linked to various carcinomas including PCa. Detection of these variants may improve clinical treatment as well as an understanding of the pathobiology underlying this complex disease. To this end, we undertook a pilot genome-wide CNV analysis approach in 36 subjects (18 patients with high-grade PCa and 18 controls that were matched by age and ethnicity) in search of more accurate biomarkers that could potentially explain susceptibility toward high-grade PCa. We conducted this study using the array comparative genomic hybridization technique. Array results were validated in 92 independent samples (46 high-grade PCa, 23 benign prostatic hyperplasia, and 23 healthy controls) using polymerase chain reaction-based copy number counting method. A total of 314 CNV regions were found to be unique to PCa subjects in this cohort (P<0.05). A log 2 ratio-based copy number analysis revealed 5 putative rare or novel CNV loci or both associated with susceptibility to PCa. The CNV gain regions were 1q21.3, 15q15, 7p12.1, and a novel CNV in PCa 12q23.1, harboring ARNT, THBS1, SLC5A8, and DDC genes that are crucial in the p53 and cancer pathways. A CNV loss and deletion event was observed at 8p11.21, which contains the SFRP1 gene from the Wnt signaling pathway. Cross-comparison analysis with genes associated to PCa revealed significant CNVs involved in biological processes that elicit cancer pathogenesis via cytokine production and endothelial cell proliferation. In conclusion, we postulated that the CNVs identified in this study could provide an insight into the development of advanced PCa. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Vile, Douglas J.
In radiation therapy, interfraction organ motion introduces a level of geometric uncertainty into the planning process. Plans, which are typically based upon a single instance of anatomy, must be robust against daily anatomical variations. For this problem, a model of the magnitude, direction, and likelihood of deformation is useful. In this thesis, principal component analysis (PCA) is used to statistically model the 3D organ motion for 19 prostate cancer patients, each with 8-13 fractional computed tomography (CT) images. Deformable image registration and the resultant displacement vector fields (DVFs) are used to quantify the interfraction systematic and random motion. By applying the PCA technique to the random DVFs, principal modes of random tissue deformation were determined for each patient, and a method for sampling synthetic random DVFs was developed. The PCA model was then extended to describe the principal modes of systematic and random organ motion for the population of patients. A leave-one-out study tested both the systematic and random motion model's ability to represent PCA training set DVFs. The random and systematic DVF PCA models allowed the reconstruction of these data with absolute mean errors between 0.5-0.9 mm and 1-2 mm, respectively. To the best of the author's knowledge, this study is the first successful effort to build a fully 3D statistical PCA model of systematic tissue deformation in a population of patients. By sampling synthetic systematic and random errors, organ occupancy maps were created for bony and prostate-centroid patient setup processes. By thresholding these maps, PCA-based planning target volume (PTV) was created and tested against conventional margin recipes (van Herk for bony alignment and 5 mm fixed [3 mm posterior] margin for centroid alignment) in a virtual clinical trial for low-risk prostate cancer. Deformably accumulated delivered dose served as a surrogate for clinical outcome. For the bony landmark setup subtrial, the PCA PTV significantly (p<0.05) reduced D30, D20, and D5 to bladder and D50 to rectum, while increasing rectal D20 and D5. For the centroid-aligned setup, the PCA PTV significantly reduced all bladder DVH metrics and trended to lower rectal toxicity metrics. All PTVs covered the prostate with the prescription dose.
Tang, Jingyuan; Xu, Lingyan; Xu, Haoxiang; Li, Ran; Han, Peng; Yang, Haiwei
2017-01-01
Previous studies have investigated the association between NAT2 polymorphism and the risk of prostate cancer (PCa). However, the findings from these studies remained inconsistent. Hence, we performed a meta-analysis to provide a more reliable conclusion about such associations. In the present meta-analysis, 13 independent case-control studies were included with a total of 14,469 PCa patients and 10,689 controls. All relevant studies published were searched in the databates PubMed, EMBASE, and Web of Science, till March 1st, 2017. We used the pooled odds ratios (ORs) with 95% confidence intervals (CIs) to evaluate the strength of the association between NAT2*4 allele and susceptibility to PCa. Subgroup analysis was carried out by ethnicity, source of controls and genotyping method. What's more, we also performed trial sequential analysis (TSA) to reduce the risk of type I error and evaluate whether the evidence of the results was firm. Firstly, our results indicated that NAT2*4 allele was not associated with PCa susceptibility (OR = 1.00, 95% CI= 0.95–1.05; P = 0.100). However, after excluding two studies for its heterogeneity and publication bias, no significant relationship was also detected between NAT2*4 allele and the increased risk of PCa, in fixed-effect model (OR = 0.99, 95% CI= 0.94–1.04; P = 0.451). Meanwhile, no significant increased risk of PCa was found in the subgroup analyses by ethnicity, source of controls and genotyping method. Moreover, TSA demonstrated that such association was confirmed in the present study. Therefore, this meta-analysis suggested that no significant association between NAT2 polymorphism and the risk of PCa was found. PMID:28915684
Dou, MengMeng; Zhou, XueLiang; Fan, ZhiRui; Ding, XianFei; Li, LiFeng; Wang, ShuLing; Xue, Wenhua; Wang, Hui; Suo, Zhenhe; Deng, XiaoMing
2018-01-01
Retinoic acid receptor beta (RAR beta) is a retinoic acid receptor gene that has been shown to play key roles during multiple cancer processes, including cell proliferation, apoptosis, migration and invasion. Numerous studies have found that methylation of the RAR beta promoter contributed to the occurrence and development of malignant tumors. However, the connection between RAR beta promoter methylation and prostate cancer (PCa) remains unknown. This meta-analysis evaluated the clinical significance of RAR beta promoter methylation in PCa. We searched all published records relevant to RAR beta and PCa in a series of databases, including PubMed, Embase, Cochrane Library, ISI Web of Science and CNKI. The rates of RAR beta promoter methylation in the PCa and control groups (including benign prostatic hyperplasia and normal prostate tissues) were summarized. In addition, we evaluated the source region of available samples and the methods used to detect methylation. To compare the incidence and variation in RAR beta promoter methylation in PCa and non-PCa tissues, the odds ratio (OR) and 95% confidence interval (CI) were calculated accordingly. All the data were analyzed with the statistical software STATA 12.0. Based on the inclusion and exclusion criteria, 15 articles assessing 1,339 samples were further analyzed. These data showed that the RAR beta promoter methylation rates in PCa tissues were significantly higher than the rates in the non-PCa group (OR=21.65, 95% CI: 9.27-50.57). Subgroup analysis according to the source region of samples showed that heterogeneity in Asia was small (I2=0.0%, P=0.430). Additional subgroup analysis based on the method used to detect RAR beta promoter methylation showed that the heterogeneity detected by MSP (methylation-specific PCR) was relatively small (I2=11.3%, P=0.343). Although studies reported different rates for RAR beta promoter methylation in PCa tissues, the total analysis demonstrated that RAR beta promoter methylation may be correlated with PCa carcinogenesis and that the RAR beta gene is particularly susceptible. Additional studies with sufficient data are essential to further evaluate the clinical features and prognostic utility of RAR beta promoter methylation in PCa. © 2018 The Author(s). Published by S. Karger AG, Basel.
NASA Astrophysics Data System (ADS)
Yang, Jing; Wang, Cheng; Cai, Gan; Dong, Xiaona
2016-10-01
The incidence and mortality rate of the primary liver cancer are very high and its postoperative metastasis and recurrence have become important factors to the prognosis of patients. Circulating tumor cells (CTC), as a new tumor marker, play important roles in the early diagnosis and individualized treatment. This paper presents an effective method to distinguish liver cancer based on the cellular scattering spectrum, which is a non-fluorescence technique based on the fiber confocal microscopic spectrometer. Combining the principal component analysis (PCA) with back propagation (BP) neural network were utilized to establish an automatic recognition model for backscatter spectrum of the liver cancer cells from blood cell. PCA was applied to reduce the dimension of the scattering spectral data which obtained by the fiber confocal microscopic spectrometer. After dimensionality reduction by PCA, a neural network pattern recognition model with 2 input layer nodes, 11 hidden layer nodes, 3 output nodes was established. We trained the network with 66 samples and also tested it. Results showed that the recognition rate of the three types of cells is more than 90%, the relative standard deviation is only 2.36%. The experimental results showed that the fiber confocal microscopic spectrometer combining with the algorithm of PCA and BP neural network can automatically identify the liver cancer cell from the blood cells. This will provide a better tool for investigating the metastasis of liver cancers in vivo, the biology metabolic characteristics of liver cancers and drug transportation. Additionally, it is obviously referential in practical application.
NASA Astrophysics Data System (ADS)
Huang, C. L.; Hsu, N. S.
2015-12-01
This study develops a novel methodology to resolve the cause of typhoon-induced precipitation using principle component analysis (PCA) and to develop a long lead-time precipitation prediction model. The discovered spatial and temporal features of rainfall are utilized to develop a state-of-the-art descriptive statistical model which can be used to predict long lead-time precipitation during typhoons. The time series of 12-hour precipitation from different types of invasive moving track of typhoons are respectively precede the signal analytical process to qualify the causes of rainfall and to quantify affected degree of each induced cause. The causes include: (1) interaction between typhoon rain band and terrain; (2) co-movement effect induced by typhoon wind field with monsoon; (3) pressure gradient; (4) wind velocity; (5) temperature environment; (6) characteristic distance between typhoon center and surface target station; (7) distance between grade 7 storm radius and surface target station; and (8) relative humidity. The results obtained from PCA can detect the hidden pattern of the eight causes in space and time and can understand the future trends and changes of precipitation. This study applies the developed methodology in Taiwan Island which is constituted by complex diverse terrain formation and height. Results show that: (1) for the typhoon moving toward the direction of 245° to 330°, Causes (1), (2) and (6) are the primary ones to generate rainfall; and (2) for the direction of 330° to 380°, Causes (1), (4) and (6) are the primary ones. Besides, the developed precipitation prediction model by using PCA with the distributed moving track approach (PCA-DMT) is 32% more accurate by that of PCA without distributed moving track approach, and the former model can effectively achieve long lead-time precipitation prediction with an average predicted error of 13% within average 48 hours of forecasted lead-time.
Isolation of candidate genes for apomictic development in buffelgrass (Pennisetum ciliare).
Singh, Manjit; Burson, Byron L; Finlayson, Scott A
2007-08-01
Asexual reproduction through seeds, or apomixis, is a process that holds much promise for agricultural advances. However, the molecular mechanisms underlying apomixis are currently poorly understood. To identify genes related to female gametophyte development in apomictic ovaries of buffelgrass (Pennisetum ciliare (L.) Link), Suppression Subtractive Hybridization of ovary cDNA with leaf cDNA was performed. Through macroarray screening of subtracted cDNAs two genes were identified, Pca21 and Pca24, that showed differential expression between apomictic and sexual ovaries. Sequence analysis showed that both Pca21 and Pca24 are novel genes not previously characterized in plants. Pca21 shows homology to two wheat genes that are also expressed during reproductive development. Pca24 has similarity to coiled-coil-helix-coiled-coil-helix (CHCH) domain containing proteins from maize and sugarcane. Northern blot analysis revealed that both of these genes are expressed throughout female gametophyte development in apomictic ovaries. In situ hybridizations localized the transcript of these two genes to the developing embryo sacs in the apomictic ovaries. Based on the expression patterns it was concluded that Pca21 and Pca24 likely play a role during apomictic development in buffelgrass.
Cao, Zipei; Wei, Lijuan; Zhu, Weizhi; Yao, Xuping
2018-03-01
Reduction of cyclin-dependent kinase inhibitor 2A (CDKN2A) (p16 and p14) expression through DNA methylation has been reported in prostate cancer (PCa). This meta-analysis was conducted to assess the difference of p16 and p14 methylation between PCa and different histological types of nonmalignant controls and the correlation of p16 or p14 methylation with clinicopathological features of PCa. According to the preferred reporting items for systematic reviews and meta-analyses (PRISMA) statement criteria, articles were searched in PubMed, Embase, EBSCO, Wanfang, and CNKI databases. The strength of correlation was calculated by the pooled odds ratios (ORs) and their corresponding 95% confidence intervals (95% CIs). Trial sequential analysis (TSA) was used to estimate the required population information for significant results. A total of 20 studies published from 1997 to 2017 were identified in this meta-analysis, including 1140 PCa patients and 530 cases without cancer. Only p16 methylation in PCa was significantly higher than in benign prostatic lesions (OR = 4.72, P = .011), but had a similar level in PCa and adjacent tissues or high-grade prostatic intraepithelial neoplasias (HGPIN). TSA revealed that this analysis on p16 methylation is a false positive result in cancer versus benign prostatic lesions (the estimated required information size of 5116 participants). p16 methylation was not correlated with PCa in the urine and blood. Besides, p16 methylation was not linked to clinical stage, prostate-specific antigen (PSA) level, and Gleason score (GS) of patients with PCa. p14 methylation was not correlated with PCa in tissue and urine samples. No correlation was observed between p14 methylation and clinical stage or GS. CDKN2A mutation and copy number alteration were not associated with prognosis of PCa in overall survival and disease-free survival. CDKN2A expression was not correlated with the prognosis of PCa in overall survival (492 cases) (P > .1), while CDKN2A expression was significantly associated with a poor disease-free survival (P < .01). CDKN2A methylation may not be significantly associated with the development, progression of PCa. Although CDKN2A expression had an unfavorable prognosis in disease-free survival. More studies are needed to confirm our results.
Shaikhibrahim, Zaki; Lindstrot, Andreas; Ochsenfahrt, Jacqueline; Fuchs, Kerstin; Wernert, Nicolas
2013-01-01
Epigenetic changes have been suggested to drive prostate cancer (PCa) development and progression. Therefore, in this study, we aimed to identify novel epigenetics-related genes in PCa tissues, and to examine their expression in metastatic PCa cell lines. We analyzed the expression of epigenetics-related genes via a clustering analysis based on gene function in moderately and poorly differentiated PCa glands compared to normal glands of the peripheral zone (prostate proper) from PCa patients using Whole Human Genome Oligo Microarrays. Our analysis identified 12 epigenetics-related genes with a more than 2-fold increase or decrease in expression and a p-value <0.01. In modera-tely differentiated tumors compared to normal glands of the peripheral zone, we found the genes, TDRD1, IGF2, DICER1, ADARB1, HILS1, GLMN and TRIM27, to be upregulated, whereas TNRC6A and DGCR8 were found to be downregulated. In poorly differentiated tumors, we found TDRD1, ADARB and RBM3 to be upregulated, whereas DGCR8, PIWIL2 and BC069781 were downregulated. Our analysis of the expression level for each gene in the metastatic androgen-sensitive VCaP and LNCaP, and -insensitive PC3 and DU-145 PCa cell lines revealed differences in expression among the cell lines which may reflect the different biological properties of each cell line, and the potential role of each gene at different metastatic sites. The novel epigenetics-related genes that we identified in primary PCa tissues may provide further insight into the role that epigenetic changes play in PCa. Moreover, some of the genes that we identified may play important roles in primary PCa and metastasis, in primary PCa only, or in metastasis only. Follow-up studies are required to investigate the functional role and the role that the expression of these genes play in the outcome and progression of PCa using tissue microarrays.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steenbergen, K. G., E-mail: kgsteen@gmail.com; Gaston, N.
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement formore » a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.« less
Steenbergen, K G; Gaston, N
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement for a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.
Bagnasco, Lucia; Zotti, Mirca; Sitta, Nicola; Oliveri, Paolo
2015-11-01
Mycophilic fungi of anamorphic genus Sepedonium (telomorphs in Hypomyces, Hypocreales, Ascomycota) infect and parasitize sporomata of boletes. The obligated hosts such as Boletus edulis and allied species (known as "porcini mushrooms") are among the most valued and prized edible wild mushrooms in the world. Sepedonium infections have a great morphological variability: at the initial state, contaminated mushrooms present a white coating covering tubes and pores; at the final state, Sepedonium forms a deep and thick hyphal layer that eventually leads to the total necrosis of the host. Up to date, Sepedonium infections in porcini mushrooms have been evaluated only through macroscopic and microscopic visual analysis. In this study, in order to implement the infection evaluation as a routine methodology for industrial purposes, the potential application of Hyperspectral Imaging (HSI) and Principal Component Analysis (PCA) for detection of Sepedonium presence on sliced and dried B. edulis and allied species was investigated. Hyperspectral images were obtained using a pushbroom line-scanning HSI instrument, operating in the wavelength range between 400 and 1000 nm with 5 nm resolution. PCA was applied on normal and contaminated samples. To reduce the spectral variability caused by factors unrelated to Sepedonium infection, such as scattering effects and differences in sample height, different spectral pre-treatments were applied. A supervised rule was then developed to assign spectra recorded on new test samples to each of the two classes, based on the PC scores. This allowed to visualize directly - within false-color images of test samples - which points of the samples were contaminated. The results achieved may lead to the development of a non-destructive monitoring system for a rapid on-line screening of contaminated mushrooms. Copyright © 2015 Elsevier B.V. All rights reserved.
Inter-comparison of receptor models for PM source apportionment: Case study in an industrial area
NASA Astrophysics Data System (ADS)
Viana, M.; Pandolfi, M.; Minguillón, M. C.; Querol, X.; Alastuey, A.; Monfort, E.; Celades, I.
2008-05-01
Receptor modelling techniques are used to identify and quantify the contributions from emission sources to the levels and major and trace components of ambient particulate matter (PM). A wide variety of receptor models are currently available, and consequently the comparability between models should be evaluated if source apportionment data are to be used as input in health effects studies or mitigation plans. Three of the most widespread receptor models (principal component analysis, PCA; positive matrix factorization, PMF; chemical mass balance, CMB) were applied to a single PM10 data set (n=328 samples, 2002-2005) obtained from an industrial area in NE Spain, dedicated to ceramic production. Sensitivity and temporal trend analyses (using the Mann-Kendall test) were applied. Results evidenced the good overall performance of the three models (r2>0.83 and α>0.91×between modelled and measured PM10 mass), with a good agreement regarding source identification and high correlations between input (CMB) and output (PCA, PMF) source profiles. Larger differences were obtained regarding the quantification of source contributions (up to a factor of 4 in some cases). The combined application of different types of receptor models would solve the limitations of each of the models, by constructing a more robust solution based on their strengths. The authors suggest the combined use of factor analysis techniques (PCA, PMF) to identify and interpret emission sources, and to obtain a first quantification of their contributions to the PM mass, and the subsequent application of CMB. Further research is needed to ensure that source apportionment methods are robust enough for application to PM health effects assessments.
Thomas-Jardin, Shayna E; Kanchwala, Mohammed S; Jacob, Joan; Merchant, Sana; Meade, Rachel K; Gahnim, Nagham M; Nawas, Afshan F; Xing, Chao; Delk, Nikki A
2018-06-01
In immunosurveillance, bone-derived immune cells infiltrate the tumor and secrete inflammatory cytokines to destroy cancer cells. However, cancer cells have evolved mechanisms to usurp inflammatory cytokines to promote tumor progression. In particular, the inflammatory cytokine, interleukin-1 (IL-1), is elevated in prostate cancer (PCa) patient tissue and serum, and promotes PCa bone metastasis. IL-1 also represses androgen receptor (AR) accumulation and activity in PCa cells, yet the cells remain viable and tumorigenic; suggesting that IL-1 may also contribute to AR-targeted therapy resistance. Furthermore, IL-1 and AR protein levels negatively correlate in PCa tumor cells. Taken together, we hypothesize that IL-1 reprograms AR positive (AR + ) PCa cells into AR negative (AR - ) PCa cells that co-opt IL-1 signaling to ensure AR-independent survival and tumor progression in the inflammatory tumor microenvironment. LNCaP and PC3 PCa cells were treated with IL-1β or HS-5 bone marrow stromal cell (BMSC) conditioned medium and analyzed by RNA sequencing and RT-QPCR. To verify genes identified by RNA sequencing, LNCaP, MDA-PCa-2b, PC3, and DU145 PCa cell lines were treated with the IL-1 family members, IL-1α or IL-1β, or exposed to HS-5 BMSC in the presence or absence of Interleukin-1 Receptor Antagonist (IL-1RA). Treated cells were analyzed by western blot and/or RT-QPCR. Comparative analysis of sequencing data from the AR + LNCaP PCa cell line versus the AR - PC3 PCa cell line reveals an IL-1-conferred gene suite in LNCaP cells that is constitutive in PC3 cells. Bioinformatics analysis of the IL-1 regulated gene suite revealed that inflammatory and immune response pathways are primarily elicited; likely facilitating PCa cell survival and tumorigenicity in an inflammatory tumor microenvironment. Our data supports that IL-1 reprograms AR + PCa cells to mimic AR - PCa gene expression patterns that favor AR-targeted treatment resistance and cell survival. © 2018 Wiley Periodicals, Inc.
PCA-LBG-based algorithms for VQ codebook generation
NASA Astrophysics Data System (ADS)
Tsai, Jinn-Tsong; Yang, Po-Yuan
2015-04-01
Vector quantisation (VQ) codebooks are generated by combining principal component analysis (PCA) algorithms with Linde-Buzo-Gray (LBG) algorithms. All training vectors are grouped according to the projected values of the principal components. The PCA-LBG-based algorithms include (1) PCA-LBG-Median, which selects the median vector of each group, (2) PCA-LBG-Centroid, which adopts the centroid vector of each group, and (3) PCA-LBG-Random, which randomly selects a vector of each group. The LBG algorithm finds a codebook based on the better vectors sent to an initial codebook by the PCA. The PCA performs an orthogonal transformation to convert a set of potentially correlated variables into a set of variables that are not linearly correlated. Because the orthogonal transformation efficiently distinguishes test image vectors, the proposed PCA-LBG-based algorithm is expected to outperform conventional algorithms in designing VQ codebooks. The experimental results confirm that the proposed PCA-LBG-based algorithms indeed obtain better results compared to existing methods reported in the literature.
NASA Astrophysics Data System (ADS)
Dan, Luo; Ohya, Jun
2010-02-01
Recognizing hand gestures from the video sequence acquired by a dynamic camera could be a useful interface between humans and mobile robots. We develop a state based approach to extract and recognize hand gestures from moving camera images. We improved Human-Following Local Coordinate (HFLC) System, a very simple and stable method for extracting hand motion trajectories, which is obtained from the located human face, body part and hand blob changing factor. Condensation algorithm and PCA-based algorithm was performed to recognize extracted hand trajectories. In last research, this Condensation Algorithm based method only applied for one person's hand gestures. In this paper, we propose a principal component analysis (PCA) based approach to improve the recognition accuracy. For further improvement, temporal changes in the observed hand area changing factor are utilized as new image features to be stored in the database after being analyzed by PCA. Every hand gesture trajectory in the database is classified into either one hand gesture categories, two hand gesture categories, or temporal changes in hand blob changes. We demonstrate the effectiveness of the proposed method by conducting experiments on 45 kinds of sign language based Japanese and American Sign Language gestures obtained from 5 people. Our experimental recognition results show better performance is obtained by PCA based approach than the Condensation algorithm based method.
Multivariate Statistical Analysis of MSL APXS Bulk Geochemical Data
NASA Astrophysics Data System (ADS)
Hamilton, V. E.; Edwards, C. S.; Thompson, L. M.; Schmidt, M. E.
2014-12-01
We apply cluster and factor analyses to bulk chemical data of 130 soil and rock samples measured by the Alpha Particle X-ray Spectrometer (APXS) on the Mars Science Laboratory (MSL) rover Curiosity through sol 650. Multivariate approaches such as principal components analysis (PCA), cluster analysis, and factor analysis compliment more traditional approaches (e.g., Harker diagrams), with the advantage of simultaneously examining the relationships between multiple variables for large numbers of samples. Principal components analysis has been applied with success to APXS, Pancam, and Mössbauer data from the Mars Exploration Rovers. Factor analysis and cluster analysis have been applied with success to thermal infrared (TIR) spectral data of Mars. Cluster analyses group the input data by similarity, where there are a number of different methods for defining similarity (hierarchical, density, distribution, etc.). For example, without any assumptions about the chemical contributions of surface dust, preliminary hierarchical and K-means cluster analyses clearly distinguish the physically adjacent rock targets Windjana and Stephen as being distinctly different than lithologies observed prior to Curiosity's arrival at The Kimberley. In addition, they are separated from each other, consistent with chemical trends observed in variation diagrams but without requiring assumptions about chemical relationships. We will discuss the variation in cluster analysis results as a function of clustering method and pre-processing (e.g., log transformation, correction for dust cover) and implications for interpreting chemical data. Factor analysis shares some similarities with PCA, and examines the variability among observed components of a dataset so as to reveal variations attributable to unobserved components. Factor analysis has been used to extract the TIR spectra of components that are typically observed in mixtures and only rarely in isolation; there is the potential for similar results with data from APXS. These techniques offer new ways to understand the chemical relationships between the materials interrogated by Curiosity, and potentially their relation to materials observed by APXS instruments on other landed missions.
3D Shape Perception in Posterior Cortical Atrophy: A Visual Neuroscience Perspective
Gillebert, Céline R.; Schaeverbeke, Jolien; Bastin, Christine; Neyens, Veerle; Bruffaerts, Rose; De Weer, An-Sofie; Seghers, Alexandra; Sunaert, Stefan; Van Laere, Koen; Versijpt, Jan; Vandenbulcke, Mathieu; Salmon, Eric; Todd, James T.; Orban, Guy A.
2015-01-01
Posterior cortical atrophy (PCA) is a rare focal neurodegenerative syndrome characterized by progressive visuoperceptual and visuospatial deficits, most often due to atypical Alzheimer's disease (AD). We applied insights from basic visual neuroscience to analyze 3D shape perception in humans affected by PCA. Thirteen PCA patients and 30 matched healthy controls participated, together with two patient control groups with diffuse Lewy body dementia (DLBD) and an amnestic-dominant phenotype of AD, respectively. The hierarchical study design consisted of 3D shape processing for 4 cues (shading, motion, texture, and binocular disparity) with corresponding 2D and elementary feature extraction control conditions. PCA and DLBD exhibited severe 3D shape-processing deficits and AD to a lesser degree. In PCA, deficient 3D shape-from-shading was associated with volume loss in the right posterior inferior temporal cortex. This region coincided with a region of functional activation during 3D shape-from-shading in healthy controls. In PCA patients who performed the same fMRI paradigm, response amplitude during 3D shape-from-shading was reduced in this region. Gray matter volume in this region also correlated with 3D shape-from-shading in AD. 3D shape-from-disparity in PCA was associated with volume loss slightly more anteriorly in posterior inferior temporal cortex as well as in ventral premotor cortex. The findings in right posterior inferior temporal cortex and right premotor cortex are consistent with neurophysiologically based models of the functional anatomy of 3D shape processing. However, in DLBD, 3D shape deficits rely on mechanisms distinct from inferior temporal structural integrity. SIGNIFICANCE STATEMENT Posterior cortical atrophy (PCA) is a neurodegenerative syndrome characterized by progressive visuoperceptual dysfunction and most often an atypical presentation of Alzheimer's disease (AD) affecting the ventral and dorsal visual streams rather than the medial temporal system. We applied insights from fundamental visual neuroscience to analyze 3D shape perception in PCA. 3D shape-processing deficits were affected beyond what could be accounted for by lower-order processing deficits. For shading and disparity, this was related to volume loss in regions previously implicated in 3D shape processing in the intact human and nonhuman primate brain. Typical amnestic-dominant AD patients also exhibited 3D shape deficits. Advanced visual neuroscience provides insight into the pathogenesis of PCA that also bears relevance for vision in typical AD. PMID:26377458
Plaque echodensity and textural features are associated with histologic carotid plaque instability.
Doonan, Robert J; Gorgui, Jessica; Veinot, Jean P; Lai, Chi; Kyriacou, Efthyvoulos; Corriveau, Marc M; Steinmetz, Oren K; Daskalopoulou, Stella S
2016-09-01
Carotid plaque echodensity and texture features predict cerebrovascular symptomatology. Our purpose was to determine the association of echodensity and textural features obtained from a digital image analysis (DIA) program with histologic features of plaque instability as well as to identify the specific morphologic characteristics of unstable plaques. Patients scheduled to undergo carotid endarterectomy were recruited and underwent carotid ultrasound imaging. DIA was performed to extract echodensity and textural features using Plaque Texture Analysis software (LifeQ Medical Ltd, Nicosia, Cyprus). Carotid plaque surgical specimens were obtained and analyzed histologically. Principal component analysis (PCA) was performed to reduce imaging variables. Logistic regression models were used to determine if PCA variables and individual imaging variables predicted histologic features of plaque instability. Image analysis data from 160 patients were analyzed. Individual imaging features of plaque echolucency and homogeneity were associated with a more unstable plaque phenotype on histology. These results were independent of age, sex, and degree of carotid stenosis. PCA reduced 39 individual imaging variables to five PCA variables. PCA1 and PCA2 were significantly associated with overall plaque instability on histology (both P = .02), whereas PCA3 did not achieve statistical significance (P = .07). DIA features of carotid plaques are associated with histologic plaque instability as assessed by multiple histologic features. Importantly, unstable plaques on histology appear more echolucent and homogeneous on ultrasound imaging. These results are independent of stenosis, suggesting that image analysis may have a role in refining the selection of patients who undergo carotid endarterectomy. Copyright © 2016 Society for Vascular Surgery. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Liu, Wen; Zhang, Yuying; Yang, Si; Han, Donghai
2018-05-01
A new technique to identify the floral resources of honeys is demanded. Terahertz time-domain attenuated total reflection spectroscopy combined with chemometrics methods was applied to discriminate different categorizes (Medlar honey, Vitex honey, and Acacia honey). Principal component analysis (PCA), cluster analysis (CA) and partial least squares-discriminant analysis (PLS-DA) have been used to find information of the botanical origins of honeys. Spectral range also was discussed to increase the precision of PLS-DA model. The accuracy of 88.46% for validation set was obtained, using PLS-DA model in 0.5-1.5 THz. This work indicated terahertz time-domain attenuated total reflection spectroscopy was an available approach to evaluate the quality of honey rapidly.
Identification of the isomers using principal component analysis (PCA) method
NASA Astrophysics Data System (ADS)
Kepceoǧlu, Abdullah; Gündoǧdu, Yasemin; Ledingham, Kenneth William David; Kilic, Hamdi Sukur
2016-03-01
In this work, we have carried out a detailed statistical analysis for experimental data of mass spectra from xylene isomers. Principle Component Analysis (PCA) was used to identify the isomers which cannot be distinguished using conventional statistical methods for interpretation of their mass spectra. Experiments have been carried out using a linear TOF-MS coupled to a femtosecond laser system as an energy source for the ionisation processes. We have performed experiments and collected data which has been analysed and interpreted using PCA as a multivariate analysis of these spectra. This demonstrates the strength of the method to get an insight for distinguishing the isomers which cannot be identified using conventional mass analysis obtained through dissociative ionisation processes on these molecules. The PCA results dependending on the laser pulse energy and the background pressure in the spectrometers have been presented in this work.
Nonlinear Peculiar-Velocity Analysis and PCA
NASA Astrophysics Data System (ADS)
Dekel, Avishai; Eldar, Amiram; Silberman, Lior; Zehavi, Idit
We allow for nonlinear effects in the likelihood analysis of peculiar velocities, and obtain ˜35%-lower values for the cosmological density parameter and for the amplitude of mass-density fluctuations. The power spectrum in the linear regime is assumed to be of the flat ΛCDM model (h = 0.65, n = 1) with only Ω_m free. Since the likelihood is driven by the nonlinear regime, we "break" the power spectrum at k_b˜ 0.2 (h^{-1}Mpc)^{-1} and fit a two-parameter power-law at k > k b . This allows for an unbiased fit in the linear regime. Tests using improved mock catalogs demonstrate a reduced bias and a better fit. We find for the Mark III and SFI data Ω_m = 0.35± 0.09 with σ_8Ω_m^{0.6} = 0.55± 0.10 (90% errors). When allowing deviations from ΛCDM, we find an indication for a wiggle in the power spectrum in the form of an excess near k ˜ 0.05 and a deficiency at k ˜ 0.1 (h^{-1}Mpc)^{-1} - a "cold flow" which may be related to a feature indicated from redshift surveys and the second peak in the CMB anisotropy. A χ^2 test applied to principal modes demonstrates that the nonlinear procedure improves the goodness of fit. The Principal Component Analysis (PCA) helps identifying spatial features of the data and fine-tuning the theoretical and error models. We address the potential for optimal data compression using PCA.
Common mode error in Antarctic GPS coordinate time series on its effect on bedrock-uplift estimates
NASA Astrophysics Data System (ADS)
Liu, Bin; King, Matt; Dai, Wujiao
2018-05-01
Spatially-correlated common mode error always exists in regional, or-larger, GPS networks. We applied independent component analysis (ICA) to GPS vertical coordinate time series in Antarctica from 2010 to 2014 and made a comparison with the principal component analysis (PCA). Using PCA/ICA, the time series can be decomposed into a set of temporal components and their spatial responses. We assume the components with common spatial responses are common mode error (CME). An average reduction of ˜40% about the RMS values was achieved in both PCA and ICA filtering. However, the common mode components obtained from the two approaches have different spatial and temporal features. ICA time series present interesting correlations with modeled atmospheric and non-tidal ocean loading displacements. A white noise (WN) plus power law noise (PL) model was adopted in the GPS velocity estimation using maximum likelihood estimation (MLE) analysis, with ˜55% reduction of the velocity uncertainties after filtering using ICA. Meanwhile, spatiotemporal filtering reduces the amplitude of PL and periodic terms in the GPS time series. Finally, we compare the GPS uplift velocities, after correction for elastic effects, with recent models of glacial isostatic adjustment (GIA). The agreements of the GPS observed velocities and four GIA models are generally improved after the spatiotemporal filtering, with a mean reduction of ˜0.9 mm/yr of the WRMS values, possibly allowing for more confident separation of various GIA model predictions.
NASA Astrophysics Data System (ADS)
Babanova, Sofia; Artyushkova, Kateryna; Ulyanova, Yevgenia; Singhal, Sameer; Atanassov, Plamen
2014-01-01
Two statistical methods, design of experiments (DOE) and principal component analysis (PCA) are employed to investigate and improve performance of air-breathing gas-diffusional enzymatic electrodes. DOE is utilized as a tool for systematic organization and evaluation of various factors affecting the performance of the composite system. Based on the results from the DOE, an improved cathode is constructed. The current density generated utilizing the improved cathode (755 ± 39 μA cm-2 at 0.3 V vs. Ag/AgCl) is 2-5 times higher than the highest current density previously achieved. Three major factors contributing to the cathode performance are identified: the amount of enzyme, the volume of phosphate buffer used to immobilize the enzyme, and the thickness of the gas-diffusion layer (GDL). PCA is applied as an independent confirmation tool to support conclusions made by DOE and to visualize the contribution of factors in individual cathode configurations.
Busetto, Gian Maria; De Berardinis, Ettore; Sciarra, Alessandro; Panebianco, Valeria; Giovannone, Riccardo; Rosato, Stefano; D'Errigo, Paola; Di Silverio, Franco; Gentile, Vincenzo; Salciccia, Stefano
2013-12-01
To overcome the well-known prostate-specific antigen limits, several new biomarkers have been proposed. Since its introduction in clinical practice, the urinary prostate cancer gene 3 (PCA3) assay has shown promising results for prostate cancer (PC) detection. Furthermore, multiparametric magnetic resonance imaging (mMRI) has the ability to better describe several aspects of PC. A prospective study of 171 patients with negative prostate biopsy findings and a persistent high prostate-specific antigen level was conducted to assess the role of mMRI and PCA3 in identifying PC. All patients underwent the PCA3 test and mMRI before a second transrectal ultrasound-guided prostate biopsy. The accuracy and reliability of PCA3 (3 different cutoff points) and mMRI were evaluated. Four multivariate logistic regression models were analyzed, in terms of discrimination and the cost benefit, to assess the clinical role of PCA3 and mMRI in predicting the biopsy outcome. A decision curve analysis was also plotted. Repeated transrectal ultrasound-guided biopsy identified 68 new cases (41.7%) of PC. The sensitivity and specificity of the PCA3 test and mMRI was 68% and 49% and 74% and 90%, respectively. Evaluating the regression models, the best discrimination (area under the curve 0.808) was obtained using the full model (base clinical model plus mMRI and PCA3). The decision curve analysis, to evaluate the cost/benefit ratio, showed good performance in predicting PC with the model that included mMRI and PCA3. mMRI increased the accuracy and sensitivity of the PCA3 test, and the use of the full model significantly improved the cost/benefit ratio, avoiding unnecessary biopsies. Copyright © 2013 Elsevier Inc. All rights reserved.
Duscharla, Divya; Bhumireddy, Sudarshana Reddy; Lakshetti, Sridhar; Pospisil, Heike; Murthy, P V L N; Walther, Reinhard; Sripadi, Prabhakar; Ummanni, Ramesh
2016-01-01
Prostate cancer (PCa) is one amongst the most common cancersin western men. Incidence rate ofPCa is on the rise worldwide. The present study deals with theserum lipidome profiling of patients diagnosed with PCa to identify potential new biomarkers. We employed ESI-MS/MS and GC-MS for identification of significantly altered lipids in cancer patient's serum compared to controls. Lipidomic data revealed 24 lipids are significantly altered in cancer patinet's serum (n = 18) compared to normal (n = 18) with no history of PCa. By using hierarchical clustering and principal component analysis (PCA) we could clearly separate cancer patients from control group. Correlation and partition analysis along with Formal Concept Analysis (FCA) have identified that PC (39:6) and FA (22:3) could classify samples with higher certainty. Both the lipids, PC (39:6) and FA (22:3) could influence the cataloging of patients with 100% sensitivity (all 18 control samples are classified correctly) and 77.7% specificity (of 18 tumor samples 4 samples are misclassified) with p-value of 1.612×10-6 in Fischer's exact test. Further, we performed GC-MS to denote fatty acids altered in PCa patients and found that alpha-linolenic acid (ALA) levels are altered in PCa. We also performed an in vitro proliferation assay to determine the effect of ALA in survival of classical human PCa cell lines LNCaP and PC3. We hereby report that the altered lipids PC (39:6) and FA (22:3) offer a new set of biomarkers in addition to the existing diagnostic tests that could significantly improve sensitivity and specificity in PCa diagnosis.
Duscharla, Divya; Bhumireddy, Sudarshana Reddy; Lakshetti, Sridhar; Pospisil, Heike; Murthy, P. V. L. N.; Walther, Reinhard; Sripadi, Prabhakar; Ummanni, Ramesh
2016-01-01
Prostate cancer (PCa) is one amongst the most common cancersin western men. Incidence rate ofPCa is on the rise worldwide. The present study deals with theserum lipidome profiling of patients diagnosed with PCa to identify potential new biomarkers. We employed ESI-MS/MS and GC-MS for identification of significantly altered lipids in cancer patient’s serum compared to controls. Lipidomic data revealed 24 lipids are significantly altered in cancer patinet’s serum (n = 18) compared to normal (n = 18) with no history of PCa. By using hierarchical clustering and principal component analysis (PCA) we could clearly separate cancer patients from control group. Correlation and partition analysis along with Formal Concept Analysis (FCA) have identified that PC (39:6) and FA (22:3) could classify samples with higher certainty. Both the lipids, PC (39:6) and FA (22:3) could influence the cataloging of patients with 100% sensitivity (all 18 control samples are classified correctly) and 77.7% specificity (of 18 tumor samples 4 samples are misclassified) with p-value of 1.612×10−6 in Fischer’s exact test. Further, we performed GC-MS to denote fatty acids altered in PCa patients and found that alpha-linolenic acid (ALA) levels are altered in PCa. We also performed an in vitro proliferation assay to determine the effect of ALA in survival of classical human PCa cell lines LNCaP and PC3. We hereby report that the altered lipids PC (39:6) and FA (22:3) offer a new set of biomarkers in addition to the existing diagnostic tests that could significantly improve sensitivity and specificity in PCa diagnosis. PMID:26958841
Párta, László; Zalai, Dénes; Borbély, Sándor; Putics, Akos
2014-02-01
The application of dielectric spectroscopy was frequently investigated as an on-line cell culture monitoring tool; however, it still requires supportive data and experience in order to become a robust technique. In this study, dielectric spectroscopy was used to predict viable cell density (VCD) at industrially relevant high levels in concentrated fed-batch culture of Chinese hamster ovary cells producing a monoclonal antibody for pharmaceutical purposes. For on-line dielectric spectroscopy measurements, capacitance was scanned within a wide range of frequency values (100-19,490 kHz) in six parallel cell cultivation batches. Prior to detailed mathematical analysis of the collected data, principal component analysis (PCA) was applied to compare dielectric behavior of the cultivations. PCA analysis resulted in detecting measurement disturbances. By using the measured spectroscopic data, partial least squares regression (PLS), Cole-Cole, and linear modeling were applied and compared in order to predict VCD. The Cole-Cole and the PLS model provided reliable prediction over the entire cultivation including both the early and decline phases of cell growth, while the linear model failed to estimate VCD in the later, declining cultivation phase. In regards to the measurement error sensitivity, remarkable differences were shown among PLS, Cole-Cole, and linear modeling. VCD prediction accuracy could be improved in the runs with measurement disturbances by first derivative pre-treatment in PLS and by parameter optimization of the Cole-Cole modeling.
Scalco, Elisa; Rancati, Tiziana; Pirovano, Ileana; Mastropietro, Alfonso; Palorini, Federica; Cicchetti, Alessandro; Messina, Antonella; Avuzzi, Barbara; Valdagni, Riccardo; Rizzo, Giovanna
2018-04-01
To investigate the potential of texture analysis applied on T2-w and postcontrast T1-w images acquired before radiotherapy for prostate cancer (PCa) and 12 months after its completion in quantitatively characterizing local radiation effect on the muscular component of internal obturators, as organs potentially involved in urinary toxicity. T2-w and postcontrast T1-w MR images were acquired at 1.5 T before treatment (MRI1) and at 12 months of follow-up (MRI2) in 13 patients treated with radiotherapy for PCa. Right and left internal obturator muscle contours were manually delineated upon MRI1 and then automatically propagated on MRI2 by an elastic registration method. Planning CT images were coregistered to both MRIs and dose maps were deformed accordingly. A high-dose region receiving >55 Gy and a low-dose region receiving <55 Gy were identified in each muscle volume. Eighteen textural features were extracted from each region of interest and differences between MRI1 and MRI2 were evaluated. A signal increase was highlighted in both T2-w and T1-w images in the portion of the obturators near the prostate, i.e., in the region receiving medium-high doses. A change in the spatial organization was identified, as an increase in homogeneity and a decrease in contrast and complexity, compatible with an inflammatory status. In particular, the region receiving medium-high doses presented more significant or, at least, stronger differences. Texture analysis applied on T1-w and T2-w MR images has demonstrated its ability in quantitative evaluating radiation-induced changes in obturator muscles after PCa radiotherapy. © 2018 American Association of Physicists in Medicine.
Zakaria, Ammar; Shakaff, Ali Yeon Md; Masnan, Maz Jamilah; Ahmad, Mohd Noor; Adom, Abdul Hamid; Jaafar, Mahmad Nor; Ghani, Supri A.; Abdullah, Abu Hassan; Aziz, Abdul Hallis Abdul; Kamarudin, Latifah Munirah; Subari, Norazian; Fikri, Nazifah Ahmad
2011-01-01
The major compounds in honey are carbohydrates such as monosaccharides and disaccharides. The same compounds are found in cane-sugar concentrates. Unfortunately when sugar concentrate is added to honey, laboratory assessments are found to be ineffective in detecting this adulteration. Unlike tracing heavy metals in honey, sugar adulterated honey is much trickier and harder to detect, and traditionally it has been very challenging to come up with a suitable method to prove the presence of adulterants in honey products. This paper proposes a combination of array sensing and multi-modality sensor fusion that can effectively discriminate the samples not only based on the compounds present in the sample but also mimic the way humans perceive flavours and aromas. Conversely, analytical instruments are based on chemical separations which may alter the properties of the volatiles or flavours of a particular honey. The present work is focused on classifying 18 samples of different honeys, sugar syrups and adulterated samples using data fusion of electronic nose (e-nose) and electronic tongue (e-tongue) measurements. Each group of samples was evaluated separately by the e-nose and e-tongue. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) were able to separately discriminate monofloral honey from sugar syrup, and polyfloral honey from sugar and adulterated samples using the e-nose and e-tongue. The e-nose was observed to give better separation compared to e-tongue assessment, particularly when LDA was applied. However, when all samples were combined in one classification analysis, neither PCA nor LDA were able to discriminate between honeys of different floral origins, sugar syrup and adulterated samples. By applying a sensor fusion technique, the classification for the 18 different samples was improved. Significant improvement was observed using PCA, while LDA not only improved the discrimination but also gave better classification. An improvement in performance was also observed using a Probabilistic Neural Network classifier when the e-nose and e-tongue data were fused. PMID:22164046
Zakaria, Ammar; Shakaff, Ali Yeon Md; Masnan, Maz Jamilah; Ahmad, Mohd Noor; Adom, Abdul Hamid; Jaafar, Mahmad Nor; Ghani, Supri A; Abdullah, Abu Hassan; Aziz, Abdul Hallis Abdul; Kamarudin, Latifah Munirah; Subari, Norazian; Fikri, Nazifah Ahmad
2011-01-01
The major compounds in honey are carbohydrates such as monosaccharides and disaccharides. The same compounds are found in cane-sugar concentrates. Unfortunately when sugar concentrate is added to honey, laboratory assessments are found to be ineffective in detecting this adulteration. Unlike tracing heavy metals in honey, sugar adulterated honey is much trickier and harder to detect, and traditionally it has been very challenging to come up with a suitable method to prove the presence of adulterants in honey products. This paper proposes a combination of array sensing and multi-modality sensor fusion that can effectively discriminate the samples not only based on the compounds present in the sample but also mimic the way humans perceive flavours and aromas. Conversely, analytical instruments are based on chemical separations which may alter the properties of the volatiles or flavours of a particular honey. The present work is focused on classifying 18 samples of different honeys, sugar syrups and adulterated samples using data fusion of electronic nose (e-nose) and electronic tongue (e-tongue) measurements. Each group of samples was evaluated separately by the e-nose and e-tongue. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) were able to separately discriminate monofloral honey from sugar syrup, and polyfloral honey from sugar and adulterated samples using the e-nose and e-tongue. The e-nose was observed to give better separation compared to e-tongue assessment, particularly when LDA was applied. However, when all samples were combined in one classification analysis, neither PCA nor LDA were able to discriminate between honeys of different floral origins, sugar syrup and adulterated samples. By applying a sensor fusion technique, the classification for the 18 different samples was improved. Significant improvement was observed using PCA, while LDA not only improved the discrimination but also gave better classification. An improvement in performance was also observed using a Probabilistic Neural Network classifier when the e-nose and e-tongue data were fused.
Differences in chewing sounds of dry-crisp snacks by multivariate data analysis
NASA Astrophysics Data System (ADS)
De Belie, N.; Sivertsvik, M.; De Baerdemaeker, J.
2003-09-01
Chewing sounds of different types of dry-crisp snacks (two types of potato chips, prawn crackers, cornflakes and low calorie snacks from extruded starch) were analysed to assess differences in sound emission patterns. The emitted sounds were recorded by a microphone placed over the ear canal. The first bite and the first subsequent chew were selected from the time signal and a fast Fourier transformation provided the power spectra. Different multivariate analysis techniques were used for classification of the snack groups. This included principal component analysis (PCA) and unfold partial least-squares (PLS) algorithms, as well as multi-way techniques such as three-way PLS, three-way PCA (Tucker3), and parallel factor analysis (PARAFAC) on the first bite and subsequent chew. The models were evaluated by calculating the classification errors and the root mean square error of prediction (RMSEP) for independent validation sets. It appeared that the logarithm of the power spectra obtained from the chewing sounds could be used successfully to distinguish the different snack groups. When different chewers were used, recalibration of the models was necessary. Multi-way models distinguished better between chewing sounds of different snack groups than PCA on bite or chew separately and than unfold PLS. From all three-way models applied, N-PLS with three components showed the best classification capabilities, resulting in classification errors of 14-18%. The major amount of incorrect classifications was due to one type of potato chips that had a very irregular shape, resulting in a wide variation of the emitted sounds.
Wang, Wei; Heitschmidt, Gerald W; Windham, William R; Feldner, Peggy; Ni, Xinzhi; Chu, Xuan
2015-01-01
The feasibility of using a visible/near-infrared hyperspectral imaging system with a wavelength range between 400 and 1000 nm to detect and differentiate different levels of aflatoxin B1 (AFB1 ) artificially titrated on maize kernel surface was examined. To reduce the color effects of maize kernels, image analysis was limited to a subset of original spectra (600 to 1000 nm). Residual staining from the AFB1 on the kernels surface was selected as regions of interest for analysis. Principal components analysis (PCA) was applied to reduce the dimensionality of hyperspectral image data, and then a stepwise factorial discriminant analysis (FDA) was performed on latent PCA variables. The results indicated that discriminant factors F2 can be used to separate control samples from all of the other groups of kernels with AFB1 inoculated, whereas the discriminant factors F1 can be used to identify maize kernels with levels of AFB1 as low as 10 ppb. An overall classification accuracy of 98% was achieved. Finally, the peaks of β coefficients of the discrimination factors F1 and F2 were analyzed and several key wavelengths identified for differentiating maize kernels with and without AFB1 , as well as those with differing levels of AFB1 inoculation. Results indicated that Vis/NIR hyperspectral imaging technology combined with the PCA-FDA was a practical method to detect and differentiate different levels of AFB1 artificially inoculated on the maize kernels surface. However, indicated the potential to detect and differentiate naturally occurring toxins in maize kernel. © 2014 Institute of Food Technologists®
Guo, Zhiqiang; Wang, Huaiqing; Yang, Jie; Miller, David J
2015-01-01
In this paper, we propose and implement a hybrid model combining two-directional two-dimensional principal component analysis ((2D)2PCA) and a Radial Basis Function Neural Network (RBFNN) to forecast stock market behavior. First, 36 stock market technical variables are selected as the input features, and a sliding window is used to obtain the input data of the model. Next, (2D)2PCA is utilized to reduce the dimension of the data and extract its intrinsic features. Finally, an RBFNN accepts the data processed by (2D)2PCA to forecast the next day's stock price or movement. The proposed model is used on the Shanghai stock market index, and the experiments show that the model achieves a good level of fitness. The proposed model is then compared with one that uses the traditional dimension reduction method principal component analysis (PCA) and independent component analysis (ICA). The empirical results show that the proposed model outperforms the PCA-based model, as well as alternative models based on ICA and on the multilayer perceptron.
Guo, Zhiqiang; Wang, Huaiqing; Yang, Jie; Miller, David J.
2015-01-01
In this paper, we propose and implement a hybrid model combining two-directional two-dimensional principal component analysis ((2D)2PCA) and a Radial Basis Function Neural Network (RBFNN) to forecast stock market behavior. First, 36 stock market technical variables are selected as the input features, and a sliding window is used to obtain the input data of the model. Next, (2D)2PCA is utilized to reduce the dimension of the data and extract its intrinsic features. Finally, an RBFNN accepts the data processed by (2D)2PCA to forecast the next day's stock price or movement. The proposed model is used on the Shanghai stock market index, and the experiments show that the model achieves a good level of fitness. The proposed model is then compared with one that uses the traditional dimension reduction method principal component analysis (PCA) and independent component analysis (ICA). The empirical results show that the proposed model outperforms the PCA-based model, as well as alternative models based on ICA and on the multilayer perceptron. PMID:25849483
Azadeh, Ali; Sheikhalishahi, Mohammad
2014-01-01
Background A unique framework for performance optimization of generation companies (GENCOs) based on health, safety, environment, and ergonomics (HSEE) indicators is presented. Methods To rank this sector of industry, the combination of data envelopment analysis (DEA), principal component analysis (PCA), and Taguchi are used for all branches of GENCOs. These methods are applied in an integrated manner to measure the performance of GENCO. The preferred model between DEA, PCA, and Taguchi is selected based on sensitivity analysis and maximum correlation between rankings. To achieve the stated objectives, noise is introduced into input data. Results The results show that Taguchi outperforms other methods. Moreover, a comprehensive experiment is carried out to identify the most influential factor for ranking GENCOs. Conclusion The approach developed in this study could be used for continuous assessment and improvement of GENCO's performance in supplying energy with respect to HSEE factors. The results of such studies would help managers to have better understanding of weak and strong points in terms of HSEE factors. PMID:26106505
Yin, Yihang; Liu, Fengzheng; Zhou, Xiang; Li, Quanzhong
2015-08-07
Wireless sensor networks (WSNs) have been widely used to monitor the environment, and sensors in WSNs are usually power constrained. Because inner-node communication consumes most of the power, efficient data compression schemes are needed to reduce the data transmission to prolong the lifetime of WSNs. In this paper, we propose an efficient data compression model to aggregate data, which is based on spatial clustering and principal component analysis (PCA). First, sensors with a strong temporal-spatial correlation are grouped into one cluster for further processing with a novel similarity measure metric. Next, sensor data in one cluster are aggregated in the cluster head sensor node, and an efficient adaptive strategy is proposed for the selection of the cluster head to conserve energy. Finally, the proposed model applies principal component analysis with an error bound guarantee to compress the data and retain the definite variance at the same time. Computer simulations show that the proposed model can greatly reduce communication and obtain a lower mean square error than other PCA-based algorithms.
NASA Astrophysics Data System (ADS)
Li, Shaoxin; Zhang, Yanjiao; Xu, Junfa; Li, Linfang; Zeng, Qiuyao; Lin, Lin; Guo, Zhouyi; Liu, Zhiming; Xiong, Honglian; Liu, Songhao
2014-09-01
This study aims to present a noninvasive prostate cancer screening methods using serum surface-enhanced Raman scattering (SERS) and support vector machine (SVM) techniques through peripheral blood sample. SERS measurements are performed using serum samples from 93 prostate cancer patients and 68 healthy volunteers by silver nanoparticles. Three types of kernel functions including linear, polynomial, and Gaussian radial basis function (RBF) are employed to build SVM diagnostic models for classifying measured SERS spectra. For comparably evaluating the performance of SVM classification models, the standard multivariate statistic analysis method of principal component analysis (PCA) is also applied to classify the same datasets. The study results show that for the RBF kernel SVM diagnostic model, the diagnostic accuracy of 98.1% is acquired, which is superior to the results of 91.3% obtained from PCA methods. The receiver operating characteristic curve of diagnostic models further confirm above research results. This study demonstrates that label-free serum SERS analysis technique combined with SVM diagnostic algorithm has great potential for noninvasive prostate cancer screening.
Longobardi, Francesco; Innamorato, Valentina; Di Gioia, Annalisa; Ventrella, Andrea; Lippolis, Vincenzo; Logrieco, Antonio F; Catucci, Lucia; Agostiano, Angela
2017-12-15
Lentil samples coming from two different countries, i.e. Italy and Canada, were analysed using untargeted 1 H NMR fingerprinting in combination with chemometrics in order to build models able to classify them according to their geographical origin. For such aim, Soft Independent Modelling of Class Analogy (SIMCA), k-Nearest Neighbor (k-NN), Principal Component Analysis followed by Linear Discriminant Analysis (PCA-LDA) and Partial Least Squares-Discriminant Analysis (PLS-DA) were applied to the NMR data and the results were compared. The best combination of average recognition (100%) and cross-validation prediction abilities (96.7%) was obtained for the PCA-LDA. All the statistical models were validated both by using a test set and by carrying out a Monte Carlo Cross Validation: the obtained performances were found to be satisfying for all the models, with prediction abilities higher than 95% demonstrating the suitability of the developed methods. Finally, the metabolites that mostly contributed to the lentil discrimination were indicated. Copyright © 2017 Elsevier Ltd. All rights reserved.
Takegami, Shigehiko; Ueyama, Keita; Konishi, Atsuko; Kitade, Tatsuya
2018-06-06
The lipid fluidity of various lipid nanoemulsions (LNEs) without and with flutamide (FT) and containing one of two neutral lipids, one of four phosphatidylcholines as a surfactant, and sodium palmitate as a cosurfactant was investigated by the combination of 1 H nuclear magnetic resonance (NMR) spectroscopy and principal component analysis (PCA). In the 1 H NMR spectra, the peaks from the methylene groups of the neutral lipids and surfactants for all LNE preparations showed downfield shifts with increasing temperature from 20 to 60 °C. PCA was applied to the 1 H NMR spectral data obtained for the LNEs. The PCA resulted in a model in which the first two principal components (PCs) extracted 88% of the total spectral variation; the first PC (PC-1) axis and second PC (PC-2) axis accounted for 73 and 15%, respectively, of the total spectral variation. The Score-1 values for PC-1 plotted against temperature revealed the existence of two clusters, which were defined by the neutral lipid of the LNE preparations. Meanwhile, the Score-2 values decreased with rising temperature and reflected the increase in lipid fluidity of each LNE preparation, consistent with fluorescence anisotropy measurements. In addition, the changes of Score-2 values with temperature for LNE preparations with FT were smaller than those for LNE preparations without FT. This indicates that FT encapsulated in LNE particles markedly suppressed the increase in lipid fluidity of LNE particles with rising temperature. Thus, PCA of 1 H NMR spectra will become a powerful tool to analyze the lipid fluidity of lipid nanoparticles. Graphical abstract ᅟ.
Cole, Jacqueline M; Cheng, Xie; Payne, Michael C
2016-11-07
The use of principal component analysis (PCA) to statistically infer features of local structure from experimental pair distribution function (PDF) data is assessed on a case study of rare-earth phosphate glasses (REPGs). Such glasses, codoped with two rare-earth ions (R and R') of different sizes and optical properties, are of interest to the laser industry. The determination of structure-property relationships in these materials is an important aspect of their technological development. Yet, realizing the local structure of codoped REPGs presents significant challenges relative to their singly doped counterparts; specifically, R and R' are difficult to distinguish in terms of establishing relative material compositions, identifying atomic pairwise correlation profiles in a PDF that are associated with each ion, and resolving peak overlap of such profiles in PDFs. This study demonstrates that PCA can be employed to help overcome these structural complications, by statistically inferring trends in PDFs that exist for a restricted set of experimental data on REPGs, and using these as training data to predict material compositions and PDF profiles in unknown codoped REPGs. The application of these PCA methods to resolve individual atomic pairwise correlations in t(r) signatures is also presented. The training methods developed for these structural predictions are prevalidated by testing their ability to reproduce known physical phenomena, such as the lanthanide contraction, on PDF signatures of the structurally simpler singly doped REPGs. The intrinsic limitations of applying PCA to analyze PDFs relative to the quality control of source data, data processing, and sample definition, are also considered. While this case study is limited to lanthanide-doped REPGs, this type of statistical inference may easily be extended to other inorganic solid-state materials and be exploited in large-scale data-mining efforts that probe many t(r) functions.
NASA Astrophysics Data System (ADS)
Secmen, Mustafa
2011-10-01
This paper introduces the performance of an electromagnetic target recognition method in resonance scattering region, which includes pseudo spectrum Multiple Signal Classification (MUSIC) algorithm and principal component analysis (PCA) technique. The aim of this method is to classify an "unknown" target as one of the "known" targets in an aspect-independent manner. The suggested method initially collects the late-time portion of noise-free time-scattered signals obtained from different reference aspect angles of known targets. Afterward, these signals are used to obtain MUSIC spectrums in real frequency domain having super-resolution ability and noise resistant feature. In the final step, PCA technique is applied to these spectrums in order to reduce dimensionality and obtain only one feature vector per known target. In the decision stage, noise-free or noisy scattered signal of an unknown (test) target from an unknown aspect angle is initially obtained. Subsequently, MUSIC algorithm is processed for this test signal and resulting test vector is compared with feature vectors of known targets one by one. Finally, the highest correlation gives the type of test target. The method is applied to wire models of airplane targets, and it is shown that it can tolerate considerable noise levels although it has a few different reference aspect angles. Besides, the runtime of the method for a test target is sufficiently low, which makes the method suitable for real-time applications.
Identification and classification of upper limb motions using PCA.
Veer, Karan; Vig, Renu
2018-03-28
This paper describes the utility of principal component analysis (PCA) in classifying upper limb signals. PCA is a powerful tool for analyzing data of high dimension. Here, two different input strategies were explored. The first method uses upper arm dual-position-based myoelectric signal acquisition and the other solely uses PCA for classifying surface electromyogram (SEMG) signals. SEMG data from the biceps and the triceps brachii muscles and four independent muscle activities of the upper arm were measured in seven subjects (total dataset=56). The datasets used for the analysis are rotated by class-specific principal component matrices to decorrelate the measured data prior to feature extraction.
White-Al Habeeb, Nicole M A; Ho, Linh T; Olkhov-Mitsel, Ekaterina; Kron, Ken; Pethe, Vaijayanti; Lehman, Melanie; Jovanovic, Lidija; Fleshner, Neil; van der Kwast, Theodorus; Nelson, Colleen C; Bapat, Bharati
2014-09-15
Epigenetic silencing mediated by CpG methylation is a common feature of many cancers. Characterizing aberrant DNA methylation changes associated with tumor progression may identify potential prognostic markers for prostate cancer (PCa). We treated two PCa cell lines, 22Rv1 and DU-145 with the demethylating agent 5-Aza 2'-deoxycitidine (DAC) and global methylation status was analyzed by performing methylation-sensitive restriction enzyme based differential methylation hybridization strategy followed by genome-wide CpG methylation array profiling. In addition, we examined gene expression changes using a custom microarray. Gene Set Enrichment Analysis (GSEA) identified the most significantly dysregulated pathways. In addition, we assessed methylation status of candidate genes that showed reduced CpG methylation and increased gene expression after DAC treatment, in Gleason score (GS) 8 vs. GS6 patients using three independent cohorts of patients; the publically available The Cancer Genome Atlas (TCGA) dataset, and two separate patient cohorts. Our analysis, by integrating methylation and gene expression in PCa cell lines, combined with patient tumor data, identified novel potential biomarkers for PCa patients. These markers may help elucidate the pathogenesis of PCa and represent potential prognostic markers for PCa patients.
The impact of moderate wine consumption on the risk of developing prostate cancer
Ferro, Matteo; Foerster, Beat; Abufaraj, Mohammad; Briganti, Alberto; Karakiewicz, Pierre I; Shariat, Shahrokh F
2018-01-01
Objective To investigate the impact of moderate wine consumption on the risk of prostate cancer (PCa). We focused on the differential effect of moderate consumption of red versus white wine. Design This study was a meta-analysis that includes data from case–control and cohort studies. Materials and methods A systematic search of Web of Science, Medline/PubMed, and Cochrane library was performed on December 1, 2017. Studies were deemed eligible if they assessed the risk of PCa due to red, white, or any wine using multivariable logistic regression analysis. We performed a formal meta-analysis for the risk of PCa according to moderate wine and wine type consumption (white or red). Heterogeneity between studies was assessed using Cochrane’s Q test and I2 statistics. Publication bias was assessed using Egger’s regression test. Results A total of 930 abstracts and titles were initially identified. After removal of duplicates, reviews, and conference abstracts, 83 full-text original articles were screened. Seventeen studies (611,169 subjects) were included for final evaluation and fulfilled the inclusion criteria. In the case of moderate wine consumption: the pooled risk ratio (RR) for the risk of PCa was 0.98 (95% CI 0.92–1.05, p=0.57) in the multivariable analysis. Moderate white wine consumption increased the risk of PCa with a pooled RR of 1.26 (95% CI 1.10–1.43, p=0.001) in the multi-variable analysis. Meanwhile, moderate red wine consumption had a protective role reducing the risk by 12% (RR 0.88, 95% CI 0.78–0.999, p=0.047) in the multivariable analysis that comprised 222,447 subjects. Conclusions In this meta-analysis, moderate wine consumption did not impact the risk of PCa. Interestingly, regarding the type of wine, moderate consumption of white wine increased the risk of PCa, whereas moderate consumption of red wine had a protective effect. Further analyses are needed to assess the differential molecular effect of white and red wine conferring their impact on PCa risk. PMID:29713200
Delfino, Ines; Perna, Giuseppe; Lasalvia, Maria; Capozzi, Vito; Manti, Lorenzo; Camerlingo, Carlo; Lepore, Maria
2015-03-01
A micro-Raman spectroscopy investigation has been performed in vitro on single human mammary epithelial cells after irradiation by graded x-ray doses. The analysis by principal component analysis (PCA) and interval-PCA (i-PCA) methods has allowed us to point out the small differences in the Raman spectra induced by irradiation. This experimental approach has enabled us to delineate radiation-induced changes in protein, nucleic acid, lipid, and carbohydrate content. In particular, the dose dependence of PCA and i-PCA components has been analyzed. Our results have confirmed that micro-Raman spectroscopy coupled to properly chosen data analysis methods is a very sensitive technique to detect early molecular changes at the single-cell level following exposure to ionizing radiation. This would help in developing innovative approaches to monitor radiation cancer radiotherapy outcome so as to reduce the overall radiation dose and minimize damage to the surrounding healthy cells, both aspects being of great importance in the field of radiation therapy.
Whole milk intake is associated with prostate cancer-specific mortality among U.S. male physicians.
Song, Yan; Chavarro, Jorge E; Cao, Yin; Qiu, Weiliang; Mucci, Lorelei; Sesso, Howard D; Stampfer, Meir J; Giovannucci, Edward; Pollak, Michael; Liu, Simin; Ma, Jing
2013-02-01
Previous studies have associated higher milk intake with greater prostate cancer (PCa) incidence, but little data are available concerning milk types and the relation between milk intake and risk of fatal PCa. We investigated the association between intake of dairy products and the incidence and survival of PCa during a 28-y follow-up. We conducted a cohort study in the Physicians' Health Study (n = 21,660) and a survival analysis among the incident PCa cases (n = 2806). Information on dairy product consumption was collected at baseline. PCa cases and deaths (n = 305) were confirmed during follow-up. The intake of total dairy products was associated with increased PCa incidence [HR = 1.12 (95% CI: 0.93, 1.35); >2.5 servings/d vs. ≤0.5 servings/d]. Skim/low-fat milk intake was positively associated with risk of low-grade, early stage, and screen-detected cancers, whereas whole milk intake was associated only with fatal PCa [HR = 1.49 (95% CI: 0.97, 2.28); ≥237 mL/d (1 serving/d) vs. rarely consumed]. In the survival analysis, whole milk intake remained associated with risk of progression to fatal disease after diagnosis [HR = 2.17 (95% CI: 1.34, 3.51)]. In this prospective cohort, higher intake of skim/low-fat milk was associated with a greater risk of nonaggressive PCa. Most importantly, only whole milk was consistently associated with higher incidence of fatal PCa in the entire cohort and higher PCa-specific mortality among cases. These findings add further evidence to suggest the potential role of dairy products in the development and prognosis of PCa.
Activation of Beta-Catenin Signaling in Androgen Receptor–Negative Prostate Cancer Cells
Wan, Xinhai; Liu, Jie; Lu, Jing-Fang; Tzelepi, Vassiliki; Yang, Jun; Starbuck, Michael W.; Diao, Lixia; Wang, Jing; Efstathiou, Eleni; Vazquez, Elba S.; Troncoso, Patricia; Maity, Sankar N.; Navone, Nora M.
2012-01-01
Purpose To study Wnt/beta-catenin in castrate-resistant prostate cancer (CRPC) and understand its function independently of the beta-catenin–androgen receptor (AR) interaction. Experimental Design We performed beta-catenin immunocytochemical analysis, evaluated TOP-flash reporter activity (a reporter of beta-catenin–mediated transcription), and sequenced the beta-catenin gene in MDA PCa 118a, MDA PCa 118b, MDA PCa 2b, and PC-3 prostate cancer (PCa) cells. We knocked down beta-catenin in AR-negative MDA PCa 118b cells and performed comparative gene-array analysis. We also immunohistochemically analyzed beta-catenin and AR in 27 bone metastases of human CRPCs. Results Beta-catenin nuclear accumulation and TOP-flash reporter activity were high in MDA PCa 118b but not in MDA PCa 2b or PC-3 cells. MDA PCa 118a and 118b cells carry a mutated beta-catenin at codon 32 (D32G). Ten genes were expressed differently (false discovery rate, 0.05) in MDA PCa 118b cells with downregulated beta-catenin. One such gene, hyaluronan synthase 2 (HAS2), synthesizes hyaluronan, a core component of the extracellular matrix. We confirmed HAS2 upregulation in PC-3 cells transfected with D32G-mutant beta-catenin. Finally, we found nuclear localization of beta-catenin in 10 of 27 human tissue specimens; this localization was inversely associated with AR expression (P = 0.056, Fisher’s exact test), suggesting that reduced AR expression enables Wnt/beta-catenin signaling. Conclusion We identified a previously unknown downstream target of beta-catenin, HAS2, in PCa, and found that high beta-catenin nuclear localization and low or no AR expression may define a subpopulation of men with bone-metastatic PCa. These findings may guide physicians in managing these patients. PMID:22298898
DOE Office of Scientific and Technical Information (OSTI.GOV)
Na, Man Gyun; Oh, Seungrohk
A neuro-fuzzy inference system combined with the wavelet denoising, principal component analysis (PCA), and sequential probability ratio test (SPRT) methods has been developed to monitor the relevant sensor using the information of other sensors. The parameters of the neuro-fuzzy inference system that estimates the relevant sensor signal are optimized by a genetic algorithm and a least-squares algorithm. The wavelet denoising technique was applied to remove noise components in input signals into the neuro-fuzzy system. By reducing the dimension of an input space into the neuro-fuzzy system without losing a significant amount of information, the PCA was used to reduce themore » time necessary to train the neuro-fuzzy system, simplify the structure of the neuro-fuzzy inference system, and also, make easy the selection of the input signals into the neuro-fuzzy system. By using the residual signals between the estimated signals and the measured signals, the SPRT is applied to detect whether the sensors are degraded or not. The proposed sensor-monitoring algorithm was verified through applications to the pressurizer water level, the pressurizer pressure, and the hot-leg temperature sensors in pressurized water reactors.« less
Guan, Yangbo; Wu, You; Liu, Yifei; Ni, Jian; Nong, Shaojun
2016-08-01
Despite androgen deprivation therapy (ADT) remains the mainstay therapy for advanced prostate cancer (PCa), the patients have widely variable durations of response to ADT. Unfortunately, there is limited knowledge of pre-treatment prognostic factors for response to ADT. Recently, microRNA-21 (miR-21) has been reported to play an important role in development of castration resistance of CaP. However, little is known about the expression of miR-21 in advanced PCa biopsy tissues, and data on its potential predictive value in advanced PCa are completely lacking. In this study, paraffin-embedded prostate carcinoma tissues obtained by needle biopsy from 85 advanced PCa patients were evaluated for the expression levels of miR-21 by quantitative real-time PCR (qRT-PCR). In situ hybridization (ISH) analysis was performed to further confirm the qRT-PCR results. Kaplan-Meier analysis and Cox proportional hazards regression models were performed to investigate the correlation between miR-21 expression and time to progression of advanced PCa patients. Compared with adjacent non-cancerous prostate tissues, the expression level of miR-21 was significantly increased in PCa tissues (PCa vs. non-cancerous prostate: 1.3273 ± 0.3207 vs. 0.9970 ± 0.2054, P < 0.001). By and large, in ISH analysis miR-21 was expressed at a higher level in tumor areas than in adjacent non-cancerous areas. Additionally, PCa patients with higher expression of miR-21 were significantly more likely to be of high Gleason score and high clinical stage (P < 0.05). There was no significant association between miR-21 expression and the initial prostate-specific antigen (PSA) level or age at diagnosis. Moreover, Kaplan-Meier survival analysis found that PCa patients with high miR-21 expression have shorter progression-free survival than those with low miR-21 expression. Furthermore, Multivariate Cox analysis revealed both miR-21 expression status (P = 0.040) and clinical stage (P = 0.042) were all independent predictive factor for progression-free survival for advanced PCa. These findings suggest for the first time that the up-regulation of miR-21 may serve as an independent predictor of progress-free survival in patients with advanced PCa. Prostate 76:986-993, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Feng, Sujuan; Qian, Xiaosong; Li, Han; Zhang, Xiaodong
2017-12-01
The aim of the present study was to investigate the effectiveness of the miR-17-92 cluster as a disease progression marker in prostate cancer (PCa). Reverse transcription-quantitative polymerase chain reaction analysis was used to detect the microRNA (miR)-17-92 cluster expression levels in tissues from patients with PCa or benign prostatic hyperplasia (BPH), in addition to in PCa and BPH cell lines. Spearman correlation was used for comparison and estimation of correlations between miRNA expression levels and clinicopathological characteristics such as the Gleason score and prostate-specific antigen (PSA). Receiver operating curve (ROC) analysis was performed for evaluation of specificity and sensitivity of miR-17-92 cluster expression levels for discriminating patients with PCa from patients with BPH. Kaplan-Meier analysis was plotted to investigate the predictive potential of miR-17-92 cluster for PCa biochemical recurrence. Expression of the majority of miRNAs in the miR-17-92 cluster was identified to be significantly increased in PCa tissues and cell lines. Bivariate correlation analysis indicated that the high expression of unregulated miRNAs was positively correlated with Gleason grade, but had no significant association with PSA. ROC curves demonstrated that high expression of miR-17-92 cluster predicted a higher diagnostic accuracy compared with PSA. Improved discriminating quotients were observed when combinations of unregulated miRNAs with PSA were used. Survival analysis confirmed a high combined miRNA score of miR-17-92 cluster was associated with shorter biochemical recurrence interval. miR-17-92 cluster could be a potential diagnostic and prognostic biomarker for PCa, and the combination of the miR-17-92 cluster and serum PSA may enhance the accuracy for diagnosis of PCa.
Mujica Ascencio, Saul; Choe, ChunSik; Meinke, Martina C; Müller, Rainer H; Maksimov, George V; Wigger-Alberti, Walter; Lademann, Juergen; Darvin, Maxim E
2016-07-01
Propylene glycol is one of the known substances added in cosmetic formulations as a penetration enhancer. Recently, nanocrystals have been employed also to increase the skin penetration of active components. Caffeine is a component with many applications and its penetration into the epidermis is controversially discussed in the literature. In the present study, the penetration ability of two components - caffeine nanocrystals and propylene glycol, applied topically on porcine ear skin in the form of a gel, was investigated ex vivo using two confocal Raman microscopes operated at different excitation wavelengths (785nm and 633nm). Several depth profiles were acquired in the fingerprint region and different spectral ranges, i.e., 526-600cm(-1) and 810-880cm(-1) were chosen for independent analysis of caffeine and propylene glycol penetration into the skin, respectively. Multivariate statistical methods such as principal component analysis (PCA) and linear discriminant analysis (LDA) combined with Student's t-test were employed to calculate the maximum penetration depths of each substance (caffeine and propylene glycol). The results show that propylene glycol penetrates significantly deeper than caffeine (20.7-22.0μm versus 12.3-13.0μm) without any penetration enhancement effect on caffeine. The results confirm that different substances, even if applied onto the skin as a mixture, can penetrate differently. The penetration depths of caffeine and propylene glycol obtained using two different confocal Raman microscopes are comparable showing that both types of microscopes are well suited for such investigations and that multivariate statistical PCA-LDA methods combined with Student's t-test are very useful for analyzing the penetration of different substances into the skin. Copyright © 2016 Elsevier B.V. All rights reserved.
Bougrini, Madiha; Tahri, Khalid; Haddi, Zouhair; El Bari, Nezha; Llobet, Eduard; Jaffrezic-Renault, Nicole; Bouchikhi, Benachir
2014-12-01
A combined approach based on a multisensor system to get additional chemical information from liquid samples through the analysis of the solution and its headspace is illustrated and commented. In the present work, innovative analytical techniques, such as a hybrid e-nose and a voltammetric e-tongue were elaborated to differentiate between different pasteurized milk brands and for the exact recognition of their storage days through the data fusion technique of the combined system. The Principal Component Analysis (PCA) has shown an acceptable discrimination of the pasteurized milk brands on the first day of storage, when the two instruments were used independently. Contrariwise, PCA indicated that no clear storage day's discrimination can be drawn when the two instruments are applied separately. Mid-level of abstraction data fusion approach has demonstrated that results obtained by the data fusion approach outperformed the classification results of the e-nose and e-tongue taken individually. Furthermore, the Support Vector Machine (SVM) supervised method was applied to the new subset and confirmed that all storage days were correctly identified. This study can be generalized to several beverage and food products where their quality is based on the perception of odor and flavor. Copyright © 2014 Elsevier B.V. All rights reserved.
Tailored multivariate analysis for modulated enhanced diffraction
Caliandro, Rocco; Guccione, Pietro; Nico, Giovanni; ...
2015-10-21
Modulated enhanced diffraction (MED) is a technique allowing the dynamic structural characterization of crystalline materials subjected to an external stimulus, which is particularly suited forin situandoperandostructural investigations at synchrotron sources. Contributions from the (active) part of the crystal system that varies synchronously with the stimulus can be extracted by an offline analysis, which can only be applied in the case of periodic stimuli and linear system responses. In this paper a new decomposition approach based on multivariate analysis is proposed. The standard principal component analysis (PCA) is adapted to treat MED data: specific figures of merit based on their scoresmore » and loadings are found, and the directions of the principal components obtained by PCA are modified to maximize such figures of merit. As a result, a general method to decompose MED data, called optimum constrained components rotation (OCCR), is developed, which produces very precise results on simulated data, even in the case of nonperiodic stimuli and/or nonlinear responses. Furthermore, the multivariate analysis approach is able to supply in one shot both the diffraction pattern related to the active atoms (through the OCCR loadings) and the time dependence of the system response (through the OCCR scores). Furthermore, when applied to real data, OCCR was able to supply only the latter information, as the former was hindered by changes in abundances of different crystal phases, which occurred besides structural variations in the specific case considered. In order to develop a decomposition procedure able to cope with this combined effect represents the next challenge in MED analysis.« less
Gere, Attila; Losó, Viktor; Györey, Annamária; Kovács, Sándor; Huzsvai, László; Nábrádi, András; Kókai, Zoltán; Sipos, László
2014-12-01
Traditional internal and external preference mapping methods are based on principal component analysis (PCA). However, parallel factor analysis (PARAFAC) and Tucker-3 methods could be a better choice. To evaluate the methods, preference maps of sweet corn varieties will be introduced. A preference map of eight sweet corn varieties was established using PARAFAC and Tucker-3 methods. Instrumental data were also integrated into the maps. The triplot created by the PARAFAC model explains better how odour is separated from texture or appearance, and how some varieties are separated from others. Internal and external preference maps were created using parallel factor analysis (PARAFAC) and Tucker-3 models employing both sensory (trained panel and consumers) and instrumental parameters simultaneously. Triplots of the applied three-way models have a competitive advantage compared to the traditional biplots of the PCA-based external preference maps. The solution of PARAFAC and Tucker-3 is very similar regarding the interpretation of the first and third factors. The main difference is due to the second factor as it differentiated the attributes better. Consumers who prefer 'super sweet' varieties (they place great emphasis especially on taste) are much younger and have significantly higher incomes, and buy sweet corn products rarely (once a month). Consumers who consume sweet corn products mainly because of their texture and appearance are significantly older and include a higher ratio of men. © 2014 Society of Chemical Industry.
Taguchi, Y-H
2018-05-08
Even though coexistence of multiple phenotypes sharing the same genomic background is interesting, it remains incompletely understood. Epigenomic profiles may represent key factors, with unknown contributions to the development of multiple phenotypes, and social-insect castes are a good model for elucidation of the underlying mechanisms. Nonetheless, previous studies have failed to identify genes associated with aberrant gene expression and methylation profiles because of the lack of suitable methodology that can address this problem properly. A recently proposed principal component analysis (PCA)-based and tensor decomposition (TD)-based unsupervised feature extraction (FE) can solve this problem because these two approaches can deal with gene expression and methylation profiles even when a small number of samples is available. PCA-based and TD-based unsupervised FE methods were applied to the analysis of gene expression and methylation profiles in the brains of two social insects, Polistes canadensis and Dinoponera quadriceps. Genes associated with differential expression and methylation between castes were identified, and analysis of enrichment of Gene Ontology terms confirmed reliability of the obtained sets of genes from the biological standpoint. Biologically relevant genes, shown to be associated with significant differential gene expression and methylation between castes, were identified here for the first time. The identification of these genes may help understand the mechanisms underlying epigenetic control of development of multiple phenotypes under the same genomic conditions.
NASA Technical Reports Server (NTRS)
Cramer, K. E.; Winfree, W. P.
2005-01-01
The Nondestructive Evaluation Sciences Branch at NASA s Langley Research Center has been actively involved in the development of thermographic inspection techniques for more than 15 years. Since the Space Shuttle Columbia accident, NASA has focused on the improvement of advanced NDE techniques for the Reinforced Carbon-Carbon (RCC) panels that comprise the orbiter s wing leading edge. Various nondestructive inspection techniques have been used in the examination of the RCC, but thermography has emerged as an effective inspection alternative to more traditional methods. Thermography is a non-contact inspection method as compared to ultrasonic techniques which typically require the use of a coupling medium between the transducer and material. Like radiographic techniques, thermography can be used to inspect large areas, but has the advantage of minimal safety concerns and the ability for single-sided measurements. Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. A typical implementation of PCA is when the eigenvectors are generated from the data set being analyzed. Although it is a powerful tool for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the "good" material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued where a fixed set of eigenvectors, generated from an analytic model of the thermal response of the material under examination, is used to process the thermal data from the RCC materials. Details of a one-dimensional analytic model and a two-dimensional finite-element model will be presented. An overview of the PCA process as well as a quantitative signal-to-noise comparison of the results of performing both embodiments of PCA on thermographic data from various RCC specimens will be shown. Finally, a number of different applications of this technology to various RCC components will be presented.
Application of multivariable statistical techniques in plant-wide WWTP control strategies analysis.
Flores, X; Comas, J; Roda, I R; Jiménez, L; Gernaey, K V
2007-01-01
The main objective of this paper is to present the application of selected multivariable statistical techniques in plant-wide wastewater treatment plant (WWTP) control strategies analysis. In this study, cluster analysis (CA), principal component analysis/factor analysis (PCA/FA) and discriminant analysis (DA) are applied to the evaluation matrix data set obtained by simulation of several control strategies applied to the plant-wide IWA Benchmark Simulation Model No 2 (BSM2). These techniques allow i) to determine natural groups or clusters of control strategies with a similar behaviour, ii) to find and interpret hidden, complex and casual relation features in the data set and iii) to identify important discriminant variables within the groups found by the cluster analysis. This study illustrates the usefulness of multivariable statistical techniques for both analysis and interpretation of the complex multicriteria data sets and allows an improved use of information for effective evaluation of control strategies.
Using recurrence plot analysis for software execution interpretation and fault detection
NASA Astrophysics Data System (ADS)
Mosdorf, M.
2015-09-01
This paper shows a method targeted at software execution interpretation and fault detection using recurrence plot analysis. In in the proposed approach recurrence plot analysis is applied to software execution trace that contains executed assembly instructions. Results of this analysis are subject to further processing with PCA (Principal Component Analysis) method that simplifies number coefficients used for software execution classification. This method was used for the analysis of five algorithms: Bubble Sort, Quick Sort, Median Filter, FIR, SHA-1. Results show that some of the collected traces could be easily assigned to particular algorithms (logs from Bubble Sort and FIR algorithms) while others are more difficult to distinguish.
Romero-Pastor, Julia; Navas, Natalia; Kuckova, Stepanka; Rodríguez-Navarro, Alejandro; Cardell, Carolina
2012-03-01
This study focuses on acquiring information on the degradation process of proteinaceous binders due to ultra violet (UV) radiation and possible interactions owing to the presence of historical mineral pigments. With this aim, three different paint model samples were prepared according to medieval recipes, using rabbit glue as proteinaceus binders. One of these model samples contained only the binder, and the other two were prepared by mixing each of the pigments (cinnabar or azurite) with the binder (glue tempera model samples). The model samples were studied by applying Principal Component Analysis (PCA) to their mass spectra obtained with Matrix-Assisted Laser Desorption/Ionization-Time of Flight Mass Spectrometry (MALDI-TOF-MS). The complementary use of Fourier Transform Infrared Spectroscopy to study conformational changes of secondary structure of the proteinaceous binder is also proposed. Ageing effects on the model samples after up to 3000 h of UV irradiation were periodically analyzed by the proposed approach. PCA on MS data proved capable of identifying significant changes in the model samples, and the results suggested different aging behavior based on the pigment present. This research represents the first attempt to use this approach (PCA on MALDI-TOF-MS data) in the field of Cultural Heritage and demonstrates the potential benefits in the study of proteinaceous artistic materials for purposes of conservation and restoration. Copyright © 2012 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Bellemans, Aurélie; Parente, Alessandro; Magin, Thierry
2018-04-01
The present work introduces a novel approach for obtaining reduced chemistry representations of large kinetic mechanisms in strong non-equilibrium conditions. The need for accurate reduced-order models arises from compression of large ab initio quantum chemistry databases for their use in fluid codes. The method presented in this paper builds on existing physics-based strategies and proposes a new approach based on the combination of a simple coarse grain model with Principal Component Analysis (PCA). The internal energy levels of the chemical species are regrouped in distinct energy groups with a uniform lumping technique. Following the philosophy of machine learning, PCA is applied on the training data provided by the coarse grain model to find an optimally reduced representation of the full kinetic mechanism. Compared to recently published complex lumping strategies, no expert judgment is required before the application of PCA. In this work, we will demonstrate the benefits of the combined approach, stressing its simplicity, reliability, and accuracy. The technique is demonstrated by reducing the complex quantum N2(g+1Σ) -N(S4u ) database for studying molecular dissociation and excitation in strong non-equilibrium. Starting from detailed kinetics, an accurate reduced model is developed and used to study non-equilibrium properties of the N2(g+1Σ) -N(S4u ) system in shock relaxation simulations.
Peng, Shengmeng; Du, Tao; Wu, Wanhua; Chen, Xianju; Lai, Yiming; Zhu, Dingjun; Wang, Qiong; Ma, Xiaoming; Lin, Chunhao; Li, Zean; Guo, Zhenghui; Huang, Hai
2018-06-11
The aim of this study was to investigate the associations of serine proteinase inhibitor family G1 (SERPING1) down-regulation with poor prognosis in patients with prostate cancer (PCa). Furthermore, we aim to find more novel and effective PCa molecular markers to provide an early screening of PCa, distinguish patients with aggressive PCa, predict the prognosis, or reduce the economic burden of PCa. SERPING1 protein expression in both human PCa and normal prostate tissues was detected by immunohistochemical staining, which intensity was analyzed in association with clinical pathological parameters such Gleason score, pathological grade, clinical stage, tumor stage, lymph node metastasis, and distant metastasis. Moreover, we used The Cancer Genome Atlas (TCGA) Database, Taylor Database, and Oncomine dataset to validate our immunohistochemical results and investigated the value of SERPING1 in PCa at mRNA level. Kaplan-Meier analysis and Cox regression analysis were performed to evaluate the relationship between SERPING1 and prognosis of patients with PCa. The outcome showed that SERPING1 was expressed mainly in cytoplasm of grand cells of prostate tissue and was significantly expressed less in PCa (P<0.001). Furthermore, in the tissue microarray of our samples, decreasing expression of SERPING1 was correlated with the higher Gleason score (P = 0.004), the higher pathological grade (P = 0.01) and the advanced tumor stage (P = 0.005) at protein level. In TCGA dataset and Taylor Dataset, low-expressed SERPING1 was correlated with the younger patient (P = 0.02 in TCGA, P = 0.044 in Taylor) and the higher Gleason score (P = 0.019 in TCGA, P<0.001 in Taylor) at mRNA level. Kaplan-Meier analysis revealed that the lower mRNA of SERPING1 predicted lower overall survivals (P = 0.027 in TCGA), lower disease-free survival (P = 0.029) and lower biochemical recurrence-free survival (P = 0.011 in Taylor). Data from Oncomine database shown that SERPING1 low expression implying higher malignancy of prostate lesions. Using multivariate analysis, we also found that SERPING1 expression was independent prognostic marker of poor disease-free survival and biochemical recurrence-free survival. SERPING1 may play an important role in PCa and can be serve as a novel marker in diagnosis and prognostic prediction in PCa. In addition, levels of SERPING1 can help identify low-risk prostate to provide reference for patients with PCa to accept active surveillance and reduce overtreatment. Copyright © 2018 Elsevier Inc. All rights reserved.
Ferro, Matteo; Bruzzese, Dario; Perdonà, Sisto; Mazzarella, Claudia; Marino, Ada; Sorrentino, Alessandra; Di Carlo, Angelina; Autorino, Riccardo; Di Lorenzo, Giuseppe; Buonerba, Carlo; Altieri, Vincenzo; Mariano, Angela; Macchia, Vincenzo; Terracciano, Daniela
2012-08-16
Indication for prostate biopsy is presently mainly based on prostate-specific antigen (PSA) serum levels and digital-rectal examination (DRE). In view of the unsatisfactory accuracy of these two diagnostic exams, research has focused on novel markers to improve pre-biopsy prostate cancer detection, such as phi and PCA3. The purpose of this prospective study was to assess the diagnostic accuracy of phi and PCA3 for prostate cancer using biopsy as gold standard. Phi index (Beckman coulter immunoassay), PCA3 score (Progensa PCA3 assay) and other established biomarkers (tPSA, fPSA and %fPSA) were assessed before a 18-core prostate biopsy in a group of 251 subjects at their first biopsy. Values of %p2PSA and phi were significantly higher in patients with PCa compared with PCa-negative group (p<0.001) and also compared with high grade prostatic intraepithelial neoplasia (HGPIN) (p<0.001). PCA3 score values were significantly higher in PCa compared with PCa-negative subjects (p<0.001) and in HGPIN vs PCa-negative patients (p<0.001). ROC curve analysis showed that %p2PSA, phi and PCA3 are predictive of malignancy. In conclusion, %p2PSA, phi and PCA3 may predict a diagnosis of PCa in men undergoing their first prostate biopsy. PCA3 score is more useful in discriminating between HGPIN and non-cancer. Copyright © 2012 Elsevier B.V. All rights reserved.
PCA as a practical indicator of OPLS-DA model reliability.
Worley, Bradley; Powers, Robert
Principal Component Analysis (PCA) and Orthogonal Projections to Latent Structures Discriminant Analysis (OPLS-DA) are powerful statistical modeling tools that provide insights into separations between experimental groups based on high-dimensional spectral measurements from NMR, MS or other analytical instrumentation. However, when used without validation, these tools may lead investigators to statistically unreliable conclusions. This danger is especially real for Partial Least Squares (PLS) and OPLS, which aggressively force separations between experimental groups. As a result, OPLS-DA is often used as an alternative method when PCA fails to expose group separation, but this practice is highly dangerous. Without rigorous validation, OPLS-DA can easily yield statistically unreliable group separation. A Monte Carlo analysis of PCA group separations and OPLS-DA cross-validation metrics was performed on NMR datasets with statistically significant separations in scores-space. A linearly increasing amount of Gaussian noise was added to each data matrix followed by the construction and validation of PCA and OPLS-DA models. With increasing added noise, the PCA scores-space distance between groups rapidly decreased and the OPLS-DA cross-validation statistics simultaneously deteriorated. A decrease in correlation between the estimated loadings (added noise) and the true (original) loadings was also observed. While the validity of the OPLS-DA model diminished with increasing added noise, the group separation in scores-space remained basically unaffected. Supported by the results of Monte Carlo analyses of PCA group separations and OPLS-DA cross-validation metrics, we provide practical guidelines and cross-validatory recommendations for reliable inference from PCA and OPLS-DA models.
Fuller, Douglas O; Parenti, Michael S; Gad, Adel M; Beier, John C
2012-01-01
Irrigation along the Nile River has resulted in dramatic changes in the biophysical environment of Upper Egypt. In this study we used a combination of MODIS 250 m NDVI data and Landsat imagery to identify areas that changed from 2001-2008 as a result of irrigation and water-level fluctuations in the Nile River and nearby water bodies. We used two different methods of time series analysis -- principal components (PCA) and harmonic decomposition (HD), applied to the MODIS 250 m NDVI images to derive simple three-class land cover maps and then assessed their accuracy using a set of reference polygons derived from 30 m Landsat 5 and 7 imagery. We analyzed our MODIS 250 m maps against a new MODIS global land cover product (MOD12Q1 collection 5) to assess whether regionally specific mapping approaches are superior to a standard global product. Results showed that the accuracy of the PCA-based product was greater than the accuracy of either the HD or MOD12Q1 products for the years 2001, 2003, and 2008. However, the accuracy of the PCA product was only slightly better than the MOD12Q1 for 2001 and 2003. Overall, the results suggest that our PCA-based approach produces a high level of user and producer accuracies, although the MOD12Q1 product also showed consistently high accuracy. Overlay of 2001-2008 PCA-based maps showed a net increase of 12 129 ha of irrigated vegetation, with the largest increase found from 2006-2008 around the Districts of Edfu and Kom Ombo. This result was unexpected in light of ambitious government plans to develop 336 000 ha of irrigated agriculture around the Toshka Lakes.
2014-01-01
Background The levels of 19 elements (As, Be, Ca, Cd, Co, Cr, Cu, Fe, K, Mg, Mn, Na, Ni, Pb, Se, Tl, U, V, Zn) from sixteen different Argentine production sites of unifloral [eucalyptus (Eucaliptus rostrata), chilca (Baccharis salicifolia), Algarrobo (Prosopis sp.), mistol (Ziziphus mistol) and citric] and multifloral honeys were measured with the aim to test the quality of the selected samples. Typical quality parameters of honeys were also determined (pH, sugar content, moisture). Mineral elements were determined by using inductively coupled plasma mass spectrometer (ICP-MS DRC). We also evaluated the suitability of honey as a possible biomonitor of environmental pollution. Thus, the sites were classified through cluster analysis (CA) and then pattern recognition methods such as Principal Component Analysis (PCA) and discriminant analysis (DA) were applied. Results Mean values for quality parameters were: pH, 4.12 and 3.81; sugar 82.1 and 82.0 °brix; moisture, 16.90 and 17.00% for unifloral and multifloral honeys respectively. The water content showed good maturity. Likewise, the other parameters confirmed the good quality of the honeys analysed. Potassium was quantitatively the most abundant metal, accounting for 92,5% of the total metal contents with an average concentration of 832.0 and 816.2 μg g-1 for unifloral and multifloral honeys respectively. Sodium was the second most abundant major metal in honeys with a mean value of 32.16 and 33.19 μg g-1 for unifloral and multifloral honeys respectively. Mg, Ca, Fe, Mn, Zn and Cu were present at low-intermediate concentrations. For the other 11 trace elements determined in this study (As, Be, Cd, Co, Cr, Ni, Pb, Se, Tl, U and V), the mean concentrations were very low or below of the LODs. The sites were classified through CA by using elements’ and physicochemical parameters data, then DA on the PCA factors was applied. Dendrograms identified three main groups. PCA explained 52.03% of the total variability with the first two factors. Conclusions In general, there are no evidences of pollution for the analysed honeys. The analytical results obtained for the Argentine honeys indicate the products’ high quality. In fact, most of the toxic elements were below LODs. The chemometric analysis combining CA, DA and PCA showed their aptness as useful tools for honey’s classification. Eventually, this study confirms that the use of honey as biomonitor of environmental contamination is not reliable for sites with low levels of contamination. PMID:25057287
Liu, Yu; Zhang, Xufeng; Li, Ying; Wang, Haixia
2017-11-01
Geographical origin traceability is an important issue for controlling the quality of seafood and safeguarding the interest of consumers. In the present study, a new method of compound-specific isotope analysis (CSIA) of fatty acids was established to evaluate its applicability in establishing the origin traceability of Apostichopus japonicus in the coastal areas of China. Moreover, principal component analysis (PCA) and discriminant analysis (DA) were applied to distinguish between the origins of A. japonicus. The results show that the stable carbon isotope compositions of fatty acids of A. japonicus significantly differ in terms of both season and origin. They also indicate that the stable carbon isotope composition of fatty acids could effectively discriminate between the origins of A. japonicus, except for between Changhai Island and Zhangzi Island in the spring of 2016 because of geographical proximity or the similarity of food sources. The fatty acids that have the highest contribution to identifying the geographical origins of A. japonicus are C22:6n-3, C16:1n-7, C20:5n-3, C18:0 and C23:1n-9, when considering the fatty acid contents, the stable carbon isotope composition of fatty acids and the results of the PCA and DA. We conclude that CSIA of fatty acids, combined with multivariate statistical analysis such as PCA and DA, may be an effective tool for establishing the traceability of A. japonicus in the coastal areas of China. The relevant conclusions of the present study provide a new method for determining the traceability of seafood or other food products. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
Evaluation of redundancy analysis to identify signatures of local adaptation.
Capblancq, Thibaut; Luu, Keurcien; Blum, Michael G B; Bazin, Eric
2018-05-26
Ordination is a common tool in ecology that aims at representing complex biological information in a reduced space. In landscape genetics, ordination methods such as principal component analysis (PCA) have been used to detect adaptive variation based on genomic data. Taking advantage of environmental data in addition to genotype data, redundancy analysis (RDA) is another ordination approach that is useful to detect adaptive variation. This paper aims at proposing a test statistic based on RDA to search for loci under selection. We compare redundancy analysis to pcadapt, which is a nonconstrained ordination method, and to a latent factor mixed model (LFMM), which is a univariate genotype-environment association method. Individual-based simulations identify evolutionary scenarios where RDA genome scans have a greater statistical power than genome scans based on PCA. By constraining the analysis with environmental variables, RDA performs better than PCA in identifying adaptive variation when selection gradients are weakly correlated with population structure. Additionally, we show that if RDA and LFMM have a similar power to identify genetic markers associated with environmental variables, the RDA-based procedure has the advantage to identify the main selective gradients as a combination of environmental variables. To give a concrete illustration of RDA in population genomics, we apply this method to the detection of outliers and selective gradients on an SNP data set of Populus trichocarpa (Geraldes et al., 2013). The RDA-based approach identifies the main selective gradient contrasting southern and coastal populations to northern and continental populations in the northwestern American coast. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Principal component analysis for the early detection of mastitis and lameness in dairy cows.
Miekley, Bettina; Traulsen, Imke; Krieter, Joachim
2013-08-01
This investigation analysed the applicability of principal component analysis (PCA), a latent variable method, for the early detection of mastitis and lameness. Data used were recorded on the Karkendamm dairy research farm between August 2008 and December 2010. For mastitis and lameness detection, data of 338 and 315 cows in their first 200 d in milk were analysed, respectively. Mastitis as well as lameness were specified according to veterinary treatments. Diseases were defined as disease blocks. The different definitions used (two for mastitis, three for lameness) varied solely in the sequence length of the blocks. Only the days before the treatment were included in the blocks. Milk electrical conductivity, milk yield and feeding patterns (feed intake, number of feeding visits and time at the trough) were used for recognition of mastitis. Pedometer activity and feeding patterns were utilised for lameness detection. To develop and verify the PCA model, the mastitis and the lameness datasets were divided into training and test datasets. PCA extracted uncorrelated principle components (PC) by linear transformations of the raw data so that the first few PCs captured most of the variations in the original dataset. For process monitoring and disease detection, these resulting PCs were applied to the Hotelling's T 2 chart and to the residual control chart. The results show that block sensitivity of mastitis detection ranged from 77·4 to 83·3%, whilst specificity was around 76·7%. The error rates were around 98·9%. For lameness detection, the block sensitivity ranged from 73·8 to 87·8% while the obtained specificities were between 54·8 and 61·9%. The error rates varied from 87·8 to 89·2%. In conclusion, PCA seems to be not yet transferable into practical usage. Results could probably be improved if different traits and more informative sensor data are included in the analysis.
Li, Xuejian; Wang, Youqing
2016-12-01
Offline general-type models are widely used for patients' monitoring in intensive care units (ICUs), which are developed by using past collected datasets consisting of thousands of patients. However, these models may fail to adapt to the changing states of ICU patients. Thus, to be more robust and effective, the monitoring models should be adaptable to individual patients. A novel combination of just-in-time learning (JITL) and principal component analysis (PCA), referred to learning-type PCA (L-PCA), was proposed for adaptive online monitoring of patients in ICUs. JITL was used to gather the most relevant data samples for adaptive modeling of complex physiological processes. PCA was used to build an online individual-type model and calculate monitoring statistics, and then to judge whether the patient's status is normal or not. The adaptability of L-PCA lies in the usage of individual data and the continuous updating of the training dataset. Twelve subjects were selected from the Physiobank's Multi-parameter Intelligent Monitoring for Intensive Care II (MIMIC II) database, and five vital signs of each subject were chosen. The proposed method was compared with the traditional PCA and fast moving-window PCA (Fast MWPCA). The experimental results demonstrated that the fault detection rates respectively increased by 20 % and 47 % compared with PCA and Fast MWPCA. L-PCA is first introduced into ICU patients monitoring and achieves the best monitoring performance in terms of adaptability to changes in patient status and sensitivity for abnormality detection.
Spatial assessment of air quality patterns in Malaysia using multivariate analysis
NASA Astrophysics Data System (ADS)
Dominick, Doreena; Juahir, Hafizan; Latif, Mohd Talib; Zain, Sharifuddin M.; Aris, Ahmad Zaharin
2012-12-01
This study aims to investigate possible sources of air pollutants and the spatial patterns within the eight selected Malaysian air monitoring stations based on a two-year database (2008-2009). The multivariate analysis was applied on the dataset. It incorporated Hierarchical Agglomerative Cluster Analysis (HACA) to access the spatial patterns, Principal Component Analysis (PCA) to determine the major sources of the air pollution and Multiple Linear Regression (MLR) to assess the percentage contribution of each air pollutant. The HACA results grouped the eight monitoring stations into three different clusters, based on the characteristics of the air pollutants and meteorological parameters. The PCA analysis showed that the major sources of air pollution were emissions from motor vehicles, aircraft, industries and areas of high population density. The MLR analysis demonstrated that the main pollutant contributing to variability in the Air Pollutant Index (API) at all stations was particulate matter with a diameter of less than 10 μm (PM10). Further MLR analysis showed that the main air pollutant influencing the high concentration of PM10 was carbon monoxide (CO). This was due to combustion processes, particularly originating from motor vehicles. Meteorological factors such as ambient temperature, wind speed and humidity were also noted to influence the concentration of PM10.
Liu, Xiao-Fang; Xue, Chang-Hu; Wang, Yu-Ming; Li, Zhao-Jie; Xue, Yong; Xu, Jie
2011-11-01
The present study is to investigate the feasibility of multi-elements analysis in determination of the geographical origin of sea cucumber Apostichopus japonicus, and to make choice of the effective tracers in sea cucumber Apostichopus japonicus geographical origin assessment. The content of the elements such as Al, V, Cr, Mn, Fe, Co, Ni, Cu, Zn, As, Se, Mo, Cd, Hg and Pb in sea cucumber Apostichopus japonicus samples from seven places of geographical origin were determined by means of ICP-MS. The results were used for the development of elements database. Cluster analysis(CA) and principal component analysis (PCA) were applied to differentiate the sea cucumber Apostichopus japonicus geographical origin. Three principal components which accounted for over 89% of the total variance were extracted from the standardized data. The results of Q-type cluster analysis showed that the 26 samples could be clustered reasonably into five groups, the classification results were significantly associated with the marine distribution of the sea cucumber Apostichopus japonicus samples. The CA and PCA were the effective methods for elements analysis of sea cucumber Apostichopus japonicus samples. The content of the mineral elements in sea cucumber Apostichopus japonicus samples was good chemical descriptors for differentiating their geographical origins.
Combined data mining/NIR spectroscopy for purity assessment of lime juice
NASA Astrophysics Data System (ADS)
Shafiee, Sahameh; Minaei, Saeid
2018-06-01
This paper reports the data mining study on the NIR spectrum of lime juice samples to determine their purity (natural or synthetic). NIR spectra for 72 pure and synthetic lime juice samples were recorded in reflectance mode. Sample outliers were removed using PCA analysis. Different data mining techniques for feature selection (Genetic Algorithm (GA)) and classification (including the radial basis function (RBF) network, Support Vector Machine (SVM), and Random Forest (RF) tree) were employed. Based on the results, SVM proved to be the most accurate classifier as it achieved the highest accuracy (97%) using the raw spectrum information. The classifier accuracy dropped to 93% when selected feature vector by GA search method was applied as classifier input. It can be concluded that some relevant features which produce good performance with the SVM classifier are removed by feature selection. Also, reduced spectra using PCA do not show acceptable performance (total accuracy of 66% by RBFNN), which indicates that dimensional reduction methods such as PCA do not always lead to more accurate results. These findings demonstrate the potential of data mining combination with near-infrared spectroscopy for monitoring lime juice quality in terms of natural or synthetic nature.
Mottese, Antonio Francesco; Naccari, Clara; Vadalà, Rossella; Bua, Giuseppe Daniel; Bartolomeo, Giovanni; Rando, Rossana; Cicero, Nicola; Dugo, Giacomo
2018-01-01
Opuntia ficus-indica L. Miller fruits, particularly 'Ficodindia dell'Etna' of Biancavilla (POD), 'Fico d'india tradizionale di Roccapalumba' with protected brand and samples from an experimental field in Pezzolo (Sicily) were analyzed by inductively coupled plasma mass spectrometry in order to determine the multi-element profile. A multivariate chemometric approach, specifically principal component analysis (PCA), was applied to individuate how mineral elements may represent a marker of geographic origin, which would be useful for traceability. PCA has allowed us to verify that the geographical origin of prickly pear fruits is significantly influenced by trace element content, and the results found in Biancavilla PDO samples were linked to the geological composition of this volcanic areas. It was observed that two principal components accounted for 72.03% of the total variance in the data and, in more detail, PC1 explains 45.51% and PC2 26.52%, respectively. This study demonstrated that PCA is an integrated tool for the traceability of food products and, at the same time, a useful method of authentication of typical local fruits such as prickly pear. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
EMPCA and Cluster Analysis of Quasar Spectra: Construction and Application to Simulated Spectra
NASA Astrophysics Data System (ADS)
Marrs, Adam; Leighly, Karen; Wagner, Cassidy; Macinnis, Francis
2017-01-01
Quasars have complex spectra with emission lines influenced by many factors. Therefore, to fully describe the spectrum requires specification of a large number of parameters, such as line equivalent width, blueshift, and ratios. Principal Component Analysis (PCA) aims to construct eigenvectors-or principal components-from the data with the goal of finding a few key parameters that can be used to predict the rest of the spectrum fairly well. Analysis of simulated quasar spectra was used to verify and justify our modified application of PCA.We used a variant of PCA called Weighted Expectation Maximization PCA (EMPCA; Bailey 2012) along with k-means cluster analysis to analyze simulated quasar spectra. Our approach combines both analytical methods to address two known problems with classical PCA. EMPCA uses weights to account for uncertainty and missing points in the spectra. K-means groups similar spectra together to address the nonlinearity of quasar spectra, specifically variance in blueshifts and widths of the emission lines.In producing and analyzing simulations, we first tested the effects of varying equivalent widths and blueshifts on the derived principal components, and explored the differences between standard PCA and EMPCA. We also tested the effects of varying signal-to-noise ratio. Next we used the results of fits to composite quasar spectra (see accompanying poster by Wagner et al.) to construct a set of realistic simulated spectra, and subjected those spectra to the EMPCA /k-means analysis. We concluded that our approach was validated when we found that the mean spectra from our k-means clusters derived from PCA projection coefficients reproduced the trends observed in the composite spectra.Furthermore, our method needed only two eigenvectors to identify both sets of correlations used to construct the simulations, as well as indicating the linear and nonlinear segments. Comparing this to regular PCA, which can require a dozen or more components, or to direct spectral analysis that may need measurement of 20 fit parameters, shows why the dual application of these two techniques is such a powerful tool.
Keenan, Michael R; Smentkowski, Vincent S; Ulfig, Robert M; Oltman, Edward; Larson, David J; Kelly, Thomas F
2011-06-01
We demonstrate for the first time that multivariate statistical analysis techniques can be applied to atom probe tomography data to estimate the chemical composition of a sample at the full spatial resolution of the atom probe in three dimensions. Whereas the raw atom probe data provide the specific identity of an atom at a precise location, the multivariate results can be interpreted in terms of the probabilities that an atom representing a particular chemical phase is situated there. When aggregated to the size scale of a single atom (∼0.2 nm), atom probe spectral-image datasets are huge and extremely sparse. In fact, the average spectrum will have somewhat less than one total count per spectrum due to imperfect detection efficiency. These conditions, under which the variance in the data is completely dominated by counting noise, test the limits of multivariate analysis, and an extensive discussion of how to extract the chemical information is presented. Efficient numerical approaches to performing principal component analysis (PCA) on these datasets, which may number hundreds of millions of individual spectra, are put forward, and it is shown that PCA can be computed in a few seconds on a typical laptop computer.
Li, Hong Zhi; Tao, Wei; Gao, Ting; Li, Hui; Lu, Ying Hua; Su, Zhong Min
2011-01-01
We propose a generalized regression neural network (GRNN) approach based on grey relational analysis (GRA) and principal component analysis (PCA) (GP-GRNN) to improve the accuracy of density functional theory (DFT) calculation for homolysis bond dissociation energies (BDE) of Y-NO bond. As a demonstration, this combined quantum chemistry calculation with the GP-GRNN approach has been applied to evaluate the homolysis BDE of 92 Y-NO organic molecules. The results show that the ull-descriptor GRNN without GRA and PCA (F-GRNN) and with GRA (G-GRNN) approaches reduce the root-mean-square (RMS) of the calculated homolysis BDE of 92 organic molecules from 5.31 to 0.49 and 0.39 kcal mol(-1) for the B3LYP/6-31G (d) calculation. Then the newly developed GP-GRNN approach further reduces the RMS to 0.31 kcal mol(-1). Thus, the GP-GRNN correction on top of B3LYP/6-31G (d) can improve the accuracy of calculating the homolysis BDE in quantum chemistry and can predict homolysis BDE which cannot be obtained experimentally.
Sciutto, Giorgia; Oliveri, Paolo; Catelli, Emilio; Bonacini, Irene
2017-01-01
In the field of applied researches in heritage science, the use of multivariate approach is still quite limited and often chemometric results obtained are often underinterpreted. Within this scenario, the present paper is aimed at disseminating the use of suitable multivariate methodologies and proposes a procedural workflow applied on a representative group of case studies, of considerable importance for conservation purposes, as a sort of guideline on the processing and on the interpretation of this FTIR data. Initially, principal component analysis (PCA) is performed and the score values are converted into chemical maps. Successively, the brushing approach is applied, demonstrating its usefulness for a deep understanding of the relationships between the multivariate map and PC score space, as well as for the identification of the spectral bands mainly involved in the definition of each area localised within the score maps. PMID:29333162
Maisuradze, Gia G; Leitner, David M
2007-05-15
Dihedral principal component analysis (dPCA) has recently been developed and shown to display complex features of the free energy landscape of a biomolecule that may be absent in the free energy landscape plotted in principal component space due to mixing of internal and overall rotational motion that can occur in principal component analysis (PCA) [Mu et al., Proteins: Struct Funct Bioinfo 2005;58:45-52]. Another difficulty in the implementation of PCA is sampling convergence, which we address here for both dPCA and PCA using a tetrapeptide as an example. We find that for both methods the sampling convergence can be reached over a similar time. Minima in the free energy landscape in the space of the two largest dihedral principal components often correspond to unique structures, though we also find some distinct minima to correspond to the same structure. 2007 Wiley-Liss, Inc.
Analysis of Zinc-Exporters Expression in Prostate Cancer.
Singh, Chandra K; Malas, Kareem M; Tydrick, Caitlin; Siddiqui, Imtiaz A; Iczkowski, Kenneth A; Ahmad, Nihal
2016-11-11
Maintaining optimal intracellular zinc (Zn) concentration is crucial for critical cellular functions. Depleted Zn has been associated with prostate cancer (PCa) progression. Solute carrier family 30 (SLC30A) proteins maintain cytoplasmic Zn balance by exporting Zn out to the extracellular space or by sequestering cytoplasmic Zn into intracellular compartments. In this study, we determined the involvement of Zn-exporters, SLC30A 1-10 in PCa, in the context of racial health disparity in human PCa samples obtained from European-American (EA) and African-American (AA) populations. We also analyzed the levels of Zn-exporters in a panel of PCa cells derived from EA and AA populations. We further explored the expression profile of Zn-exporters in PCa using Oncomine database. Zn-exporters were found to be differentially expressed at the mRNA level, with a significant upregulation of SLC30A1, SLC30A9 and SLC30A10, and downregulation of SLC30A5 and SLC30A6 in PCa, compared to benign prostate. Moreover, Ingenuity Pathway analysis revealed several interactions of Zn-exporters with certain tumor suppressor and promoter proteins known to be modulated in PCa. Our study provides an insight regarding Zn-exporters in PCa, which may open new avenues for future studies aimed at enhancing the levels of Zn by modulating Zn-transporters via pharmacological means.
Analysis of Zinc-Exporters Expression in Prostate Cancer
Singh, Chandra K.; Malas, Kareem M.; Tydrick, Caitlin; Siddiqui, Imtiaz A.; Iczkowski, Kenneth A.; Ahmad, Nihal
2016-01-01
Maintaining optimal intracellular zinc (Zn) concentration is crucial for critical cellular functions. Depleted Zn has been associated with prostate cancer (PCa) progression. Solute carrier family 30 (SLC30A) proteins maintain cytoplasmic Zn balance by exporting Zn out to the extracellular space or by sequestering cytoplasmic Zn into intracellular compartments. In this study, we determined the involvement of Zn-exporters, SLC30A 1–10 in PCa, in the context of racial health disparity in human PCa samples obtained from European-American (EA) and African-American (AA) populations. We also analyzed the levels of Zn-exporters in a panel of PCa cells derived from EA and AA populations. We further explored the expression profile of Zn-exporters in PCa using Oncomine database. Zn-exporters were found to be differentially expressed at the mRNA level, with a significant upregulation of SLC30A1, SLC30A9 and SLC30A10, and downregulation of SLC30A5 and SLC30A6 in PCa, compared to benign prostate. Moreover, Ingenuity Pathway analysis revealed several interactions of Zn-exporters with certain tumor suppressor and promoter proteins known to be modulated in PCa. Our study provides an insight regarding Zn-exporters in PCa, which may open new avenues for future studies aimed at enhancing the levels of Zn by modulating Zn-transporters via pharmacological means. PMID:27833104
Chieng, Norman; Trnka, Hjalte; Boetker, Johan; Pikal, Michael; Rantanen, Jukka; Grohganz, Holger
2013-09-15
The purpose of this study is to investigate the use of multivariate data analysis for powder X-ray diffraction-pair-wise distribution function (PXRD-PDF) data to detect phase separation in freeze-dried binary amorphous systems. Polymer-polymer and polymer-sugar binary systems at various ratios were freeze-dried. All samples were analyzed by PXRD, transformed to PDF and analyzed by principal component analysis (PCA). These results were validated by differential scanning calorimetry (DSC) through characterization of glass transition of the maximally freeze-concentrate solute (Tg'). Analysis of PXRD-PDF data using PCA provides a more clear 'miscible' or 'phase separated' interpretation through the distribution pattern of samples on a score plot presentation compared to residual plot method. In a phase separated system, samples were found to be evenly distributed around the theoretical PDF profile. For systems that were miscible, a clear deviation of samples away from the theoretical PDF profile was observed. Moreover, PCA analysis allows simultaneous analysis of replicate samples. Comparatively, the phase behavior analysis from PXRD-PDF-PCA method was in agreement with the DSC results. Overall, the combined PXRD-PDF-PCA approach improves the clarity of the PXRD-PDF results and can be used as an alternative explorative data analytical tool in detecting phase separation in freeze-dried binary amorphous systems. Copyright © 2013 Elsevier B.V. All rights reserved.
24 CFR 401.451 - PAE Physical Condition Analysis (PCA).
Code of Federal Regulations, 2010 CFR
2010-04-01
... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition... 24 Housing and Urban Development 2 2010-04-01 2010-04-01 false PAE Physical Condition Analysis...
24 CFR 401.451 - PAE Physical Condition Analysis (PCA).
Code of Federal Regulations, 2013 CFR
2013-04-01
... 24 Housing and Urban Development 2 2013-04-01 2013-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...
24 CFR 401.451 - PAE Physical Condition Analysis (PCA).
Code of Federal Regulations, 2011 CFR
2011-04-01
... 24 Housing and Urban Development 2 2011-04-01 2011-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...
24 CFR 401.451 - PAE Physical Condition Analysis (PCA).
Code of Federal Regulations, 2012 CFR
2012-04-01
... 24 Housing and Urban Development 2 2012-04-01 2012-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...
24 CFR 401.451 - PAE Physical Condition Analysis (PCA).
Code of Federal Regulations, 2014 CFR
2014-04-01
... 24 Housing and Urban Development 2 2014-04-01 2014-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...
3D Shape Perception in Posterior Cortical Atrophy: A Visual Neuroscience Perspective.
Gillebert, Céline R; Schaeverbeke, Jolien; Bastin, Christine; Neyens, Veerle; Bruffaerts, Rose; De Weer, An-Sofie; Seghers, Alexandra; Sunaert, Stefan; Van Laere, Koen; Versijpt, Jan; Vandenbulcke, Mathieu; Salmon, Eric; Todd, James T; Orban, Guy A; Vandenberghe, Rik
2015-09-16
Posterior cortical atrophy (PCA) is a rare focal neurodegenerative syndrome characterized by progressive visuoperceptual and visuospatial deficits, most often due to atypical Alzheimer's disease (AD). We applied insights from basic visual neuroscience to analyze 3D shape perception in humans affected by PCA. Thirteen PCA patients and 30 matched healthy controls participated, together with two patient control groups with diffuse Lewy body dementia (DLBD) and an amnestic-dominant phenotype of AD, respectively. The hierarchical study design consisted of 3D shape processing for 4 cues (shading, motion, texture, and binocular disparity) with corresponding 2D and elementary feature extraction control conditions. PCA and DLBD exhibited severe 3D shape-processing deficits and AD to a lesser degree. In PCA, deficient 3D shape-from-shading was associated with volume loss in the right posterior inferior temporal cortex. This region coincided with a region of functional activation during 3D shape-from-shading in healthy controls. In PCA patients who performed the same fMRI paradigm, response amplitude during 3D shape-from-shading was reduced in this region. Gray matter volume in this region also correlated with 3D shape-from-shading in AD. 3D shape-from-disparity in PCA was associated with volume loss slightly more anteriorly in posterior inferior temporal cortex as well as in ventral premotor cortex. The findings in right posterior inferior temporal cortex and right premotor cortex are consistent with neurophysiologically based models of the functional anatomy of 3D shape processing. However, in DLBD, 3D shape deficits rely on mechanisms distinct from inferior temporal structural integrity. Posterior cortical atrophy (PCA) is a neurodegenerative syndrome characterized by progressive visuoperceptual dysfunction and most often an atypical presentation of Alzheimer's disease (AD) affecting the ventral and dorsal visual streams rather than the medial temporal system. We applied insights from fundamental visual neuroscience to analyze 3D shape perception in PCA. 3D shape-processing deficits were affected beyond what could be accounted for by lower-order processing deficits. For shading and disparity, this was related to volume loss in regions previously implicated in 3D shape processing in the intact human and nonhuman primate brain. Typical amnestic-dominant AD patients also exhibited 3D shape deficits. Advanced visual neuroscience provides insight into the pathogenesis of PCA that also bears relevance for vision in typical AD. Copyright © 2015 Gillebert, Schaeverbeke et al.
Physical activity in relation to risk of prostate cancer: a systematic review and meta-analysis.
Benke, I N; Leitzmann, M F; Behrens, G; Schmid, D
2018-05-01
Prostate cancer (PCa) is one of the most common cancers among men, yet little is known about its modifiable risk and protective factors. This study aims to quantitatively summarize observational studies relating physical activity (PA) to PCa incidence and mortality. Published articles pertaining to PA and PCa incidence and mortality were retrieved in July 2017 using the Medline and EMBASE databases. The literature review yielded 48 cohort studies and 24 case-control studies with a total of 151 748 PCa cases. The mean age of the study participants at baseline was 61 years. In random-effects models, comparing the highest versus the lowest level of overall PA showed a summary relative risk (RR) estimate for total PCa incidence close to the null [RR = 0.99, 95% confidence interval (CI) = 0.94-1.04]. The corresponding RRs for advanced and non-advanced PCa were 0.92 (95% CI = 0.80-1.06) and 0.95 (95% CI = 0.85-1.07), respectively. We noted a statistically significant inverse association between long-term occupational activity and total PCa (RR = 0.83, 95% CI = 0.71-0.98, n studies = 13), although that finding became statistically non-significant when individual studies were removed from the analysis. When evaluated by cancer subtype, an inverse association with long-term occupational activity was noted for non-advanced/non-aggressive PCa (RR = 0.51, 95% CI = 0.37-0.71, n studies = 2) and regular recreational activity was inversely related to advanced/aggressive PCa (RR = 0.75, 95% CI = 0.60-0.95, n studies = 2), although these observations are based on a low number of studies. Moreover, PA after diagnosis was related to reduced risk of PCa mortality among survivors of PCa (summary RR based on four studies = 0.69, 95% CI = 0.55-0.85). Whether PA protects against PCa remains elusive. Further investigation taking into account the complex clinical and pathologic nature of PCa is needed to clarify the PA and PCa incidence relation. Moreover, future studies are needed to confirm whether PA after diagnosis reduces risk of PCa mortality.
Roudier, Martine P; Winters, Brian R; Coleman, Ilsa; Lam, Hung-Ming; Zhang, Xiaotun; Coleman, Roger; Chéry, Lisly; True, Lawrence D.; Higano, Celestia S.; Montgomery, Bruce; Lange, Paul H.; Snyder, Linda A.; Srivistava, Shiv; Corey, Eva; Vessella, Robert L.; Nelson, Peter S.; Üren, Aykut; Morrissey, Colm
2017-01-01
Background The TMPRSS2-ERG gene fusion is detected in approximately half of primary prostate cancers (PCa) yet the prognostic significance remains unclear. We hypothesized that ERG promotes the expression of common genes in primary PCa and metastatic castration-resistant PCa (CRPC), with the objective of identifying ERG-associated pathways, which may promote the transition from primary PCa to CRPC. Methods We constructed tissue microarrays (TMA) from 127 radical prostatectomy specimens, 20 LuCaP patient-derived xenografts (PDX), and 152 CRPC metastases obtained immediately at time of death. Nuclear ERG was assessed by immunohistochemistry (IHC). To characterize the molecular features of ERG-expressing PCa, a subset of IHC confirmed ERG+ or ERG-specimens including 11 radical prostatectomies, 20 LuCaP PDXs, and 45 CRPC metastases underwent gene expression analysis. Genes were ranked based on expression in primary PCa and CRPC. Common genes of interest were targeted for IHC analysis and expression compared with biochemical recurrence (BCR) status. Results IHC revealed that 43% of primary PCa, 35% of the LuCaP PDXs, and 18% of the CRPC metastases were ERG+ (12 of 48 patients [25%] had at least 1 ERG+ metastasis). Based on gene expression data and previous literature, two proteins involved in calcium signaling (NCALD, CACNA1D), a protein involved in inflammation (HLA-DMB), CD3 positive immune cells, and a novel ERG-associated protein, DCLK1 were evaluated in primary PCa and CRPC metastases. In ERG+ primary PCa, a weak association was seen with NCALD and CACNA1D protein expression. HLA-DMB expression and the presence of CD3 positive immune cells were decreased in CRPC metastases compared to primary PCa. DCLK1 was upregulated at the protein level in unpaired ERG+ primary PCa and CRPC metastases (p=0.0013 and p<0.0001, respectively). In primary PCa, ERG status or expression of targeted proteins was not associated with BCR-free survival. However for primary PCa, ERG+DCLK1+ patients exhibited shorter time to BCR (p=0.06) compared with ERG+DCLK1- patients. Conclusions This study examined ERG expression in primary PCa and CRPC. We have identified altered levels of inflammatory mediators associated with ERG expression. We determined expression of DCLK1 correlates with ERG expression and may play a role in primary PCa progression to metastatic CPRC. PMID:26990456
NASA Astrophysics Data System (ADS)
Ma, Mengli; Lei, En; Meng, Hengling; Wang, Tiantao; Xie, Linyan; Shen, Dong; Xianwang, Zhou; Lu, Bingyue
2017-08-01
Amomum tsao-ko is a commercial plant that used for various purposes in medicinal and food industries. For the present investigation, 44 germplasm samples were collected from Jinping County of Yunnan Province. Clusters analysis and 2-dimensional principal component analysis (PCA) was used to represent the genetic relations among Amomum tsao-ko by using simple sequence repeat (SSR) markers. Clustering analysis clearly distinguished the samples groups. Two major clusters were formed; first (Cluster I) consisted of 34 individuals, the second (Cluster II) consisted of 10 individuals, Cluster I as the main group contained multiple sub-clusters. PCA also showed 2 groups: PCA Group 1 included 29 individuals, PCA Group 2 included 12 individuals, consistent with the results of cluster analysis. The purpose of the present investigation was to provide information on genetic relationship of Amomum tsao-ko germplasm resources in main producing areas, also provide a theoretical basis for the protection and utilization of Amomum tsao-ko resources.
Xie, Lianwu; Guo, Junfang; Zhang, Yuping; Hu, Yunchu; You, Qingping; Shi, Shuyun
2015-07-01
Improving sites accessibility can increase the binding efficiency of molecular imprinted polymers (MIPs). In this work, we firstly synthesized MIPs over magnetic mesoporous silica microspheres (Fe3O4@mSiO2@MIPs) for the selective recognition of protocatechuic acid (PCA). The resulting Fe3O4@mSiO2@MIPs were characterized by transmission electron microscopy (TEM), Fourier transform infrared spectrometer (FT-IR), thermo-gravimetric analysis (TGA), Brunauer-Emmett-Teller (BET), and vibration sample magnetometer (VSM), and evaluated by adsorption isotherms/kinetics and competitive adsorption. The maximum adsorption capacity of PCA on Fe3O4@mSiO2@MIPs was 17.2mg/g (2.3 times that on Fe3O4@SiO2@MIPs). In addition, Fe3O4@mSiO2@MIPs showed a short equilibrium time (140min), rapid magnetic separation (5s) and high stability (retained 94.4% after six cycles). Subsequently, Fe3O4@mSiO2@MIPs were successfully applied for the selective and efficient determination of PCA (29.3μg/g) from Syzygium aromaticum. Conclusively, we combined three advantages into Fe3O4@mSiO2@MIPs, namely, Fe3O4 core for quick separation, mSiO2 layer for enough accessible sites, and surface imprinting MIPs for fast binding and excellent selectivity, to extract PCA from complex systems. Copyright © 2015 Elsevier Ltd. All rights reserved.
Goodpaster, Aaron M.; Kennedy, Michael A.
2015-01-01
Currently, no standard metrics are used to quantify cluster separation in PCA or PLS-DA scores plots for metabonomics studies or to determine if cluster separation is statistically significant. Lack of such measures makes it virtually impossible to compare independent or inter-laboratory studies and can lead to confusion in the metabonomics literature when authors putatively identify metabolites distinguishing classes of samples based on visual and qualitative inspection of scores plots that exhibit marginal separation. While previous papers have addressed quantification of cluster separation in PCA scores plots, none have advocated routine use of a quantitative measure of separation that is supported by a standard and rigorous assessment of whether or not the cluster separation is statistically significant. Here quantification and statistical significance of separation of group centroids in PCA and PLS-DA scores plots are considered. The Mahalanobis distance is used to quantify the distance between group centroids, and the two-sample Hotelling's T2 test is computed for the data, related to an F-statistic, and then an F-test is applied to determine if the cluster separation is statistically significant. We demonstrate the value of this approach using four datasets containing various degrees of separation, ranging from groups that had no apparent visual cluster separation to groups that had no visual cluster overlap. Widespread adoption of such concrete metrics to quantify and evaluate the statistical significance of PCA and PLS-DA cluster separation would help standardize reporting of metabonomics data. PMID:26246647
Contact- and distance-based principal component analysis of protein dynamics.
Ernst, Matthias; Sittel, Florian; Stock, Gerhard
2015-12-28
To interpret molecular dynamics simulations of complex systems, systematic dimensionality reduction methods such as principal component analysis (PCA) represent a well-established and popular approach. Apart from Cartesian coordinates, internal coordinates, e.g., backbone dihedral angles or various kinds of distances, may be used as input data in a PCA. Adopting two well-known model problems, folding of villin headpiece and the functional dynamics of BPTI, a systematic study of PCA using distance-based measures is presented which employs distances between Cα-atoms as well as distances between inter-residue contacts including side chains. While this approach seems prohibitive for larger systems due to the quadratic scaling of the number of distances with the size of the molecule, it is shown that it is sufficient (and sometimes even better) to include only relatively few selected distances in the analysis. The quality of the PCA is assessed by considering the resolution of the resulting free energy landscape (to identify metastable conformational states and barriers) and the decay behavior of the corresponding autocorrelation functions (to test the time scale separation of the PCA). By comparing results obtained with distance-based, dihedral angle, and Cartesian coordinates, the study shows that the choice of input variables may drastically influence the outcome of a PCA.
Contact- and distance-based principal component analysis of protein dynamics
NASA Astrophysics Data System (ADS)
Ernst, Matthias; Sittel, Florian; Stock, Gerhard
2015-12-01
To interpret molecular dynamics simulations of complex systems, systematic dimensionality reduction methods such as principal component analysis (PCA) represent a well-established and popular approach. Apart from Cartesian coordinates, internal coordinates, e.g., backbone dihedral angles or various kinds of distances, may be used as input data in a PCA. Adopting two well-known model problems, folding of villin headpiece and the functional dynamics of BPTI, a systematic study of PCA using distance-based measures is presented which employs distances between Cα-atoms as well as distances between inter-residue contacts including side chains. While this approach seems prohibitive for larger systems due to the quadratic scaling of the number of distances with the size of the molecule, it is shown that it is sufficient (and sometimes even better) to include only relatively few selected distances in the analysis. The quality of the PCA is assessed by considering the resolution of the resulting free energy landscape (to identify metastable conformational states and barriers) and the decay behavior of the corresponding autocorrelation functions (to test the time scale separation of the PCA). By comparing results obtained with distance-based, dihedral angle, and Cartesian coordinates, the study shows that the choice of input variables may drastically influence the outcome of a PCA.
Balacescu, Ovidiu; Petrut, Bogdan; Tudoran, Oana; Feflea, Dragos; Balacescu, Loredana; Anghel, Andrei; Sirbu, Ioan O; Seclaman, Edward; Marian, Catalin
2017-11-01
Prostate cancer (PCa) remains one of the leading causes of cancer-related deaths in men. Despite the tremendous progress in research over the years, a suitable minimally invasive PCa biomarker is yet to be discovered. The recent advances regarding the roles of microRNAs as biomarkers has allowed for their study in PCa as well, especially as blood-based markers. However, there are several studies that used urine as biological sample to evaluate microRNAs as biomarkers for PCa diagnosis, prognosis, and treatment response, which were reviewed herein. A high degree of inconsistency among reports has been observed, which could be due to several analytical aspects, starting with different urinary fractions used for analysis and continuing with the employment of various analytical platforms and methods of statistical analysis. However, a few microRNAs were found to be dysregulated in the urine of PCa patients, which alone or together with serum prostate-specific antigen seem to improve diagnostic power even in the gray zone of PCa. These results warrant further confirmation by larger prospective studies, preferably using a standardized protocol for analysis. WIREs RNA 2017, 8:e1438. doi: 10.1002/wrna.1438 For further resources related to this article, please visit the WIREs website. © 2017 Wiley Periodicals, Inc.
Bijangi-Vishehsaraei, Khadijeh; Blum, Kevin; Zhang, Hongji; Safa, Ahmad R; Halum, Stacey L
2016-03-01
The pathophysiology of recurrent laryngeal nerve (RLN) transection injury is rare in that it is characteristically followed by a high degree of spontaneous reinnervation, with reinnervation of the laryngeal adductor complex (AC) preceding that of the abducting posterior cricoarytenoid (PCA) muscle. Here, we aim to elucidate the differentially expressed myogenic factors following RLN injury that may be at least partially responsible for the spontaneous reinnervation. F344 male rats underwent RLN injury (n = 12) or sham surgery (n = 12). One week after RLN injury, larynges were harvested following euthanasia. The mRNA was extracted from PCA and AC muscles bilaterally, and microarray analysis was performed using a full rat genome array. Microarray analysis of denervated AC and PCA muscles demonstrated dramatic differences in gene expression profiles, with 205 individual probes that were differentially expressed between the denervated AC and PCA muscles and only 14 genes with similar expression patterns. The differential expression patterns of the AC and PCA suggest different mechanisms of reinnervation. The PCA showed the gene patterns of Wallerian degeneration, while the AC expressed the gene patterns of reinnervation by adjacent axonal sprouting. This finding may reveal important therapeutic targets applicable to RLN and other peripheral nerve injuries. © The Author(s) 2015.
Inflammation: an important parameter in the search of prostate cancer biomarkers
2014-01-01
Background A more specific and early diagnostics for prostate cancer (PCa) is highly desirable. In this study, being inflammation the focus of our effort, serum protein profiles were analyzed in order to investigate if this parameter could interfere with the search of discriminating proteins between PCa and benign prostatic hyperplasia (BPH). Methods Patients with clinical suspect of PCa and candidates for trans-rectal ultrasound guided prostate biopsy (TRUS) were enrolled. Histological specimens were examined in order to grade and classify the tumor, identify BPH and detect inflammation. Surface Enhanced Laser Desorption/Ionization-Time of Flight-Mass Spectrometry (SELDI-ToF-MS) and two-dimensional gel electrophoresis (2-DE) coupled with Liquid Chromatography-MS/MS (LC-MS/MS) were used to analyze immuno-depleted serum samples from patients with PCa and BPH. Results The comparison between PCa (with and without inflammation) and BPH (with and without inflammation) serum samples by SELDI-ToF-MS analysis did not show differences in protein expression, while changes were only observed when the concomitant presence of inflammation was taken into consideration. In fact, when samples with histological sign of inflammation were excluded, 20 significantly different protein peaks were detected. Subsequent comparisons (PCa with inflammation vs PCa without inflammation, and BPH with inflammation vs BPH without inflammation) showed that 16 proteins appeared to be modified in the presence of inflammation, while 4 protein peaks were not modified. With 2-DE analysis, comparing PCa without inflammation vs PCa with inflammation, and BPH without inflammation vs the same condition in the presence of inflammation, were identified 29 and 25 differentially expressed protein spots, respectively. Excluding samples with inflammation the comparison between PCa vs BPH showed 9 unique PCa proteins, 4 of which overlapped with those previously identified in the presence of inflammation, while other 2 were new proteins, not identified in our previous comparisons. Conclusions The present study indicates that inflammation might be a confounding parameter during the proteomic research of candidate biomarkers of PCa. These results indicate that some possible biomarker-candidate proteins are strongly influenced by the presence of inflammation, hence only a well-selected protein pattern should be considered for potential marker of PCa. PMID:24944525
Xu, Ning; Wu, Yu-Peng; Chen, Dong-Ning; Ke, Zhi-Bin; Cai, Hai; Wei, Yong; Zheng, Qing-Shui; Huang, Jin-Bei; Li, Xiao-Dong; Xue, Xue-Yi
2018-05-01
To explore the value of Prostate Imaging Reporting and Data System Version 2 (PI-RADS v2) for predicting prostate biopsy results in patients with prostate specific antigen (PSA) levels of 4-10 ng/ml. We retrospectively reviewed multi-parameter magnetic resonance images from 528 patients with PSA levels of 4-10 ng/ml who underwent transrectal ultrasound-guided prostate biopsies between May 2015 and May 2017. Among them, 137 were diagnosed with prostate cancer (PCa), and we further subdivided them according to pathological results into the significant PCa (S-PCa) and insignificant significant PCa (Ins-PCa) groups (121 cases were defined by surgical pathological specimen and 16 by biopsy). Age, PSA, percent free PSA, PSA density (PSAD), prostate volume (PV), and PI-RADS score were collected. Logistic regression analysis was performed to determine predictors of pathological results. Receiver operating characteristic curves were constructed to analyze the diagnostic value of PI-RADS v2 in PCa. Multivariate analysis indicated that age, PV, percent free PSA, and PI-RADS score were independent predictors of biopsy findings, while only PI-RADS score was an independent predictor of S-PCa (P < 0.05). The areas under the receiver operating characteristic curve for diagnosing PCa with respect to age, PV, percent free PSA, and PI-RADS score were 0.570, 0.430, 0.589 and 0.836, respectively. The area under the curve for diagnosing S-PCa with respect to PI-RADS score was 0.732. A PI-RADS score of 3 was the best cutoff for predicting PCa, and 4 was the best cutoff for predicting S-PCa. Thus, 92.8% of patients with PI-RADS scores of 1-2 would have avoided biopsy, but at the cost of missing 2.2% of the potential PCa cases. Similarly, 83.82% of patients with a PI-RADS score ≤ 3 would have avoided biopsy, but at the cost of missing 3.3% of the potential S-PCa cases. PI-RADS v2 could be used to reduce unnecessary prostate biopsies in patients with PSA levels of 4-10 ng/ml.
Zhang, Yu-Dong; Wang, Qing; Wu, Chen-Jiang; Wang, Xiao-Ning; Zhang, Jing; Liu, Hui; Liu, Xi-Sheng; Shi, Hai-Bin
2015-04-01
To evaluate histogram analysis of intravoxel incoherent motion (IVIM) for discriminating the Gleason grade of prostate cancer (PCa). A total of 48 patients pathologically confirmed as having clinically significant PCa (size > 0.5 cm) underwent preoperative DW-MRI (b of 0-900 s/mm(2)). Data was post-processed by monoexponential and IVIM model for quantitation of apparent diffusion coefficients (ADCs), perfusion fraction f, diffusivity D and pseudo-diffusivity D*. Histogram analysis was performed by outlining entire-tumour regions of interest (ROIs) from histological-radiological correlation. The ability of imaging indices to differentiate low-grade (LG, Gleason score (GS) ≤6) from intermediate/high-grade (HG, GS > 6) PCa was analysed by ROC regression. Eleven patients had LG tumours (18 foci) and 37 patients had HG tumours (42 foci) on pathology examination. HG tumours had significantly lower ADCs and D in terms of mean, median, 10th and 75th percentiles, combined with higher histogram kurtosis and skewness for ADCs, D and f, than LG PCa (p < 0.05). Histogram D showed relatively higher correlations (ñ = 0.641-0.668 vs. ADCs: 0.544-0.574) with ordinal GS of PCa; and its mean, median and 10th percentile performed better than ADCs did in distinguishing LG from HG PCa. It is feasible to stratify the pathological grade of PCa by IVIM with histogram metrics. D performed better in distinguishing LG from HG tumour than conventional ADCs. • GS had relatively higher correlation with tumour D than ADCs. • Difference of histogram D among two-grade tumours was statistically significant. • D yielded better individual features in demonstrating tumour grade than ADC. • D* and f failed to determine tumour grade of PCa.
Spectral discrimination of serum from liver cancer and liver cirrhosis using Raman spectroscopy
NASA Astrophysics Data System (ADS)
Yang, Tianyue; Li, Xiaozhou; Yu, Ting; Sun, Ruomin; Li, Siqi
2011-07-01
In this paper, Raman spectra of human serum were measured using Raman spectroscopy, then the spectra was analyzed by multivariate statistical methods of principal component analysis (PCA). Then linear discriminant analysis (LDA) was utilized to differentiate the loading score of different diseases as the diagnosing algorithm. Artificial neural network (ANN) was used for cross-validation. The diagnosis sensitivity and specificity by PCA-LDA are 88% and 79%, while that of the PCA-ANN are 89% and 95%. It can be seen that modern analyzing method is a useful tool for the analysis of serum spectra for diagnosing diseases.
Subject order-independent group ICA (SOI-GICA) for functional MRI data analysis.
Zhang, Han; Zuo, Xi-Nian; Ma, Shuang-Ye; Zang, Yu-Feng; Milham, Michael P; Zhu, Chao-Zhe
2010-07-15
Independent component analysis (ICA) is a data-driven approach to study functional magnetic resonance imaging (fMRI) data. Particularly, for group analysis on multiple subjects, temporally concatenation group ICA (TC-GICA) is intensively used. However, due to the usually limited computational capability, data reduction with principal component analysis (PCA: a standard preprocessing step of ICA decomposition) is difficult to achieve for a large dataset. To overcome this, TC-GICA employs multiple-stage PCA data reduction. Such multiple-stage PCA data reduction, however, leads to variable outputs due to different subject concatenation orders. Consequently, the ICA algorithm uses the variable multiple-stage PCA outputs and generates variable decompositions. In this study, a rigorous theoretical analysis was conducted to prove the existence of such variability. Simulated and real fMRI experiments were used to demonstrate the subject-order-induced variability of TC-GICA results using multiple PCA data reductions. To solve this problem, we propose a new subject order-independent group ICA (SOI-GICA). Both simulated and real fMRI data experiments demonstrated the high robustness and accuracy of the SOI-GICA results compared to those of traditional TC-GICA. Accordingly, we recommend SOI-GICA for group ICA-based fMRI studies, especially those with large data sets. Copyright 2010 Elsevier Inc. All rights reserved.
Carleton, Neil M; Zhu, Guangjing; Gorbounov, Mikhail; Miller, M Craig; Pienta, Kenneth J; Resar, Linda M S; Veltri, Robert W
2018-05-01
There are few tissue-based biomarkers that can accurately predict prostate cancer (PCa) progression and aggressiveness. We sought to evaluate the clinical utility of prostate and breast overexpressed 1 (PBOV1) as a potential PCa biomarker. Patient tumor samples were designated by Grade Groups using the 2014 Gleason grading system. Primary radical prostatectomy tumors were obtained from 48 patients and evaluated for PBOV1 levels using Western blot analysis in matched cancer and benign cancer-adjacent regions. Immunohistochemical evaluation of PBOV1 was subsequently performed in 80 cancer and 80 benign cancer-adjacent patient samples across two tissue microarrays (TMAs) to verify protein levels in epithelial tissue and to assess correlation between PBOV1 proteins and nuclear architectural changes in PCa cells. Digital histomorphometric analysis was used to track 22 parameters that characterized nuclear changes in PBOV1-stained cells. Using a training and test set for validation, multivariate logistic regression (MLR) models were used to identify significant nuclear parameters that distinguish Grade Group 3 and above PCa from Grade Group 1 and 2 PCa regions. PBOV1 protein levels were increased in tumors from Grade Group 3 and above (GS 4 + 3 and ≥ 8) regions versus Grade Groups 1 and 2 (GS 3 + 3 and 3 + 4) regions (P = 0.005) as assessed by densitometry of immunoblots. Additionally, by immunoblotting, PBOV1 protein levels differed significantly between Grade Group 2 (GS 3 + 4) and Grade Group 3 (GS 4 + 3) PCa samples (P = 0.028). In the immunohistochemical analysis, measures of PBOV1 staining intensity strongly correlated with nuclear alterations in cancer cells. An MLR model retaining eight parameters describing PBOV1 staining intensity and nuclear architecture discriminated Grade Group 3 and above PCa from Grade Group 1 and 2 PCa and benign cancer-adjacent regions with a ROC-AUC of 0.90 and 0.80, respectively, in training and test sets. Our study demonstrates that the PBOV1 protein could be used to discriminate Grade Group 3 and above PCa. Additionally, the PBOV1 protein could be involved in modulating changes to the nuclear architecture of PCa cells. Confirmatory studies are warranted in an independent population for further validation. © 2018 Wiley Periodicals, Inc.
Physicochemical and mechanical properties of paracetamol cocrystal with 5-nitroisophthalic acid.
Hiendrawan, Stevanus; Veriansyah, Bambang; Widjojokusumo, Edward; Soewandhi, Sundani Nurono; Wikarsa, Saleh; Tjandrawinata, Raymond R
2016-01-30
We report novel pharmaceutical cocrystal of a popular antipyretic drug paracetamol (PCA) with coformer 5-nitroisophhthalic acid (5NIP) to improve its tabletability. The cocrystal (PCA-5NIP at molar ratio of 1:1) was synthesized by solvent evaporation technique using methanol as solvent. The physicochemical properties of cocrystal were characterized by powder X-ray diffraction (PXRD), differential scanning calorimetry (DSC), thermogravimetry analysis (TGA), fourier transform infrared spectroscopy (FTIR), hot stage polarized microscopy (HSPM) and scanning electron microscopy (SEM). Stability of the cocrystal was assessed by storing them at 40°C/75% RH for one month. Compared to PCA, the cocrystal displayed superior tableting performance. PCA-5NIP cocrystal showed a similar dissolution profile as compared to PCA and exhibited good stability. This study showed the utility of PCA-5NIP cocrystal for improving mechanical properties of PCA. Copyright © 2015 Elsevier B.V. All rights reserved.
PEM-PCA: a parallel expectation-maximization PCA face recognition architecture.
Rujirakul, Kanokmon; So-In, Chakchai; Arnonkijpanich, Banchar
2014-01-01
Principal component analysis or PCA has been traditionally used as one of the feature extraction techniques in face recognition systems yielding high accuracy when requiring a small number of features. However, the covariance matrix and eigenvalue decomposition stages cause high computational complexity, especially for a large database. Thus, this research presents an alternative approach utilizing an Expectation-Maximization algorithm to reduce the determinant matrix manipulation resulting in the reduction of the stages' complexity. To improve the computational time, a novel parallel architecture was employed to utilize the benefits of parallelization of matrix computation during feature extraction and classification stages including parallel preprocessing, and their combinations, so-called a Parallel Expectation-Maximization PCA architecture. Comparing to a traditional PCA and its derivatives, the results indicate lower complexity with an insignificant difference in recognition precision leading to high speed face recognition systems, that is, the speed-up over nine and three times over PCA and Parallel PCA.
NASA Astrophysics Data System (ADS)
Burgin, Laura; Ekström, Marie; Dessai, Suraje
2017-07-01
Bluetongue, an economically important animal disease, can be spread over long distances by carriage of insect vectors ( Culicoides biting midges) on the wind. The weather conditions which influence the midge's flight are controlled by synoptic scale atmospheric circulations. A method is proposed that links wind-borne dispersion of the insects to synoptic circulation through the use of a dispersion model in combination with principal component analysis (PCA) and cluster analysis. We illustrate how to identify the main synoptic situations present during times of midge incursions into the UK from the European continent. A PCA was conducted on high-pass-filtered mean sea-level pressure data for a domain centred over north-west Europe from 2005 to 2007. A clustering algorithm applied to the PCA scores indicated the data should be divided into five classes for which averages were calculated, providing a classification of the main synoptic types present. Midge incursion events were found to mainly occur in two synoptic categories; 64.8% were associated with a pattern displaying a pressure gradient over the North Atlantic leading to moderate south-westerly flow over the UK and 17.9% of the events occurred when high pressure dominated the region leading to south-easterly or easterly winds. The winds indicated by the pressure maps generally compared well against observations from a surface station and analysis charts. This technique could be used to assess frequency and timings of incursions of virus into new areas on seasonal and decadal timescales, currently not possible with other dispersion or biological modelling methods.
Wang, Chaolong; Zöllner, Sebastian; Rosenberg, Noah A.
2012-01-01
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure. PMID:22927824
Wang, Chaolong; Zöllner, Sebastian; Rosenberg, Noah A
2012-08-01
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure.
Rossi, Gabriela Barbosa; Valentim-Neto, Pedro Alexandre; Blank, Martina; Faria, Josias Correa de; Arisi, Ana Carolina Maisonnave
2017-08-30
Common bean (Phaseolus vulgaris L.) is a source of proteins for about one billion people worldwide. In Brazil, 'BRS Sublime', 'BRS Vereda', 'BRS Esteio', and 'BRS Estilo' cultivars were developed by Embrapa to offer high yield to farmers and excellent quality to final consumers. In this work, grain proteomes of these common bean cultivars were compared based on two-dimensional gel electrophoresis (2-DE) and tandem mass spectrometry (MS/MS). Principal component analysis (PCA) was applied to compare 349 matched spots in these cultivars proteomes, and all cultivars were clearly separated in PCA plot. Thirty-two differentially accumulated proteins were identified by MS. Storage proteins such as phaseolins, legumins, and lectins were the most abundant, and novel proteins were also identified. We have built a useful platform that could be used to analyze other Brazilian cultivars and genotypes of common beans.
Has your ancient stamp been regummed with synthetic glue? A FT-NIR and FT-Raman study.
Simonetti, Remo; Oliveri, Paolo; Henry, Adrien; Duponchel, Ludovic; Lanteri, Silvia
2016-01-01
The potential of FT-NIR and FT-Raman spectroscopies to characterise the gum applied on the backside of ancient stamps was investigated for the first time. This represents a very critical issue for the collectors' market, since gum conditions heavily influence stamp quotations, and fraudulent application of synthetic gum onto damaged stamp backsides to increase their desirability is a well-documented practice. Spectral data were processed by exploratory pattern recognition tools. In particular, application of principal component analysis (PCA) revealed that both of the spectroscopic techniques provide information useful to characterise stamp gum. Examination of PCA loadings and their chemical interpretation confirmed the robustness of the outcomes. Fusion of FT-NIR and FT-Raman spectral data was performed, following both a low-level and a mid-level procedure. The results were critically compared with those obtained separately for the two spectroscopic techniques. Copyright © 2015 Elsevier B.V. All rights reserved.
Low-contrast underwater living fish recognition using PCANet
NASA Astrophysics Data System (ADS)
Sun, Xin; Yang, Jianping; Wang, Changgang; Dong, Junyu; Wang, Xinhua
2018-04-01
Quantitative and statistical analysis of ocean creatures is critical to ecological and environmental studies. And living fish recognition is one of the most essential requirements for fishery industry. However, light attenuation and scattering phenomenon are present in the underwater environment, which makes underwater images low-contrast and blurry. This paper tries to design a robust framework for accurate fish recognition. The framework introduces a two stage PCA Network to extract abstract features from fish images. On a real-world fish recognition dataset, we use a linear SVM classifier and set penalty coefficients to conquer data unbalanced issue. Feature visualization results show that our method can avoid the feature distortion in boundary regions of underwater image. Experiments results show that the PCA Network can extract discriminate features and achieve promising recognition accuracy. The framework improves the recognition accuracy of underwater living fishes and can be easily applied to marine fishery industry.
Eigenvectors of optimal color spectra.
Flinkman, Mika; Laamanen, Hannu; Tuomela, Jukka; Vahimaa, Pasi; Hauta-Kasari, Markku
2013-09-01
Principal component analysis (PCA) and weighted PCA were applied to spectra of optimal colors belonging to the outer surface of the object-color solid or to so-called MacAdam limits. The correlation matrix formed from this data is a circulant matrix whose biggest eigenvalue is simple and the corresponding eigenvector is constant. All other eigenvalues are double, and the eigenvectors can be expressed with trigonometric functions. Found trigonometric functions can be used as a general basis to reconstruct all possible smooth reflectance spectra. When the spectral data are weighted with an appropriate weight function, the essential part of the color information is compressed to the first three components and the shapes of the first three eigenvectors correspond to one achromatic response function and to two chromatic response functions, the latter corresponding approximately to Munsell opponent-hue directions 9YR-9B and 2BG-2R.
RECENT APPLICATIONS OF SOURCE APPORTIONMENT METHODS AND RELATED NEEDS
Traditional receptor modeling studies have utilized factor analysis (like principal component analysis, PCA) and/or Chemical Mass Balance (CMB) to assess source influences. The limitations with these approaches is that PCA is qualitative and CMB requires the input of source pr...
Kernel Principal Component Analysis for dimensionality reduction in fMRI-based diagnosis of ADHD.
Sidhu, Gagan S; Asgarian, Nasimeh; Greiner, Russell; Brown, Matthew R G
2012-01-01
This study explored various feature extraction methods for use in automated diagnosis of Attention-Deficit Hyperactivity Disorder (ADHD) from functional Magnetic Resonance Image (fMRI) data. Each participant's data consisted of a resting state fMRI scan as well as phenotypic data (age, gender, handedness, IQ, and site of scanning) from the ADHD-200 dataset. We used machine learning techniques to produce support vector machine (SVM) classifiers that attempted to differentiate between (1) all ADHD patients vs. healthy controls and (2) ADHD combined (ADHD-c) type vs. ADHD inattentive (ADHD-i) type vs. controls. In different tests, we used only the phenotypic data, only the imaging data, or else both the phenotypic and imaging data. For feature extraction on fMRI data, we tested the Fast Fourier Transform (FFT), different variants of Principal Component Analysis (PCA), and combinations of FFT and PCA. PCA variants included PCA over time (PCA-t), PCA over space and time (PCA-st), and kernelized PCA (kPCA-st). Baseline chance accuracy was 64.2% produced by guessing healthy control (the majority class) for all participants. Using only phenotypic data produced 72.9% accuracy on two class diagnosis and 66.8% on three class diagnosis. Diagnosis using only imaging data did not perform as well as phenotypic-only approaches. Using both phenotypic and imaging data with combined FFT and kPCA-st feature extraction yielded accuracies of 76.0% on two class diagnosis and 68.6% on three class diagnosis-better than phenotypic-only approaches. Our results demonstrate the potential of using FFT and kPCA-st with resting-state fMRI data as well as phenotypic data for automated diagnosis of ADHD. These results are encouraging given known challenges of learning ADHD diagnostic classifiers using the ADHD-200 dataset (see Brown et al., 2012).
Liu, Chang; Liu, Shi-Liang; Wang, Zhi-Xian; Yu, Kai; Feng, Chun-Xiang; Ke, Zan; Wang, Liang; Zeng, Xiao-Yong
2018-04-13
Prostate cancer (PCa) is one of the most common cancers among men globally. The authors aimed to evaluate the ability of the Prostate Imaging Reporting and Data System version 2 (PI-RADS v2) to classify men with PCa, clinically significant PCa (CSPCa), or no PCa, especially among those with serum total prostate-specific antigen (tPSA) levels in the "gray zone" (4-10 ng ml -1 ). A total of 308 patients (355 lesions) were enrolled in this study. Diagnostic efficiency was determined. Univariate and multivariate analyses, receiver operating characteristic curve analysis, and decision curve analysis were performed to determine and compare the predictors of PCa and CSPCa. The results suggested that PI-RADS v2, tPSA, and prostate-specific antigen density (PSAD) were independent predictors of PCa and CSPCa. A PI-RADS v2 score ≥4 provided high negative predictive values (91.39% for PCa and 95.69% for CSPCa). A model of PI-RADS combined with PSA and PSAD helped to define a high-risk group (PI-RADS score = 5 and PSAD ≥0.15 ng ml -1 cm -3 , with tPSA in the gray zone, or PI-RADS score ≥4 with high tPSA level) with a detection rate of 96.1% for PCa and 93.0% for CSPCa while a low-risk group with a detection rate of 6.1% for PCa and 2.2% for CSPCa. It was concluded that the PI-RADS v2 could be used as a reliable and independent predictor of PCa and CSPCa. The combination of PI-RADS v2 score with PSA and PSAD could be helpful in the prediction and diagnosis of PCa and CSPCa and, thus, may help in preventing unnecessary invasive procedures.
Tang, Lu; Li, Xintao; Wang, Baojun; Luo, Guoxiong; Gu, Liangyou; Chen, Luyao; Liu, Kan; Gao, Yu; Zhang, Xu
2016-01-01
Increasing evidence suggests that inflammation plays an essential role in cancer development and progression. The inflammation marker neutrophil-lymphocyte ratio (NLR) is correlated with prognosis across a wide variety of tumor types, but its prognostic value in prostate cancer (PCa) remains controversial. In the present meta-analysis, the prognostic value of NLR in PCa patients is investigated. We performed a meta-analysis to determine the predictive value of NLR for overall survival (OS), recurrence-free survival (RFS), and clinical features in patients with PCa. We systematically searched PubMed, ISI Web of Science, and Embase for relevant studies published up to October 2015. A total of 9418 patients from 18 studies were included in the meta-analysis. Elevated pretreatment NLR predicted poor OS (HR 1.628, 95% CI 1.410-1.879) and RFS (HR 1.357, 95% CI 1.126-1.636) in all patients with PCa. However, NLR was insignificantly associated with OS in the subgroup of patients with localized PCa (HR 1.439, 95% CI 0.753-2.75). Increased NLR was also significantly correlated with lymph node involvement (OR 1.616, 95% CI 1.167-2.239) but not with pathological stage (OR 0.827, 95% CI 0.637-1.074) or Gleason score (OR 0.761, 95% CI 0.555-1.044). The present meta-analysis indicated that NLR could predict the prognosis for patients with locally advanced or castration-resistant PCa. Patients with higher NLR are more likely to have poorer prognosis than those with lower NLR.
The Burden of Urinary Incontinence and Urinary Bother Among Elderly Prostate Cancer Survivors
Kopp, Ryan P.; Marshall, Lynn M.; Wang, Patty Y.; Bauer, Douglas C.; Barrett-Connor, Elizabeth; Parsons, J. Kellogg
2014-01-01
Background Data describing urinary health in elderly, community-dwelling prostate cancer (PCa) survivors are limited. Objective To elucidate the prevalence of lower urinary tract symptoms, urinary bother, and incontinence in elderly PCa survivors compared with peers without PCa. Design, setting, and participants A cross-sectional analysis of 5990 participants in the Osteoporotic Fractures in Men Research Group, a cohort study of community-dwelling men ≥65 yr. Outcome measurements and statistical analysis We characterized urinary health using self-reported urinary incontinence and the American Urological Association Symptom Index (AUA-SI). We compared urinary health measures according to type of PCa treatment in men with PCa and men without PCa using multivariate log-binomial regression to generate prevalence ratios (PRs). Results and limitations At baseline, 706 men (12%) reported a history of PCa, with a median time since diagnosis of 6.3 yr. Of these men, 426 (60%) reported urinary incontinence. In adjusted analyses, observation (PR: 1.92; 95% confidence interval [CI], 1.15–3.21; p = 0.01), surgery (PR: 4.68; 95% CI, 4.11–5.32; p < 0.0001), radiation therapy (PR: 1.64; 95% CI, 1.20– 2.23; p = 0.002), and androgen-deprivation therapy (ADT) (PR: 2.01; 95% CI, 1.35–2.99; p = 0.0006) were each associated with daily incontinence. Daily incontinence risk increased with time since diagnosis independently of age. Observation (PR: 1.33; 95% CI, 1.00–1.78; p = 0.05), surgery (PR: 1.25; 95% CI, 1.10–1.42; p = 0.0008), and ADT (PR: 1.50; 95% CI, 1.26–1.79; p < 0.0001) were associated with increased AUA-SI bother scores. Cancer stage and use of adjuvant or salvage therapies were not available for analysis. Conclusions Compared with their peers without PCa, elderly PCa survivors had a two-fold to five-fold greater prevalence of urinary incontinence, which rose with increasing survivorship duration. Observation, surgery, and ADT were each associated with increased urinary bother. These data suggest a substantially greater burden of urinary health problems among elderly PCa survivors than previously recognized. PMID:23587870
Howe, Laura D; Hargreaves, James R; Huttly, Sharon RA
2008-01-01
Background Epidemiological studies often require measures of socio-economic position (SEP). The application of principal components analysis (PCA) to data on asset-ownership is one popular approach to household SEP measurement. Proponents suggest that the approach provides a rational method for weighting asset data in a single indicator, captures the most important aspect of SEP for health studies, and is based on data that are readily available and/or simple to collect. However, the use of PCA on asset data may not be the best approach to SEP measurement. There remains concern that this approach can obscure the meaning of the final index and is statistically inappropriate for use with discrete data. In addition, the choice of assets to include and the level of agreement between wealth indices and more conventional measures of SEP such as consumption expenditure remain unclear. We discuss these issues, illustrating our examples with data from the Malawi Integrated Household Survey 2004–5. Methods Wealth indices were constructed using the assets on which data are collected within Demographic and Health Surveys. Indices were constructed using five weighting methods: PCA, PCA using dichotomised versions of categorical variables, equal weights, weights equal to the inverse of the proportion of households owning the item, and Multiple Correspondence Analysis. Agreement between indices was assessed. Indices were compared with per capita consumption expenditure, and the difference in agreement assessed when different methods were used to adjust consumption expenditure for household size and composition. Results All indices demonstrated similarly modest agreement with consumption expenditure. The indices constructed using dichotomised data showed strong agreement with each other, as did the indices constructed using categorical data. Agreement was lower between indices using data coded in different ways. The level of agreement between wealth indices and consumption expenditure did not differ when different consumption equivalence scales were applied. Conclusion This study questions the appropriateness of wealth indices as proxies for consumption expenditure. The choice of data included had a greater influence on the wealth index than the method used to weight the data. Despite the limitations of PCA, alternative methods also all had disadvantages. PMID:18234082
Kolosowski, Kamil P; Sodhi, Rana N S; Kishen, Anil; Basrani, Bettina R
2014-12-01
Interaction of sodium hypochlorite (NaOCl) mixed with chlorhexidine (CHX) produces a brown precipitate containing para-chloroaniline (PCA). When QMiX is mixed with NaOCl, no precipitate forms, but color change occurs. The aim of this study was to qualitatively assess the formation of precipitate and PCA on the surface and in the tubules of dentin irrigated with NaOCl, followed either by EDTA, NaOCl, and CHX or by saline and QMiX by using time-of-flight secondary ion mass spectrometry (TOF-SIMS). Dentin blocks were obtained from human maxillary molars, embedded in resin, and cross-sectioned to expose dentin. Specimens in group 1 were immersed in 2.5% NaOCl, followed by 17% EDTA, 2.5% NaOCl, and 2% CHX. Specimens in group 2 were immersed in 2.5% NaOCl, followed by saline and QMiX. The dentin surfaces were subjected to TOF-SIMS spectra analysis. Longitudinal sections of dentin blocks were then exposed and subjected to TOF-SIMS analysis. All samples and analysis were performed in triplicate for confirmation. TOF-SIMS analysis of group 1 revealed an irregular precipitate, containing PCA and CHX breakdown products, on the dentin surfaces, occluding and extending into the tubules. In TOF-SIMS analysis of group 2, no precipitates, including PCA, were detected on the dentin surface or in the tubules. Within the limitations of this study, precipitate containing PCA was formed in the tubules of dentin irrigated with NaOCl followed by CHX. No precipitates or PCA were detected in the tubules of dentin irrigated with NaOCl followed by saline and QMiX. Copyright © 2014 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Evaluation of FTIR spectroscopy as diagnostic tool for colorectal cancer using spectral analysis
NASA Astrophysics Data System (ADS)
Dong, Liu; Sun, Xuejun; Chao, Zhang; Zhang, Shiyun; Zheng, Jianbao; Gurung, Rajendra; Du, Junkai; Shi, Jingsen; Xu, Yizhuang; Zhang, Yuanfu; Wu, Jinguang
2014-03-01
The aim of this study is to confirm FTIR spectroscopy as a diagnostic tool for colorectal cancer. 180 freshly removed colorectal samples were collected from 90 patients for spectrum analysis. The ratios of spectral intensity and relative intensity (/I1460) were calculated. Principal component analysis (PCA) and Fisher's discriminant analysis (FDA) were applied to distinguish the malignant from normal. The FTIR parameters of colorectal cancer and normal tissues were distinguished due to the contents or configurations of nucleic acids, proteins, lipids and carbohydrates. Related to nitrogen containing, water, protein and nucleic acid were increased significantly in the malignant group. Six parameters were selected as independent factors to perform discriminant functions. The sensitivity for FTIR in diagnosing colorectal cancer was 96.6% by discriminant analysis. Our study demonstrates that FTIR can be a useful technique for detection of colorectal cancer and may be applied in clinical colorectal cancer diagnosis.
Mjørud, Marit; Kirkevold, Marit; Røsvik, Janne; Engedal, Knut
2014-01-01
To investigate which factors the Quality of Life in Late-Stage Dementia (QUALID) scale holds when used among people with dementia (pwd) in nursing homes and to find out how the symptom load varies across the different severity levels of dementia. We included 661 pwd [mean age ± SD, 85.3 ± 8.6 years; 71.4% women]. The QUALID and the Clinical Dementia Rating (CDR) scale were applied. A principal component analysis (PCA) with varimax rotation and Kaiser normalization was applied to test the factor structure. Nonparametric analyses were applied to examine differences of symptom load across the three CDR groups. The mean QUALID score was 21.5 (±7.1), and the CDR scores of the three groups were 1 in 22.5%, 2 in 33.6% and 3 in 43.9%. The results of the statistical measures employed were the following: Crohnbach's α of QUALID, 0.74; Bartlett's test of sphericity, p <0.001; the Kaiser-Meyer-Olkin measure, 0.77. The PCA analysis resulted in three components accounting for 53% of the variance. The first component was 'tension' ('facial expression of discomfort', 'appears physically uncomfortable', 'verbalization suggests discomfort', 'being irritable and aggressive', 'appears calm', Crohnbach's α = 0.69), the second was 'well-being' ('smiles', 'enjoys eating', 'enjoys touching/being touched', 'enjoys social interaction', Crohnbach's α = 0.62) and the third was 'sadness' ('appears sad', 'cries', 'facial expression of discomfort', Crohnbach's α 0.65). The mean score on the components 'tension' and 'well-being' increased significantly with increasing severity levels of dementia. Three components of quality of life (qol) were identified. Qol decreased with increasing severity of dementia. © 2013 S. Karger AG, Basel.
ERIC Educational Resources Information Center
Su, Chung-Ho; Cheng, Ching-Hsue
2016-01-01
This study aims to explore the factors in a patient's rehabilitation achievement after a total knee replacement (TKR) patient exercises, using a PCA-ANFIS emotion model-based game rehabilitation system, which combines virtual reality (VR) and motion capture technology. The researchers combine a principal component analysis (PCA) and an adaptive…
Tsatsishvili, Valeri; Burunat, Iballa; Cong, Fengyu; Toiviainen, Petri; Alluri, Vinoo; Ristaniemi, Tapani
2018-06-01
There has been growing interest towards naturalistic neuroimaging experiments, which deepen our understanding of how human brain processes and integrates incoming streams of multifaceted sensory information, as commonly occurs in real world. Music is a good example of such complex continuous phenomenon. In a few recent fMRI studies examining neural correlates of music in continuous listening settings, multiple perceptual attributes of music stimulus were represented by a set of high-level features, produced as the linear combination of the acoustic descriptors computationally extracted from the stimulus audio. NEW METHOD: fMRI data from naturalistic music listening experiment were employed here. Kernel principal component analysis (KPCA) was applied to acoustic descriptors extracted from the stimulus audio to generate a set of nonlinear stimulus features. Subsequently, perceptual and neural correlates of the generated high-level features were examined. The generated features captured musical percepts that were hidden from the linear PCA features, namely Rhythmic Complexity and Event Synchronicity. Neural correlates of the new features revealed activations associated to processing of complex rhythms, including auditory, motor, and frontal areas. Results were compared with the findings in the previously published study, which analyzed the same fMRI data but applied linear PCA for generating stimulus features. To enable comparison of the results, methodology for finding stimulus-driven functional maps was adopted from the previous study. Exploiting nonlinear relationships among acoustic descriptors can lead to the novel high-level stimulus features, which can in turn reveal new brain structures involved in music processing. Copyright © 2018 Elsevier B.V. All rights reserved.
Na, Rong; Zheng, S. Lilly; Han, Misop; Yu, Hongjie; Jiang, Deke; Shah, Sameep; Ewing, Charles M.; Zhang, Liti; Novakovic, Kristian; Petkewicz, Jacqueline; Gulukota, Kamalakar; Helseth, Donald L.; Quinn, Margo; Humphries, Elizabeth; Wiley, Kathleen E.; Isaacs, Sarah D.; Wu, Yishuo; Liu, Xu; Zhang, Ning; Wang, Chi-Hsiung; Khandekar, Janardan; Hulick, Peter J.; Shevrin, Daniel H.; Cooney, Kathleen A.; Shen, Zhoujun; Partin, Alan W.; Carter, H. Ballentine; Carducci, Michael A.; Eisenberger, Mario A.; Denmeade, Sam R.; McGuire, Michael; Walsh, Patrick C.; Helfand, Brian T.; Brendler, Charles B.; Ding, Qiang; Xu, Jianfeng; Isaacs, William B.
2017-01-01
Background Germline mutations in BRCA1/2 and ATM have been associated with prostate cancer (PCa) risk. Objective To directly assess whether germline mutations in these three genes distinguish lethal from indolent PCa and whether they confer any effect on age at death. Design, setting, and participants A retrospective case-case study of 313 patients who died of PCa and 486 patients with low-risk localized PCa of European, African, and Chinese descent. Germline DNA of each of the 799 patients was sequenced for these three genes. Outcome measurements and statistical analysis Mutation carrier rates and their effect on lethal PCa were analyzed using the Fisher’s exact test and Cox regression analysis, respectively. Results and limitations The combined BRCA1/2 and ATM mutation carrier rate was significantly higher in lethal PCa patients (6.07%) than localized PCa patients (1.44%), p = 0.0007. The rate also differed significantly among lethal PCa patients as a function of age at death (10.00%, 9.08%, 8.33%, 4.94%, and 2.97% in patients who died ≤60 yr, 61–65 yr, 66–70 yr, 71–75 yr, and over 75 yr, respectively, p = 0.046) and time to death after diagnosis (12.26%, 4.76%, and 0.98% in patients who died ≤5 yr, 6–10 yr, and > 10 yr after a PCa diagnosis, respectively, p = 0.0006). Survival analysis in the entire cohort revealed mutation carriers remained an independent predictor of lethal PCa after adjusting for race and age, prostate-specific antigen, and Gleason score at the time of diagnosis (hazard ratio = 2.13, 95% confidence interval: 1.24–3.66, p = 0.004). A limitation of this study is that other DNA repair genes were not analyzed. Conclusions Mutation status of BRCA1/2 and ATM distinguishes risk for lethal and indolent PCa and is associated with earlier age at death and shorter survival time. Patient summary Prostate cancer patients with inherited mutations in BRCA1/2 and ATM are more likely to die of prostate cancer and do so at an earlier age. PMID:27989354
Thomas, Lynn N; Merrimen, Jennifer; Bell, David G; Rendon, Ricardo; Too, Catherine K L
2015-11-01
Carboxypeptidase-D (CPD) cleaves C-terminal arginine for conversion to nitric oxide (NO) by nitric oxide synthase (NOS). Prolactin (PRL) and androgens stimulate CPD gene transcription and expression, which increases intracellular production of NO to promote viability of prostate cancer (PCa) cells in vitro. The current study evaluated whether hormonal upregulation of CPD and NO promote PCa cell viabilty in vivo, by correlating changes in expression of CPD and nitrotyrosine residues (products of NO action) with proliferation marker Ki67 and associated proteins during PCa development and progression. Fresh prostate tissues, obtained from 40 men with benign prostatic hyperplasia (BPH) or PCa, were flash-frozen at the time of surgery and used for RT-qPCR analysis of CPD, androgen receptor (AR), PRL receptor (PRLR), eNOS, and Ki67 levels. Archival paraffin-embedded tissues from 113 men with BPH or PCa were used for immunohistochemical (IHC) analysis of CPD, nitrotyrosines, phospho-Stat5 (for activated PRLR), AR, eNOS/iNOS, and Ki67. RT-qPCR and IHC analyses showed strong AR and PRLR expression in benign and malignant prostates. CPD mRNA levels increased ∼threefold in PCa compared to BPH, which corresponded to a twofold increase in Ki67 mRNA levels. IHC analysis showed a progressive increase in CPD from 11.4 ± 2.1% in benign to 21.8 ± 3.2% in low-grade (P = 0.007), 40.7 ± 4.0% in high-grade (P < 0.0001) and 50.0 ± 9.5% in castration-recurrent PCa (P < 0.0001). Immunostaining for nitrotyrosines and Ki67 mirrored these increases during PCa progression. CPD, nitrotyrosines, and Ki67 tended to co-localize, as did phospho-Stat5. CPD, nitrotyrosine, and Ki67 levels were higher in PCa than in benign and tended to co-localize, along with phospho-Stat5. The strong correlation in expression of these proteins in benign and malignant prostate tissues, combined with abundant AR and PRLR, supports in vitro evidence that the CPD-Arg-NO pathway is involved in the regulation of PCa cell proliferation. It further highlights a role for PRL in the development and progression of PCa. © 2015 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Milev, M.; Nikolova, Kr.; Ivanova, Ir.; Dobreva, M.
2015-11-01
25 olive oils were studied- different in origin and ways of extraction, in accordance with 17 physico-chemical parameters as follows: color parameters - a and b, light, fluorescence peaks, pigments - chlorophyll and β-carotene, fatty-acid content. The goals of the current study were: Conducting correlation analysis to find the inner relation between the studied indices; By applying factor analysis with the help of the method of Principal Components (PCA), to reduce the great number of variables into a few factors, which are of main importance for distinguishing the different types of olive oil;Using K-means cluster to compare and group the tested types olive oils based on their similarity. The inner relation between the studied indices was found by applying correlation analysis. A factor analysis using PCA was applied on the basis of the found correlation matrix. Thus the number of the studied indices was reduced to 4 factors, which explained 79.3% from the entire variation. The first one unified the color parameters, β-carotene and the related with oxidative products fluorescence peak - about 520 nm. The second one was determined mainly by the chlorophyll content and related to it fluorescence peak - about 670 nm. The third and the fourth factors were determined by the fatty-acid content of the samples. The third one unified the fatty-acids, which give us the opportunity to distinguish olive oil from the other plant oils - oleic, linoleic and stearin acids. The fourth factor included fatty-acids with relatively much lower content in the studied samples. It is enquired the number of clusters to be determined preliminary in order to apply the K-Cluster analysis. The variant K = 3 was worked out because the types of the olive oil were three. The first cluster unified all salad and pomace olive oils, the second unified the samples of extra virgin oilstaken as controls from producers, which were bought from the trade network. The third cluster unified samples from pomace and extra virgin oils, which distinguish one from another in accordance with their parameters from the natural olive oils, because of presence of plant oils impurities.
An incidence model of the cost of advanced prostate cancer in Spain.
Hart, W M; Nazir, J; Baskin-Bey, E
2014-02-01
Prostate cancer (PCa) is the second leading cancer diagnosed among men. In Spain the incidence of PCa was 70.75 cases per 100,000 males. Advanced PCa has spread outside of the prostate capsule and may involve other parts of the body. The aim of this study was to estimate the lifetime costs of a cohort of advanced PCa patients diagnosed in Spain in 2012. A partitioned economic model was developed in EXCEL incorporating Spanish incidence, mortality, and cost data supplemented with data from the international literature. Progression from Stage III to Stage IV was permitted. Costs were discounted at the standard rate of 3%. Lifetime costs were presented on an individual basis and for the entire cohort of newly diagnosed Stage III and Stage IV PCa patients. Lifetime costs for advanced PCa were ∼€19,961 per patient (mean survival of 8.4 years). Using the projected incident cases for 2012 (3047), the total cost for the incident cohort of patients in 2012 would amount to €61 million. These results were more sensitive to changes in the ongoing costs (post-initial 12 months) of Stage III PCa, the rate of progression from Stage III to Stage IV, and the discount rate applied to costs. This study provides an estimate of the lifetime costs of advanced PCa in Spain and a framework for further research. The study is limited by the availability of long-term Spanish data and the need to make inferences from international studies. However, until long-term prospective or observational data do become available in Spain, based on the assumptions, the current results indicate that the burden of advanced PCa in Spain is substantial. Any treatments that could potentially reduce the economic burden of the disease should be of interest to healthcare decision makers.
Carlesi, Serena; Ricci, Marilena; Cucci, Costanza; La Nasa, Jacopo; Lofrumento, Cristiana; Picollo, Marcello; Becucci, Maurizio
2015-07-01
This work explores the application of chemometric techniques to the analysis of lipidic paint binders (i.e., drying oils) by means of Raman and near-infrared spectroscopy. These binders have been widely used by artists throughout history, both individually and in mixtures. We prepared various model samples of the pure binders (linseed, poppy seed, and walnut oils) obtained from different manufacturers. These model samples were left to dry and then characterized by Raman and reflectance near-infrared spectroscopy. Multivariate analysis was performed by applying principal component analysis (PCA) on the first derivative of the corresponding Raman spectra (1800-750 cm(-1)), near-infrared spectra (6000-3900 cm(-1)), and their combination to test whether spectral differences could enable samples to be distinguished on the basis of their composition. The vibrational bands we found most useful to discriminate between the different products we studied are the fundamental ν(C=C) stretching and methylenic stretching and bending combination bands. The results of the multivariate analysis demonstrated the potential of chemometric approaches for characterizing and identifying drying oils, and also for gaining a deeper insight into the aging process. Comparison with high-performance liquid chromatography data was conducted to check the PCA results.
Hadjisolomou, Ekaterini; Stefanidis, Konstantinos; Papatheodorou, George; Papastergiadou, Evanthia
2018-03-19
During the last decades, Mediterranean freshwater ecosystems, especially lakes, have been under severe pressure due to increasing eutrophication and water quality deterioration. In this article, we compared the effectiveness of different data analysis methods by assessing the contribution of environmental parameters to eutrophication processes. For this purpose, principal components analysis (PCA), cluster analysis, and a self-organizing map (SOM) were applied, using water quality data from two transboundary lakes of North Greece. SOM is considered as an advanced and powerful data analysis tool because of its ability to represent complex and nonlinear relationships among multivariate data sets. The results of PCA and cluster analysis agreed with the SOM results, although the latter provided more information because of the visualization abilities regarding the parameters' relationships. Besides nutrients that were found to be a key factor for controlling chlorophyll-a (Chl - a), water temperature was related positively with algal production, while the Secchi disk depth parameter was found to be highly important and negatively related toeutrophic conditions. In general, the SOM results were more specific and allowed direct associations between the water quality variables. Our work showed that SOMs can be used effectively in limnological studies to produce robust and interpretable results, aiding scientists and managers to cope with environmental problems such as eutrophication.
Free-energy landscape of RNA hairpins constructed via dihedral angle principal component analysis.
Riccardi, Laura; Nguyen, Phuong H; Stock, Gerhard
2009-12-31
To systematically construct a low-dimensional free-energy landscape of RNA systems from a classical molecular dynamics simulation, various versions of the principal component analysis (PCA) are compared: the cPCA using the Cartesian coordinates of all atoms, the dPCA using the sine/cosine-transformed six backbone dihedral angles as well as the glycosidic torsional angle chi and the pseudorotational angle P, the aPCA which ignores the circularity of the 6 + 2 dihedral angles of the RNA, and the dPCA(etatheta), which approximates the 6 backbone dihedral angles by 2 pseudotorsional angles eta and theta. As representative examples, a 10-nucleotide UUCG hairpin and the 36-nucleotide segment SL1 of the Psi site of HIV-1 are studied by classical molecular dynamics simulation, using the Amber all-atom force field and explicit solvent. It is shown that the conformational heterogeneity of the RNA hairpins can only be resolved by an angular PCA such as the dPCA but not by the cPCA using Cartesian coordinates. Apart from possible artifacts due to the coupling of overall and internal motion, this is because the details of hydrogen bonding and stacking interactions but also of global structural rearrangements of the RNA are better discriminated by dihedral angles. In line with recent experiments, it is found that the free energy landscape of RNA hairpins is quite rugged and contains various metastable conformational states which may serve as an intermediate for unfolding.
Exercise and prostate cancer: From basic science to clinical applications.
Campos, Christian; Sotomayor, Paula; Jerez, Daniel; González, Javier; Schmidt, Camila B; Schmidt, Katharina; Banzer, Winfried; Godoy, Alejandro S
2018-06-01
Prostate cancer (PCa) is a disease of increasing medical significance worldwide. In developed countries, PCa is the most common non-skin cancer in men, and one of the leading causes of cancer-related deaths. Exercise is one of the environmental factors that have been shown to influence cancer risk. Moreover, systemic reviews and meta-analysis have suggested that total physical activity is related to a decrease in the risk of developing PCa. In addition, epidemiological studies have shown that exercise, after diagnosis, has benefits regarding PCa development, and positive outcome in patients under treatment. The standard treatment for locally advanced or metastatic PCa is Androgen deprivation therapy (ADT). ADT produces diverse side effects, including loss of libido, changes in body composition (increase abdominal fat), and reduced muscle mass, and muscle tone. Analysis of numerous research publications showed that aerobic and/or resistance training improve patient's physical condition, such us, cardiorespiratory fitness, muscle strength, physical function, body composition, and fatigue. Therefore, exercise might counteract several ADT treatment-induced side effects. In addition of the aforementioned benefits, epidemiological, and in vitro studies have shown that exercise might decrease PCa development. Thus, physical activity might attenuate the risk of PCa and supervised exercise intervention might improve deleterious effects of cancer treatment, such as ADT side effects. This review article provides evidence indicating that exercise could complement, and potentiate, the current standard treatments for advanced PCa, probably by creating an unfavorable microenvironment that can negatively affect tumor development, and progression. © 2018 Wiley Periodicals, Inc.
Classification of Malaysia aromatic rice using multivariate statistical analysis
NASA Astrophysics Data System (ADS)
Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md; Masnan, M. J.; Zakaria, A.; Rahim, N. A.; Omar, O.
2015-05-01
Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy training time, and prone to fatigue as the number of sample increased and inconsistent. The GC-MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties.
Prognostic value of transformer 2β expression in prostate cancer.
Diao, Yan; Wu, Dong; Dai, Zhijun; Kang, Huafeng; Wang, Ziming; Wang, Xijing
2015-01-01
Deregulation of transformer 2β (Tra2β) has been implicated in several cancers. However, the role of Tra2β expression in prostate cancer (PCa) is unclear. Therefore, this study was to investigate the expression of Tra2β in PCa and evaluated its association with clinicopathological variables and prognosis. Thirty paired fresh PCa samples were analyzed for Tra2β expression by Western blot analysis. Immunohistochemistry (IHC) assay was performed in 160 PCa samples after radical prostatectomy and adjacent non-cancerous tissues. Tra2β protein expression was divided into high expression group and low expression group by IHC. We also investigated the association of Tra2β expression with clinical and pathologic parameters. Kaplan-Meier plots and Cox proportional hazards regression model were used to analyze the association between Tra2β protein expression and prognosis of PCa patients. Our results showed that Tra2β was significantly upregulated in PCa tissues by western blot and IHC. Our data indicated that high expression of Tra2β was significantly associated with lymph node metastasis (P=0.002), clinical stage (P=0.015), preoperative prostate-specific antigen (P=0.003), Gleason score (P=0.001), and biochemical recurrence (P=0.021). High Tra2β expression was a significant predictor of poor biochemical recurrence free survival and overall survival both in univariate and multivariate analysis. We show that Tra2β was significantly upregulated in PCa patients after radical prostatectomy, and multivariate analysis confirmed Tra2β as an independent prognostic factor.
Dai, Yuanqing; Li, Dongjie; Chen, Xiong; Tan, Xinji; Gu, Jie; Chen, Mingquan; Zhang, Xiaobo
2018-05-25
BACKGROUND In developed countries, prostate cancer (PCa) is a frequently diagnosed cancer with the second highest fatality rate. Circular RNAs (circRNAs) are a class of endogenous non-coding RNAs (ncRNAs) stably expressed in cells and involved in a series of carcinomas. However, few research studies have reported on the role of circRNAs in PCa. MATERIAL AND METHODS We used qRT-PCR to detect the expression of circMYLK (circRNA ID: hsa_circ_0141940) and miR-29a in PCa tissues and cell lines. MTT, colony formation, and TUNEL assays were performed to analysis the cell viability of PCa cells. Transwell and wound scratch assays were performed to investigate the cell invasion and migration of PCa cells. RESULTS In the present study, we confirmed that circMYLK expression level was significantly higher in PCa samples and PCa cells than in normal tissues and normal prostatic cells. The upregulated circRNA-MYLK promoted PCa cells proliferation, invasion, and migration; however, si-circRNA-MYLK significantly accelerated the PCa cell apoptosis. We also observed that the aforementioned function of circRNA-MYLK on PCa cells was affected through targeting miR-29a. CONCLUSIONS We confirmed circRNA-MYLK was an oncogene in PCa and revealed a novel mechanism underlying circRNA-MYLK in PC progression.
Guo, Yuehua; Qu, Shuxin; Lu, Xiong; Xie, Haodong; Zhang, Hongping; Weng, Jie
2010-07-01
The aim of this study is to investigate the interaction between dicalcium phosphate dihydrate (CaHPO(4) x 2H(2)O, DCPD) and Protocatechuic aldehyde (C(7)H(6)O(3), Pca), which is the water-soluble constituents of Chinese Medicine, Salvia Miltiorrhiza Bunge (SMB), by calculating the absorption energy through molecular dynamics simulation. Furthermore, the effects of functional groups of Pca and temperature on Pca adsorbed by DCPD are calculated respectively. DCPD/Pca and DCPD were analyzed by X-ray diffraction (XRD), Fourier transform infrared spectroscopy (FTIR) and thermogravimetric analysis (TG). The simulation results showed that Pca mostly absorbed on the (0 2 0) surface of DCPD. The aldehyde group of Pca played a moren important role on the adsorption of Pca on DCPD than hydroxyl did, while temperature had no distinct effects on the adsorption. XRD results indicated that Pca induced the preferential growth of (0 2 0) crystal surface in DCPC/Pca whereas it had no influence on the crystal structure, the crystallinity and grain size of DCPD. FTIR and TG results showed that the characteristic peak of Pca was at 1295 cm(-1) and the content of Pca in DCPD was 16%, respectively. The present results show that molecular dynamics simulation is a very effective and complementary method to study the interaction between materials and medicine.
Alves, Júnia de O; Neto, Waldomiro B; Mitsutake, Hery; Alves, Paulo S P; Augusti, Rodinei
2010-07-15
Extra virgin (EV), the finest and most expensive among all the olive oil grades, is often adulterated by the cheapest and lowest quality ordinary (ON) olive oil. A new methodology is described herein that provides a simple, rapid, and accurate way not only to detect such type of adulteration, but also to distinguish between these olive oil grades (EV and ON). This approach is based on the application of direct infusion electrospray ionization mass spectrometry in the positive ion mode, ESI(+)-MS, followed by the treatment of the MS data via exploratory statistical approaches, PCA (principal component analysis) and HCA (hierarchical clustering analysis). Ten distinct brands of each EV and ON olive oil, acquired at local stores, were analyzed by ESI(+)-MS and the results from HCA and PCA clearly indicated the formation of two distinct groups related to these two categories. For the adulteration study, one brand of each olive oil grade (EV and ON) was selected. The counterfeit samples (a total of 20) were then prepared by adding assorted proportions, from 1 to 20% w/w, with increments of 1% w/w, of the ON to the EV olive oil. The PCA and HCA methodologies, applied to the ESI(+)-MS data from the counterfeit (20) and authentic (10) EV samples, were able to readily detect adulteration, even at levels as low as 1% w/w. Copyright 2010 John Wiley & Sons, Ltd.
Roudier, Martine P; Winters, Brian R; Coleman, Ilsa; Lam, Hung-Ming; Zhang, Xiaotun; Coleman, Roger; Chéry, Lisly; True, Lawrence D; Higano, Celestia S; Montgomery, Bruce; Lange, Paul H; Snyder, Linda A; Srivastava, Shiv; Corey, Eva; Vessella, Robert L; Nelson, Peter S; Üren, Aykut; Morrissey, Colm
2016-06-01
The TMPRSS2-ERG gene fusion is detected in approximately half of primary prostate cancers (PCa) yet the prognostic significance remains unclear. We hypothesized that ERG promotes the expression of common genes in primary PCa and metastatic castration-resistant PCa (CRPC), with the objective of identifying ERG-associated pathways, which may promote the transition from primary PCa to CRPC. We constructed tissue microarrays (TMA) from 127 radical prostatectomy specimens, 20 LuCaP patient-derived xenografts (PDX), and 152 CRPC metastases obtained immediately at time of death. Nuclear ERG was assessed by immunohistochemistry (IHC). To characterize the molecular features of ERG-expressing PCa, a subset of IHC confirmed ERG+ or ERG- specimens including 11 radical prostatectomies, 20 LuCaP PDXs, and 45 CRPC metastases underwent gene expression analysis. Genes were ranked based on expression in primary PCa and CRPC. Common genes of interest were targeted for IHC analysis and expression compared with biochemical recurrence (BCR) status. IHC revealed that 43% of primary PCa, 35% of the LuCaP PDXs, and 18% of the CRPC metastases were ERG+ (12 of 48 patients [25%] had at least one ERG+ metastasis). Based on gene expression data and previous literature, two proteins involved in calcium signaling (NCALD, CACNA1D), a protein involved in inflammation (HLA-DMB), CD3 positive immune cells, and a novel ERG-associated protein, DCLK1 were evaluated in primary PCa and CRPC metastases. In ERG+ primary PCa, a weak association was seen with NCALD and CACNA1D protein expression. HLA-DMB association with ERG was decreased and CD3 cell number association with ERG was changed from positive to negative in CRPC metastases compared to primary PCa. DCLK1 was upregulated at the protein level in unpaired ERG+ primary PCa and CRPC metastases (P = 0.0013 and P < 0.0001, respectively). In primary PCa, ERG status or expression of targeted proteins was not associated with BCR-free survival. However, for primary PCa, ERG+DCLK1+ patients exhibited shorter time to BCR (P = 0.06) compared with ERG+DCLK1- patients. This study examined ERG expression in primary PCa and CRPC. We have identified altered levels of inflammatory mediators associated with ERG expression. We determined expression of DCLK1 correlates with ERG expression and may play a role in primary PCa progression to metastatic CPRC. Prostate 76:810-822, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Kukić, Predrag; Farrell, Damien; Søndergaard, Chresten R; Bjarnadottir, Una; Bradley, John; Pollastri, Gianluca; Nielsen, Jens Erik
2010-03-01
pH-induced chemical shift perturbations (CSPs) can be used to study pH-dependent conformational transitions in proteins. Recently, an elegant principal component analysis (PCA) algorithm was developed and used to study the pH-dependent structural transitions in bovine beta-lactoglobulin (betaLG) by analyzing its NMR pH-titration spectra. Here, we augment this analysis method by filtering out changes in the NMR chemical shift that stem from effects that are electrostatic in nature. Specifically, we examine how many CSPs can be explained by purely electrostatic effects arising from titrational events in betaLG. The results show that around 20% of the amide nuclei CSPs in betaLG originate exclusively from "through-space" electric field effects. A PCA of NMR data where electric field artefacts have been removed gives a different picture of the pH-dependent structural transitions in betaLG. The method implemented here is well suited to be applied on a whole range of proteins, which experience at least one pH-dependent conformational change. Proteins 2010. (c) 2009 Wiley-Liss, Inc.
Spatial and spectral analysis of corneal epithelium injury using hyperspectral images
NASA Astrophysics Data System (ADS)
Md Noor, Siti Salwa; Michael, Kaleena; Marshall, Stephen; Ren, Jinchang
2017-12-01
Eye assessment is essential in preventing blindness. Currently, the existing methods to assess corneal epithelium injury are complex and require expert knowledge. Hence, we have introduced a non-invasive technique using hyperspectral imaging (HSI) and an image analysis algorithm of corneal epithelium injury. Three groups of images were compared and analyzed, namely healthy eyes, injured eyes, and injured eyes with stain. Dimensionality reduction using principal component analysis (PCA) was applied to reduce massive data and redundancies. The first 10 principal components (PCs) were selected for further processing. The mean vector of 10 PCs with 45 pairs of all combinations was computed and sent to two classifiers. A quadratic Bayes normal classifier (QDC) and a support vector classifier (SVC) were used in this study to discriminate the eleven eyes into three groups. As a result, the combined classifier of QDC and SVC showed optimal performance with 2D PCA features (2DPCA-QDSVC) and was utilized to classify normal and abnormal tissues, using color image segmentation. The result was compared with human segmentation. The outcome showed that the proposed algorithm produced extremely promising results to assist the clinician in quantifying a cornea injury.
Support vector machine and principal component analysis for microarray data classification
NASA Astrophysics Data System (ADS)
Astuti, Widi; Adiwijaya
2018-03-01
Cancer is a leading cause of death worldwide although a significant proportion of it can be cured if it is detected early. In recent decades, technology called microarray takes an important role in the diagnosis of cancer. By using data mining technique, microarray data classification can be performed to improve the accuracy of cancer diagnosis compared to traditional techniques. The characteristic of microarray data is small sample but it has huge dimension. Since that, there is a challenge for researcher to provide solutions for microarray data classification with high performance in both accuracy and running time. This research proposed the usage of Principal Component Analysis (PCA) as a dimension reduction method along with Support Vector Method (SVM) optimized by kernel functions as a classifier for microarray data classification. The proposed scheme was applied on seven data sets using 5-fold cross validation and then evaluation and analysis conducted on term of both accuracy and running time. The result showed that the scheme can obtained 100% accuracy for Ovarian and Lung Cancer data when Linear and Cubic kernel functions are used. In term of running time, PCA greatly reduced the running time for every data sets.
Boeing, Joana Schuelter; Barizão, Erica Oliveira; E Silva, Beatriz Costa; Montanher, Paula Fernandes; de Cinque Almeida, Vitor; Visentainer, Jesuí Vergilio
2014-01-01
This study evaluated the effect of the solvent on the extraction of antioxidant compounds from black mulberry (Morus nigra), blackberry (Rubus ulmifolius) and strawberry (Fragaria x ananassa). Different extracts of each berry were evaluated from the determination of total phenolic content, anthocyanin content and antioxidant capacity, and data were applied to the principal component analysis (PCA) to gain an overview of the effect of the solvent in extraction method. For all the berries analyzed, acetone/water (70/30, v/v) solvent mixture was more efficient solvent in the extracting of phenolic compounds, and methanol/water/acetic acid (70/29.5/0.5, v/v/v) showed the best values for anthocyanin content. Mixtures of ethanol/water (50/50, v/v), acetone water/acetic acid (70/29.5/0.5, v/v/v) and acetone/water (50/50, v/v) presented the highest antioxidant capacities for black mulberries, blackberries and strawberries, respectively. Antioxidants extractions are extremely affected by the solvent combination used. In addition, the obtained extracts with the organic solvent-water mixtures were distinguished from the extracts obtained with pure organic solvents, through the PCA analysis.
Zhu, Yanzhong; Song, Yonghui; Yu, Huibin; Liu, Ruixia; Liu, Lusan; Lv, Chunjian
2017-08-08
UV-visible absorption spectroscopy coupled with principal component analysis (PCA) and hierarchical cluster analysis (HCA) was applied to characterize spectroscopic components, detect latent factors, and investigate spatial variations of dissolved organic matter (DOM) in a large-scale lake. Twelve surface water samples were collected from Dongjianghu Lake in China. DOM contained lignin and quinine moieties, carboxylic acid, microbial products, and aromatic and alkyl groups, which in the northern part of the lake was largely different from the southern part. Fifteen spectroscopic indices were deduced from the absorption spectra to indicate molecular weight or humification degree of DOM. The northern part of the lake presented the smaller molecular weight or the lower humification degree of DOM than the southern part. E 2/4 , E 3/4 , E 2/3 , and S 2 were latent factors of characterizing the molecular weight of DOM, while E 2/5 , E 3/5 , E 2/6 , E 4/5 , E 3/6 , and A 2/1 were latent factors of evaluating the humification degree of DOM. The UV-visible absorption spectroscopy combined with PCA and HCA may not only characterize DOM fractions of lakes, but may be transferred to other types of waterscape.
NASA Astrophysics Data System (ADS)
Pal, S. K.; Majumdar, T. J.; Bhattacharya, Amit K.
Fusion of optical and synthetic aperture radar data has been attempted in the present study for mapping of various lithologic units over a part of the Singhbhum Shear Zone (SSZ) and its surroundings. ERS-2 SAR data over the study area has been enhanced using Fast Fourier Transformation (FFT) based filtering approach, and also using Frost filtering technique. Both the enhanced SAR imagery have been then separately fused with histogram equalized IRS-1C LISS III image using Principal Component Analysis (PCA) technique. Later, Feature-oriented Principal Components Selection (FPCS) technique has been applied to generate False Color Composite (FCC) images, from which corresponding geological maps have been prepared. Finally, GIS techniques have been successfully used for change detection analysis in the lithological interpretation between the published geological map and the fusion based geological maps. In general, there is good agreement between these maps over a large portion of the study area. Based on the change detection studies, few areas could be identified which need attention for further detailed ground-based geological studies.
Katayama, K; Sato, T; Arai, T; Amao, H; Ohta, Y; Ozawa, T; Kenyon, P R; Hickson, R E; Tazaki, H
2013-02-01
Simple liquid chromatography-mass spectrometry (LC-MS) was applied to non-targeted metabolic analyses to discover new metabolic markers in animal plasma. Principle component analysis (PCA) and partial least squares-discriminate analysis (PLS-DA) were used to analyse LC-MS multivariate data. PCA clearly generated two separate clusters for artificially induced diabetic mice and healthy control mice. PLS-DA of time-course changes in plasma metabolites of chicks after feeding generated three clusters (pre- and immediately after feeding, 0.5-3 h after feeding and 4 h after feeding). Two separate clusters were also generated for plasma metabolites of pregnant Angus heifers with differing live-weight change profiles (gaining or losing). The accompanying PLS-DA loading plot detailed the metabolites that contribute the most to the cluster separation. In each case, the same highly hydrophilic metabolite was strongly correlated to the group separation. The metabolite was identified as betaine by LC-MS/MS. This result indicates that betaine and its metabolic precursor, choline, may be useful biomarkers to evaluate the nutritional and metabolic status of animals. © 2011 Blackwell Verlag GmbH.
Exploring space-time structure of human mobility in urban space
NASA Astrophysics Data System (ADS)
Sun, J. B.; Yuan, J.; Wang, Y.; Si, H. B.; Shan, X. M.
2011-03-01
Understanding of human mobility in urban space benefits the planning and provision of municipal facilities and services. Due to the high penetration of cell phones, mobile cellular networks provide information for urban dynamics with a large spatial extent and continuous temporal coverage in comparison with traditional approaches. The original data investigated in this paper were collected by cellular networks in a southern city of China, recording the population distribution by dividing the city into thousands of pixels. The space-time structure of urban dynamics is explored by applying Principal Component Analysis (PCA) to the original data, from temporal and spatial perspectives between which there is a dual relation. Based on the results of the analysis, we have discovered four underlying rules of urban dynamics: low intrinsic dimensionality, three categories of common patterns, dominance of periodic trends, and temporal stability. It implies that the space-time structure can be captured well by remarkably few temporal or spatial predictable periodic patterns, and the structure unearthed by PCA evolves stably over time. All these features play a critical role in the applications of forecasting and anomaly detection.
A comprehensive evaluation of CHEK2 germline mutations in men with prostate cancer.
Wu, Yishuo; Yu, Hongjie; Zheng, S Lilly; Na, Rong; Mamawala, Mufaddal; Landis, Tricia; Wiley, Kathleen; Petkewicz, Jacqueline; Shah, Sameep; Shi, Zhuqing; Novakovic, Kristian; McGuire, Michael; Brendler, Charles B; Ding, Qiang; Helfand, Brian T; Carter, H Ballentine; Cooney, Kathleen A; Isaacs, William B; Xu, Jianfeng
2018-06-01
Germline mutations in CHEK2 have been associated with prostate cancer (PCa) risk. Our objective is to examine whether germline pathogenic CHEK2 mutations can differentiate risk of lethal from indolent PCa. A case-case study of 703 lethal PCa patients and 1455 patients with low-risk localized PCa of European, African, and Chinese origin was performed. Germline DNA samples from these patients were sequenced for CHEK2. Mutation carrier rates and their association with lethal PCa were analyzed using the Fisher exact test and Kaplan-Meier survival analysis. In the entire study population, 40 (1.85%) patients were identified as carrying one of 15 different germline CHEK2 pathogenic or likely pathogenic mutations. CHEK2 mutations were detected in 16 (2.28%) of 703 lethal PCa patients compared with 24 (1.65%) of 1455 low-risk PCa patients (P = 0.31). No association was found between CHEK2 mutation status and early-diagnosis or PCa-specific survival time. However, the most common mutation in CHEK2, c.1100delC (p.T367 fs), had a significantly higher carrier rate (1.28%) in lethal PCa patients than low-risk PCa patients of European American origin (0.16%), P = 0.0038. The estimated Odds Ratio of this mutation for lethal PCa was 7.86. The carrier rate in lethal PCa was also significantly higher than that (0.46%) in 32 461 non-Finnish European subjects from the Exome Aggregation Consortium (ExAC) (P = 0.01). While overall CHEK2 mutations were not significantly more common in men with lethal compared to low-risk PCa, the specific CHEK2 mutation, c.1100delC, appears to contribute to an increased risk of lethal PCa in European American men. © 2018 Wiley Periodicals, Inc.
Lin, Yiqing; Li, Weiyong; Xu, Jin; Boulas, Pierre
2015-07-05
The aim of this study is to develop an at-line near infrared (NIR) method for the rapid and simultaneous determination of four structurally similar active pharmaceutical ingredients (APIs) in powder blends intended for the manufacturing of tablets. Two of the four APIs in the formula are present in relatively small amounts, one at 0.95% and the other at 0.57%. Such small amounts in addition to the similarity in structures add significant complexity to the blend uniformity analysis. The NIR method is developed using spectra from six laboratory-created calibration samples augmented by a small set of spectra from a large-scale blending sample. Applying the quality by design (QbD) principles, the calibration design included concentration variations of the four APIs and a main excipient, microcrystalline cellulose. A bench-top FT-NIR instrument was used to acquire the spectra. The obtained NIR spectra were analyzed by applying principal component analysis (PCA) before calibration model development. Score patterns from the PCA were analyzed to reveal relationship between latent variables and concentration variations of the APIs. In calibration model development, both PLS-1 and PLS-2 models were created and evaluated for their effectiveness in predicting API concentrations in the blending samples. The final NIR method shows satisfactory specificity and accuracy. Copyright © 2015 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Baccar, D.; Söffker, D.
2017-11-01
Acoustic Emission (AE) is a suitable method to monitor the health of composite structures in real-time. However, AE-based failure mode identification and classification are still complex to apply due to the fact that AE waves are generally released simultaneously from all AE-emitting damage sources. Hence, the use of advanced signal processing techniques in combination with pattern recognition approaches is required. In this paper, AE signals generated from laminated carbon fiber reinforced polymer (CFRP) subjected to indentation test are examined and analyzed. A new pattern recognition approach involving a number of processing steps able to be implemented in real-time is developed. Unlike common classification approaches, here only CWT coefficients are extracted as relevant features. Firstly, Continuous Wavelet Transform (CWT) is applied to the AE signals. Furthermore, dimensionality reduction process using Principal Component Analysis (PCA) is carried out on the coefficient matrices. The PCA-based feature distribution is analyzed using Kernel Density Estimation (KDE) allowing the determination of a specific pattern for each fault-specific AE signal. Moreover, waveform and frequency content of AE signals are in depth examined and compared with fundamental assumptions reported in this field. A correlation between the identified patterns and failure modes is achieved. The introduced method improves the damage classification and can be used as a non-destructive evaluation tool.
Multivariate Statistical Analysis of Water Quality data in Indian River Lagoon, Florida
NASA Astrophysics Data System (ADS)
Sayemuzzaman, M.; Ye, M.
2015-12-01
The Indian River Lagoon, is part of the longest barrier island complex in the United States, is a region of particular concern to the environmental scientist because of the rapid rate of human development throughout the region and the geographical position in between the colder temperate zone and warmer sub-tropical zone. Thus, the surface water quality analysis in this region always brings the newer information. In this present study, multivariate statistical procedures were applied to analyze the spatial and temporal water quality in the Indian River Lagoon over the period 1998-2013. Twelve parameters have been analyzed on twelve key water monitoring stations in and beside the lagoon on monthly datasets (total of 27,648 observations). The dataset was treated using cluster analysis (CA), principle component analysis (PCA) and non-parametric trend analysis. The CA was used to cluster twelve monitoring stations into four groups, with stations on the similar surrounding characteristics being in the same group. The PCA was then applied to the similar groups to find the important water quality parameters. The principal components (PCs), PC1 to PC5 was considered based on the explained cumulative variances 75% to 85% in each cluster groups. Nutrient species (phosphorus and nitrogen), salinity, specific conductivity and erosion factors (TSS, Turbidity) were major variables involved in the construction of the PCs. Statistical significant positive or negative trends and the abrupt trend shift were detected applying Mann-Kendall trend test and Sequential Mann-Kendall (SQMK), for each individual stations for the important water quality parameters. Land use land cover change pattern, local anthropogenic activities and extreme climate such as drought might be associated with these trends. This study presents the multivariate statistical assessment in order to get better information about the quality of surface water. Thus, effective pollution control/management of the surface waters can be undertaken.
Sparse principal component analysis in medical shape modeling
NASA Astrophysics Data System (ADS)
Sjöstrand, Karl; Stegmann, Mikkel B.; Larsen, Rasmus
2006-03-01
Principal component analysis (PCA) is a widely used tool in medical image analysis for data reduction, model building, and data understanding and exploration. While PCA is a holistic approach where each new variable is a linear combination of all original variables, sparse PCA (SPCA) aims at producing easily interpreted models through sparse loadings, i.e. each new variable is a linear combination of a subset of the original variables. One of the aims of using SPCA is the possible separation of the results into isolated and easily identifiable effects. This article introduces SPCA for shape analysis in medicine. Results for three different data sets are given in relation to standard PCA and sparse PCA by simple thresholding of small loadings. Focus is on a recent algorithm for computing sparse principal components, but a review of other approaches is supplied as well. The SPCA algorithm has been implemented using Matlab and is available for download. The general behavior of the algorithm is investigated, and strengths and weaknesses are discussed. The original report on the SPCA algorithm argues that the ordering of modes is not an issue. We disagree on this point and propose several approaches to establish sensible orderings. A method that orders modes by decreasing variance and maximizes the sum of variances for all modes is presented and investigated in detail.
Zheng, Lu-Lu; Niu, Shen; Hao, Pei; Feng, KaiYan; Cai, Yu-Dong; Li, Yixue
2011-01-01
Pyrrolidone carboxylic acid (PCA) is formed during a common post-translational modification (PTM) of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations. PMID:22174779
A new statistical PCA-ICA algorithm for location of R-peaks in ECG.
Chawla, M P S; Verma, H K; Kumar, Vinod
2008-09-16
The success of ICA to separate the independent components from the mixture depends on the properties of the electrocardiogram (ECG) recordings. This paper discusses some of the conditions of independent component analysis (ICA) that could affect the reliability of the separation and evaluation of issues related to the properties of the signals and number of sources. Principal component analysis (PCA) scatter plots are plotted to indicate the diagnostic features in the presence and absence of base-line wander in interpreting the ECG signals. In this analysis, a newly developed statistical algorithm by authors, based on the use of combined PCA-ICA for two correlated channels of 12-channel ECG data is proposed. ICA technique has been successfully implemented in identifying and removal of noise and artifacts from ECG signals. Cleaned ECG signals are obtained using statistical measures like kurtosis and variance of variance after ICA processing. This analysis also paper deals with the detection of QRS complexes in electrocardiograms using combined PCA-ICA algorithm. The efficacy of the combined PCA-ICA algorithm lies in the fact that the location of the R-peaks is bounded from above and below by the location of the cross-over points, hence none of the peaks are ignored or missed.
NASA Astrophysics Data System (ADS)
DiFranco, Matthew D.; Reynolds, Hayley M.; Mitchell, Catherine; Williams, Scott; Allan, Prue; Haworth, Annette
2015-03-01
Reliable automated prostate tumor detection and characterization in whole-mount histology images is sought in many applications, including post-resection tumor staging and as ground-truth data for multi-parametric MRI interpretation. In this study, an ensemble-based supervised classification algorithm for high-resolution histology images was trained on tile-based image features including histogram and gray-level co-occurrence statistics. The algorithm was assessed using different combinations of H and E prostate slides from two separate medical centers and at two different magnifications (400x and 200x), with the aim of applying tumor classification models to new data. Slides from both datasets were annotated by expert pathologists in order to identify homogeneous cancerous and non-cancerous tissue regions of interest, which were then categorized as (1) low-grade tumor (LG-PCa), including Gleason 3 and high-grade prostatic intraepithelial neoplasia (HG-PIN), (2) high-grade tumor (HG-PCa), including various Gleason 4 and 5 patterns, or (3) non-cancerous, including benign stroma and benign prostatic hyperplasia (BPH). Classification models for both LG-PCa and HG-PCa were separately trained using a support vector machine (SVM) approach, and per-tile tumor prediction maps were generated from the resulting ensembles. Results showed high sensitivity for predicting HG-PCa with an AUC up to 0.822 using training data from both medical centres, while LG-PCa showed a lower sensitivity of 0.763 with the same training data. Visual inspection of cancer probability heatmaps from 9 patients showed that 17/19 tumors were detected, and HG-PCa generally reported less false positives than LG-PCa.
Variants on 8q24 and prostate cancer risk in Chinese population: a meta-analysis.
Ren, Xiao-Qiang; Zhang, Jian-Guo; Xin, Shi-Yong; Cheng, Tao; Li, Liang; Ren, Wei-Hua
2015-01-01
Previous studies have identified 8q24 as an important region to prostate cancer (PCa) susceptibility. The aim of this study was to investigate the role of six genetic variants on 8q24 (rs1447295, A; rs6983267, G; rs6983561, C; rs7837688, T; rs10090154, T and rs16901979, A) on PCa risk in Chinese population. Online electronic databases were searched to retrieve related articles concerning the association between 8q24 variants and PCa risk in men of Chinese population published between 2000 and 2014. Odds ratio (ORs) with its 95% correspondence interval (CI) were employed to assess the strength of association. Total eleven case-control studies were screened out, including 2624 PCa patients and 2438 healthy controls. Our results showed that three risk alleles of rs1447295 A (OR=1.35, 95% CI=1.19-1.53, P<0.00001), rs6983561 C (C vs. A: OR=1.41, 95% CI=1.21-1.63, P<0.00001) and rs10090154 T (T vs. C: OR=1.48, 95% CI=1.22-1.80, P<0.00001) on8q24 were significantly associated with PCa risk in Chinese population. Furthermore, genotypes of rs1447295, AA+AC; rs6983561, CC+AC and CC; rs10090154, TT+TC; and rs16901979, AA were associated with PCa as well (P<0.01). No association was found between rs6983267, rs7837688 and PCa risk. In conclusions, variants including rs1447295, rs6983561, rs10090154 and rs16901979 on 8q24 might be associated with PCa risk in Chinese population, indicating these four variations may contribute risk to this disease. This meta-analysis was the first study to assess the role of 8q24 variants on PCa risk in Chinese population.
An Estimate of the Incidence of Prostate Cancer in Africa: A Systematic Review and Meta-Analysis
Aderemi, Adewale Victor; Iseolorunkanmi, Alexander; Oyedokun, Ayo; Ayo, Charles K.
2016-01-01
Background Prostate cancer (PCa) is rated the second most common cancer and sixth leading cause of cancer deaths among men globally. Reports show that African men suffer disproportionately from PCa compared to men from other parts of the world. It is still quite difficult to accurately describe the burden of PCa in Africa due to poor cancer registration systems. We systematically reviewed the literature on prostate cancer in Africa and provided a continent-wide incidence rate of PCa based on available data in the region. Methods A systematic literature search of Medline, EMBASE and Global Health from January 1980 to June 2015 was conducted, with additional search of Google Scholar, International Association of Cancer Registries (IACR), International Agency for Research on Cancer (IARC), and WHO African region websites, for studies that estimated incidence rate of PCa in any African location. Having assessed quality and consistency across selected studies, we extracted incidence rates of PCa and conducted a random effects meta-analysis. Results Our search returned 9766 records, with 40 studies spreading across 16 African countries meeting our selection criteria. We estimated a pooled PCa incidence rate of 22.0 (95% CI: 19.93–23.97) per 100,000 population, and also reported a median incidence rate of 19.5 per 100,000 population. We observed an increasing trend in PCa incidence with advancing age, and over the main years covered. Conclusion Effective cancer registration and extensive research are vital to appropriately quantifying PCa burden in Africa. We hope our findings may further assist at identifying relevant gaps, and contribute to improving knowledge, research, and interventions targeted at prostate cancer in Africa. PMID:27073921
Decision tree and PCA-based fault diagnosis of rotating machinery
NASA Astrophysics Data System (ADS)
Sun, Weixiang; Chen, Jin; Li, Jiaqing
2007-04-01
After analysing the flaws of conventional fault diagnosis methods, data mining technology is introduced to fault diagnosis field, and a new method based on C4.5 decision tree and principal component analysis (PCA) is proposed. In this method, PCA is used to reduce features after data collection, preprocessing and feature extraction. Then, C4.5 is trained by using the samples to generate a decision tree model with diagnosis knowledge. At last the tree model is used to make diagnosis analysis. To validate the method proposed, six kinds of running states (normal or without any defect, unbalance, rotor radial rub, oil whirl, shaft crack and a simultaneous state of unbalance and radial rub), are simulated on Bently Rotor Kit RK4 to test C4.5 and PCA-based method and back-propagation neural network (BPNN). The result shows that C4.5 and PCA-based diagnosis method has higher accuracy and needs less training time than BPNN.
On a PCA-based lung motion model
NASA Astrophysics Data System (ADS)
Li, Ruijiang; Lewis, John H.; Jia, Xun; Zhao, Tianyu; Liu, Weifeng; Wuenschel, Sara; Lamb, James; Yang, Deshan; Low, Daniel A.; Jiang, Steve B.
2011-09-01
Respiration-induced organ motion is one of the major uncertainties in lung cancer radiotherapy and is crucial to be able to accurately model the lung motion. Most work so far has focused on the study of the motion of a single point (usually the tumor center of mass), and much less work has been done to model the motion of the entire lung. Inspired by the work of Zhang et al (2007 Med. Phys. 34 4772-81), we believe that the spatiotemporal relationship of the entire lung motion can be accurately modeled based on principle component analysis (PCA) and then a sparse subset of the entire lung, such as an implanted marker, can be used to drive the motion of the entire lung (including the tumor). The goal of this work is twofold. First, we aim to understand the underlying reason why PCA is effective for modeling lung motion and find the optimal number of PCA coefficients for accurate lung motion modeling. We attempt to address the above important problems both in a theoretical framework and in the context of real clinical data. Second, we propose a new method to derive the entire lung motion using a single internal marker based on the PCA model. The main results of this work are as follows. We derived an important property which reveals the implicit regularization imposed by the PCA model. We then studied the model using two mathematical respiratory phantoms and 11 clinical 4DCT scans for eight lung cancer patients. For the mathematical phantoms with cosine and an even power (2n) of cosine motion, we proved that 2 and 2n PCA coefficients and eigenvectors will completely represent the lung motion, respectively. Moreover, for the cosine phantom, we derived the equivalence conditions for the PCA motion model and the physiological 5D lung motion model (Low et al 2005 Int. J. Radiat. Oncol. Biol. Phys. 63 921-9). For the clinical 4DCT data, we demonstrated the modeling power and generalization performance of the PCA model. The average 3D modeling error using PCA was within 1 mm (0.7 ± 0.1 mm). When a single artificial internal marker was used to derive the lung motion, the average 3D error was found to be within 2 mm (1.8 ± 0.3 mm) through comprehensive statistical analysis. The optimal number of PCA coefficients needs to be determined on a patient-by-patient basis and two PCA coefficients seem to be sufficient for accurate modeling of the lung motion for most patients. In conclusion, we have presented thorough theoretical analysis and clinical validation of the PCA lung motion model. The feasibility of deriving the entire lung motion using a single marker has also been demonstrated on clinical data using a simulation approach.
Agner, Shannon C; Xu, Jun; Madabhushi, Anant
2013-03-01
Segmentation of breast lesions on dynamic contrast enhanced (DCE) magnetic resonance imaging (MRI) is the first step in lesion diagnosis in a computer-aided diagnosis framework. Because manual segmentation of such lesions is both time consuming and highly susceptible to human error and issues of reproducibility, an automated lesion segmentation method is highly desirable. Traditional automated image segmentation methods such as boundary-based active contour (AC) models require a strong gradient at the lesion boundary. Even when region-based terms are introduced to an AC model, grayscale image intensities often do not allow for clear definition of foreground and background region statistics. Thus, there is a need to find alternative image representations that might provide (1) strong gradients at the margin of the object of interest (OOI); and (2) larger separation between intensity distributions and region statistics for the foreground and background, which are necessary to halt evolution of the AC model upon reaching the border of the OOI. In this paper, the authors introduce a spectral embedding (SE) based AC (SEAC) for lesion segmentation on breast DCE-MRI. SE, a nonlinear dimensionality reduction scheme, is applied to the DCE time series in a voxelwise fashion to reduce several time point images to a single parametric image where every voxel is characterized by the three dominant eigenvectors. This parametric eigenvector image (PrEIm) representation allows for better capture of image region statistics and stronger gradients for use with a hybrid AC model, which is driven by both boundary and region information. They compare SEAC to ACs that employ fuzzy c-means (FCM) and principal component analysis (PCA) as alternative image representations. Segmentation performance was evaluated by boundary and region metrics as well as comparing lesion classification using morphological features from SEAC, PCA+AC, and FCM+AC. On a cohort of 50 breast DCE-MRI studies, PrEIm yielded overall better region and boundary-based statistics compared to the original DCE-MR image, FCM, and PCA based image representations. Additionally, SEAC outperformed a hybrid AC applied to both PCA and FCM image representations. Mean dice similarity coefficient (DSC) for SEAC was significantly better (DSC = 0.74 ± 0.21) than FCM+AC (DSC = 0.50 ± 0.32) and similar to PCA+AC (DSC = 0.73 ± 0.22). Boundary-based metrics of mean absolute difference and Hausdorff distance followed the same trends. Of the automated segmentation methods, breast lesion classification based on morphologic features derived from SEAC segmentation using a support vector machine classifier also performed better (AUC = 0.67 ± 0.05; p < 0.05) than FCM+AC (AUC = 0.50 ± 0.07), and PCA+AC (AUC = 0.49 ± 0.07). In this work, we presented SEAC, an accurate, general purpose AC segmentation tool that could be applied to any imaging domain that employs time series data. SE allows for projection of time series data into a PrEIm representation so that every voxel is characterized by the dominant eigenvectors, capturing the global and local time-intensity curve similarities in the data. This PrEIm allows for the calculation of strong tensor gradients and better region statistics than the original image intensities or alternative image representations such as PCA and FCM. The PrEIm also allows for building a more accurate hybrid AC scheme.
The influence of stigma on the quality of life for prostate cancer survivors.
Wood, Andrew W; Barden, Sejal; Terk, Mitchell; Cesaretti, Jamie
2017-01-01
The purpose of the present study was to investigate the influence of stigma on prostate cancer (PCa) survivors' quality of life. Stigma for lung cancer survivors has been the focus of considerable research (Else-Quest & Jackson, 2014); however, gaps remain in understanding the experience of PCa stigma. A cross-sectional correlational study was designed to assess the incidence of PCa stigma and its influence on the quality of life of survivors. Eighty-five PCa survivors were administered survey packets consisting of a stigma measure, a PCa-specific quality of life measure, and a demographic survey during treatment of their disease. A linear regression analysis was conducted with the data received from PCa survivors. Results indicated that PCa stigma has a significant, negative influence on the quality of life for survivors (R 2 = 0.33, F(4, 80) = 11.53, p < 0.001). There were no statistically significant differences in PCa stigma based on demographic variables (e.g., race and age). Implications for physical and mental health practitioners and researchers are discussed.
Mudali, D; Teune, L K; Renken, R J; Leenders, K L; Roerdink, J B T M
2015-01-01
Medical imaging techniques like fluorodeoxyglucose positron emission tomography (FDG-PET) have been used to aid in the differential diagnosis of neurodegenerative brain diseases. In this study, the objective is to classify FDG-PET brain scans of subjects with Parkinsonian syndromes (Parkinson's disease, multiple system atrophy, and progressive supranuclear palsy) compared to healthy controls. The scaled subprofile model/principal component analysis (SSM/PCA) method was applied to FDG-PET brain image data to obtain covariance patterns and corresponding subject scores. The latter were used as features for supervised classification by the C4.5 decision tree method. Leave-one-out cross validation was applied to determine classifier performance. We carried out a comparison with other types of classifiers. The big advantage of decision tree classification is that the results are easy to understand by humans. A visual representation of decision trees strongly supports the interpretation process, which is very important in the context of medical diagnosis. Further improvements are suggested based on enlarging the number of the training data, enhancing the decision tree method by bagging, and adding additional features based on (f)MRI data.
NASA Astrophysics Data System (ADS)
Chaa, Mourad; Boukezzoula, Naceur-Eddine; Attia, Abdelouahab
2017-01-01
Two types of scores extracted from two-dimensional (2-D) and three-dimensional (3-D) palmprint for personal recognition systems are merged, introducing a local image descriptor for 2-D palmprint-based recognition systems, named bank of binarized statistical image features (B-BSIF). The main idea of B-BSIF is that the extracted histograms from the binarized statistical image features (BSIF) code images (the results of applying the different BSIF descriptor size with the length 12) are concatenated into one to produce a large feature vector. 3-D palmprint contains the depth information of the palm surface. The self-quotient image (SQI) algorithm is applied for reconstructing illumination-invariant 3-D palmprint images. To extract discriminative Gabor features from SQI images, Gabor wavelets are defined and used. Indeed, the dimensionality reduction methods have shown their ability in biometrics systems. Given this, a principal component analysis (PCA)+linear discriminant analysis (LDA) technique is employed. For the matching process, the cosine Mahalanobis distance is applied. Extensive experiments were conducted on a 2-D and 3-D palmprint database with 10,400 range images from 260 individuals. Then, a comparison was made between the proposed algorithm and other existing methods in the literature. Results clearly show that the proposed framework provides a higher correct recognition rate. Furthermore, the best results were obtained by merging the score of B-BSIF descriptor with the score of the SQI+Gabor wavelets+PCA+LDA method, yielding an equal error rate of 0.00% and a recognition rate of rank-1=100.00%.
Xue, Dong; Lu, Hao; Xu, Han-Yan; Zhou, Cui-Xing; He, Xiao-Zhou
2018-06-01
Our present work was aimed to study on the regulatory role of MALAT1/miR-145-5p/AKAP12 axis on docetaxel (DTX) sensitivity of prostate cancer (PCa) cells. The microarray data (GSE33455) to identify differentially expressed lncRNAs and mRNAs in DTX-resistant PCa cell lines (DU-145-DTX and PC-3-DTX) was retrieved from the Gene Expression Omnibus (GEO) database. QRT-PCR analysis was performed to measure MALAT1 expression in DTX-sensitive and DTX-resistant tissues/cells. The human DTX-resistant cell lines DU145-PTX and PC3-DTX were established as in vitro cell models, and the expression of MALAT1, miR-145-5p and AKAP12 was manipulated in DTX-sensitive and DTX-resistant cells. Cell viability was examined using MTT assay and colony formation methods. Cell apoptosis was assessed by TUNEL staining. Cell migration and invasion was determined by scratch test (wound healing) and Transwell assay, respectively. Dual-luciferase assay was applied to analyse the target relationship between lncRNA MALAT1 and miR-145-5p, as well as between miR-145-5p and AKAP12. Tumour xenograft study was undertaken to confirm the correlation of MALAT1/miR-145-5p/AKAP12 axis and DTX sensitivity of PCa cells in vivo. In this study, we firstly notified that the MALAT1 expression levels were up-regulated in clinical DTX-resistant PCa samples. Overexpressed MALAT1 promoted cell proliferation, migration and invasion but decreased cell apoptosis rate of PCa cells in spite of DTX treatment. We identified miR-145-5p as a target of MALAT1. MiR-145-5p overexpression in PC3-DTX led to inhibited cell proliferation, migration and invasion as well as reduced chemoresistance to DTX, which was attenuated by MALAT1. Moreover, we determined that AKAP12 was a target of miR-145-5p, which significantly induced chemoresistance of PCa cells to DTX. Besides, it was proved that MALAT1 promoted tumour cell proliferation and enhanced DTX-chemoresistance in vivo. There was an lncRNA MALAT1/miR-145-5p/AKAP12 axis involved in DTX resistance of PCa cells and provided a new thought for PCa therapy. © 2018 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.
NASA Astrophysics Data System (ADS)
Pujiwati, Arie; Nakamura, K.; Watanabe, N.; Komai, T.
2018-02-01
Multivariate analysis is applied to investigate geochemistry of several trace elements in top soils and their relation with the contamination source as the influence of coal mines in Jorong, South Kalimantan. Total concentration of Cd, V, Co, Ni, Cr, Zn, As, Pb, Sb, Cu and Ba was determined in 20 soil samples by the bulk analysis. Pearson correlation is applied to specify the linear correlation among the elements. Principal Component Analysis (PCA) and Cluster Analysis (CA) were applied to observe the classification of trace elements and contamination sources. The results suggest that contamination loading is contributed by Cr, Cu, Ni, Zn, As, and Pb. The elemental loading mostly affects the non-coal mining area, for instances the area near settlement and agricultural land use. Moreover, the contamination source is classified into the areas that are influenced by the coal mining activity, the agricultural types, and the river mixing zone. Multivariate analysis could elucidate the elemental loading and the contamination sources of trace elements in the vicinity of coal mine area.
Comparison of water extraction methods in Tibet based on GF-1 data
NASA Astrophysics Data System (ADS)
Jia, Lingjun; Shang, Kun; Liu, Jing; Sun, Zhongqing
2018-03-01
In this study, we compared four different water extraction methods with GF-1 data according to different water types in Tibet, including Support Vector Machine (SVM), Principal Component Analysis (PCA), Decision Tree Classifier based on False Normalized Difference Water Index (FNDWI-DTC), and PCA-SVM. The results show that all of the four methods can extract large area water body, but only SVM and PCA-SVM can obtain satisfying extraction results for small size water body. The methods were evaluated by both overall accuracy (OAA) and Kappa coefficient (KC). The OAA of PCA-SVM, SVM, FNDWI-DTC, PCA are 96.68%, 94.23%, 93.99%, 93.01%, and the KCs are 0.9308, 0.8995, 0.8962, 0.8842, respectively, in consistent with visual inspection. In summary, SVM is better for narrow rivers extraction and PCA-SVM is suitable for water extraction of various types. As for dark blue lakes, the methods using PCA can extract more quickly and accurately.
External validation of urinary PCA3-based nomograms to individually predict prostate biopsy outcome.
Auprich, Marco; Haese, Alexander; Walz, Jochen; Pummer, Karl; de la Taille, Alexandre; Graefen, Markus; de Reijke, Theo; Fisch, Margit; Kil, Paul; Gontero, Paolo; Irani, Jacques; Chun, Felix K-H
2010-11-01
Prior to safely adopting risk stratification tools, their performance must be tested in an external patient cohort. To assess accuracy and generalizability of previously reported, internally validated, prebiopsy prostate cancer antigen 3 (PCA3) gene-based nomograms when applied to a large, external, European cohort of men at risk of prostate cancer (PCa). Biopsy data, including urinary PCA3 score, were available for 621 men at risk of PCa who were participating in a European multi-institutional study. All patients underwent a ≥10-core prostate biopsy. Biopsy indication was based on suspicious digital rectal examination, persistently elevated prostate-specific antigen level (2.5-10 ng/ml) and/or suspicious histology (atypical small acinar proliferation of the prostate, >/= two cores affected by high-grade prostatic intraepithelial neoplasia in first set of biopsies). PCA3 scores were assessed using the Progensa assay (Gen-Probe Inc, San Diego, CA, USA). According to the previously reported nomograms, different PCA3 score codings were used. The probability of a positive biopsy was calculated using previously published logistic regression coefficients. Predicted outcomes were compared to the actual biopsy results. Accuracy was calculated using the area under the curve as a measure of discrimination; calibration was explored graphically. Biopsy-confirmed PCa was detected in 255 (41.1%) men. Median PCA3 score of biopsy-negative versus biopsy-positive men was 20 versus 48 in the total cohort, 17 versus 47 at initial biopsy, and 37 versus 53 at repeat biopsy (all p≤0.002). External validation of all four previously reported PCA3-based nomograms demonstrated equally high accuracy (0.73-0.75) and excellent calibration. The main limitations of the study reside in its early detection setting, referral scenario, and participation of only tertiary-care centers. In accordance with the original publication, previously developed PCA3-based nomograms achieved high accuracy and sufficient calibration. These novel nomograms represent robust tools and are thus generalizable to European men at risk of harboring PCa. Consequently, in presence of a PCA3 score, these nomograms may be safely used to assist clinicians when prostate biopsy is contemplated. Copyright © 2010 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Davalieva, Katarina; Kostovska, Ivana Maleva; Kiprijanovska, Sanja; Markoska, Katerina; Kubelka-Sabit, Katerina; Filipovski, Vanja; Stavridis, Sotir; Stankov, Oliver; Komina, Selim; Petrusevska, Gordana; Polenakovic, Momir
2015-10-01
The key to a more effective diagnosis, prognosis, and therapeutic management of prostate cancer (PCa) could lie in the direct analysis of cancer tissue. In this study, by comparative proteomics analysis of PCa and benign prostate hyperplasia (BPH) tissues we attempted to elucidate the proteins and regulatory pathways involved in this disease. The samples used in this study were fresh surgical tissues with clinically and histologically confirmed PCa (n = 19) and BPH (n = 33). We used two dimensional difference in gel electrophoresis (2D DIGE) coupled with mass spectrometry (MS) and bioinformatics analysis. Thirty-nine spots with statistically significant 1.8-fold variation or more in abundance, corresponding to 28 proteins were identified. The IPA analysis pointed out to 3 possible networks regulated within MAPK, ERK, TGFB1, and ubiquitin pathways. Thirteen of the identified proteins, namely, constituents of the intermediate filaments (KRT8, KRT18, DES), potential tumor suppressors (ARHGAP1, AZGP1, GSTM2, and MFAP4), transport and membrane organization proteins (FABP5, GC, and EHD2), chaperons (FKBP4 and HSPD1) and known cancer marker (NME1) have been associated with prostate and other cancers by numerous proteomics, genomics or functional studies. We evidenced for the first time the dysregulation of 9 proteins (CSNK1A1, ARID5B, LYPLA1, PSMB6, RABEP1, TALDO1, UBE2N, PPP1CB, and SERPINB1) that may have role in PCa. The UBE2N, PSMB6, and PPP1CB, involved in cell cycle regulation and progression were evaluated by Western blot analysis which confirmed significantly higher abundances of UBE2N and PSMB6 and significantly lower abundance of PPP1CB in PCa. In addition to the identification of substantial number of proteins with known association with PCa, the proteomic approach in this study revealed proteins not previously clearly related to PCa, providing a starting point for further elucidation of their function in disease initiation and progression. © 2015 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md
Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy trainingmore » time, and prone to fatigue as the number of sample increased and inconsistent. The GC–MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties.« less
Ntakatsane, M P; Yang, X Q; Lin, M; Liu, X M; Zhou, P
2011-11-01
Thirteen milk brands comprising 76 pasteurized and UHT milk samples of various compositions (whole, reduced fat, skimmed, low lactose, and high protein) were obtained from local supermarkets, and milk samples manufactured in various countries were discriminated using front-face fluorescence spectroscopy (FFFS) coupled with chemometric tools. The emission spectra of Maillard reaction products and riboflavin (MRP/RF; 400 to 600 nm) and tryptophan (300 to 400 nm) were recorded using FFFS, and the excitation wavelengths were set at 360 nm for MRP/RF and 290 nm for tryptophan. Principal component analysis (PCA) was applied to analyze the normalized spectra. The PCA of spectral information from MRP/RF discriminated the milk samples originating in different countries, and PCA of spectral information from tryptophan discriminated the samples according to composition. The fluorescence spectral data were compared with liquid chromatography-mass spectrometry results for the glycation extent of the milk samples, and a positive association (R(2)=0.84) was found between the degree of glycation of α-lactalbumin and the MRP/RF spectral data. This study demonstrates the ability and sensitivity of FFFS to rapidly discriminate and classify commercial milk with various compositions and processing conditions. Copyright © 2011 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Evaluation of cerebral ischemia using near-infrared spectroscopy with oxygen inhalation
NASA Astrophysics Data System (ADS)
Ebihara, Akira; Tanaka, Yuichi; Konno, Takehiko; Kawasaki, Shingo; Fujiwara, Michiyuki; Watanabe, Eiju
2012-09-01
Conventional methods presently used to evaluate cerebral hemodynamics are invasive, require physical restraint, and employ equipment that is not easily transportable. Therefore, it is difficult to take repeated measurements at the patient's bedside. An alternative method to evaluate cerebral hemodynamics was developed using near-infrared spectroscopy (NIRS) with oxygen inhalation. The bilateral fronto-temporal areas of 30 normal volunteers and 33 patients with cerebral ischemia were evaluated with the NIRS system. The subjects inhaled oxygen through a mask for 2 min at a flow rate of 8 L/min. Principal component analysis (PCA) was applied to the data, and a topogram was drawn using the calculated weights. NIRS findings were compared with those of single-photon-emission computed tomography (SPECT). In normal volunteers, no laterality of the PCA weights was observed in 25 of 30 cases (83%). In patients with cerebral ischemia, PCA weights in ischemic regions were lower than in normal regions. In 28 of 33 patients (85%) with cerebral ischemia, NIRS findings agreed with those of SPECT. The results suggest that transmission of the changes in systemic SpO2 were attenuated in ischemic regions. The method discussed here should be clinically useful because it can be used to measure cerebral ischemia easily, repeatedly, and noninvasively.
Online dimensionality reduction using competitive learning and Radial Basis Function network.
Tomenko, Vladimir
2011-06-01
The general purpose dimensionality reduction method should preserve data interrelations at all scales. Additional desired features include online projection of new data, processing nonlinearly embedded manifolds and large amounts of data. The proposed method, called RBF-NDR, combines these features. RBF-NDR is comprised of two modules. The first module learns manifolds by utilizing modified topology representing networks and geodesic distance in data space and approximates sampled or streaming data with a finite set of reference patterns, thus achieving scalability. Using input from the first module, the dimensionality reduction module constructs mappings between observation and target spaces. Introduction of specific loss function and synthesis of the training algorithm for Radial Basis Function network results in global preservation of data structures and online processing of new patterns. The RBF-NDR was applied for feature extraction and visualization and compared with Principal Component Analysis (PCA), neural network for Sammon's projection (SAMANN) and Isomap. With respect to feature extraction, the method outperformed PCA and yielded increased performance of the model describing wastewater treatment process. As for visualization, RBF-NDR produced superior results compared to PCA and SAMANN and matched Isomap. For the Topic Detection and Tracking corpus, the method successfully separated semantically different topics. Copyright © 2011 Elsevier Ltd. All rights reserved.
Performance evaluation of PCA-based spike sorting algorithms.
Adamos, Dimitrios A; Kosmidis, Efstratios K; Theophilidis, George
2008-09-01
Deciphering the electrical activity of individual neurons from multi-unit noisy recordings is critical for understanding complex neural systems. A widely used spike sorting algorithm is being evaluated for single-electrode nerve trunk recordings. The algorithm is based on principal component analysis (PCA) for spike feature extraction. In the neuroscience literature it is generally assumed that the use of the first two or most commonly three principal components is sufficient. We estimate the optimum PCA-based feature space by evaluating the algorithm's performance on simulated series of action potentials. A number of modifications are made to the open source nev2lkit software to enable systematic investigation of the parameter space. We introduce a new metric to define clustering error considering over-clustering more favorable than under-clustering as proposed by experimentalists for our data. Both the program patch and the metric are available online. Correlated and white Gaussian noise processes are superimposed to account for biological and artificial jitter in the recordings. We report that the employment of more than three principal components is in general beneficial for all noise cases considered. Finally, we apply our results to experimental data and verify that the sorting process with four principal components is in agreement with a panel of electrophysiology experts.
Shanthi, C; Pappa, N
2017-05-01
Flow pattern recognition is necessary to select design equations for finding operating details of the process and to perform computational simulations. Visual image processing can be used to automate the interpretation of patterns in two-phase flow. In this paper, an attempt has been made to improve the classification accuracy of the flow pattern of gas/ liquid two- phase flow using fuzzy logic and Support Vector Machine (SVM) with Principal Component Analysis (PCA). The videos of six different types of flow patterns namely, annular flow, bubble flow, churn flow, plug flow, slug flow and stratified flow are recorded for a period and converted to 2D images for processing. The textural and shape features extracted using image processing are applied as inputs to various classification schemes namely fuzzy logic, SVM and SVM with PCA in order to identify the type of flow pattern. The results obtained are compared and it is observed that SVM with features reduced using PCA gives the better classification accuracy and computationally less intensive than other two existing schemes. This study results cover industrial application needs including oil and gas and any other gas-liquid two-phase flows. Copyright © 2017 ISA. Published by Elsevier Ltd. All rights reserved.
A hybrid PCA-CART-MARS-based prognostic approach of the remaining useful life for aircraft engines.
Sánchez Lasheras, Fernando; García Nieto, Paulino José; de Cos Juez, Francisco Javier; Mayo Bayón, Ricardo; González Suárez, Victor Manuel
2015-03-23
Prognostics is an engineering discipline that predicts the future health of a system. In this research work, a data-driven approach for prognostics is proposed. Indeed, the present paper describes a data-driven hybrid model for the successful prediction of the remaining useful life of aircraft engines. The approach combines the multivariate adaptive regression splines (MARS) technique with the principal component analysis (PCA), dendrograms and classification and regression trees (CARTs). Elements extracted from sensor signals are used to train this hybrid model, representing different levels of health for aircraft engines. In this way, this hybrid algorithm is used to predict the trends of these elements. Based on this fitting, one can determine the future health state of a system and estimate its remaining useful life (RUL) with accuracy. To evaluate the proposed approach, a test was carried out using aircraft engine signals collected from physical sensors (temperature, pressure, speed, fuel flow, etc.). Simulation results show that the PCA-CART-MARS-based approach can forecast faults long before they occur and can predict the RUL. The proposed hybrid model presents as its main advantage the fact that it does not require information about the previous operation states of the input variables of the engine. The performance of this model was compared with those obtained by other benchmark models (multivariate linear regression and artificial neural networks) also applied in recent years for the modeling of remaining useful life. Therefore, the PCA-CART-MARS-based approach is very promising in the field of prognostics of the RUL for aircraft engines.
A Hybrid PCA-CART-MARS-Based Prognostic Approach of the Remaining Useful Life for Aircraft Engines
Lasheras, Fernando Sánchez; Nieto, Paulino José García; de Cos Juez, Francisco Javier; Bayón, Ricardo Mayo; Suárez, Victor Manuel González
2015-01-01
Prognostics is an engineering discipline that predicts the future health of a system. In this research work, a data-driven approach for prognostics is proposed. Indeed, the present paper describes a data-driven hybrid model for the successful prediction of the remaining useful life of aircraft engines. The approach combines the multivariate adaptive regression splines (MARS) technique with the principal component analysis (PCA), dendrograms and classification and regression trees (CARTs). Elements extracted from sensor signals are used to train this hybrid model, representing different levels of health for aircraft engines. In this way, this hybrid algorithm is used to predict the trends of these elements. Based on this fitting, one can determine the future health state of a system and estimate its remaining useful life (RUL) with accuracy. To evaluate the proposed approach, a test was carried out using aircraft engine signals collected from physical sensors (temperature, pressure, speed, fuel flow, etc.). Simulation results show that the PCA-CART-MARS-based approach can forecast faults long before they occur and can predict the RUL. The proposed hybrid model presents as its main advantage the fact that it does not require information about the previous operation states of the input variables of the engine. The performance of this model was compared with those obtained by other benchmark models (multivariate linear regression and artificial neural networks) also applied in recent years for the modeling of remaining useful life. Therefore, the PCA-CART-MARS-based approach is very promising in the field of prognostics of the RUL for aircraft engines. PMID:25806876
Zhang, Mo; Chen, Lizhu; Yuan, Zhengwei; Yang, Zeyu; Li, Yue; Shan, Liping; Yin, Bo; Fei, Xiang; Miao, Jianing; Song, Yongsheng
2016-11-01
Prostate cancer (PCa) is one of the most common malignant tumors and a major cause of cancer-related death for men worldwide. The aim of our study was to identify potential non-invasive serum and expressed prostatic secretion (EPS)-urine biomarkers for accurate diagnosis of PCa. Here, we performed a combined isobaric tags for relative and absolute quantification (iTRAQ) proteomic analysis to compare protein profiles using pooled serum and EPS-urine samples from 4 groups of patients: benign prostate hyperplasia (BPH), high grade prostatic intraepithelial neoplasia (HGPIN), localized PCa and metastatic PCa. The differentially expressed proteins were rigorously selected and further validated in a large and independent cohort using classical ELISA and Western blot assays. Finally, we established a multiplex biomarker panel consisting of 3 proteins (serum PF4V1, PSA, and urinary CRISP3) with an excellent diagnostic capacity to differentiate PCa from BPH [area under the receiver operating characteristic curve (AUC) of 0.941], which showed an evidently greater discriminatory ability than PSA alone (AUC, 0.757) (P<0.001). Importantly, even when PSA level was in the gray zone (4-10 ng/mL), a combination of PF4V1 and CRISP3 could achieve a relatively high diagnostic efficacy (AUC, 0.895). Furthermore, their combination also had the potential to distinguish PCa from HGPIN (AUC, 0.934). Our results demonstrated that the combined application of serum and EPS-urine biomarkers can improve the diagnosis of PCa and provide a new prospect for non-invasive PCa detection.
Flight test of a propulsion controlled aircraft system on the NASA F-15 airplane
NASA Technical Reports Server (NTRS)
Burcham, Frank W., Jr.; Maine, Trindel A.
1995-01-01
Flight tests of the propulsion controlled aircraft (PCA) system on the NASA F-15 airplane evolved as a result of a long series of simulation and flight tests. Initially, the simulation results were very optimistic. Early flight tests showed that manual throttles-only control was much more difficult than the simulation, and a flight investigation was flown to acquire data to resolve this discrepancy. The PCA system designed and developed by MDA evolved as these discrepancies were found and resolved, requiring redesign of the PCA software and modification of the flight test plan. Small throttle step inputs were flown to provide data for analysis, simulation update, and control logic modification. The PCA flight tests quickly revealed less than desired performance, but the extensive flexibility built into the flight PCA software allowed rapid evaluation of alternate gains, filters, and control logic, and within 2 weeks, the PCA system was functioning well. The initial objective of achieving adequate control for up-and-away flying and approaches was satisfied, and the option to continue to actual landings was achieved. After the PCA landings were accomplished, other PCA features were added, and additional maneuvers beyond those originally planned were flown. The PCA system was used to recover from extreme upset conditions, descend, and make approaches to landing. A heading mode was added, and a single engine plus rudder PCA mode was also added and flown. The PCA flight envelope was expanded far beyond that originally designed for. Guest pilots from the USAF, USN, NASA, and the contractor also flew the PCA system and were favorably impressed.
Thomas, John E.; Sem, Daniel S.
2009-01-01
Introduction The purpose of this in vitro study was to determine whether para-chloroaniline (PCA) is formed through the reaction of mixing sodium hypochlorite (NaOCl) and chlorhexidine (CHX). Methods Initially commercially available samples of chlorhexidine acetate (CHXa) and PCA were analyzed with 1H NMR spectroscopy. Two solutions, NaOCl and CHXa, were warmed to 37°C and when mixed they produced a brown precipitate. This precipitate was separated in half and pure PCA was added to one of the samples for comparison before they were each analyzed with 1H NMR spectroscopy. Results The peaks in the 1H NMR spectra of CHXa and PCA were assigned to specific protons of the molecules, and the location of the aromatic peaks in the PCA spectrum defined the PCA doublet region. While the spectrum of the precipitate alone resulted in a complex combination of peaks, upon magnification there were no peaks in the PCA doublet region which were intense enough to be quantified. In the spectrum of the precipitate, to which PCA was added, two peaks do appear in the PCA doublet region. Comparing this spectrum to that of precipitate alone, the peaks in the PCA doublet region are not visible prior to the addition of PCA. Conclusions Based on this in vitro study, the reaction mixture of NaOCl and CHXa does not produce PCA at any measurable quantity and further investigation is needed to determine the chemical composition of the brown precipitate. PMID:20113799
NASA Astrophysics Data System (ADS)
Fragkaki, A. G.; Angelis, Y. S.; Tsantili-Kakoulidou, A.; Koupparis, M.; Georgakopoulos, C.
2009-08-01
Anabolic androgenic steroids (AAS) are included in the List of prohibited substances of the World Anti-Doping Agency (WADA) as substances abused to enhance athletic performance. Gas chromatography coupled to mass spectrometry (GC-MS) plays an important role in doping control analyses identifying AAS as their enolized-trimethylsilyl (TMS)-derivatives using the electron ionization (EI) mode. This paper explores the suitability of complementary GC-MS mass spectra and statistical analysis (principal component analysis, PCA and partial least squares-discriminant analysis, PLS-DA) to differentiate AAS as a function of their structural and conformational features expressed by their fragment ions. The results obtained showed that the application of PCA yielded a classification among the AAS molecules which became more apparent after applying PLS-DA to the dataset. The application of PLS-DA yielded a clear separation among the AAS molecules which were, thus, classified as: 1-ene-3-keto, 3-hydroxyl with saturated A-ring, 1-ene-3-hydroxyl, 4-ene-3-keto, 1,4-diene-3-keto and 3-keto with saturated A-ring anabolic steroids. The study of this paper also presents structurally diagnostic fragment ions and dissociation routes providing evidence for the presence of unknown AAS or chemically modified molecules known as designer steroids.
Early Improper Motion Detection in Golf Swings Using Wearable Motion Sensors: The First Approach
Stančin, Sara; Tomažič, Sašo
2013-01-01
This paper presents an analysis of a golf swing to detect improper motion in the early phase of the swing. Led by the desire to achieve a consistent shot outcome, a particular golfer would (in multiple trials) prefer to perform completely identical golf swings. In reality, some deviations from the desired motion are always present due to the comprehensive nature of the swing motion. Swing motion deviations that are not detrimental to performance are acceptable. This analysis is conducted using a golfer's leading arm kinematic data, which are obtained from a golfer wearing a motion sensor that is comprised of gyroscopes and accelerometers. Applying the principal component analysis (PCA) to the reference observations of properly performed swings, the PCA components of acceptable swing motion deviations are established. Using these components, the motion deviations in the observations of other swings are examined. Any unacceptable deviations that are detected indicate an improper swing motion. Arbitrarily long observations of an individual player's swing sequences can be included in the analysis. The results obtained for the considered example show an improper swing motion in early phase of the swing, i.e., the first part of the backswing. An early detection method for improper swing motions that is conducted on an individual basis provides assistance for performance improvement. PMID:23752563
Early improper motion detection in golf swings using wearable motion sensors: the first approach.
Stančin, Sara; Tomažič, Sašo
2013-06-10
This paper presents an analysis of a golf swing to detect improper motion in the early phase of the swing. Led by the desire to achieve a consistent shot outcome, a particular golfer would (in multiple trials) prefer to perform completely identical golf swings. In reality, some deviations from the desired motion are always present due to the comprehensive nature of the swing motion. Swing motion deviations that are not detrimental to performance are acceptable. This analysis is conducted using a golfer's leading arm kinematic data, which are obtained from a golfer wearing a motion sensor that is comprised of gyroscopes and accelerometers. Applying the principal component analysis (PCA) to the reference observations of properly performed swings, the PCA components of acceptable swing motion deviations are established. Using these components, the motion deviations in the observations of other swings are examined. Any unacceptable deviations that are detected indicate an improper swing motion. Arbitrarily long observations of an individual player's swing sequences can be included in the analysis. The results obtained for the considered example show an improper swing motion in early phase of the swing, i.e., the first part of the backswing. An early detection method for improper swing motions that is conducted on an individual basis provides assistance for performance improvement.
Zeng, Shanshan; Wang, Lu; Chen, Teng; Wang, Yuefei; Mo, Huanbiao; Qu, Haibin
2012-07-06
The paper presents a novel strategy to identify analytical markers of traditional Chinese medicine preparation (TCMP) rapidly via direct analysis in real time mass spectrometry (DART-MS). A commonly used TCMP, Danshen injection, was employed as a model. The optimal analysis conditions were achieved by measuring the contribution of various experimental parameters to the mass spectra. Salvianolic acids and saccharides were simultaneously determined within a single 1-min DART-MS run. Furthermore, spectra of Danshen injections supplied by five manufacturers were processed with principal component analysis (PCA). Obvious clustering was observed in the PCA score plot, and candidate markers were recognized from the contribution plots of PCA. The suitability of potential markers was then confirmed by contrasting with the results of traditional analysis methods. Using this strategy, fructose, glucose, sucrose, protocatechuic aldehyde and salvianolic acid A were rapidly identified as the markers of Danshen injections. The combination of DART-MS with PCA provides a reliable approach to the identification of analytical markers for quality control of TCMP. Copyright © 2012 Elsevier B.V. All rights reserved.
Liu, Jie; Zhang, Fu-Dong; Teng, Fei; Li, Jun; Wang, Zhi-Hong
2014-10-01
In order to in-situ detect the oil yield of oil shale, based on portable near infrared spectroscopy analytical technology, with 66 rock core samples from No. 2 well drilling of Fuyu oil shale base in Jilin, the modeling and analyzing methods for in-situ detection were researched. By the developed portable spectrometer, 3 data formats (reflectance, absorbance and K-M function) spectra were acquired. With 4 different modeling data optimization methods: principal component-mahalanobis distance (PCA-MD) for eliminating abnormal samples, uninformative variables elimination (UVE) for wavelength selection and their combina- tions: PCA-MD + UVE and UVE + PCA-MD, 2 modeling methods: partial least square (PLS) and back propagation artificial neural network (BPANN), and the same data pre-processing, the modeling and analyzing experiment were performed to determine the optimum analysis model and method. The results show that the data format, modeling data optimization method and modeling method all affect the analysis precision of model. Results show that whether or not using the optimization method, reflectance or K-M function is the proper spectrum format of the modeling database for two modeling methods. Using two different modeling methods and four different data optimization methods, the model precisions of the same modeling database are different. For PLS modeling method, the PCA-MD and UVE + PCA-MD data optimization methods can improve the modeling precision of database using K-M function spectrum data format. For BPANN modeling method, UVE, UVE + PCA-MD and PCA- MD + UVE data optimization methods can improve the modeling precision of database using any of the 3 spectrum data formats. In addition to using the reflectance spectra and PCA-MD data optimization method, modeling precision by BPANN method is better than that by PLS method. And modeling with reflectance spectra, UVE optimization method and BPANN modeling method, the model gets the highest analysis precision, its correlation coefficient (Rp) is 0.92, and its standard error of prediction (SEP) is 0.69%.
Ruela-de-Sousa, Roberta R; Hoekstra, Elmer; Hoogland, A Marije; Queiroz, Karla C Souza; Peppelenbosch, Maikel P; Stubbs, Andrew P; Pelizzaro-Rocha, Karin; van Leenders, Geert J L H; Jenster, Guido; Aoyama, Hiroshi; Ferreira, Carmen V; Fuhler, Gwenny M
2016-04-01
Low-risk patients suffering from prostate cancer (PCa) are currently placed under active surveillance rather than undergoing radical prostatectomy. However, clear parameters for selecting the right patient for each strategy are not available, and new biomarkers and treatment modalities are needed. Low-molecular-weight protein tyrosine phosphatase (LMWPTP) could present such a target. To correlate expression levels of LMWPTP in primary PCa to clinical outcome, and determine the role of LMWPTP in prostate tumor cell biology. Acid phosphatase 1, soluble (ACP1) expression was analyzed on microarray data sets, which were subsequently used in Ingenuity Pathway Analysis. Immunohistochemistry was performed on a tissue microarray containing material of 481 PCa patients whose clinicopathologic data were recorded. PCa cell line models were used to investigate the role of LMWPTP in cell proliferation, migration, adhesion, and anoikis resistance. The association between LMWPTP expression and clinical and pathologic outcomes was calculated using chi-square correlations and multivariable Cox regression analysis. Functional consequences of LMWPTP overexpression or downregulation were determined using migration and adhesion assays, confocal microscopy, Western blotting, and proliferation assays. LMWPTP expression was significantly increased in human PCa and correlated with earlier recurrence of disease (hazard ratio [HR]:1.99; p<0.001) and reduced patient survival (HR: 1.53; p=0.04). Unbiased Ingenuity analysis comparing cancer and normal prostate suggests migratory propensities in PCa. Indeed, overexpression of LMWPTP increases PCa cell migration, anoikis resistance, and reduces activation of focal adhesion kinase/paxillin, corresponding to decreased adherence. Overexpression of LMWPTP in PCa confers a malignant phenotype with worse clinical outcome. Prospective follow-up should determine the clinical potential of LMWPTP overexpression. These findings implicate low-molecular-weight protein tyrosine phosphatase as a novel oncogene in prostate cancer and could offer the possibility of using this protein as biomarker or target for treatment of this disease. Copyright © 2015 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Maxeiner, Andreas; Fischer, Thomas; Schwabe, Julia; Baur, Alexander Daniel Jacques; Stephan, Carsten; Peters, Robert; Slowinski, Torsten; von Laffert, Maximilian; Marticorena Garcia, Stephan Rodrigo; Hamm, Bernd; Jung, Ernst-Michael
2018-06-06
The aim of this study was to investigate contrast-enhanced ultrasound (CEUS) parameters acquired by software during magnetic resonance imaging (MRI) US fusion-guided biopsy for prostate cancer (PCa) detection and discrimination. From 2012 to 2015, 158 out of 165 men with suspicion for PCa and with at least 1 negative biopsy of the prostate were included and underwent a multi-parametric 3 Tesla MRI and an MRI/US fusion-guided biopsy, consecutively. CEUS was conducted during biopsy with intravenous bolus application of 2.4 mL of SonoVue ® (Bracco, Milan, Italy). In the latter CEUS clips were investigated using quantitative perfusion analysis software (VueBox, Bracco). The area of strongest enhancement within the MRI pre-located region was investigated and all available parameters from the quantification tool box were collected and analyzed for PCa and its further differentiation was based on the histopathological results. The overall detection rate was 74 (47 %) PCa cases in 158 included patients. From these 74 PCa cases, 49 (66 %) were graded Gleason ≥ 3 + 4 = 7 (ISUP ≥ 2) PCa. The best results for cancer detection over all quantitative perfusion parameters were rise time (p = 0.026) and time to peak (p = 0.037). Within the subgroup analysis (> vs ≤ 3 + 4 = 7a (ISUP 2)), peak enhancement (p = 0.012), wash-in rate (p = 0.011), wash-out rate (p = 0.007) and wash-in perfusion index (p = 0.014) also showed statistical significance. The quantification of CEUS parameters was able to discriminate PCa aggressiveness during MRI/US fusion-guided prostate biopsy. © Georg Thieme Verlag KG Stuttgart · New York.
Giri, Veda N.; Coups, Elliot J.; Ruth, Karen; Goplerud, Julia; Raysor, Susan; Kim, Taylor Y.; Bagden, Loretta; Mastalski, Kathleen; Zakrzewski, Debra; Leimkuhler, Suzanne; Watkins-Bruner, Deborah
2009-01-01
Purpose Men with a family history (FH) of prostate cancer (PCA) and African American (AA) men are at higher risk for PCA. Recruitment and retention of these high-risk men into early detection programs has been challenging. We report a comprehensive analysis on recruitment methods, show rates, and participant factors from the Prostate Cancer Risk Assessment Program (PRAP), which is a prospective, longitudinal PCA screening study. Materials and Methods Men 35–69 years are eligible if they have a FH of PCA, are AA, or have a BRCA1/2 mutation. Recruitment methods were analyzed with respect to participant demographics and show to the first PRAP appointment using standard statistical methods Results Out of 707 men recruited, 64.9% showed to the initial PRAP appointment. More individuals were recruited via radio than from referral or other methods (χ2 = 298.13, p < .0001). Men recruited via radio were more likely to be AA (p<0.001), less educated (p=0.003), not married or partnered (p=0.007), and have no FH of PCA (p<0.001). Men recruited via referrals had higher incomes (p=0.007). Men recruited via referral were more likely to attend their initial PRAP visit than those recruited by radio or other methods (χ2 = 27.08, p < .0001). Conclusions This comprehensive analysis finds that radio leads to higher recruitment of AA men with lower socioeconomic status. However, these are the high-risk men that have lower show rates for PCA screening. Targeted motivational measures need to be studied to improve show rates for PCA risk assessment for these high-risk men. PMID:19758657
Once upon Multivariate Analyses: When They Tell Several Stories about Biological Evolution.
Renaud, Sabrina; Dufour, Anne-Béatrice; Hardouin, Emilie A; Ledevin, Ronan; Auffray, Jean-Christophe
2015-01-01
Geometric morphometrics aims to characterize of the geometry of complex traits. It is therefore by essence multivariate. The most popular methods to investigate patterns of differentiation in this context are (1) the Principal Component Analysis (PCA), which is an eigenvalue decomposition of the total variance-covariance matrix among all specimens; (2) the Canonical Variate Analysis (CVA, a.k.a. linear discriminant analysis (LDA) for more than two groups), which aims at separating the groups by maximizing the between-group to within-group variance ratio; (3) the between-group PCA (bgPCA) which investigates patterns of between-group variation, without standardizing by the within-group variance. Standardizing within-group variance, as performed in the CVA, distorts the relationships among groups, an effect that is particularly strong if the variance is similarly oriented in a comparable way in all groups. Such shared direction of main morphological variance may occur and have a biological meaning, for instance corresponding to the most frequent standing genetic variation in a population. Here we undertake a case study of the evolution of house mouse molar shape across various islands, based on the real dataset and simulations. We investigated how patterns of main variance influence the depiction of among-group differentiation according to the interpretation of the PCA, bgPCA and CVA. Without arguing about a method performing 'better' than another, it rather emerges that working on the total or between-group variance (PCA and bgPCA) will tend to put the focus on the role of direction of main variance as line of least resistance to evolution. Standardizing by the within-group variance (CVA), by dampening the expression of this line of least resistance, has the potential to reveal other relevant patterns of differentiation that may otherwise be blurred.
Hectors, Stefanie J; Besa, Cecilia; Wagner, Mathilde; Jajamovich, Guido H; Haines, George K; Lewis, Sara; Tewari, Ashutosh; Rastinehad, Ardeshir; Huang, Wei; Taouli, Bachir
2017-09-01
To quantify Tofts model (TM) and shutter-speed model (SSM) perfusion parameters in prostate cancer (PCa) and noncancerous peripheral zone (PZ) and to compare the diagnostic performance of dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) to Prostate Imaging and Reporting and Data System (PI-RADS) classification for the assessment of PCa aggressiveness. Fifty PCa patients (mean age 60 years old) who underwent MRI at 3.0T followed by prostatectomy were included in this Institutional Review Board-approved retrospective study. DCE-MRI parameters (K trans , v e , k ep [TM&SSM] and intracellular water molecule lifetime τ i [SSM]) were determined in PCa and PZ. Differences in DCE-MRI parameters between PCa and PZ, and between models were assessed using Wilcoxon signed-rank tests. Receiver operating characteristic (ROC) analysis for differentiation between PCa and PZ was performed for individual and combined DCE-MRI parameters. Diagnostic performance of DCE-MRI parameters for identification of aggressive PCa (Gleason ≥8, grade group [GG] ≥3 or pathology stage pT3) was assessed using ROC analysis and compared with PI-RADSv2 scores. DCE-MRI parameters were significantly different between TM and SSM and between PZ and PCa (P < 0.037). Diagnostic performances of TM and SSM for differentiation of PCa from PZ were similar (highest AUC TM: K trans +k ep 0.76, SSM: τ i +k ep 0.80). PI-RADS outperformed TM and SSM DCE-MRI for identification of Gleason ≥8 lesions (AUC PI-RADS: 0.91, highest AUC DCE-MRI: K trans +τ i SSM 0.61, P = 0.002). The diagnostic performance of PI-RADS and DCE-MRI for identification of GG ≥3 and pT3 PCa was not significantly different (P > 0.213). SSM DCE-MRI did not increase the diagnostic performance of DCE-MRI for PCa characterization. PI-RADS outperformed both TM and SSM DCE-MRI for identification of aggressive cancer. 3 Technical Efficacy: Stage 2 J. MAGN. RESON. IMAGING 2017;46:837-849. © 2017 International Society for Magnetic Resonance in Medicine.
Tumor volume in insignificant prostate cancer: increasing threshold gains increasing risk.
Schiffmann, Jonas; Connan, Judith; Salomon, Georg; Boehm, Katharina; Beyer, Burkhard; Schlomm, Thorsten; Tennstedt, Pierre; Sauter, Guido; Karakiewicz, Pierre I; Graefen, Markus; Huland, Hartwig
2015-01-01
An increased tumor volume threshold (<2.5 ml) is suggested to define insignificant prostate cancer (iPCa). We hypothesize that an increasing tumor volume within iPCa patients increases the risk of biochemical recurrence (BCR) after radical prostatectomy (RP). We relied on RP patients treated between 1992 and 2008. Multivariable Cox regression analyses predicting BCR within patients harboring favorable pathological characteristics (≤pT2, pN0/Nx, Gleason 3 + 3). Kaplan-Meier analysis was performed for BCR-free survival within iPCa patients (≤pT2, pN0/Nx, Gleason 3 + 3, tumor volume: <0.5 vs. 0.5-2.49 ml). From 1,829 patients, 141 (7.7%) and 310 (16.9%) harbored iPCa (tumor volume: <0.5 vs. 0.5-2.49 ml), respectively. Of those, 21 (14.9%) versus 31 (10.0%) had PSA >10 ng/ml. Tumor volume achieved independent predictor status for BCR. Specifically, iPCa patients with increasing tumor volume (0.5-2.49 ml) were at higher risk of BCR after RP than those with tumor volume <0.5 ml (HR: 8.8, 95% CI: 1.2-65.9, P = 0.04). Kaplan-Meier analysis recorded superior BCR-free survival in iPCa patients with lower tumor volume (<0.5 ml) (log-rank P = 0.009). The 10-year cancer-specific death rate was 0 versus 0.5%. Contemporary iPCa definition incorporates intermediate and high-risk patients (PSA: 10-20 and >20 ng/ml). Despite most favorable pathological characteristics, iPCa patients are not devoid of BCR after RP. Moreover, iPCa patients were at higher risk of BCR, when increasing tumor volume up to 2.49 ml was at play. Taken together the contemporary concept of iPCa is suboptimal. Especially, an increased tumor volume threshold for defining iPCa cannot be recommended according to our data. Clinicians might take these considerations into account during decision-making process. © 2014 Wiley Periodicals, Inc.
Cole, Jacqueline M.; Cheng, Xie; Payne, Michael C.
2016-10-18
The use of principal component analysis (PCA) to statistically infer features of local structure from experimental pair distribution function (PDF) data is assessed on a case study of rare-earth phosphate glasses (REPGs). Such glasses, co-doped with two rare-earth ions (R and R’) of different sizes and optical properties, are of interest to the laser industry. The determination of structure-property relationships in these materials is an important aspect of their technological development. Yet, realizing the local structure of co-doped REPGs presents significant challenges relative to their singly-doped counterparts; specifically, R and R’ are difficult to distinguish in terms of establishing relativemore » material compositions, identifying atomic pairwise correlation profiles in a PDF that are associated with each ion, and resolving peak overlap of such profiles in PDFs. This study demonstrates that PCA can be employed to help overcome these structural complications, by statistically inferring trends in PDFs that exist for a restricted set of experimental data on REPGs, and using these as training data to predict material compositions and PDF profiles in unknown co-doped REPGs. The application of these PCA methods to resolve individual atomic pairwise correlations in t(r) signatures is also presented. The training methods developed for these structural predictions are pre-validated by testing their ability to reproduce known physical phenomena, such as the lanthanide contraction, on PDF signatures of the structurally simpler singly-doped REPGs. The intrinsic limitations of applying PCA to analyze PDFs relative to the quality control of source data, data processing, and sample definition, are also considered. Furthermore, while this case study is limited to lanthanide-doped REPGs, this type of statistical inference may easily be extended to other inorganic solid-state materials, and be exploited in large-scale data-mining efforts that probe many t(r) functions.« less
Picture agnosia as a characteristic of posterior cortical atrophy.
Sugimoto, Azusa; Midorikawa, Akira; Koyama, Shinichi; Futamura, Akinori; Hieda, Sotaro; Kawamura, Mitsuru
2012-01-01
Posterior cortical atrophy (PCA) is a degenerative disease characterized by progressive visual agnosia with posterior cerebral atrophy. We examine the role of the picture naming test and make a number of suggestions with regard to diagnosing PCA as atypical dementia. We investigated 3 cases of early-stage PCA with 7 control cases of Alzheimer disease (AD). The patients and controls underwent a naming test with real objects and colored photographs of familiar objects. We then compared rates of correct answers. Patients with early-stage PCA showed significant inability to recognize photographs compared to real objects (F = 196.284, p = 0.0000) as measured by analysis of variants. This difficulty was also significant to AD controls (F = 58.717, p = 0.0000). Picture agnosia is a characteristic symptom of early-stage PCA, and the picture naming test is useful for the diagnosis of PCA as atypical dementia at an early stage. Copyright © 2012 S. Karger AG, Basel.
Wenderski, Todd A; Stratton, Christopher F; Bauer, Renato A; Kopp, Felix; Tan, Derek S
2015-01-01
Principal component analysis (PCA) is a useful tool in the design and planning of chemical libraries. PCA can be used to reveal differences in structural and physicochemical parameters between various classes of compounds by displaying them in a convenient graphical format. Herein, we demonstrate the use of PCA to gain insight into structural features that differentiate natural products, synthetic drugs, natural product-like libraries, and drug-like libraries, and show how the results can be used to guide library design.
Wenderski, Todd A.; Stratton, Christopher F.; Bauer, Renato A.; Kopp, Felix; Tan, Derek S.
2015-01-01
Principal component analysis (PCA) is a useful tool in the design and planning of chemical libraries. PCA can be used to reveal differences in structural and physicochemical parameters between various classes of compounds by displaying them in a convenient graphical format. Herein, we demonstrate the use of PCA to gain insight into structural features that differentiate natural products, synthetic drugs, natural product-like libraries, and drug-like libraries, and show how the results can be used to guide library design. PMID:25618349
Zuendorf, Gerhard; Kerrouche, Nacer; Herholz, Karl; Baron, Jean-Claude
2003-01-01
Principal component analysis (PCA) is a well-known technique for reduction of dimensionality of functional imaging data. PCA can be looked at as the projection of the original images onto a new orthogonal coordinate system with lower dimensions. The new axes explain the variance in the images in decreasing order of importance, showing correlations between brain regions. We used an efficient, stable and analytical method to work out the PCA of Positron Emission Tomography (PET) images of 74 normal subjects using [(18)F]fluoro-2-deoxy-D-glucose (FDG) as a tracer. Principal components (PCs) and their relation to age effects were investigated. Correlations between the projections of the images on the new axes and the age of the subjects were carried out. The first two PCs could be identified as being the only PCs significantly correlated to age. The first principal component, which explained 10% of the data set variance, was reduced only in subjects of age 55 or older and was related to loss of signal in and adjacent to ventricles and basal cisterns, reflecting expected age-related brain atrophy with enlarging CSF spaces. The second principal component, which accounted for 8% of the total variance, had high loadings from prefrontal, posterior parietal and posterior cingulate cortices and showed the strongest correlation with age (r = -0.56), entirely consistent with previously documented age-related declines in brain glucose utilization. Thus, our method showed that the effect of aging on brain metabolism has at least two independent dimensions. This method should have widespread applications in multivariate analysis of brain functional images. Copyright 2002 Wiley-Liss, Inc.
Chen, Liangmian; Kotani, Akira; Kusu, Fumiyo; Wang, Zhimin; Zhu, Jingjing; Hakamata, Hideki
2015-01-01
For the determination of seven caffeoylquinic acids [neochlorogenic acid (NcA), cryptochlorogenic acid (CcA), chlorogenic acid (CA), caffeic acid (CfA), isochlorogenic acid A (Ic A), isochlorogenic acid B (Ic B), isochlorogenic acid C (Ic C)] and two flavonoids [luteolin 7-O-glucoside (LtG) and luteolin (Lt)], a three-channel liquid chromatography with electrochemical detection (LC-3ECD) method was established. Chromatographic peak heights were proportional to each concentration, ranging from 2.5 to 100 ng/mL for NcA, CA, CcA, and CfA, and ranging from 2.5 to 250 ng/mL for LtG, Ic B, Ic A, Ic C, and Lt, respectively. The present LC-3ECD method was applied to the quantitative analysis of caffeoylquinic acids and flavonoids in four cultivars of Chrysanthemum morifolium flowers and their sulfur-fumigated products. It was found that 60% of LtG and more than 47% of caffeoylquinic acids were lost during the sulfur fumigation processing. Sulfur fumigation showed a destructive effect on the C. morifolium flowers. In addition, principle component analyses (PCA) were performed using the results of the quantitative analysis of caffeoylquinic acids and flavonoids to compare the "sameness" and "differences" of these analytes in C. morifolium flowers and the sulfur-fumigated products. PCA score plots showed that the four cultivars of C. morifolium flowers were clearly classified into four groups, and that significant differences were also found between the non-fumigated C. morifolium flowers and the sulfur-fumigated products. Therefore, it was demonstrated that the present LC-3ECD method coupled with PCA is applicable to the variation analysis of different C. morifolium flower samples.
NASA Astrophysics Data System (ADS)
Bispo, Jeyse A. M.; Silveira, Landulfo; Vieira, Elzo E. d. S.; Fernandes, Adriana B.
2013-02-01
Diabetes mellitus and hypertension diseases are frequently found in the same patient, which if untreated predispose to atherosclerotic and kidney diseases. The objective of this study was to identify potential biomarkers in the urine of diabetic and hypertensive patients through dispersive near-infrared Raman spectroscopy. Urine samples were collected from patients with diabetes and hypertension but no complications (LG), high degree of complications (HG), and control ones: one fraction was submitted to biochemical tests and another one was stored frozen (-20°C) until spectral analysis. Samples were warmed up and placed in an aluminum sample holder for Raman spectra collection using a dispersive spectrometer (830 nm wavelength, 300 mW laser power and 20 s exposure time). Spectra were then submitted to Principal Components Analysis. The PCA loading vectors 1 and 3 revealed spectral features of urea/creatinine and glucose, respectively; the PCA scores showed that patients with diabetes/hypertension (LG and HG) had higher amount of glucose in the urine compared to the normal group (p < 0.05), which can bring serious consequences to patients. Also, the PCA scores showed that the amount of urea decreased in the groups with diabetes/hypertension (p < 0.05), which generates the same concern as it is a marker that has a strong importance in the metabolic changes induced by such diseases. These results, applied to the analysis of urine of patients with diabetes/hypertension, can lead to early diagnostic information of complications and a possible disease prognosis in the patients where no complications from diabetes and hypertension were found.
Hu, Boran; Yue, Yaqing; Zhu, Yong; Wen, Wen; Zhang, Fengmin; Hardie, Jim W
2015-01-01
Proton nuclear magnetic resonance spectroscopy coupled multivariate analysis (1H NMR-PCA/PLS-DA) is an important tool for the discrimination of wine products. Although 1H NMR has been shown to discriminate wines of different cultivars, a grape genetic component of the discrimination has been inferred only from discrimination of cultivars of undefined genetic homology and in the presence of many confounding environmental factors. We aimed to confirm the influence of grape genotypes in the absence of those factors. We applied 1H NMR-PCA/PLS-DA and hierarchical cluster analysis (HCA) to wines from five, variously genetically-related grapevine (V. vinifera) cultivars; all grown similarly on the same site and vinified similarly. We also compared the semi-quantitative profiles of the discriminant metabolites of each cultivar with previously reported chemical analyses. The cultivars were clearly distinguishable and there was a general correlation between their grouping and their genetic homology as revealed by recent genomic studies. Between cultivars, the relative amounts of several of the cultivar-related discriminant metabolites conformed closely with reported chemical analyses. Differences in grape-derived metabolites associated with genetic differences alone are a major source of 1H NMR-based discrimination of wines and 1H NMR has the capacity to discriminate between very closely related cultivars. The study confirms that genetic variation among grape cultivars alone can account for the discrimination of wine by 1H NMR-PCA/PLS and indicates that 1H NMR spectra of wine of single grape cultivars may in future be used in tandem with hierarchical cluster analysis to elucidate genetic lineages and metabolomic relations of grapevine cultivars. In the absence of genetic information, for example, where predecessor varieties are no longer extant, this may be a particularly useful approach.
Evaluation of skin melanoma in spectral range 450-950 nm using principal component analysis
NASA Astrophysics Data System (ADS)
Jakovels, D.; Lihacova, I.; Kuzmina, I.; Spigulis, J.
2013-06-01
Diagnostic potential of principal component analysis (PCA) of multi-spectral imaging data in the wavelength range 450- 950 nm for distant skin melanoma recognition is discussed. Processing of the measured clinical data by means of PCA resulted in clear separation between malignant melanomas and pigmented nevi.
AlleleCoder: a PERL script for coding codominant polymorphism data for PCA analysis
USDA-ARS?s Scientific Manuscript database
A useful biological interpretation of diploid heterozygotes is in terms of the dose of the common allele (0, 1 or 2 copies). We have developed a PERL script that converts FASTA files into coded spreadsheets suitable for Principal Component Analysis (PCA). In combination with R and R Commander, two- ...
Missing data is a common problem in the application of statistical techniques. In principal component analysis (PCA), a technique for dimensionality reduction, incomplete data points are either discarded or imputed using interpolation methods. Such approaches are less valid when ...
Huang, Ya-Qiang; Sun, Tong; Zhong, Wei-De; Wu, Chin-Lee
2014-01-01
Prostate-specific antigen (PSA) has been widely used as a serum marker for prostate cancer (PCa) screening or progression monitoring, which dramatically increased rate of early detection while significantly reduced PCa-specific mortality. However, a number of limitations of PSA have been noticed. Low specificity of PSA may lead to overtreatment in men who presenting with a total PSA (tPSA) level of < 10 ng/mL. As a type of free PSA (fPSA), [-2]proPSA is differentially expressed in peripheral zone of prostate gland and found to be elevated in serum of men with PCa. Two p2PSA-based derivatives, prostate health index (PHI) and %p2PSA, which were defined as [(p2PSA/fPSA) × √ tPSA] and [(p2PSA/fPSA) × 100] respectively, have been suggested to be increased in PCa and can better distinguish PCa from benign prostatic diseases than tPSA or fPSA. We performed a systematic review of the available scientific evidences to evaluate the potentials of %p2PSA and PHI in clinical application. Mounting evidences suggested that both %p2PSA and PHI possess higher area under the ROC curve (AUC) and better specificity at a high sensitivity for PCa detection when compare with tPSA and %fPSA. It indicated that measurements of %p2PSA and PHI significantly improved the accuracy of PCa detection and diminished unnecessary biopsies. Furthermore, elevations of %p2PSA and PHI are related to more aggressive diseases. %p2PSA and PHI might be helpful in reducing overtreatment on indolent cases or assessing the progression of PCa in men who undergo active surveillance. Further studies are needed before being applied in routine clinical practice.
A PCA-Based method for determining craniofacial relationship and sexual dimorphism of facial shapes.
Shui, Wuyang; Zhou, Mingquan; Maddock, Steve; He, Taiping; Wang, Xingce; Deng, Qingqiong
2017-11-01
Previous studies have used principal component analysis (PCA) to investigate the craniofacial relationship, as well as sex determination using facial factors. However, few studies have investigated the extent to which the choice of principal components (PCs) affects the analysis of craniofacial relationship and sexual dimorphism. In this paper, we propose a PCA-based method for visual and quantitative analysis, using 140 samples of 3D heads (70 male and 70 female), produced from computed tomography (CT) images. There are two parts to the method. First, skull and facial landmarks are manually marked to guide the model's registration so that dense corresponding vertices occupy the same relative position in every sample. Statistical shape spaces of the skull and face in dense corresponding vertices are constructed using PCA. Variations in these vertices, captured in every principal component (PC), are visualized to observe shape variability. The correlations of skull- and face-based PC scores are analysed, and linear regression is used to fit the craniofacial relationship. We compute the PC coefficients of a face based on this craniofacial relationship and the PC scores of a skull, and apply the coefficients to estimate a 3D face for the skull. To evaluate the accuracy of the computed craniofacial relationship, the mean and standard deviation of every vertex between the two models are computed, where these models are reconstructed using real PC scores and coefficients. Second, each PC in facial space is analysed for sex determination, for which support vector machines (SVMs) are used. We examined the correlation between PCs and sex, and explored the extent to which the choice of PCs affects the expression of sexual dimorphism. Our results suggest that skull- and face-based PCs can be used to describe the craniofacial relationship and that the accuracy of the method can be improved by using an increased number of face-based PCs. The results show that the accuracy of the sex classification is related to the choice of PCs. The highest sex classification rate is 91.43% using our method. Copyright © 2017 Elsevier Ltd. All rights reserved.
Sun, Li-Li; Wang, Meng; Zhang, Hui-Jie; Liu, Ya-Nan; Ren, Xiao-Liang; Deng, Yan-Ru; Qi, Ai-Di
2018-01-01
Polygoni Multiflori Radix (PMR) is increasingly being used not just as a traditional herbal medicine but also as a popular functional food. In this study, multivariate chemometric methods and mass spectrometry were combined to analyze the ultra-high-performance liquid chromatograph (UPLC) fingerprints of PMR from six different geographical origins. A chemometric strategy based on multivariate curve resolution-alternating least squares (MCR-ALS) and three classification methods is proposed to analyze the UPLC fingerprints obtained. Common chromatographic problems, including the background contribution, baseline contribution, and peak overlap, were handled by the established MCR-ALS model. A total of 22 components were resolved. Moreover, relative species concentrations were obtained from the MCR-ALS model, which was used for multivariate classification analysis. Principal component analysis (PCA) and Ward's method have been applied to classify 72 PMR samples from six different geographical regions. The PCA score plot showed that the PMR samples fell into four clusters, which related to the geographical location and climate of the source areas. The results were then corroborated by Ward's method. In addition, according to the variance-weighted distance between cluster centers obtained from Ward's method, five components were identified as the most significant variables (chemical markers) for cluster discrimination. A counter-propagation artificial neural network has been applied to confirm and predict the effects of chemical markers on different samples. Finally, the five chemical markers were identified by UPLC-quadrupole time-of-flight mass spectrometer. Components 3, 12, 16, 18, and 19 were identified as 2,3,5,4'-tetrahydroxy-stilbene-2-O-β-d-glucoside, emodin-8-O-β-d-glucopyranoside, emodin-8-O-(6'-O-acetyl)-β-d-glucopyranoside, emodin, and physcion, respectively. In conclusion, the proposed method can be applied for the comprehensive analysis of natural samples. Copyright © 2016. Published by Elsevier B.V.
On a PCA-based lung motion model
Li, Ruijiang; Lewis, John H; Jia, Xun; Zhao, Tianyu; Liu, Weifeng; Wuenschel, Sara; Lamb, James; Yang, Deshan; Low, Daniel A; Jiang, Steve B
2014-01-01
Respiration-induced organ motion is one of the major uncertainties in lung cancer radiotherapy and is crucial to be able to accurately model the lung motion. Most work so far has focused on the study of the motion of a single point (usually the tumor center of mass), and much less work has been done to model the motion of the entire lung. Inspired by the work of Zhang et al (2007 Med. Phys. 34 4772–81), we believe that the spatiotemporal relationship of the entire lung motion can be accurately modeled based on principle component analysis (PCA) and then a sparse subset of the entire lung, such as an implanted marker, can be used to drive the motion of the entire lung (including the tumor). The goal of this work is twofold. First, we aim to understand the underlying reason why PCA is effective for modeling lung motion and find the optimal number of PCA coefficients for accurate lung motion modeling. We attempt to address the above important problems both in a theoretical framework and in the context of real clinical data. Second, we propose a new method to derive the entire lung motion using a single internal marker based on the PCA model. The main results of this work are as follows. We derived an important property which reveals the implicit regularization imposed by the PCA model. We then studied the model using two mathematical respiratory phantoms and 11 clinical 4DCT scans for eight lung cancer patients. For the mathematical phantoms with cosine and an even power (2n) of cosine motion, we proved that 2 and 2n PCA coefficients and eigenvectors will completely represent the lung motion, respectively. Moreover, for the cosine phantom, we derived the equivalence conditions for the PCA motion model and the physiological 5D lung motion model (Low et al 2005 Int. J. Radiat. Oncol. Biol. Phys. 63 921–9). For the clinical 4DCT data, we demonstrated the modeling power and generalization performance of the PCA model. The average 3D modeling error using PCA was within 1 mm (0.7 ± 0.1 mm). When a single artificial internal marker was used to derive the lung motion, the average 3D error was found to be within 2 mm (1.8 ± 0.3 mm) through comprehensive statistical analysis. The optimal number of PCA coefficients needs to be determined on a patient-by-patient basis and two PCA coefficients seem to be sufficient for accurate modeling of the lung motion for most patients. In conclusion, we have presented thorough theoretical analysis and clinical validation of the PCA lung motion model. The feasibility of deriving the entire lung motion using a single marker has also been demonstrated on clinical data using a simulation approach. PMID:21865624
Population Analysis of Disabled Children by Departments in France
NASA Astrophysics Data System (ADS)
Meidatuzzahra, Diah; Kuswanto, Heri; Pech, Nicolas; Etchegaray, Amélie
2017-06-01
In this study, a statistical analysis is performed by model the variations of the disabled about 0-19 years old population among French departments. The aim is to classify the departments according to their profile determinants (socioeconomic and behavioural profiles). The analysis is focused on two types of methods: principal component analysis (PCA) and multiple correspondences factorial analysis (MCA) to review which one is the best methods for interpretation of the correlation between the determinants of disability (independent variable). The PCA is the best method for interpretation of the correlation between the determinants of disability (independent variable). The PCA reduces 14 determinants of disability to 4 axes, keeps 80% of total information, and classifies them into 7 classes. The MCA reduces the determinants to 3 axes, retains only 30% of information, and classifies them into 4 classes.
Li, Der-Chiang; Liu, Chiao-Wen; Hu, Susan C
2011-05-01
Medical data sets are usually small and have very high dimensionality. Too many attributes will make the analysis less efficient and will not necessarily increase accuracy, while too few data will decrease the modeling stability. Consequently, the main objective of this study is to extract the optimal subset of features to increase analytical performance when the data set is small. This paper proposes a fuzzy-based non-linear transformation method to extend classification related information from the original data attribute values for a small data set. Based on the new transformed data set, this study applies principal component analysis (PCA) to extract the optimal subset of features. Finally, we use the transformed data with these optimal features as the input data for a learning tool, a support vector machine (SVM). Six medical data sets: Pima Indians' diabetes, Wisconsin diagnostic breast cancer, Parkinson disease, echocardiogram, BUPA liver disorders dataset, and bladder cancer cases in Taiwan, are employed to illustrate the approach presented in this paper. This research uses the t-test to evaluate the classification accuracy for a single data set; and uses the Friedman test to show the proposed method is better than other methods over the multiple data sets. The experiment results indicate that the proposed method has better classification performance than either PCA or kernel principal component analysis (KPCA) when the data set is small, and suggest creating new purpose-related information to improve the analysis performance. This paper has shown that feature extraction is important as a function of feature selection for efficient data analysis. When the data set is small, using the fuzzy-based transformation method presented in this work to increase the information available produces better results than the PCA and KPCA approaches. Copyright © 2011 Elsevier B.V. All rights reserved.
Perdonà, Sisto; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; D’Esposito, Vittoria; Cosimato, Vincenzo; Buonerba, Carlo; Di Lorenzo, Giuseppe; Musi, Gennaro; De Cobelli, Ottavio; Chun, Felix K.; Terracciano, Daniela
2013-01-01
Many efforts to reduce prostate specific antigen (PSA) overdiagnosis and overtreatment have been made. To this aim, Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) have been proposed as new more specific biomarkers. We evaluated the ability of phi and PCA3 to identify prostate cancer (PCa) at initial prostate biopsy in men with total PSA range of 2–10 ng/ml. The performance of phi and PCA3 were evaluated in 300 patients undergoing first prostate biopsy. ROC curve analyses tested the accuracy (AUC) of phi and PCA3 in predicting PCa. Decision curve analyses (DCA) were used to compare the clinical benefit of the two biomarkers. We found that the AUC value of phi (0.77) was comparable to those of %p2PSA (0.76) and PCA3 (0.73) with no significant differences in pairwise comparison (%p2PSA vs phi p = 0.673, %p2PSA vs. PCA3 p = 0.417 and phi vs. PCA3 p = 0.247). These three biomarkers significantly outperformed fPSA (AUC = 0.60), % fPSA (AUC = 0.62) and p2PSA (AUC = 0.63). At DCA, phi and PCA3 exhibited a very close net benefit profile until the threshold probability of 25%, then phi index showed higher net benefit than PCA3. Multivariable analysis showed that the addition of phi and PCA3 to the base multivariable model (age, PSA, %fPSA, DRE, prostate volume) increased predictive accuracy, whereas no model improved single biomarker performance. Finally we showed that subjects with active surveillance (AS) compatible cancer had significantly lower phi and PCA3 values (p<0.001 and p = 0.01, respectively). In conclusion, both phi and PCA3 comparably increase the accuracy in predicting the presence of PCa in total PSA range 2–10 ng/ml at initial biopsy, outperforming currently used %fPSA. PMID:23861782
Ferro, Matteo; Bruzzese, Dario; Perdonà, Sisto; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; D'Esposito, Vittoria; Cosimato, Vincenzo; Buonerba, Carlo; Di Lorenzo, Giuseppe; Musi, Gennaro; De Cobelli, Ottavio; Chun, Felix K; Terracciano, Daniela
2013-01-01
Many efforts to reduce prostate specific antigen (PSA) overdiagnosis and overtreatment have been made. To this aim, Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) have been proposed as new more specific biomarkers. We evaluated the ability of phi and PCA3 to identify prostate cancer (PCa) at initial prostate biopsy in men with total PSA range of 2-10 ng/ml. The performance of phi and PCA3 were evaluated in 300 patients undergoing first prostate biopsy. ROC curve analyses tested the accuracy (AUC) of phi and PCA3 in predicting PCa. Decision curve analyses (DCA) were used to compare the clinical benefit of the two biomarkers. We found that the AUC value of phi (0.77) was comparable to those of %p2PSA (0.76) and PCA3 (0.73) with no significant differences in pairwise comparison (%p2PSA vs phi p = 0.673, %p2PSA vs. PCA3 p = 0.417 and phi vs. PCA3 p = 0.247). These three biomarkers significantly outperformed fPSA (AUC = 0.60), % fPSA (AUC = 0.62) and p2PSA (AUC = 0.63). At DCA, phi and PCA3 exhibited a very close net benefit profile until the threshold probability of 25%, then phi index showed higher net benefit than PCA3. Multivariable analysis showed that the addition of phi and PCA3 to the base multivariable model (age, PSA, %fPSA, DRE, prostate volume) increased predictive accuracy, whereas no model improved single biomarker performance. Finally we showed that subjects with active surveillance (AS) compatible cancer had significantly lower phi and PCA3 values (p<0.001 and p = 0.01, respectively). In conclusion, both phi and PCA3 comparably increase the accuracy in predicting the presence of PCa in total PSA range 2-10 ng/ml at initial biopsy, outperforming currently used %fPSA.
A Seven-Gene Locus for Synthesis of Phenazine-1-Carboxylic Acid by Pseudomonas fluorescens 2-79
Mavrodi, Dmitri V.; Ksenzenko, Vladimir N.; Bonsall, Robert F.; Cook, R. James; Boronin, Alexander M.; Thomashow, Linda S.
1998-01-01
Pseudomonas fluorescens 2-79 produces the broad-spectrum antibiotic phenazine-1-carboxylic acid (PCA), which is active against a variety of fungal root pathogens. In this study, seven genes designated phzABCDEFG that are sufficient for synthesis of PCA were localized within a 6.8-kb BglII-XbaI fragment from the phenazine biosynthesis locus of strain 2-79. Polypeptides corresponding to all phz genes were identified by analysis of recombinant plasmids in a T7 promoter/polymerase expression system. Products of the phzC, phzD, and phzE genes have similarities to enzymes of shikimic acid and chorismic acid metabolism and, together with PhzF, are absolutely necessary for PCA production. PhzG is similar to pyridoxamine-5′-phosphate oxidases and probably is a source of cofactor for the PCA-synthesizing enzyme(s). Products of the phzA and phzB genes are highly homologous to each other and may be involved in stabilization of a putative PCA-synthesizing multienzyme complex. Two new genes, phzX and phzY, that are homologous to phzA and phzB, respectively, were cloned and sequenced from P. aureofaciens 30-84, which produces PCA, 2-hydroxyphenazine-1-carboxylic acid, and 2-hydroxyphenazine. Based on functional analysis of the phz genes from strains 2-79 and 30-84, we postulate that different species of fluorescent pseudomonads have similar genetic systems that confer the ability to synthesize PCA. PMID:9573209
IMPROVED SEARCH OF PRINCIPAL COMPONENT ANALYSIS DATABASES FOR SPECTRO-POLARIMETRIC INVERSION
DOE Office of Scientific and Technical Information (OSTI.GOV)
Casini, R.; Lites, B. W.; Ramos, A. Asensio
2013-08-20
We describe a simple technique for the acceleration of spectro-polarimetric inversions based on principal component analysis (PCA) of Stokes profiles. This technique involves the indexing of the database models based on the sign of the projections (PCA coefficients) of the first few relevant orders of principal components of the four Stokes parameters. In this way, each model in the database can be attributed a distinctive binary number of 2{sup 4n} bits, where n is the number of PCA orders used for the indexing. Each of these binary numbers (indices) identifies a group of ''compatible'' models for the inversion of amore » given set of observed Stokes profiles sharing the same index. The complete set of the binary numbers so constructed evidently determines a partition of the database. The search of the database for the PCA inversion of spectro-polarimetric data can profit greatly from this indexing. In practical cases it becomes possible to approach the ideal acceleration factor of 2{sup 4n} as compared to the systematic search of a non-indexed database for a traditional PCA inversion. This indexing method relies on the existence of a physical meaning in the sign of the PCA coefficients of a model. For this reason, the presence of model ambiguities and of spectro-polarimetric noise in the observations limits in practice the number n of relevant PCA orders that can be used for the indexing.« less
Wang, Kai; Chen, Xinguang; Bird, Victoria Y; Gerke, Travis A; Manini, Todd M; Prosperi, Mattia
2017-11-01
The relationship between serum total testosterone and prostate cancer (PCa) risk is controversial. The hypothesis that faster age-related reduction in testosterone is linked with increased PCa risk remains untested. We conducted our study at a tertiary-level hospital in southeast of the USA, and derived data from the Medical Registry Database of individuals that were diagnosed of any prostate-related disease from 2001 to 2015. Cases were those diagnosed of PCa and had one or more measurements of testosterone prior to PCa diagnosis. Controls were those without PCa and had one or more testosterone measurements. Multivariable logistic regression models for PCa risk of absolute levels (one-time measure and 5-year average) and annual change in testosterone were respectively constructed. Among a total of 1,559 patients, 217 were PCa cases, and neither one-time measure nor 5-year average of testosterone was found to be significantly associated with PCa risk. Among the 379 patients with two or more testosterone measurements, 27 were PCa cases. For every 10 ng/dL increment in annual reduction of testosterone, the risk of PCa would increase by 14% [adjusted odds ratio, 1.14; 95% confidence interval (CI), 1.03-1.25]. Compared to patients with a relatively stable testosterone, patients with an annual testosterone reduction of more than 30 ng/dL had 5.03 [95% CI: 1.53, 16.55] fold increase in PCa risk. This implies a faster age-related reduction in, but not absolute level of serum total testosterone as a risk factor for PCa. Further longitudinal studies are needed to confirm this finding. © 2017 UICC.
NASA Astrophysics Data System (ADS)
Suzuki, Noriaki
Genetically engineered proteins for inorganics (GEPIs) belong to a new class of polypeptides that are designed to have specific affinities to inorganic materials. A "gold binding protein (GBP)" was chosen as a model protein for GEPIs to study the molecular origins of binding specificity to gold using Time-of-flight secondary ion mass spectrometry (TOF-SIMS) and X-ray photoelectron spectroscopy (XPS). TOF-SIMS, a surface-sensitive analytical instrument with extremely high mass resolutions, provides information on specific amino acid-surface interactions. We used "principal component analysis (PCA)" to analyze the data. We also introduced a new multivariate technique, "hierarchical cluster analysis (HCA)" to organize the data into meaningful structures by measuring a degree of "similarity" and "dissimilarity" of the data. This report discusses a combined use of PCA and HCA to elucidate the binding specificity of GBP to Au. Based on the knowledge gained from TOF-SIMS measurements, we further investigated the nature of the interaction between selected amino acids and noble metal surfaces by using X-ray photoelectron spectroscopy (XPS). We developed a unique capability to introduce water vapor during the adsorption of a single amino acid and applied this method to study the intrinsic nature of sidechain/Au interactions. To further apply this unique research protocol, we characterized another type of GEPI, "quartz binding protein (QBP)," to identify the possible binding sites. This thesis research aims to provide experimental protocols for analyzing short peptide-substrate interface from complex spectroscopic data by using multivariate analysis techniques.
Influence of apple pomace inclusion on the process of animal feed pelleting.
Maslovarić, Marijana D; Vukmirović, Đuro; Pezo, Lato; Čolović, Radmilo; Jovanović, Rade; Spasevski, Nedeljka; Tolimir, Nataša
2017-08-01
Apple pomace (AP) is the main by-product of apple juice production. Large amounts of this material disposed into landfills can cause serious environmental problems. One of the solutions is to utilise AP as animal feed. The aim of this study was to investigate the impact of dried AP inclusion into model mixtures made from conventional feedstuffs on pellet quality and pellet press performance. Three model mixtures, with different ratios of maize, sunflower meal and AP, were pelleted. Response surface methodology (RSM) was applied when designing the experiment. The simultaneous and interactive effects of apple pomace share (APS) in the mixtures, die thickness (DT) of the pellet press and initial moisture content of the mixtures (M), on pellet quality and production parameters were investigated. Principal component analysis (PCA) and standard score (SS) analysis were applied for comprehensive analysis of the experimental data. The increase in APS led to an improvement of pellet quality parameters: pellet durability index (PDI), hardness (H) and proportion of fines in pellets. The increase in DT and M resulted in pellet quality improvement. The increase in DT and APS resulted in higher energy consumption of the pellet press. APS was the most influential variable for PDI and H calculation, while APS and DT were the most influential variables in the calculation of pellet press energy consumption. PCA showed that the first two principal components could be considered sufficient for data representation. In conclusion, addition of dried AP to feed model mixtures significantly improved the quality of the pellets.
Folded concave penalized learning in identifying multimodal MRI marker for Parkinson’s disease
Liu, Hongcheng; Du, Guangwei; Zhang, Lijun; Lewis, Mechelle M.; Wang, Xue; Yao, Tao; Li, Runze; Huang, Xuemei
2016-01-01
Background Brain MRI holds promise to gauge different aspects of Parkinson’s disease (PD)-related pathological changes. Its analysis, however, is hindered by the high-dimensional nature of the data. New method This study introduces folded concave penalized (FCP) sparse logistic regression to identify biomarkers for PD from a large number of potential factors. The proposed statistical procedures target the challenges of high-dimensionality with limited data samples acquired. The maximization problem associated with the sparse logistic regression model is solved by local linear approximation. The proposed procedures then are applied to the empirical analysis of multimodal MRI data. Results From 45 features, the proposed approach identified 15 MRI markers and the UPSIT, which are known to be clinically relevant to PD. By combining the MRI and clinical markers, we can enhance substantially the specificity and sensitivity of the model, as indicated by the ROC curves. Comparison to existing methods We compare the folded concave penalized learning scheme with both the Lasso penalized scheme and the principle component analysis-based feature selection (PCA) in the Parkinson’s biomarker identification problem that takes into account both the clinical features and MRI markers. The folded concave penalty method demonstrates a substantially better clinical potential than both the Lasso and PCA in terms of specificity and sensitivity. Conclusions For the first time, we applied the FCP learning method to MRI biomarker discovery in PD. The proposed approach successfully identified MRI markers that are clinically relevant. Combining these biomarkers with clinical features can substantially enhance performance. PMID:27102045
Adam, Ahmed; Hellig, Julian C; Perera, Marlon; Bolton, Damien; Lawrentschuk, Nathan
2018-04-01
The use of mobile phone applications (Apps) has modernised the conventional practice of medicine. The diagnostic ability of the current Apps in prostate specific antigen monitoring, and its diagnostic ability within prostate cancer (PCa) risk calculators have not yet been appraised. We aimed to review, rate and assess the everyday functionality, and utility of all the currently available PCa risk calculator Apps. A systematic search on iTunes, Google Play Store, Blackberry World and Windows Apps Store, was performed on 23/11/2017, using the search term 'prostate cancer risk calculator'. After applying the exclusion criteria, each App was individually assessed and rated using pre-set criteria and grading was performed using the validated uMARS scale. In total, 83 Apps were retrieved. After applying our exclusion criteria, only 9 Apps were relevant, with 2 duplicated, and the remaining 7 were suitable for critical review. Data sizes ranged from 414 kb to 10.1 Mb. The cost of the Apps ranged from South African rand (ZAR) 0.00 to ZAR 29.99. The overall mean category uMARS scores ranged from 2.8/5 to 4.5/5. Apps such as Rotterdam Prostate Cancer Risk Calculator, Coral-Prostate Cancer Nomogram Calculator and CPC Risk Calculator, performed the best. The current PCa risk calculator mobile Apps available may be beneficial in counselling the concerned at risk patient. These Apps have potential to assist both the patient and the urologist alike. The PCa risk calculator App 'predictability' may be further enhanced by the incorporation of newly validated risk factors and predictors for PCa.
Russo, Giorgio Ivan; Regis, Federica; Castelli, Tommaso; Favilla, Vincenzo; Privitera, Salvatore; Giardina, Raimondo; Cimino, Sebastiano; Morgia, Giuseppe
2017-08-01
Markers for prostate cancer (PCa) have progressed over recent years. In particular, the prostate health index (PHI) and the 4-kallikrein (4K) panel have been demonstrated to improve the diagnosis of PCa. We aimed to review the diagnostic accuracy of PHI and the 4K panel for PCa detection. We performed a systematic literature search of PubMed, EMBASE, Cochrane, and Academic One File databases until July 2016. We included diagnostic accuracy studies that used PHI or 4K panel for the diagnosis of PCa or high-grade PCa. The methodological quality was assessed using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool. Twenty-eight studies including 16,762 patients have been included for the analysis. The pooled data showed a sensitivity of 0.89 and 0.74 for PHI and 4K panel, respectively, for PCa detection and a pooled specificity of 0.34 and 0.60 for PHI and 4K panel, respectively. The derived area under the curve (AUC) from the hierarchical summary receiver operating characteristic (HSROC) showed an accuracy of 0.76 and 0.72 for PHI and 4K panel respectively. For high-grade PCa detection, the pooled sensitivity was 0.93 and 0.87 for PHI and 4K panel, respectively, whereas the pooled specificity was 0.34 and 0.61 for PHI and 4K panel, respectively. The derived AUC from the HSROC showed an accuracy of 0.82 and 0.81 for PHI and 4K panel, respectively. Both PHI and the 4K panel provided good diagnostic accuracy in detecting overall and high-grade PCa. Copyright © 2016 Elsevier Inc. All rights reserved.
Yun, Seok Joong; Jeong, Pildu; Kang, Ho Won; Kim, Ye-Hwan; Kim, Eun-Ah; Yan, Chunri; Choi, Young-Ki; Kim, Dongho; Kim, Jung Min; Kim, Seon-Kyu; Kim, Seon-Young; Kim, Sang Tae; Kim, Won Tae; Lee, Ok-Jun; Koh, Gou-Young; Moon, Sung-Kwon; Kim, Isaac Yi; Kim, Jayoung; Choi, Yung-Hyun; Kim, Wun-Jae
2015-06-01
MicroRNAs (miRNAs) in biological fluids are potential biomarkers for the diagnosis and assessment of urological diseases such as benign prostatic hyperplasia (BPH) and prostate cancer (PCa). The aim of the study was to identify and validate urinary cell-free miRNAs that can segregate patients with PCa from those with BPH. In total, 1,052 urine, 150 serum, and 150 prostate tissue samples from patients with PCa or BPH were used in the study. A urine-based miRNA microarray analysis suggested the presence of differentially expressed urinary miRNAs in patients with PCa, and these were further validated in three independent PCa cohorts, using a quantitative reverse transcriptionpolymerase chain reaction analysis. The expression levels of hsa-miR-615-3p, hsv1-miR-H18, hsv2-miR-H9-5p, and hsa-miR-4316 were significantly higher in urine samples of patients with PCa than in those of BPH controls. In particular, herpes simplex virus (hsv)-derived hsv1-miR-H18 and hsv2-miR-H9-5p showed better diagnostic performance than did the serum prostate-specific antigen (PSA) test for patients in the PSA gray zone. Furthermore, a combination of urinary hsv2-miR-H9-5p with serum PSA showed high sensitivity and specificity, providing a potential clinical benefit by reducing unnecessary biopsies. Our findings showed that hsv-encoded hsv1-miR-H18 and hsv2-miR-H9-5p are significantly associated with PCa and can facilitate early diagnosis of PCa for patients within the serum PSA gray zone.
Yun, Seok Joong; Jeong, Pildu; Kang, Ho Won; Kim, Ye-Hwan; Kim, Eun-Ah; Yan, Chunri; Choi, Young-Ki; Kim, Dongho; Kim, Jung Min; Kim, Seon-Kyu; Kim, Seon-Young; Kim, Sang Tae; Kim, Won Tae; Lee, Ok-Jun; Koh, Gou-Young; Moon, Sung-Kwon; Kim, Isaac Yi; Kim, Jayoung; Choi, Yung-Hyun; Kim, Wun-Jae
2015-01-01
Purpose: MicroRNAs (miRNAs) in biological fluids are potential biomarkers for the diagnosis and assessment of urological diseases such as benign prostatic hyperplasia (BPH) and prostate cancer (PCa). The aim of the study was to identify and validate urinary cell-free miRNAs that can segregate patients with PCa from those with BPH. Methods: In total, 1,052 urine, 150 serum, and 150 prostate tissue samples from patients with PCa or BPH were used in the study. A urine-based miRNA microarray analysis suggested the presence of differentially expressed urinary miRNAs in patients with PCa, and these were further validated in three independent PCa cohorts, using a quantitative reverse transcriptionpolymerase chain reaction analysis. Results: The expression levels of hsa-miR-615-3p, hsv1-miR-H18, hsv2-miR-H9-5p, and hsa-miR-4316 were significantly higher in urine samples of patients with PCa than in those of BPH controls. In particular, herpes simplex virus (hsv)-derived hsv1-miR-H18 and hsv2-miR-H9-5p showed better diagnostic performance than did the serum prostate-specific antigen (PSA) test for patients in the PSA gray zone. Furthermore, a combination of urinary hsv2-miR-H9-5p with serum PSA showed high sensitivity and specificity, providing a potential clinical benefit by reducing unnecessary biopsies. Conclusions: Our findings showed that hsv-encoded hsv1-miR-H18 and hsv2-miR-H9-5p are significantly associated with PCa and can facilitate early diagnosis of PCa for patients within the serum PSA gray zone. PMID:26126436
Marita, Jane M; Hatfield, Ronald D; Rancour, David M; Frost, Kenneth E
2014-01-01
Grasses, such as Zea mays L. (maize), contain relatively high levels of p-coumarates (pCA) within their cell walls. Incorporation of pCA into cell walls is believed to be due to a hydroxycinnamyl transferase that couples pCA to monolignols. To understand the role of pCA in maize development, the p-coumaroyl CoA:hydroxycinnamyl alcohol transferase (pCAT) was isolated and purified from maize stems. Purified pCAT was subjected to partial trypsin digestion, and peptides were sequenced by tandem mass spectrometry. TBLASTN analysis of the acquired peptide sequences identified a single full-length maize cDNA clone encoding all the peptide sequences obtained from the purified enzyme. The cDNA clone was obtained and used to generate an RNAi construct for suppressing pCAT expression in maize. Here we describe the effects of suppression of pCAT in maize. Primary screening of transgenic maize seedling leaves using a new rapid analytical platform was used to identify plants with decreased amounts of pCA. Using this screening method, mature leaves from fully developed plants were analyzed, confirming reduced pCA levels throughout plant development. Complete analysis of isolated cell walls from mature transgenic stems and leaves revealed that lignin levels did not change, but pCA levels decreased and the lignin composition was altered. Transgenic plants with the lowest levels of pCA had decreased levels of syringyl units in the lignin. Thus, altering the levels of pCAT expression in maize leads to altered lignin composition, but does not appear to alter the total amount of lignin present in the cell walls. PMID:24654730
Adeola, Henry A.; Smith, Muneerah; Kaestner, Lisa; Blackburn, Jonathan M.; Zerbini, Luiz F.
2016-01-01
There is a growing need for high throughput diagnostic tools for early diagnosis and treatment monitoring of prostate cancer (PCa) in Africa. The role of cancer-testis antigens (CTAs) in PCa in men of African descent is poorly researched. Hence, we aimed to elucidate the role of 123 Tumour Associated Antigens (TAAs) using antigen microarray platform in blood samples (N = 67) from a South African PCa, Benign prostatic hyperplasia (BPH) and disease control (DC) cohort. Linear (fold-over-cutoff) and differential expression quantitation of autoantibody signal intensities were performed. Molecular signatures of candidate PCa antigen biomarkers were identified and analyzed for ethnic group variation. Potential cancer diagnostic and immunotherapeutic inferences were drawn. We identified a total of 41 potential diagnostic/therapeutic antigen biomarkers for PCa. By linear quantitation, four antigens, GAGE1, ROPN1, SPANXA1 and PRKCZ were found to have higher autoantibody titres in PCa serum as compared with BPH where MAGEB1 and PRKCZ were highly expressed. Also, p53 S15A and p53 S46A were found highly expressed in the disease control group. Statistical analysis by differential expression revealed twenty-four antigens as upregulated in PCa samples, while 11 were downregulated in comparison to BPH and DC (FDR = 0.01). FGFR2, COL6A1and CALM1 were verifiable biomarkers of PCa analysis using urinary shotgun proteomics. Functional pathway annotation of identified biomarkers revealed similar enrichment both at genomic and proteomic level and ethnic variations were observed. Cancer antigen arrays are emerging useful in potential diagnostic and immunotherapeutic antigen biomarker discovery. PMID:26885621
Marita, Jane M; Hatfield, Ronald D; Rancour, David M; Frost, Kenneth E
2014-06-01
Grasses, such as Zea mays L. (maize), contain relatively high levels of p-coumarates (pCA) within their cell walls. Incorporation of pCA into cell walls is believed to be due to a hydroxycinnamyl transferase that couples pCA to monolignols. To understand the role of pCA in maize development, the p-coumaroyl CoA:hydroxycinnamyl alcohol transferase (pCAT) was isolated and purified from maize stems. Purified pCAT was subjected to partial trypsin digestion, and peptides were sequenced by tandem mass spectrometry. TBLASTN analysis of the acquired peptide sequences identified a single full-length maize cDNA clone encoding all the peptide sequences obtained from the purified enzyme. The cDNA clone was obtained and used to generate an RNAi construct for suppressing pCAT expression in maize. Here we describe the effects of suppression of pCAT in maize. Primary screening of transgenic maize seedling leaves using a new rapid analytical platform was used to identify plants with decreased amounts of pCA. Using this screening method, mature leaves from fully developed plants were analyzed, confirming reduced pCA levels throughout plant development. Complete analysis of isolated cell walls from mature transgenic stems and leaves revealed that lignin levels did not change, but pCA levels decreased and the lignin composition was altered. Transgenic plants with the lowest levels of pCA had decreased levels of syringyl units in the lignin. Thus, altering the levels of pCAT expression in maize leads to altered lignin composition, but does not appear to alter the total amount of lignin present in the cell walls. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.
Sakhtah, Hassan; Koyama, Leslie; Zhang, Yihan; Morales, Diana K.; Fields, Blanche L.; Price-Whelan, Alexa; Hogan, Deborah A.; Shepard, Kenneth; Dietrich, Lars E. P.
2016-01-01
Redox-cycling compounds, including endogenously produced phenazine antibiotics, induce expression of the efflux pump MexGHI-OpmD in the opportunistic pathogen Pseudomonas aeruginosa. Previous studies of P. aeruginosa virulence, physiology, and biofilm development have focused on the blue phenazine pyocyanin and the yellow phenazine-1-carboxylic acid (PCA). In P. aeruginosa phenazine biosynthesis, conversion of PCA to pyocyanin is presumed to proceed through the intermediate 5-methylphenazine-1-carboxylate (5-Me-PCA), a reactive compound that has eluded detection in most laboratory samples. Here, we apply electrochemical methods to directly detect 5-Me-PCA and find that it is transported by MexGHI-OpmD in P. aeruginosa strain PA14 planktonic and biofilm cells. We also show that 5-Me-PCA is sufficient to fully induce MexGHI-OpmD expression and that it is required for wild-type colony biofilm morphogenesis. These physiological effects are consistent with the high redox potential of 5-Me-PCA, which distinguishes it from other well-studied P. aeruginosa phenazines. Our observations highlight the importance of this compound, which was previously overlooked due to the challenges associated with its detection, in the context of P. aeruginosa gene expression and multicellular behavior. This study constitutes a unique demonstration of efflux-based self-resistance, controlled by a simple circuit, in a Gram-negative pathogen. PMID:27274079
Saidemberg, Daniel M; Baptista-Saidemberg, Nicoli B; Palma, Mario S
2011-09-01
When searching for prospective novel peptides, it is difficult to determine the biological activity of a peptide based only on its sequence. The "trial and error" approach is generally laborious, expensive and time consuming due to the large number of different experimental setups required to cover a reasonable number of biological assays. To simulate a virtual model for Hymenoptera insects, 166 peptides were selected from the venoms and hemolymphs of wasps, bees and ants and applied to a mathematical model of multivariate analysis, with nine different chemometric components: GRAVY, aliphaticity index, number of disulfide bonds, total residues, net charge, pI value, Boman index, percentage of alpha helix, and flexibility prediction. Principal component analysis (PCA) with non-linear iterative projections by alternating least-squares (NIPALS) algorithm was performed, without including any information about the biological activity of the peptides. This analysis permitted the grouping of peptides in a way that strongly correlated to the biological function of the peptides. Six different groupings were observed, which seemed to correspond to the following groups: chemotactic peptides, mastoparans, tachykinins, kinins, antibiotic peptides, and a group of long peptides with one or two disulfide bonds and with biological activities that are not yet clearly defined. The partial overlap between the mastoparans group and the chemotactic peptides, tachykinins, kinins and antibiotic peptides in the PCA score plot may be used to explain the frequent reports in the literature about the multifunctionality of some of these peptides. The mathematical model used in the present investigation can be used to predict the biological activities of novel peptides in this system, and it may also be easily applied to other biological systems. Copyright © 2011 Elsevier Inc. All rights reserved.
Facilitating text reading in posterior cortical atrophy.
Yong, Keir X X; Rajdev, Kishan; Shakespeare, Timothy J; Leff, Alexander P; Crutch, Sebastian J
2015-07-28
We report (1) the quantitative investigation of text reading in posterior cortical atrophy (PCA), and (2) the effects of 2 novel software-based reading aids that result in dramatic improvements in the reading ability of patients with PCA. Reading performance, eye movements, and fixations were assessed in patients with PCA and typical Alzheimer disease and in healthy controls (experiment 1). Two reading aids (single- and double-word) were evaluated based on the notion that reducing the spatial and oculomotor demands of text reading might support reading in PCA (experiment 2). Mean reading accuracy in patients with PCA was significantly worse (57%) compared with both patients with typical Alzheimer disease (98%) and healthy controls (99%); spatial aspects of passages were the primary determinants of text reading ability in PCA. Both aids led to considerable gains in reading accuracy (PCA mean reading accuracy: single-word reading aid = 96%; individual patient improvement range: 6%-270%) and self-rated measures of reading. Data suggest a greater efficiency of fixations and eye movements under the single-word reading aid in patients with PCA. These findings demonstrate how neurologic characterization of a neurodegenerative syndrome (PCA) and detailed cognitive analysis of an important everyday skill (reading) can combine to yield aids capable of supporting important everyday functional abilities. This study provides Class III evidence that for patients with PCA, 2 software-based reading aids (single-word and double-word) improve reading accuracy. © 2015 American Academy of Neurology.
Facilitating text reading in posterior cortical atrophy
Rajdev, Kishan; Shakespeare, Timothy J.; Leff, Alexander P.; Crutch, Sebastian J.
2015-01-01
Objective: We report (1) the quantitative investigation of text reading in posterior cortical atrophy (PCA), and (2) the effects of 2 novel software-based reading aids that result in dramatic improvements in the reading ability of patients with PCA. Methods: Reading performance, eye movements, and fixations were assessed in patients with PCA and typical Alzheimer disease and in healthy controls (experiment 1). Two reading aids (single- and double-word) were evaluated based on the notion that reducing the spatial and oculomotor demands of text reading might support reading in PCA (experiment 2). Results: Mean reading accuracy in patients with PCA was significantly worse (57%) compared with both patients with typical Alzheimer disease (98%) and healthy controls (99%); spatial aspects of passages were the primary determinants of text reading ability in PCA. Both aids led to considerable gains in reading accuracy (PCA mean reading accuracy: single-word reading aid = 96%; individual patient improvement range: 6%–270%) and self-rated measures of reading. Data suggest a greater efficiency of fixations and eye movements under the single-word reading aid in patients with PCA. Conclusions: These findings demonstrate how neurologic characterization of a neurodegenerative syndrome (PCA) and detailed cognitive analysis of an important everyday skill (reading) can combine to yield aids capable of supporting important everyday functional abilities. Classification of evidence: This study provides Class III evidence that for patients with PCA, 2 software-based reading aids (single-word and double-word) improve reading accuracy. PMID:26138948
Shawky, Eman; Abou El Kheir, Rasha M
2018-02-11
Species of Apiaceae are used in folk medicine as spices and in officinal medicinal preparations of drugs. They are an excellent source of phenolics exhibiting antioxidant activity, which are of great benefit to human health. Discrimination among Apiaceae medicinal herbs remains an intricate challenge due to their morphological similarity. In this study, a combined "untargeted" and "targeted" approach to investigate different Apiaceae plants species was proposed by using the merging of high-performance thin layer chromatography (HPTLC)-image analysis and pattern recognition methods which were used for fingerprinting and classification of 42 different Apiaceae samples collected from Egypt. Software for image processing was applied for fingerprinting and data acquisition. HPTLC fingerprint assisted by principal component analysis (PCA) and hierarchical cluster analysis (HCA)-heat maps resulted in a reliable untargeted approach for discrimination and classification of different samples. The "targeted" approach was performed by developing and validating an HPTLC method allowing the quantification of eight flavonoids. The combination of quantitative data with PCA and HCA-heat-maps allowed the different samples to be discriminated from each other. The use of chemometrics tools for evaluation of fingerprints reduced expense and analysis time. The proposed method can be adopted for routine discrimination and evaluation of the phytochemical variability in different Apiaceae species extracts. Copyright © 2018 John Wiley & Sons, Ltd.
Stefanidis, Konstantinos; Papatheodorou, George
2018-01-01
During the last decades, Mediterranean freshwater ecosystems, especially lakes, have been under severe pressure due to increasing eutrophication and water quality deterioration. In this article, we compared the effectiveness of different data analysis methods by assessing the contribution of environmental parameters to eutrophication processes. For this purpose, principal components analysis (PCA), cluster analysis, and a self-organizing map (SOM) were applied, using water quality data from two transboundary lakes of North Greece. SOM is considered as an advanced and powerful data analysis tool because of its ability to represent complex and nonlinear relationships among multivariate data sets. The results of PCA and cluster analysis agreed with the SOM results, although the latter provided more information because of the visualization abilities regarding the parameters’ relationships. Besides nutrients that were found to be a key factor for controlling chlorophyll-a (Chl-a), water temperature was related positively with algal production, while the Secchi disk depth parameter was found to be highly important and negatively related toeutrophic conditions. In general, the SOM results were more specific and allowed direct associations between the water quality variables. Our work showed that SOMs can be used effectively in limnological studies to produce robust and interpretable results, aiding scientists and managers to cope with environmental problems such as eutrophication. PMID:29562675
A reduced basis method for molecular dynamics simulation
NASA Astrophysics Data System (ADS)
Vincent-Finley, Rachel Elisabeth
In this dissertation, we develop a method for molecular simulation based on principal component analysis (PCA) of a molecular dynamics trajectory and least squares approximation of a potential energy function. Molecular dynamics (MD) simulation is a computational tool used to study molecular systems as they evolve through time. With respect to protein dynamics, local motions, such as bond stretching, occur within femtoseconds, while rigid body and large-scale motions, occur within a range of nanoseconds to seconds. To capture motion at all levels, time steps on the order of a femtosecond are employed when solving the equations of motion and simulations must continue long enough to capture the desired large-scale motion. To date, simulations of solvated proteins on the order of nanoseconds have been reported. It is typically the case that simulations of a few nanoseconds do not provide adequate information for the study of large-scale motions. Thus, the development of techniques that allow longer simulation times can advance the study of protein function and dynamics. In this dissertation we use principal component analysis (PCA) to identify the dominant characteristics of an MD trajectory and to represent the coordinates with respect to these characteristics. We augment PCA with an updating scheme based on a reduced representation of a molecule and consider equations of motion with respect to the reduced representation. We apply our method to butane and BPTI and compare the results to standard MD simulations of these molecules. Our results indicate that the molecular activity with respect to our simulation method is analogous to that observed in the standard MD simulation with simulations on the order of picoseconds.
Paris, Guillaume; Ramseyer, Christophe; Enescu, Mironel
2014-05-01
The conformational dynamics of human serum albumin (HSA) was investigated by principal component analysis (PCA) applied to three molecular dynamics trajectories of 200 ns each. The overlap of the essential subspaces spanned by the first 10 principal components (PC) of different trajectories was about 0.3 showing that the PCA based on a trajectory length of 200 ns is not completely convergent for this protein. The contributions of the relative motion of subdomains and of the subdomains (internal) distortion to the first 10 PCs were found to be comparable. Based on the distribution of the first 3 PC, 10 protein conformers are identified showing relative root mean square deviations (RMSD) between 2.3 and 4.6 Å. The main PCs are found to be delocalized over the whole protein structure indicating that the motions of different protein subdomains are coupled. This coupling is considered as being related to the allosteric effects observed upon ligand binding to HSA. On the other hand, the first PC of one of the three trajectories describes a conformational transition of the protein domain I that is close to that experimentally observed upon myristate binding. This is a theoretical support for the older hypothesis stating that changes of the protein onformation favorable to binding can precede the ligand complexation. A detailed all atoms PCA performed on the primary Sites 1 and 2 confirms the multiconformational character of the HSA binding sites as well as the significant coupling of their motions. Copyright © 2013 Wiley Periodicals, Inc.
An electrophysiological index of changes in risk decision-making strategies.
Zhang, Dandan; Gu, Ruolei; Wu, Tingting; Broster, Lucas S; Luo, Yi; Jiang, Yang; Luo, Yue-jia
2013-07-01
Human decision-making is significantly modulated by previously experienced outcomes. Using event-related potentials (ERPs), we examined whether ERP components evoked by outcome feedbacks could serve as biomarkers to signal the influence of current outcome evaluation on subsequent decision-making. In this study, 18 adult volunteers participated in a simple monetary gambling task, in which they were asked to choose between two options that differed in risk. Their decisions were immediately followed by outcome presentation. Temporospatial principle component analysis (PCA) was applied to the outcome-onset locked ERPs in the 200-1000 ms time window. The PCA factors that approximated classical ERP components (P2, feedback-related negativity, P3a, and P3b) in terms of time course and scalp distribution were tested for their association with subsequent decision-making strategies. Our results revealed that a fronto-central PCA factor approximating the classical P3a was related to changes of decision-making strategies on subsequent trials. The decision to switch between high- and low-risk options resulted in a larger P3a relative to the decision to retain the same choice. According to the results, we suggest that the amplitude of the fronto-central P3a is an electrophysiological index of the influence of current outcome on subsequent risk decision-making. Furthermore, the ERP source analysis indicated that the activations of the frontopolar cortex and sensorimotor cortex were involved in subsequent changes of strategies, which enriches our understanding of the neural mechanisms of adjusting decision-making strategies based on previous experience. Copyright © 2013 Elsevier Ltd. All rights reserved.
Contrasting nitrogen fate in watersheds using agricultural and water quality information
Essaid, Hedeff I.; Baker, Nancy T.; McCarthy, Kathleen A.
2016-01-01
Surplus nitrogen (N) estimates, principal component analysis (PCA), and end-member mixing analysis (EMMA) were used in a multisite comparison contrasting the fate of N in diverse agricultural watersheds. We applied PCA-EMMA in 10 watersheds located in Indiana, Iowa, Maryland, Nebraska, Mississippi, and Washington ranging in size from 5 to 1254 km2 with four nested watersheds. Watershed Surplus N was determined by subtracting estimates of crop uptake and volatilization from estimates of N input from atmospheric deposition, plant fixation, fertilizer, and manure for the period from 1987 to 2004. Watershed average Surplus N ranged from 11 to 52 kg N ha−1 and from 9 to 32% of N input. Solute concentrations in streams, overland runoff, tile drainage, groundwater (GW), streambeds, and the unsaturated zone were used in the PCA-EMMA procedure to identify independent components contributing to observed stream concentration variability and the end-members contributing to streamflow and NO3 load. End-members included dilute runoff, agricultural runoff, benthic-processing, tile drainage, and oxic and anoxic GW. Surplus N was larger in watersheds with more permeable soils (Washington, Nebraska, and Maryland) that allowed greater infiltration, and oxic GW was the primary source of NO3 load. Subsurface transport of NO3 in these watersheds resulted in some removal of Surplus N by denitrification. In less permeable watersheds (Iowa, Indiana, and Mississippi), NO3 was rapidly transported to the stream by tile drainage and runoff with little removal. Evidence of streambed removal of NO3 by benthic diatoms was observed in the larger watersheds.
NASA Astrophysics Data System (ADS)
Panahi, Nima S.
We studied the problem of understanding and computing the essential features and dynamics of molecular motions through the development of two theories for two different systems. First, we studied the process of the Berry Pseudorotation of PF5 and the rotations it induces in the molecule through its natural and intrinsic geometric nature by setting it in the language of fiber bundles and graph theory. With these tools, we successfully extracted the essentials of the process' loops and induced rotations. The infinite number of pseudorotation loops were broken down into a small set of essential loops called "super loops", with their intrinsic properties and link to the physical movements of the molecule extensively studied. In addition, only the three "self-edge loops" generated any induced rotations, and then only a finite number of classes of them. Second, we studied applying the statistical methods of Principal Components Analysis (PCA) and Principal Coordinate Analysis (PCO) to capture only the most important changes in Argon clusters so as to reduce computational costs and graph the potential energy surface (PES) in three dimensions respectively. Both methods proved successful, but PCA was only partially successful since one will only see advantages for PES database systems much larger than those both currently being studied and those that can be computationally studied in the next few decades to come. In addition, PCA is only needed for the very rare case of a PES database that does not already include Hessian eigenvalues.
An electrophysiological index of changes in risk decision-making strategies
Zhang, Dandan; Gu, Ruolei; Wu, Tingting; Broster, Lucas S.; Luo, Yi; Jiang, Yang; Luo, Yue-jia
2014-01-01
Human decision-making is significantly modulated by previously experienced outcomes. Using event-related potentials (ERPs), we examined whether ERP components evoked by outcome feedbacks could serve as biomarkers to signal the influence of current outcome evaluation on subsequent decision-making. In this study, eighteen adult volunteers participated in a simple monetary gambling task, in which they were asked to choose between two options that differed in risk. Their decisions were immediately followed by outcome presentation. Temporospatial principle component analysis (PCA) was applied to the outcome-onset locked ERPs in the -200 – 1000 ms time window. The PCA factors that approximated classical ERP components (P2, feedback-related negativity, P3a, & P3b) in terms of time course and scalp distribution were tested for their association with subsequent decision-making strategies. Our results revealed that a fronto-central PCA factor approximating the classical P3a was related to changes of decision-making strategies on subsequent trials. The decision to switch between high- and low-risk options resulted in a larger P3a relative to the decision to retain the same choice. According to the results, we suggest the amplitude of the fronto-central P3a is an electrophysiological index of the influence of current outcome on subsequent risk decision-making. Furthermore, the ERP source analysis indicated that the activations of the frontopolar cortex and sensorimotor cortex were involved in subsequent changes of strategies, which enriches our understanding of the neural mechanisms of adjusting decision-making strategies based on previous experience. PMID:23643796
Vanderhaeghe, F; Smolders, A J P; Roelofs, J G M; Hoffmann, M
2012-03-01
Selecting an appropriate variable subset in linear multivariate methods is an important methodological issue for ecologists. Interest often exists in obtaining general predictive capacity or in finding causal inferences from predictor variables. Because of a lack of solid knowledge on a studied phenomenon, scientists explore predictor variables in order to find the most meaningful (i.e. discriminating) ones. As an example, we modelled the response of the amphibious softwater plant Eleocharis multicaulis using canonical discriminant function analysis. We asked how variables can be selected through comparison of several methods: univariate Pearson chi-square screening, principal components analysis (PCA) and step-wise analysis, as well as combinations of some methods. We expected PCA to perform best. The selected methods were evaluated through fit and stability of the resulting discriminant functions and through correlations between these functions and the predictor variables. The chi-square subset, at P < 0.05, followed by a step-wise sub-selection, gave the best results. In contrast to expectations, PCA performed poorly, as so did step-wise analysis. The different chi-square subset methods all yielded ecologically meaningful variables, while probable noise variables were also selected by PCA and step-wise analysis. We advise against the simple use of PCA or step-wise discriminant analysis to obtain an ecologically meaningful variable subset; the former because it does not take into account the response variable, the latter because noise variables are likely to be selected. We suggest that univariate screening techniques are a worthwhile alternative for variable selection in ecology. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.
Multi-Centrality Graph Spectral Decompositions and Their Application to Cyber Intrusion Detection
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Pin-Yu; Choudhury, Sutanay; Hero, Alfred
Many modern datasets can be represented as graphs and hence spectral decompositions such as graph principal component analysis (PCA) can be useful. Distinct from previous graph decomposition approaches based on subspace projection of a single topological feature, e.g., the centered graph adjacency matrix (graph Laplacian), we propose spectral decomposition approaches to graph PCA and graph dictionary learning that integrate multiple features, including graph walk statistics, centrality measures and graph distances to reference nodes. In this paper we propose a new PCA method for single graph analysis, called multi-centrality graph PCA (MC-GPCA), and a new dictionary learning method for ensembles ofmore » graphs, called multi-centrality graph dictionary learning (MC-GDL), both based on spectral decomposition of multi-centrality matrices. As an application to cyber intrusion detection, MC-GPCA can be an effective indicator of anomalous connectivity pattern and MC-GDL can provide discriminative basis for attack classification.« less
Revealing the ultrafast outflow in IRAS 13224-3809 through spectral variability
NASA Astrophysics Data System (ADS)
Parker, M. L.; Alston, W. N.; Buisson, D. J. K.; Fabian, A. C.; Jiang, J.; Kara, E.; Lohfink, A.; Pinto, C.; Reynolds, C. S.
2017-08-01
We present an analysis of the long-term X-ray variability of the extreme narrow-line Seyfert 1 galaxy IRAS 13224-3809 using principal component analysis (PCA) and fractional excess variability (Fvar) spectra to identify model-independent spectral components. We identify a series of variability peaks in both the first PCA component and Fvar spectrum which correspond to the strongest predicted absorption lines from the ultrafast outflow (UFO) discovered by Parker et al. (2017). We also find higher order PCA components, which correspond to variability of the soft excess and reflection features. The subtle differences between RMS and PCA results argue that the observed flux-dependence of the absorption is due to increased ionization of the gas, rather than changes in column density or covering fraction. This result demonstrates that we can detect outflows from variability alone and that variability studies of UFOs are an extremely promising avenue for future research.
Neblett, Enrique W; Sosoo, Effua E; Willis, Henry A; Bernard, Donte L; Bae, Jiwoon; Billingsley, Janelle T
Racism constitutes a significant risk to the healthy development of African American youth. Fortunately, however, not all youth who experience racism evidence negative developmental outcomes. In this chapter, we examine person-centered analysis (PCA)-a quantitative technique that investigates how variables combine across individuals-as a useful tool for elucidating racial and ethnic protective processes that mitigate the negative impact of racism. We review recent studies employing PCA in examinations of racial identity, racial socialization, and other race-related experiences, as well as how these constructs correlate with and impact African American youth development. We also consider challenges and limitations of PCA and conclude with a discussion of future research and how PCA might be used to promote equity and justice for African American and other racial and ethnic minority youth who experience racism. © 2016 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Chen, Gang; Metz, Margaret R.; Rizzo, David M.; Dillon, Whalen W.; Meentemeyer, Ross K.
2015-04-01
Forest ecosystems are subject to a variety of disturbances with increasing intensities and frequencies, which may permanently change the trajectories of forest recovery and disrupt the ecosystem services provided by trees. Fire and invasive species, especially exotic disease-causing pathogens and insects, are examples of disturbances that together could pose major threats to forest health. This study examines the impacts of fire and exotic disease (sudden oak death) on forests, with an emphasis on the assessment of post-fire burn severity in a forest where trees have experienced three stages of disease progression pre-fire: early-stage (trees retaining dried foliage and fine twigs), middle-stage (trees losing fine crown fuels), and late-stage (trees falling down). The research was conducted by applying Geographic Object-Based Image Analysis (GEOBIA) to MASTER airborne images that were acquired immediately following the fire for rapid assessment and contained both high-spatial (4 m) and high-spectral (50 bands) resolutions. Although GEOBIA has gradually become a standard tool for analyzing high-spatial resolution imagery, high-spectral resolution data (dozens to hundreds of bands) can dramatically reduce computation efficiency in the process of segmentation and object-based variable extraction, leading to complicated variable selection for succeeding modeling. Hence, we also assessed two widely used band reduction algorithms, PCA (principal component analysis) and MNF (minimum noise fraction), for the delineation of image objects and the subsequent performance of burn severity models using either PCA or MNF derived variables. To increase computation efficiency, only the top 5 PCA and MNF and top 10 PCA and MNF components were evaluated, which accounted for 10% and 20% of the total number of the original 50 spectral bands, respectively. Results show that if no band reduction was applied the models developed for the three stages of disease progression had relatively similar performance, where both spectral responses and texture contributed to burn assessments. However, the application of PCA and MNF introduced much greater variation among models across the three stages. For the early-stage disease progression, neither band reduction algorithms improved or retained the accuracy of burn severity modeling (except for the use of 10 MNF components). Compared to the no-band-reduction scenario, band reduction led to a greater level of overestimation of low-degree burns and underestimation of medium-degree burns, suggesting that the spectral variation removed by PCA and MNF was vital for distinguishing between the spectral reflectance from disease-induced dried crowns (still retaining high structural complexity) and fire ash. For the middle-stage, both algorithms improved the model R2 values by 2-37%, while the late-stage models had comparable or better performance to those using the original 50 spectral bands. This could be explained by the loss of tree crowns enabling better signal penetration, thus leading to reduced spectral variation from canopies. Hence, spectral bands containing a high degree of random noise were correctly removed by the band reduction algorithms. Compared to the middle-stage, the late-stage forest stands were covered by large piles of fallen trees and branches, resulting in higher variability of MASTER imagery. The ability of band reduction to improve the model performance for these late-stage forest stands was reduced, because the valuable spectral variation representing the actual late-stage forest status was partially removed by both algorithms as noise. Our results indicate that PCA and MNF are promising for balancing computation efficiency and the performance of burn severity models in forest stands subject to the middle and late stages of sudden oak death disease progression. Compared to PCA, MNF dramatically reduced image spectral variation, generating larger image objects with less complexity of object shapes. Whereas, PCA-based models delivered superior performance in most evaluated cases suggesting that some key spectral variability contributing to the accuracy of burn severity models in diseased forests may have been removed together with true spectral noise through MNF transformations.
Multiscale 3D Shape Analysis using Spherical Wavelets
Nain, Delphine; Haker, Steven; Bobick, Aaron; Tannenbaum, Allen
2013-01-01
Shape priors attempt to represent biological variations within a population. When variations are global, Principal Component Analysis (PCA) can be used to learn major modes of variation, even from a limited training set. However, when significant local variations exist, PCA typically cannot represent such variations from a small training set. To address this issue, we present a novel algorithm that learns shape variations from data at multiple scales and locations using spherical wavelets and spectral graph partitioning. Our results show that when the training set is small, our algorithm significantly improves the approximation of shapes in a testing set over PCA, which tends to oversmooth data. PMID:16685992
Multiscale 3D shape analysis using spherical wavelets.
Nain, Delphine; Haker, Steven; Bobick, Aaron; Tannenbaum, Allen R
2005-01-01
Shape priors attempt to represent biological variations within a population. When variations are global, Principal Component Analysis (PCA) can be used to learn major modes of variation, even from a limited training set. However, when significant local variations exist, PCA typically cannot represent such variations from a small training set. To address this issue, we present a novel algorithm that learns shape variations from data at multiple scales and locations using spherical wavelets and spectral graph partitioning. Our results show that when the training set is small, our algorithm significantly improves the approximation of shapes in a testing set over PCA, which tends to oversmooth data.
2012-01-01
Background PCA3 is a non-coding RNA (ncRNA) that is highly expressed in prostate cancer (PCa) cells, but its functional role is unknown. To investigate its putative function in PCa biology, we used gene expression knockdown by small interference RNA, and also analyzed its involvement in androgen receptor (AR) signaling. Methods LNCaP and PC3 cells were used as in vitro models for these functional assays, and three different siRNA sequences were specifically designed to target PCA3 exon 4. Transfected cells were analyzed by real-time qRT-PCR and cell growth, viability, and apoptosis assays. Associations between PCA3 and the androgen-receptor (AR) signaling pathway were investigated by treating LNCaP cells with 100 nM dihydrotestosterone (DHT) and with its antagonist (flutamide), and analyzing the expression of some AR-modulated genes (TMPRSS2, NDRG1, GREB1, PSA, AR, FGF8, CdK1, CdK2 and PMEPA1). PCA3 expression levels were investigated in different cell compartments by using differential centrifugation and qRT-PCR. Results LNCaP siPCA3-transfected cells significantly inhibited cell growth and viability, and increased the proportion of cells in the sub G0/G1 phase of the cell cycle and the percentage of pyknotic nuclei, compared to those transfected with scramble siRNA (siSCr)-transfected cells. DHT-treated LNCaP cells induced a significant upregulation of PCA3 expression, which was reversed by flutamide. In siPCA3/LNCaP-transfected cells, the expression of AR target genes was downregulated compared to siSCr-transfected cells. The siPCA3 transfection also counteracted DHT stimulatory effects on the AR signaling cascade, significantly downregulating expression of the AR target gene. Analysis of PCA3 expression in different cell compartments provided evidence that the main functional roles of PCA3 occur in the nuclei and microsomal cell fractions. Conclusions Our findings suggest that the ncRNA PCA3 is involved in the control of PCa cell survival, in part through modulating AR signaling, which may raise new possibilities of using PCA3 knockdown as an additional therapeutic strategy for PCa control. PMID:23130941
NASA Astrophysics Data System (ADS)
Colbo, M. H.; Cote, D.; Kendall, V.
2005-05-01
The fauna of insular Newfoundland compared to the mainland is depauperate, e.g., 35 spp of mayflies versus about 160 spp in Maine. The question is can this depauperate fauna provide good biomonitoring information? A comparative study using present /absent data from 85 sites in eastern Newfoundland was analyzed with particular reference to the Ephemeroptera, Plecoptera and Trichoptera fauna. The results indicated these data did differentiate regions sampling type and land use when run through a simple Principle Component Analysis (PCA). The PCA 1 and 2 components varied from no to a strong correlation with both Ephemeroptera and Trichoptera richness and taxa richness of the major feeding groups. However, applying previously published biotic indices of species that occur in Newfoundland did not provide satisfactory results. It appears these indices may reflect oxygen gradients rather than pollutant concentrations and in our cold well oxygenated streams yield unsatisfactory discrimination among natural and perturb streams.
A comparison between different coronagraphic data reduction techniques
NASA Astrophysics Data System (ADS)
Carolo, E.; Vassallo, D.; Farinato, J.; Bergomi, M.; Bonavita, M.; Carlotti, A.; D'Orazi, V.; Greggio, D.; Magrin, D.; Mesa, D.; Pinna, E.; Puglisi, A.; Stangalini, M.; Verinaud, C.; Viotto, V.
2016-07-01
A robust post processing technique is mandatory for analysing the coronagraphic high contrast imaging data. Angular Differential Imaging (ADI) and Principal Component Analysis (PCA) are the most used approaches to suppress the quasi-static structure presents in the Point Spread Function (PSF) for revealing planets at different separations from the host star. In this work, we present the comparison between ADI and PCA applied to System of coronagraphy with High order Adaptive optics from R to K band (SHARK-NIR), which will be implemented at Large Binocular Telescope (LBT). The comparison has been carried out by using as starting point the simulated wavefront residuals of the LBT Adaptive Optics (AO) system, in different observing conditions. Accurate tests for tuning the post processing parameters to obtain the best performance from each technique were performed in various seeing conditions (0:4"-1") for star magnitude ranging from 8 to 12, with particular care in finding the best compromise between quasi static speckle subtraction and planets detection.
Optimizing the clinical utility of PCA3 to diagnose prostate cancer in initial prostate biopsy.
Rubio-Briones, Jose; Borque, Angel; Esteban, Luis M; Casanova, Juan; Fernandez-Serra, Antonio; Rubio, Luis; Casanova-Salas, Irene; Sanz, Gerardo; Domínguez-Escrig, Jose; Collado, Argimiro; Gómez-Ferrer, Alvaro; Iborra, Inmaculada; Ramírez-Backhaus, Miguel; Martínez, Francisco; Calatrava, Ana; Lopez-Guerrero, Jose A
2015-09-11
PCA3 has been included in a nomogram outperforming previous clinical models for the prediction of any prostate cancer (PCa) and high grade PCa (HGPCa) at the initial prostate biopsy (IBx). Our objective is to validate such IBx-specific PCA3-based nomogram. We also aim to optimize the use of this nomogram in clinical practice through the definition of risk groups. Independent external validation. Clinical and biopsy data from a contemporary cohort of 401 men with the same inclusion criteria to those used to build up the reference's nomogram in IBx. The predictive value of the nomogram was assessed by means of calibration curves and discrimination ability through the area under the curve (AUC). Clinical utility of the nomogram was analyzed by choosing thresholds points that minimize the overlapping between probability density functions (PDF) in PCa and no PCa and HGPCa and no HGPCa groups, and net benefit was assessed by decision curves. We detect 28% of PCa and 11 % of HGPCa in IBx, contrasting to the 46 and 20% at the reference series. Due to this, there is an overestimation of the nomogram probabilities shown in the calibration curve for PCa. The AUC values are 0.736 for PCa (C.I.95%:0.68-0.79) and 0.786 for HGPCa (C.I.95%:0.71-0.87) showing an adequate discrimination ability. PDF show differences in the distributions of nomogram probabilities in PCa and not PCa patient groups. A minimization of the overlapping between these curves confirms the threshold probability of harboring PCa >30 % proposed by Hansen is useful to indicate a IBx, but a cut-off > 40% could be better in series of opportunistic screening like ours. Similar results appear in HGPCa analysis. The decision curve also shows a net benefit of 6.31% for the threshold probability of 40%. PCA3 is an useful tool to select patients for IBx. Patients with a calculated probability of having PCa over 40% should be counseled to undergo an IBx if opportunistic screening is required.
2L-PCA: a two-level principal component analyzer for quantitative drug design and its applications.
Du, Qi-Shi; Wang, Shu-Qing; Xie, Neng-Zhong; Wang, Qing-Yan; Huang, Ri-Bo; Chou, Kuo-Chen
2017-09-19
A two-level principal component predictor (2L-PCA) was proposed based on the principal component analysis (PCA) approach. It can be used to quantitatively analyze various compounds and peptides about their functions or potentials to become useful drugs. One level is for dealing with the physicochemical properties of drug molecules, while the other level is for dealing with their structural fragments. The predictor has the self-learning and feedback features to automatically improve its accuracy. It is anticipated that 2L-PCA will become a very useful tool for timely providing various useful clues during the process of drug development.
The burden of urinary incontinence and urinary bother among elderly prostate cancer survivors.
Kopp, Ryan P; Marshall, Lynn M; Wang, Patty Y; Bauer, Douglas C; Barrett-Connor, Elizabeth; Parsons, J Kellogg
2013-10-01
Data describing urinary health in elderly, community-dwelling prostate cancer (PCa) survivors are limited. To elucidate the prevalence of lower urinary tract symptoms, urinary bother, and incontinence in elderly PCa survivors compared with peers without PCa. A cross-sectional analysis of 5990 participants in the Osteoporotic Fractures in Men Research Group, a cohort study of community-dwelling men ≥ 65 yr. We characterized urinary health using self-reported urinary incontinence and the American Urological Association Symptom Index (AUA-SI). We compared urinary health measures according to type of PCa treatment in men with PCa and men without PCa using multivariate log-binomial regression to generate prevalence ratios (PRs). At baseline, 706 men (12%) reported a history of PCa, with a mean time since diagnosis of 6.3 yr. Of these men, 426 (60%) reported urinary incontinence. In adjusted analyses, observation (PR: 2.11; 95% confidence interval [CI], 1.22-3.65; p=0.007), surgery (PR: 4.41; 95% CI, 3.79-5.13; p<0.0001), radiation therapy (PR: 1.49; 95% CI, 1.06-2.08; p=0.02), and androgen-deprivation therapy (ADT) (PR: 2.02; 95% CI, 1.31-3.13; p=0.002) were each associated with daily incontinence. Daily incontinence risk increased with time since diagnosis independently of age. Observation (PR: 1.33; 95% CI, 1.00-1.78; p=0.05), surgery (PR: 1.25; 95% CI, 1.10-1.42; p=0.0008), and ADT (PR: 1.50; 95% CI, 1.26-1.79; p<0.0001) were associated with increased AUA-SI bother scores. Cancer stage and use of adjuvant or salvage therapies were not available for analysis. Compared with their peers without PCa, elderly PCa survivors had a two-fold to five-fold greater prevalence of urinary incontinence, which rose with increasing survivorship duration. Observation, surgery, and ADT were each associated with increased urinary bother. These data suggest a substantially greater burden of urinary health problems among elderly PCa survivors than previously recognized. Copyright © 2013 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Xu, Yong; Qin, Sihua; An, Taixue; Tang, Yueting; Huang, Yiyao; Zheng, Lei
2017-07-01
Extracellular vesicles (EVs) can be detected in body fluids and may serve as disease biomarkers. Increasing evidence suggests that circulating miRNAs in serum and urine may be potential non-invasive biomarkers for prostate cancer (PCa). In the present study, we aimed to investigate whether hydrostatic filtration dialysis (HFD) is suitable for urinary EVs (UEVs) isolation and whether such reported PCa-related miRNAs can be detected in UEVs as PCa biomarkers. To analyze EVs miRNAs, we searched for an easy and economic method to enrich EVs from urine samples. We compared the efficiency of HFD method and conventional ultracentrifugation (UC) in isolating UEVs. Subsequently, UEVs were isolated from patients with PCa, patients with benign prostate hyperplasia (BPH) and healthy individuals. Differential expression of four PCa-related miRNAs (miR-572, miR-1290, miR-141, and miR-145) were measured in UEVs and paired serum EVs using SYBR Green-based quantitative reverse transcription-polymerase chain reaction (qRT-PCR). The overall performance of HFD was similar to UC. In miRNA yield, both HFD and UC can meet the needs of further analysis. The level of miR-145 in UEVs was significantly increased in patients with PCa compared with the patients with BPH (P = 0.018). In addition, significant increase was observed in miR-145 levels when patients with Gleason score ≥8 tumors compared with Gleason score ≤7 (P = 0.020). Receiver-operating characteristic curve (ROC) revealed that miR-145 in UEVs combined with serum PSA could differentiate PCa from BPH better than PSA alone (AUC 0.863 and AUC 0.805, respectively). In serum EVs, four miRNAs were significantly higher in patients with PCa than with BPH. HFD is appropriate for UEVs isolation and miRNA analysis when compared with conventional UC. miR-145 in UEVs is upregulated from PCa patients compared BPH patients and healthy controls. We suggest the potential use of UEVs miR-145 as a biomarker of PCa. © 2017 Wiley Periodicals, Inc.
A DATA-DRIVEN MODEL FOR SPECTRA: FINDING DOUBLE REDSHIFTS IN THE SLOAN DIGITAL SKY SURVEY
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsalmantza, P.; Hogg, David W., E-mail: vivitsal@mpia.de
2012-07-10
We present a data-driven method-heteroscedastic matrix factorization, a kind of probabilistic factor analysis-for modeling or performing dimensionality reduction on observed spectra or other high-dimensional data with known but non-uniform observational uncertainties. The method uses an iterative inverse-variance-weighted least-squares minimization procedure to generate a best set of basis functions. The method is similar to principal components analysis (PCA), but with the substantial advantage that it uses measurement uncertainties in a responsible way and accounts naturally for poorly measured and missing data; it models the variance in the noise-deconvolved data space. A regularization can be applied, in the form of a smoothnessmore » prior (inspired by Gaussian processes) or a non-negative constraint, without making the method prohibitively slow. Because the method optimizes a justified scalar (related to the likelihood), the basis provides a better fit to the data in a probabilistic sense than any PCA basis. We test the method on Sloan Digital Sky Survey (SDSS) spectra, concentrating on spectra known to contain two redshift components: these are spectra of gravitational lens candidates and massive black hole binaries. We apply a hypothesis test to compare one-redshift and two-redshift models for these spectra, utilizing the data-driven model trained on a random subset of all SDSS spectra. This test confirms 129 of the 131 lens candidates in our sample and all of the known binary candidates, and turns up very few false positives.« less
Epileptic seizure detection in EEG signal with GModPCA and support vector machine.
Jaiswal, Abeg Kumar; Banka, Haider
2017-01-01
Epilepsy is one of the most common neurological disorders caused by recurrent seizures. Electroencephalograms (EEGs) record neural activity and can detect epilepsy. Visual inspection of an EEG signal for epileptic seizure detection is a time-consuming process and may lead to human error; therefore, recently, a number of automated seizure detection frameworks were proposed to replace these traditional methods. Feature extraction and classification are two important steps in these procedures. Feature extraction focuses on finding the informative features that could be used for classification and correct decision-making. Therefore, proposing effective feature extraction techniques for seizure detection is of great significance. Principal Component Analysis (PCA) is a dimensionality reduction technique used in different fields of pattern recognition including EEG signal classification. Global modular PCA (GModPCA) is a variation of PCA. In this paper, an effective framework with GModPCA and Support Vector Machine (SVM) is presented for epileptic seizure detection in EEG signals. The feature extraction is performed with GModPCA, whereas SVM trained with radial basis function kernel performed the classification between seizure and nonseizure EEG signals. Seven different experimental cases were conducted on the benchmark epilepsy EEG dataset. The system performance was evaluated using 10-fold cross-validation. In addition, we prove analytically that GModPCA has less time and space complexities as compared to PCA. The experimental results show that EEG signals have strong inter-sub-pattern correlations. GModPCA and SVM have been able to achieve 100% accuracy for the classification between normal and epileptic signals. Along with this, seven different experimental cases were tested. The classification results of the proposed approach were better than were compared the results of some of the existing methods proposed in literature. It is also found that the time and space complexities of GModPCA are less as compared to PCA. This study suggests that GModPCA and SVM could be used for automated epileptic seizure detection in EEG signal.
Li, Jian; Ma, Guowei; Ma, Lin; Bao, Xiaolin; Li, Liping; Zhao, Qian
2018-01-01
Effects of 1-methylcyclopropene (1-MCP) and vacuum precooling on quality and antioxidant properties of blackberries (Rubus spp.) were evaluated using one-way analysis of variance, principal component analysis (PCA), partial least squares (PLS), and path analysis. Results showed that the activities of antioxidant enzymes were enhanced by both 1-MCP treatment and vacuum precooling. PCA could discriminate 1-MCP treated fruit and the vacuum precooled fruit and showed that the radical-scavenging activities in vacuum precooled fruit were higher than those in 1-MCP treated fruit. The scores of PCA showed that H2O2 content was the most important variables of blackberry fruit. PLSR results showed that peroxidase (POD) activity negatively correlated with H2O2 content. The results of path coefficient analysis indicated that glutathione (GSH) also had an indirect effect on H2O2 content. PMID:29487622
Ghosh, Debasree; Chattopadhyay, Parimal
2012-06-01
The objective of the work was to use the method of quantitative descriptive analysis (QDA) to describe the sensory attributes of the fermented food products prepared with the incorporation of lactic cultures. Panellists were selected and trained to evaluate various attributes specially color and appearance, body texture, flavor, overall acceptability and acidity of the fermented food products like cow milk curd and soymilk curd, idli, sauerkraut and probiotic ice cream. Principal component analysis (PCA) identified the six significant principal components that accounted for more than 90% of the variance in the sensory attribute data. Overall product quality was modelled as a function of principal components using multiple least squares regression (R (2) = 0.8). The result from PCA was statistically analyzed by analysis of variance (ANOVA). These findings demonstrate the utility of quantitative descriptive analysis for identifying and measuring the fermented food product attributes that are important for consumer acceptability.
Fixed Eigenvector Analysis of Thermographic NDE Data
NASA Technical Reports Server (NTRS)
Cramer, K. Elliott; Winfree, William P.
2011-01-01
Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. This paper will discuss an alternative method of analysis that has been developed where a predetermined set of eigenvectors is used to process the thermal data from both reinforced carbon-carbon (RCC) and graphiteepoxy honeycomb materials. These eigenvectors can be generated either from an analytic model of the thermal response of the material system under examination, or from a large set of experimental data. This paper provides the details of the analytic model, an overview of the PCA process, as well as a quantitative signal-to-noise comparison of the results of performing both conventional PCA and fixed eigenvector analysis on thermographic data from two specimens, one Reinforced Carbon-Carbon with flat bottom holes and the second a sandwich construction with graphite-epoxy face sheets and aluminum honeycomb core.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Bo, E-mail: luboufl@gmail.com; Park, Justin C.; Fan, Qiyong
Purpose: Accurately localizing lung tumor localization is essential for high-precision radiation therapy techniques such as stereotactic body radiation therapy (SBRT). Since direct monitoring of tumor motion is not always achievable due to the limitation of imaging modalities for treatment guidance, placement of fiducial markers on the patient’s body surface to act as a surrogate for tumor position prediction is a practical alternative for tracking lung tumor motion during SBRT treatments. In this work, the authors propose an innovative and robust model to solve the multimarker position optimization problem. The model is able to overcome the major drawbacks of the sparsemore » optimization approach (SOA) model. Methods: The principle-component-analysis (PCA) method was employed as the framework to build the authors’ statistical prediction model. The method can be divided into two stages. The first stage is to build the surrogate tumor matrix and calculate its eigenvalues and associated eigenvectors. The second stage is to determine the “best represented” columns of the eigenvector matrix obtained from stage one and subsequently acquire the optimal marker positions as well as numbers. Using 4-dimensional CT (4DCT) and breath hold CT imaging data, the PCA method was compared to the SOA method with respect to calculation time, average prediction accuracy, prediction stability, noise resistance, marker position consistency, and marker distribution. Results: The PCA and SOA methods which were both tested were on all 11 patients for a total of 130 cases including 4DCT and breath-hold CT scenarios. The maximum calculation time for the PCA method was less than 1 s with 64 752 surface points, whereas the average calculation time for the SOA method was over 12 min with 400 surface points. Overall, the tumor center position prediction errors were comparable between the two methods, and all were less than 1.5 mm. However, for the extreme scenarios (breath hold), the prediction errors for the PCA method were not only smaller, but were also more stable than for the SOA method. Results obtained by imposing a series of random noises to the surrogates indicated that the PCA method was much more noise resistant than the SOA method. The marker position consistency tests using various combinations of 4DCT phases to construct the surrogates suggested that the marker position predictions of the PCA method were more consistent than those of the SOA method, in spite of surrogate construction. Marker distribution tests indicated that greater than 80% of the calculated marker positions fell into the high cross correlation and high motion magnitude regions for both of the algorithms. Conclusions: The PCA model is an accurate, efficient, robust, and practical model for solving the multimarker position optimization problem to predict lung tumor motion during SBRT treatments. Due to its generality, PCA model can also be applied to other imaging guidance system whichever using surface motion as the surrogates.« less
A Mass Spectrometric Analysis Method Based on PPCA and SVM for Early Detection of Ovarian Cancer.
Wu, Jiang; Ji, Yanju; Zhao, Ling; Ji, Mengying; Ye, Zhuang; Li, Suyi
2016-01-01
Background. Surfaced-enhanced laser desorption-ionization-time of flight mass spectrometry (SELDI-TOF-MS) technology plays an important role in the early diagnosis of ovarian cancer. However, the raw MS data is highly dimensional and redundant. Therefore, it is necessary to study rapid and accurate detection methods from the massive MS data. Methods. The clinical data set used in the experiments for early cancer detection consisted of 216 SELDI-TOF-MS samples. An MS analysis method based on probabilistic principal components analysis (PPCA) and support vector machine (SVM) was proposed and applied to the ovarian cancer early classification in the data set. Additionally, by the same data set, we also established a traditional PCA-SVM model. Finally we compared the two models in detection accuracy, specificity, and sensitivity. Results. Using independent training and testing experiments 10 times to evaluate the ovarian cancer detection models, the average prediction accuracy, sensitivity, and specificity of the PCA-SVM model were 83.34%, 82.70%, and 83.88%, respectively. In contrast, those of the PPCA-SVM model were 90.80%, 92.98%, and 88.97%, respectively. Conclusions. The PPCA-SVM model had better detection performance. And the model combined with the SELDI-TOF-MS technology had a prospect in early clinical detection and diagnosis of ovarian cancer.
Progress Towards Improved Analysis of TES X-ray Data Using Principal Component Analysis
NASA Technical Reports Server (NTRS)
Busch, S. E.; Adams, J. S.; Bandler, S. R.; Chervenak, J. A.; Eckart, M. E.; Finkbeiner, F. M.; Fixsen, D. J.; Kelley, R. L.; Kilbourne, C. A.; Lee, S.-J.;
2015-01-01
The traditional method of applying a digital optimal filter to measure X-ray pulses from transition-edge sensor (TES) devices does not achieve the best energy resolution when the signals have a highly non-linear response to energy, or the noise is non-stationary during the pulse. We present an implementation of a method to analyze X-ray data from TESs, which is based upon principal component analysis (PCA). Our method separates the X-ray signal pulse into orthogonal components that have the largest variance. We typically recover pulse height, arrival time, differences in pulse shape, and the variation of pulse height with detector temperature. These components can then be combined to form a representation of pulse energy. An added value of this method is that by reporting information on more descriptive parameters (as opposed to a single number representing energy), we generate a much more complete picture of the pulse received. Here we report on progress in developing this technique for future implementation on X-ray telescopes. We used an 55Fe source to characterize Mo/Au TESs. On the same dataset, the PCA method recovers a spectral resolution that is better by a factor of two than achievable with digital optimal filters.
Auer, Matthias K.; Cecil, Alexander; Roepke, Yasmin; Bultynck, Charlotte; Pas, Charlotte; Fuss, Johannes; Prehn, Cornelia; Wang-Sattler, Rui; Adamski, Jerzy; Stalla, Günter K.; T’Sjoen, Guy
2016-01-01
Metabolomic analyses in epidemiological studies have demonstrated a strong sexual dimorphism for most metabolites. Cross-sex hormone treatment (CSH) in transgender individuals enables the study of metabolites in a cross-gender setting. Targeted metabolomic profiling of serum of fasting transmen and transwomen at baseline and following 12 months of CSH (N = 20/group) was performed. Changes in 186 serum metabolites and metabolite ratios were determined by targeted metabolomics analysis based on ESI-LC-MS/MS. RandomForest (RF) analysis was applied to detect metabolites of highest interest for grouping of transwomen and transmen before and after initiation of CSH. Principal component analysis (PCA) was performed to check whether group differentiation was achievable according to these variables and to see if changes in metabolite levels could be explained by a priori gender differences. PCA predicted grouping of individuals-determined by the citrulline/arginine-ratio and the amino acids lysine, alanine and asymmetric dimethylarginine - in addition to the expected grouping due to changes in sex steroids and body composition. The fact that most of the investigated metabolites did, however, not change, indicates that the majority of sex dependent differences in metabolites reported in the literature before may primarily not be attributable to sex hormones but to other gender-differences. PMID:27833161
Dosimetric treatment course simulation based on a statistical model of deformable organ motion
NASA Astrophysics Data System (ADS)
Söhn, M.; Sobotta, B.; Alber, M.
2012-06-01
We present a method of modeling dosimetric consequences of organ deformation and correlated motion of adjacent organ structures in radiotherapy. Based on a few organ geometry samples and the respective deformation fields as determined by deformable registration, principal component analysis (PCA) is used to create a low-dimensional parametric statistical organ deformation model (Söhn et al 2005 Phys. Med. Biol. 50 5893-908). PCA determines the most important geometric variability in terms of eigenmodes, which represent 3D vector fields of correlated organ deformations around the mean geometry. Weighted sums of a few dominating eigenmodes can be used to simulate synthetic geometries, which are statistically meaningful inter- and extrapolations of the input geometries, and predict their probability of occurrence. We present the use of PCA as a versatile treatment simulation tool, which allows comprehensive dosimetric assessment of the detrimental effects that deformable geometric uncertainties can have on a planned dose distribution. For this, a set of random synthetic geometries is generated by a PCA model for each simulated treatment course, and the dose of a given treatment plan is accumulated in the moving tissue elements via dose warping. This enables the calculation of average voxel doses, local dose variability, dose-volume histogram uncertainties, marginal as well as joint probability distributions of organ equivalent uniform doses and thus of TCP and NTCP, and other dosimetric and biologic endpoints. The method is applied to the example of deformable motion of prostate/bladder/rectum in prostate IMRT. Applications include dosimetric assessment of the adequacy of margin recipes, adaptation schemes, etc, as well as prospective ‘virtual’ evaluation of the possible benefits of new radiotherapy schemes.
Comparison of receptor models for source apportionment of the PM10 in Zaragoza (Spain).
Callén, M S; de la Cruz, M T; López, J M; Navarro, M V; Mastral, A M
2009-08-01
Receptor models are useful to understand the chemical and physical characteristics of air pollutants by identifying their sources and by estimating contributions of each source to receptor concentrations. In this work, three receptor models based on principal component analysis with absolute principal component scores (PCA-APCS), Unmix and positive matrix factorization (PMF) were applied to study for the first time the apportionment of the airborne particulate matter less or equal than 10microm (PM10) in Zaragoza, Spain, during 1year sampling campaign (2003-2004). The PM10 samples were characterized regarding their concentrations in inorganic components: trace elements and ions and also organic components: polycyclic aromatic hydrocarbons (PAH) not only in the solid phase but also in the gas phase. A comparison of the three receptor models was carried out in order to do a more robust characterization of the PM10. The three models predicted that the major sources of PM10 in Zaragoza were related to natural sources (60%, 75% and 47%, respectively, for PCA-APCS, Unmix and PMF) although anthropogenic sources also contributed to PM10 (28%, 25% and 39%). With regard to the anthropogenic sources, while PCA and PMF allowed high discrimination in the sources identification associated with different combustion sources such as traffic and industry, fossil fuel, biomass and fuel-oil combustion, heavy traffic and evaporative emissions, the Unmix model only allowed the identification of industry and traffic emissions, evaporative emissions and heavy-duty vehicles. The three models provided good correlations between the experimental and modelled PM10 concentrations with major precision and the closest agreement between the PMF and PCA models.