Sample records for analysis pca method

  1. Analysis of the principal component algorithm in phase-shifting interferometry.

    PubMed

    Vargas, J; Quiroga, J Antonio; Belenguer, T

    2011-06-15

    We recently presented a new asynchronous demodulation method for phase-sampling interferometry. The method is based in the principal component analysis (PCA) technique. In the former work, the PCA method was derived heuristically. In this work, we present an in-depth analysis of the PCA demodulation method.

  2. Exploring patterns enriched in a dataset with contrastive principal component analysis.

    PubMed

    Abid, Abubakar; Zhang, Martin J; Bagaria, Vivek K; Zou, James

    2018-05-30

    Visualization and exploration of high-dimensional data is a ubiquitous challenge across disciplines. Widely used techniques such as principal component analysis (PCA) aim to identify dominant trends in one dataset. However, in many settings we have datasets collected under different conditions, e.g., a treatment and a control experiment, and we are interested in visualizing and exploring patterns that are specific to one dataset. This paper proposes a method, contrastive principal component analysis (cPCA), which identifies low-dimensional structures that are enriched in a dataset relative to comparison data. In a wide variety of experiments, we demonstrate that cPCA with a background dataset enables us to visualize dataset-specific patterns missed by PCA and other standard methods. We further provide a geometric interpretation of cPCA and strong mathematical guarantees. An implementation of cPCA is publicly available, and can be used for exploratory data analysis in many applications where PCA is currently used.

  3. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge

    PubMed Central

    Wagner, Florian

    2015-01-01

    Method Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. Results I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets. PMID:26575370

  4. Strain Transient Detection Techniques: A Comparison of Source Parameter Inversions of Signals Isolated through Principal Component Analysis (PCA), Non-Linear PCA, and Rotated PCA

    NASA Astrophysics Data System (ADS)

    Lipovsky, B.; Funning, G. J.

    2009-12-01

    We compare several techniques for the analysis of geodetic time series with the ultimate aim to characterize the physical processes which are represented therein. We compare three methods for the analysis of these data: Principal Component Analysis (PCA), Non-Linear PCA (NLPCA), and Rotated PCA (RPCA). We evaluate each method by its ability to isolate signals which may be any combination of low amplitude (near noise level), temporally transient, unaccompanied by seismic emissions, and small scale with respect to the spatial domain. PCA is a powerful tool for extracting structure from large datasets which is traditionally realized through either the solution of an eigenvalue problem or through iterative methods. PCA is an transformation of the coordinate system of our data such that the new "principal" data axes retain maximal variance and minimal reconstruction error (Pearson, 1901; Hotelling, 1933). RPCA is achieved by an orthogonal transformation of the principal axes determined in PCA. In the analysis of meteorological data sets, RPCA has been seen to overcome domain shape dependencies, correct for sampling errors, and to determine principal axes which more closely represent physical processes (e.g., Richman, 1986). NLPCA generalizes PCA such that principal axes are replaced by principal curves (e.g., Hsieh 2004). We achieve NLPCA through an auto-associative feed-forward neural network (Scholz, 2005). We show the geophysical relevance of these techniques by application of each to a synthetic data set. Results are compared by inverting principal axes to determine deformation source parameters. Temporal variability in source parameters, estimated by each method, are also compared.

  5. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.

    PubMed

    Wagner, Florian

    2015-01-01

    Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.

  6. Incorporating biological information in sparse principal component analysis with application to genomic data.

    PubMed

    Li, Ziyi; Safo, Sandra E; Long, Qi

    2017-07-11

    Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.

  7. [Analyzing and modeling methods of near infrared spectroscopy for in-situ prediction of oil yield from oil shale].

    PubMed

    Liu, Jie; Zhang, Fu-Dong; Teng, Fei; Li, Jun; Wang, Zhi-Hong

    2014-10-01

    In order to in-situ detect the oil yield of oil shale, based on portable near infrared spectroscopy analytical technology, with 66 rock core samples from No. 2 well drilling of Fuyu oil shale base in Jilin, the modeling and analyzing methods for in-situ detection were researched. By the developed portable spectrometer, 3 data formats (reflectance, absorbance and K-M function) spectra were acquired. With 4 different modeling data optimization methods: principal component-mahalanobis distance (PCA-MD) for eliminating abnormal samples, uninformative variables elimination (UVE) for wavelength selection and their combina- tions: PCA-MD + UVE and UVE + PCA-MD, 2 modeling methods: partial least square (PLS) and back propagation artificial neural network (BPANN), and the same data pre-processing, the modeling and analyzing experiment were performed to determine the optimum analysis model and method. The results show that the data format, modeling data optimization method and modeling method all affect the analysis precision of model. Results show that whether or not using the optimization method, reflectance or K-M function is the proper spectrum format of the modeling database for two modeling methods. Using two different modeling methods and four different data optimization methods, the model precisions of the same modeling database are different. For PLS modeling method, the PCA-MD and UVE + PCA-MD data optimization methods can improve the modeling precision of database using K-M function spectrum data format. For BPANN modeling method, UVE, UVE + PCA-MD and PCA- MD + UVE data optimization methods can improve the modeling precision of database using any of the 3 spectrum data formats. In addition to using the reflectance spectra and PCA-MD data optimization method, modeling precision by BPANN method is better than that by PLS method. And modeling with reflectance spectra, UVE optimization method and BPANN modeling method, the model gets the highest analysis precision, its correlation coefficient (Rp) is 0.92, and its standard error of prediction (SEP) is 0.69%.

  8. Applying robust variant of Principal Component Analysis as a damage detector in the presence of outliers

    NASA Astrophysics Data System (ADS)

    Gharibnezhad, Fahit; Mujica, Luis E.; Rodellar, José

    2015-01-01

    Using Principal Component Analysis (PCA) for Structural Health Monitoring (SHM) has received considerable attention over the past few years. PCA has been used not only as a direct method to identify, classify and localize damages but also as a significant primary step for other methods. Despite several positive specifications that PCA conveys, it is very sensitive to outliers. Outliers are anomalous observations that can affect the variance and the covariance as vital parts of PCA method. Therefore, the results based on PCA in the presence of outliers are not fully satisfactory. As a main contribution, this work suggests the use of robust variant of PCA not sensitive to outliers, as an effective way to deal with this problem in SHM field. In addition, the robust PCA is compared with the classical PCA in the sense of detecting probable damages. The comparison between the results shows that robust PCA can distinguish the damages much better than using classical one, and even in many cases allows the detection where classic PCA is not able to discern between damaged and non-damaged structures. Moreover, different types of robust PCA are compared with each other as well as with classical counterpart in the term of damage detection. All the results are obtained through experiments with an aircraft turbine blade using piezoelectric transducers as sensors and actuators and adding simulated damages.

  9. Common factor analysis versus principal component analysis: choice for symptom cluster research.

    PubMed

    Kim, Hee-Ju

    2008-03-01

    The purpose of this paper is to examine differences between two factor analytical methods and their relevance for symptom cluster research: common factor analysis (CFA) versus principal component analysis (PCA). Literature was critically reviewed to elucidate the differences between CFA and PCA. A secondary analysis (N = 84) was utilized to show the actual result differences from the two methods. CFA analyzes only the reliable common variance of data, while PCA analyzes all the variance of data. An underlying hypothetical process or construct is involved in CFA but not in PCA. PCA tends to increase factor loadings especially in a study with a small number of variables and/or low estimated communality. Thus, PCA is not appropriate for examining the structure of data. If the study purpose is to explain correlations among variables and to examine the structure of the data (this is usual for most cases in symptom cluster research), CFA provides a more accurate result. If the purpose of a study is to summarize data with a smaller number of variables, PCA is the choice. PCA can also be used as an initial step in CFA because it provides information regarding the maximum number and nature of factors. In using factor analysis for symptom cluster research, several issues need to be considered, including subjectivity of solution, sample size, symptom selection, and level of measure.

  10. Identification of the isomers using principal component analysis (PCA) method

    NASA Astrophysics Data System (ADS)

    Kepceoǧlu, Abdullah; Gündoǧdu, Yasemin; Ledingham, Kenneth William David; Kilic, Hamdi Sukur

    2016-03-01

    In this work, we have carried out a detailed statistical analysis for experimental data of mass spectra from xylene isomers. Principle Component Analysis (PCA) was used to identify the isomers which cannot be distinguished using conventional statistical methods for interpretation of their mass spectra. Experiments have been carried out using a linear TOF-MS coupled to a femtosecond laser system as an energy source for the ionisation processes. We have performed experiments and collected data which has been analysed and interpreted using PCA as a multivariate analysis of these spectra. This demonstrates the strength of the method to get an insight for distinguishing the isomers which cannot be identified using conventional mass analysis obtained through dissociative ionisation processes on these molecules. The PCA results dependending on the laser pulse energy and the background pressure in the spectrometers have been presented in this work.

  11. Decision tree and PCA-based fault diagnosis of rotating machinery

    NASA Astrophysics Data System (ADS)

    Sun, Weixiang; Chen, Jin; Li, Jiaqing

    2007-04-01

    After analysing the flaws of conventional fault diagnosis methods, data mining technology is introduced to fault diagnosis field, and a new method based on C4.5 decision tree and principal component analysis (PCA) is proposed. In this method, PCA is used to reduce features after data collection, preprocessing and feature extraction. Then, C4.5 is trained by using the samples to generate a decision tree model with diagnosis knowledge. At last the tree model is used to make diagnosis analysis. To validate the method proposed, six kinds of running states (normal or without any defect, unbalance, rotor radial rub, oil whirl, shaft crack and a simultaneous state of unbalance and radial rub), are simulated on Bently Rotor Kit RK4 to test C4.5 and PCA-based method and back-propagation neural network (BPNN). The result shows that C4.5 and PCA-based diagnosis method has higher accuracy and needs less training time than BPNN.

  12. Nonlinear Principal Components Analysis: Introduction and Application

    ERIC Educational Resources Information Center

    Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Koojj, Anita J.

    2007-01-01

    The authors provide a didactic treatment of nonlinear (categorical) principal components analysis (PCA). This method is the nonlinear equivalent of standard PCA and reduces the observed variables to a number of uncorrelated principal components. The most important advantages of nonlinear over linear PCA are that it incorporates nominal and ordinal…

  13. Comparison of water extraction methods in Tibet based on GF-1 data

    NASA Astrophysics Data System (ADS)

    Jia, Lingjun; Shang, Kun; Liu, Jing; Sun, Zhongqing

    2018-03-01

    In this study, we compared four different water extraction methods with GF-1 data according to different water types in Tibet, including Support Vector Machine (SVM), Principal Component Analysis (PCA), Decision Tree Classifier based on False Normalized Difference Water Index (FNDWI-DTC), and PCA-SVM. The results show that all of the four methods can extract large area water body, but only SVM and PCA-SVM can obtain satisfying extraction results for small size water body. The methods were evaluated by both overall accuracy (OAA) and Kappa coefficient (KC). The OAA of PCA-SVM, SVM, FNDWI-DTC, PCA are 96.68%, 94.23%, 93.99%, 93.01%, and the KCs are 0.9308, 0.8995, 0.8962, 0.8842, respectively, in consistent with visual inspection. In summary, SVM is better for narrow rivers extraction and PCA-SVM is suitable for water extraction of various types. As for dark blue lakes, the methods using PCA can extract more quickly and accurately.

  14. Model Reduction via Principe Component Analysis and Markov Chain Monte Carlo (MCMC) Methods

    NASA Astrophysics Data System (ADS)

    Gong, R.; Chen, J.; Hoversten, M. G.; Luo, J.

    2011-12-01

    Geophysical and hydrogeological inverse problems often include a large number of unknown parameters, ranging from hundreds to millions, depending on parameterization and problems undertaking. This makes inverse estimation and uncertainty quantification very challenging, especially for those problems in two- or three-dimensional spatial domains. Model reduction technique has the potential of mitigating the curse of dimensionality by reducing total numbers of unknowns while describing the complex subsurface systems adequately. In this study, we explore the use of principal component analysis (PCA) and Markov chain Monte Carlo (MCMC) sampling methods for model reduction through the use of synthetic datasets. We compare the performances of three different but closely related model reduction approaches: (1) PCA methods with geometric sampling (referred to as 'Method 1'), (2) PCA methods with MCMC sampling (referred to as 'Method 2'), and (3) PCA methods with MCMC sampling and inclusion of random effects (referred to as 'Method 3'). We consider a simple convolution model with five unknown parameters as our goal is to understand and visualize the advantages and disadvantages of each method by comparing their inversion results with the corresponding analytical solutions. We generated synthetic data with noise added and invert them under two different situations: (1) the noised data and the covariance matrix for PCA analysis are consistent (referred to as the unbiased case), and (2) the noise data and the covariance matrix are inconsistent (referred to as biased case). In the unbiased case, comparison between the analytical solutions and the inversion results show that all three methods provide good estimates of the true values and Method 1 is computationally more efficient. In terms of uncertainty quantification, Method 1 performs poorly because of relatively small number of samples obtained, Method 2 performs best, and Method 3 overestimates uncertainty due to inclusion of random effects. However, in the biased case, only Method 3 correctly estimates all the unknown parameters, and both Methods 1 and 2 provide wrong values for the biased parameters. The synthetic case study demonstrates that if the covariance matrix for PCA analysis is inconsistent with true models, the PCA methods with geometric or MCMC sampling will provide incorrect estimates.

  15. Priority of VHS Development Based in Potential Area using Principal Component Analysis

    NASA Astrophysics Data System (ADS)

    Meirawan, D.; Ana, A.; Saripudin, S.

    2018-02-01

    The current condition of VHS is still inadequate in quality, quantity and relevance. The purpose of this research is to analyse the development of VHS based on the development of regional potential by using principal component analysis (PCA) in Bandung, Indonesia. This study used descriptive qualitative data analysis using the principle of secondary data reduction component. The method used is Principal Component Analysis (PCA) analysis with Minitab Statistics Software tool. The results of this study indicate the value of the lowest requirement is a priority of the construction of development VHS with a program of majors in accordance with the development of regional potential. Based on the PCA score found that the main priority in the development of VHS in Bandung is in Saguling, which has the lowest PCA value of 416.92 in area 1, Cihampelas with the lowest PCA value in region 2 and Padalarang with the lowest PCA value.

  16. Two worlds collide: Image analysis methods for quantifying structural variation in cluster molecular dynamics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Steenbergen, K. G., E-mail: kgsteen@gmail.com; Gaston, N.

    2014-02-14

    Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement formore » a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.« less

  17. Two worlds collide: image analysis methods for quantifying structural variation in cluster molecular dynamics.

    PubMed

    Steenbergen, K G; Gaston, N

    2014-02-14

    Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement for a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.

  18. Recruitment Methods and Show Rates to a Prostate Cancer Early Detection Program for High-Risk Men: A Comprehensive Analysis

    PubMed Central

    Giri, Veda N.; Coups, Elliot J.; Ruth, Karen; Goplerud, Julia; Raysor, Susan; Kim, Taylor Y.; Bagden, Loretta; Mastalski, Kathleen; Zakrzewski, Debra; Leimkuhler, Suzanne; Watkins-Bruner, Deborah

    2009-01-01

    Purpose Men with a family history (FH) of prostate cancer (PCA) and African American (AA) men are at higher risk for PCA. Recruitment and retention of these high-risk men into early detection programs has been challenging. We report a comprehensive analysis on recruitment methods, show rates, and participant factors from the Prostate Cancer Risk Assessment Program (PRAP), which is a prospective, longitudinal PCA screening study. Materials and Methods Men 35–69 years are eligible if they have a FH of PCA, are AA, or have a BRCA1/2 mutation. Recruitment methods were analyzed with respect to participant demographics and show to the first PRAP appointment using standard statistical methods Results Out of 707 men recruited, 64.9% showed to the initial PRAP appointment. More individuals were recruited via radio than from referral or other methods (χ2 = 298.13, p < .0001). Men recruited via radio were more likely to be AA (p<0.001), less educated (p=0.003), not married or partnered (p=0.007), and have no FH of PCA (p<0.001). Men recruited via referrals had higher incomes (p=0.007). Men recruited via referral were more likely to attend their initial PRAP visit than those recruited by radio or other methods (χ2 = 27.08, p < .0001). Conclusions This comprehensive analysis finds that radio leads to higher recruitment of AA men with lower socioeconomic status. However, these are the high-risk men that have lower show rates for PCA screening. Targeted motivational measures need to be studied to improve show rates for PCA risk assessment for these high-risk men. PMID:19758657

  19. Gabor-based kernel PCA with fractional power polynomial models for face recognition.

    PubMed

    Liu, Chengjun

    2004-05-01

    This paper presents a novel Gabor-based kernel Principal Component Analysis (PCA) method by integrating the Gabor wavelet representation of face images and the kernel PCA method for face recognition. Gabor wavelets first derive desirable facial features characterized by spatial frequency, spatial locality, and orientation selectivity to cope with the variations due to illumination and facial expression changes. The kernel PCA method is then extended to include fractional power polynomial models for enhanced face recognition performance. A fractional power polynomial, however, does not necessarily define a kernel function, as it might not define a positive semidefinite Gram matrix. Note that the sigmoid kernels, one of the three classes of widely used kernel functions (polynomial kernels, Gaussian kernels, and sigmoid kernels), do not actually define a positive semidefinite Gram matrix either. Nevertheless, the sigmoid kernels have been successfully used in practice, such as in building support vector machines. In order to derive real kernel PCA features, we apply only those kernel PCA eigenvectors that are associated with positive eigenvalues. The feasibility of the Gabor-based kernel PCA method with fractional power polynomial models has been successfully tested on both frontal and pose-angled face recognition, using two data sets from the FERET database and the CMU PIE database, respectively. The FERET data set contains 600 frontal face images of 200 subjects, while the PIE data set consists of 680 images across five poses (left and right profiles, left and right half profiles, and frontal view) with two different facial expressions (neutral and smiling) of 68 subjects. The effectiveness of the Gabor-based kernel PCA method with fractional power polynomial models is shown in terms of both absolute performance indices and comparative performance against the PCA method, the kernel PCA method with polynomial kernels, the kernel PCA method with fractional power polynomial models, the Gabor wavelet-based PCA method, and the Gabor wavelet-based kernel PCA method with polynomial kernels.

  20. Spectral discrimination of serum from liver cancer and liver cirrhosis using Raman spectroscopy

    NASA Astrophysics Data System (ADS)

    Yang, Tianyue; Li, Xiaozhou; Yu, Ting; Sun, Ruomin; Li, Siqi

    2011-07-01

    In this paper, Raman spectra of human serum were measured using Raman spectroscopy, then the spectra was analyzed by multivariate statistical methods of principal component analysis (PCA). Then linear discriminant analysis (LDA) was utilized to differentiate the loading score of different diseases as the diagnosing algorithm. Artificial neural network (ANN) was used for cross-validation. The diagnosis sensitivity and specificity by PCA-LDA are 88% and 79%, while that of the PCA-ANN are 89% and 95%. It can be seen that modern analyzing method is a useful tool for the analysis of serum spectra for diagnosing diseases.

  1. The fractal characteristic of facial anthropometric data for developing PCA fit test panels for youth born in central China.

    PubMed

    Yang, Lei; Wei, Ran; Shen, Henggen

    2017-01-01

    New principal component analysis (PCA) respirator fit test panels had been developed for current American and Chinese civilian workers based on anthropometric surveys. The PCA panels used the first two principal components (PCs) obtained from a set of 10 facial dimensions. Although the PCA panels for American and Chinese subjects adopted the bivairate framework with two PCs, the number of the PCs retained in the PCA analysis was different between Chinese subjects and Americans. For the Chinese youth group, the third PC should be retained in the PCA analysis for developing new fit test panels. In this article, an additional number label (ANL) is used to explain the third PC in PCA analysis when the first two PCs are used to construct the PCA half-facepiece respirator fit test panel for Chinese group. The three-dimensional box-counting method is proposed to estimate the ANLs by calculating fractal dimensions of the facial anthropometric data of the Chinese youth. The linear regression coefficients of scale-free range R 2 are all over 0.960, which demonstrates that the facial anthropometric data of the Chinese youth has fractal characteristic. The youth subjects born in Henan province has an ANL of 2.002, which is lower than the composite facial anthropometric data of Chinese subjects born in many provinces. Hence, Henan youth subjects have the self-similar facial anthropometric characteristic and should use the particular ANL (2.002) as the important tool along with using the PCA panel. The ANL method proposed in this article not only provides a new methodology in quantifying the characteristics of facial anthropometric dimensions for any ethnic/racial group, but also extends the scope of PCA panel studies to higher dimensions.

  2. A two-stage linear discriminant analysis via QR-decomposition.

    PubMed

    Ye, Jieping; Li, Qi

    2005-06-01

    Linear Discriminant Analysis (LDA) is a well-known method for feature extraction and dimension reduction. It has been used widely in many applications involving high-dimensional data, such as image and text classification. An intrinsic limitation of classical LDA is the so-called singularity problems; that is, it fails when all scatter matrices are singular. Many LDA extensions were proposed in the past to overcome the singularity problems. Among these extensions, PCA+LDA, a two-stage method, received relatively more attention. In PCA+LDA, the LDA stage is preceded by an intermediate dimension reduction stage using Principal Component Analysis (PCA). Most previous LDA extensions are computationally expensive, and not scalable, due to the use of Singular Value Decomposition or Generalized Singular Value Decomposition. In this paper, we propose a two-stage LDA method, namely LDA/QR, which aims to overcome the singularity problems of classical LDA, while achieving efficiency and scalability simultaneously. The key difference between LDA/QR and PCA+LDA lies in the first stage, where LDA/QR applies QR decomposition to a small matrix involving the class centroids, while PCA+LDA applies PCA to the total scatter matrix involving all training data points. We further justify the proposed algorithm by showing the relationship among LDA/QR and previous LDA methods. Extensive experiments on face images and text documents are presented to show the effectiveness of the proposed algorithm.

  3. Principle Component Analysis with Incomplete Data: A simulation of R pcaMethods package in Constructing an Environmental Quality Index with Missing Data

    EPA Science Inventory

    Missing data is a common problem in the application of statistical techniques. In principal component analysis (PCA), a technique for dimensionality reduction, incomplete data points are either discarded or imputed using interpolation methods. Such approaches are less valid when ...

  4. Visible micro-Raman spectroscopy of single human mammary epithelial cells exposed to x-ray radiation.

    PubMed

    Delfino, Ines; Perna, Giuseppe; Lasalvia, Maria; Capozzi, Vito; Manti, Lorenzo; Camerlingo, Carlo; Lepore, Maria

    2015-03-01

    A micro-Raman spectroscopy investigation has been performed in vitro on single human mammary epithelial cells after irradiation by graded x-ray doses. The analysis by principal component analysis (PCA) and interval-PCA (i-PCA) methods has allowed us to point out the small differences in the Raman spectra induced by irradiation. This experimental approach has enabled us to delineate radiation-induced changes in protein, nucleic acid, lipid, and carbohydrate content. In particular, the dose dependence of PCA and i-PCA components has been analyzed. Our results have confirmed that micro-Raman spectroscopy coupled to properly chosen data analysis methods is a very sensitive technique to detect early molecular changes at the single-cell level following exposure to ionizing radiation. This would help in developing innovative approaches to monitor radiation cancer radiotherapy outcome so as to reduce the overall radiation dose and minimize damage to the surrounding healthy cells, both aspects being of great importance in the field of radiation therapy.

  5. Experimental Researches on the Durability Indicators and the Physiological Comfort of Fabrics using the Principal Component Analysis (PCA) Method

    NASA Astrophysics Data System (ADS)

    Hristian, L.; Ostafe, M. M.; Manea, L. R.; Apostol, L. L.

    2017-06-01

    The work pursued the distribution of combed wool fabrics destined to manufacturing of external articles of clothing in terms of the values of durability and physiological comfort indices, using the mathematical model of Principal Component Analysis (PCA). Principal Components Analysis (PCA) applied in this study is a descriptive method of the multivariate analysis/multi-dimensional data, and aims to reduce, under control, the number of variables (columns) of the matrix data as much as possible to two or three. Therefore, based on the information about each group/assortment of fabrics, it is desired that, instead of nine inter-correlated variables, to have only two or three new variables called components. The PCA target is to extract the smallest number of components which recover the most of the total information contained in the initial data.

  6. Multilevel principal component analysis (mPCA) in shape analysis: A feasibility study in medical and dental imaging.

    PubMed

    Farnell, D J J; Popat, H; Richmond, S

    2016-06-01

    Methods used in image processing should reflect any multilevel structures inherent in the image dataset or they run the risk of functioning inadequately. We wish to test the feasibility of multilevel principal components analysis (PCA) to build active shape models (ASMs) for cases relevant to medical and dental imaging. Multilevel PCA was used to carry out model fitting to sets of landmark points and it was compared to the results of "standard" (single-level) PCA. Proof of principle was tested by applying mPCA to model basic peri-oral expressions (happy, neutral, sad) approximated to the junction between the mouth/lips. Monte Carlo simulations were used to create this data which allowed exploration of practical implementation issues such as the number of landmark points, number of images, and number of groups (i.e., "expressions" for this example). To further test the robustness of the method, mPCA was subsequently applied to a dental imaging dataset utilising landmark points (placed by different clinicians) along the boundary of mandibular cortical bone in panoramic radiographs of the face. Changes of expression that varied between groups were modelled correctly at one level of the model and changes in lip width that varied within groups at another for the Monte Carlo dataset. Extreme cases in the test dataset were modelled adequately by mPCA but not by standard PCA. Similarly, variations in the shape of the cortical bone were modelled by one level of mPCA and variations between the experts at another for the panoramic radiographs dataset. Results for mPCA were found to be comparable to those of standard PCA for point-to-point errors via miss-one-out testing for this dataset. These errors reduce with increasing number of eigenvectors/values retained, as expected. We have shown that mPCA can be used in shape models for dental and medical image processing. mPCA was found to provide more control and flexibility when compared to standard "single-level" PCA. Specifically, mPCA is preferable to "standard" PCA when multiple levels occur naturally in the dataset. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  7. Population Analysis of Disabled Children by Departments in France

    NASA Astrophysics Data System (ADS)

    Meidatuzzahra, Diah; Kuswanto, Heri; Pech, Nicolas; Etchegaray, Amélie

    2017-06-01

    In this study, a statistical analysis is performed by model the variations of the disabled about 0-19 years old population among French departments. The aim is to classify the departments according to their profile determinants (socioeconomic and behavioural profiles). The analysis is focused on two types of methods: principal component analysis (PCA) and multiple correspondences factorial analysis (MCA) to review which one is the best methods for interpretation of the correlation between the determinants of disability (independent variable). The PCA is the best method for interpretation of the correlation between the determinants of disability (independent variable). The PCA reduces 14 determinants of disability to 4 axes, keeps 80% of total information, and classifies them into 7 classes. The MCA reduces the determinants to 3 axes, retains only 30% of information, and classifies them into 4 classes.

  8. Selecting predictors for discriminant analysis of species performance: an example from an amphibious softwater plant.

    PubMed

    Vanderhaeghe, F; Smolders, A J P; Roelofs, J G M; Hoffmann, M

    2012-03-01

    Selecting an appropriate variable subset in linear multivariate methods is an important methodological issue for ecologists. Interest often exists in obtaining general predictive capacity or in finding causal inferences from predictor variables. Because of a lack of solid knowledge on a studied phenomenon, scientists explore predictor variables in order to find the most meaningful (i.e. discriminating) ones. As an example, we modelled the response of the amphibious softwater plant Eleocharis multicaulis using canonical discriminant function analysis. We asked how variables can be selected through comparison of several methods: univariate Pearson chi-square screening, principal components analysis (PCA) and step-wise analysis, as well as combinations of some methods. We expected PCA to perform best. The selected methods were evaluated through fit and stability of the resulting discriminant functions and through correlations between these functions and the predictor variables. The chi-square subset, at P < 0.05, followed by a step-wise sub-selection, gave the best results. In contrast to expectations, PCA performed poorly, as so did step-wise analysis. The different chi-square subset methods all yielded ecologically meaningful variables, while probable noise variables were also selected by PCA and step-wise analysis. We advise against the simple use of PCA or step-wise discriminant analysis to obtain an ecologically meaningful variable subset; the former because it does not take into account the response variable, the latter because noise variables are likely to be selected. We suggest that univariate screening techniques are a worthwhile alternative for variable selection in ecology. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.

  9. Lack of association between NAT2 polymorphism and prostate cancer risk: a meta-analysis and trial sequential analysis

    PubMed Central

    Tang, Jingyuan; Xu, Lingyan; Xu, Haoxiang; Li, Ran; Han, Peng; Yang, Haiwei

    2017-01-01

    Previous studies have investigated the association between NAT2 polymorphism and the risk of prostate cancer (PCa). However, the findings from these studies remained inconsistent. Hence, we performed a meta-analysis to provide a more reliable conclusion about such associations. In the present meta-analysis, 13 independent case-control studies were included with a total of 14,469 PCa patients and 10,689 controls. All relevant studies published were searched in the databates PubMed, EMBASE, and Web of Science, till March 1st, 2017. We used the pooled odds ratios (ORs) with 95% confidence intervals (CIs) to evaluate the strength of the association between NAT2*4 allele and susceptibility to PCa. Subgroup analysis was carried out by ethnicity, source of controls and genotyping method. What's more, we also performed trial sequential analysis (TSA) to reduce the risk of type I error and evaluate whether the evidence of the results was firm. Firstly, our results indicated that NAT2*4 allele was not associated with PCa susceptibility (OR = 1.00, 95% CI= 0.95–1.05; P = 0.100). However, after excluding two studies for its heterogeneity and publication bias, no significant relationship was also detected between NAT2*4 allele and the increased risk of PCa, in fixed-effect model (OR = 0.99, 95% CI= 0.94–1.04; P = 0.451). Meanwhile, no significant increased risk of PCa was found in the subgroup analyses by ethnicity, source of controls and genotyping method. Moreover, TSA demonstrated that such association was confirmed in the present study. Therefore, this meta-analysis suggested that no significant association between NAT2 polymorphism and the risk of PCa was found. PMID:28915684

  10. Detection of compatibility between baclofen and excipients with aid of infrared spectroscopy and chemometry

    NASA Astrophysics Data System (ADS)

    Rojek, Barbara; Wesolowski, Marek; Suchacz, Bogdan

    2013-12-01

    In the paper infrared (IR) spectroscopy and multivariate exploration techniques: principal component analysis (PCA) and cluster analysis (CA) were applied as supportive methods for the detection of physicochemical incompatibilities between baclofen and excipients. In the course of research, the most useful rotational strategy in PCA proved to be varimax normalized, while in CA Ward's hierarchical agglomeration with Euclidean distance measure enabled to yield the most interpretable results. Chemometrical calculations confirmed the suitability of PCA and CA as the auxiliary methods for interpretation of infrared spectra in order to recognize whether compatibilities or incompatibilities between active substance and excipients occur. On the basis of IR spectra and the results of PCA and CA it was possible to demonstrate that the presence of lactose, β-cyclodextrin and meglumine in binary mixtures produce interactions with baclofen. The results were verified using differential scanning calorimetry, differential thermal analysis, thermogravimetry/differential thermogravimetry and X-ray powder diffraction analyses.

  11. RECENT APPLICATIONS OF SOURCE APPORTIONMENT METHODS AND RELATED NEEDS

    EPA Science Inventory

    Traditional receptor modeling studies have utilized factor analysis (like principal component analysis, PCA) and/or Chemical Mass Balance (CMB) to assess source influences. The limitations with these approaches is that PCA is qualitative and CMB requires the input of source pr...

  12. A stable systemic risk ranking in China's banking sector: Based on principal component analysis

    NASA Astrophysics Data System (ADS)

    Fang, Libing; Xiao, Binqing; Yu, Honghai; You, Qixing

    2018-02-01

    In this paper, we compare five popular systemic risk rankings, and apply principal component analysis (PCA) model to provide a stable systemic risk ranking for the Chinese banking sector. Our empirical results indicate that five methods suggest vastly different systemic risk rankings for the same bank, while the combined systemic risk measure based on PCA provides a reliable ranking. Furthermore, according to factor loadings of the first component, PCA combined ranking is mainly based on fundamentals instead of market price data. We clearly find that price-based rankings are not as practical a method as fundamentals-based ones. This PCA combined ranking directly shows systemic risk contributions of each bank for banking supervision purpose and reminds banks to prevent and cope with the financial crisis in advance.

  13. Time-oriented hierarchical method for computation of principal components using subspace learning algorithm.

    PubMed

    Jankovic, Marko; Ogawa, Hidemitsu

    2004-10-01

    Principal Component Analysis (PCA) and Principal Subspace Analysis (PSA) are classic techniques in statistical data analysis, feature extraction and data compression. Given a set of multivariate measurements, PCA and PSA provide a smaller set of "basis vectors" with less redundancy, and a subspace spanned by them, respectively. Artificial neurons and neural networks have been shown to perform PSA and PCA when gradient ascent (descent) learning rules are used, which is related to the constrained maximization (minimization) of statistical objective functions. Due to their low complexity, such algorithms and their implementation in neural networks are potentially useful in cases of tracking slow changes of correlations in the input data or in updating eigenvectors with new samples. In this paper we propose PCA learning algorithm that is fully homogeneous with respect to neurons. The algorithm is obtained by modification of one of the most famous PSA learning algorithms--Subspace Learning Algorithm (SLA). Modification of the algorithm is based on Time-Oriented Hierarchical Method (TOHM). The method uses two distinct time scales. On a faster time scale PSA algorithm is responsible for the "behavior" of all output neurons. On a slower scale, output neurons will compete for fulfillment of their "own interests". On this scale, basis vectors in the principal subspace are rotated toward the principal eigenvectors. At the end of the paper it will be briefly analyzed how (or why) time-oriented hierarchical method can be used for transformation of any of the existing neural network PSA method, into PCA method.

  14. Guided filter and principal component analysis hybrid method for hyperspectral pansharpening

    NASA Astrophysics Data System (ADS)

    Qu, Jiahui; Li, Yunsong; Dong, Wenqian

    2018-01-01

    Hyperspectral (HS) pansharpening aims to generate a fused HS image with high spectral and spatial resolution through integrating an HS image with a panchromatic (PAN) image. A guided filter (GF) and principal component analysis (PCA) hybrid HS pansharpening method is proposed. First, the HS image is interpolated and the PCA transformation is performed on the interpolated HS image. The first principal component (PC1) channel concentrates on the spatial information of the HS image. Different from the traditional PCA method, the proposed method sharpens the PAN image and utilizes the GF to obtain the spatial information difference between the HS image and the enhanced PAN image. Then, in order to reduce spectral and spatial distortion, an appropriate tradeoff parameter is defined and the spatial information difference is injected into the PC1 channel through multiplying by this tradeoff parameter. Once the new PC1 channel is obtained, the fused image is finally generated by the inverse PCA transformation. Experiments performed on both synthetic and real datasets show that the proposed method outperforms other several state-of-the-art HS pansharpening methods in both subjective and objective evaluations.

  15. Time-dependent analysis of dosage delivery information for patient-controlled analgesia services.

    PubMed

    Kuo, I-Ting; Chang, Kuang-Yi; Juan, De-Fong; Hsu, Steen J; Chan, Chia-Tai; Tsou, Mei-Yung

    2018-01-01

    Pain relief always plays the essential part of perioperative care and an important role of medical quality improvement. Patient-controlled analgesia (PCA) is a method that allows a patient to self-administer small boluses of analgesic to relieve the subjective pain. PCA logs from the infusion pump consisted of a lot of text messages which record all events during the therapies. The dosage information can be extracted from PCA logs to provide easily understanding features. The analysis of dosage information with time has great help to figure out the variance of a patient's pain relief condition. To explore the trend of pain relief requirement, we developed a PCA dosage information generator (PCA DIG) to extract meaningful messages from PCA logs during the first 48 hours of therapies. PCA dosage information including consumption, delivery, infusion rate, and the ratio between demand and delivery is presented with corresponding values in 4 successive time frames. Time-dependent statistical analysis demonstrated the trends of analgesia requirements decreased gradually along with time. These findings are compatible with clinical observations and further provide valuable information about the strategy to customize postoperative pain management.

  16. A feasibility study on age-related factors of wrist pulse using principal component analysis.

    PubMed

    Jang-Han Bae; Young Ju Jeon; Sanghun Lee; Jaeuk U Kim

    2016-08-01

    Various analysis methods for examining wrist pulse characteristics are needed for accurate pulse diagnosis. In this feasibility study, principal component analysis (PCA) was performed to observe age-related factors of wrist pulse from various analysis parameters. Forty subjects in the age group of 20s and 40s were participated, and their wrist pulse signal and respiration signal were acquired with the pulse tonometric device. After pre-processing of the signals, twenty analysis parameters which have been regarded as values reflecting pulse characteristics were calculated and PCA was performed. As a results, we could reduce complex parameters to lower dimension and age-related factors of wrist pulse were observed by combining-new analysis parameter derived from PCA. These results demonstrate that PCA can be useful tool for analyzing wrist pulse signal.

  17. Investigation of domain walls in PPLN by confocal raman microscopy and PCA analysis

    NASA Astrophysics Data System (ADS)

    Shur, Vladimir Ya.; Zelenovskiy, Pavel; Bourson, Patrice

    2017-07-01

    Confocal Raman microscopy (CRM) is a powerful tool for investigation of ferroelectric domains. Mechanical stresses and electric fields existed in the vicinity of neutral and charged domain walls modify frequency, intensity and width of spectral lines [1], thus allowing to visualize micro- and nanodomain structures both at the surface and in the bulk of the crystal [2,3]. Stresses and fields are naturally coupled in ferroelectrics due to inverse piezoelectric effect and hardly can be separated in Raman spectra. PCA is a powerful statistical method for analysis of large data matrix providing a set of orthogonal variables, called principal components (PCs). PCA is widely used for classification of experimental data, for example, in crystallization experiments, for detection of small amounts of components in solid mixtures etc. [4,5]. In Raman spectroscopy PCA was applied for analysis of phase transitions and provided critical pressure with good accuracy [6]. In the present work we for the first time applied Principal Component Analysis (PCA) method for analysis of Raman spectra measured in periodically poled lithium niobate (PPLN). We found that principal components demonstrate different sensitivity to mechanical stresses and electric fields in the vicinity of the domain walls. This allowed us to separately visualize spatial distribution of fields and electric fields at the surface and in the bulk of PPLN.

  18. Detecting phase separation of freeze-dried binary amorphous systems using pair-wise distribution function and multivariate data analysis.

    PubMed

    Chieng, Norman; Trnka, Hjalte; Boetker, Johan; Pikal, Michael; Rantanen, Jukka; Grohganz, Holger

    2013-09-15

    The purpose of this study is to investigate the use of multivariate data analysis for powder X-ray diffraction-pair-wise distribution function (PXRD-PDF) data to detect phase separation in freeze-dried binary amorphous systems. Polymer-polymer and polymer-sugar binary systems at various ratios were freeze-dried. All samples were analyzed by PXRD, transformed to PDF and analyzed by principal component analysis (PCA). These results were validated by differential scanning calorimetry (DSC) through characterization of glass transition of the maximally freeze-concentrate solute (Tg'). Analysis of PXRD-PDF data using PCA provides a more clear 'miscible' or 'phase separated' interpretation through the distribution pattern of samples on a score plot presentation compared to residual plot method. In a phase separated system, samples were found to be evenly distributed around the theoretical PDF profile. For systems that were miscible, a clear deviation of samples away from the theoretical PDF profile was observed. Moreover, PCA analysis allows simultaneous analysis of replicate samples. Comparatively, the phase behavior analysis from PXRD-PDF-PCA method was in agreement with the DSC results. Overall, the combined PXRD-PDF-PCA approach improves the clarity of the PXRD-PDF results and can be used as an alternative explorative data analytical tool in detecting phase separation in freeze-dried binary amorphous systems. Copyright © 2013 Elsevier B.V. All rights reserved.

  19. A novel method for qualitative analysis of edible oil oxidation using an electronic nose.

    PubMed

    Xu, Lirong; Yu, Xiuzhu; Liu, Lei; Zhang, Rui

    2016-07-01

    An electronic nose (E-nose) was used for rapid assessment of the degree of oxidation in edible oils. Peroxide and acid values of edible oil samples were analyzed using data obtained by the American Oil Chemists' Society (AOCS) Official Method for reference. Qualitative discrimination between non-oxidized and oxidized oils was conducted using the E-nose technique developed in combination with cluster analysis (CA), principal component analysis (PCA), and linear discriminant analysis (LDA). The results from CA, PCA and LDA indicated that the E-nose technique could be used for differentiation of non-oxidized and oxidized oils. LDA produced slightly better results than CA and PCA. The proposed approach can be used as an alternative to AOCS Official Method as an innovative tool for rapid detection of edible oil oxidation. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Principal Component Analysis of Thermographic Data

    NASA Technical Reports Server (NTRS)

    Winfree, William P.; Cramer, K. Elliott; Zalameda, Joseph N.; Howell, Patricia A.; Burke, Eric R.

    2015-01-01

    Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. While a reliable technique for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the "good" material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued where a fixed set of eigenvectors, generated from an analytic model of the thermal response of the material under examination, is used to process the thermal data from composite materials. This method has been applied for characterization of flaws.

  1. Identification and classification of upper limb motions using PCA.

    PubMed

    Veer, Karan; Vig, Renu

    2018-03-28

    This paper describes the utility of principal component analysis (PCA) in classifying upper limb signals. PCA is a powerful tool for analyzing data of high dimension. Here, two different input strategies were explored. The first method uses upper arm dual-position-based myoelectric signal acquisition and the other solely uses PCA for classifying surface electromyogram (SEMG) signals. SEMG data from the biceps and the triceps brachii muscles and four independent muscle activities of the upper arm were measured in seven subjects (total dataset=56). The datasets used for the analysis are rotated by class-specific principal component matrices to decorrelate the measured data prior to feature extraction.

  2. Fast principal component analysis for stacking seismic data

    NASA Astrophysics Data System (ADS)

    Wu, Juan; Bai, Min

    2018-04-01

    Stacking seismic data plays an indispensable role in many steps of the seismic data processing and imaging workflow. Optimal stacking of seismic data can help mitigate seismic noise and enhance the principal components to a great extent. Traditional average-based seismic stacking methods cannot obtain optimal performance when the ambient noise is extremely strong. We propose a principal component analysis (PCA) algorithm for stacking seismic data without being sensitive to noise level. Considering the computational bottleneck of the classic PCA algorithm in processing massive seismic data, we propose an efficient PCA algorithm to make the proposed method readily applicable for industrial applications. Two numerically designed examples and one real seismic data are used to demonstrate the performance of the presented method.

  3. Clinical Significance of Retinoic Acid Receptor Beta Promoter Methylation in Prostate Cancer: A Meta-Analysis.

    PubMed

    Dou, MengMeng; Zhou, XueLiang; Fan, ZhiRui; Ding, XianFei; Li, LiFeng; Wang, ShuLing; Xue, Wenhua; Wang, Hui; Suo, Zhenhe; Deng, XiaoMing

    2018-01-01

    Retinoic acid receptor beta (RAR beta) is a retinoic acid receptor gene that has been shown to play key roles during multiple cancer processes, including cell proliferation, apoptosis, migration and invasion. Numerous studies have found that methylation of the RAR beta promoter contributed to the occurrence and development of malignant tumors. However, the connection between RAR beta promoter methylation and prostate cancer (PCa) remains unknown. This meta-analysis evaluated the clinical significance of RAR beta promoter methylation in PCa. We searched all published records relevant to RAR beta and PCa in a series of databases, including PubMed, Embase, Cochrane Library, ISI Web of Science and CNKI. The rates of RAR beta promoter methylation in the PCa and control groups (including benign prostatic hyperplasia and normal prostate tissues) were summarized. In addition, we evaluated the source region of available samples and the methods used to detect methylation. To compare the incidence and variation in RAR beta promoter methylation in PCa and non-PCa tissues, the odds ratio (OR) and 95% confidence interval (CI) were calculated accordingly. All the data were analyzed with the statistical software STATA 12.0. Based on the inclusion and exclusion criteria, 15 articles assessing 1,339 samples were further analyzed. These data showed that the RAR beta promoter methylation rates in PCa tissues were significantly higher than the rates in the non-PCa group (OR=21.65, 95% CI: 9.27-50.57). Subgroup analysis according to the source region of samples showed that heterogeneity in Asia was small (I2=0.0%, P=0.430). Additional subgroup analysis based on the method used to detect RAR beta promoter methylation showed that the heterogeneity detected by MSP (methylation-specific PCR) was relatively small (I2=11.3%, P=0.343). Although studies reported different rates for RAR beta promoter methylation in PCa tissues, the total analysis demonstrated that RAR beta promoter methylation may be correlated with PCa carcinogenesis and that the RAR beta gene is particularly susceptible. Additional studies with sufficient data are essential to further evaluate the clinical features and prognostic utility of RAR beta promoter methylation in PCa. © 2018 The Author(s). Published by S. Karger AG, Basel.

  4. An improved PCA method with application to boiler leak detection.

    PubMed

    Sun, Xi; Marquez, Horacio J; Chen, Tongwen; Riaz, Muhammad

    2005-07-01

    Principal component analysis (PCA) is a popular fault detection technique. It has been widely used in process industries, especially in the chemical industry. In industrial applications, achieving a sensitive system capable of detecting incipient faults, which maintains the false alarm rate to a minimum, is a crucial issue. Although a lot of research has been focused on these issues for PCA-based fault detection and diagnosis methods, sensitivity of the fault detection scheme versus false alarm rate continues to be an important issue. In this paper, an improved PCA method is proposed to address this problem. In this method, a new data preprocessing scheme and a new fault detection scheme designed for Hotelling's T2 as well as the squared prediction error are developed. A dynamic PCA model is also developed for boiler leak detection. This new method is applied to boiler water/steam leak detection with real data from Syncrude Canada's utility plant in Fort McMurray, Canada. Our results demonstrate that the proposed method can effectively reduce false alarm rate, provide effective and correct leak alarms, and give early warning to operators.

  5. Classification of alloys using laser induced breakdown spectroscopy with principle component analysis

    NASA Astrophysics Data System (ADS)

    Syuhada Mangsor, Aneez; Haider Rizvi, Zuhaib; Chaudhary, Kashif; Safwan Aziz, Muhammad

    2018-05-01

    The study of atomic spectroscopy has contributed to a wide range of scientific applications. In principle, laser induced breakdown spectroscopy (LIBS) method has been used to analyse various types of matter regardless of its physical state, either it is solid, liquid or gas because all elements emit light of characteristic frequencies when it is excited to sufficiently high energy. The aim of this work was to analyse the signature spectrums of each element contained in three different types of samples. Metal alloys of Aluminium, Titanium and Brass with the purities of 75%, 80%, 85%, 90% and 95% were used as the manipulated variable and their LIBS spectra were recorded. The characteristic emission lines of main elements were identified from the spectra as well as its corresponding contents. Principal component analysis (PCA) was carried out using the data from LIBS spectra. Three obvious clusters were observed in 3-dimensional PCA plot which corresponding to the different group of alloys. Findings from this study showed that LIBS technology with the help of principle component analysis could conduct the variety discrimination of alloys demonstrating the capability of LIBS-PCA method in field of spectro-analysis. Thus, LIBS-PCA method is believed to be an effective method for classifying alloys with different percentage of purifications, which was high-cost and time-consuming before.

  6. Application of principal component analysis to distinguish patients with schizophrenia from healthy controls based on fractional anisotropy measurements.

    PubMed

    Caprihan, A; Pearlson, G D; Calhoun, V D

    2008-08-15

    Principal component analysis (PCA) is often used to reduce the dimension of data before applying more sophisticated data analysis methods such as non-linear classification algorithms or independent component analysis. This practice is based on selecting components corresponding to the largest eigenvalues. If the ultimate goal is separation of data in two groups, then these set of components need not have the most discriminatory power. We measured the distance between two such populations using Mahalanobis distance and chose the eigenvectors to maximize it, a modified PCA method, which we call the discriminant PCA (DPCA). DPCA was applied to diffusion tensor-based fractional anisotropy images to distinguish age-matched schizophrenia subjects from healthy controls. The performance of the proposed method was evaluated by the one-leave-out method. We show that for this fractional anisotropy data set, the classification error with 60 components was close to the minimum error and that the Mahalanobis distance was twice as large with DPCA, than with PCA. Finally, by masking the discriminant function with the white matter tracts of the Johns Hopkins University atlas, we identified left superior longitudinal fasciculus as the tract which gave the least classification error. In addition, with six optimally chosen tracts the classification error was zero.

  7. Principal Component Analysis: A Method for Determining the Essential Dynamics of Proteins

    PubMed Central

    David, Charles C.; Jacobs, Donald J.

    2015-01-01

    It has become commonplace to employ principal component analysis to reveal the most important motions in proteins. This method is more commonly known by its acronym, PCA. While most popular molecular dynamics packages inevitably provide PCA tools to analyze protein trajectories, researchers often make inferences of their results without having insight into how to make interpretations, and they are often unaware of limitations and generalizations of such analysis. Here we review best practices for applying standard PCA, describe useful variants, discuss why one may wish to make comparison studies, and describe a set of metrics that make comparisons possible. In practice, one will be forced to make inferences about the essential dynamics of a protein without having the desired amount of samples. Therefore, considerable time is spent on describing how to judge the significance of results, highlighting pitfalls. The topic of PCA is reviewed from the perspective of many practical considerations, and useful recipes are provided. PMID:24061923

  8. Principal component analysis: a method for determining the essential dynamics of proteins.

    PubMed

    David, Charles C; Jacobs, Donald J

    2014-01-01

    It has become commonplace to employ principal component analysis to reveal the most important motions in proteins. This method is more commonly known by its acronym, PCA. While most popular molecular dynamics packages inevitably provide PCA tools to analyze protein trajectories, researchers often make inferences of their results without having insight into how to make interpretations, and they are often unaware of limitations and generalizations of such analysis. Here we review best practices for applying standard PCA, describe useful variants, discuss why one may wish to make comparison studies, and describe a set of metrics that make comparisons possible. In practice, one will be forced to make inferences about the essential dynamics of a protein without having the desired amount of samples. Therefore, considerable time is spent on describing how to judge the significance of results, highlighting pitfalls. The topic of PCA is reviewed from the perspective of many practical considerations, and useful recipes are provided.

  9. PCA based clustering for brain tumor segmentation of T1w MRI images.

    PubMed

    Kaya, Irem Ersöz; Pehlivanlı, Ayça Çakmak; Sekizkardeş, Emine Gezmez; Ibrikci, Turgay

    2017-03-01

    Medical images are huge collections of information that are difficult to store and process consuming extensive computing time. Therefore, the reduction techniques are commonly used as a data pre-processing step to make the image data less complex so that a high-dimensional data can be identified by an appropriate low-dimensional representation. PCA is one of the most popular multivariate methods for data reduction. This paper is focused on T1-weighted MRI images clustering for brain tumor segmentation with dimension reduction by different common Principle Component Analysis (PCA) algorithms. Our primary aim is to present a comparison between different variations of PCA algorithms on MRIs for two cluster methods. Five most common PCA algorithms; namely the conventional PCA, Probabilistic Principal Component Analysis (PPCA), Expectation Maximization Based Principal Component Analysis (EM-PCA), Generalize Hebbian Algorithm (GHA), and Adaptive Principal Component Extraction (APEX) were applied to reduce dimensionality in advance of two clustering algorithms, K-Means and Fuzzy C-Means. In the study, the T1-weighted MRI images of the human brain with brain tumor were used for clustering. In addition to the original size of 512 lines and 512 pixels per line, three more different sizes, 256 × 256, 128 × 128 and 64 × 64, were included in the study to examine their effect on the methods. The obtained results were compared in terms of both the reconstruction errors and the Euclidean distance errors among the clustered images containing the same number of principle components. According to the findings, the PPCA obtained the best results among all others. Furthermore, the EM-PCA and the PPCA assisted K-Means algorithm to accomplish the best clustering performance in the majority as well as achieving significant results with both clustering algorithms for all size of T1w MRI images. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  10. A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis.

    PubMed

    Reese, Sarah E; Archer, Kellie J; Therneau, Terry M; Atkinson, Elizabeth J; Vachon, Celine M; de Andrade, Mariza; Kocher, Jean-Pierre A; Eckel-Passow, Jeanette E

    2013-11-15

    Batch effects are due to probe-specific systematic variation between groups of samples (batches) resulting from experimental features that are not of biological interest. Principal component analysis (PCA) is commonly used as a visual tool to determine whether batch effects exist after applying a global normalization method. However, PCA yields linear combinations of the variables that contribute maximum variance and thus will not necessarily detect batch effects if they are not the largest source of variability in the data. We present an extension of PCA to quantify the existence of batch effects, called guided PCA (gPCA). We describe a test statistic that uses gPCA to test whether a batch effect exists. We apply our proposed test statistic derived using gPCA to simulated data and to two copy number variation case studies: the first study consisted of 614 samples from a breast cancer family study using Illumina Human 660 bead-chip arrays, whereas the second case study consisted of 703 samples from a family blood pressure study that used Affymetrix SNP Array 6.0. We demonstrate that our statistic has good statistical properties and is able to identify significant batch effects in two copy number variation case studies. We developed a new statistic that uses gPCA to identify whether batch effects exist in high-throughput genomic data. Although our examples pertain to copy number data, gPCA is general and can be used on other data types as well. The gPCA R package (Available via CRAN) provides functionality and data to perform the methods in this article. reesese@vcu.edu

  11. Characterizing Variability of Modular Brain Connectivity with Constrained Principal Component Analysis

    PubMed Central

    Hirayama, Jun-ichiro; Hyvärinen, Aapo; Kiviniemi, Vesa; Kawanabe, Motoaki; Yamashita, Okito

    2016-01-01

    Characterizing the variability of resting-state functional brain connectivity across subjects and/or over time has recently attracted much attention. Principal component analysis (PCA) serves as a fundamental statistical technique for such analyses. However, performing PCA on high-dimensional connectivity matrices yields complicated “eigenconnectivity” patterns, for which systematic interpretation is a challenging issue. Here, we overcome this issue with a novel constrained PCA method for connectivity matrices by extending the idea of the previously proposed orthogonal connectivity factorization method. Our new method, modular connectivity factorization (MCF), explicitly introduces the modularity of brain networks as a parametric constraint on eigenconnectivity matrices. In particular, MCF analyzes the variability in both intra- and inter-module connectivities, simultaneously finding network modules in a principled, data-driven manner. The parametric constraint provides a compact module-based visualization scheme with which the result can be intuitively interpreted. We develop an optimization algorithm to solve the constrained PCA problem and validate our method in simulation studies and with a resting-state functional connectivity MRI dataset of 986 subjects. The results show that the proposed MCF method successfully reveals the underlying modular eigenconnectivity patterns in more general situations and is a promising alternative to existing methods. PMID:28002474

  12. Multi-Centrality Graph Spectral Decompositions and Their Application to Cyber Intrusion Detection

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Pin-Yu; Choudhury, Sutanay; Hero, Alfred

    Many modern datasets can be represented as graphs and hence spectral decompositions such as graph principal component analysis (PCA) can be useful. Distinct from previous graph decomposition approaches based on subspace projection of a single topological feature, e.g., the centered graph adjacency matrix (graph Laplacian), we propose spectral decomposition approaches to graph PCA and graph dictionary learning that integrate multiple features, including graph walk statistics, centrality measures and graph distances to reference nodes. In this paper we propose a new PCA method for single graph analysis, called multi-centrality graph PCA (MC-GPCA), and a new dictionary learning method for ensembles ofmore » graphs, called multi-centrality graph dictionary learning (MC-GDL), both based on spectral decomposition of multi-centrality matrices. As an application to cyber intrusion detection, MC-GPCA can be an effective indicator of anomalous connectivity pattern and MC-GDL can provide discriminative basis for attack classification.« less

  13. PCA-LBG-based algorithms for VQ codebook generation

    NASA Astrophysics Data System (ADS)

    Tsai, Jinn-Tsong; Yang, Po-Yuan

    2015-04-01

    Vector quantisation (VQ) codebooks are generated by combining principal component analysis (PCA) algorithms with Linde-Buzo-Gray (LBG) algorithms. All training vectors are grouped according to the projected values of the principal components. The PCA-LBG-based algorithms include (1) PCA-LBG-Median, which selects the median vector of each group, (2) PCA-LBG-Centroid, which adopts the centroid vector of each group, and (3) PCA-LBG-Random, which randomly selects a vector of each group. The LBG algorithm finds a codebook based on the better vectors sent to an initial codebook by the PCA. The PCA performs an orthogonal transformation to convert a set of potentially correlated variables into a set of variables that are not linearly correlated. Because the orthogonal transformation efficiently distinguishes test image vectors, the proposed PCA-LBG-based algorithm is expected to outperform conventional algorithms in designing VQ codebooks. The experimental results confirm that the proposed PCA-LBG-based algorithms indeed obtain better results compared to existing methods reported in the literature.

  14. Nonlinear multivariate and time series analysis by neural network methods

    NASA Astrophysics Data System (ADS)

    Hsieh, William W.

    2004-03-01

    Methods in multivariate statistical analysis are essential for working with large amounts of geophysical data, data from observational arrays, from satellites, or from numerical model output. In classical multivariate statistical analysis, there is a hierarchy of methods, starting with linear regression at the base, followed by principal component analysis (PCA) and finally canonical correlation analysis (CCA). A multivariate time series method, the singular spectrum analysis (SSA), has been a fruitful extension of the PCA technique. The common drawback of these classical methods is that only linear structures can be correctly extracted from the data. Since the late 1980s, neural network methods have become popular for performing nonlinear regression and classification. More recently, neural network methods have been extended to perform nonlinear PCA (NLPCA), nonlinear CCA (NLCCA), and nonlinear SSA (NLSSA). This paper presents a unified view of the NLPCA, NLCCA, and NLSSA techniques and their applications to various data sets of the atmosphere and the ocean (especially for the El Niño-Southern Oscillation and the stratospheric quasi-biennial oscillation). These data sets reveal that the linear methods are often too simplistic to describe real-world systems, with a tendency to scatter a single oscillatory phenomenon into numerous unphysical modes or higher harmonics, which can be largely alleviated in the new nonlinear paradigm.

  15. Fluorescence, electrophoretic and chromatographic fingerprints of herbal medicines and their comparative chemometric analysis.

    PubMed

    Mazina, Jekaterina; Vaher, Merike; Kuhtinskaja, Maria; Poryvkina, Larisa; Kaljurand, Mihkel

    2015-07-01

    The aim of the present study was to compare the polyphenolic compositions of 47 medicinal herbs (HM) and four herbal tea mixtures from Central Estonia by rapid, reliable and sensitive Spectral Fluorescence Signature (SFS) method in a front face mode. The SFS method was validated for the main identified HM representatives including detection limits (0.037mgL(-1) for catechin, 0.052mgL(-1) for protocatechuic acid, 0.136mgL(-1) for chlorogenic acid, 0.058mgL(-1) for syringic acid and 0.256mgL(-1) for ferulic acid), linearity (up to 5.0-15mgL(-1)), intra-day precision (RSDs=6.6-10.6%), inter-day precision (RSDs=6.4-13.8%), matrix effect (-15.8 to +5.5) and recovery (85-107%). The phytochemical fingerprints were differentiated by parallel factor analysis (PARAFAC) combined with hierarchical cluster analysis (CA) and principal component analysis (PCA). HM were clustered into four main clusters (catechin-like, hydroxycinnamic acid-like, dihydrobenzoic acid-like derivatives containing HM and HM with low/very low content of fluorescent constituents) and 14 subclusters (rich, medium, low/very low contents). The average accuracy and precision of CA for validation HM set were 97.4% (within 85.2-100%) and 89.6%, (within 66.7-100%), respectively. PARAFAC-PCA/CA has improved the analysis of HM by the SFS method. The results were verified by two separation methods CE-DAD and HPLC-DAD-MS also combined with PARAFAC-PCA/CA. The SFS-PARAFAC-PCA/CA method has potential as a rapid and reliable tool for investigating the fingerprints and predicting the composition of HM or evaluating the quality and authenticity of different standardised formulas. Moreover, SFS-PARAFAC-PCA/CA can be implemented as a laboratory and/or an onsite method. Copyright © 2015 Elsevier B.V. All rights reserved.

  16. EMPCA and Cluster Analysis of Quasar Spectra: Construction and Application to Simulated Spectra

    NASA Astrophysics Data System (ADS)

    Marrs, Adam; Leighly, Karen; Wagner, Cassidy; Macinnis, Francis

    2017-01-01

    Quasars have complex spectra with emission lines influenced by many factors. Therefore, to fully describe the spectrum requires specification of a large number of parameters, such as line equivalent width, blueshift, and ratios. Principal Component Analysis (PCA) aims to construct eigenvectors-or principal components-from the data with the goal of finding a few key parameters that can be used to predict the rest of the spectrum fairly well. Analysis of simulated quasar spectra was used to verify and justify our modified application of PCA.We used a variant of PCA called Weighted Expectation Maximization PCA (EMPCA; Bailey 2012) along with k-means cluster analysis to analyze simulated quasar spectra. Our approach combines both analytical methods to address two known problems with classical PCA. EMPCA uses weights to account for uncertainty and missing points in the spectra. K-means groups similar spectra together to address the nonlinearity of quasar spectra, specifically variance in blueshifts and widths of the emission lines.In producing and analyzing simulations, we first tested the effects of varying equivalent widths and blueshifts on the derived principal components, and explored the differences between standard PCA and EMPCA. We also tested the effects of varying signal-to-noise ratio. Next we used the results of fits to composite quasar spectra (see accompanying poster by Wagner et al.) to construct a set of realistic simulated spectra, and subjected those spectra to the EMPCA /k-means analysis. We concluded that our approach was validated when we found that the mean spectra from our k-means clusters derived from PCA projection coefficients reproduced the trends observed in the composite spectra.Furthermore, our method needed only two eigenvectors to identify both sets of correlations used to construct the simulations, as well as indicating the linear and nonlinear segments. Comparing this to regular PCA, which can require a dozen or more components, or to direct spectral analysis that may need measurement of 20 fit parameters, shows why the dual application of these two techniques is such a powerful tool.

  17. Nondestructive determination of transgenic Bacillus thuringiensis rice seeds (Oryza sativa L.) using multispectral imaging and chemometric methods.

    PubMed

    Liu, Changhong; Liu, Wei; Lu, Xuzhong; Chen, Wei; Yang, Jianbo; Zheng, Lei

    2014-06-15

    Crop-to-crop transgene flow may affect the seed purity of non-transgenic rice varieties, resulting in unwanted biosafety consequences. The feasibility of a rapid and nondestructive determination of transgenic rice seeds from its non-transgenic counterparts was examined by using multispectral imaging system combined with chemometric data analysis. Principal component analysis (PCA), partial least squares discriminant analysis (PLSDA), least squares-support vector machines (LS-SVM), and PCA-back propagation neural network (PCA-BPNN) methods were applied to classify rice seeds according to their genetic origins. The results demonstrated that clear differences between non-transgenic and transgenic rice seeds could be easily visualized with the nondestructive determination method developed through this study and an excellent classification (up to 100% with LS-SVM model) can be achieved. It is concluded that multispectral imaging together with chemometric data analysis is a promising technique to identify transgenic rice seeds with high efficiency, providing bright prospects for future applications. Copyright © 2013 Elsevier Ltd. All rights reserved.

  18. DWI-associated entire-tumor histogram analysis for the differentiation of low-grade prostate cancer from intermediate-high-grade prostate cancer.

    PubMed

    Wu, Chen-Jiang; Wang, Qing; Li, Hai; Wang, Xiao-Ning; Liu, Xi-Sheng; Shi, Hai-Bin; Zhang, Yu-Dong

    2015-10-01

    To investigate diagnostic efficiency of DWI using entire-tumor histogram analysis in differentiating the low-grade (LG) prostate cancer (PCa) from intermediate-high-grade (HG) PCa in comparison with conventional ROI-based measurement. DW images (b of 0-1400 s/mm(2)) from 126 pathology-confirmed PCa (diameter >0.5 cm) in 110 patients were retrospectively collected and processed by mono-exponential model. The measurement of tumor apparent diffusion coefficients (ADCs) was performed with using histogram-based and ROI-based approach, respectively. The diagnostic ability of ADCs from two methods for differentiating LG-PCa (Gleason score, GS ≤ 6) from HG-PCa (GS > 6) was determined by ROC regression, and compared by McNemar's test. There were 49 LG-tumor and 77 HG-tumor at pathologic findings. Histogram-based ADCs (mean, median, 10th and 90th) and ROI-based ADCs (mean) showed dominant relationships with ordinal GS of Pca (ρ = -0.225 to -0.406, p < 0.05). All above imaging indices reflected significant difference between LG-PCa and HG-PCa (all p values <0.01). Histogram 10th ADCs had dominantly high Az (0.738), Youden index (0.415), and positive likelihood ratio (LR+, 2.45) in stratifying tumor GS against mean, median and 90th ADCs, and ROI-based ADCs. Histogram mean, median, and 10th ADCs showed higher specificity (65.3%-74.1% vs. 44.9%, p < 0.01), but lower sensitivity (57.1%-71.3% vs. 84.4%, p < 0.05) than ROI-based ADCs in differentiating LG-PCa from HG-PCa. DWI-associated histogram analysis had higher specificity, Az, Youden index, and LR+ for differentiation of PCa Gleason grade than ROI-based approach.

  19. A stock market forecasting model combining two-directional two-dimensional principal component analysis and radial basis function neural network.

    PubMed

    Guo, Zhiqiang; Wang, Huaiqing; Yang, Jie; Miller, David J

    2015-01-01

    In this paper, we propose and implement a hybrid model combining two-directional two-dimensional principal component analysis ((2D)2PCA) and a Radial Basis Function Neural Network (RBFNN) to forecast stock market behavior. First, 36 stock market technical variables are selected as the input features, and a sliding window is used to obtain the input data of the model. Next, (2D)2PCA is utilized to reduce the dimension of the data and extract its intrinsic features. Finally, an RBFNN accepts the data processed by (2D)2PCA to forecast the next day's stock price or movement. The proposed model is used on the Shanghai stock market index, and the experiments show that the model achieves a good level of fitness. The proposed model is then compared with one that uses the traditional dimension reduction method principal component analysis (PCA) and independent component analysis (ICA). The empirical results show that the proposed model outperforms the PCA-based model, as well as alternative models based on ICA and on the multilayer perceptron.

  20. A Stock Market Forecasting Model Combining Two-Directional Two-Dimensional Principal Component Analysis and Radial Basis Function Neural Network

    PubMed Central

    Guo, Zhiqiang; Wang, Huaiqing; Yang, Jie; Miller, David J.

    2015-01-01

    In this paper, we propose and implement a hybrid model combining two-directional two-dimensional principal component analysis ((2D)2PCA) and a Radial Basis Function Neural Network (RBFNN) to forecast stock market behavior. First, 36 stock market technical variables are selected as the input features, and a sliding window is used to obtain the input data of the model. Next, (2D)2PCA is utilized to reduce the dimension of the data and extract its intrinsic features. Finally, an RBFNN accepts the data processed by (2D)2PCA to forecast the next day's stock price or movement. The proposed model is used on the Shanghai stock market index, and the experiments show that the model achieves a good level of fitness. The proposed model is then compared with one that uses the traditional dimension reduction method principal component analysis (PCA) and independent component analysis (ICA). The empirical results show that the proposed model outperforms the PCA-based model, as well as alternative models based on ICA and on the multilayer perceptron. PMID:25849483

  1. ToF-SIMS PCA analysis of Myrtus communis L.

    NASA Astrophysics Data System (ADS)

    Piras, F. M.; Dettori, M. F.; Magnani, A.

    2009-06-01

    Nowadays there is a growing interest of researchers for the application of sophisticated analytical techniques in conjunction with statistical data analysis methods to the characterization of natural products to assure their authenticity and quality, and for the possibility of direct analysis of food to obtain maximum information. In this work, time-of-flight secondary ion mass spectrometry (ToF-SIMS) in conjunction with principal components analysis (PCA) are applied to study the chemical composition and variability of Sardinian myrtle ( Myrtus communis L.) through the analysis of both berries alcoholic extracts and berries epicarp. ToF-SIMS spectra of berries epicarp show that the epicuticular waxes consist mainly of carboxylic acids with chain length ranging from C20 to C30, or identical species formed from fragmentation of long-chain esters. PCA of ToF-SIMS data from myrtle berries epicarp distinguishes two groups characterized by a different surface concentration of triacontanoic acid. Variability in antocyanins, flavonols, α-tocopherol, and myrtucommulone contents is showed by ToF-SIMS PCA analysis of myrtle berries alcoholic extracts.

  2. Chemometric Methods to Quantify 1D and 2D NMR Spectral Differences Among Similar Protein Therapeutics.

    PubMed

    Chen, Kang; Park, Junyong; Li, Feng; Patil, Sharadrao M; Keire, David A

    2018-04-01

    NMR spectroscopy is an emerging analytical tool for measuring complex drug product qualities, e.g., protein higher order structure (HOS) or heparin chemical composition. Most drug NMR spectra have been visually analyzed; however, NMR spectra are inherently quantitative and multivariate and thus suitable for chemometric analysis. Therefore, quantitative measurements derived from chemometric comparisons between spectra could be a key step in establishing acceptance criteria for a new generic drug or a new batch after manufacture change. To measure the capability of chemometric methods to differentiate comparator NMR spectra, we calculated inter-spectra difference metrics on 1D/2D spectra of two insulin drugs, Humulin R® and Novolin R®, from different manufacturers. Both insulin drugs have an identical drug substance but differ in formulation. Chemometric methods (i.e., principal component analysis (PCA), 3-way Tucker3 or graph invariant (GI)) were performed to calculate Mahalanobis distance (D M ) between the two brands (inter-brand) and distance ratio (D R ) among the different lots (intra-brand). The PCA on 1D inter-brand spectral comparison yielded a D M value of 213. In comparing 2D spectra, the Tucker3 analysis yielded the highest differentiability value (D M  = 305) in the comparisons made followed by PCA (D M  = 255) then the GI method (D M  = 40). In conclusion, drug quality comparisons among different lots might benefit from PCA on 1D spectra for rapidly comparing many samples, while higher resolution but more time-consuming 2D-NMR-data-based comparisons using Tucker3 analysis or PCA provide a greater level of assurance for drug structural similarity evaluation between drug brands.

  3. Free energy landscape of a biomolecule in dihedral principal component space: sampling convergence and correspondence between structures and minima.

    PubMed

    Maisuradze, Gia G; Leitner, David M

    2007-05-15

    Dihedral principal component analysis (dPCA) has recently been developed and shown to display complex features of the free energy landscape of a biomolecule that may be absent in the free energy landscape plotted in principal component space due to mixing of internal and overall rotational motion that can occur in principal component analysis (PCA) [Mu et al., Proteins: Struct Funct Bioinfo 2005;58:45-52]. Another difficulty in the implementation of PCA is sampling convergence, which we address here for both dPCA and PCA using a tetrapeptide as an example. We find that for both methods the sampling convergence can be reached over a similar time. Minima in the free energy landscape in the space of the two largest dihedral principal components often correspond to unique structures, though we also find some distinct minima to correspond to the same structure. 2007 Wiley-Liss, Inc.

  4. Low-Dimensional Feature Representation for Instrument Identification

    NASA Astrophysics Data System (ADS)

    Ihara, Mizuki; Maeda, Shin-Ichi; Ikeda, Kazushi; Ishii, Shin

    For monophonic music instrument identification, various feature extraction and selection methods have been proposed. One of the issues toward instrument identification is that the same spectrum is not always observed even in the same instrument due to the difference of the recording condition. Therefore, it is important to find non-redundant instrument-specific features that maintain information essential for high-quality instrument identification to apply them to various instrumental music analyses. For such a dimensionality reduction method, the authors propose the utilization of linear projection methods: local Fisher discriminant analysis (LFDA) and LFDA combined with principal component analysis (PCA). After experimentally clarifying that raw power spectra are actually good for instrument classification, the authors reduced the feature dimensionality by LFDA or by PCA followed by LFDA (PCA-LFDA). The reduced features achieved reasonably high identification performance that was comparable or higher than those by the power spectra and those achieved by other existing studies. These results demonstrated that our LFDA and PCA-LFDA can successfully extract low-dimensional instrument features that maintain the characteristic information of the instruments.

  5. Comparison of common components analysis with principal components analysis and independent components analysis: Application to SPME-GC-MS volatolomic signatures.

    PubMed

    Bouhlel, Jihéne; Jouan-Rimbaud Bouveresse, Delphine; Abouelkaram, Said; Baéza, Elisabeth; Jondreville, Catherine; Travel, Angélique; Ratel, Jérémy; Engel, Erwan; Rutledge, Douglas N

    2018-02-01

    The aim of this work is to compare a novel exploratory chemometrics method, Common Components Analysis (CCA), with Principal Components Analysis (PCA) and Independent Components Analysis (ICA). CCA consists in adapting the multi-block statistical method known as Common Components and Specific Weights Analysis (CCSWA or ComDim) by applying it to a single data matrix, with one variable per block. As an application, the three methods were applied to SPME-GC-MS volatolomic signatures of livers in an attempt to reveal volatile organic compounds (VOCs) markers of chicken exposure to different types of micropollutants. An application of CCA to the initial SPME-GC-MS data revealed a drift in the sample Scores along CC2, as a function of injection order, probably resulting from time-related evolution in the instrument. This drift was eliminated by orthogonalization of the data set with respect to CC2, and the resulting data are used as the orthogonalized data input into each of the three methods. Since the first step in CCA is to norm-scale all the variables, preliminary data scaling has no effect on the results, so that CCA was applied only to orthogonalized SPME-GC-MS data, while, PCA and ICA were applied to the "orthogonalized", "orthogonalized and Pareto-scaled", and "orthogonalized and autoscaled" data. The comparison showed that PCA results were highly dependent on the scaling of variables, contrary to ICA where the data scaling did not have a strong influence. Nevertheless, for both PCA and ICA the clearest separations of exposed groups were obtained after autoscaling of variables. The main part of this work was to compare the CCA results using the orthogonalized data with those obtained with PCA and ICA applied to orthogonalized and autoscaled variables. The clearest separations of exposed chicken groups were obtained by CCA. CCA Loadings also clearly identified the variables contributing most to the Common Components giving separations. The PCA Loadings did not highlight the most influencing variables for each separation, whereas the ICA Loadings highlighted the same variables as did CCA. This study shows the potential of CCA for the extraction of pertinent information from a data matrix, using a procedure based on an original optimisation criterion, to produce results that are complementary, and in some cases may be superior, to those of PCA and ICA. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. MiR-145 detection in urinary extracellular vesicles increase diagnostic efficiency of prostate cancer based on hydrostatic filtration dialysis method.

    PubMed

    Xu, Yong; Qin, Sihua; An, Taixue; Tang, Yueting; Huang, Yiyao; Zheng, Lei

    2017-07-01

    Extracellular vesicles (EVs) can be detected in body fluids and may serve as disease biomarkers. Increasing evidence suggests that circulating miRNAs in serum and urine may be potential non-invasive biomarkers for prostate cancer (PCa). In the present study, we aimed to investigate whether hydrostatic filtration dialysis (HFD) is suitable for urinary EVs (UEVs) isolation and whether such reported PCa-related miRNAs can be detected in UEVs as PCa biomarkers. To analyze EVs miRNAs, we searched for an easy and economic method to enrich EVs from urine samples. We compared the efficiency of HFD method and conventional ultracentrifugation (UC) in isolating UEVs. Subsequently, UEVs were isolated from patients with PCa, patients with benign prostate hyperplasia (BPH) and healthy individuals. Differential expression of four PCa-related miRNAs (miR-572, miR-1290, miR-141, and miR-145) were measured in UEVs and paired serum EVs using SYBR Green-based quantitative reverse transcription-polymerase chain reaction (qRT-PCR). The overall performance of HFD was similar to UC. In miRNA yield, both HFD and UC can meet the needs of further analysis. The level of miR-145 in UEVs was significantly increased in patients with PCa compared with the patients with BPH (P = 0.018). In addition, significant increase was observed in miR-145 levels when patients with Gleason score ≥8 tumors compared with Gleason score ≤7 (P = 0.020). Receiver-operating characteristic curve (ROC) revealed that miR-145 in UEVs combined with serum PSA could differentiate PCa from BPH better than PSA alone (AUC 0.863 and AUC 0.805, respectively). In serum EVs, four miRNAs were significantly higher in patients with PCa than with BPH. HFD is appropriate for UEVs isolation and miRNA analysis when compared with conventional UC. miR-145 in UEVs is upregulated from PCa patients compared BPH patients and healthy controls. We suggest the potential use of UEVs miR-145 as a biomarker of PCa. © 2017 Wiley Periodicals, Inc.

  7. Algorithms for accelerated convergence of adaptive PCA.

    PubMed

    Chatterjee, C; Kang, Z; Roychowdhury, V P

    2000-01-01

    We derive and discuss new adaptive algorithms for principal component analysis (PCA) that are shown to converge faster than the traditional PCA algorithms due to Oja, Sanger, and Xu. It is well known that traditional PCA algorithms that are derived by using gradient descent on an objective function are slow to converge. Furthermore, the convergence of these algorithms depends on appropriate choices of the gain sequences. Since online applications demand faster convergence and an automatic selection of gains, we present new adaptive algorithms to solve these problems. We first present an unconstrained objective function, which can be minimized to obtain the principal components. We derive adaptive algorithms from this objective function by using: 1) gradient descent; 2) steepest descent; 3) conjugate direction; and 4) Newton-Raphson methods. Although gradient descent produces Xu's LMSER algorithm, the steepest descent, conjugate direction, and Newton-Raphson methods produce new adaptive algorithms for PCA. We also provide a discussion on the landscape of the objective function, and present a global convergence proof of the adaptive gradient descent PCA algorithm using stochastic approximation theory. Extensive experiments with stationary and nonstationary multidimensional Gaussian sequences show faster convergence of the new algorithms over the traditional gradient descent methods.We also compare the steepest descent adaptive algorithm with state-of-the-art methods on stationary and nonstationary sequences.

  8. Complexity of free energy landscapes of peptides revealed by nonlinear principal component analysis.

    PubMed

    Nguyen, Phuong H

    2006-12-01

    Employing the recently developed hierarchical nonlinear principal component analysis (NLPCA) method of Saegusa et al. (Neurocomputing 2004;61:57-70 and IEICE Trans Inf Syst 2005;E88-D:2242-2248), the complexities of the free energy landscapes of several peptides, including triglycine, hexaalanine, and the C-terminal beta-hairpin of protein G, were studied. First, the performance of this NLPCA method was compared with the standard linear principal component analysis (PCA). In particular, we compared two methods according to (1) the ability of the dimensionality reduction and (2) the efficient representation of peptide conformations in low-dimensional spaces spanned by the first few principal components. The study revealed that NLPCA reduces the dimensionality of the considered systems much better, than did PCA. For example, in order to get the similar error, which is due to representation of the original data of beta-hairpin in low dimensional space, one needs 4 and 21 principal components of NLPCA and PCA, respectively. Second, by representing the free energy landscapes of the considered systems as a function of the first two principal components obtained from PCA, we obtained the relatively well-structured free energy landscapes. In contrast, the free energy landscapes of NLPCA are much more complicated, exhibiting many states which are hidden in the PCA maps, especially in the unfolded regions. Furthermore, the study also showed that many states in the PCA maps are mixed up by several peptide conformations, while those of the NLPCA maps are more pure. This finding suggests that the NLPCA should be used to capture the essential features of the systems. (c) 2006 Wiley-Liss, Inc.

  9. [Identification of varieties of textile fibers by using Vis/NIR infrared spectroscopy technique].

    PubMed

    Wu, Gui-Fang; He, Yong

    2010-02-01

    The aim of the present paper was to provide new insight into Vis/NIR spectroscopic analysis of textile fibers. In order to achieve rapid identification of the varieties of fibers, the authors selected 5 kinds of fibers of cotton, flax, wool, silk and tencel to do a study with Vis/NIR spectroscopy. Firstly, the spectra of each kind of fiber were scanned by spectrometer, and principal component analysis (PCA) method was used to analyze the characteristics of the pattern of Vis/NIR spectra. Principal component scores scatter plot (PC1 x PC2 x PC3) of fiber indicated the classification effect of five varieties of fibers. The former 6 principal components (PCs) were selected according to the quantity and size of PCs. The PCA classification model was optimized by using the least-squares support vector machines (LS-SVM) method. The authors used the 6 PCs extracted by PCA as the inputs of LS-SVM, and PCA-LS-SVM model was built to achieve varieties validation as well as mathematical model building and optimization analysis. Two hundred samples (40 samples for each variety of fibers) of five varieties of fibers were used for calibration of PCA-LS-SVM model, and the other 50 samples (10 samples for each variety of fibers) were used for validation. The result of validation showed that Vis/NIR spectroscopy technique based on PCA-LS-SVM had a powerful classification capability. It provides a new method for identifying varieties of fibers rapidly and real time, so it has important significance for protecting the rights of consumers, ensuring the quality of textiles, and implementing rationalization production and transaction of textile materials and its production.

  10. An algorithm for separation of mixed sparse and Gaussian sources

    PubMed Central

    Akkalkotkar, Ameya

    2017-01-01

    Independent component analysis (ICA) is a ubiquitous method for decomposing complex signal mixtures into a small set of statistically independent source signals. However, in cases in which the signal mixture consists of both nongaussian and Gaussian sources, the Gaussian sources will not be recoverable by ICA and will pollute estimates of the nongaussian sources. Therefore, it is desirable to have methods for mixed ICA/PCA which can separate mixtures of Gaussian and nongaussian sources. For mixtures of purely Gaussian sources, principal component analysis (PCA) can provide a basis for the Gaussian subspace. We introduce a new method for mixed ICA/PCA which we call Mixed ICA/PCA via Reproducibility Stability (MIPReSt). Our method uses a repeated estimations technique to rank sources by reproducibility, combined with decomposition of multiple subsamplings of the original data matrix. These multiple decompositions allow us to assess component stability as the size of the data matrix changes, which can be used to determinine the dimension of the nongaussian subspace in a mixture. We demonstrate the utility of MIPReSt for signal mixtures consisting of simulated sources and real-word (speech) sources, as well as mixture of unknown composition. PMID:28414814

  11. An algorithm for separation of mixed sparse and Gaussian sources.

    PubMed

    Akkalkotkar, Ameya; Brown, Kevin Scott

    2017-01-01

    Independent component analysis (ICA) is a ubiquitous method for decomposing complex signal mixtures into a small set of statistically independent source signals. However, in cases in which the signal mixture consists of both nongaussian and Gaussian sources, the Gaussian sources will not be recoverable by ICA and will pollute estimates of the nongaussian sources. Therefore, it is desirable to have methods for mixed ICA/PCA which can separate mixtures of Gaussian and nongaussian sources. For mixtures of purely Gaussian sources, principal component analysis (PCA) can provide a basis for the Gaussian subspace. We introduce a new method for mixed ICA/PCA which we call Mixed ICA/PCA via Reproducibility Stability (MIPReSt). Our method uses a repeated estimations technique to rank sources by reproducibility, combined with decomposition of multiple subsamplings of the original data matrix. These multiple decompositions allow us to assess component stability as the size of the data matrix changes, which can be used to determinine the dimension of the nongaussian subspace in a mixture. We demonstrate the utility of MIPReSt for signal mixtures consisting of simulated sources and real-word (speech) sources, as well as mixture of unknown composition.

  12. Mapping brain activity in gradient-echo functional MRI using principal component analysis

    NASA Astrophysics Data System (ADS)

    Khosla, Deepak; Singh, Manbir; Don, Manuel

    1997-05-01

    The detection of sites of brain activation in functional MRI has been a topic of immense research interest and many technique shave been proposed to this end. Recently, principal component analysis (PCA) has been applied to extract the activated regions and their time course of activation. This method is based on the assumption that the activation is orthogonal to other signal variations such as brain motion, physiological oscillations and other uncorrelated noises. A distinct advantage of this method is that it does not require any knowledge of the time course of the true stimulus paradigm. This technique is well suited to EPI image sequences where the sampling rate is high enough to capture the effects of physiological oscillations. In this work, we propose and apply tow methods that are based on PCA to conventional gradient-echo images and investigate their usefulness as tools to extract reliable information on brain activation. The first method is a conventional technique where a single image sequence with alternating on and off stages is subject to a principal component analysis. The second method is a PCA-based approach called the common spatial factor analysis technique (CSF). As the name suggests, this method relies on common spatial factors between the above fMRI image sequence and a background fMRI. We have applied these methods to identify active brain ares during visual stimulation and motor tasks. The results from these methods are compared to those obtained by using the standard cross-correlation technique. We found good agreement in the areas identified as active across all three techniques. The results suggest that PCA and CSF methods have good potential in detecting the true stimulus correlated changes in the presence of other interfering signals.

  13. Principal Component Analysis for pulse-shape discrimination of scintillation radiation detectors

    NASA Astrophysics Data System (ADS)

    Alharbi, T.

    2016-01-01

    In this paper, we report on the application of Principal Component analysis (PCA) for pulse-shape discrimination (PSD) of scintillation radiation detectors. The details of the method are described and the performance of the method is experimentally examined by discriminating between neutrons and gamma-rays with a liquid scintillation detector in a mixed radiation field. The performance of the method is also compared against that of the conventional charge-comparison method, demonstrating the superior performance of the method particularly at low light output range. PCA analysis has the important advantage of automatic extraction of the pulse-shape characteristics which makes the PSD method directly applicable to various scintillation detectors without the need for the adjustment of a PSD parameter.

  14. Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis.

    PubMed

    You, Zhu-Hong; Lei, Ying-Ke; Zhu, Lin; Xia, Junfeng; Wang, Bing

    2013-01-01

    Protein-protein interactions (PPIs) play crucial roles in the execution of various cellular processes and form the basis of biological mechanisms. Although large amount of PPIs data for different species has been generated by high-throughput experimental techniques, current PPI pairs obtained with experimental methods cover only a fraction of the complete PPI networks, and further, the experimental methods for identifying PPIs are both time-consuming and expensive. Hence, it is urgent and challenging to develop automated computational methods to efficiently and accurately predict PPIs. We present here a novel hierarchical PCA-EELM (principal component analysis-ensemble extreme learning machine) model to predict protein-protein interactions only using the information of protein sequences. In the proposed method, 11188 protein pairs retrieved from the DIP database were encoded into feature vectors by using four kinds of protein sequences information. Focusing on dimension reduction, an effective feature extraction method PCA was then employed to construct the most discriminative new feature set. Finally, multiple extreme learning machines were trained and then aggregated into a consensus classifier by majority voting. The ensembling of extreme learning machine removes the dependence of results on initial random weights and improves the prediction performance. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 87.00% prediction accuracy with 86.15% sensitivity at the precision of 87.59%. Extensive experiments are performed to compare our method with state-of-the-art techniques Support Vector Machine (SVM). Experimental results demonstrate that proposed PCA-EELM outperforms the SVM method by 5-fold cross-validation. Besides, PCA-EELM performs faster than PCA-SVM based method. Consequently, the proposed approach can be considered as a new promising and powerful tools for predicting PPI with excellent performance and less time.

  15. Once upon Multivariate Analyses: When They Tell Several Stories about Biological Evolution.

    PubMed

    Renaud, Sabrina; Dufour, Anne-Béatrice; Hardouin, Emilie A; Ledevin, Ronan; Auffray, Jean-Christophe

    2015-01-01

    Geometric morphometrics aims to characterize of the geometry of complex traits. It is therefore by essence multivariate. The most popular methods to investigate patterns of differentiation in this context are (1) the Principal Component Analysis (PCA), which is an eigenvalue decomposition of the total variance-covariance matrix among all specimens; (2) the Canonical Variate Analysis (CVA, a.k.a. linear discriminant analysis (LDA) for more than two groups), which aims at separating the groups by maximizing the between-group to within-group variance ratio; (3) the between-group PCA (bgPCA) which investigates patterns of between-group variation, without standardizing by the within-group variance. Standardizing within-group variance, as performed in the CVA, distorts the relationships among groups, an effect that is particularly strong if the variance is similarly oriented in a comparable way in all groups. Such shared direction of main morphological variance may occur and have a biological meaning, for instance corresponding to the most frequent standing genetic variation in a population. Here we undertake a case study of the evolution of house mouse molar shape across various islands, based on the real dataset and simulations. We investigated how patterns of main variance influence the depiction of among-group differentiation according to the interpretation of the PCA, bgPCA and CVA. Without arguing about a method performing 'better' than another, it rather emerges that working on the total or between-group variance (PCA and bgPCA) will tend to put the focus on the role of direction of main variance as line of least resistance to evolution. Standardizing by the within-group variance (CVA), by dampening the expression of this line of least resistance, has the potential to reveal other relevant patterns of differentiation that may otherwise be blurred.

  16. A comparison of the usefulness of canonical analysis, principal components analysis, and band selection for extraction of features from TMS data for landcover analysis

    NASA Technical Reports Server (NTRS)

    Boyd, R. K.; Brumfield, J. O.; Campbell, W. J.

    1984-01-01

    Three feature extraction methods, canonical analysis (CA), principal component analysis (PCA), and band selection, have been applied to Thematic Mapper Simulator (TMS) data in order to evaluate the relative performance of the methods. The results obtained show that CA is capable of providing a transformation of TMS data which leads to better classification results than provided by all seven bands, by PCA, or by band selection. A second conclusion drawn from the study is that TMS bands 2, 3, 4, and 7 (thermal) are most important for landcover classification.

  17. A Study of Wind Turbine Comprehensive Operational Assessment Model Based on EM-PCA Algorithm

    NASA Astrophysics Data System (ADS)

    Zhou, Minqiang; Xu, Bin; Zhan, Yangyan; Ren, Danyuan; Liu, Dexing

    2018-01-01

    To assess wind turbine performance accurately and provide theoretical basis for wind farm management, a hybrid assessment model based on Entropy Method and Principle Component Analysis (EM-PCA) was established, which took most factors of operational performance into consideration and reach to a comprehensive result. To verify the model, six wind turbines were chosen as the research objects, the ranking obtained by the method proposed in the paper were 4#>6#>1#>5#>2#>3#, which are completely in conformity with the theoretical ranking, which indicates that the reliability and effectiveness of the EM-PCA method are high. The method could give guidance for processing unit state comparison among different units and launching wind farm operational assessment.

  18. Power line identification of millimeter wave radar based on PCA-GS-SVM

    NASA Astrophysics Data System (ADS)

    Fang, Fang; Zhang, Guifeng; Cheng, Yansheng

    2017-12-01

    Aiming at the problem that the existing detection method can not effectively solve the security of UAV's ultra low altitude flight caused by power line, a power line recognition method based on grid search (GS) and the principal component analysis and support vector machine (PCA-SVM) is proposed. Firstly, the candidate line of Hough transform is reduced by PCA, and the main feature of candidate line is extracted. Then, upport vector machine (SVM is) optimized by grid search method (GS). Finally, using support vector machine classifier optimized parameters to classify the candidate line. MATLAB simulation results show that this method can effectively identify the power line and noise, and has high recognition accuracy and algorithm efficiency.

  19. Preliminary identification of unicellular algal genus by using combined confocal resonance Raman spectroscopy with PCA and DPLS analysis

    NASA Astrophysics Data System (ADS)

    He, Shixuan; Xie, Wanyi; Zhang, Ping; Fang, Shaoxi; Li, Zhe; Tang, Peng; Gao, Xia; Guo, Jinsong; Tlili, Chaker; Wang, Deqiang

    2018-02-01

    The analysis of algae and dominant alga plays important roles in ecological and environmental fields since it can be used to forecast water bloom and control its potential deleterious effects. Herein, we combine in vivo confocal resonance Raman spectroscopy with multivariate analysis methods to preliminary identify the three algal genera in water blooms at unicellular scale. Statistical analysis of characteristic Raman peaks demonstrates that certain shifts and different normalized intensities, resulting from composition of different carotenoids, exist in Raman spectra of three algal cells. Principal component analysis (PCA) scores and corresponding loading weights show some differences from Raman spectral characteristics which are caused by vibrations of carotenoids in unicellular algae. Then, discriminant partial least squares (DPLS) classification method is used to verify the effectiveness of algal identification with confocal resonance Raman spectroscopy. Our results show that confocal resonance Raman spectroscopy combined with PCA and DPLS could handle the preliminary identification of dominant alga for forecasting and controlling of water blooms.

  20. Performance evaluation of BPM system in SSRF using PCA method

    NASA Astrophysics Data System (ADS)

    Chen, Zhi-Chu; Leng, Yong-Bin; Yan, Ying-Bing; Yuan, Ren-Xian; Lai, Long-Wei

    2014-07-01

    The beam position monitor (BPM) system is of most importance in a light source. The capability of the BPM depends on the resolution of the system. The traditional standard deviation on the raw data method merely gives the upper limit of the resolution. Principal component analysis (PCA) had been introduced in the accelerator physics and it could be used to get rid of the actual signals. Beam related information was extracted before the evaluation of the BPM performance. A series of studies had been made in the Shanghai Synchrotron Radiation Facility (SSRF) and PCA was proved to be an effective and robust method in the performance evaluations of our BPM system.

  1. Study of support vector machine and serum surface-enhanced Raman spectroscopy for noninvasive esophageal cancer detection

    NASA Astrophysics Data System (ADS)

    Li, Shao-Xin; Zeng, Qiu-Yao; Li, Lin-Fang; Zhang, Yan-Jiao; Wan, Ming-Ming; Liu, Zhi-Ming; Xiong, Hong-Lian; Guo, Zhou-Yi; Liu, Song-Hao

    2013-02-01

    The ability of combining serum surface-enhanced Raman spectroscopy (SERS) with support vector machine (SVM) for improving classification esophageal cancer patients from normal volunteers is investigated. Two groups of serum SERS spectra based on silver nanoparticles (AgNPs) are obtained: one group from patients with pathologically confirmed esophageal cancer (n=30) and the other group from healthy volunteers (n=31). Principal components analysis (PCA), conventional SVM (C-SVM) and conventional SVM combination with PCA (PCA-SVM) methods are implemented to classify the same spectral dataset. Results show that a diagnostic accuracy of 77.0% is acquired for PCA technique, while diagnostic accuracies of 83.6% and 85.2% are obtained for C-SVM and PCA-SVM methods based on radial basis functions (RBF) models. The results prove that RBF SVM models are superior to PCA algorithm in classification serum SERS spectra. The study demonstrates that serum SERS in combination with SVM technique has great potential to provide an effective and accurate diagnostic schema for noninvasive detection of esophageal cancer.

  2. Short-term PV/T module temperature prediction based on PCA-RBF neural network

    NASA Astrophysics Data System (ADS)

    Li, Jiyong; Zhao, Zhendong; Li, Yisheng; Xiao, Jing; Tang, Yunfeng

    2018-02-01

    Aiming at the non-linearity and large inertia of temperature control in PV/T system, short-term temperature prediction of PV/T module is proposed, to make the PV/T system controller run forward according to the short-term forecasting situation to optimize control effect. Based on the analysis of the correlation between PV/T module temperature and meteorological factors, and the temperature of adjacent time series, the principal component analysis (PCA) method is used to pre-process the original input sample data. Combined with the RBF neural network theory, the simulation results show that the PCA method makes the prediction accuracy of the network model higher and the generalization performance stronger than that of the RBF neural network without the main component extraction.

  3. Protein-RNA specificity by high-throughput principal component analysis of NMR spectra.

    PubMed

    Collins, Katherine M; Oregioni, Alain; Robertson, Laura E; Kelly, Geoff; Ramos, Andres

    2015-03-31

    Defining the RNA target selectivity of the proteins regulating mRNA metabolism is a key issue in RNA biology. Here we present a novel use of principal component analysis (PCA) to extract the RNA sequence preference of RNA binding proteins. We show that PCA can be used to compare the changes in the nuclear magnetic resonance (NMR) spectrum of a protein upon binding a set of quasi-degenerate RNAs and define the nucleobase specificity. We couple this application of PCA to an automated NMR spectra recording and processing protocol and obtain an unbiased and high-throughput NMR method for the analysis of nucleobase preference in protein-RNA interactions. We test the method on the RNA binding domains of three important regulators of RNA metabolism. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Q-mode versus R-mode principal component analysis for linear discriminant analysis (LDA)

    NASA Astrophysics Data System (ADS)

    Lee, Loong Chuen; Liong, Choong-Yeun; Jemain, Abdul Aziz

    2017-05-01

    Many literature apply Principal Component Analysis (PCA) as either preliminary visualization or variable con-struction methods or both. Focus of PCA can be on the samples (R-mode PCA) or variables (Q-mode PCA). Traditionally, R-mode PCA has been the usual approach to reduce high-dimensionality data before the application of Linear Discriminant Analysis (LDA), to solve classification problems. Output from PCA composed of two new matrices known as loadings and scores matrices. Each matrix can then be used to produce a plot, i.e. loadings plot aids identification of important variables whereas scores plot presents spatial distribution of samples on new axes that are also known as Principal Components (PCs). Fundamentally, the scores matrix always be the input variables for building classification model. A recent paper uses Q-mode PCA but the focus of analysis was not on the variables but instead on the samples. As a result, the authors have exchanged the use of both loadings and scores plots in which clustering of samples was studied using loadings plot whereas scores plot has been used to identify important manifest variables. Therefore, the aim of this study is to statistically validate the proposed practice. Evaluation is based on performance of external error obtained from LDA models according to number of PCs. On top of that, bootstrapping was also conducted to evaluate the external error of each of the LDA models. Results show that LDA models produced by PCs from R-mode PCA give logical performance and the matched external error are also unbiased whereas the ones produced with Q-mode PCA show the opposites. With that, we concluded that PCs produced from Q-mode is not statistically stable and thus should not be applied to problems of classifying samples, but variables. We hope this paper will provide some insights on the disputable issues.

  5. Method of Real-Time Principal-Component Analysis

    NASA Technical Reports Server (NTRS)

    Duong, Tuan; Duong, Vu

    2005-01-01

    Dominant-element-based gradient descent and dynamic initial learning rate (DOGEDYN) is a method of sequential principal-component analysis (PCA) that is well suited for such applications as data compression and extraction of features from sets of data. In comparison with a prior method of gradient-descent-based sequential PCA, this method offers a greater rate of learning convergence. Like the prior method, DOGEDYN can be implemented in software. However, the main advantage of DOGEDYN over the prior method lies in the facts that it requires less computation and can be implemented in simpler hardware. It should be possible to implement DOGEDYN in compact, low-power, very-large-scale integrated (VLSI) circuitry that could process data in real time.

  6. Assets as a Socioeconomic Status Index: Categorical Principal Components Analysis vs. Latent Class Analysis.

    PubMed

    Sartipi, Majid; Nedjat, Saharnaz; Mansournia, Mohammad Ali; Baigi, Vali; Fotouhi, Akbar

    2016-11-01

    Some variables like Socioeconomic Status (SES) cannot be directly measured, instead, so-called 'latent variables' are measured indirectly through calculating tangible items. There are different methods for measuring latent variables such as data reduction methods e.g. Principal Components Analysis (PCA) and Latent Class Analysis (LCA). The purpose of our study was to measure assets index- as a representative of SES- through two methods of Non-Linear PCA (NLPCA) and LCA, and to compare them for choosing the most appropriate model. This was a cross sectional study in which 1995 respondents filled the questionnaires about their assets in Tehran. The data were analyzed by SPSS 19 (CATPCA command) and SAS 9.2 (PROC LCA command) to estimate their socioeconomic status. The results were compared based on the Intra-class Correlation Coefficient (ICC). The 6 derived classes from LCA based on BIC, were highly consistent with the 6 classes from CATPCA (Categorical PCA) (ICC = 0.87, 95%CI: 0.86 - 0.88). There is no gold standard to measure SES. Therefore, it is not possible to definitely say that a specific method is better than another one. LCA is a complicated method that presents detailed information about latent variables and required one assumption (local independency), while NLPCA is a simple method, which requires more assumptions. Generally, NLPCA seems to be an acceptable method of analysis because of its simplicity and high agreement with LCA.

  7. Geographical classification of Epimedium based on HPLC fingerprint analysis combined with multi-ingredients quantitative analysis.

    PubMed

    Xu, Ning; Zhou, Guofu; Li, Xiaojuan; Lu, Heng; Meng, Fanyun; Zhai, Huaqiang

    2017-05-01

    A reliable and comprehensive method for identifying the origin and assessing the quality of Epimedium has been developed. The method is based on analysis of HPLC fingerprints, combined with similarity analysis, hierarchical cluster analysis (HCA), principal component analysis (PCA) and multi-ingredient quantitative analysis. Nineteen batches of Epimedium, collected from different areas in the western regions of China, were used to establish the fingerprints and 18 peaks were selected for the analysis. Similarity analysis, HCA and PCA all classified the 19 areas into three groups. Simultaneous quantification of the five major bioactive ingredients in the Epimedium samples was also carried out to confirm the consistency of the quality tests. These methods were successfully used to identify the geographical origin of the Epimedium samples and to evaluate their quality. Copyright © 2016 John Wiley & Sons, Ltd.

  8. Origin Discrimination of Osmanthus fragrans var. thunbergii Flowers using GC-MS and UPLC-PDA Combined with Multivariable Analysis Methods.

    PubMed

    Zhou, Fei; Zhao, Yajing; Peng, Jiyu; Jiang, Yirong; Li, Maiquan; Jiang, Yuan; Lu, Baiyi

    2017-07-01

    Osmanthus fragrans flowers are used as folk medicine and additives for teas, beverages and foods. The metabolites of O. fragrans flowers from different geographical origins were inconsistent in some extent. Chromatography and mass spectrometry combined with multivariable analysis methods provides an approach for discriminating the origin of O. fragrans flowers. To discriminate the Osmanthus fragrans var. thunbergii flowers from different origins with the identified metabolites. GC-MS and UPLC-PDA were conducted to analyse the metabolites in O. fragrans var. thunbergii flowers (in total 150 samples). Principal component analysis (PCA), soft independent modelling of class analogy analysis (SIMCA) and random forest (RF) analysis were applied to group the GC-MS and UPLC-PDA data. GC-MS identified 32 compounds common to all samples while UPLC-PDA/QTOF-MS identified 16 common compounds. PCA of the UPLC-PDA data generated a better clustering than PCA of the GC-MS data. Ten metabolites (six from GC-MS and four from UPLC-PDA) were selected as effective compounds for discrimination by PCA loadings. SIMCA and RF analysis were used to build classification models, and the RF model, based on the four effective compounds (caffeic acid derivative, acteoside, ligustroside and compound 15), yielded better results with the classification rate of 100% in the calibration set and 97.8% in the prediction set. GC-MS and UPLC-PDA combined with multivariable analysis methods can discriminate the origin of Osmanthus fragrans var. thunbergii flowers. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  9. Multivariate Analysis of Electron Detachment Dissociation and Infrared Multiphoton Dissociation Mass Spectra of Heparan Sulfate Tetrasaccharides Differing Only in Hexuronic acid Stereochemistry

    NASA Astrophysics Data System (ADS)

    Oh, Han Bin; Leach, Franklin E.; Arungundram, Sailaja; Al-Mafraji, Kanar; Venot, Andre; Boons, Geert-Jan; Amster, I. Jonathan

    2011-03-01

    The structural characterization of glycosaminoglycan (GAG) carbohydrates by mass spectrometry has been a long-standing analytical challenge due to the inherent heterogeneity of these biomolecules, specifically polydispersity, variability in sulfation, and hexuronic acid stereochemistry. Recent advances in tandem mass spectrometry methods employing threshold and electron-based ion activation have resulted in the ability to determine the location of the labile sulfate modification as well as assign the stereochemistry of hexuronic acid residues. To facilitate the analysis of complex electron detachment dissociation (EDD) spectra, principal component analysis (PCA) is employed to differentiate the hexuronic acid stereochemistry of four synthetic GAG epimers whose EDD spectra are nearly identical upon visual inspection. For comparison, PCA is also applied to infrared multiphoton dissociation spectra (IRMPD) of the examined epimers. To assess the applicability of multivariate methods in GAG mixture analysis, PCA is utilized to identify the relative content of two epimers in a binary mixture.

  10. Genome-wide copy number analysis reveals candidate gene loci that confer susceptibility to high-grade prostate cancer.

    PubMed

    Poniah, Prevathe; Mohd Zain, Shamsul; Abdul Razack, Azad Hassan; Kuppusamy, Shanggar; Karuppayah, Shankar; Sian Eng, Hooi; Mohamed, Zahurin

    2017-09-01

    Two key issues in prostate cancer (PCa) that demand attention currently are the need for a more precise and minimally invasive screening test owing to the inaccuracy of prostate-specific antigen and differential diagnosis to distinguish advanced vs. indolent cancers. This continues to pose a tremendous challenge in diagnosis and prognosis of PCa and could potentially lead to overdiagnosis and overtreatment complications. Copy number variations (CNVs) in the human genome have been linked to various carcinomas including PCa. Detection of these variants may improve clinical treatment as well as an understanding of the pathobiology underlying this complex disease. To this end, we undertook a pilot genome-wide CNV analysis approach in 36 subjects (18 patients with high-grade PCa and 18 controls that were matched by age and ethnicity) in search of more accurate biomarkers that could potentially explain susceptibility toward high-grade PCa. We conducted this study using the array comparative genomic hybridization technique. Array results were validated in 92 independent samples (46 high-grade PCa, 23 benign prostatic hyperplasia, and 23 healthy controls) using polymerase chain reaction-based copy number counting method. A total of 314 CNV regions were found to be unique to PCa subjects in this cohort (P<0.05). A log 2 ratio-based copy number analysis revealed 5 putative rare or novel CNV loci or both associated with susceptibility to PCa. The CNV gain regions were 1q21.3, 15q15, 7p12.1, and a novel CNV in PCa 12q23.1, harboring ARNT, THBS1, SLC5A8, and DDC genes that are crucial in the p53 and cancer pathways. A CNV loss and deletion event was observed at 8p11.21, which contains the SFRP1 gene from the Wnt signaling pathway. Cross-comparison analysis with genes associated to PCa revealed significant CNVs involved in biological processes that elicit cancer pathogenesis via cytokine production and endothelial cell proliferation. In conclusion, we postulated that the CNVs identified in this study could provide an insight into the development of advanced PCa. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Application of Principal Component Analysis to NIR Spectra of Phyllosilicates: A Technique for Identifying Phyllosilicates on Mars

    NASA Technical Reports Server (NTRS)

    Rampe, E. B.; Lanza, N. L.

    2012-01-01

    Orbital near-infrared (NIR) reflectance spectra of the martian surface from the OMEGA and CRISM instruments have identified a variety of phyllosilicates in Noachian terrains. The types of phyllosilicates present on Mars have important implications for the aqueous environments in which they formed, and, thus, for recognizing locales that may have been habitable. Current identifications of phyllosilicates from martian NIR data are based on the positions of spectral absorptions relative to laboratory data of well-characterized samples and from spectral ratios; however, some phyllosilicates can be difficult to distinguish from one another with these methods (i.e. illite vs. muscovite). Here we employ a multivariate statistical technique, principal component analysis (PCA), to differentiate between spectrally similar phyllosilicate minerals. PCA is commonly used in a variety of industries (pharmaceutical, agricultural, viticultural) to discriminate between samples. Previous work using PCA to analyze raw NIR reflectance data from mineral mixtures has shown that this is a viable technique for identifying mineral types, abundances, and particle sizes. Here, we evaluate PCA of second-derivative NIR reflectance data as a method for classifying phyllosilicates and test whether this method can be used to identify phyllosilicates on Mars.

  12. Contact- and distance-based principal component analysis of protein dynamics.

    PubMed

    Ernst, Matthias; Sittel, Florian; Stock, Gerhard

    2015-12-28

    To interpret molecular dynamics simulations of complex systems, systematic dimensionality reduction methods such as principal component analysis (PCA) represent a well-established and popular approach. Apart from Cartesian coordinates, internal coordinates, e.g., backbone dihedral angles or various kinds of distances, may be used as input data in a PCA. Adopting two well-known model problems, folding of villin headpiece and the functional dynamics of BPTI, a systematic study of PCA using distance-based measures is presented which employs distances between Cα-atoms as well as distances between inter-residue contacts including side chains. While this approach seems prohibitive for larger systems due to the quadratic scaling of the number of distances with the size of the molecule, it is shown that it is sufficient (and sometimes even better) to include only relatively few selected distances in the analysis. The quality of the PCA is assessed by considering the resolution of the resulting free energy landscape (to identify metastable conformational states and barriers) and the decay behavior of the corresponding autocorrelation functions (to test the time scale separation of the PCA). By comparing results obtained with distance-based, dihedral angle, and Cartesian coordinates, the study shows that the choice of input variables may drastically influence the outcome of a PCA.

  13. Contact- and distance-based principal component analysis of protein dynamics

    NASA Astrophysics Data System (ADS)

    Ernst, Matthias; Sittel, Florian; Stock, Gerhard

    2015-12-01

    To interpret molecular dynamics simulations of complex systems, systematic dimensionality reduction methods such as principal component analysis (PCA) represent a well-established and popular approach. Apart from Cartesian coordinates, internal coordinates, e.g., backbone dihedral angles or various kinds of distances, may be used as input data in a PCA. Adopting two well-known model problems, folding of villin headpiece and the functional dynamics of BPTI, a systematic study of PCA using distance-based measures is presented which employs distances between Cα-atoms as well as distances between inter-residue contacts including side chains. While this approach seems prohibitive for larger systems due to the quadratic scaling of the number of distances with the size of the molecule, it is shown that it is sufficient (and sometimes even better) to include only relatively few selected distances in the analysis. The quality of the PCA is assessed by considering the resolution of the resulting free energy landscape (to identify metastable conformational states and barriers) and the decay behavior of the corresponding autocorrelation functions (to test the time scale separation of the PCA). By comparing results obtained with distance-based, dihedral angle, and Cartesian coordinates, the study shows that the choice of input variables may drastically influence the outcome of a PCA.

  14. Urinary microRNAs for prostate cancer diagnosis, prognosis, and treatment response: are we there yet?

    PubMed

    Balacescu, Ovidiu; Petrut, Bogdan; Tudoran, Oana; Feflea, Dragos; Balacescu, Loredana; Anghel, Andrei; Sirbu, Ioan O; Seclaman, Edward; Marian, Catalin

    2017-11-01

    Prostate cancer (PCa) remains one of the leading causes of cancer-related deaths in men. Despite the tremendous progress in research over the years, a suitable minimally invasive PCa biomarker is yet to be discovered. The recent advances regarding the roles of microRNAs as biomarkers has allowed for their study in PCa as well, especially as blood-based markers. However, there are several studies that used urine as biological sample to evaluate microRNAs as biomarkers for PCa diagnosis, prognosis, and treatment response, which were reviewed herein. A high degree of inconsistency among reports has been observed, which could be due to several analytical aspects, starting with different urinary fractions used for analysis and continuing with the employment of various analytical platforms and methods of statistical analysis. However, a few microRNAs were found to be dysregulated in the urine of PCa patients, which alone or together with serum prostate-specific antigen seem to improve diagnostic power even in the gray zone of PCa. These results warrant further confirmation by larger prospective studies, preferably using a standardized protocol for analysis. WIREs RNA 2017, 8:e1438. doi: 10.1002/wrna.1438 For further resources related to this article, please visit the WIREs website. © 2017 Wiley Periodicals, Inc.

  15. Adaptive online monitoring for ICU patients by combining just-in-time learning and principal component analysis.

    PubMed

    Li, Xuejian; Wang, Youqing

    2016-12-01

    Offline general-type models are widely used for patients' monitoring in intensive care units (ICUs), which are developed by using past collected datasets consisting of thousands of patients. However, these models may fail to adapt to the changing states of ICU patients. Thus, to be more robust and effective, the monitoring models should be adaptable to individual patients. A novel combination of just-in-time learning (JITL) and principal component analysis (PCA), referred to learning-type PCA (L-PCA), was proposed for adaptive online monitoring of patients in ICUs. JITL was used to gather the most relevant data samples for adaptive modeling of complex physiological processes. PCA was used to build an online individual-type model and calculate monitoring statistics, and then to judge whether the patient's status is normal or not. The adaptability of L-PCA lies in the usage of individual data and the continuous updating of the training dataset. Twelve subjects were selected from the Physiobank's Multi-parameter Intelligent Monitoring for Intensive Care II (MIMIC II) database, and five vital signs of each subject were chosen. The proposed method was compared with the traditional PCA and fast moving-window PCA (Fast MWPCA). The experimental results demonstrated that the fault detection rates respectively increased by 20 % and 47 % compared with PCA and Fast MWPCA. L-PCA is first introduced into ICU patients monitoring and achieves the best monitoring performance in terms of adaptability to changes in patient status and sensitivity for abnormality detection.

  16. Principal Component Analysis in the Spectral Analysis of the Dynamic Laser Speckle Patterns

    NASA Astrophysics Data System (ADS)

    Ribeiro, K. M.; Braga, R. A., Jr.; Horgan, G. W.; Ferreira, D. D.; Safadi, T.

    2014-02-01

    Dynamic laser speckle is a phenomenon that interprets an optical patterns formed by illuminating a surface under changes with coherent light. Therefore, the dynamic change of the speckle patterns caused by biological material is known as biospeckle. Usually, these patterns of optical interference evolving in time are analyzed by graphical or numerical methods, and the analysis in frequency domain has also been an option, however involving large computational requirements which demands new approaches to filter the images in time. Principal component analysis (PCA) works with the statistical decorrelation of data and it can be used as a data filtering. In this context, the present work evaluated the PCA technique to filter in time the data from the biospeckle images aiming the reduction of time computer consuming and improving the robustness of the filtering. It was used 64 images of biospeckle in time observed in a maize seed. The images were arranged in a data matrix and statistically uncorrelated by PCA technique, and the reconstructed signals were analyzed using the routine graphical and numerical methods to analyze the biospeckle. Results showed the potential of the PCA tool in filtering the dynamic laser speckle data, with the definition of markers of principal components related to the biological phenomena and with the advantage of fast computational processing.

  17. Stationary Wavelet-based Two-directional Two-dimensional Principal Component Analysis for EMG Signal Classification

    NASA Astrophysics Data System (ADS)

    Ji, Yi; Sun, Shanlin; Xie, Hong-Bo

    2017-06-01

    Discrete wavelet transform (WT) followed by principal component analysis (PCA) has been a powerful approach for the analysis of biomedical signals. Wavelet coefficients at various scales and channels were usually transformed into a one-dimensional array, causing issues such as the curse of dimensionality dilemma and small sample size problem. In addition, lack of time-shift invariance of WT coefficients can be modeled as noise and degrades the classifier performance. In this study, we present a stationary wavelet-based two-directional two-dimensional principal component analysis (SW2D2PCA) method for the efficient and effective extraction of essential feature information from signals. Time-invariant multi-scale matrices are constructed in the first step. The two-directional two-dimensional principal component analysis then operates on the multi-scale matrices to reduce the dimension, rather than vectors in conventional PCA. Results are presented from an experiment to classify eight hand motions using 4-channel electromyographic (EMG) signals recorded in healthy subjects and amputees, which illustrates the efficiency and effectiveness of the proposed method for biomedical signal analysis.

  18. Fuji apple storage time rapid determination method using Vis/NIR spectroscopy.

    PubMed

    Liu, Fuqi; Tang, Xuxiang

    2015-01-01

    Fuji apple storage time rapid determination method using visible/near-infrared (Vis/NIR) spectroscopy was studied in this paper. Vis/NIR diffuse reflection spectroscopy responses to samples were measured for 6 days. Spectroscopy data were processed by stochastic resonance (SR). Principal component analysis (PCA) was utilized to analyze original spectroscopy data and SNR eigen value. Results demonstrated that PCA could not totally discriminate Fuji apples using original spectroscopy data. Signal-to-noise ratio (SNR) spectrum clearly classified all apple samples. PCA using SNR spectrum successfully discriminated apple samples. Therefore, Vis/NIR spectroscopy was effective for Fuji apple storage time rapid discrimination. The proposed method is also promising in condition safety control and management for food and environmental laboratories.

  19. Fuji apple storage time rapid determination method using Vis/NIR spectroscopy

    PubMed Central

    Liu, Fuqi; Tang, Xuxiang

    2015-01-01

    Fuji apple storage time rapid determination method using visible/near-infrared (Vis/NIR) spectroscopy was studied in this paper. Vis/NIR diffuse reflection spectroscopy responses to samples were measured for 6 days. Spectroscopy data were processed by stochastic resonance (SR). Principal component analysis (PCA) was utilized to analyze original spectroscopy data and SNR eigen value. Results demonstrated that PCA could not totally discriminate Fuji apples using original spectroscopy data. Signal-to-noise ratio (SNR) spectrum clearly classified all apple samples. PCA using SNR spectrum successfully discriminated apple samples. Therefore, Vis/NIR spectroscopy was effective for Fuji apple storage time rapid discrimination. The proposed method is also promising in condition safety control and management for food and environmental laboratories. PMID:25874818

  20. Comparative Analysis of a Principal Component Analysis-Based and an Artificial Neural Network-Based Method for Baseline Removal.

    PubMed

    Carvajal, Roberto C; Arias, Luis E; Garces, Hugo O; Sbarbaro, Daniel G

    2016-04-01

    This work presents a non-parametric method based on a principal component analysis (PCA) and a parametric one based on artificial neural networks (ANN) to remove continuous baseline features from spectra. The non-parametric method estimates the baseline based on a set of sampled basis vectors obtained from PCA applied over a previously composed continuous spectra learning matrix. The parametric method, however, uses an ANN to filter out the baseline. Previous studies have demonstrated that this method is one of the most effective for baseline removal. The evaluation of both methods was carried out by using a synthetic database designed for benchmarking baseline removal algorithms, containing 100 synthetic composed spectra at different signal-to-baseline ratio (SBR), signal-to-noise ratio (SNR), and baseline slopes. In addition to deomonstrating the utility of the proposed methods and to compare them in a real application, a spectral data set measured from a flame radiation process was used. Several performance metrics such as correlation coefficient, chi-square value, and goodness-of-fit coefficient were calculated to quantify and compare both algorithms. Results demonstrate that the PCA-based method outperforms the one based on ANN both in terms of performance and simplicity. © The Author(s) 2016.

  1. PCA as a practical indicator of OPLS-DA model reliability.

    PubMed

    Worley, Bradley; Powers, Robert

    Principal Component Analysis (PCA) and Orthogonal Projections to Latent Structures Discriminant Analysis (OPLS-DA) are powerful statistical modeling tools that provide insights into separations between experimental groups based on high-dimensional spectral measurements from NMR, MS or other analytical instrumentation. However, when used without validation, these tools may lead investigators to statistically unreliable conclusions. This danger is especially real for Partial Least Squares (PLS) and OPLS, which aggressively force separations between experimental groups. As a result, OPLS-DA is often used as an alternative method when PCA fails to expose group separation, but this practice is highly dangerous. Without rigorous validation, OPLS-DA can easily yield statistically unreliable group separation. A Monte Carlo analysis of PCA group separations and OPLS-DA cross-validation metrics was performed on NMR datasets with statistically significant separations in scores-space. A linearly increasing amount of Gaussian noise was added to each data matrix followed by the construction and validation of PCA and OPLS-DA models. With increasing added noise, the PCA scores-space distance between groups rapidly decreased and the OPLS-DA cross-validation statistics simultaneously deteriorated. A decrease in correlation between the estimated loadings (added noise) and the true (original) loadings was also observed. While the validity of the OPLS-DA model diminished with increasing added noise, the group separation in scores-space remained basically unaffected. Supported by the results of Monte Carlo analyses of PCA group separations and OPLS-DA cross-validation metrics, we provide practical guidelines and cross-validatory recommendations for reliable inference from PCA and OPLS-DA models.

  2. Principal components analysis in clinical studies.

    PubMed

    Zhang, Zhongheng; Castelló, Adela

    2017-09-01

    In multivariate analysis, independent variables are usually correlated to each other which can introduce multicollinearity in the regression models. One approach to solve this problem is to apply principal components analysis (PCA) over these variables. This method uses orthogonal transformation to represent sets of potentially correlated variables with principal components (PC) that are linearly uncorrelated. PCs are ordered so that the first PC has the largest possible variance and only some components are selected to represent the correlated variables. As a result, the dimension of the variable space is reduced. This tutorial illustrates how to perform PCA in R environment, the example is a simulated dataset in which two PCs are responsible for the majority of the variance in the data. Furthermore, the visualization of PCA is highlighted.

  3. Epileptic seizure detection in EEG signal with GModPCA and support vector machine.

    PubMed

    Jaiswal, Abeg Kumar; Banka, Haider

    2017-01-01

    Epilepsy is one of the most common neurological disorders caused by recurrent seizures. Electroencephalograms (EEGs) record neural activity and can detect epilepsy. Visual inspection of an EEG signal for epileptic seizure detection is a time-consuming process and may lead to human error; therefore, recently, a number of automated seizure detection frameworks were proposed to replace these traditional methods. Feature extraction and classification are two important steps in these procedures. Feature extraction focuses on finding the informative features that could be used for classification and correct decision-making. Therefore, proposing effective feature extraction techniques for seizure detection is of great significance. Principal Component Analysis (PCA) is a dimensionality reduction technique used in different fields of pattern recognition including EEG signal classification. Global modular PCA (GModPCA) is a variation of PCA. In this paper, an effective framework with GModPCA and Support Vector Machine (SVM) is presented for epileptic seizure detection in EEG signals. The feature extraction is performed with GModPCA, whereas SVM trained with radial basis function kernel performed the classification between seizure and nonseizure EEG signals. Seven different experimental cases were conducted on the benchmark epilepsy EEG dataset. The system performance was evaluated using 10-fold cross-validation. In addition, we prove analytically that GModPCA has less time and space complexities as compared to PCA. The experimental results show that EEG signals have strong inter-sub-pattern correlations. GModPCA and SVM have been able to achieve 100% accuracy for the classification between normal and epileptic signals. Along with this, seven different experimental cases were tested. The classification results of the proposed approach were better than were compared the results of some of the existing methods proposed in literature. It is also found that the time and space complexities of GModPCA are less as compared to PCA. This study suggests that GModPCA and SVM could be used for automated epileptic seizure detection in EEG signal.

  4. [Research on spectra recognition method for cabbages and weeds based on PCA and SIMCA].

    PubMed

    Zu, Qin; Deng, Wei; Wang, Xiu; Zhao, Chun-Jiang

    2013-10-01

    In order to improve the accuracy and efficiency of weed identification, the difference of spectral reflectance was employed to distinguish between crops and weeds. Firstly, the different combinations of Savitzky-Golay (SG) convolutional derivation and multiplicative scattering correction (MSC) method were applied to preprocess the raw spectral data. Then the clustering analysis of various types of plants was completed by using principal component analysis (PCA) method, and the feature wavelengths which were sensitive for classifying various types of plants were extracted according to the corresponding loading plots of the optimal principal components in PCA results. Finally, setting the feature wavelengths as the input variables, the soft independent modeling of class analogy (SIMCA) classification method was used to identify the various types of plants. The experimental results of classifying cabbages and weeds showed that on the basis of the optimal pretreatment by a synthetic application of MSC and SG convolutional derivation with SG's parameters set as 1rd order derivation, 3th degree polynomial and 51 smoothing points, 23 feature wavelengths were extracted in accordance with the top three principal components in PCA results. When SIMCA method was used for classification while the previously selected 23 feature wavelengths were set as the input variables, the classification rates of the modeling set and the prediction set were respectively up to 98.6% and 100%.

  5. Prostate Cancer Predictive Simulation Modelling, Assessing the Risk Technique (PCP-SMART): Introduction and Initial Clinical Efficacy Evaluation Data Presentation of a Simple Novel Mathematical Simulation Modelling Method, Devised to Predict the Outcome of Prostate Biopsy on an Individual Basis.

    PubMed

    Spyropoulos, Evangelos; Kotsiris, Dimitrios; Spyropoulos, Katherine; Panagopoulos, Aggelos; Galanakis, Ioannis; Mavrikos, Stamatios

    2017-02-01

    We developed a mathematical "prostate cancer (PCa) conditions simulating" predictive model (PCP-SMART), from which we derived a novel PCa predictor (prostate cancer risk determinator [PCRD] index) and a PCa risk equation. We used these to estimate the probability of finding PCa on prostate biopsy, on an individual basis. A total of 371 men who had undergone transrectal ultrasound-guided prostate biopsy were enrolled in the present study. Given that PCa risk relates to the total prostate-specific antigen (tPSA) level, age, prostate volume, free PSA (fPSA), fPSA/tPSA ratio, and PSA density and that tPSA ≥ 50 ng/mL has a 98.5% positive predictive value for a PCa diagnosis, we hypothesized that correlating 2 variables composed of 3 ratios (1, tPSA/age; 2, tPSA/prostate volume; and 3, fPSA/tPSA; 1 variable including the patient's tPSA and the other, a tPSA value of 50 ng/mL) could operate as a PCa conditions imitating/simulating model. Linear regression analysis was used to derive the coefficient of determination (R 2 ), termed the PCRD index. To estimate the PCRD index's predictive validity, we used the χ 2 test, multiple logistic regression analysis with PCa risk equation formation, calculation of test performance characteristics, and area under the receiver operating characteristic curve analysis using SPSS, version 22 (P < .05). The biopsy findings were positive for PCa in 167 patients (45.1%) and negative in 164 (44.2%). The PCRD index was positively signed in 89.82% positive PCa cases and negative in 91.46% negative PCa cases (χ 2 test; P < .001; relative risk, 8.98). The sensitivity was 89.8%, specificity was 91.5%, positive predictive value was 91.5%, negative predictive value was 89.8%, positive likelihood ratio was 10.5, negative likelihood ratio was 0.11, and accuracy was 90.6%. Multiple logistic regression revealed the PCRD index as an independent PCa predictor, and the formulated risk equation was 91% accurate in predicting the probability of finding PCa. On the receiver operating characteristic analysis, the PCRD index (area under the curve, 0.926) significantly (P < .001) outperformed other, established PCa predictors. The PCRD index effectively predicted the prostate biopsy outcome, correctly identifying 9 of 10 men who were eventually diagnosed with PCa and correctly ruling out PCa for 9 of 10 men who did not have PCa. Its predictive power significantly outperformed established PCa predictors, and the formulated risk equation accurately calculated the probability of finding cancer on biopsy, on an individual patient basis. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. Item response theory and factor analysis as a mean to characterize occurrence of response shift in a longitudinal quality of life study in breast cancer patients

    PubMed Central

    2014-01-01

    Background The occurrence of response shift (RS) in longitudinal health-related quality of life (HRQoL) studies, reflecting patient adaptation to disease, has already been demonstrated. Several methods have been developed to detect the three different types of response shift (RS), i.e. recalibration RS, 2) reprioritization RS, and 3) reconceptualization RS. We investigated two complementary methods that characterize the occurrence of RS: factor analysis, comprising Principal Component Analysis (PCA) and Multiple Correspondence Analysis (MCA), and a method of Item Response Theory (IRT). Methods Breast cancer patients (n = 381) completed the EORTC QLQ-C30 and EORTC QLQ-BR23 questionnaires at baseline, immediately following surgery, and three and six months after surgery, according to the “then-test/post-test” design. Recalibration was explored using MCA and a model of IRT, called the Linear Logistic Model with Relaxed Assumptions (LLRA) using the then-test method. Principal Component Analysis (PCA) was used to explore reconceptualization and reprioritization. Results MCA highlighted the main profiles of recalibration: patients with high HRQoL level report a slightly worse HRQoL level retrospectively and vice versa. The LLRA model indicated a downward or upward recalibration for each dimension. At six months, the recalibration effect was statistically significant for 11/22 dimensions of the QLQ-C30 and BR23 according to the LLRA model (p ≤ 0.001). Regarding the QLQ-C30, PCA indicated a reprioritization of symptom scales and reconceptualization via an increased correlation between functional scales. Conclusions Our findings demonstrate the usefulness of these analyses in characterizing the occurrence of RS. MCA and IRT model had convergent results with then-test method to characterize recalibration component of RS. PCA is an indirect method in investigating the reprioritization and reconceptualization components of RS. PMID:24606836

  7. An application of principal component analysis to the clavicle and clavicle fixation devices.

    PubMed

    Daruwalla, Zubin J; Courtis, Patrick; Fitzpatrick, Clare; Fitzpatrick, David; Mullett, Hannan

    2010-03-26

    Principal component analysis (PCA) enables the building of statistical shape models of bones and joints. This has been used in conjunction with computer assisted surgery in the past. However, PCA of the clavicle has not been performed. Using PCA, we present a novel method that examines the major modes of size and three-dimensional shape variation in male and female clavicles and suggests a method of grouping the clavicle into size and shape categories. Twenty-one high-resolution computerized tomography scans of the clavicle were reconstructed and analyzed using a specifically developed statistical software package. After performing statistical shape analysis, PCA was applied to study the factors that account for anatomical variation. The first principal component representing size accounted for 70.5 percent of anatomical variation. The addition of a further three principal components accounted for almost 87 percent. Using statistical shape analysis, clavicles in males have a greater lateral depth and are longer, wider and thicker than in females. However, the sternal angle in females is larger than in males. PCA confirmed these differences between genders but also noted that men exhibit greater variance and classified clavicles into five morphological groups. This unique approach is the first that standardizes a clavicular orientation. It provides information that is useful to both, the biomedical engineer and clinician. Other applications include implant design with regard to modifying current or designing future clavicle fixation devices. Our findings support the need for further development of clavicle fixation devices and the questioning of whether gender-specific devices are necessary.

  8. Comparison of multi-subject ICA methods for analysis of fMRI data

    PubMed Central

    Erhardt, Erik Barry; Rachakonda, Srinivas; Bedrick, Edward; Allen, Elena; Adali, Tülay; Calhoun, Vince D.

    2010-01-01

    Spatial independent component analysis (ICA) applied to functional magnetic resonance imaging (fMRI) data identifies functionally connected networks by estimating spatially independent patterns from their linearly mixed fMRI signals. Several multi-subject ICA approaches estimating subject-specific time courses (TCs) and spatial maps (SMs) have been developed, however there has not yet been a full comparison of the implications of their use. Here, we provide extensive comparisons of four multi-subject ICA approaches in combination with data reduction methods for simulated and fMRI task data. For multi-subject ICA, the data first undergo reduction at the subject and group levels using principal component analysis (PCA). Comparisons of subject-specific, spatial concatenation, and group data mean subject-level reduction strategies using PCA and probabilistic PCA (PPCA) show that computationally intensive PPCA is equivalent to PCA, and that subject-specific and group data mean subject-level PCA are preferred because of well-estimated TCs and SMs. Second, aggregate independent components are estimated using either noise free ICA or probabilistic ICA (PICA). Third, subject-specific SMs and TCs are estimated using back-reconstruction. We compare several direct group ICA (GICA) back-reconstruction approaches (GICA1-GICA3) and an indirect back-reconstruction approach, spatio-temporal regression (STR, or dual regression). Results show the earlier group ICA (GICA1) approximates STR, however STR has contradictory assumptions and may show mixed-component artifacts in estimated SMs. Our evidence-based recommendation is to use GICA3, introduced here, with subject-specific PCA and noise-free ICA, providing the most robust and accurate estimated SMs and TCs in addition to offering an intuitive interpretation. PMID:21162045

  9. Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis

    PubMed Central

    2013-01-01

    Background Protein-protein interactions (PPIs) play crucial roles in the execution of various cellular processes and form the basis of biological mechanisms. Although large amount of PPIs data for different species has been generated by high-throughput experimental techniques, current PPI pairs obtained with experimental methods cover only a fraction of the complete PPI networks, and further, the experimental methods for identifying PPIs are both time-consuming and expensive. Hence, it is urgent and challenging to develop automated computational methods to efficiently and accurately predict PPIs. Results We present here a novel hierarchical PCA-EELM (principal component analysis-ensemble extreme learning machine) model to predict protein-protein interactions only using the information of protein sequences. In the proposed method, 11188 protein pairs retrieved from the DIP database were encoded into feature vectors by using four kinds of protein sequences information. Focusing on dimension reduction, an effective feature extraction method PCA was then employed to construct the most discriminative new feature set. Finally, multiple extreme learning machines were trained and then aggregated into a consensus classifier by majority voting. The ensembling of extreme learning machine removes the dependence of results on initial random weights and improves the prediction performance. Conclusions When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 87.00% prediction accuracy with 86.15% sensitivity at the precision of 87.59%. Extensive experiments are performed to compare our method with state-of-the-art techniques Support Vector Machine (SVM). Experimental results demonstrate that proposed PCA-EELM outperforms the SVM method by 5-fold cross-validation. Besides, PCA-EELM performs faster than PCA-SVM based method. Consequently, the proposed approach can be considered as a new promising and powerful tools for predicting PPI with excellent performance and less time. PMID:23815620

  10. Sparse principal component analysis in medical shape modeling

    NASA Astrophysics Data System (ADS)

    Sjöstrand, Karl; Stegmann, Mikkel B.; Larsen, Rasmus

    2006-03-01

    Principal component analysis (PCA) is a widely used tool in medical image analysis for data reduction, model building, and data understanding and exploration. While PCA is a holistic approach where each new variable is a linear combination of all original variables, sparse PCA (SPCA) aims at producing easily interpreted models through sparse loadings, i.e. each new variable is a linear combination of a subset of the original variables. One of the aims of using SPCA is the possible separation of the results into isolated and easily identifiable effects. This article introduces SPCA for shape analysis in medicine. Results for three different data sets are given in relation to standard PCA and sparse PCA by simple thresholding of small loadings. Focus is on a recent algorithm for computing sparse principal components, but a review of other approaches is supplied as well. The SPCA algorithm has been implemented using Matlab and is available for download. The general behavior of the algorithm is investigated, and strengths and weaknesses are discussed. The original report on the SPCA algorithm argues that the ordering of modes is not an issue. We disagree on this point and propose several approaches to establish sensible orderings. A method that orders modes by decreasing variance and maximizes the sum of variances for all modes is presented and investigated in detail.

  11. The Influence Function of Principal Component Analysis by Self-Organizing Rule.

    PubMed

    Higuchi; Eguchi

    1998-07-28

    This article is concerned with a neural network approach to principal component analysis (PCA). An algorithm for PCA by the self-organizing rule has been proposed and its robustness observed through the simulation study by Xu and Yuille (1995). In this article, the robustness of the algorithm against outliers is investigated by using the theory of influence function. The influence function of the principal component vector is given in an explicit form. Through this expression, the method is shown to be robust against any directions orthogonal to the principal component vector. In addition, a statistic generated by the self-organizing rule is proposed to assess the influence of data in PCA.

  12. The impact of moderate wine consumption on the risk of developing prostate cancer

    PubMed Central

    Ferro, Matteo; Foerster, Beat; Abufaraj, Mohammad; Briganti, Alberto; Karakiewicz, Pierre I; Shariat, Shahrokh F

    2018-01-01

    Objective To investigate the impact of moderate wine consumption on the risk of prostate cancer (PCa). We focused on the differential effect of moderate consumption of red versus white wine. Design This study was a meta-analysis that includes data from case–control and cohort studies. Materials and methods A systematic search of Web of Science, Medline/PubMed, and Cochrane library was performed on December 1, 2017. Studies were deemed eligible if they assessed the risk of PCa due to red, white, or any wine using multivariable logistic regression analysis. We performed a formal meta-analysis for the risk of PCa according to moderate wine and wine type consumption (white or red). Heterogeneity between studies was assessed using Cochrane’s Q test and I2 statistics. Publication bias was assessed using Egger’s regression test. Results A total of 930 abstracts and titles were initially identified. After removal of duplicates, reviews, and conference abstracts, 83 full-text original articles were screened. Seventeen studies (611,169 subjects) were included for final evaluation and fulfilled the inclusion criteria. In the case of moderate wine consumption: the pooled risk ratio (RR) for the risk of PCa was 0.98 (95% CI 0.92–1.05, p=0.57) in the multivariable analysis. Moderate white wine consumption increased the risk of PCa with a pooled RR of 1.26 (95% CI 1.10–1.43, p=0.001) in the multi-variable analysis. Meanwhile, moderate red wine consumption had a protective role reducing the risk by 12% (RR 0.88, 95% CI 0.78–0.999, p=0.047) in the multivariable analysis that comprised 222,447 subjects. Conclusions In this meta-analysis, moderate wine consumption did not impact the risk of PCa. Interestingly, regarding the type of wine, moderate consumption of white wine increased the risk of PCa, whereas moderate consumption of red wine had a protective effect. Further analyses are needed to assess the differential molecular effect of white and red wine conferring their impact on PCa risk. PMID:29713200

  13. Improved estimation of parametric images of cerebral glucose metabolic rate from dynamic FDG-PET using volume-wise principle component analysis

    NASA Astrophysics Data System (ADS)

    Dai, Xiaoqian; Tian, Jie; Chen, Zhe

    2010-03-01

    Parametric images can represent both spatial distribution and quantification of the biological and physiological parameters of tracer kinetics. The linear least square (LLS) method is a well-estimated linear regression method for generating parametric images by fitting compartment models with good computational efficiency. However, bias exists in LLS-based parameter estimates, owing to the noise present in tissue time activity curves (TTACs) that propagates as correlated error in the LLS linearized equations. To address this problem, a volume-wise principal component analysis (PCA) based method is proposed. In this method, firstly dynamic PET data are properly pre-transformed to standardize noise variance as PCA is a data driven technique and can not itself separate signals from noise. Secondly, the volume-wise PCA is applied on PET data. The signals can be mostly represented by the first few principle components (PC) and the noise is left in the subsequent PCs. Then the noise-reduced data are obtained using the first few PCs by applying 'inverse PCA'. It should also be transformed back according to the pre-transformation method used in the first step to maintain the scale of the original data set. Finally, the obtained new data set is used to generate parametric images using the linear least squares (LLS) estimation method. Compared with other noise-removal method, the proposed method can achieve high statistical reliability in the generated parametric images. The effectiveness of the method is demonstrated both with computer simulation and with clinical dynamic FDG PET study.

  14. Fixed Eigenvector Analysis of Thermographic NDE Data

    NASA Technical Reports Server (NTRS)

    Cramer, K. Elliott; Winfree, William P.

    2011-01-01

    Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. This paper will discuss an alternative method of analysis that has been developed where a predetermined set of eigenvectors is used to process the thermal data from both reinforced carbon-carbon (RCC) and graphiteepoxy honeycomb materials. These eigenvectors can be generated either from an analytic model of the thermal response of the material system under examination, or from a large set of experimental data. This paper provides the details of the analytic model, an overview of the PCA process, as well as a quantitative signal-to-noise comparison of the results of performing both conventional PCA and fixed eigenvector analysis on thermographic data from two specimens, one Reinforced Carbon-Carbon with flat bottom holes and the second a sandwich construction with graphite-epoxy face sheets and aluminum honeycomb core.

  15. Health status monitoring for ICU patients based on locally weighted principal component analysis.

    PubMed

    Ding, Yangyang; Ma, Xin; Wang, Youqing

    2018-03-01

    Intelligent status monitoring for critically ill patients can help medical stuff quickly discover and assess the changes of disease and then make appropriate treatment strategy. However, general-type monitoring model now widely used is difficult to adapt the changes of intensive care unit (ICU) patients' status due to its fixed pattern, and a more robust, efficient and fast monitoring model should be developed to the individual. A data-driven learning approach combining locally weighted projection regression (LWPR) and principal component analysis (PCA) is firstly proposed and applied to monitor the nonlinear process of patients' health status in ICU. LWPR is used to approximate the complex nonlinear process with local linear models, in which PCA could be further applied to status monitoring, and finally a global weighted statistic will be acquired for detecting the possible abnormalities. Moreover, some improved versions are developed, such as LWPR-MPCA and LWPR-JPCA, which also have superior performance. Eighteen subjects were selected from the Physiobank's Multi-parameter Intelligent Monitoring for Intensive Care II (MIMIC II) database, and two vital signs of each subject were chosen for online monitoring. The proposed method was compared with several existing methods including traditional PCA, Partial least squares (PLS), just in time learning combined with modified PCA (L-PCA), and Kernel PCA (KPCA). The experimental results demonstrated that the mean fault detection rate (FDR) of PCA can be improved by 41.7% after adding LWPR. The mean FDR of LWPR-MPCA was increased by 8.3%, compared with the latest reported method L-PCA. Meanwhile, LWPR spent less training time than others, especially KPCA. LWPR is first introduced into ICU patients monitoring and achieves the best monitoring performance including adaptability to changes in patient status, sensitivity for abnormality detection as well as its fast learning speed and low computational complexity. The algorithm is an excellent approach to establishing a personalized model for patients, which is the mainstream direction of modern medicine in the following development, as well as improving the global monitoring performance. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.

  16. Circular RNA Myosin Light Chain Kinase (MYLK) Promotes Prostate Cancer Progression through Modulating Mir-29a Expression.

    PubMed

    Dai, Yuanqing; Li, Dongjie; Chen, Xiong; Tan, Xinji; Gu, Jie; Chen, Mingquan; Zhang, Xiaobo

    2018-05-25

    BACKGROUND In developed countries, prostate cancer (PCa) is a frequently diagnosed cancer with the second highest fatality rate. Circular RNAs (circRNAs) are a class of endogenous non-coding RNAs (ncRNAs) stably expressed in cells and involved in a series of carcinomas. However, few research studies have reported on the role of circRNAs in PCa. MATERIAL AND METHODS We used qRT-PCR to detect the expression of circMYLK (circRNA ID: hsa_circ_0141940) and miR-29a in PCa tissues and cell lines. MTT, colony formation, and TUNEL assays were performed to analysis the cell viability of PCa cells. Transwell and wound scratch assays were performed to investigate the cell invasion and migration of PCa cells. RESULTS In the present study, we confirmed that circMYLK expression level was significantly higher in PCa samples and PCa cells than in normal tissues and normal prostatic cells. The upregulated circRNA-MYLK promoted PCa cells proliferation, invasion, and migration; however, si-circRNA-MYLK significantly accelerated the PCa cell apoptosis. We also observed that the aforementioned function of circRNA-MYLK on PCa cells was affected through targeting miR-29a. CONCLUSIONS We confirmed circRNA-MYLK was an oncogene in PCa and revealed a novel mechanism underlying circRNA-MYLK in PC progression.

  17. The theoretical and experimental study on dicalcium phosphate dehydrate loading with protocatechuic aldehyde.

    PubMed

    Guo, Yuehua; Qu, Shuxin; Lu, Xiong; Xie, Haodong; Zhang, Hongping; Weng, Jie

    2010-07-01

    The aim of this study is to investigate the interaction between dicalcium phosphate dihydrate (CaHPO(4) x 2H(2)O, DCPD) and Protocatechuic aldehyde (C(7)H(6)O(3), Pca), which is the water-soluble constituents of Chinese Medicine, Salvia Miltiorrhiza Bunge (SMB), by calculating the absorption energy through molecular dynamics simulation. Furthermore, the effects of functional groups of Pca and temperature on Pca adsorbed by DCPD are calculated respectively. DCPD/Pca and DCPD were analyzed by X-ray diffraction (XRD), Fourier transform infrared spectroscopy (FTIR) and thermogravimetric analysis (TG). The simulation results showed that Pca mostly absorbed on the (0 2 0) surface of DCPD. The aldehyde group of Pca played a moren important role on the adsorption of Pca on DCPD than hydroxyl did, while temperature had no distinct effects on the adsorption. XRD results indicated that Pca induced the preferential growth of (0 2 0) crystal surface in DCPC/Pca whereas it had no influence on the crystal structure, the crystallinity and grain size of DCPD. FTIR and TG results showed that the characteristic peak of Pca was at 1295 cm(-1) and the content of Pca in DCPD was 16%, respectively. The present results show that molecular dynamics simulation is a very effective and complementary method to study the interaction between materials and medicine.

  18. Decomposing the Apoptosis Pathway Into Biologically Interpretable Principal Components

    PubMed Central

    Wang, Min; Kornblau, Steven M; Coombes, Kevin R

    2018-01-01

    Principal component analysis (PCA) is one of the most common techniques in the analysis of biological data sets, but applying PCA raises 2 challenges. First, one must determine the number of significant principal components (PCs). Second, because each PC is a linear combination of genes, it rarely has a biological interpretation. Existing methods to determine the number of PCs are either subjective or computationally extensive. We review several methods and describe a new R package, PCDimension, that implements additional methods, the most important being an algorithm that extends and automates a graphical Bayesian method. Using simulations, we compared the methods. Our newly automated procedure is competitive with the best methods when considering both accuracy and speed and is the most accurate when the number of objects is small compared with the number of attributes. We applied the method to a proteomics data set from patients with acute myeloid leukemia. Proteins in the apoptosis pathway could be explained using 6 PCs. By clustering the proteins in PC space, we were able to replace the PCs by 6 “biological components,” 3 of which could be immediately interpreted from the current literature. We expect this approach combining PCA with clustering to be widely applicable. PMID:29881252

  19. Direct analysis in real time mass spectrometry and multivariate data analysis: a novel approach to rapid identification of analytical markers for quality control of traditional Chinese medicine preparation.

    PubMed

    Zeng, Shanshan; Wang, Lu; Chen, Teng; Wang, Yuefei; Mo, Huanbiao; Qu, Haibin

    2012-07-06

    The paper presents a novel strategy to identify analytical markers of traditional Chinese medicine preparation (TCMP) rapidly via direct analysis in real time mass spectrometry (DART-MS). A commonly used TCMP, Danshen injection, was employed as a model. The optimal analysis conditions were achieved by measuring the contribution of various experimental parameters to the mass spectra. Salvianolic acids and saccharides were simultaneously determined within a single 1-min DART-MS run. Furthermore, spectra of Danshen injections supplied by five manufacturers were processed with principal component analysis (PCA). Obvious clustering was observed in the PCA score plot, and candidate markers were recognized from the contribution plots of PCA. The suitability of potential markers was then confirmed by contrasting with the results of traditional analysis methods. Using this strategy, fructose, glucose, sucrose, protocatechuic aldehyde and salvianolic acid A were rapidly identified as the markers of Danshen injections. The combination of DART-MS with PCA provides a reliable approach to the identification of analytical markers for quality control of TCMP. Copyright © 2012 Elsevier B.V. All rights reserved.

  20. A Genealogical Interpretation of Principal Components Analysis

    PubMed Central

    McVean, Gil

    2009-01-01

    Principal components analysis, PCA, is a statistical method commonly used in population genetics to identify structure in the distribution of genetic variation across geographical location and ethnic background. However, while the method is often used to inform about historical demographic processes, little is known about the relationship between fundamental demographic parameters and the projection of samples onto the primary axes. Here I show that for SNP data the projection of samples onto the principal components can be obtained directly from considering the average coalescent times between pairs of haploid genomes. The result provides a framework for interpreting PCA projections in terms of underlying processes, including migration, geographical isolation, and admixture. I also demonstrate a link between PCA and Wright's fst and show that SNP ascertainment has a largely simple and predictable effect on the projection of samples. Using examples from human genetics, I discuss the application of these results to empirical data and the implications for inference. PMID:19834557

  1. Application of principal component analysis (PCA) as a sensory assessment tool for fermented food products.

    PubMed

    Ghosh, Debasree; Chattopadhyay, Parimal

    2012-06-01

    The objective of the work was to use the method of quantitative descriptive analysis (QDA) to describe the sensory attributes of the fermented food products prepared with the incorporation of lactic cultures. Panellists were selected and trained to evaluate various attributes specially color and appearance, body texture, flavor, overall acceptability and acidity of the fermented food products like cow milk curd and soymilk curd, idli, sauerkraut and probiotic ice cream. Principal component analysis (PCA) identified the six significant principal components that accounted for more than 90% of the variance in the sensory attribute data. Overall product quality was modelled as a function of principal components using multiple least squares regression (R (2) = 0.8). The result from PCA was statistically analyzed by analysis of variance (ANOVA). These findings demonstrate the utility of quantitative descriptive analysis for identifying and measuring the fermented food product attributes that are important for consumer acceptability.

  2. An Intelligent Architecture Based on Field Programmable Gate Arrays Designed to Detect Moving Objects by Using Principal Component Analysis

    PubMed Central

    Bravo, Ignacio; Mazo, Manuel; Lázaro, José L.; Gardel, Alfredo; Jiménez, Pedro; Pizarro, Daniel

    2010-01-01

    This paper presents a complete implementation of the Principal Component Analysis (PCA) algorithm in Field Programmable Gate Array (FPGA) devices applied to high rate background segmentation of images. The classical sequential execution of different parts of the PCA algorithm has been parallelized. This parallelization has led to the specific development and implementation in hardware of the different stages of PCA, such as computation of the correlation matrix, matrix diagonalization using the Jacobi method and subspace projections of images. On the application side, the paper presents a motion detection algorithm, also entirely implemented on the FPGA, and based on the developed PCA core. This consists of dynamically thresholding the differences between the input image and the one obtained by expressing the input image using the PCA linear subspace previously obtained as a background model. The proposal achieves a high ratio of processed images (up to 120 frames per second) and high quality segmentation results, with a completely embedded and reliable hardware architecture based on commercial CMOS sensors and FPGA devices. PMID:22163406

  3. An intelligent architecture based on Field Programmable Gate Arrays designed to detect moving objects by using Principal Component Analysis.

    PubMed

    Bravo, Ignacio; Mazo, Manuel; Lázaro, José L; Gardel, Alfredo; Jiménez, Pedro; Pizarro, Daniel

    2010-01-01

    This paper presents a complete implementation of the Principal Component Analysis (PCA) algorithm in Field Programmable Gate Array (FPGA) devices applied to high rate background segmentation of images. The classical sequential execution of different parts of the PCA algorithm has been parallelized. This parallelization has led to the specific development and implementation in hardware of the different stages of PCA, such as computation of the correlation matrix, matrix diagonalization using the Jacobi method and subspace projections of images. On the application side, the paper presents a motion detection algorithm, also entirely implemented on the FPGA, and based on the developed PCA core. This consists of dynamically thresholding the differences between the input image and the one obtained by expressing the input image using the PCA linear subspace previously obtained as a background model. The proposal achieves a high ratio of processed images (up to 120 frames per second) and high quality segmentation results, with a completely embedded and reliable hardware architecture based on commercial CMOS sensors and FPGA devices.

  4. Accurate Structural Correlations from Maximum Likelihood Superpositions

    PubMed Central

    Theobald, Douglas L; Wuttke, Deborah S

    2008-01-01

    The cores of globular proteins are densely packed, resulting in complicated networks of structural interactions. These interactions in turn give rise to dynamic structural correlations over a wide range of time scales. Accurate analysis of these complex correlations is crucial for understanding biomolecular mechanisms and for relating structure to function. Here we report a highly accurate technique for inferring the major modes of structural correlation in macromolecules using likelihood-based statistical analysis of sets of structures. This method is generally applicable to any ensemble of related molecules, including families of nuclear magnetic resonance (NMR) models, different crystal forms of a protein, and structural alignments of homologous proteins, as well as molecular dynamics trajectories. Dominant modes of structural correlation are determined using principal components analysis (PCA) of the maximum likelihood estimate of the correlation matrix. The correlations we identify are inherently independent of the statistical uncertainty and dynamic heterogeneity associated with the structural coordinates. We additionally present an easily interpretable method (“PCA plots”) for displaying these positional correlations by color-coding them onto a macromolecular structure. Maximum likelihood PCA of structural superpositions, and the structural PCA plots that illustrate the results, will facilitate the accurate determination of dynamic structural correlations analyzed in diverse fields of structural biology. PMID:18282091

  5. Enlightening discriminative network functional modules behind Principal Component Analysis separation in differential-omic science studies

    PubMed Central

    Ciucci, Sara; Ge, Yan; Durán, Claudio; Palladini, Alessandra; Jiménez-Jiménez, Víctor; Martínez-Sánchez, Luisa María; Wang, Yuting; Sales, Susanne; Shevchenko, Andrej; Poser, Steven W.; Herbig, Maik; Otto, Oliver; Androutsellis-Theotokis, Andreas; Guck, Jochen; Gerl, Mathias J.; Cannistraci, Carlo Vittorio

    2017-01-01

    Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics. PMID:28287094

  6. Markerless gating for lung cancer radiotherapy based on machine learning techniques

    NASA Astrophysics Data System (ADS)

    Lin, Tong; Li, Ruijiang; Tang, Xiaoli; Dy, Jennifer G.; Jiang, Steve B.

    2009-03-01

    In lung cancer radiotherapy, radiation to a mobile target can be delivered by respiratory gating, for which we need to know whether the target is inside or outside a predefined gating window at any time point during the treatment. This can be achieved by tracking one or more fiducial markers implanted inside or near the target, either fluoroscopically or electromagnetically. However, the clinical implementation of marker tracking is limited for lung cancer radiotherapy mainly due to the risk of pneumothorax. Therefore, gating without implanted fiducial markers is a promising clinical direction. We have developed several template-matching methods for fluoroscopic marker-less gating. Recently, we have modeled the gating problem as a binary pattern classification problem, in which principal component analysis (PCA) and support vector machine (SVM) are combined to perform the classification task. Following the same framework, we investigated different combinations of dimensionality reduction techniques (PCA and four nonlinear manifold learning methods) and two machine learning classification methods (artificial neural networks—ANN and SVM). Performance was evaluated on ten fluoroscopic image sequences of nine lung cancer patients. We found that among all combinations of dimensionality reduction techniques and classification methods, PCA combined with either ANN or SVM achieved a better performance than the other nonlinear manifold learning methods. ANN when combined with PCA achieves a better performance than SVM in terms of classification accuracy and recall rate, although the target coverage is similar for the two classification methods. Furthermore, the running time for both ANN and SVM with PCA is within tolerance for real-time applications. Overall, ANN combined with PCA is a better candidate than other combinations we investigated in this work for real-time gated radiotherapy.

  7. Perturbational formulation of principal component analysis in molecular dynamics simulation.

    PubMed

    Koyama, Yohei M; Kobayashi, Tetsuya J; Tomoda, Shuji; Ueda, Hiroki R

    2008-10-01

    Conformational fluctuations of a molecule are important to its function since such intrinsic fluctuations enable the molecule to respond to the external environmental perturbations. For extracting large conformational fluctuations, which predict the primary conformational change by the perturbation, principal component analysis (PCA) has been used in molecular dynamics simulations. However, several versions of PCA, such as Cartesian coordinate PCA and dihedral angle PCA (dPCA), are limited to use with molecules with a single dominant state or proteins where the dihedral angle represents an important internal coordinate. Other PCAs with general applicability, such as the PCA using pairwise atomic distances, do not represent the physical meaning clearly. Therefore, a formulation that provides general applicability and clearly represents the physical meaning is yet to be developed. For developing such a formulation, we consider the conformational distribution change by the perturbation with arbitrary linearly independent perturbation functions. Within the second order approximation of the Kullback-Leibler divergence by the perturbation, the PCA can be naturally interpreted as a method for (1) decomposing a given perturbation into perturbations that independently contribute to the conformational distribution change or (2) successively finding the perturbation that induces the largest conformational distribution change. In this perturbational formulation of PCA, (i) the eigenvalue measures the Kullback-Leibler divergence from the unperturbed to perturbed distributions, (ii) the eigenvector identifies the combination of the perturbation functions, and (iii) the principal component determines the probability change induced by the perturbation. Based on this formulation, we propose a PCA using potential energy terms, and we designate it as potential energy PCA (PEPCA). The PEPCA provides both general applicability and clear physical meaning. For demonstrating its power, we apply the PEPCA to an alanine dipeptide molecule in vacuum as a minimal model of a nonsingle dominant conformational biomolecule. The first and second principal components clearly characterize two stable states and the transition state between them. Positive and negative components with larger absolute values of the first and second eigenvectors identify the electrostatic interactions, which stabilize or destabilize each stable state and the transition state. Our result therefore indicates that PCA can be applied, by carefully selecting the perturbation functions, not only to identify the molecular conformational fluctuation but also to predict the conformational distribution change by the perturbation beyond the limitation of the previous methods.

  8. Perturbational formulation of principal component analysis in molecular dynamics simulation

    NASA Astrophysics Data System (ADS)

    Koyama, Yohei M.; Kobayashi, Tetsuya J.; Tomoda, Shuji; Ueda, Hiroki R.

    2008-10-01

    Conformational fluctuations of a molecule are important to its function since such intrinsic fluctuations enable the molecule to respond to the external environmental perturbations. For extracting large conformational fluctuations, which predict the primary conformational change by the perturbation, principal component analysis (PCA) has been used in molecular dynamics simulations. However, several versions of PCA, such as Cartesian coordinate PCA and dihedral angle PCA (dPCA), are limited to use with molecules with a single dominant state or proteins where the dihedral angle represents an important internal coordinate. Other PCAs with general applicability, such as the PCA using pairwise atomic distances, do not represent the physical meaning clearly. Therefore, a formulation that provides general applicability and clearly represents the physical meaning is yet to be developed. For developing such a formulation, we consider the conformational distribution change by the perturbation with arbitrary linearly independent perturbation functions. Within the second order approximation of the Kullback-Leibler divergence by the perturbation, the PCA can be naturally interpreted as a method for (1) decomposing a given perturbation into perturbations that independently contribute to the conformational distribution change or (2) successively finding the perturbation that induces the largest conformational distribution change. In this perturbational formulation of PCA, (i) the eigenvalue measures the Kullback-Leibler divergence from the unperturbed to perturbed distributions, (ii) the eigenvector identifies the combination of the perturbation functions, and (iii) the principal component determines the probability change induced by the perturbation. Based on this formulation, we propose a PCA using potential energy terms, and we designate it as potential energy PCA (PEPCA). The PEPCA provides both general applicability and clear physical meaning. For demonstrating its power, we apply the PEPCA to an alanine dipeptide molecule in vacuum as a minimal model of a nonsingle dominant conformational biomolecule. The first and second principal components clearly characterize two stable states and the transition state between them. Positive and negative components with larger absolute values of the first and second eigenvectors identify the electrostatic interactions, which stabilize or destabilize each stable state and the transition state. Our result therefore indicates that PCA can be applied, by carefully selecting the perturbation functions, not only to identify the molecular conformational fluctuation but also to predict the conformational distribution change by the perturbation beyond the limitation of the previous methods.

  9. Quantitative structure-activity relationship study of P2X7 receptor inhibitors using combination of principal component analysis and artificial intelligence methods.

    PubMed

    Ahmadi, Mehdi; Shahlaei, Mohsen

    2015-01-01

    P2X7 antagonist activity for a set of 49 molecules of the P2X7 receptor antagonists, derivatives of purine, was modeled with the aid of chemometric and artificial intelligence techniques. The activity of these compounds was estimated by means of combination of principal component analysis (PCA), as a well-known data reduction method, genetic algorithm (GA), as a variable selection technique, and artificial neural network (ANN), as a non-linear modeling method. First, a linear regression, combined with PCA, (principal component regression) was operated to model the structure-activity relationships, and afterwards a combination of PCA and ANN algorithm was employed to accurately predict the biological activity of the P2X7 antagonist. PCA preserves as much of the information as possible contained in the original data set. Seven most important PC's to the studied activity were selected as the inputs of ANN box by an efficient variable selection method, GA. The best computational neural network model was a fully-connected, feed-forward model with 7-7-1 architecture. The developed ANN model was fully evaluated by different validation techniques, including internal and external validation, and chemical applicability domain. All validations showed that the constructed quantitative structure-activity relationship model suggested is robust and satisfactory.

  10. Quantitative structure–activity relationship study of P2X7 receptor inhibitors using combination of principal component analysis and artificial intelligence methods

    PubMed Central

    Ahmadi, Mehdi; Shahlaei, Mohsen

    2015-01-01

    P2X7 antagonist activity for a set of 49 molecules of the P2X7 receptor antagonists, derivatives of purine, was modeled with the aid of chemometric and artificial intelligence techniques. The activity of these compounds was estimated by means of combination of principal component analysis (PCA), as a well-known data reduction method, genetic algorithm (GA), as a variable selection technique, and artificial neural network (ANN), as a non-linear modeling method. First, a linear regression, combined with PCA, (principal component regression) was operated to model the structure–activity relationships, and afterwards a combination of PCA and ANN algorithm was employed to accurately predict the biological activity of the P2X7 antagonist. PCA preserves as much of the information as possible contained in the original data set. Seven most important PC's to the studied activity were selected as the inputs of ANN box by an efficient variable selection method, GA. The best computational neural network model was a fully-connected, feed-forward model with 7−7−1 architecture. The developed ANN model was fully evaluated by different validation techniques, including internal and external validation, and chemical applicability domain. All validations showed that the constructed quantitative structure–activity relationship model suggested is robust and satisfactory. PMID:26600858

  11. Integrative analysis of gene expression and copy number alterations using canonical correlation analysis.

    PubMed

    Soneson, Charlotte; Lilljebjörn, Henrik; Fioretos, Thoas; Fontes, Magnus

    2010-04-15

    With the rapid development of new genetic measurement methods, several types of genetic alterations can be quantified in a high-throughput manner. While the initial focus has been on investigating each data set separately, there is an increasing interest in studying the correlation structure between two or more data sets. Multivariate methods based on Canonical Correlation Analysis (CCA) have been proposed for integrating paired genetic data sets. The high dimensionality of microarray data imposes computational difficulties, which have been addressed for instance by studying the covariance structure of the data, or by reducing the number of variables prior to applying the CCA. In this work, we propose a new method for analyzing high-dimensional paired genetic data sets, which mainly emphasizes the correlation structure and still permits efficient application to very large data sets. The method is implemented by translating a regularized CCA to its dual form, where the computational complexity depends mainly on the number of samples instead of the number of variables. The optimal regularization parameters are chosen by cross-validation. We apply the regularized dual CCA, as well as a classical CCA preceded by a dimension-reducing Principal Components Analysis (PCA), to a paired data set of gene expression changes and copy number alterations in leukemia. Using the correlation-maximizing methods, regularized dual CCA and PCA+CCA, we show that without pre-selection of known disease-relevant genes, and without using information about clinical class membership, an exploratory analysis singles out two patient groups, corresponding to well-known leukemia subtypes. Furthermore, the variables showing the highest relevance to the extracted features agree with previous biological knowledge concerning copy number alterations and gene expression changes in these subtypes. Finally, the correlation-maximizing methods are shown to yield results which are more biologically interpretable than those resulting from a covariance-maximizing method, and provide different insight compared to when each variable set is studied separately using PCA. We conclude that regularized dual CCA as well as PCA+CCA are useful methods for exploratory analysis of paired genetic data sets, and can be efficiently implemented also when the number of variables is very large.

  12. Identification of genetic risk associated with prostate cancer using ancestry informative markers

    PubMed Central

    Ricks-Santi, LJ; Apprey, V; Mason, T; Wilson, B; Abbas, M; Hernandez, W; Hooker, S; Doura, M; Bonney, G; Dunston, G; Kittles, R; Ahaghotu, C

    2014-01-01

    BACKGROUND Prostate cancer (PCa) is a common malignancy and a leading cause of cancer death among men in the United States with African-American (AA) men having the highest incidence and mortality rates. Given recent results from admixture mapping and genome-wide association studies for PCa in AA men, it is clear that many risk alleles are enriched in men with West African genetic ancestry. METHODS A total of 77 ancestry informative markers (AIMs) within surrounding candidate gene regions were genotyped and haplotyped using Pyrosequencing in 358 unrelated men enrolled in a PCa genetic association study at the Howard University Hospital between 2000 and 2004. Sequence analysis of promoter region single-nucleotide polymorphisms (SNPs) to evaluate disruption of transcription factor-binding sites was conducted using in silico methods. RESULTS Eight AIMs were significantly associated with PCa risk after adjusting for age and West African ancestry. SNP rs1993973 (intervening sequences) had the strongest association with PCa using the log-additive genetic model (P = 0.002). SNPs rs1561131 (genotypic, P = 0.007), rs1963562 (dominant, P = 0.01) and rs615382 (recessive, P = 0.009) remained highly significant after adjusting for both age and ancestry. We also tested the independent effect of each significantly associated SNP and rs1561131 (P = 0.04) and rs1963562 (P = 0.04) remained significantly associated with PCa development. After multiple comparisons testing using the false discovery rate, rs1993973 remained significant. Analysis of the rs156113–, rs1963562–rs615382l and rs1993973–rs585224 haplotypes revealed that the least frequently found haplotypes in this population were significantly associated with a decreased risk of PCa (P = 0.032 and 0.0017, respectively). CONCLUSIONS The approach for SNP selection utilized herein showed that AIMs may not only leverage increased linkage disequilibrium in populations to identify risk and protective alleles, but may also be informative in dissecting the biology of PCa and other health disparities. PMID:22801071

  13. Hyperspectral Image Denoising Using a Nonlocal Spectral Spatial Principal Component Analysis

    NASA Astrophysics Data System (ADS)

    Li, D.; Xu, L.; Peng, J.; Ma, J.

    2018-04-01

    Hyperspectral images (HSIs) denoising is a critical research area in image processing duo to its importance in improving the quality of HSIs, which has a negative impact on object detection and classification and so on. In this paper, we develop a noise reduction method based on principal component analysis (PCA) for hyperspectral imagery, which is dependent on the assumption that the noise can be removed by selecting the leading principal components. The main contribution of paper is to introduce the spectral spatial structure and nonlocal similarity of the HSIs into the PCA denoising model. PCA with spectral spatial structure can exploit spectral correlation and spatial correlation of HSI by using 3D blocks instead of 2D patches. Nonlocal similarity means the similarity between the referenced pixel and other pixels in nonlocal area, where Mahalanobis distance algorithm is used to estimate the spatial spectral similarity by calculating the distance in 3D blocks. The proposed method is tested on both simulated and real hyperspectral images, the results demonstrate that the proposed method is superior to several other popular methods in HSI denoising.

  14. Identification of rice field using Multi-Temporal NDVI and PCA method on Landsat 8 (Case Study: Demak, Central Java)

    NASA Astrophysics Data System (ADS)

    Sukmono, Abdi; Ardiansyah

    2017-01-01

    Paddy is one of the most important agricultural crop in Indonesia. Indonesia’s consumption of rice per capita in 2013 amounted to 78,82 kg/capita/year. In 2017, the Indonesian government has the mission of realizing Indonesia became self-sufficient in food. Therefore, the Indonesian government should be able to seek the stability of the fulfillment of basic needs for food, such as rice field mapping. The accurate mapping for rice field can use a quick and easy method such as Remote Sensing. In this study, multi-temporal Landsat 8 are used for identification of rice field based on Rice Planting Time. It was combined with other method for extract information from the imagery. The methods which was used Normalized Difference Vegetation Index (NDVI), Principal Component Analysis (PCA) and band combination. Image classification is processed by using nine classes, those are water, settlements, mangrove, gardens, fields, rice fields 1st, rice fields 2nd, rice fields 3rd and rice fields 4th. The results showed the rice fields area obtained from the PCA method was 50,009 ha, combination bands was 51,016 ha and NDVI method was 45,893 ha. The accuracy level was obtained PCA method (84.848%), band combination (81.818%), and NDVI method (75.758%).

  15. Improved medical image fusion based on cascaded PCA and shift invariant wavelet transforms.

    PubMed

    Reena Benjamin, J; Jayasree, T

    2018-02-01

    In the medical field, radiologists need more informative and high-quality medical images to diagnose diseases. Image fusion plays a vital role in the field of biomedical image analysis. It aims to integrate the complementary information from multimodal images, producing a new composite image which is expected to be more informative for visual perception than any of the individual input images. The main objective of this paper is to improve the information, to preserve the edges and to enhance the quality of the fused image using cascaded principal component analysis (PCA) and shift invariant wavelet transforms. A novel image fusion technique based on cascaded PCA and shift invariant wavelet transforms is proposed in this paper. PCA in spatial domain extracts relevant information from the large dataset based on eigenvalue decomposition, and the wavelet transform operating in the complex domain with shift invariant properties brings out more directional and phase details of the image. The significance of maximum fusion rule applied in dual-tree complex wavelet transform domain enhances the average information and morphological details. The input images of the human brain of two different modalities (MRI and CT) are collected from whole brain atlas data distributed by Harvard University. Both MRI and CT images are fused using cascaded PCA and shift invariant wavelet transform method. The proposed method is evaluated based on three main key factors, namely structure preservation, edge preservation, contrast preservation. The experimental results and comparison with other existing fusion methods show the superior performance of the proposed image fusion framework in terms of visual and quantitative evaluations. In this paper, a complex wavelet-based image fusion has been discussed. The experimental results demonstrate that the proposed method enhances the directional features as well as fine edge details. Also, it reduces the redundant details, artifacts, distortions.

  16. Independent components analysis to increase efficiency of discriminant analysis methods (FDA and LDA): Application to NMR fingerprinting of wine.

    PubMed

    Monakhova, Yulia B; Godelmann, Rolf; Kuballa, Thomas; Mushtakova, Svetlana P; Rutledge, Douglas N

    2015-08-15

    Discriminant analysis (DA) methods, such as linear discriminant analysis (LDA) or factorial discriminant analysis (FDA), are well-known chemometric approaches for solving classification problems in chemistry. In most applications, principle components analysis (PCA) is used as the first step to generate orthogonal eigenvectors and the corresponding sample scores are utilized to generate discriminant features for the discrimination. Independent components analysis (ICA) based on the minimization of mutual information can be used as an alternative to PCA as a preprocessing tool for LDA and FDA classification. To illustrate the performance of this ICA/DA methodology, four representative nuclear magnetic resonance (NMR) data sets of wine samples were used. The classification was performed regarding grape variety, year of vintage and geographical origin. The average increase for ICA/DA in comparison with PCA/DA in the percentage of correct classification varied between 6±1% and 8±2%. The maximum increase in classification efficiency of 11±2% was observed for discrimination of the year of vintage (ICA/FDA) and geographical origin (ICA/LDA). The procedure to determine the number of extracted features (PCs, ICs) for the optimum DA models was discussed. The use of independent components (ICs) instead of principle components (PCs) resulted in improved classification performance of DA methods. The ICA/LDA method is preferable to ICA/FDA for recognition tasks based on NMR spectroscopic measurements. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. Kernel Principal Component Analysis for dimensionality reduction in fMRI-based diagnosis of ADHD.

    PubMed

    Sidhu, Gagan S; Asgarian, Nasimeh; Greiner, Russell; Brown, Matthew R G

    2012-01-01

    This study explored various feature extraction methods for use in automated diagnosis of Attention-Deficit Hyperactivity Disorder (ADHD) from functional Magnetic Resonance Image (fMRI) data. Each participant's data consisted of a resting state fMRI scan as well as phenotypic data (age, gender, handedness, IQ, and site of scanning) from the ADHD-200 dataset. We used machine learning techniques to produce support vector machine (SVM) classifiers that attempted to differentiate between (1) all ADHD patients vs. healthy controls and (2) ADHD combined (ADHD-c) type vs. ADHD inattentive (ADHD-i) type vs. controls. In different tests, we used only the phenotypic data, only the imaging data, or else both the phenotypic and imaging data. For feature extraction on fMRI data, we tested the Fast Fourier Transform (FFT), different variants of Principal Component Analysis (PCA), and combinations of FFT and PCA. PCA variants included PCA over time (PCA-t), PCA over space and time (PCA-st), and kernelized PCA (kPCA-st). Baseline chance accuracy was 64.2% produced by guessing healthy control (the majority class) for all participants. Using only phenotypic data produced 72.9% accuracy on two class diagnosis and 66.8% on three class diagnosis. Diagnosis using only imaging data did not perform as well as phenotypic-only approaches. Using both phenotypic and imaging data with combined FFT and kPCA-st feature extraction yielded accuracies of 76.0% on two class diagnosis and 68.6% on three class diagnosis-better than phenotypic-only approaches. Our results demonstrate the potential of using FFT and kPCA-st with resting-state fMRI data as well as phenotypic data for automated diagnosis of ADHD. These results are encouraging given known challenges of learning ADHD diagnostic classifiers using the ADHD-200 dataset (see Brown et al., 2012).

  18. Fourier Transform Infrared Spectroscopy (FTIR) and Multivariate Analysis for Identification of Different Vegetable Oils Used in Biodiesel Production

    PubMed Central

    Mueller, Daniela; Ferrão, Marco Flôres; Marder, Luciano; da Costa, Adilson Ben; de Cássia de Souza Schneider, Rosana

    2013-01-01

    The main objective of this study was to use infrared spectroscopy to identify vegetable oils used as raw material for biodiesel production and apply multivariate analysis to the data. Six different vegetable oil sources—canola, cotton, corn, palm, sunflower and soybeans—were used to produce biodiesel batches. The spectra were acquired by Fourier transform infrared spectroscopy using a universal attenuated total reflectance sensor (FTIR-UATR). For the multivariate analysis principal component analysis (PCA), hierarchical cluster analysis (HCA), interval principal component analysis (iPCA) and soft independent modeling of class analogy (SIMCA) were used. The results indicate that is possible to develop a methodology to identify vegetable oils used as raw material in the production of biodiesel by FTIR-UATR applying multivariate analysis. It was also observed that the iPCA found the best spectral range for separation of biodiesel batches using FTIR-UATR data, and with this result, the SIMCA method classified 100% of the soybean biodiesel samples. PMID:23539030

  19. Analysis of Lard in Lipstick Formulation Using FTIR Spectroscopy and Multivariate Calibration: A Comparison of Three Extraction Methods.

    PubMed

    Waskitho, Dri; Lukitaningsih, Endang; Sudjadi; Rohman, Abdul

    2016-01-01

    Analysis of lard extracted from lipstick formulation containing castor oil has been performed using FTIR spectroscopic method combined with multivariate calibration. Three different extraction methods were compared, namely saponification method followed by liquid/liquid extraction with hexane/dichlorometane/ethanol/water, saponification method followed by liquid/liquid extraction with dichloromethane/ethanol/water, and Bligh & Dyer method using chloroform/methanol/water as extracting solvent. Qualitative and quantitative analysis of lard were performed using principle component (PCA) and partial least square (PLS) analysis, respectively. The results showed that, in all samples prepared by the three extraction methods, PCA was capable of identifying lard at wavelength region of 1200-800 cm -1 with the best result was obtained by Bligh & Dyer method. Furthermore, PLS analysis at the same wavelength region used for qualification showed that Bligh and Dyer was the most suitable extraction method with the highest determination coefficient (R 2 ) and the lowest root mean square error of calibration (RMSEC) as well as root mean square error of prediction (RMSEP) values.

  20. The function of oxytocin: a potential biomarker for prostate cancer diagnosis and promoter of prostate cancer.

    PubMed

    Xu, Huan; Fu, Shi; Chen, Qi; Gu, Meng; Zhou, Juan; Liu, Chong; Chen, Yanbo; Wang, Zhong

    2017-05-09

    To measure the level of oxytocin in serum and prostate cancer (PCa) tissue and study its effect on the proliferation of PCa cells. Oxytocin level in serum was significantly increased in PCa patients compared with the no-carcinoma individuals. Additionally, the levels of oxytocin and its receptor were also elevated in the PCa tissue. However, no significant difference existed among the PCa of various Gleason grades. Western blot analysis confirmed the previous results and revealed an increased expression level of APPL1. The level of oxytocin in serum was measured by ELISA analysis. The expression of oxytocin and its receptor in prostate was analyzed by immunohistochemistry. The proliferation and apoptosis of PCa cells were assessed by the Cell Counting Kit 8 (CCK8) assay, cell cycle analysis and caspase3 activity analysis, respectively. Western blot analysis was used for the detection of PCNA, Caspase3 and APPL1 protein levels. Serum and prostatic oxytocin levels are increased in the PCa subjects. Serum oxytocin level may be a biomarker for PCa in the future. Oxytocin increases PCa growth and APPL1 expression.

  1. Inflammation: an important parameter in the search of prostate cancer biomarkers

    PubMed Central

    2014-01-01

    Background A more specific and early diagnostics for prostate cancer (PCa) is highly desirable. In this study, being inflammation the focus of our effort, serum protein profiles were analyzed in order to investigate if this parameter could interfere with the search of discriminating proteins between PCa and benign prostatic hyperplasia (BPH). Methods Patients with clinical suspect of PCa and candidates for trans-rectal ultrasound guided prostate biopsy (TRUS) were enrolled. Histological specimens were examined in order to grade and classify the tumor, identify BPH and detect inflammation. Surface Enhanced Laser Desorption/Ionization-Time of Flight-Mass Spectrometry (SELDI-ToF-MS) and two-dimensional gel electrophoresis (2-DE) coupled with Liquid Chromatography-MS/MS (LC-MS/MS) were used to analyze immuno-depleted serum samples from patients with PCa and BPH. Results The comparison between PCa (with and without inflammation) and BPH (with and without inflammation) serum samples by SELDI-ToF-MS analysis did not show differences in protein expression, while changes were only observed when the concomitant presence of inflammation was taken into consideration. In fact, when samples with histological sign of inflammation were excluded, 20 significantly different protein peaks were detected. Subsequent comparisons (PCa with inflammation vs PCa without inflammation, and BPH with inflammation vs BPH without inflammation) showed that 16 proteins appeared to be modified in the presence of inflammation, while 4 protein peaks were not modified. With 2-DE analysis, comparing PCa without inflammation vs PCa with inflammation, and BPH without inflammation vs the same condition in the presence of inflammation, were identified 29 and 25 differentially expressed protein spots, respectively. Excluding samples with inflammation the comparison between PCa vs BPH showed 9 unique PCa proteins, 4 of which overlapped with those previously identified in the presence of inflammation, while other 2 were new proteins, not identified in our previous comparisons. Conclusions The present study indicates that inflammation might be a confounding parameter during the proteomic research of candidate biomarkers of PCa. These results indicate that some possible biomarker-candidate proteins are strongly influenced by the presence of inflammation, hence only a well-selected protein pattern should be considered for potential marker of PCa. PMID:24944525

  2. An Estimate of the Incidence of Prostate Cancer in Africa: A Systematic Review and Meta-Analysis

    PubMed Central

    Aderemi, Adewale Victor; Iseolorunkanmi, Alexander; Oyedokun, Ayo; Ayo, Charles K.

    2016-01-01

    Background Prostate cancer (PCa) is rated the second most common cancer and sixth leading cause of cancer deaths among men globally. Reports show that African men suffer disproportionately from PCa compared to men from other parts of the world. It is still quite difficult to accurately describe the burden of PCa in Africa due to poor cancer registration systems. We systematically reviewed the literature on prostate cancer in Africa and provided a continent-wide incidence rate of PCa based on available data in the region. Methods A systematic literature search of Medline, EMBASE and Global Health from January 1980 to June 2015 was conducted, with additional search of Google Scholar, International Association of Cancer Registries (IACR), International Agency for Research on Cancer (IARC), and WHO African region websites, for studies that estimated incidence rate of PCa in any African location. Having assessed quality and consistency across selected studies, we extracted incidence rates of PCa and conducted a random effects meta-analysis. Results Our search returned 9766 records, with 40 studies spreading across 16 African countries meeting our selection criteria. We estimated a pooled PCa incidence rate of 22.0 (95% CI: 19.93–23.97) per 100,000 population, and also reported a median incidence rate of 19.5 per 100,000 population. We observed an increasing trend in PCa incidence with advancing age, and over the main years covered. Conclusion Effective cancer registration and extensive research are vital to appropriately quantifying PCa burden in Africa. We hope our findings may further assist at identifying relevant gaps, and contribute to improving knowledge, research, and interventions targeted at prostate cancer in Africa. PMID:27073921

  3. [Identification of varieties of cashmere by Vis/NIR spectroscopy technology based on PCA-SVM].

    PubMed

    Wu, Gui-Fang; He, Yong

    2009-06-01

    One mixed algorithm was presented to discriminate cashmere varieties with principal component analysis (PCA) and support vector machine (SVM). Cashmere fiber has such characteristics as threadlike, softness, glossiness and high tensile strength. The quality characters and economic value of each breed of cashmere are very different. In order to safeguard the consumer's rights and guarantee the quality of cashmere product, quickly, efficiently and correctly identifying cashmere has significant meaning to the production and transaction of cashmere material. The present research adopts Vis/NIRS spectroscopy diffuse techniques to collect the spectral data of cashmere. The near infrared fingerprint of cashmere was acquired by principal component analysis (PCA), and support vector machine (SVM) methods were used to further identify the cashmere material. The result of PCA indicated that the score map made by the scores of PC1, PC2 and PC3 was used, and 10 principal components (PCs) were selected as the input of support vector machine (SVM) based on the reliabilities of PCs of 99.99%. One hundred cashmere samples were used for calibration and the remaining 75 cashmere samples were used for validation. A one-against-all multi-class SVM model was built, the capabilities of SVM with different kernel function were comparatively analyzed, and the result showed that SVM possessing with the Gaussian kernel function has the best identification capabilities with the accuracy of 100%. This research indicated that the data mining method of PCA-SVM has a good identification effect, and can work as a new method for rapid identification of cashmere material varieties.

  4. Searching for prostate cancer by fully automated magnetic resonance imaging classification: deep learning versus non-deep learning.

    PubMed

    Wang, Xinggang; Yang, Wei; Weinreb, Jeffrey; Han, Juan; Li, Qiubai; Kong, Xiangchuang; Yan, Yongluan; Ke, Zan; Luo, Bo; Liu, Tao; Wang, Liang

    2017-11-13

    Prostate cancer (PCa) is a major cause of death since ancient time documented in Egyptian Ptolemaic mummy imaging. PCa detection is critical to personalized medicine and varies considerably under an MRI scan. 172 patients with 2,602 morphologic images (axial 2D T2-weighted imaging) of the prostate were obtained. A deep learning with deep convolutional neural network (DCNN) and a non-deep learning with SIFT image feature and bag-of-word (BoW), a representative method for image recognition and analysis, were used to distinguish pathologically confirmed PCa patients from prostate benign conditions (BCs) patients with prostatitis or prostate benign hyperplasia (BPH). In fully automated detection of PCa patients, deep learning had a statistically higher area under the receiver operating characteristics curve (AUC) than non-deep learning (P = 0.0007 < 0.001). The AUCs were 0.84 (95% CI 0.78-0.89) for deep learning method and 0.70 (95% CI 0.63-0.77) for non-deep learning method, respectively. Our results suggest that deep learning with DCNN is superior to non-deep learning with SIFT image feature and BoW model for fully automated PCa patients differentiation from prostate BCs patients. Our deep learning method is extensible to image modalities such as MR imaging, CT and PET of other organs.

  5. Characterizing the molecular features of ERG-positive tumors in primary and castration resistant prostate cancer

    PubMed Central

    Roudier, Martine P; Winters, Brian R; Coleman, Ilsa; Lam, Hung-Ming; Zhang, Xiaotun; Coleman, Roger; Chéry, Lisly; True, Lawrence D.; Higano, Celestia S.; Montgomery, Bruce; Lange, Paul H.; Snyder, Linda A.; Srivistava, Shiv; Corey, Eva; Vessella, Robert L.; Nelson, Peter S.; Üren, Aykut; Morrissey, Colm

    2017-01-01

    Background The TMPRSS2-ERG gene fusion is detected in approximately half of primary prostate cancers (PCa) yet the prognostic significance remains unclear. We hypothesized that ERG promotes the expression of common genes in primary PCa and metastatic castration-resistant PCa (CRPC), with the objective of identifying ERG-associated pathways, which may promote the transition from primary PCa to CRPC. Methods We constructed tissue microarrays (TMA) from 127 radical prostatectomy specimens, 20 LuCaP patient-derived xenografts (PDX), and 152 CRPC metastases obtained immediately at time of death. Nuclear ERG was assessed by immunohistochemistry (IHC). To characterize the molecular features of ERG-expressing PCa, a subset of IHC confirmed ERG+ or ERG-specimens including 11 radical prostatectomies, 20 LuCaP PDXs, and 45 CRPC metastases underwent gene expression analysis. Genes were ranked based on expression in primary PCa and CRPC. Common genes of interest were targeted for IHC analysis and expression compared with biochemical recurrence (BCR) status. Results IHC revealed that 43% of primary PCa, 35% of the LuCaP PDXs, and 18% of the CRPC metastases were ERG+ (12 of 48 patients [25%] had at least 1 ERG+ metastasis). Based on gene expression data and previous literature, two proteins involved in calcium signaling (NCALD, CACNA1D), a protein involved in inflammation (HLA-DMB), CD3 positive immune cells, and a novel ERG-associated protein, DCLK1 were evaluated in primary PCa and CRPC metastases. In ERG+ primary PCa, a weak association was seen with NCALD and CACNA1D protein expression. HLA-DMB expression and the presence of CD3 positive immune cells were decreased in CRPC metastases compared to primary PCa. DCLK1 was upregulated at the protein level in unpaired ERG+ primary PCa and CRPC metastases (p=0.0013 and p<0.0001, respectively). In primary PCa, ERG status or expression of targeted proteins was not associated with BCR-free survival. However for primary PCa, ERG+DCLK1+ patients exhibited shorter time to BCR (p=0.06) compared with ERG+DCLK1- patients. Conclusions This study examined ERG expression in primary PCa and CRPC. We have identified altered levels of inflammatory mediators associated with ERG expression. We determined expression of DCLK1 correlates with ERG expression and may play a role in primary PCa progression to metastatic CPRC. PMID:26990456

  6. Blind source separation problem in GPS time series

    NASA Astrophysics Data System (ADS)

    Gualandi, A.; Serpelloni, E.; Belardinelli, M. E.

    2016-04-01

    A critical point in the analysis of ground displacement time series, as those recorded by space geodetic techniques, is the development of data-driven methods that allow the different sources of deformation to be discerned and characterized in the space and time domains. Multivariate statistic includes several approaches that can be considered as a part of data-driven methods. A widely used technique is the principal component analysis (PCA), which allows us to reduce the dimensionality of the data space while maintaining most of the variance of the dataset explained. However, PCA does not perform well in finding the solution to the so-called blind source separation (BSS) problem, i.e., in recovering and separating the original sources that generate the observed data. This is mainly due to the fact that PCA minimizes the misfit calculated using an L2 norm (χ 2), looking for a new Euclidean space where the projected data are uncorrelated. The independent component analysis (ICA) is a popular technique adopted to approach the BSS problem. However, the independence condition is not easy to impose, and it is often necessary to introduce some approximations. To work around this problem, we test the use of a modified variational Bayesian ICA (vbICA) method to recover the multiple sources of ground deformation even in the presence of missing data. The vbICA method models the probability density function (pdf) of each source signal using a mix of Gaussian distributions, allowing for more flexibility in the description of the pdf of the sources with respect to standard ICA, and giving a more reliable estimate of them. Here we present its application to synthetic global positioning system (GPS) position time series, generated by simulating deformation near an active fault, including inter-seismic, co-seismic, and post-seismic signals, plus seasonal signals and noise, and an additional time-dependent volcanic source. We evaluate the ability of the PCA and ICA decomposition techniques in explaining the data and in recovering the original (known) sources. Using the same number of components, we find that the vbICA method fits the data almost as well as a PCA method, since the χ 2 increase is less than 10 % the value calculated using a PCA decomposition. Unlike PCA, the vbICA algorithm is found to correctly separate the sources if the correlation of the dataset is low (<0.67) and the geodetic network is sufficiently dense (ten continuous GPS stations within a box of side equal to two times the locking depth of a fault where an earthquake of Mw >6 occurred). We also provide a cookbook for the use of the vbICA algorithm in analyses of position time series for tectonic and non-tectonic applications.

  7. A Novel Weighted Kernel PCA-Based Method for Optimization and Uncertainty Quantification

    NASA Astrophysics Data System (ADS)

    Thimmisetty, C.; Talbot, C.; Chen, X.; Tong, C. H.

    2016-12-01

    It has been demonstrated that machine learning methods can be successfully applied to uncertainty quantification for geophysical systems through the use of the adjoint method coupled with kernel PCA-based optimization. In addition, it has been shown through weighted linear PCA how optimization with respect to both observation weights and feature space control variables can accelerate convergence of such methods. Linear machine learning methods, however, are inherently limited in their ability to represent features of non-Gaussian stochastic random fields, as they are based on only the first two statistical moments of the original data. Nonlinear spatial relationships and multipoint statistics leading to the tortuosity characteristic of channelized media, for example, are captured only to a limited extent by linear PCA. With the aim of coupling the kernel-based and weighted methods discussed, we present a novel mathematical formulation of kernel PCA, Weighted Kernel Principal Component Analysis (WKPCA), that both captures nonlinear relationships and incorporates the attribution of significance levels to different realizations of the stochastic random field of interest. We also demonstrate how new instantiations retaining defining characteristics of the random field can be generated using Bayesian methods. In particular, we present a novel WKPCA-based optimization method that minimizes a given objective function with respect to both feature space random variables and observation weights through which optimal snapshot significance levels and optimal features are learned. We showcase how WKPCA can be applied to nonlinear optimal control problems involving channelized media, and in particular demonstrate an application of the method to learning the spatial distribution of material parameter values in the context of linear elasticity, and discuss further extensions of the method to stochastic inversion.

  8. Sample-space-based feature extraction and class preserving projection for gene expression data.

    PubMed

    Wang, Wenjun

    2013-01-01

    In order to overcome the problems of high computational complexity and serious matrix singularity for feature extraction using Principal Component Analysis (PCA) and Fisher's Linear Discrinimant Analysis (LDA) in high-dimensional data, sample-space-based feature extraction is presented, which transforms the computation procedure of feature extraction from gene space to sample space by representing the optimal transformation vector with the weighted sum of samples. The technique is used in the implementation of PCA, LDA, Class Preserving Projection (CPP) which is a new method for discriminant feature extraction proposed, and the experimental results on gene expression data demonstrate the effectiveness of the method.

  9. [Near infrared reflectance spectroscopy (NIRS): a novel approach to reconstructing historical changes of primary productivity in Antarctic lake].

    PubMed

    Chen, Qian-Qian; Liu, Xiao-Dong; Liu, Wen-Qi; Jiang, Shan

    2011-10-01

    Compared with traditional chemical analysis methods, reflectance spectroscopy has the advantages of speed, minimal or no sample preparation, non-destruction, and low cost. In order to explore the potential application of spectroscopy technology in the paleolimnological study on Antarctic lakes, we took a lake sediment core in Mochou Lake at Zhongshan Station of Antarctic, and analyzed the near infrared reflectance spectroscopy (NIRS) data in the sedimentary samples. The results showed that the factor loadings of principal component analysis (PCA) displayed very similar depth-profile change pattern with the S2 index, a reliable proxy for the change in historical lake primary productivity. The correlation analysis showed that the values of PCA factor loading and S2 were correlated significantly, suggesting that it is feasible to infer paleoproductivity changes recorded in Antarctic lakes using NIRS technology. Compared to the traditional method of the trough area between 650 and 700 nm, the authors found that the PCA statistical approach was more accurate for reconstructing the change in historical lake primary productivity. The results reported here demonstrate that reflectance spectroscopy can provide a rapid method for the reconstruction of lake palaeoenviro nmental change in the remote Antarctic regions.

  10. An In Vitro Spectroscopic Analysis to Determine Whether para-Chloroaniline is Produced from Mixing Sodium Hypochlorite and Chlorhexidine

    PubMed Central

    Thomas, John E.; Sem, Daniel S.

    2009-01-01

    Introduction The purpose of this in vitro study was to determine whether para-chloroaniline (PCA) is formed through the reaction of mixing sodium hypochlorite (NaOCl) and chlorhexidine (CHX). Methods Initially commercially available samples of chlorhexidine acetate (CHXa) and PCA were analyzed with 1H NMR spectroscopy. Two solutions, NaOCl and CHXa, were warmed to 37°C and when mixed they produced a brown precipitate. This precipitate was separated in half and pure PCA was added to one of the samples for comparison before they were each analyzed with 1H NMR spectroscopy. Results The peaks in the 1H NMR spectra of CHXa and PCA were assigned to specific protons of the molecules, and the location of the aromatic peaks in the PCA spectrum defined the PCA doublet region. While the spectrum of the precipitate alone resulted in a complex combination of peaks, upon magnification there were no peaks in the PCA doublet region which were intense enough to be quantified. In the spectrum of the precipitate, to which PCA was added, two peaks do appear in the PCA doublet region. Comparing this spectrum to that of precipitate alone, the peaks in the PCA doublet region are not visible prior to the addition of PCA. Conclusions Based on this in vitro study, the reaction mixture of NaOCl and CHXa does not produce PCA at any measurable quantity and further investigation is needed to determine the chemical composition of the brown precipitate. PMID:20113799

  11. An Exploratory Study on Using Principal-Component Analysis and Confirmatory Factor Analysis to Identify Bolt-On Dimensions: The EQ-5D Case Study.

    PubMed

    Finch, Aureliano Paolo; Brazier, John Edward; Mukuria, Clara; Bjorner, Jakob Bue

    2017-12-01

    Generic preference-based measures such as the EuroQol five-dimensional questionnaire (EQ-5D) are used in economic evaluation, but may not be appropriate for all conditions. When this happens, a possible solution is adding bolt-ons to expand their descriptive systems. Using review-based methods, studies published to date claimed the relevance of bolt-ons in the presence of poor psychometric results. This approach does not identify the specific dimensions missing from the Generic preference-based measure core descriptive system, and is inappropriate for identifying dimensions that might improve the measure generically. This study explores the use of principal-component analysis (PCA) and confirmatory factor analysis (CFA) for bolt-on identification in the EQ-5D. Data were drawn from the international Multi-Instrument Comparison study, which is an online survey on health and well-being measures in five countries. Analysis was based on a pool of 92 items from nine instruments. Initial content analysis provided a theoretical framework for PCA results interpretation and CFA model development. PCA was used to investigate the underlining dimensional structure and whether EQ-5D items were represented in the identified constructs. CFA was used to confirm the structure. CFA was cross-validated in random halves of the sample. PCA suggested a nine-component solution, which was confirmed by CFA. This included psychological symptoms, physical functioning, and pain, which were covered by the EQ-5D, and satisfaction, speech/cognition,relationships, hearing, vision, and energy/sleep which were not. These latter factors may represent relevant candidate bolt-ons. PCA and CFA appear useful methods for identifying potential bolt-ons dimensions for an instrument such as the EQ-5D. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.

  12. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lu, Bo, E-mail: luboufl@gmail.com; Park, Justin C.; Fan, Qiyong

    Purpose: Accurately localizing lung tumor localization is essential for high-precision radiation therapy techniques such as stereotactic body radiation therapy (SBRT). Since direct monitoring of tumor motion is not always achievable due to the limitation of imaging modalities for treatment guidance, placement of fiducial markers on the patient’s body surface to act as a surrogate for tumor position prediction is a practical alternative for tracking lung tumor motion during SBRT treatments. In this work, the authors propose an innovative and robust model to solve the multimarker position optimization problem. The model is able to overcome the major drawbacks of the sparsemore » optimization approach (SOA) model. Methods: The principle-component-analysis (PCA) method was employed as the framework to build the authors’ statistical prediction model. The method can be divided into two stages. The first stage is to build the surrogate tumor matrix and calculate its eigenvalues and associated eigenvectors. The second stage is to determine the “best represented” columns of the eigenvector matrix obtained from stage one and subsequently acquire the optimal marker positions as well as numbers. Using 4-dimensional CT (4DCT) and breath hold CT imaging data, the PCA method was compared to the SOA method with respect to calculation time, average prediction accuracy, prediction stability, noise resistance, marker position consistency, and marker distribution. Results: The PCA and SOA methods which were both tested were on all 11 patients for a total of 130 cases including 4DCT and breath-hold CT scenarios. The maximum calculation time for the PCA method was less than 1 s with 64 752 surface points, whereas the average calculation time for the SOA method was over 12 min with 400 surface points. Overall, the tumor center position prediction errors were comparable between the two methods, and all were less than 1.5 mm. However, for the extreme scenarios (breath hold), the prediction errors for the PCA method were not only smaller, but were also more stable than for the SOA method. Results obtained by imposing a series of random noises to the surrogates indicated that the PCA method was much more noise resistant than the SOA method. The marker position consistency tests using various combinations of 4DCT phases to construct the surrogates suggested that the marker position predictions of the PCA method were more consistent than those of the SOA method, in spite of surrogate construction. Marker distribution tests indicated that greater than 80% of the calculated marker positions fell into the high cross correlation and high motion magnitude regions for both of the algorithms. Conclusions: The PCA model is an accurate, efficient, robust, and practical model for solving the multimarker position optimization problem to predict lung tumor motion during SBRT treatments. Due to its generality, PCA model can also be applied to other imaging guidance system whichever using surface motion as the surrogates.« less

  13. Investigation of probabilistic principal component analysis compared to proper orthogonal decomposition methods for basis extraction and missing data estimation

    NASA Astrophysics Data System (ADS)

    Lee, Kyunghoon

    To evaluate the maximum likelihood estimates (MLEs) of probabilistic principal component analysis (PPCA) parameters such as a factor-loading, PPCA can invoke an expectation-maximization (EM) algorithm, yielding an EM algorithm for PPCA (EM-PCA). In order to examine the benefits of the EM-PCA for aerospace engineering applications, this thesis attempts to qualitatively and quantitatively scrutinize the EM-PCA alongside both POD and gappy POD using high-dimensional simulation data. In pursuing qualitative investigations, the theoretical relationship between POD and PPCA is transparent such that the factor-loading MLE of PPCA, evaluated by the EM-PCA, pertains to an orthogonal basis obtained by POD. By contrast, the analytical connection between gappy POD and the EM-PCA is nebulous because they distinctively approximate missing data due to their antithetical formulation perspectives: gappy POD solves a least-squares problem whereas the EM-PCA relies on the expectation of the observation probability model. To juxtapose both gappy POD and the EM-PCA, this research proposes a unifying least-squares perspective that embraces the two disparate algorithms within a generalized least-squares framework. As a result, the unifying perspective reveals that both methods address similar least-squares problems; however, their formulations contain dissimilar bases and norms. Furthermore, this research delves into the ramifications of the different bases and norms that will eventually characterize the traits of both methods. To this end, two hybrid algorithms of gappy POD and the EM-PCA are devised and compared to the original algorithms for a qualitative illustration of the different basis and norm effects. After all, a norm reflecting a curve-fitting method is found to more significantly affect estimation error reduction than a basis for two example test data sets: one is absent of data only at a single snapshot and the other misses data across all the snapshots. From a numerical performance aspect, the EM-PCA is computationally less efficient than POD for intact data since it suffers from slow convergence inherited from the EM algorithm. For incomplete data, this thesis quantitatively found that the number of data missing snapshots predetermines whether the EM-PCA or gappy POD outperforms the other because of the computational cost of a coefficient evaluation, resulting from a norm selection. For instance, gappy POD demands laborious computational effort in proportion to the number of data-missing snapshots as a consequence of the gappy norm. In contrast, the computational cost of the EM-PCA is invariant to the number of data-missing snapshots thanks to the L2 norm. In general, the higher the number of data-missing snapshots, the wider the gap between the computational cost of gappy POD and the EM-PCA. Based on the numerical experiments reported in this thesis, the following criterion is recommended regarding the selection between gappy POD and the EM-PCA for computational efficiency: gappy POD for an incomplete data set containing a few data-missing snapshots and the EM-PCA for an incomplete data set involving multiple data-missing snapshots. Last, the EM-PCA is applied to two aerospace applications in comparison to gappy POD as a proof of concept: one with an emphasis on basis extraction and the other with a focus on missing data reconstruction for a given incomplete data set with scattered missing data. The first application exploits the EM-PCA to efficiently construct reduced-order models of engine deck responses obtained by the numerical propulsion system simulation (NPSS), some of whose results are absent due to failed analyses caused by numerical instability. Model-prediction tests validate that engine performance metrics estimated by the reduced-order NPSS model exhibit considerably good agreement with those directly obtained by NPSS. Similarly, the second application illustrates that the EM-PCA is significantly more cost effective than gappy POD at repairing spurious PIV measurements obtained from acoustically-excited, bluff-body jet flow experiments. The EM-PCA reduces computational cost on factors 8 ˜ 19 compared to gappy POD while generating the same restoration results as those evaluated by gappy POD. All in all, through comprehensive theoretical and numerical investigation, this research establishes that the EM-PCA is an efficient alternative to gappy POD for an incomplete data set containing missing data over an entire data set. (Abstract shortened by UMI.)

  14. IMPROVED SEARCH OF PRINCIPAL COMPONENT ANALYSIS DATABASES FOR SPECTRO-POLARIMETRIC INVERSION

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Casini, R.; Lites, B. W.; Ramos, A. Asensio

    2013-08-20

    We describe a simple technique for the acceleration of spectro-polarimetric inversions based on principal component analysis (PCA) of Stokes profiles. This technique involves the indexing of the database models based on the sign of the projections (PCA coefficients) of the first few relevant orders of principal components of the four Stokes parameters. In this way, each model in the database can be attributed a distinctive binary number of 2{sup 4n} bits, where n is the number of PCA orders used for the indexing. Each of these binary numbers (indices) identifies a group of ''compatible'' models for the inversion of amore » given set of observed Stokes profiles sharing the same index. The complete set of the binary numbers so constructed evidently determines a partition of the database. The search of the database for the PCA inversion of spectro-polarimetric data can profit greatly from this indexing. In practical cases it becomes possible to approach the ideal acceleration factor of 2{sup 4n} as compared to the systematic search of a non-indexed database for a traditional PCA inversion. This indexing method relies on the existence of a physical meaning in the sign of the PCA coefficients of a model. For this reason, the presence of model ambiguities and of spectro-polarimetric noise in the observations limits in practice the number n of relevant PCA orders that can be used for the indexing.« less

  15. Discrimination of healthy and osteoarthritic articular cartilage by Fourier transform infrared imaging and Fisher’s discriminant analysis

    PubMed Central

    Mao, Zhi-Hua; Yin, Jian-Hua; Zhang, Xue-Xi; Wang, Xiao; Xia, Yang

    2016-01-01

    Fourier transform infrared spectroscopic imaging (FTIRI) technique can be used to obtain the quantitative information of content and spatial distribution of principal components in cartilage by combining with chemometrics methods. In this study, FTIRI combining with principal component analysis (PCA) and Fisher’s discriminant analysis (FDA) was applied to identify the healthy and osteoarthritic (OA) articular cartilage samples. Ten 10-μm thick sections of canine cartilages were imaged at 6.25μm/pixel in FTIRI. The infrared spectra extracted from the FTIR images were imported into SPSS software for PCA and FDA. Based on the PCA result of 2 principal components, the healthy and OA cartilage samples were effectively discriminated by the FDA with high accuracy of 94% for the initial samples (training set) and cross validation, as well as 86.67% for the prediction group. The study showed that cartilage degeneration became gradually weak with the increase of the depth. FTIRI combined with chemometrics may become an effective method for distinguishing healthy and OA cartilages in future. PMID:26977354

  16. Detection of l-Cysteine in wheat flour by Raman microspectroscopy combined chemometrics of HCA and PCA.

    PubMed

    Cebi, Nur; Dogan, Canan Ekinci; Develioglu, Ayşen; Yayla, Mediha Esra Altuntop; Sagdic, Osman

    2017-08-01

    l-Cysteine is deliberately added to various flour types since l-Cysteine has enabled favorable baking conditions such as low viscosity, increased elasticity and rise during baking. In Turkey, usage of l-Cysteine as a food additive isn't allowed in wheat flour according to the Turkish Food Codex Regulation on food additives. There is an urgent need for effective methods to detect l-Cysteine in wheat flour. In this study, for the first time, a new, rapid, effective, non-destructive and cost-effective method was developed for detection of l-Cysteine in wheat flour using Raman microscopy. Detection of l-Cysteine in wheat flour was accomplished successfully using Raman microscopy combined chemometrics of PCA (Principal Component Analysis) and HCA (Hierarchical Cluster Analysis). In this work, 500-2000cm -1 spectral range (fingerprint region) was determined to perform PCA and HCA analysis. l-Cysteine and l-Cystine were determined with detection limit of 0.125% (w/w) in different wheat flour samples. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. PCA leverage: outlier detection for high-dimensional functional magnetic resonance imaging data.

    PubMed

    Mejia, Amanda F; Nebel, Mary Beth; Eloyan, Ani; Caffo, Brian; Lindquist, Martin A

    2017-07-01

    Outlier detection for high-dimensional (HD) data is a popular topic in modern statistical research. However, one source of HD data that has received relatively little attention is functional magnetic resonance images (fMRI), which consists of hundreds of thousands of measurements sampled at hundreds of time points. At a time when the availability of fMRI data is rapidly growing-primarily through large, publicly available grassroots datasets-automated quality control and outlier detection methods are greatly needed. We propose principal components analysis (PCA) leverage and demonstrate how it can be used to identify outlying time points in an fMRI run. Furthermore, PCA leverage is a measure of the influence of each observation on the estimation of principal components, which are often of interest in fMRI data. We also propose an alternative measure, PCA robust distance, which is less sensitive to outliers and has controllable statistical properties. The proposed methods are validated through simulation studies and are shown to be highly accurate. We also conduct a reliability study using resting-state fMRI data from the Autism Brain Imaging Data Exchange and find that removal of outliers using the proposed methods results in more reliable estimation of subject-level resting-state networks using independent components analysis. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. The 20th Annual Prostate Cancer Foundation Scientific Retreat report.

    PubMed

    Miyahira, Andrea K; Simons, Jonathan W; Soule, Howard R

    2014-06-01

    The 20th Annual Prostate Cancer Foundation (PCF) Scientific Retreat was held from October 24 to 26, 2013, in National Harbor, Maryland. This event is held annually for the purpose of convening a diverse group of leading experimental and clinical researchers from academia, industry, and government to present and discuss critical and emerging topics relevant to prostate cancer (PCa) biology, and the diagnosis, prognosis, and treatment of PCa patients, with a focus on results that will lend to treatments for the most life-threatening stages of this disease. The themes that were highlighted at this year's event included: (i) mechanisms of PCa initiation and progression: cellular origins, neurons and neuroendocrine PCa, long non-coding RNAs, epigenetics, tumor cell metabolism, tumor-immune interactions, and novel molecular mechanisms; (ii) advancements in precision medicine strategies and predictive biomarkers of progression, survival, and drug sensitivities, including the analysis of circulating tumor cells and cell-free tumor DNA-new methods for liquid biopsies; (iii) new treatments including epigenomic therapy and immunotherapy, discovery of new treatment targets, and defining and targeting mechanisms of resistance to androgen-axis therapeutics; and (iv) new experimental and clinical epidemiology methods and techniques, including PCa population studies using patho-epidemiology. © 2014 Wiley Periodicals, Inc.

  19. Facilitating text reading in posterior cortical atrophy

    PubMed Central

    Rajdev, Kishan; Shakespeare, Timothy J.; Leff, Alexander P.; Crutch, Sebastian J.

    2015-01-01

    Objective: We report (1) the quantitative investigation of text reading in posterior cortical atrophy (PCA), and (2) the effects of 2 novel software-based reading aids that result in dramatic improvements in the reading ability of patients with PCA. Methods: Reading performance, eye movements, and fixations were assessed in patients with PCA and typical Alzheimer disease and in healthy controls (experiment 1). Two reading aids (single- and double-word) were evaluated based on the notion that reducing the spatial and oculomotor demands of text reading might support reading in PCA (experiment 2). Results: Mean reading accuracy in patients with PCA was significantly worse (57%) compared with both patients with typical Alzheimer disease (98%) and healthy controls (99%); spatial aspects of passages were the primary determinants of text reading ability in PCA. Both aids led to considerable gains in reading accuracy (PCA mean reading accuracy: single-word reading aid = 96%; individual patient improvement range: 6%–270%) and self-rated measures of reading. Data suggest a greater efficiency of fixations and eye movements under the single-word reading aid in patients with PCA. Conclusions: These findings demonstrate how neurologic characterization of a neurodegenerative syndrome (PCA) and detailed cognitive analysis of an important everyday skill (reading) can combine to yield aids capable of supporting important everyday functional abilities. Classification of evidence: This study provides Class III evidence that for patients with PCA, 2 software-based reading aids (single-word and double-word) improve reading accuracy. PMID:26138948

  20. Principal component analysis vs. self-organizing maps combined with hierarchical clustering for pattern recognition in volcano seismic spectra

    NASA Astrophysics Data System (ADS)

    Unglert, K.; Radić, V.; Jellinek, A. M.

    2016-06-01

    Variations in the spectral content of volcano seismicity related to changes in volcanic activity are commonly identified manually in spectrograms. However, long time series of monitoring data at volcano observatories require tools to facilitate automated and rapid processing. Techniques such as self-organizing maps (SOM) and principal component analysis (PCA) can help to quickly and automatically identify important patterns related to impending eruptions. For the first time, we evaluate the performance of SOM and PCA on synthetic volcano seismic spectra constructed from observations during two well-studied eruptions at Klauea Volcano, Hawai'i, that include features observed in many volcanic settings. In particular, our objective is to test which of the techniques can best retrieve a set of three spectral patterns that we used to compose a synthetic spectrogram. We find that, without a priori knowledge of the given set of patterns, neither SOM nor PCA can directly recover the spectra. We thus test hierarchical clustering, a commonly used method, to investigate whether clustering in the space of the principal components and on the SOM, respectively, can retrieve the known patterns. Our clustering method applied to the SOM fails to detect the correct number and shape of the known input spectra. In contrast, clustering of the data reconstructed by the first three PCA modes reproduces these patterns and their occurrence in time more consistently. This result suggests that PCA in combination with hierarchical clustering is a powerful practical tool for automated identification of characteristic patterns in volcano seismic spectra. Our results indicate that, in contrast to PCA, common clustering algorithms may not be ideal to group patterns on the SOM and that it is crucial to evaluate the performance of these tools on a control dataset prior to their application to real data.

  1. Rotation of EOFs by the Independent Component Analysis: Towards A Solution of the Mixing Problem in the Decomposition of Geophysical Time Series

    NASA Technical Reports Server (NTRS)

    Aires, Filipe; Rossow, William B.; Chedin, Alain; Hansen, James E. (Technical Monitor)

    2001-01-01

    The Independent Component Analysis is a recently developed technique for component extraction. This new method requires the statistical independence of the extracted components, a stronger constraint that uses higher-order statistics, instead of the classical decorrelation, a weaker constraint that uses only second-order statistics. This technique has been used recently for the analysis of geophysical time series with the goal of investigating the causes of variability in observed data (i.e. exploratory approach). We demonstrate with a data simulation experiment that, if initialized with a Principal Component Analysis, the Independent Component Analysis performs a rotation of the classical PCA (or EOF) solution. This rotation uses no localization criterion like other Rotation Techniques (RT), only the global generalization of decorrelation by statistical independence is used. This rotation of the PCA solution seems to be able to solve the tendency of PCA to mix several physical phenomena, even when the signal is just their linear sum.

  2. A SPECTRAL GRAPH APPROACH TO DISCOVERING GENETIC ANCESTRY1

    PubMed Central

    Lee, Ann B.; Luca, Diana; Roeder, Kathryn

    2010-01-01

    Mapping human genetic variation is fundamentally interesting in fields such as anthropology and forensic inference. At the same time, patterns of genetic diversity confound efforts to determine the genetic basis of complex disease. Due to technological advances, it is now possible to measure hundreds of thousands of genetic variants per individual across the genome. Principal component analysis (PCA) is routinely used to summarize the genetic similarity between subjects. The eigenvectors are interpreted as dimensions of ancestry. We build on this idea using a spectral graph approach. In the process we draw on connections between multidimensional scaling and spectral kernel methods. Our approach, based on a spectral embedding derived from the normalized Laplacian of a graph, can produce more meaningful delineation of ancestry than by using PCA. The method is stable to outliers and can more easily incorporate different similarity measures of genetic data than PCA. We illustrate a new algorithm for genetic clustering and association analysis on a large, genetically heterogeneous sample. PMID:20689656

  3. Comparison of 3 Methods for Identifying Dietary Patterns Associated With Risk of Disease

    PubMed Central

    DiBello, Julia R.; Kraft, Peter; McGarvey, Stephen T.; Goldberg, Robert; Campos, Hannia

    2008-01-01

    Reduced rank regression and partial least-squares regression (PLS) are proposed alternatives to principal component analysis (PCA). Using all 3 methods, the authors derived dietary patterns in Costa Rican data collected on 3,574 cases and controls in 1994–2004 and related the resulting patterns to risk of first incident myocardial infarction. Four dietary patterns associated with myocardial infarction were identified. Factor 1, characterized by high intakes of lean chicken, vegetables, fruit, and polyunsaturated oil, was generated by all 3 dietary pattern methods and was associated with a significantly decreased adjusted risk of myocardial infarction (28%–46%, depending on the method used). PCA and PLS also each yielded a pattern associated with a significantly decreased risk of myocardial infarction (31% and 23%, respectively); this pattern was characterized by moderate intake of alcohol and polyunsaturated oil and low intake of high-fat dairy products. The fourth factor derived from PCA was significantly associated with a 38% increased risk of myocardial infarction and was characterized by high intakes of coffee and palm oil. Contrary to previous studies, the authors found PCA and PLS to produce more patterns associated with cardiovascular disease than reduced rank regression. The most effective method for deriving dietary patterns related to disease may vary depending on the study goals. PMID:18945692

  4. Quantitative analysis of NMR spectra with chemometrics

    NASA Astrophysics Data System (ADS)

    Winning, H.; Larsen, F. H.; Bro, R.; Engelsen, S. B.

    2008-01-01

    The number of applications of chemometrics to series of NMR spectra is rapidly increasing due to an emerging interest for quantitative NMR spectroscopy e.g. in the pharmaceutical and food industries. This paper gives an analysis of advantages and limitations of applying the two most common chemometric procedures, Principal Component Analysis (PCA) and Multivariate Curve Resolution (MCR), to a designed set of 231 simple alcohol mixture (propanol, butanol and pentanol) 1H 400 MHz spectra. The study clearly demonstrates that the major advantage of chemometrics is the visualisation of larger data structures which adds a new exploratory dimension to NMR research. While robustness and powerful data visualisation and exploration are the main qualities of the PCA method, the study demonstrates that the bilinear MCR method is an even more powerful method for resolving pure component NMR spectra from mixtures when certain conditions are met.

  5. Principal component analysis on a torus: Theory and application to protein dynamics.

    PubMed

    Sittel, Florian; Filk, Thomas; Stock, Gerhard

    2017-12-28

    A dimensionality reduction method for high-dimensional circular data is developed, which is based on a principal component analysis (PCA) of data points on a torus. Adopting a geometrical view of PCA, various distance measures on a torus are introduced and the associated problem of projecting data onto the principal subspaces is discussed. The main idea is that the (periodicity-induced) projection error can be minimized by transforming the data such that the maximal gap of the sampling is shifted to the periodic boundary. In a second step, the covariance matrix and its eigendecomposition can be computed in a standard manner. Adopting molecular dynamics simulations of two well-established biomolecular systems (Aib 9 and villin headpiece), the potential of the method to analyze the dynamics of backbone dihedral angles is demonstrated. The new approach allows for a robust and well-defined construction of metastable states and provides low-dimensional reaction coordinates that accurately describe the free energy landscape. Moreover, it offers a direct interpretation of covariances and principal components in terms of the angular variables. Apart from its application to PCA, the method of maximal gap shifting is general and can be applied to any other dimensionality reduction method for circular data.

  6. Principal component analysis on a torus: Theory and application to protein dynamics

    NASA Astrophysics Data System (ADS)

    Sittel, Florian; Filk, Thomas; Stock, Gerhard

    2017-12-01

    A dimensionality reduction method for high-dimensional circular data is developed, which is based on a principal component analysis (PCA) of data points on a torus. Adopting a geometrical view of PCA, various distance measures on a torus are introduced and the associated problem of projecting data onto the principal subspaces is discussed. The main idea is that the (periodicity-induced) projection error can be minimized by transforming the data such that the maximal gap of the sampling is shifted to the periodic boundary. In a second step, the covariance matrix and its eigendecomposition can be computed in a standard manner. Adopting molecular dynamics simulations of two well-established biomolecular systems (Aib9 and villin headpiece), the potential of the method to analyze the dynamics of backbone dihedral angles is demonstrated. The new approach allows for a robust and well-defined construction of metastable states and provides low-dimensional reaction coordinates that accurately describe the free energy landscape. Moreover, it offers a direct interpretation of covariances and principal components in terms of the angular variables. Apart from its application to PCA, the method of maximal gap shifting is general and can be applied to any other dimensionality reduction method for circular data.

  7. Fluorescence Intrinsic Characterization of Excitation-Emission Matrix Using Multi-Dimensional Ensemble Empirical Mode Decomposition

    PubMed Central

    Chang, Chi-Ying; Chang, Chia-Chi; Hsiao, Tzu-Chien

    2013-01-01

    Excitation-emission matrix (EEM) fluorescence spectroscopy is a noninvasive method for tissue diagnosis and has become important in clinical use. However, the intrinsic characterization of EEM fluorescence remains unclear. Photobleaching and the complexity of the chemical compounds make it difficult to distinguish individual compounds due to overlapping features. Conventional studies use principal component analysis (PCA) for EEM fluorescence analysis, and the relationship between the EEM features extracted by PCA and diseases has been examined. The spectral features of different tissue constituents are not fully separable or clearly defined. Recently, a non-stationary method called multi-dimensional ensemble empirical mode decomposition (MEEMD) was introduced; this method can extract the intrinsic oscillations on multiple spatial scales without loss of information. The aim of this study was to propose a fluorescence spectroscopy system for EEM measurements and to describe a method for extracting the intrinsic characteristics of EEM by MEEMD. The results indicate that, although PCA provides the principal factor for the spectral features associated with chemical compounds, MEEMD can provide additional intrinsic features with more reliable mapping of the chemical compounds. MEEMD has the potential to extract intrinsic fluorescence features and improve the detection of biochemical changes. PMID:24240806

  8. EVALUATION OF THE I-STAT PORTABLE CLINICAL ANALYZER FOR MEASUREMENT OF IONIZED CALCIUM AND SELECTED BLOOD CHEMISTRY VALUES IN ASIAN ELEPHANTS (ELEPHAS MAXIMUS).

    PubMed

    Tarbert, Danielle K; Behling-Kelly, Erica; Priest, Heather; Childs-Sanford, Sara

    2017-06-01

    Thei-STAT® portable clinical analyzer (PCA) provides patient-side results for hematologic, biochemical, and blood gas values when immediate results are desired. This analyzer is commonly used in nondomestic animals; however, validation of this method in comparison with traditional benchtop methods should be performed for each species. In this study, the i-STAT PCA was compared with the Radiometer ABL 800 Flex benchtop analyzer using 24 heparinized whole blood samples obtained from healthy E. maximus . In addition, the effect of sample storage was evaluated on the i-STAT PCA. Analytes evaluated were hydrogen ion concentration (pH), glucose, potassium (K + ), sodium (Na + ), bicarbonate (HCO 3 - ), total carbon dioxide (TCO 2 ), partial pressure of carbon dioxide (PCO 2 ), and ionized calcium (iCa 2+ ). Statistical analysis using correlation coefficients, Passing-Bablok regression analysis, and Bland-Altman plots found good agreement between results from samples run immediately after phlebotomy and 4 hr postsampling on the i-STAT PCA with the exception of K + , which is known to change with sample storage. Comparison of the results from the two analyzers at 4 hr postsampling found very strong or strong correlation in all values except K + , with statistically significant bias in all values except glucose and PCO 2 . Despite bias, mean differences assessed via Bland-Altman plots were clinically acceptable for all analytes excluding K + . Within the reference range for iCa 2+ , the iCa 2+ values obtained by the i-STAT PCA and Radiometer ABL 800 Flex were close in value, however in light of the constant and proportionate biases detected, overestimation at higher values and underestimation at lower values of iCa 2+ by the i-STAT PCA would be of potential concern. This study supports the use of the i-STAT PCA for the evaluation of these analytes, with the exception of K + , in the Asian elephant.

  9. The Pattern of Brain Amyloid Load in Posterior Cortical Atrophy Using 18F-AV45: Is Amyloid the Principal Actor in the Disease?

    PubMed Central

    Beaufils, Emilie; Ribeiro, Maria Joao; Vierron, Emilie; Vercouillie, Johnny; Dufour-Rainfray, Diane; Cottier, Jean-Philippe; Camus, Vincent; Mondon, Karl; Guilloteau, Denis; Hommet, Caroline

    2014-01-01

    Background Posterior cortical atrophy (PCA) is characterized by progressive higher-order visuoperceptual dysfunction and praxis declines. This syndrome is related to a number of underlying diseases, including, in most cases, Alzheimer's disease (AD). The aim of this study was to compare the amyloid load with 18F-AV45 positron emission tomography (PET) between PCA and AD subjects. Methods We performed 18F-AV45 PET, cerebrospinal fluid (CSF) biomarker analysis and a neuropsychological assessment in 11 PCA patients and 12 AD patients. Results The global and regional 18F-AV45 uptake was similar in the PCA and AD groups. No significant correlation was observed between global 18F-AV45 uptake and CSF biomarkers or between regional 18F-AV45 uptake and cognitive and affective symptoms. Conclusion This 18F-AV45 PET amyloid imaging study showed no specific regional pattern of cortical 18F-AV45 binding in PCA patients. These results confirm that a distinct clinical phenotype in amnestic AD and PCA is not related to amyloid distribution. PMID:25538727

  10. Germline BRCA Mutations Are Associated With Higher Risk of Nodal Involvement, Distant Metastasis, and Poor Survival Outcomes in Prostate Cancer

    PubMed Central

    Castro, Elena; Goh, Chee; Olmos, David; Saunders, Ed; Leongamornlert, Daniel; Tymrakiewicz, Malgorzata; Mahmud, Nadiya; Dadaev, Tokhir; Govindasami, Koveela; Guy, Michelle; Sawyer, Emma; Wilkinson, Rosemary; Ardern-Jones, Audrey; Ellis, Steve; Frost, Debra; Peock, Susan; Evans, D. Gareth; Tischkowitz, Marc; Cole, Trevor; Davidson, Rosemarie; Eccles, Diana; Brewer, Carole; Douglas, Fiona; Porteous, Mary E.; Donaldson, Alan; Dorkins, Huw; Izatt, Louise; Cook, Jackie; Hodgson, Shirley; Kennedy, M. John; Side, Lucy E.; Eason, Jacqueline; Murray, Alex; Antoniou, Antonis C.; Easton, Douglas F.; Kote-Jarai, Zsofia; Eeles, Rosalind

    2013-01-01

    Purpose To analyze the baseline clinicopathologic characteristics of prostate tumors with germline BRCA1 and BRCA2 (BRCA1/2) mutations and the prognostic value of those mutations on prostate cancer (PCa) outcomes. Patients and Methods This study analyzed the tumor features and outcomes of 2,019 patients with PCa (18 BRCA1 carriers, 61 BRCA2 carriers, and 1,940 noncarriers). The Kaplan-Meier method and Cox regression analysis were used to evaluate the associations between BRCA1/2 status and other PCa prognostic factors with overall survival (OS), cause-specific OS (CSS), CSS in localized PCa (CSS_M0), metastasis-free survival (MFS), and CSS from metastasis (CSS_M1). Results PCa with germline BRCA1/2 mutations were more frequently associated with Gleason ≥ 8 (P = .00003), T3/T4 stage (P = .003), nodal involvement (P = .00005), and metastases at diagnosis (P = .005) than PCa in noncarriers. CSS was significantly longer in noncarriers than in carriers (15.7 v 8.6 years, multivariable analyses [MVA] P = .015; hazard ratio [HR] = 1.8). For localized PCa, 5-year CSS and MFS were significantly higher in noncarriers (96% v 82%; MVA P = .01; HR = 2.6%; and 93% v 77%; MVA P = .009; HR = 2.7, respectively). Subgroup analyses confirmed the poor outcomes in BRCA2 patients, whereas the role of BRCA1 was not well defined due to the limited size and follow-up in this subgroup. Conclusion Our results confirm that BRCA1/2 mutations confer a more aggressive PCa phenotype with a higher probability of nodal involvement and distant metastasis. BRCA mutations are associated with poor survival outcomes and this should be considered for tailoring clinical management of these patients. PMID:23569316

  11. Predicting timing of foot strike during running, independent of striking technique, using principal component analysis of joint angles.

    PubMed

    Osis, Sean T; Hettinga, Blayne A; Leitch, Jessica; Ferber, Reed

    2014-08-22

    As 3-dimensional (3D) motion-capture for clinical gait analysis continues to evolve, new methods must be developed to improve the detection of gait cycle events based on kinematic data. Recently, the application of principal component analysis (PCA) to gait data has shown promise in detecting important biomechanical features. Therefore, the purpose of this study was to define a new foot strike detection method for a continuum of striking techniques, by applying PCA to joint angle waveforms. In accordance with Newtonian mechanics, it was hypothesized that transient features in the sagittal-plane accelerations of the lower extremity would be linked with the impulsive application of force to the foot at foot strike. Kinematic and kinetic data from treadmill running were selected for 154 subjects, from a database of gait biomechanics. Ankle, knee and hip sagittal plane angular acceleration kinematic curves were chained together to form a row input to a PCA matrix. A linear polynomial was calculated based on PCA scores, and a 10-fold cross-validation was performed to evaluate prediction accuracy against gold-standard foot strike as determined by a 10 N rise in the vertical ground reaction force. Results show 89-94% of all predicted foot strikes were within 4 frames (20 ms) of the gold standard with the largest error being 28 ms. It is concluded that this new foot strike detection is an improvement on existing methods and can be applied regardless of whether the runner exhibits a rearfoot, midfoot, or forefoot strike pattern. Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. Sample-Poor Estimation of Order and Common Signal Subspace with Application to Fusion of Medical Imaging Data

    PubMed Central

    Levin-Schwartz, Yuri; Song, Yang; Schreier, Peter J.; Calhoun, Vince D.; Adalı, Tülay

    2016-01-01

    Due to their data-driven nature, multivariate methods such as canonical correlation analysis (CCA) have proven very useful for fusion of multimodal neurological data. However, being able to determine the degree of similarity between datasets and appropriate order selection are crucial to the success of such techniques. The standard methods for calculating the order of multimodal data focus only on sources with the greatest individual energy and ignore relations across datasets. Additionally, these techniques as well as the most widely-used methods for determining the degree of similarity between datasets assume sufficient sample support and are not effective in the sample-poor regime. In this paper, we propose to jointly estimate the degree of similarity between datasets and their order when few samples are present using principal component analysis and canonical correlation analysis (PCA-CCA). By considering these two problems simultaneously, we are able to minimize the assumptions placed on the data and achieve superior performance in the sample-poor regime compared to traditional techniques. We apply PCA-CCA to the pairwise combinations of functional magnetic resonance imaging (fMRI), structural magnetic resonance imaging (sMRI), and electroencephalogram (EEG) data drawn from patients with schizophrenia and healthy controls while performing an auditory oddball task. The PCA-CCA results indicate that the fMRI and sMRI datasets are the most similar, whereas the sMRI and EEG datasets share the least similarity. We also demonstrate that the degree of similarity obtained by PCA-CCA is highly predictive of the degree of significance found for components generated using CCA. PMID:27039696

  13. Classification and quantification analysis of peach kernel from different origins with near-infrared diffuse reflection spectroscopy

    PubMed Central

    Liu, Wei; Wang, Zhen-Zhong; Qing, Jian-Ping; Li, Hong-Juan; Xiao, Wei

    2014-01-01

    Background: Peach kernels which contain kinds of fatty acids play an important role in the regulation of a variety of physiological and biological functions. Objective: To establish an innovative and rapid diffuse reflectance near-infrared spectroscopy (DR-NIR) analysis method along with chemometric techniques for the qualitative and quantitative determination of a peach kernel. Materials and Methods: Peach kernel samples from nine different origins were analyzed with high-performance liquid chromatography (HPLC) as a reference method. DR-NIR is in the spectral range 1100-2300 nm. Principal component analysis (PCA) and partial least squares regression (PLSR) algorithm were applied to obtain prediction models, The Savitzky-Golay derivative and first derivative were adopted for the spectral pre-processing, PCA was applied to classify the varieties of those samples. For the quantitative calibration, the models of linoleic and oleinic acids were established with the PLSR algorithm and the optimal principal component (PC) numbers were selected with leave-one-out (LOO) cross-validation. The established models were evaluated with the root mean square error of deviation (RMSED) and corresponding correlation coefficients (R2). Results: The PCA results of DR-NIR spectra yield clear classification of the two varieties of peach kernel. PLSR had a better predictive ability. The correlation coefficients of the two calibration models were above 0.99, and the RMSED of linoleic and oleinic acids were 1.266% and 1.412%, respectively. Conclusion: The DR-NIR combined with PCA and PLSR algorithm could be used efficiently to identify and quantify peach kernels and also help to solve variety problem. PMID:25422544

  14. Standardized processing of MALDI imaging raw data for enhancement of weak analyte signals in mouse models of gastric cancer and Alzheimer's disease.

    PubMed

    Schwartz, Matthias; Meyer, Björn; Wirnitzer, Bernhard; Hopf, Carsten

    2015-03-01

    Conventional mass spectrometry image preprocessing methods used for denoising, such as the Savitzky-Golay smoothing or discrete wavelet transformation, typically do not only remove noise but also weak signals. Recently, memory-efficient principal component analysis (PCA) in conjunction with random projections (RP) has been proposed for reversible compression and analysis of large mass spectrometry imaging datasets. It considers single-pixel spectra in their local context and consequently offers the prospect of using information from the spectra of adjacent pixels for denoising or signal enhancement. However, little systematic analysis of key RP-PCA parameters has been reported so far, and the utility and validity of this method for context-dependent enhancement of known medically or pharmacologically relevant weak analyte signals in linear-mode matrix-assisted laser desorption/ionization (MALDI) mass spectra has not been explored yet. Here, we investigate MALDI imaging datasets from mouse models of Alzheimer's disease and gastric cancer to systematically assess the importance of selecting the right number of random projections k and of principal components (PCs) L for reconstructing reproducibly denoised images after compression. We provide detailed quantitative data for comparison of RP-PCA-denoising with the Savitzky-Golay and wavelet-based denoising in these mouse models as a resource for the mass spectrometry imaging community. Most importantly, we demonstrate that RP-PCA preprocessing can enhance signals of low-intensity amyloid-β peptide isoforms such as Aβ1-26 even in sparsely distributed Alzheimer's β-amyloid plaques and that it enables enhanced imaging of multiply acetylated histone H4 isoforms in response to pharmacological histone deacetylase inhibition in vivo. We conclude that RP-PCA denoising may be a useful preprocessing step in biomarker discovery workflows.

  15. Urinary MicroRNAs of Prostate Cancer: Virus-Encoded hsv1-miRH18 and hsv2-miR-H9-5p Could Be Valuable Diagnostic Markers

    PubMed Central

    Yun, Seok Joong; Jeong, Pildu; Kang, Ho Won; Kim, Ye-Hwan; Kim, Eun-Ah; Yan, Chunri; Choi, Young-Ki; Kim, Dongho; Kim, Jung Min; Kim, Seon-Kyu; Kim, Seon-Young; Kim, Sang Tae; Kim, Won Tae; Lee, Ok-Jun; Koh, Gou-Young; Moon, Sung-Kwon; Kim, Isaac Yi; Kim, Jayoung; Choi, Yung-Hyun; Kim, Wun-Jae

    2015-01-01

    Purpose: MicroRNAs (miRNAs) in biological fluids are potential biomarkers for the diagnosis and assessment of urological diseases such as benign prostatic hyperplasia (BPH) and prostate cancer (PCa). The aim of the study was to identify and validate urinary cell-free miRNAs that can segregate patients with PCa from those with BPH. Methods: In total, 1,052 urine, 150 serum, and 150 prostate tissue samples from patients with PCa or BPH were used in the study. A urine-based miRNA microarray analysis suggested the presence of differentially expressed urinary miRNAs in patients with PCa, and these were further validated in three independent PCa cohorts, using a quantitative reverse transcriptionpolymerase chain reaction analysis. Results: The expression levels of hsa-miR-615-3p, hsv1-miR-H18, hsv2-miR-H9-5p, and hsa-miR-4316 were significantly higher in urine samples of patients with PCa than in those of BPH controls. In particular, herpes simplex virus (hsv)-derived hsv1-miR-H18 and hsv2-miR-H9-5p showed better diagnostic performance than did the serum prostate-specific antigen (PSA) test for patients in the PSA gray zone. Furthermore, a combination of urinary hsv2-miR-H9-5p with serum PSA showed high sensitivity and specificity, providing a potential clinical benefit by reducing unnecessary biopsies. Conclusions: Our findings showed that hsv-encoded hsv1-miR-H18 and hsv2-miR-H9-5p are significantly associated with PCa and can facilitate early diagnosis of PCa for patients within the serum PSA gray zone. PMID:26126436

  16. Identification and suppression of the p-coumaroyl CoA:hydroxycinnamyl alcohol transferase in Zea mays L.

    PubMed Central

    Marita, Jane M; Hatfield, Ronald D; Rancour, David M; Frost, Kenneth E

    2014-01-01

    Grasses, such as Zea mays L. (maize), contain relatively high levels of p-coumarates (pCA) within their cell walls. Incorporation of pCA into cell walls is believed to be due to a hydroxycinnamyl transferase that couples pCA to monolignols. To understand the role of pCA in maize development, the p-coumaroyl CoA:hydroxycinnamyl alcohol transferase (pCAT) was isolated and purified from maize stems. Purified pCAT was subjected to partial trypsin digestion, and peptides were sequenced by tandem mass spectrometry. TBLASTN analysis of the acquired peptide sequences identified a single full-length maize cDNA clone encoding all the peptide sequences obtained from the purified enzyme. The cDNA clone was obtained and used to generate an RNAi construct for suppressing pCAT expression in maize. Here we describe the effects of suppression of pCAT in maize. Primary screening of transgenic maize seedling leaves using a new rapid analytical platform was used to identify plants with decreased amounts of pCA. Using this screening method, mature leaves from fully developed plants were analyzed, confirming reduced pCA levels throughout plant development. Complete analysis of isolated cell walls from mature transgenic stems and leaves revealed that lignin levels did not change, but pCA levels decreased and the lignin composition was altered. Transgenic plants with the lowest levels of pCA had decreased levels of syringyl units in the lignin. Thus, altering the levels of pCAT expression in maize leads to altered lignin composition, but does not appear to alter the total amount of lignin present in the cell walls. PMID:24654730

  17. Identification and suppression of the p-coumaroyl CoA:hydroxycinnamyl alcohol transferase in Zea mays L.

    PubMed

    Marita, Jane M; Hatfield, Ronald D; Rancour, David M; Frost, Kenneth E

    2014-06-01

    Grasses, such as Zea mays L. (maize), contain relatively high levels of p-coumarates (pCA) within their cell walls. Incorporation of pCA into cell walls is believed to be due to a hydroxycinnamyl transferase that couples pCA to monolignols. To understand the role of pCA in maize development, the p-coumaroyl CoA:hydroxycinnamyl alcohol transferase (pCAT) was isolated and purified from maize stems. Purified pCAT was subjected to partial trypsin digestion, and peptides were sequenced by tandem mass spectrometry. TBLASTN analysis of the acquired peptide sequences identified a single full-length maize cDNA clone encoding all the peptide sequences obtained from the purified enzyme. The cDNA clone was obtained and used to generate an RNAi construct for suppressing pCAT expression in maize. Here we describe the effects of suppression of pCAT in maize. Primary screening of transgenic maize seedling leaves using a new rapid analytical platform was used to identify plants with decreased amounts of pCA. Using this screening method, mature leaves from fully developed plants were analyzed, confirming reduced pCA levels throughout plant development. Complete analysis of isolated cell walls from mature transgenic stems and leaves revealed that lignin levels did not change, but pCA levels decreased and the lignin composition was altered. Transgenic plants with the lowest levels of pCA had decreased levels of syringyl units in the lignin. Thus, altering the levels of pCAT expression in maize leads to altered lignin composition, but does not appear to alter the total amount of lignin present in the cell walls. © 2014 The Authors The Plant Journal © 2014 John Wiley & Sons Ltd.

  18. Application of EOF/PCA-based methods in the post-processing of GRACE derived water variations

    NASA Astrophysics Data System (ADS)

    Forootan, Ehsan; Kusche, Jürgen

    2010-05-01

    Two problems that users of monthly GRACE gravity field solutions face are 1) the presence of correlated noise in the Stokes coefficients that increases with harmonic degree and causes ‘striping', and 2) the fact that different physical signals are overlaid and difficult to separate from each other in the data. These problems are termed the signal-noise separation problem and the signal-signal separation problem. Methods that are based on principal component analysis and empirical orthogonal functions (PCA/EOF) have been frequently proposed to deal with these problems for GRACE. However, different strategies have been applied to different (spatial: global/regional, spectral: global/order-wise, geoid/equivalent water height) representations of the GRACE level 2 data products, leading to differing results and a general feeling that PCA/EOF-based methods are to be applied ‘with care'. In addition, it is known that conventional EOF/PCA methods force separated modes to be orthogonal, and that, on the other hand, to either EOFs or PCs an arbitrary orthogonal rotation can be applied. The aim of this paper is to provide a common theoretical framework and to study the application of PCA/EOF-based methods as a signal separation tool due to post-process GRACE data products. In order to investigate and illustrate the applicability of PCA/EOF-based methods, we have employed them on GRACE level 2 monthly solutions based on the Center for Space Research, University of Texas (CSR/UT) RL04 products and on the ITG-GRACE03 solutions from the University of Bonn, and on various representations of them. Our results show that EOF modes do reveal the dominating annual, semiannual and also long-periodic signals in the global water storage variations, but they also show how choosing different strategies changes the outcome and may lead to unexpected results.

  19. Measuring the Indonesian provinces competitiveness by using PCA technique

    NASA Astrophysics Data System (ADS)

    Runita, Ditha; Fajriyah, Rohmatul

    2017-12-01

    Indonesia is a country which has vast teritoty. It has 34 provinces. Building local competitiveness is critical to enhance the long-term national competitiveness especially for a country as diverse as Indonesia. A competitive local government can attract and maintain successful firms and increase living standards for its inhabitants, because investment and skilled workers gravitate from uncompetitive regions to more competitive ones. Altough there are other methods to measuring competitiveness, but here we have demonstrated a simple method using principal component analysis (PCA). It can directly be applied to correlated, multivariate data. The analysis on Indonesian provinces provides 3 clusters based on the competitiveness measurement and the clusters are Bad, Good and Best perform provinces.

  20. Contrast-Enhanced Ultrasound Angiogenesis Imaging by Mutual Information Analysis for Prostate Cancer Localization.

    PubMed

    Schalk, Stefan G; Demi, Libertario; Bouhouch, Nabil; Kuenen, Maarten P J; Postema, Arnoud W; de la Rosette, Jean J M C H; Wijkstra, Hessel; Tjalkens, Tjalling J; Mischi, Massimo

    2017-03-01

    The role of angiogenesis in cancer growth has stimulated research aimed at noninvasive cancer detection by blood perfusion imaging. Recently, contrast ultrasound dispersion imaging was proposed as an alternative method for angiogenesis imaging. After the intravenous injection of an ultrasound-contrast-agent bolus, dispersion can be indirectly estimated from the local similarity between neighboring time-intensity curves (TICs) measured by ultrasound imaging. Up until now, only linear similarity measures have been investigated. Motivated by the promising results of this approach in prostate cancer (PCa), we developed a novel dispersion estimation method based on mutual information, thus including nonlinear similarity, to further improve its ability to localize PCa. First, a simulation study was performed to establish the theoretical link between dispersion and mutual information. Next, the method's ability to localize PCa was validated in vivo in 23 patients (58 datasets) referred for radical prostatectomy by comparison with histology. A monotonic relationship between dispersion and mutual information was demonstrated. The in vivo study resulted in a receiver operating characteristic (ROC) curve area equal to 0.77, which was superior (p = 0.21-0.24) to that obtained by linear similarity measures (0.74-0.75) and (p <; 0.05) to that by conventional perfusion parameters (≤0.70). Mutual information between neighboring time-intensity curves can be used to indirectly estimate contrast dispersion and can lead to more accurate PCa localization. An improved PCa localization method can possibly lead to better grading and staging of tumors, and support focal-treatment guidance. Moreover, future employment of the method in other types of angiogenic cancer can be considered.

  1. INTEGRATED ENVIRONMENTAL ASSESSMENT OF THE MID-ATLANTIC REGION WITH ANALYTICAL NETWORK PROCESS

    EPA Science Inventory

    A decision analysis method for integrating environmental indicators was developed. This was a combination of Principal Component Analysis (PCA) and the Analytic Network Process (ANP). Being able to take into account interdependency among variables, the method was capable of ran...

  2. Different approaches in Partial Least Squares and Artificial Neural Network models applied for the analysis of a ternary mixture of Amlodipine, Valsartan and Hydrochlorothiazide

    NASA Astrophysics Data System (ADS)

    Darwish, Hany W.; Hassan, Said A.; Salem, Maissa Y.; El-Zeany, Badr A.

    2014-03-01

    Different chemometric models were applied for the quantitative analysis of Amlodipine (AML), Valsartan (VAL) and Hydrochlorothiazide (HCT) in ternary mixture, namely, Partial Least Squares (PLS) as traditional chemometric model and Artificial Neural Networks (ANN) as advanced model. PLS and ANN were applied with and without variable selection procedure (Genetic Algorithm GA) and data compression procedure (Principal Component Analysis PCA). The chemometric methods applied are PLS-1, GA-PLS, ANN, GA-ANN and PCA-ANN. The methods were used for the quantitative analysis of the drugs in raw materials and pharmaceutical dosage form via handling the UV spectral data. A 3-factor 5-level experimental design was established resulting in 25 mixtures containing different ratios of the drugs. Fifteen mixtures were used as a calibration set and the other ten mixtures were used as validation set to validate the prediction ability of the suggested methods. The validity of the proposed methods was assessed using the standard addition technique.

  3. An analytical approach based on ESI-MS, LC-MS and PCA for the quali-quantitative analysis of cycloartane derivatives in Astragalus spp.

    PubMed

    Napolitano, Assunta; Akay, Seref; Mari, Angela; Bedir, Erdal; Pizza, Cosimo; Piacente, Sonia

    2013-11-01

    Astragalus species are widely used as health foods and dietary supplements, as well as drugs in traditional medicine. To rapidly evaluate metabolite similarities and differences among the EtOH extracts of the roots of eight commercial Astragalus spp., an approach based on direct analyses by ESI-MS followed by PCA of ESI-MS data, was carried out. Successively, quali-quantitative analyses of cycloartane derivatives in the eight Astragalus spp. by LC-ESI-MS(n) and PCA of LC-ESI-MS data were performed. This approach allowed to promptly highlighting metabolite similarities and differences among the various Astragalus spp. PCA results from LC-ESI-MS data of Astragalus samples were in reasonable agreement with both PCA results of ESI-MS data and quantitative results. This study affords an analytical method for the quali-quantitative determination of cycloartane derivatives in herbal preparations used as health and food supplements. Copyright © 2013 Elsevier B.V. All rights reserved.

  4. Principal component and spatial correlation analysis of spectroscopic-imaging data in scanning probe microscopy.

    PubMed

    Jesse, Stephen; Kalinin, Sergei V

    2009-02-25

    An approach for the analysis of multi-dimensional, spectroscopic-imaging data based on principal component analysis (PCA) is explored. PCA selects and ranks relevant response components based on variance within the data. It is shown that for examples with small relative variations between spectra, the first few PCA components closely coincide with results obtained using model fitting, and this is achieved at rates approximately four orders of magnitude faster. For cases with strong response variations, PCA allows an effective approach to rapidly process, de-noise, and compress data. The prospects for PCA combined with correlation function analysis of component maps as a universal tool for data analysis and representation in microscopy are discussed.

  5. Principle component analysis (PCA) for investigation of relationship between population dynamics of microbial pathogenesis, chemical and sensory characteristics in beef slices containing Tarragon essential oil.

    PubMed

    Alizadeh Behbahani, Behrooz; Tabatabaei Yazdi, Farideh; Shahidi, Fakhri; Mortazavi, Seyed Ali; Mohebbi, Mohebbat

    2017-04-01

    Principle component analysis (PCA) was employed to examine the effect of the exerted treatments on the beef shelf life as well as discovering the correlations between the studied responses. Considering the variability of the dimensions of the responses, correlation coefficients were applied to form the matrix and extract the eigenvalue. Antimicrobial effect was evaluated on 10 pathogenic microorganisms through the methods of hole-plate diffusion method, disk diffusion method, pour plate method, minimum inhibitory concentration and minimum bactericidal/fungicidal concentration. Antioxidant potential and total phenolic content were examined through the method of 2,2-diphenyl-1-picrylhydrazyl (DPPH) and Folin-Ciocalteu method, respectively. The components were identified through gas chromatography and gas chromatography/mass spectrometry. Barhang seed mucilage (BSM) based edible coating containing 0, 0.5, 1 and 1.5% (w/w) Tarragon (T) essential oil mix were applied on beef slices to control the growth of pathogenic microorganisms. Microbiological (total viable count, psychrotrophic count, Escherichia coli, Staphylococcus aureus and fungi), chemical (thiobarbituric acid, peroxide value and pH) and sensory characteristics (odor, color and overall acceptability) analysis measurements were made during the storage periodically. PCA was employed to examine the effect of the exerted treatments on the beef shelf life as well as discovering the correlations between the studied responses. Considering the variability of the dimensions of the responses, correlation coefficients were applied to form the matrix and extract the eigenvalue. The PCA showed that the properties of the uncoated meat samples on the 9th, 12th, 15th and 18th days of storage are continuously changing independent of the exerted treatments on the other samples. This reveals the effect of the exerted treatments on the samples. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. Source apportionment of PAH in Hamilton Harbour suspended sediments: comparison of two factor analysis methods.

    PubMed

    Sofowote, Uwayemi M; McCarry, Brian E; Marvin, Christopher H

    2008-08-15

    A total of 26 suspended sediment samples collected over a 5-year period in Hamilton Harbour, Ontario, Canada and surrounding creeks were analyzed for a suite of polycyclic aromatic hydrocarbons and sulfur heterocycles. Hamilton Harbour sediments contain relatively high levels of polycyclic aromatic compounds and heavy metals due to emissions from industrial and mobile sources. Two receptor modeling methods using factor analyses were compared to determine the profiles and relative contributions of pollution sources to the harbor; these methods are principal component analyses (PCA) with multiple linear regression analysis (MLR) and positive matrix factorization (PMF). Both methods identified four factors and gave excellent correlation coefficients between predicted and measured levels of 25 aromatic compounds; both methods predicted similar contributions from coal tar/coal combustion sources to the harbor (19 and 26%, respectively). One PCA factor was identified as contributions from vehicular emissions (61%); PMF was able to differentiate vehicular emissions into two factors, one attributed to gasoline emissions sources (28%) and the other to diesel emissions sources (24%). Overall, PMF afforded better source identification than PCA with MLR. This work constitutes one of the few examples of the application of PMF to the source apportionment of sediments; the addition of sulfur heterocycles to the analyte list greatly aided in the source identification process.

  7. Metabolic fingerprint of Brazilian maize landraces silk (stigma/styles) using NMR spectroscopy and chemometric methods.

    PubMed

    Kuhnen, Shirley; Bernardi Ogliari, Juliana; Dias, Paulo Fernando; da Silva Santos, Maiara; Ferreira, Antônio Gilberto; Bonham, Connie C; Wood, Karl Vernon; Maraschin, Marcelo

    2010-02-24

    Aqueous extract from maize silks is used by traditional medicine for the treatment of several ailments, mainly related to the urinary system. This work focuses on the application of NMR spectroscopy and chemometric analysis for the determination of metabolic fingerprint and pattern recognition of silk extracts from seven maize landraces cultivated in southern Brazil. Principal component analysis (PCA) of the (1)H NMR data set showed clear discrimination among the maize varieties by PC1 and PC2, pointing out three distinct metabolic profiles. Target compounds analysis showed significant differences (p < 0.05) in the contents of protocatechuic acid, gallic acid, t-cinnamic acid, and anthocyanins, corroborating the discrimination of the genotypes in this study as revealed by PCA analysis. Thus the combination of (1)H NMR and PCA is a useful tool for the discrimination of maize silks in respect to their chemical composition, including rapid authentication of the raw material of current pharmacological interest.

  8. Analysis of antique bronze coins by Laser Induced Breakdown Spectroscopy and multivariate analysis

    NASA Astrophysics Data System (ADS)

    Bachler, M. Orlić; Bišćan, M.; Kregar, Z.; Jelovica Badovinac, I.; Dobrinić, J.; Milošević, S.

    2016-09-01

    This work presents a feasibility study of applying the Principal Component Analysis (PCA) to data obtained by Laser-Induced Breakdown Spectroscopy (LIBS) with the aim of determining correlation between different samples. The samples were antique bronze coins coated in silver (follis) dated in the Roman Empire period and were made during different rulers in different mints. While raw LIBS data revealed that in the period from the year 286 to 383 CE content of silver was constantly decreasing, the PCA showed that the samples can be somewhat grouped together based on their place of origin, which could be a useful hint when analysing unknown samples. It was also found that PCA can help in discriminating spectra corresponding to ablation from the surface and from the bulk. Furthermore, Partial Least Squares method (PLS) was used to obtain, based on a set of samples with known composition, an estimation of relative copper concentration in studied ancient coins. This analysis showed that copper concentration in surface layers ranged from 83% to 90%.

  9. Analyzing brain networks with PCA and conditional Granger causality.

    PubMed

    Zhou, Zhenyu; Chen, Yonghong; Ding, Mingzhou; Wright, Paul; Lu, Zuhong; Liu, Yijun

    2009-07-01

    Identifying directional influences in anatomical and functional circuits presents one of the greatest challenges for understanding neural computations in the brain. Granger causality mapping (GCM) derived from vector autoregressive models of data has been employed for this purpose, revealing complex temporal and spatial dynamics underlying cognitive processes. However, the traditional GCM methods are computationally expensive, as signals from thousands of voxels within selected regions of interest (ROIs) are individually processed, and being based on pairwise Granger causality, they lack the ability to distinguish direct from indirect connectivity among brain regions. In this work a new algorithm called PCA based conditional GCM is proposed to overcome these problems. The algorithm implements the following two procedures: (i) dimensionality reduction in ROIs of interest with principle component analysis (PCA), and (ii) estimation of the direct causal influences in local brain networks, using conditional Granger causality. Our results show that the proposed method achieves greater accuracy in detecting network connectivity than the commonly used pairwise Granger causality method. Furthermore, the use of PCA components in conjunction with conditional GCM greatly reduces the computational cost relative to the use of individual voxel time series. Copyright 2009 Wiley-Liss, Inc

  10. Cluster analysis of commercial samples of Bauhinia spp. using HPLC-UV/PDA and MCR-ALS/PCA without peak alignment procedure.

    PubMed

    Ardila, Jorge Armando; Funari, Cristiano Soleo; Andrade, André Marques; Cavalheiro, Alberto José; Carneiro, Renato Lajarim

    2015-01-01

    Bauhinia forficata Link. is recognised by the Brazilian Health Ministry as a treatment of hypoglycemia and diabetes. Analytical methods are useful to assess the plant identity due the similarities found in plants from Bauhinia spp. HPLC-UV/PDA in combination with chemometric tools is an alternative widely used and suitable for authentication of plant material, however, the shifts of retention times for similar compounds in different samples is a problem. To perform comparisons between the authentic medicinal plant (Bauhinia forficata Link.) and samples commercially available in drugstores claiming to be "Bauhinia spp. to treat diabetes" and to evaluate the performance of multivariate curve resolution - alternating least squares (MCR-ALS) associated to principal component analysis (PCA) when compared to pure PCA. HPLC-UV/PDA data obtained from extracts of leaves were evaluated employing a combination of MCR-ALS and PCA, which allowed the use of the full chromatographic and spectrometric information without the need of peak alignment procedures. The use of MCR-ALS/PCA showed better results than the conventional PCA using only one wavelength. Only two of nine commercial samples presented characteristics similar to the authentic Bauhinia forficata spp., considering the full HPLC-UV/PDA data. The combination of MCR-ALS and PCA is very useful when applied to a group of samples where a general alignment procedure could not be applied due to the different chromatographic profiles. This work also demonstrates the need of more strict control from the health authorities regarding herbal products available on the market. Copyright © 2015 John Wiley & Sons, Ltd.

  11. Quality Evaluation of Potentilla fruticosa L. by High Performance Liquid Chromatography Fingerprinting Associated with Chemometric Methods.

    PubMed

    Liu, Wei; Wang, Dongmei; Liu, Jianjun; Li, Dengwu; Yin, Dongxue

    2016-01-01

    The present study was performed to assess the quality of Potentilla fruticosa L. sampled from distinct regions of China using high performance liquid chromatography (HPLC) fingerprinting coupled with a suite of chemometric methods. For this quantitative analysis, the main active phytochemical compositions and the antioxidant activity in P. fruticosa were also investigated. Considering the high percentages and antioxidant activities of phytochemicals, P. fruticosa samples from Kangding, Sichuan were selected as the most valuable raw materials. Similarity analysis (SA) of HPLC fingerprints, hierarchical cluster analysis (HCA), principle component analysis (PCA), and discriminant analysis (DA) were further employed to provide accurate classification and quality estimates of P. fruticosa. Two principal components (PCs) were collected by PCA. PC1 separated samples from Kangding, Sichuan, capturing 57.64% of the variance, whereas PC2 contributed to further separation, capturing 18.97% of the variance. Two kinds of discriminant functions with a 100% discrimination ratio were constructed. The results strongly supported the conclusion that the eight samples from different regions were clustered into three major groups, corresponding with their morphological classification, for which HPLC analysis confirmed the considerable variation in phytochemical compositions and that P. fruticosa samples from Kangding, Sichuan were of high quality. The results of SA, HCA, PCA, and DA were in agreement and performed well for the quality assessment of P. fruticosa. Consequently, HPLC fingerprinting coupled with chemometric techniques provides a highly flexible and reliable method for the quality evaluation of traditional Chinese medicines.

  12. Quality Evaluation of Potentilla fruticosa L. by High Performance Liquid Chromatography Fingerprinting Associated with Chemometric Methods

    PubMed Central

    Liu, Wei; Wang, Dongmei; Liu, Jianjun; Li, Dengwu; Yin, Dongxue

    2016-01-01

    The present study was performed to assess the quality of Potentilla fruticosa L. sampled from distinct regions of China using high performance liquid chromatography (HPLC) fingerprinting coupled with a suite of chemometric methods. For this quantitative analysis, the main active phytochemical compositions and the antioxidant activity in P. fruticosa were also investigated. Considering the high percentages and antioxidant activities of phytochemicals, P. fruticosa samples from Kangding, Sichuan were selected as the most valuable raw materials. Similarity analysis (SA) of HPLC fingerprints, hierarchical cluster analysis (HCA), principle component analysis (PCA), and discriminant analysis (DA) were further employed to provide accurate classification and quality estimates of P. fruticosa. Two principal components (PCs) were collected by PCA. PC1 separated samples from Kangding, Sichuan, capturing 57.64% of the variance, whereas PC2 contributed to further separation, capturing 18.97% of the variance. Two kinds of discriminant functions with a 100% discrimination ratio were constructed. The results strongly supported the conclusion that the eight samples from different regions were clustered into three major groups, corresponding with their morphological classification, for which HPLC analysis confirmed the considerable variation in phytochemical compositions and that P. fruticosa samples from Kangding, Sichuan were of high quality. The results of SA, HCA, PCA, and DA were in agreement and performed well for the quality assessment of P. fruticosa. Consequently, HPLC fingerprinting coupled with chemometric techniques provides a highly flexible and reliable method for the quality evaluation of traditional Chinese medicines. PMID:26890416

  13. Study on 1H-NMR fingerprinting of Rhodiolae Crenulatae Radix et Rhizoma.

    PubMed

    Wen, Shi-yuan; Zhou, Jiang-tao; Chen, Yan-yan; Ding, Li-qin; Jiang, Miao-miao

    2015-07-01

    Nuclear magnetic resonance (1H-NMR) fingerprint of Rhodiola rosea medicinal materials was established, and used to distinguish the quality of raw materials from different sources. Pulse sequence for water peak inhibition was employed to acquire 1H-NMR spectra with the temperature at 298 K and spectrometer frequency of 400.13 MHz. Through subsection integral method, the obtained NMR data was subjected to similarity analysis and principal component analysis (PCA). 10 batches raw materials of Rhodiola rosea from different origins were successfully distinguished by PCA. The statistical results indicated that rhodiola glucoside, butyl alcohol, maleic acid and alanine were the main differential ingredients. This method provides an auxiliary method of Chinese quality approach to evaluate the quality of Rhodiola crenulata without using natural reference substances.

  14. Learning binary code via PCA of angle projection for image retrieval

    NASA Astrophysics Data System (ADS)

    Yang, Fumeng; Ye, Zhiqiang; Wei, Xueqi; Wu, Congzhong

    2018-01-01

    With benefits of low storage costs and high query speeds, binary code representation methods are widely researched for efficiently retrieving large-scale data. In image hashing method, learning hashing function to embed highdimensions feature to Hamming space is a key step for accuracy retrieval. Principal component analysis (PCA) technical is widely used in compact hashing methods, and most these hashing methods adopt PCA projection functions to project the original data into several dimensions of real values, and then each of these projected dimensions is quantized into one bit by thresholding. The variances of different projected dimensions are different, and with real-valued projection produced more quantization error. To avoid the real-valued projection with large quantization error, in this paper we proposed to use Cosine similarity projection for each dimensions, the angle projection can keep the original structure and more compact with the Cosine-valued. We used our method combined the ITQ hashing algorithm, and the extensive experiments on the public CIFAR-10 and Caltech-256 datasets validate the effectiveness of the proposed method.

  15. ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap

    PubMed Central

    Metsalu, Tauno; Vilo, Jaak

    2015-01-01

    The Principal Component Analysis (PCA) is a widely used method of reducing the dimensionality of high-dimensional data, often followed by visualizing two of the components on the scatterplot. Although widely used, the method is lacking an easy-to-use web interface that scientists with little programming skills could use to make plots of their own data. The same applies to creating heatmaps: it is possible to add conditional formatting for Excel cells to show colored heatmaps, but for more advanced features such as clustering and experimental annotations, more sophisticated analysis tools have to be used. We present a web tool called ClustVis that aims to have an intuitive user interface. Users can upload data from a simple delimited text file that can be created in a spreadsheet program. It is possible to modify data processing methods and the final appearance of the PCA and heatmap plots by using drop-down menus, text boxes, sliders etc. Appropriate defaults are given to reduce the time needed by the user to specify input parameters. As an output, users can download PCA plot and heatmap in one of the preferred file formats. This web server is freely available at http://biit.cs.ut.ee/clustvis/. PMID:25969447

  16. Multivariate qualitative analysis of banned additives in food safety using surface enhanced Raman scattering spectroscopy

    NASA Astrophysics Data System (ADS)

    He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei

    2015-02-01

    A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety.

  17. An improved principal component analysis based region matching method for fringe direction estimation

    NASA Astrophysics Data System (ADS)

    He, A.; Quan, C.

    2018-04-01

    The principal component analysis (PCA) and region matching combined method is effective for fringe direction estimation. However, its mask construction algorithm for region matching fails in some circumstances, and the algorithm for conversion of orientation to direction in mask areas is computationally-heavy and non-optimized. We propose an improved PCA based region matching method for the fringe direction estimation, which includes an improved and robust mask construction scheme, and a fast and optimized orientation-direction conversion algorithm for the mask areas. Along with the estimated fringe direction map, filtered fringe pattern by automatic selective reconstruction modification and enhanced fast empirical mode decomposition (ASRm-EFEMD) is used for Hilbert spiral transform (HST) to demodulate the phase. Subsequently, windowed Fourier ridge (WFR) method is used for the refinement of the phase. The robustness and effectiveness of proposed method are demonstrated by both simulated and experimental fringe patterns.

  18. Docking and multivariate methods to explore HIV-1 drug-resistance: a comparative analysis

    NASA Astrophysics Data System (ADS)

    Almerico, Anna Maria; Tutone, Marco; Lauria, Antonino

    2008-05-01

    In this paper we describe a comparative analysis between multivariate and docking methods in the study of the drug resistance to the reverse transcriptase and the protease inhibitors. In our early papers we developed a simple but efficient method to evaluate the features of compounds that are less likely to trigger resistance or are effective against mutant HIV strains, using the multivariate statistical procedures PCA and DA. In the attempt to create a more solid background for the prediction of susceptibility or resistance, we carried out a comparative analysis between our previous multivariate approach and molecular docking study. The intent of this paper is not only to find further support to the results obtained by the combined use of PCA and DA, but also to evidence the structural features, in terms of molecular descriptors, similarity, and energetic contributions, derived from docking, which can account for the arising of drug-resistance against mutant strains.

  19. On a PCA-based lung motion model

    NASA Astrophysics Data System (ADS)

    Li, Ruijiang; Lewis, John H.; Jia, Xun; Zhao, Tianyu; Liu, Weifeng; Wuenschel, Sara; Lamb, James; Yang, Deshan; Low, Daniel A.; Jiang, Steve B.

    2011-09-01

    Respiration-induced organ motion is one of the major uncertainties in lung cancer radiotherapy and is crucial to be able to accurately model the lung motion. Most work so far has focused on the study of the motion of a single point (usually the tumor center of mass), and much less work has been done to model the motion of the entire lung. Inspired by the work of Zhang et al (2007 Med. Phys. 34 4772-81), we believe that the spatiotemporal relationship of the entire lung motion can be accurately modeled based on principle component analysis (PCA) and then a sparse subset of the entire lung, such as an implanted marker, can be used to drive the motion of the entire lung (including the tumor). The goal of this work is twofold. First, we aim to understand the underlying reason why PCA is effective for modeling lung motion and find the optimal number of PCA coefficients for accurate lung motion modeling. We attempt to address the above important problems both in a theoretical framework and in the context of real clinical data. Second, we propose a new method to derive the entire lung motion using a single internal marker based on the PCA model. The main results of this work are as follows. We derived an important property which reveals the implicit regularization imposed by the PCA model. We then studied the model using two mathematical respiratory phantoms and 11 clinical 4DCT scans for eight lung cancer patients. For the mathematical phantoms with cosine and an even power (2n) of cosine motion, we proved that 2 and 2n PCA coefficients and eigenvectors will completely represent the lung motion, respectively. Moreover, for the cosine phantom, we derived the equivalence conditions for the PCA motion model and the physiological 5D lung motion model (Low et al 2005 Int. J. Radiat. Oncol. Biol. Phys. 63 921-9). For the clinical 4DCT data, we demonstrated the modeling power and generalization performance of the PCA model. The average 3D modeling error using PCA was within 1 mm (0.7 ± 0.1 mm). When a single artificial internal marker was used to derive the lung motion, the average 3D error was found to be within 2 mm (1.8 ± 0.3 mm) through comprehensive statistical analysis. The optimal number of PCA coefficients needs to be determined on a patient-by-patient basis and two PCA coefficients seem to be sufficient for accurate modeling of the lung motion for most patients. In conclusion, we have presented thorough theoretical analysis and clinical validation of the PCA lung motion model. The feasibility of deriving the entire lung motion using a single marker has also been demonstrated on clinical data using a simulation approach.

  20. Generation of Boundary Manikin Anthropometry

    NASA Technical Reports Server (NTRS)

    Young, Karen S.; Margerum, Sarah; Barr, Abbe; Ferrer, Mike A.; Rajulu, Sudhakar

    2008-01-01

    The purpose of this study was to develop 3D digital boundary manikins that are representative of the anthropometry of a unique population. These digital manikins can be used by designers to verify and validate that the components of the spacesuit design satisfy the requirements specified in the Human Systems Integration Requirements (HSIR) document. Currently, the HSIR requires the suit to accommodate the 1st percentile American female to the 99th percentile American male. The manikin anthropometry was derived using two methods: Principal Component Analysis (PCA) and Whole Body Posture Based Analysis (WBPBA). PCA is a statistical method for reducing a multidimensional data set by using eigenvectors and eigenvalues. The goal is to create a reduced data set that encapsulates the majority of the variation in the population. WBPBA is a multivariate analytical approach that was developed by the Anthropometry and Biomechanics Facility (ABF) to identify the extremes of the population for a given body posture. WBPBA is a simulation-based method that finds extremes in a population based on anthropometry and posture whereas PCA is based solely on anthropometry. Both methods yield a list of subjects and their anthropometry from the target population; PCA resulted in 20 female and 22 male subjects anthropometry and WBPBA resulted in 7 subjects' anthropometry representing the extreme subjects in the target population. The subjects anthropometry is then used to 'morph' a baseline digital scan of a person with the same body type to create a 3D digital model that can be used as a tool for designers, the details of which will be discussed in subsequent papers.

  1. Quorum sensing systems differentially regulate the production of phenazine-1-carboxylic acid in the rhizobacterium Pseudomonas aeruginosa PA1201

    PubMed Central

    Sun, Shuang; Zhou, Lian; Jin, Kaiming; Jiang, Haixia; He, Ya-Wen

    2016-01-01

    Pseudomonas aeruginosa strain PA1201 is a newly identified rhizobacterium that produces high levels of the secondary metabolite phenazine-1-carboxylic acid (PCA), the newly registered biopesticide Shenqinmycin. PCA production in liquid batch cultures utilizing a specialized PCA-promoting medium (PPM) typically occurs after the period of most rapid growth, and production is regulated in a quorum sensing (QS)-dependent manner. PA1201 contains two PCA biosynthetic gene clusters phz1 and phz2; both clusters contribute to PCA production, with phz2 making a greater contribution. PA1201 also contains a complete set of genes for four QS systems (LasI/LasR, RhlI/RhlR, PQS/MvfR, and IQS). By using several methods including gene deletion, the construction of promoter-lacZ fusion reporter strains, and RNA-Seq analysis, this study investigated the effects of the four QS systems on bacterial growth, QS signal production, the expression of phz1 and phz2, and PCA production. The possible mechanisms for the strain- and condition-dependent expression of phz1 and phz2 were discussed, and a schematic model was proposed. These findings provide a basis for further genetic engineering of the QS systems to improve PCA production. PMID:27456813

  2. The pre-image problem in kernel methods.

    PubMed

    Kwok, James Tin-yau; Tsang, Ivor Wai-hung

    2004-11-01

    In this paper, we address the problem of finding the pre-image of a feature vector in the feature space induced by a kernel. This is of central importance in some kernel applications, such as on using kernel principal component analysis (PCA) for image denoising. Unlike the traditional method which relies on nonlinear optimization, our proposed method directly finds the location of the pre-image based on distance constraints in the feature space. It is noniterative, involves only linear algebra and does not suffer from numerical instability or local minimum problems. Evaluations on performing kernel PCA and kernel clustering on the USPS data set show much improved performance.

  3. Spatial Mapping of Pyocyanin in Pseudomonas aeruginosa Bacterial Communities by Surface Enhanced Raman Scattering

    PubMed Central

    Polisetti, Sneha; Baig, Nameera F.; Morales-Soto, Nydia; Shrout, Joshua D.; Bohn, Paul W.

    2017-01-01

    Surface Enhanced Raman Spectroscopy (SERS) imaging was used in conjunction with Principal Component Analysis (PCA) for the in situ spatiotemporal mapping of the virulence factor pyocyanin, in communities of the pathogenic bacterium Pseudomonas aeruginosa. The combination of SERS imaging and PCA analysis provides a robust method for characterization of heterogeneous biological systems while circumventing issues associated with interference from sample autofluorescence and low reproducibility of SERS signals. The production of pyocyanin is found to depend both on the growth carbon source and on the specific strain of P. aeruginosa studied. A cystic fibrosis lung isolate strain of P. aeruginosa synthesizes and secretes pyocyanin when grown with glucose and glutamate, while the laboratory strain exhibits detectable production of pyocyanin only when grown with glutamate as the source of carbon. Pyocyanin production in the laboratory strain grown with glucose was below the limit of detection of SERS. In addition, the combination of SERS imaging and PCA can elucidate subtle differences in the molecular composition of biofilms. PCA loading plots from the clinical isolate exhibit features corresponding to vibrational bands of carbohydrates, which represent the mucoid biofilm matrix specific to that isolate, features that are not seen in the PCA loading plots of the laboratory strain. PMID:27354400

  4. Scalable Robust Principal Component Analysis Using Grassmann Averages.

    PubMed

    Hauberg, Sren; Feragen, Aasa; Enficiaud, Raffi; Black, Michael J

    2016-11-01

    In large datasets, manual data verification is impossible, and we must expect the number of outliers to increase with data size. While principal component analysis (PCA) can reduce data size, and scalable solutions exist, it is well-known that outliers can arbitrarily corrupt the results. Unfortunately, state-of-the-art approaches for robust PCA are not scalable. We note that in a zero-mean dataset, each observation spans a one-dimensional subspace, giving a point on the Grassmann manifold. We show that the average subspace corresponds to the leading principal component for Gaussian data. We provide a simple algorithm for computing this Grassmann Average ( GA), and show that the subspace estimate is less sensitive to outliers than PCA for general distributions. Because averages can be efficiently computed, we immediately gain scalability. We exploit robust averaging to formulate the Robust Grassmann Average (RGA) as a form of robust PCA. The resulting Trimmed Grassmann Average ( TGA) is appropriate for computer vision because it is robust to pixel outliers. The algorithm has linear computational complexity and minimal memory requirements. We demonstrate TGA for background modeling, video restoration, and shadow removal. We show scalability by performing robust PCA on the entire Star Wars IV movie; a task beyond any current method. Source code is available online.

  5. A measure for objects clustering in principal component analysis biplot: A case study in inter-city buses maintenance cost data

    NASA Astrophysics Data System (ADS)

    Ginanjar, Irlandia; Pasaribu, Udjianna S.; Indratno, Sapto W.

    2017-03-01

    This article presents the application of the principal component analysis (PCA) biplot for the needs of data mining. This article aims to simplify and objectify the methods for objects clustering in PCA biplot. The novelty of this paper is to get a measure that can be used to objectify the objects clustering in PCA biplot. Orthonormal eigenvectors, which are the coefficients of a principal component model representing an association between principal components and initial variables. The existence of the association is a valid ground to objects clustering based on principal axes value, thus if m principal axes used in the PCA, then the objects can be classified into 2m clusters. The inter-city buses are clustered based on maintenance costs data by using two principal axes PCA biplot. The buses are clustered into four groups. The first group is the buses with high maintenance costs, especially for lube, and brake canvass. The second group is the buses with high maintenance costs, especially for tire, and filter. The third group is the buses with low maintenance costs, especially for lube, and brake canvass. The fourth group is buses with low maintenance costs, especially for tire, and filter.

  6. Performance analysis of robust road sign identification

    NASA Astrophysics Data System (ADS)

    Ali, Nursabillilah M.; Mustafah, Y. M.; Rashid, N. K. A. M.

    2013-12-01

    This study describes performance analysis of a robust system for road sign identification that incorporated two stages of different algorithms. The proposed algorithms consist of HSV color filtering and PCA techniques respectively in detection and recognition stages. The proposed algorithms are able to detect the three standard types of colored images namely Red, Yellow and Blue. The hypothesis of the study is that road sign images can be used to detect and identify signs that are involved with the existence of occlusions and rotational changes. PCA is known as feature extraction technique that reduces dimensional size. The sign image can be easily recognized and identified by the PCA method as is has been used in many application areas. Based on the experimental result, it shows that the HSV is robust in road sign detection with minimum of 88% and 77% successful rate for non-partial and partial occlusions images. For successful recognition rates using PCA can be achieved in the range of 94-98%. The occurrences of all classes are recognized successfully is between 5% and 10% level of occlusions.

  7. Reconstructing the free-energy landscape of Met-enkephalin using dihedral principal component analysis and well-tempered metadynamics

    NASA Astrophysics Data System (ADS)

    Sicard, François; Senet, Patrick

    2013-06-01

    Well-Tempered Metadynamics (WTmetaD) is an efficient method to enhance the reconstruction of the free-energy surface of proteins. WTmetaD guarantees a faster convergence in the long time limit in comparison with the standard metadynamics. It still suffers, however, from the same limitation, i.e., the non-trivial choice of pertinent collective variables (CVs). To circumvent this problem, we couple WTmetaD with a set of CVs generated from a dihedral Principal Component Analysis (dPCA) on the Ramachandran dihedral angles describing the backbone structure of the protein. The dPCA provides a generic method to extract relevant CVs built from internal coordinates, and does not depend on the alignment to an arbitrarily chosen reference structure as usual in Cartesian PCA. We illustrate the robustness of this method in the case of a reference model protein, the small and very diffusive Met-enkephalin pentapeptide. We propose a justification a posteriori of the considered number of CVs necessary to bias the metadynamics simulation in terms of the one-dimensional free-energy profiles associated with Ramachandran dihedral angles along the amino-acid sequence.

  8. Reconstructing the free-energy landscape of Met-enkephalin using dihedral principal component analysis and well-tempered metadynamics.

    PubMed

    Sicard, François; Senet, Patrick

    2013-06-21

    Well-Tempered Metadynamics (WTmetaD) is an efficient method to enhance the reconstruction of the free-energy surface of proteins. WTmetaD guarantees a faster convergence in the long time limit in comparison with the standard metadynamics. It still suffers, however, from the same limitation, i.e., the non-trivial choice of pertinent collective variables (CVs). To circumvent this problem, we couple WTmetaD with a set of CVs generated from a dihedral Principal Component Analysis (dPCA) on the Ramachandran dihedral angles describing the backbone structure of the protein. The dPCA provides a generic method to extract relevant CVs built from internal coordinates, and does not depend on the alignment to an arbitrarily chosen reference structure as usual in Cartesian PCA. We illustrate the robustness of this method in the case of a reference model protein, the small and very diffusive Met-enkephalin pentapeptide. We propose a justification a posteriori of the considered number of CVs necessary to bias the metadynamics simulation in terms of the one-dimensional free-energy profiles associated with Ramachandran dihedral angles along the amino-acid sequence.

  9. Issues in the construction of wealth indices for the measurement of socio-economic position in low-income countries

    PubMed Central

    Howe, Laura D; Hargreaves, James R; Huttly, Sharon RA

    2008-01-01

    Background Epidemiological studies often require measures of socio-economic position (SEP). The application of principal components analysis (PCA) to data on asset-ownership is one popular approach to household SEP measurement. Proponents suggest that the approach provides a rational method for weighting asset data in a single indicator, captures the most important aspect of SEP for health studies, and is based on data that are readily available and/or simple to collect. However, the use of PCA on asset data may not be the best approach to SEP measurement. There remains concern that this approach can obscure the meaning of the final index and is statistically inappropriate for use with discrete data. In addition, the choice of assets to include and the level of agreement between wealth indices and more conventional measures of SEP such as consumption expenditure remain unclear. We discuss these issues, illustrating our examples with data from the Malawi Integrated Household Survey 2004–5. Methods Wealth indices were constructed using the assets on which data are collected within Demographic and Health Surveys. Indices were constructed using five weighting methods: PCA, PCA using dichotomised versions of categorical variables, equal weights, weights equal to the inverse of the proportion of households owning the item, and Multiple Correspondence Analysis. Agreement between indices was assessed. Indices were compared with per capita consumption expenditure, and the difference in agreement assessed when different methods were used to adjust consumption expenditure for household size and composition. Results All indices demonstrated similarly modest agreement with consumption expenditure. The indices constructed using dichotomised data showed strong agreement with each other, as did the indices constructed using categorical data. Agreement was lower between indices using data coded in different ways. The level of agreement between wealth indices and consumption expenditure did not differ when different consumption equivalence scales were applied. Conclusion This study questions the appropriateness of wealth indices as proxies for consumption expenditure. The choice of data included had a greater influence on the wealth index than the method used to weight the data. Despite the limitations of PCA, alternative methods also all had disadvantages. PMID:18234082

  10. Principal elementary mode analysis (PEMA).

    PubMed

    Folch-Fortuny, Abel; Marques, Rodolfo; Isidro, Inês A; Oliveira, Rui; Ferrer, Alberto

    2016-03-01

    Principal component analysis (PCA) has been widely applied in fluxomics to compress data into a few latent structures in order to simplify the identification of metabolic patterns. These latent structures lack a direct biological interpretation due to the intrinsic constraints associated with a PCA model. Here we introduce a new method that significantly improves the interpretability of the principal components with a direct link to metabolic pathways. This method, called principal elementary mode analysis (PEMA), establishes a bridge between a PCA-like model, aimed at explaining the maximum variance in flux data, and the set of elementary modes (EMs) of a metabolic network. It provides an easy way to identify metabolic patterns in large fluxomics datasets in terms of the simplest pathways of the organism metabolism. The results using a real metabolic model of Escherichia coli show the ability of PEMA to identify the EMs that generated the different simulated flux distributions. Actual flux data of E. coli and Pichia pastoris cultures confirm the results observed in the simulated study, providing a biologically meaningful model to explain flux data of both organisms in terms of the EM activation. The PEMA toolbox is freely available for non-commercial purposes on http://mseg.webs.upv.es.

  11. Descriptive sensory analysis in different classes of orange juice by a robust free-choice profile method.

    PubMed

    Pérez Aparicio, Jesús; Toledano Medina, M Angeles; Lafuente Rosales, Victoria

    2007-07-09

    Free-choice profile (FCP), developed in the 1980s, is a sensory analysis method that can be carried out by untrained panels. The participants need only to be able to use a scale and be consumers of the product under evaluation. The data are analysed by sophisticated statistical methodologies like Generalized Procrustean Analysis (GPA) or STATIS. To facilitate a wider use of the free-choice profiling procedure, different authors have advocated simpler methods based on principal components analysis (PCA) of merged data sets. The purpose of this work was to apply another easy procedure to this type of data by means of a robust PCA. The most important characteristic of the proposed method is that quality responsible managers could use this methodology without any scale evaluation. Only the free terms generated by the assessors are necessary to apply the script, thus avoiding the error associated with scale utilization by inexpert assessors. Also, it is possible to use the application with missing data and with differences in the assessors' attendance at sessions. An example was performed to generate the descriptors from different orange juice types. The results were compared with the STATIS method and with the PCA on the merged data sets. The samples evaluated were fresh orange juices with differences in storage days and pasteurized, concentrated and orange nectar drinks from different brands. Eighteen assessors with a low-level training program were used in a six-session free-choice profile framework. The results proved that this script could be of use in marketing decisions and product quality program development.

  12. Classification of Hyperspectral Data Based on Guided Filtering and Random Forest

    NASA Astrophysics Data System (ADS)

    Ma, H.; Feng, W.; Cao, X.; Wang, L.

    2017-09-01

    Hyperspectral images usually consist of more than one hundred spectral bands, which have potentials to provide rich spatial and spectral information. However, the application of hyperspectral data is still challengeable due to "the curse of dimensionality". In this context, many techniques, which aim to make full use of both the spatial and spectral information, are investigated. In order to preserve the geometrical information, meanwhile, with less spectral bands, we propose a novel method, which combines principal components analysis (PCA), guided image filtering and the random forest classifier (RF). In detail, PCA is firstly employed to reduce the dimension of spectral bands. Secondly, the guided image filtering technique is introduced to smooth land object, meanwhile preserving the edge of objects. Finally, the features are fed into RF classifier. To illustrate the effectiveness of the method, we carry out experiments over the popular Indian Pines data set, which is collected by Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) sensor. By comparing the proposed method with the method of only using PCA or guided image filter, we find that effect of the proposed method is better.

  13. [Research on Rapid Discrimination of Edible Oil by ATR Infrared Spectroscopy].

    PubMed

    Ma, Xiao; Yuan, Hong-fu; Song, Chun-feng; Hu, Ai-qin; Li, Xiao-yu; Zhao, Zhong; Li, Xiu-qin; Guo Zhen; Zhu, Zhi-qiang

    2015-07-01

    A rapid discrimination method of edible oils, KL-BP model, was proposed by attenuated total reflectance infrared spectroscopy. The model extracts the characteristic of classification from source data by KL and reduces data dimension at the same time. Then the neural network model is constructed by the new data which as the input of the model. 84 edible oil samples which include sesame oil, corn oil, canola oil, blend oil, sunflower oil, peanut oil, olive oil, soybean oil and tea seed oil, were collected and their infrared spectra determined using an ATR FT-IR spectrometer. In order to compare the method performance, principal component analysis (PCA) direct-classification model, KL direct-classification model, PLS-DA model, PCA-BP model and KL-BP model are constructed in this paper. The results show that the recognition rates of PCA, PCA-BP, KL, PLS-DA and KL-BP are 59.1%, 68.2%, 77.3%, 77.3% and 90.9% for discriminating the 9 kinds of edible oils, respectively. KL extracts the eigenvector which make the distance between different class and distance of every class ratio is the largest. So the method can get much more classify information than PCA. BP neural network can effectively enhance the classification ability and accuracy. Taking full of the advantages of KL in extracting more category information in dimension reducing and the features of BP neural network in self-learning, adaptive, nonlinear, the KL-BP method has the best classification ability and recognition accuracy and great importance for rapidly recognizing edible oil in practice.

  14. Evaluating motion processing algorithms for use with functional near-infrared spectroscopy data from young children.

    PubMed

    Delgado Reyes, Lourdes M; Bohache, Kevin; Wijeakumar, Sobanawartiny; Spencer, John P

    2018-04-01

    Motion artifacts are often a significant component of the measured signal in functional near-infrared spectroscopy (fNIRS) experiments. A variety of methods have been proposed to address this issue, including principal components analysis (PCA), correlation-based signal improvement (CBSI), wavelet filtering, and spline interpolation. The efficacy of these techniques has been compared using simulated data; however, our understanding of how these techniques fare when dealing with task-based cognitive data is limited. Brigadoi et al. compared motion correction techniques in a sample of adult data measured during a simple cognitive task. Wavelet filtering showed the most promise as an optimal technique for motion correction. Given that fNIRS is often used with infants and young children, it is critical to evaluate the effectiveness of motion correction techniques directly with data from these age groups. This study addresses that problem by evaluating motion correction algorithms implemented in HomER2. The efficacy of each technique was compared quantitatively using objective metrics related to the physiological properties of the hemodynamic response. Results showed that targeted PCA (tPCA), spline, and CBSI retained a higher number of trials. These techniques also performed well in direct head-to-head comparisons with the other approaches using quantitative metrics. The CBSI method corrected many of the artifacts present in our data; however, this approach produced sometimes unstable HRFs. The targeted PCA and spline methods proved to be the most robust, performing well across all comparison metrics. When compared head to head, tPCA consistently outperformed spline. We conclude, therefore, that tPCA is an effective technique for correcting motion artifacts in fNIRS data from young children.

  15. Liquid chromatography tandem mass spectrometry determination of chemical markers and principal component analysis of Vitex agnus-castus L. fruits (Verbenaceae) and derived food supplements.

    PubMed

    Mari, Angela; Montoro, Paola; Pizza, Cosimo; Piacente, Sonia

    2012-11-01

    A validated analytical method for the quantitative determination of seven chemical markers occurring in a hydroalcoholic extract of Vitex agnus-castus fruits by liquid chromatography electrospray triple quadrupole tandem mass spectrometry (LC/ESI/(QqQ)MSMS) is reported. To carry out a comparative study, five commercial food supplements corresponding to hydroalcoholic extracts of V. agnus-castus fruits were analysed under the same chromatographic conditions of the crude extract. Principal component analysis (PCA), based only on the variation of the amount of the seven chemical markers, was applied in order to find similarities between the hydroalcoholic extract and the food supplements. A second PCA analysis was carried out considering the whole spectroscopic data deriving from liquid chromatography electrospray linear ion trap mass spectrometry (LC/ESI/(LIT)MS) analysis. High similarity between the two PCA was observed, showing the possibility to select one of these two approaches for future applications in the field of comparative analysis of food supplements and quality control procedures. Copyright © 2012 Elsevier B.V. All rights reserved.

  16. Energy resolution improvement of CdTe detectors by using the principal component analysis technique

    NASA Astrophysics Data System (ADS)

    Alharbi, T.

    2018-02-01

    In this paper, we report on the application of the Principal Component Analysis (PCA) technique for the improvement of the γ-ray energy resolution of CdTe detectors. The PCA technique is used to estimate the amount of charge-trapping effect which is reflected in the shape of each detector pulse, thereby correcting for the charge-trapping effect. The details of the method are described and the results obtained with a CdTe detector are shown. We have achieved an energy resolution of 1.8 % (FWHM) at 662 keV with full detection efficiency from a 1 mm thick CdTe detector which gives an energy resolution of 4.5 % (FWHM) by using the standard pulse processing method.

  17. Representation of Probability Density Functions from Orbit Determination using the Particle Filter

    NASA Technical Reports Server (NTRS)

    Mashiku, Alinda K.; Garrison, James; Carpenter, J. Russell

    2012-01-01

    Statistical orbit determination enables us to obtain estimates of the state and the statistical information of its region of uncertainty. In order to obtain an accurate representation of the probability density function (PDF) that incorporates higher order statistical information, we propose the use of nonlinear estimation methods such as the Particle Filter. The Particle Filter (PF) is capable of providing a PDF representation of the state estimates whose accuracy is dependent on the number of particles or samples used. For this method to be applicable to real case scenarios, we need a way of accurately representing the PDF in a compressed manner with little information loss. Hence we propose using the Independent Component Analysis (ICA) as a non-Gaussian dimensional reduction method that is capable of maintaining higher order statistical information obtained using the PF. Methods such as the Principal Component Analysis (PCA) are based on utilizing up to second order statistics, hence will not suffice in maintaining maximum information content. Both the PCA and the ICA are applied to two scenarios that involve a highly eccentric orbit with a lower apriori uncertainty covariance and a less eccentric orbit with a higher a priori uncertainty covariance, to illustrate the capability of the ICA in relation to the PCA.

  18. Behavior of the PCA3 gene in the urine of men with high grade prostatic intraepithelial neoplasia.

    PubMed

    Morote, Juan; Rigau, Marina; Garcia, Marta; Mir, Carmen; Ballesteros, Carlos; Planas, Jacques; Raventós, Carles X; Placer, José; de Torres, Inés M; Reventós, Jaume; Doll, Andreas

    2010-12-01

    An ideal marker for the early detection of prostate cancer (PCa) should also differentiate between men with isolated high grade prostatic intraepithelial neoplasia (HGPIN) and those with PCa. Prostate Cancer Gene 3 (PCA3) is a highly specific PCa gene and its score, in relation to the PSA gene in post-prostate massage urine (PMU-PCA3), seems to be useful in ruling out PCa, especially after a negative prostate biopsy. Because PCA3 is also expressed in the HGPIN lesion, the aim of this study was to determine the efficacy of PMU-PCA3 scores for ruling out PCa in men with previous HGPIN. The PMU-PCA3 score was assessed by quantitative PCR (multiplex research assay) in 244 men subjected to prostate biopsy: 64 men with an isolated HGPIN (no cancer detected after two or more repeated biopsies), 83 men with PCa and 97 men with benign pathology findings (BP: no PCa, HGPIN or ASAP). The median PMU-PCA3 score was 1.56 in men with BP, 2.01 in men with HGPIN (p = 0.128) and 9.06 in men with PCa (p = 0.008). The AUC in the ROC analysis was 0.705 in the subset of men with BP and PCa, while it decreased to 0.629 when only men with isolated HGPIN and PCa were included in the analysis. Fixing the sensitivity of the PMU-PCA3 score at 90%, its specificity was 79% in men with BP and 69% in men with isolated HGPIN. The efficacy of the PMU-PCA3 score to rule out PCa in men with HGPIN is lower than in men with BP.

  19. TARGETED PRINCIPLE COMPONENT ANALYSIS: A NEW MOTION ARTIFACT CORRECTION APPROACH FOR NEAR-INFRARED SPECTROSCOPY

    PubMed Central

    YÜCEL, MERYEM A.; SELB, JULIETTE; COOPER, ROBERT J.; BOAS, DAVID A.

    2014-01-01

    As near-infrared spectroscopy (NIRS) broadens its application area to different age and disease groups, motion artifacts in the NIRS signal due to subject movement is becoming an important challenge. Motion artifacts generally produce signal fluctuations that are larger than physiological NIRS signals, thus it is crucial to correct for them before obtaining an estimate of stimulus evoked hemodynamic responses. There are various methods for correction such as principle component analysis (PCA), wavelet-based filtering and spline interpolation. Here, we introduce a new approach to motion artifact correction, targeted principle component analysis (tPCA), which incorporates a PCA filter only on the segments of data identified as motion artifacts. It is expected that this will overcome the issues of filtering desired signals that plagues standard PCA filtering of entire data sets. We compared the new approach with the most effective motion artifact correction algorithms on a set of data acquired simultaneously with a collodion-fixed probe (low motion artifact content) and a standard Velcro probe (high motion artifact content). Our results show that tPCA gives statistically better results in recovering hemodynamic response function (HRF) as compared to wavelet-based filtering and spline interpolation for the Velcro probe. It results in a significant reduction in mean-squared error (MSE) and significant enhancement in Pearson’s correlation coefficient to the true HRF. The collodion-fixed fiber probe with no motion correction performed better than the Velcro probe corrected for motion artifacts in terms of MSE and Pearson’s correlation coefficient. Thus, if the experimental study permits, the use of a collodion-fixed fiber probe may be desirable. If the use of a collodion-fixed probe is not feasible, then we suggest the use of tPCA in the processing of motion artifact contaminated data. PMID:25360181

  20. Bedside ROP screening and telemedicine interpretation integrated to a neonatal transport system: Economic aspects and return on investment analysis.

    PubMed

    Kovács, Gábor; Somogyvári, Zsolt; Maka, Erika; Nagyjánosi, László

    Peter Cerny Ambulance Service - Premature Eye Rescue Program (PCA-PERP) uses digital retinal imaging (DRI) with remote interpretation in bedside ROP screening, which has advantages over binocular indirect ophthalmoscopy (BIO) in screening of premature newborns. We aimed to demonstrate that PCA-PERP provides good value for the money and to model the cost ramifications of a similar newly launched system. As DRI was demonstrated to have high diagnostic performance, only the costs of bedside DRI-based screening were compared to those of traditional transport and BIO-based screening (cost-minimization analysis). The total costs of investment and maintenance were analyzed with micro-costing method. A ten-year analysis time-horizon and service provider's perspective were applied. From the launch of PCA-PERP up to the end of 2014, 3722 bedside examinations were performed in the PCA covered central region of Hungary. From 2009 to 2014, PCA-PERP saved 92,248km and 3633 staff working hours, with an annual nominal cost-savings ranging from 17,435 to 35,140 Euro. The net present value was 127,847 Euro at the end of 2014, with a payback period of 4.1years and an internal rate of return of 20.8%. Our model presented the NPVs of different scenarios with different initial investments, annual number of transports and average transport distances. PCA-PERP as bedside screening with remote interpretation, when compared to a transport-based screening with BIO, produced better cost-savings from the perspective of the service provider and provided a return on initial investment within five years after the project initiation. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. Principal component analysis of the CT density histogram to generate parametric response maps of COPD

    NASA Astrophysics Data System (ADS)

    Zha, N.; Capaldi, D. P. I.; Pike, D.; McCormack, D. G.; Cunningham, I. A.; Parraga, G.

    2015-03-01

    Pulmonary x-ray computed tomography (CT) may be used to characterize emphysema and airways disease in patients with chronic obstructive pulmonary disease (COPD). One analysis approach - parametric response mapping (PMR) utilizes registered inspiratory and expiratory CT image volumes and CT-density-histogram thresholds, but there is no consensus regarding the threshold values used, or their clinical meaning. Principal-component-analysis (PCA) of the CT density histogram can be exploited to quantify emphysema using data-driven CT-density-histogram thresholds. Thus, the objective of this proof-of-concept demonstration was to develop a PRM approach using PCA-derived thresholds in COPD patients and ex-smokers without airflow limitation. Methods: Fifteen COPD ex-smokers and 5 normal ex-smokers were evaluated. Thoracic CT images were also acquired at full inspiration and full expiration and these images were non-rigidly co-registered. PCA was performed for the CT density histograms, from which the components with the highest eigenvalues greater than one were summed. Since the values of the principal component curve correlate directly with the variability in the sample, the maximum and minimum points on the curve were used as threshold values for the PCA-adjusted PRM technique. Results: A significant correlation was determined between conventional and PCA-adjusted PRM with 3He MRI apparent diffusion coefficient (p<0.001), with CT RA950 (p<0.0001), as well as with 3He MRI ventilation defect percent, a measurement of both small airways disease (p=0.049 and p=0.06, respectively) and emphysema (p=0.02). Conclusions: PRM generated using PCA thresholds of the CT density histogram showed significant correlations with CT and 3He MRI measurements of emphysema, but not airways disease.

  2. Origin of fecal contamination in waters from contrasted areas: stanols as Microbial Source Tracking markers.

    PubMed

    Derrien, M; Jardé, E; Gruau, G; Pourcher, A M; Gourmelon, M; Jadas-Hécart, A; Pierson Wickmann, A C

    2012-09-01

    Improving the microbiological quality of coastal and river waters relies on the development of reliable markers that are capable of determining sources of fecal pollution. Recently, a principal component analysis (PCA) method based on six stanol compounds (i.e. 5β-cholestan-3β-ol (coprostanol), 5β-cholestan-3α-ol (epicoprostanol), 24-methyl-5α-cholestan-3β-ol (campestanol), 24-ethyl-5α-cholestan-3β-ol (sitostanol), 24-ethyl-5β-cholestan-3β-ol (24-ethylcoprostanol) and 24-ethyl-5β-cholestan-3α-ol (24-ethylepicoprostanol)) was shown to be suitable for distinguishing between porcine and bovine feces. In this study, we tested if this PCA method, using the above six stanols, could be used as a tool in "Microbial Source Tracking (MST)" methods in water from areas of intensive agriculture where diffuse fecal contamination is often marked by the co-existence of human and animal sources. In particular, well-defined and stable clusters were found in PCA score plots clustering samples of "pure" human, bovine and porcine feces along with runoff and diluted waters in which the source of contamination is known. A good consistency was also observed between the source assignments made by the 6-stanol-based PCA method and the microbial markers for river waters contaminated by fecal matter of unknown origin. More generally, the tests conducted in this study argue for the addition of the PCA method based on six stanols in the MST toolbox to help identify fecal contamination sources. The data presented in this study show that this addition would improve the determination of fecal contamination sources when the contamination levels are low to moderate. Copyright © 2012 Elsevier Ltd. All rights reserved.

  3. Image restoration for three-dimensional fluorescence microscopy using an orthonormal basis for efficient representation of depth-variant point-spread functions

    PubMed Central

    Patwary, Nurmohammed; Preza, Chrysanthe

    2015-01-01

    A depth-variant (DV) image restoration algorithm for wide field fluorescence microscopy, using an orthonormal basis decomposition of DV point-spread functions (PSFs), is investigated in this study. The efficient PSF representation is based on a previously developed principal component analysis (PCA), which is computationally intensive. We present an approach developed to reduce the number of DV PSFs required for the PCA computation, thereby making the PCA-based approach computationally tractable for thick samples. Restoration results from both synthetic and experimental images show consistency and that the proposed algorithm addresses efficiently depth-induced aberration using a small number of principal components. Comparison of the PCA-based algorithm with a previously-developed strata-based DV restoration algorithm demonstrates that the proposed method improves performance by 50% in terms of accuracy and simultaneously reduces the processing time by 64% using comparable computational resources. PMID:26504634

  4. Discrimination of premalignant lesions and cancer tissues from normal gastric tissues using Raman spectroscopy

    NASA Astrophysics Data System (ADS)

    Luo, Shuwen; Chen, Changshui; Mao, Hua; Jin, Shaoqin

    2013-06-01

    The feasibility of early detection of gastric cancer using near-infrared (NIR) Raman spectroscopy (RS) by distinguishing premalignant lesions (adenomatous polyp, n=27) and cancer tissues (adenocarcinoma, n=33) from normal gastric tissues (n=45) is evaluated. Significant differences in Raman spectra are observed among the normal, adenomatous polyp, and adenocarcinoma gastric tissues at 936, 1003, 1032, 1174, 1208, 1323, 1335, 1450, and 1655 cm-1. Diverse statistical methods are employed to develop effective diagnostic algorithms for classifying the Raman spectra of different types of ex vivo gastric tissues, including principal component analysis (PCA), linear discriminant analysis (LDA), and naive Bayesian classifier (NBC) techniques. Compared with PCA-LDA algorithms, PCA-NBC techniques together with leave-one-out, cross-validation method provide better discriminative results of normal, adenomatous polyp, and adenocarcinoma gastric tissues, resulting in superior sensitivities of 96.3%, 96.9%, and 96.9%, and specificities of 93%, 100%, and 95.2%, respectively. Therefore, NIR RS associated with multivariate statistical algorithms has the potential for early diagnosis of gastric premalignant lesions and cancer tissues in molecular level.

  5. Performance analysis of a Principal Component Analysis ensemble classifier for Emotiv headset P300 spellers.

    PubMed

    Elsawy, Amr S; Eldawlatly, Seif; Taher, Mohamed; Aly, Gamal M

    2014-01-01

    The current trend to use Brain-Computer Interfaces (BCIs) with mobile devices mandates the development of efficient EEG data processing methods. In this paper, we demonstrate the performance of a Principal Component Analysis (PCA) ensemble classifier for P300-based spellers. We recorded EEG data from multiple subjects using the Emotiv neuroheadset in the context of a classical oddball P300 speller paradigm. We compare the performance of the proposed ensemble classifier to the performance of traditional feature extraction and classifier methods. Our results demonstrate the capability of the PCA ensemble classifier to classify P300 data recorded using the Emotiv neuroheadset with an average accuracy of 86.29% on cross-validation data. In addition, offline testing of the recorded data reveals an average classification accuracy of 73.3% that is significantly higher than that achieved using traditional methods. Finally, we demonstrate the effect of the parameters of the P300 speller paradigm on the performance of the method.

  6. Computer aided detection in prostate cancer diagnostics: A promising alternative to biopsy? A retrospective study from 104 lesions with histological ground truth

    PubMed Central

    Thon, Anika; Teichgräber, Ulf; Tennstedt-Schenk, Cornelia; Hadjidemetriou, Stathis; Winzler, Sven; Malich, Ansgar

    2017-01-01

    Background Prostate cancer (PCa) diagnosis by means of multiparametric magnetic resonance imaging (mpMRI) is a current challenge for the development of computer-aided detection (CAD) tools. An innovative CAD-software (Watson Elementary™) was proposed to achieve high sensitivity and specificity, as well as to allege a correlate to Gleason grade. Aim/Objective To assess the performance of Watson Elementary™ in automated PCa diagnosis in our hospital´s database of MRI-guided prostate biopsies. Methods The evaluation was retrospective for 104 lesions (47 PCa, 57 benign) from 79, 64.61±6.64 year old patients using 3T T2-weighted imaging, Apparent Diffusion Coefficient (ADC) maps and dynamic contrast enhancement series. Watson Elementary™ utilizes signal intensity, diffusion properties and kinetic profile to compute a proportional Gleason grade predictor, termed Malignancy Attention Index (MAI). The analysis focused on (i) the CAD sensitivity and specificity to classify suspect lesions and (ii) the MAI correlation with the histopathological ground truth. Results The software revealed a sensitivity of 46.80% for PCa classification. The specificity for PCa was found to be 75.43% with a positive predictive value of 61.11%, a negative predictive value of 63.23% and a false discovery rate of 38.89%. CAD classified PCa and benign lesions with equal probability (P 0.06, χ2 test). Accordingly, receiver operating characteristic analysis suggests a poor predictive value for MAI with an area under curve of 0.65 (P 0.02), which is not superior to the performance of board certified observers. Moreover, MAI revealed no significant correlation with Gleason grade (P 0.60, Pearson´s correlation). Conclusion The tested CAD software for mpMRI analysis was a weak PCa biomarker in this dataset. Targeted prostate biopsy and histology remains the gold standard for prostate cancer diagnosis. PMID:29023572

  7. Portable XRF and principal component analysis for bill characterization in forensic science.

    PubMed

    Appoloni, C R; Melquiades, F L

    2014-02-01

    Several modern techniques have been applied to prevent counterfeiting of money bills. The objective of this study was to demonstrate the potential of Portable X-ray Fluorescence (PXRF) technique and the multivariate analysis method of Principal Component Analysis (PCA) for classification of bills in order to use it in forensic science. Bills of Dollar, Euro and Real (Brazilian currency) were measured directly at different colored regions, without any previous preparation. Spectra interpretation allowed the identification of Ca, Ti, Fe, Cu, Sr, Y, Zr and Pb. PCA analysis separated the bills in three groups and subgroups among Brazilian currency. In conclusion, the samples were classified according to its origin identifying the elements responsible for differentiation and basic pigment composition. PXRF allied to multivariate discriminate methods is a promising technique for rapid and no destructive identification of false bills in forensic science. Copyright © 2013 Elsevier Ltd. All rights reserved.

  8. Towards the identification of plant and animal binders on Australian stone knives.

    PubMed

    Blee, Alisa J; Walshe, Keryn; Pring, Allan; Quinton, Jamie S; Lenehan, Claire E

    2010-07-15

    There is limited information regarding the nature of plant and animal residues used as adhesives, fixatives and pigments found on Australian Aboriginal artefacts. This paper reports the use of FTIR in combination with the chemometric tools principal component analysis (PCA) and hierarchical clustering (HC) for the analysis and identification of Australian plant and animal fixatives on Australian stone artefacts. Ten different plant and animal residues were able to be discriminated from each other at a species level by combining FTIR spectroscopy with the chemometric data analysis methods, principal component analysis (PCA) and hierarchical clustering (HC). Application of this method to residues from three broken stone knives from the collections of the South Australian Museum indicated that two of the handles of knives were likely to have contained beeswax as the fixative whilst Spinifex resin was the probable binder on the third. Copyright 2010 Elsevier B.V. All rights reserved.

  9. A case-control study of lower urinary-tract infections, associated antibiotics and the risk of developing prostate cancer using PCBaSe 3.0

    PubMed Central

    Garmo, Hans; Beckmann, Kerri; Stattin, Pär; Adolfsson, Jan; Van Hemelrijck, Mieke

    2018-01-01

    Objectives To investigate the association between lower urinary-tract infections, their associated antibiotics and the subsequent risk of developing PCa. Subjects/Patients (or materials) and methods Using data from the Swedish PCBaSe 3.0, we performed a matched case-control study (8762 cases and 43806 controls). Conditional logistic regression analysis was used to assess the association between lower urinary-tract infections, related antibiotics and PCa, whilst adjusting for civil status, education, Charlson Comorbidity Index and time between lower urinary-tract infection and PCa diagnosis. Results It was found that lower urinary-tract infections did not affect PCa risk, however, having a lower urinary-tract infection or a first antibiotic prescription 6–12 months before PCa were both associated with an increased risk of PCa (OR: 1.50, 95% CI: 1.23–1.82 and 1.96, 1.71–2.25, respectively), as compared to men without lower urinary-tract infections. Compared to men with no prescriptions for antibiotics, men who were prescribed ≥10 antibiotics, were 15% less likely to develop PCa (OR: 0.85, 95% CI: 0.78–0.91). Conclusion PCa was not found to be associated with diagnosis of a urinary-tract infection or frequency, but was positively associated with short time since diagnoses of lower urinary-tract infection or receiving prescriptions for antibiotics. These observations can likely be explained by detection bias, which highlights the importance of data on the diagnostic work-up when studying potential risk factors for PCa. PMID:29649268

  10. The language profile of Posterior Cortical Atrophy

    PubMed Central

    Crutch, Sebastian J.; Lehmann, Manja; Warren, Jason D.; Rohrer, Jonathan D.

    2015-01-01

    Background Posterior Cortical Atrophy (PCA) is typically considered to be a visual syndrome, primarily characterised by progressive impairment of visuoperceptual and visuospatial skills. However patients commonly describe early difficulties with word retrieval. This paper details the first systematic analysis of linguistic function in PCA. Characterising and quantifying the aphasia associated with PCA is important for clarifying diagnostic and selection criteria for clinical and research studies. Methods Fifteen patients with PCA, 7 patients with logopenic/phonological aphasia (LPA) and 18 age-matched healthy participants completed a detailed battery of linguistic tests evaluating auditory input processing, repetition and working memory, lexical and grammatical comprehension, single word retrieval and fluency, and spontaneous speech. Results Relative to healthy controls, PCA patients exhibited language impairments across all the domains examined, but with anomia, reduced phonemic fluency and slowed speech rate the most prominent deficits. PCA performance most closely resembled that of LPA patients on tests of auditory input processing, repetition and digit span, but was relatively stronger on tasks of comprehension and spontaneous speech. Conclusions The study demonstrates that in addition to the well-reported degradation of vision, literacy and numeracy, PCA is characterised by a progressive oral language dysfunction with prominent word retrieval difficulties. Overlap in the linguistic profiles of PCA and LPA, which are both most commonly caused by Alzheimer’s disease, further emphasises the notion of a phenotypic continuum between typical and atypical manifestations of the disease. Clarifying the boundaries between AD phenotypes has important implications for diagnosis, clinical trial recruitment and investigations into biological factors driving phenotypic heterogeneity in AD. Rehabilitation strategies to ameliorate the phonological deficit in PCA are required. PMID:23138762

  11. Prostate extracellular vesicles in patient plasma as a liquid biopsy platform for prostate cancer using nanoscale flow cytometry

    PubMed Central

    Al-Zahrani, Ali A.; Pardhan, Siddika; Brett, Sabine I.; Guo, Qiu Q.; Yang, Jun; Wolf, Philipp; Power, Nicholas E.; Durfee, Paul N.; MacMillan, Connor D.; Townson, Jason L.; Brinker, Jeffrey C.; Fleshner, Neil E.; Izawa, Jonathan I.; Chambers, Ann F.; Chin, Joseph L.; Leong, Hon S.

    2016-01-01

    Background Extracellular vesicles released by prostate cancer present in seminal fluid, urine, and blood may represent a non-invasive means to identify and prioritize patients with intermediate risk and high risk of prostate cancer. We hypothesize that enumeration of circulating prostate microparticles (PMPs), a type of extracellular vesicle (EV), can identify patients with Gleason Score≥4+4 prostate cancer (PCa) in a manner independent of PSA. Patients and Methods Plasmas from healthy volunteers, benign prostatic hyperplasia patients, and PCa patients with various Gleason score patterns were analyzed for PMPs. We used nanoscale flow cytometry to enumerate PMPs which were defined as submicron events (100-1000nm) immunoreactive to anti-PSMA mAb when compared to isotype control labeled samples. Levels of PMPs (counts/μL of plasma) were also compared to CellSearch CTC Subclasses in various PCa metastatic disease subtypes (treatment naïve, castration resistant prostate cancer) and in serially collected plasma sets from patients undergoing radical prostatectomy. Results PMP levels in plasma as enumerated by nanoscale flow cytometry are effective in distinguishing PCa patients with Gleason Score≥8 disease, a high-risk prognostic factor, from patients with Gleason Score≤7 PCa, which carries an intermediate risk of PCa recurrence. PMP levels were independent of PSA and significantly decreased after surgical resection of the prostate, demonstrating its prognostic potential for clinical follow-up. CTC subclasses did not decrease after prostatectomy and were not effective in distinguishing localized PCa patients from metastatic PCa patients. Conclusions PMP enumeration was able to identify patients with Gleason Score ≥8 PCa but not patients with Gleason Score 4+3 PCa, but offers greater confidence than CTC counts in identifying patients with metastatic prostate cancer. CTC Subclass analysis was also not effective for post-prostatectomy follow up and for distinguishing metastatic PCa and localized PCa patients. Nanoscale flow cytometry of PMPs presents an emerging biomarker platform for various stages of prostate cancer. PMID:26814433

  12. PCA3 noncoding RNA is involved in the control of prostate-cancer cell survival and modulates androgen receptor signaling

    PubMed Central

    2012-01-01

    Background PCA3 is a non-coding RNA (ncRNA) that is highly expressed in prostate cancer (PCa) cells, but its functional role is unknown. To investigate its putative function in PCa biology, we used gene expression knockdown by small interference RNA, and also analyzed its involvement in androgen receptor (AR) signaling. Methods LNCaP and PC3 cells were used as in vitro models for these functional assays, and three different siRNA sequences were specifically designed to target PCA3 exon 4. Transfected cells were analyzed by real-time qRT-PCR and cell growth, viability, and apoptosis assays. Associations between PCA3 and the androgen-receptor (AR) signaling pathway were investigated by treating LNCaP cells with 100 nM dihydrotestosterone (DHT) and with its antagonist (flutamide), and analyzing the expression of some AR-modulated genes (TMPRSS2, NDRG1, GREB1, PSA, AR, FGF8, CdK1, CdK2 and PMEPA1). PCA3 expression levels were investigated in different cell compartments by using differential centrifugation and qRT-PCR. Results LNCaP siPCA3-transfected cells significantly inhibited cell growth and viability, and increased the proportion of cells in the sub G0/G1 phase of the cell cycle and the percentage of pyknotic nuclei, compared to those transfected with scramble siRNA (siSCr)-transfected cells. DHT-treated LNCaP cells induced a significant upregulation of PCA3 expression, which was reversed by flutamide. In siPCA3/LNCaP-transfected cells, the expression of AR target genes was downregulated compared to siSCr-transfected cells. The siPCA3 transfection also counteracted DHT stimulatory effects on the AR signaling cascade, significantly downregulating expression of the AR target gene. Analysis of PCA3 expression in different cell compartments provided evidence that the main functional roles of PCA3 occur in the nuclei and microsomal cell fractions. Conclusions Our findings suggest that the ncRNA PCA3 is involved in the control of PCa cell survival, in part through modulating AR signaling, which may raise new possibilities of using PCA3 knockdown as an additional therapeutic strategy for PCa control. PMID:23130941

  13. Variability search in M 31 using principal component analysis and the Hubble Source Catalogue

    NASA Astrophysics Data System (ADS)

    Moretti, M. I.; Hatzidimitriou, D.; Karampelas, A.; Sokolovsky, K. V.; Bonanos, A. Z.; Gavras, P.; Yang, M.

    2018-06-01

    Principal component analysis (PCA) is being extensively used in Astronomy but not yet exhaustively exploited for variability search. The aim of this work is to investigate the effectiveness of using the PCA as a method to search for variable stars in large photometric data sets. We apply PCA to variability indices computed for light curves of 18 152 stars in three fields in M 31 extracted from the Hubble Source Catalogue. The projection of the data into the principal components is used as a stellar variability detection and classification tool, capable of distinguishing between RR Lyrae stars, long-period variables (LPVs) and non-variables. This projection recovered more than 90 per cent of the known variables and revealed 38 previously unknown variable stars (about 30 per cent more), all LPVs except for one object of uncertain variability type. We conclude that this methodology can indeed successfully identify candidate variable stars.

  14. Demixed principal component analysis of neural population data.

    PubMed

    Kobak, Dmitry; Brendel, Wieland; Constantinidis, Christos; Feierstein, Claudia E; Kepecs, Adam; Mainen, Zachary F; Qi, Xue-Lian; Romo, Ranulfo; Uchida, Naoshige; Machens, Christian K

    2016-04-12

    Neurons in higher cortical areas, such as the prefrontal cortex, are often tuned to a variety of sensory and motor variables, and are therefore said to display mixed selectivity. This complexity of single neuron responses can obscure what information these areas represent and how it is represented. Here we demonstrate the advantages of a new dimensionality reduction technique, demixed principal component analysis (dPCA), that decomposes population activity into a few components. In addition to systematically capturing the majority of the variance of the data, dPCA also exposes the dependence of the neural representation on task parameters such as stimuli, decisions, or rewards. To illustrate our method we reanalyze population data from four datasets comprising different species, different cortical areas and different experimental tasks. In each case, dPCA provides a concise way of visualizing the data that summarizes the task-dependent features of the population response in a single figure.

  15. Study of recognizing multiple persons' complicated hand gestures from the video sequence acquired by a moving camera

    NASA Astrophysics Data System (ADS)

    Dan, Luo; Ohya, Jun

    2010-02-01

    Recognizing hand gestures from the video sequence acquired by a dynamic camera could be a useful interface between humans and mobile robots. We develop a state based approach to extract and recognize hand gestures from moving camera images. We improved Human-Following Local Coordinate (HFLC) System, a very simple and stable method for extracting hand motion trajectories, which is obtained from the located human face, body part and hand blob changing factor. Condensation algorithm and PCA-based algorithm was performed to recognize extracted hand trajectories. In last research, this Condensation Algorithm based method only applied for one person's hand gestures. In this paper, we propose a principal component analysis (PCA) based approach to improve the recognition accuracy. For further improvement, temporal changes in the observed hand area changing factor are utilized as new image features to be stored in the database after being analyzed by PCA. Every hand gesture trajectory in the database is classified into either one hand gesture categories, two hand gesture categories, or temporal changes in hand blob changes. We demonstrate the effectiveness of the proposed method by conducting experiments on 45 kinds of sign language based Japanese and American Sign Language gestures obtained from 5 people. Our experimental recognition results show better performance is obtained by PCA based approach than the Condensation algorithm based method.

  16. Inquiring the Most Critical Teacher's Technology Education Competences in the Highest Efficient Technology Education Learning Organization

    ERIC Educational Resources Information Center

    Yung-Kuan, Chan; Hsieh, Ming-Yuan; Lee, Chin-Feng; Huang, Chih-Cheng; Ho, Li-Chih

    2017-01-01

    Under the hyper-dynamic education situation, this research, in order to comprehensively explore the interplays between Teacher Competence Demands (TCD) and Learning Organization Requests (LOR), cross-employs the data refined method of Descriptive Statistics (DS) method and Analysis of Variance (ANOVA) and Principal Components Analysis (PCA)…

  17. PTEN genomic deletion predicts prostate cancer recurrence and is associated with low AR expression and transcriptional activity

    PubMed Central

    2012-01-01

    Background Prostate cancer (PCa), a leading cause of cancer death in North American men, displays a broad range of clinical outcome from relatively indolent to lethal metastatic disease. Several genomic alterations have been identified in PCa which may serve as predictors of progression. PTEN, (10q23.3), is a negative regulator of the phosphatidylinositol 3-kinase (PIK3)/AKT survival pathway and a tumor suppressor frequently deleted in PCa. The androgen receptor (AR) signalling pathway is known to play an important role in PCa and its blockade constitutes a commonly used treatment modality. In this study, we assessed the deletion status of PTEN along with AR expression levels in 43 primary PCa specimens with clinical follow-up. Methods Fluorescence In Situ Hybridization (FISH) was done on formalin fixed paraffin embedded (FFPE) PCa samples to examine the deletion status of PTEN. AR expression levels were determined using immunohistochemistry (IHC). Results Using FISH, we found 18 cases of PTEN deletion. Kaplan-Meier analysis showed an association with disease recurrence (P=0.03). Concurrently, IHC staining for AR found significantly lower levels of AR expression within those tumors deleted for PTEN (P<0.05). To validate these observations we interrogated a copy number alteration and gene expression profiling dataset of 64 PCa samples, 17 of which were PTEN deleted. We confirmed the predictive value of PTEN deletion in disease recurrence (P=0.03). PTEN deletion was also linked to diminished expression of PTEN (P<0.01) and AR (P=0.02). Furthermore, gene set enrichment analysis revealed a diminished expression of genes downstream of AR signalling in PTEN deleted tumors. Conclusions Altogether, our data suggest that PTEN deleted tumors expressing low levels of AR may represent a worse prognostic subset of PCa establishing a challenge for therapeutic management. PMID:23171135

  18. Understanding deformation mechanisms during powder compaction using principal component analysis of compression data.

    PubMed

    Roopwani, Rahul; Buckner, Ira S

    2011-10-14

    Principal component analysis (PCA) was applied to pharmaceutical powder compaction. A solid fraction parameter (SF(c/d)) and a mechanical work parameter (W(c/d)) representing irreversible compression behavior were determined as functions of applied load. Multivariate analysis of the compression data was carried out using PCA. The first principal component (PC1) showed loadings for the solid fraction and work values that agreed with changes in the relative significance of plastic deformation to consolidation at different pressures. The PC1 scores showed the same rank order as the relative plasticity ranking derived from the literature for common pharmaceutical materials. The utility of PC1 in understanding deformation was extended to binary mixtures using a subset of the original materials. Combinations of brittle and plastic materials were characterized using the PCA method. The relationships between PC1 scores and the weight fractions of the mixtures were typically linear showing ideal mixing in their deformation behaviors. The mixture consisting of two plastic materials was the only combination to show a consistent positive deviation from ideality. The application of PCA to solid fraction and mechanical work data appears to be an effective means of predicting deformation behavior during compaction of simple powder mixtures. Copyright © 2011 Elsevier B.V. All rights reserved.

  19. On a PCA-based lung motion model

    PubMed Central

    Li, Ruijiang; Lewis, John H; Jia, Xun; Zhao, Tianyu; Liu, Weifeng; Wuenschel, Sara; Lamb, James; Yang, Deshan; Low, Daniel A; Jiang, Steve B

    2014-01-01

    Respiration-induced organ motion is one of the major uncertainties in lung cancer radiotherapy and is crucial to be able to accurately model the lung motion. Most work so far has focused on the study of the motion of a single point (usually the tumor center of mass), and much less work has been done to model the motion of the entire lung. Inspired by the work of Zhang et al (2007 Med. Phys. 34 4772–81), we believe that the spatiotemporal relationship of the entire lung motion can be accurately modeled based on principle component analysis (PCA) and then a sparse subset of the entire lung, such as an implanted marker, can be used to drive the motion of the entire lung (including the tumor). The goal of this work is twofold. First, we aim to understand the underlying reason why PCA is effective for modeling lung motion and find the optimal number of PCA coefficients for accurate lung motion modeling. We attempt to address the above important problems both in a theoretical framework and in the context of real clinical data. Second, we propose a new method to derive the entire lung motion using a single internal marker based on the PCA model. The main results of this work are as follows. We derived an important property which reveals the implicit regularization imposed by the PCA model. We then studied the model using two mathematical respiratory phantoms and 11 clinical 4DCT scans for eight lung cancer patients. For the mathematical phantoms with cosine and an even power (2n) of cosine motion, we proved that 2 and 2n PCA coefficients and eigenvectors will completely represent the lung motion, respectively. Moreover, for the cosine phantom, we derived the equivalence conditions for the PCA motion model and the physiological 5D lung motion model (Low et al 2005 Int. J. Radiat. Oncol. Biol. Phys. 63 921–9). For the clinical 4DCT data, we demonstrated the modeling power and generalization performance of the PCA model. The average 3D modeling error using PCA was within 1 mm (0.7 ± 0.1 mm). When a single artificial internal marker was used to derive the lung motion, the average 3D error was found to be within 2 mm (1.8 ± 0.3 mm) through comprehensive statistical analysis. The optimal number of PCA coefficients needs to be determined on a patient-by-patient basis and two PCA coefficients seem to be sufficient for accurate modeling of the lung motion for most patients. In conclusion, we have presented thorough theoretical analysis and clinical validation of the PCA lung motion model. The feasibility of deriving the entire lung motion using a single marker has also been demonstrated on clinical data using a simulation approach. PMID:21865624

  20. A novel principal component analysis for spatially misaligned multivariate air pollution data.

    PubMed

    Jandarov, Roman A; Sheppard, Lianne A; Sampson, Paul D; Szpiro, Adam A

    2017-01-01

    We propose novel methods for predictive (sparse) PCA with spatially misaligned data. These methods identify principal component loading vectors that explain as much variability in the observed data as possible, while also ensuring the corresponding principal component scores can be predicted accurately by means of spatial statistics at locations where air pollution measurements are not available. This will make it possible to identify important mixtures of air pollutants and to quantify their health effects in cohort studies, where currently available methods cannot be used. We demonstrate the utility of predictive (sparse) PCA in simulated data and apply the approach to annual averages of particulate matter speciation data from national Environmental Protection Agency (EPA) regulatory monitors.

  1. The role of CD147 expression in prostate cancer: a systematic review and meta-analysis

    PubMed Central

    Ye, Yun; Li, Su-Liang; Wang, Yao; Yao, Yang; Wang, Juan; Ma, Yue-Yun; Hao, Xiao-Ke

    2016-01-01

    Background There are a number of studies which show that expression of CD147 is increased significantly in prostate cancer (PCa). However, conflicting conclusions have also been reported by other researchers lately. In order to arrive at a clear conclusion, a meta-analysis of eligible studies was conducted. Materials and methods We searched PubMed, MEDLINE, Cochrane Library, and the China National Knowledge Infrastructure databases to identify all the published case–control studies on the relationship between the expression of CD147 and PCa until February 2016. In the end, a total of 930 patients in eight studies were included in the meta-analysis. Results CD147 expression in the PCa patients increased significantly (odds ratio [OR], 4.65; 95% confidence interval [CI], 3.52–6.14; Z=10.79; P<0.05), but there was obvious heterogeneity between studies (I2=92.9%, P<0.05). Subgroup analysis showed that positive expression of CD147 was associated with PCa among the Asian population (OR, 21.01; 95% CI, 12.88–34.28; Z=12.19; P<0.05). Furthermore, it was significantly related to TNM stage (OR, 0.24; 95% CI, 0.17–0.35; Z=7.74; P<0.05), Gleason score (OR, 0.41; 95% CI, 0.31–0.56; Z=5.62; P<0.05), differentiation grade (OR, 0.27; 95% CI, 0.13–0.56; Z=3.47; P<0.05), and pretreatment serum prostate-specific antigen level (OR, 0.07; 95% CI, 0.03–0.16; Z=6.47; P<0.05). Conclusion Positive expression of CD147 was related to PCa, significant heterogeneity was not found between Asian studies, and the result became more significant. The positive expression of CD147 was significantly related to the clinicopathological characteristics of PCa. This suggests that CD147 plays an essential role in poor prognosis and recurrence prediction. PMID:27536064

  2. Does adding ketamine to morphine patient-controlled analgesia safely improve post-thoracotomy pain?

    PubMed

    Mathews, Timothy J; Churchhouse, Antonia M D; Housden, Tessa; Dunning, Joel

    2012-02-01

    A best evidence topic in thoracic surgery was written according to a structured protocol. The question addressed was 'is the addition of ketamine to morphine patient-controlled analgesia (PCA) following thoracic surgery superior to morphine alone'. Altogether 201 papers were found using the reported search, of which nine represented the best evidence to answer the clinical question. The authors, journal, date and country of publication, patient group studied, study type, relevant outcomes and results of these papers are tabulated. This consisted of one systematic review of PCA morphine with ketamine (PCA-MK) trials, one meta-analysis of PCA-MK trials, four randomized controlled trials of PCA-MK, one meta-analysis of trials using a variety of peri-operative ketamine regimes and two cohort studies of PCA-MK. Main outcomes measured included pain score rated on visual analogue scale, morphine consumption and incidence of psychotomimetic side effects/hallucination. Two papers reported the measurements of respiratory function. This evidence shows that adding ketamine to morphine PCA is safe, with a reported incidence of hallucination requiring intervention of 2.9%, and a meta-analysis finding an incidence of all central nervous system side effects of 18% compared with 15% with morphine alone, P = 0.31, RR 1.27 with 95% CI (0.8-2.01). All randomized controlled trials of its use following thoracic surgery found no hallucination or psychological side effect. All five studies in thoracic surgery (n = 243) found reduced morphine requirements with PCA-MK. Pain scores were significantly lower in PCA-MK patients in thoracic surgery papers, with one paper additionally reporting increased patient satisfaction. However, no significant improvement was found in a meta-analysis of five papers studying PCA-MK in a variety of surgical settings. Both papers reporting respiratory outcomes found improved oxygen saturations and PaCO(2) levels in PCA-MK patients following thoracic surgery. We conclude that adding low-dose ketamine to morphine PCA is safe and post-thoracotomy may provide better pain control than PCA with morphine alone (PCA-MO), with reduced morphine consumption and possible improvement in respiratory function. These studies thus support the routine use of PCA-MK instead of PCA-MO to improve post-thoracotomy pain control.

  3. A multifaceted independent performance analysis of facial subspace recognition algorithms.

    PubMed

    Bajwa, Usama Ijaz; Taj, Imtiaz Ahmad; Anwar, Muhammad Waqas; Wang, Xuan

    2013-01-01

    Face recognition has emerged as the fastest growing biometric technology and has expanded a lot in the last few years. Many new algorithms and commercial systems have been proposed and developed. Most of them use Principal Component Analysis (PCA) as a base for their techniques. Different and even conflicting results have been reported by researchers comparing these algorithms. The purpose of this study is to have an independent comparative analysis considering both performance and computational complexity of six appearance based face recognition algorithms namely PCA, 2DPCA, A2DPCA, (2D)(2)PCA, LPP and 2DLPP under equal working conditions. This study was motivated due to the lack of unbiased comprehensive comparative analysis of some recent subspace methods with diverse distance metric combinations. For comparison with other studies, FERET, ORL and YALE databases have been used with evaluation criteria as of FERET evaluations which closely simulate real life scenarios. A comparison of results with previous studies is performed and anomalies are reported. An important contribution of this study is that it presents the suitable performance conditions for each of the algorithms under consideration.

  4. Soy Consumption and the Risk of Prostate Cancer: An Updated Systematic Review and Meta-Analysis

    PubMed Central

    Ranard, Katherine M.; Jeon, Sookyoung; Erdman, John W.

    2018-01-01

    Prostate cancer (PCa) is the second most commonly diagnosed cancer in men, accounting for 15% of all cancers in men worldwide. Asian populations consume soy foods as part of a regular diet, which may contribute to the lower PCa incidence observed in these countries. This meta-analysis provides a comprehensive updated analysis that builds on previously published meta-analyses, demonstrating that soy foods and their isoflavones (genistein and daidzein) are associated with a lower risk of prostate carcinogenesis. Thirty articles were included for analysis of the potential impacts of soy food intake, isoflavone intake, and circulating isoflavone levels, on both primary and advanced PCa. Total soy food (p < 0.001), genistein (p = 0.008), daidzein (p = 0.018), and unfermented soy food (p < 0.001) intakes were significantly associated with a reduced risk of PCa. Fermented soy food intake, total isoflavone intake, and circulating isoflavones were not associated with PCa risk. Neither soy food intake nor circulating isoflavones were associated with advanced PCa risk, although very few studies currently exist to examine potential associations. Combined, this evidence from observational studies shows a statistically significant association between soy consumption and decreased PCa risk. Further studies are required to support soy consumption as a prophylactic dietary approach to reduce PCa carcinogenesis. PMID:29300347

  5. Strategies for reducing large fMRI data sets for independent component analysis.

    PubMed

    Wang, Ze; Wang, Jiongjiong; Calhoun, Vince; Rao, Hengyi; Detre, John A; Childress, Anna R

    2006-06-01

    In independent component analysis (ICA), principal component analysis (PCA) is generally used to reduce the raw data to a few principal components (PCs) through eigenvector decomposition (EVD) on the data covariance matrix. Although this works for spatial ICA (sICA) on moderately sized fMRI data, it is intractable for temporal ICA (tICA), since typical fMRI data have a high spatial dimension, resulting in an unmanageable data covariance matrix. To solve this problem, two practical data reduction methods are presented in this paper. The first solution is to calculate the PCs of tICA from the PCs of sICA. This approach works well for moderately sized fMRI data; however, it is highly computationally intensive, even intractable, when the number of scans increases. The second solution proposed is to perform PCA decomposition via a cascade recursive least squared (CRLS) network, which provides a uniform data reduction solution for both sICA and tICA. Without the need to calculate the covariance matrix, CRLS extracts PCs directly from the raw data, and the PC extraction can be terminated after computing an arbitrary number of PCs without the need to estimate the whole set of PCs. Moreover, when the whole data set becomes too large to be loaded into the machine memory, CRLS-PCA can save data retrieval time by reading the data once, while the conventional PCA requires numerous data retrieval steps for both covariance matrix calculation and PC extractions. Real fMRI data were used to evaluate the PC extraction precision, computational expense, and memory usage of the presented methods.

  6. Prostate health index (phi) and prostate cancer antigen 3 (PCA3) significantly improve diagnostic accuracy in patients undergoing prostate biopsy.

    PubMed

    Perdonà, Sisto; Bruzzese, Dario; Ferro, Matteo; Autorino, Riccardo; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; Longo, Michele; Spinelli, Rosa; Di Lorenzo, Giuseppe; Oliva, Andrea; De Sio, Marco; Damiano, Rocco; Altieri, Vincenzo; Terracciano, Daniela

    2013-02-15

    Prostate health index (phi) and prostate cancer antigen 3 (PCA3) have been recently proposed as novel biomarkers for prostate cancer (PCa). We assessed the diagnostic performance of these biomarkers, alone or in combination, in men undergoing first prostate biopsy for suspicion of PCa. One hundred sixty male subjects were enrolled in this prospective observational study. PSA molecular forms, phi index (Beckman coulter immunoassay), PCA3 score (Progensa PCA3 assay), and other established biomarkers (tPSA, fPSA, and %fPSA) were assessed before patients underwent a 18-core first prostate biopsy. The discriminating ability between PCa-negative and PCa-positive biopsies of Beckman coulter phi and PCA3 score and other used biomarkers were determined. One hundred sixty patients met inclusion criteria. %p2PSA (p2PSA/fPSA × 100), phi and PCA3 were significantly higher in patients with PCa compared to PCa-negative group (median values: 1.92 vs. 1.55, 49.97 vs. 36.84, and 50 vs. 32, respectively, P ≤ 0.001). ROC curve analysis showed that %p2PSA, phi, and PCA3 are good indicator of malignancy (AUCs = 0.68, 0.71, and 0.66, respectively). A multivariable logistic regression model consisting of both the phi index and PCA3 score allowed to reach an overall diagnostic accuracy of 0.77. Decision curve analysis revealed that this "combined" marker achieved the highest net benefit over the examined range of the threshold probability. phi and PCA3 showed no significant difference in the ability to predict PCa diagnosis in men undergoing first prostate biopsy. However, diagnostic performance is significantly improved by combining phi and PCA3. Copyright © 2012 Wiley Periodicals, Inc.

  7. An improved geographically weighted regression model for PM2.5 concentration estimation in large areas

    NASA Astrophysics Data System (ADS)

    Zhai, Liang; Li, Shuang; Zou, Bin; Sang, Huiyong; Fang, Xin; Xu, Shan

    2018-05-01

    Considering the spatial non-stationary contributions of environment variables to PM2.5 variations, the geographically weighted regression (GWR) modeling method has been using to estimate PM2.5 concentrations widely. However, most of the GWR models in reported studies so far were established based on the screened predictors through pretreatment correlation analysis, and this process might cause the omissions of factors really driving PM2.5 variations. This study therefore developed a best subsets regression (BSR) enhanced principal component analysis-GWR (PCA-GWR) modeling approach to estimate PM2.5 concentration by fully considering all the potential variables' contributions simultaneously. The performance comparison experiment between PCA-GWR and regular GWR was conducted in the Beijing-Tianjin-Hebei (BTH) region over a one-year-period. Results indicated that the PCA-GWR modeling outperforms the regular GWR modeling with obvious higher model fitting- and cross-validation based adjusted R2 and lower RMSE. Meanwhile, the distribution map of PM2.5 concentration from PCA-GWR modeling also clearly depicts more spatial variation details in contrast to the one from regular GWR modeling. It can be concluded that the BSR enhanced PCA-GWR modeling could be a reliable way for effective air pollution concentration estimation in the coming future by involving all the potential predictor variables' contributions to PM2.5 variations.

  8. A propensity score approach to correction for bias due to population stratification using genetic and non-genetic factors.

    PubMed

    Zhao, Huaqing; Rebbeck, Timothy R; Mitra, Nandita

    2009-12-01

    Confounding due to population stratification (PS) arises when differences in both allele and disease frequencies exist in a population of mixed racial/ethnic subpopulations. Genomic control, structured association, principal components analysis (PCA), and multidimensional scaling (MDS) approaches have been proposed to address this bias using genetic markers. However, confounding due to PS can also be due to non-genetic factors. Propensity scores are widely used to address confounding in observational studies but have not been adapted to deal with PS in genetic association studies. We propose a genomic propensity score (GPS) approach to correct for bias due to PS that considers both genetic and non-genetic factors. We compare the GPS method with PCA and MDS using simulation studies. Our results show that GPS can adequately adjust and consistently correct for bias due to PS. Under no/mild, moderate, and severe PS, GPS yielded estimated with bias close to 0 (mean=-0.0044, standard error=0.0087). Under moderate or severe PS, the GPS method consistently outperforms the PCA method in terms of bias, coverage probability (CP), and type I error. Under moderate PS, the GPS method consistently outperforms the MDS method in terms of CP. PCA maintains relatively high power compared to both MDS and GPS methods under the simulated situations. GPS and MDS are comparable in terms of statistical properties such as bias, type I error, and power. The GPS method provides a novel and robust tool for obtaining less-biased estimates of genetic associations that can consider both genetic and non-genetic factors. 2009 Wiley-Liss, Inc.

  9. Epidural Analgesia Versus Patient-Controlled Analgesia for Pain Relief in Uterine Artery Embolization for Uterine Fibroids: A Decision Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kooij, Sanne M. van der, E-mail: s.m.vanderkooij@amc.uva.nl; Moolenaar, Lobke M.; Ankum, Willem M.

    Purpose: This study was designed to compare the costs and effects of epidural analgesia (EDA) to those of patient-controlled intravenous analgesia (PCA) for postintervention pain relief in women having uterine artery embolization (UAE) for systematic uterine fibroids. Methods: Cost-effectiveness analysis (CEA) based on data from the literature by constructing a decision tree to model the clinical pathways for estimating the effects and costs of treatment with EDA and PCA. Literature on EDA for pain-relief after UAE was missing, and therefore, data on EDA for abdominal surgery were used. Outcome measures were compared costs to reduce one point in visual analoguemore » score (VAS) or numeric rating scale (NRS) for pain 6 and 24 h after UAE and risk for complications. Results: Six hours after the intervention, the VAS was 3.56 when using PCA and 2.0 when using EDA. The costs for pain relief in women undergoing UAE with PCA and EDA were Euro-Sign 191 and Euro-Sign 355, respectively. The costs for EDA to reduce the VAS score 6 h after the intervention with one point compared with PCA were Euro-Sign 105 and Euro-Sign 179 after 24 h. The risk of having a complication was 2.45 times higher when using EDA. Conclusions: The results of this indirect comparison of EDA for abdominal surgery with PCA for UAE show that EDA would provide superior analgesia for post UAE pain at 6 and 24 h but with higher costs and an increased risk of complications.« less

  10. A graph-Laplacian-based feature extraction algorithm for neural spike sorting.

    PubMed

    Ghanbari, Yasser; Spence, Larry; Papamichalis, Panos

    2009-01-01

    Analysis of extracellular neural spike recordings is highly dependent upon the accuracy of neural waveform classification, commonly referred to as spike sorting. Feature extraction is an important stage of this process because it can limit the quality of clustering which is performed in the feature space. This paper proposes a new feature extraction method (which we call Graph Laplacian Features, GLF) based on minimizing the graph Laplacian and maximizing the weighted variance. The algorithm is compared with Principal Components Analysis (PCA, the most commonly-used feature extraction method) using simulated neural data. The results show that the proposed algorithm produces more compact and well-separated clusters compared to PCA. As an added benefit, tentative cluster centers are output which can be used to initialize a subsequent clustering stage.

  11. Evaluation of redundancy analysis to identify signatures of local adaptation.

    PubMed

    Capblancq, Thibaut; Luu, Keurcien; Blum, Michael G B; Bazin, Eric

    2018-05-26

    Ordination is a common tool in ecology that aims at representing complex biological information in a reduced space. In landscape genetics, ordination methods such as principal component analysis (PCA) have been used to detect adaptive variation based on genomic data. Taking advantage of environmental data in addition to genotype data, redundancy analysis (RDA) is another ordination approach that is useful to detect adaptive variation. This paper aims at proposing a test statistic based on RDA to search for loci under selection. We compare redundancy analysis to pcadapt, which is a nonconstrained ordination method, and to a latent factor mixed model (LFMM), which is a univariate genotype-environment association method. Individual-based simulations identify evolutionary scenarios where RDA genome scans have a greater statistical power than genome scans based on PCA. By constraining the analysis with environmental variables, RDA performs better than PCA in identifying adaptive variation when selection gradients are weakly correlated with population structure. Additionally, we show that if RDA and LFMM have a similar power to identify genetic markers associated with environmental variables, the RDA-based procedure has the advantage to identify the main selective gradients as a combination of environmental variables. To give a concrete illustration of RDA in population genomics, we apply this method to the detection of outliers and selective gradients on an SNP data set of Populus trichocarpa (Geraldes et al., 2013). The RDA-based approach identifies the main selective gradient contrasting southern and coastal populations to northern and continental populations in the northwestern American coast. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  12. Multivariate qualitative analysis of banned additives in food safety using surface enhanced Raman scattering spectroscopy.

    PubMed

    He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei

    2015-02-25

    A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. [Quality evaluation of American ginseng using UPLC coupled with multivariate analysis].

    PubMed

    Tang, Yan; Yan, Shu-Mo; Wang, Jing-Jing; Yuan, Yuan; Yang, Bin

    2016-05-01

    An ultra performance liquid chromatography (UPLC)method combined with multivariate data analysis was developed to evaluate the quality of American ginseng by simultaneously determining the concentrations of six ginsenosides (Rg₁, Re, Rb₁, Rc, Ro and Rd)in the samples. For UPLC, acetonitrile with 0.01% formic acid and water with 0.01% formic acid were used as the mobile phase with gradient elution. Under the established chromatographic conditions, the six ginsenosides could be well separated and the results of linearity, stability, precision, repeatability, and recovery rate all reached the requirement of quantification analysis, respectively. The total contents of Rg₁, Re, and Rb₁ in 57 samples all reached the requirement of the 2015 edition of Chinese Pharmacopoeia. At the same time, the experimental data were analyzed by principle component analysis (PCA) and partial least squares discriminant analysis (PLS-DA). The crude drugs and the decoction pieces can be discriminated by a PCA method and the samples with different age can be distinguished by a PLS-DA method. Copyright© by the Chinese Pharmaceutical Association.

  14. Automated Classification and Analysis of Non-metallic Inclusion Data Sets

    NASA Astrophysics Data System (ADS)

    Abdulsalam, Mohammad; Zhang, Tongsheng; Tan, Jia; Webler, Bryan A.

    2018-05-01

    The aim of this study is to utilize principal component analysis (PCA), clustering methods, and correlation analysis to condense and examine large, multivariate data sets produced from automated analysis of non-metallic inclusions. Non-metallic inclusions play a major role in defining the properties of steel and their examination has been greatly aided by automated analysis in scanning electron microscopes equipped with energy dispersive X-ray spectroscopy. The methods were applied to analyze inclusions on two sets of samples: two laboratory-scale samples and four industrial samples from a near-finished 4140 alloy steel components with varying machinability. The laboratory samples had well-defined inclusions chemistries, composed of MgO-Al2O3-CaO, spinel (MgO-Al2O3), and calcium aluminate inclusions. The industrial samples contained MnS inclusions as well as (Ca,Mn)S + calcium aluminate oxide inclusions. PCA could be used to reduce inclusion chemistry variables to a 2D plot, which revealed inclusion chemistry groupings in the samples. Clustering methods were used to automatically classify inclusion chemistry measurements into groups, i.e., no user-defined rules were required.

  15. SU-F-R-41: Regularized PCA Can Model Treatment-Related Changes in Head and Neck Patients Using Daily CBCTs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chetvertkov, M; Henry Ford Health System, Detroit, MI; Siddiqui, F

    2016-06-15

    Purpose: To use daily cone beam CTs (CBCTs) to develop regularized principal component analysis (PCA) models of anatomical changes in head and neck (H&N) patients, to guide replanning decisions in adaptive radiation therapy (ART). Methods: Known deformations were applied to planning CT (pCT) images of 10 H&N patients to model several different systematic anatomical changes. A Pinnacle plugin was used to interpolate systematic changes over 35 fractions, generating a set of 35 synthetic CTs for each patient. Deformation vector fields (DVFs) were acquired between the pCT and synthetic CTs and random fraction-to-fraction changes were superimposed on the DVFs. Standard non-regularizedmore » and regularized patient-specific PCA models were built using the DVFs. The ability of PCA to extract the known deformations was quantified. PCA models were also generated from clinical CBCTs, for which the deformations and DVFs were not known. It was hypothesized that resulting eigenvectors/eigenfunctions with largest eigenvalues represent the major anatomical deformations during the course of treatment. Results: As demonstrated with quantitative results in the supporting document regularized PCA is more successful than standard PCA at capturing systematic changes early in the treatment. Regularized PCA is able to detect smaller systematic changes against the background of random fraction-to-fraction changes. To be successful at guiding ART, regularized PCA should be coupled with models of when anatomical changes occur: early, late or throughout the treatment course. Conclusion: The leading eigenvector/eigenfunction from the both PCA approaches can tentatively be identified as a major systematic change during radiotherapy course when systematic changes are large enough with respect to random fraction-to-fraction changes. In all cases the regularized PCA approach appears to be more reliable at capturing systematic changes, enabling dosimetric consequences to be projected once trends are established early in the treatment course. This work is supported in part by a grant from Varian Medical Systems, Palo Alto, CA.« less

  16. National economic and development indicators and international variation in prostate cancer incidence and mortality: an ecological analysis.

    PubMed

    Neupane, Subas; Bray, Freddie; Auvinen, Anssi

    2017-06-01

    Macroeconomic indicators are likely associated with prostate cancer (PCa) incidence and mortality globally, but have rarely been assessed. Data on PCa incidence in 2003-2007 for 49 countries with either nationwide cancer registry or at least two regional registries were obtained from Cancer Incidence in Five Continents Vol X and national PCa mortality for 2012 from GLOBOCAN 2012. We compared PCa incidence and mortality rates with various population-level indicators of health, economy and development in 2000. Poisson and linear regression methods were used to quantify the associations. PCa incidence varied more than 15-fold, being highest in high-income countries. PCa mortality exhibited less variation, with higher rates in many low- and middle-income countries. Healthcare expenditure (rate ratio, RR 1.46, 95 % CI 1.45-1.47) and population growth (RR 1.15, 95 % CI 1.14-1.16), as well as computer and mobile phone density, were associated with a higher PCa incidence, while gross domestic product, GDP (RR 0.94, 95 % CI 0.93-0.95) and overall mortality (RR 0.72, 95 % CI 0.71-0.73) were associated with a low incidence. GDP (RR 0.55, 95 % CI 0.46-0.66) was also associated with a low PCa mortality, while life expectancy (RR 3.93, 95 % CI 3.22-4.79) and healthcare expenditure (RR 1.20, 95 % CI 1.09-1.32) were associated with an elevated mortality. Our results show that healthcare expenditure and, thus, the availability of medical resources are an important contributor to the patterns of international variation in PCa incidence. This suggests that there is an iatrogenic component in the current global epidemic of PCa. On the other hand, higher healthcare expenditure is associated with lower PCa death rates.

  17. Robust prediction of protein subcellular localization combining PCA and WSVMs.

    PubMed

    Tian, Jiang; Gu, Hong; Liu, Wenqi; Gao, Chiyang

    2011-08-01

    Automated prediction of protein subcellular localization is an important tool for genome annotation and drug discovery, and Support Vector Machines (SVMs) can effectively solve this problem in a supervised manner. However, the datasets obtained from real experiments are likely to contain outliers or noises, which can lead to poor generalization ability and classification accuracy. To explore this problem, we adopt strategies to lower the effect of outliers. First we design a method based on Weighted SVMs, different weights are assigned to different data points, so the training algorithm will learn the decision boundary according to the relative importance of the data points. Second we analyse the influence of Principal Component Analysis (PCA) on WSVM classification, propose a hybrid classifier combining merits of both PCA and WSVM. After performing dimension reduction operations on the datasets, kernel-based possibilistic c-means algorithm can generate more suitable weights for the training, as PCA transforms the data into a new coordinate system with largest variances affected greatly by the outliers. Experiments on benchmark datasets show promising results, which confirms the effectiveness of the proposed method in terms of prediction accuracy. Copyright © 2011 Elsevier Ltd. All rights reserved.

  18. Using Paradigm Case Analysis To Foster Instructor Development.

    ERIC Educational Resources Information Center

    Peregrym, Jill; And Others

    Paradigm Case Analysis (PCA) is a method of increasing instructor effectiveness through the gathering of narratives of critical teaching incidents and experiences from proficient instructors and their analysis in group discussions. Critical Incidents (CI's) may include those in which the instructor's intervention made a significant difference in…

  19. Associations of tea and coffee consumption with prostate cancer risk

    PubMed Central

    Geybels, Milan S.; Neuhouser, Marian L.; Stanford, Janet L.

    2013-01-01

    Purpose: Tea and coffee contain bioactive compounds and both beverages have recently been associated with a reduced risk of prostate cancer (PCa). Methods: We studied associations of tea and coffee consumption with PCa risk in a population-based case-control study from King County, Washington, US. Prostate cancer cases were diagnosed in 2002-2005 and matched to controls by five-year age groups. Logistic regression was used to generate odds ratios (ORs) and 95% confidence intervals (CIs). Results: Among controls, 19% and 58% consumed at least one cup per day of tea and coffee, respectively. The analysis of tea included 892 cases and 863 controls and tea consumption was associated with a reduced overall PCa risk with an adjusted OR of 0.63 (95% CI: 0.45, 0.90; P for trend = 0.02) for men in the highest compared to lowest category of tea intake (≥2 cups/day versus ≤1 cup/week). Risk estimates did not vary substantially by Gleason grade or disease stage. Coffee consumption was not associated with risk of overall PCa or PCa in subgroups defined by tumor grade or stage. Conclusions: Our results contribute further evidence that tea consumption may be a modifiable exposure that reduces PCa risk. PMID:23412806

  20. Linearized radiative transfer models for retrieval of cloud parameters from EPIC/DSCOVR measurements

    NASA Astrophysics Data System (ADS)

    Molina García, Víctor; Sasi, Sruthy; Efremenko, Dmitry S.; Doicu, Adrian; Loyola, Diego

    2018-07-01

    In this paper, we describe several linearized radiative transfer models which can be used for the retrieval of cloud parameters from EPIC (Earth Polychromatic Imaging Camera) measurements. The approaches under examination are (1) the linearized forward approach, represented in this paper by the linearized discrete ordinate and matrix operator methods with matrix exponential, and (2) the forward-adjoint approach based on the discrete ordinate method with matrix exponential. To enhance the performance of the radiative transfer computations, the correlated k-distribution method and the Principal Component Analysis (PCA) technique are used. We provide a compact description of the proposed methods, as well as a numerical analysis of their accuracy and efficiency when simulating EPIC measurements in the oxygen A-band channel at 764 nm. We found that the computation time of the forward-adjoint approach using the correlated k-distribution method in conjunction with PCA is approximately 13 s for simultaneously computing the derivatives with respect to cloud optical thickness and cloud top height.

  1. Comparison of discrete Fourier transform (DFT) and principal component analysis/DFT as forecasting tools for absorbance time series received by UV-visible probes installed in urban sewer systems.

    PubMed

    Plazas-Nossa, Leonardo; Torres, Andrés

    2014-01-01

    The objective of this work is to introduce a forecasting method for UV-Vis spectrometry time series that combines principal component analysis (PCA) and discrete Fourier transform (DFT), and to compare the results obtained with those obtained by using DFT. Three time series for three different study sites were used: (i) Salitre wastewater treatment plant (WWTP) in Bogotá; (ii) Gibraltar pumping station in Bogotá; and (iii) San Fernando WWTP in Itagüí (in the south part of Medellín). Each of these time series had an equal number of samples (1051). In general terms, the results obtained are hardly generalizable, as they seem to be highly dependent on specific water system dynamics; however, some trends can be outlined: (i) for UV range, DFT and PCA/DFT forecasting accuracy were almost the same; (ii) for visible range, the PCA/DFT forecasting procedure proposed gives systematically lower forecasting errors and variability than those obtained with the DFT procedure; and (iii) for short forecasting times the PCA/DFT procedure proposed is more suitable than the DFT procedure, according to processing times obtained.

  2. Evaluation of prostate cancer antigen 3 for detecting prostate cancer: a systematic review and meta-analysis

    NASA Astrophysics Data System (ADS)

    Cui, Yong; Cao, Wenzhou; Li, Quan; Shen, Hua; Liu, Chao; Deng, Junpeng; Xu, Jiangfeng; Shao, Qiang

    2016-05-01

    Previous studies indicate that prostate cancer antigen 3 (PCA3) is highly expressed in prostatic tumors. However, its clinical value has not been characterized. The aim of this study was to investigate the clinical value of the urine PCA3 test in the diagnosis of prostate cancer by pooling the published data. Clinical trials utilizing the urine PCA3 test for diagnosing prostate cancer were retrieved from PubMed and Embase. A total of 46 clinical trials including 12,295 subjects were included in this meta-analysis. The pooled sensitivity, specificity, positive likelihood ratio (+LR), negative likelihood ratio (-LR), diagnostic odds ratio (DOR) and area under the curve (AUC) were 0.65 (95% confidence interval [CI]: 0.63-0.66), 0.73 (95% CI: 0.72-0.74), 2.23 (95% CI: 1.91-2.62), 0.48 (95% CI: 0.44-0.52), 5.31 (95% CI: 4.19-6.73) and 0.75 (95% CI: 0.74-0.77), respectively. In conclusion, the urine PCA3 test has acceptable sensitivity and specificity for the diagnosis of prostate cancer and can be used as a non-invasive method for that purpose.

  3. An unsupervised MVA method to compare specific regions in human breast tumor tissue samples using ToF-SIMS.

    PubMed

    Bluestein, Blake M; Morrish, Fionnuala; Graham, Daniel J; Guenthoer, Jamie; Hockenbery, David; Porter, Peggy L; Gamble, Lara J

    2016-03-21

    Imaging time-of-flight secondary ion mass spectrometry (ToF-SIMS) and principal component analysis (PCA) were used to investigate two sets of pre- and post-chemotherapy human breast tumor tissue sections to characterize lipids associated with tumor metabolic flexibility and response to treatment. The micron spatial resolution imaging capability of ToF-SIMS provides a powerful approach to attain spatially-resolved molecular and cellular data from cancerous tissues not available with conventional imaging techniques. Three ca. 1 mm(2) areas per tissue section were analyzed by stitching together 200 μm × 200 μm raster area scans. A method to isolate and analyze specific tissue regions of interest by utilizing PCA of ToF-SIMS images is presented, which allowed separation of cellularized areas from stromal areas. These PCA-generated regions of interest were then used as masks to reconstruct representative spectra from specifically stromal or cellular regions. The advantage of this unsupervised selection method is a reduction in scatter in the spectral PCA results when compared to analyzing all tissue areas or analyzing areas highlighted by a pathologist. Utilizing this method, stromal and cellular regions of breast tissue biopsies taken pre- versus post-chemotherapy demonstrate chemical separation using negatively-charged ion species. In this sample set, the cellular regions were predominantly all cancer cells. Fatty acids (i.e. palmitic, oleic, and stearic), monoacylglycerols, diacylglycerols and vitamin E profiles were distinctively different between the pre- and post-therapy tissues. These results validate a new unsupervised method to isolate and interpret biochemically distinct regions in cancer tissues using imaging ToF-SIMS data. In addition, the method developed here can provide a framework to compare a variety of tissue samples using imaging ToF-SIMS, especially where there is section-to-section variability that makes it difficult to use a serial hematoxylin and eosin (H&E) stained section to direct the SIMS analysis.

  4. Estimation of human emotions using thermal facial information

    NASA Astrophysics Data System (ADS)

    Nguyen, Hung; Kotani, Kazunori; Chen, Fan; Le, Bac

    2014-01-01

    In recent years, research on human emotion estimation using thermal infrared (IR) imagery has appealed to many researchers due to its invariance to visible illumination changes. Although infrared imagery is superior to visible imagery in its invariance to illumination changes and appearance differences, it has difficulties in handling transparent glasses in the thermal infrared spectrum. As a result, when using infrared imagery for the analysis of human facial information, the regions of eyeglasses are dark and eyes' thermal information is not given. We propose a temperature space method to correct eyeglasses' effect using the thermal facial information in the neighboring facial regions, and then use Principal Component Analysis (PCA), Eigen-space Method based on class-features (EMC), and PCA-EMC method to classify human emotions from the corrected thermal images. We collected the Kotani Thermal Facial Emotion (KTFE) database and performed the experiments, which show the improved accuracy rate in estimating human emotions.

  5. [Research on outlier detection methods for determination of oil yield in oil shales using near-infrared spectroscopy].

    PubMed

    Zhang, Huai-zhu; Lin, Jun; Zhang, Huai-Zhu

    2014-06-01

    In the present paper, the outlier detection methods for determination of oil yield in oil shale using near-infrared (NIR) diffuse reflection spectroscopy was studied. During the quantitative analysis with near-infrared spectroscopy, environmental change and operator error will both produce outliers. The presence of outliers will affect the overall distribution trend of samples and lead to the decrease in predictive capability. Thus, the detection of outliers are important for the construction of high-quality calibration models. The methods including principal component analysis-Mahalanobis distance (PCA-MD) and resampling by half-means (RHM) were applied to the discrimination and elimination of outliers in this work. The thresholds and confidences for MD and RHM were optimized using the performance of partial least squares (PLS) models constructed after the elimination of outliers, respectively. Compared with the model constructed with the data of full spectrum, the values of RMSEP of the models constructed with the application of PCA-MD with a threshold of a value equal to the sum of average and standard deviation of MD, RHM with the confidence level of 85%, and the combination of PCA-MD and RHM, were reduced by 48.3%, 27.5% and 44.8%, respectively. The predictive ability of the calibration model has been improved effectively.

  6. Discriminating the Mineralogical Composition in Drill Cuttings Based on Absorption Spectra in the Terahertz Range.

    PubMed

    Miao, Xinyang; Li, Hao; Bao, Rima; Feng, Chengjing; Wu, Hang; Zhan, Honglei; Li, Yizhang; Zhao, Kun

    2017-02-01

    Understanding the geological units of a reservoir is essential to the development and management of the resource. In this paper, drill cuttings from several depths from an oilfield were studied using terahertz time domain spectroscopy (THz-TDS). Cluster analysis (CA) and principal component analysis (PCA) were employed to classify and analyze the cuttings. The cuttings were clearly classified based on CA and PCA methods, and the results were in agreement with the lithology. Moreover, calcite and dolomite have stronger absorption of a THz pulse than any other minerals, based on an analysis of the PC1 scores. Quantitative analyses of minor minerals were also realized by building a series of linear and non-linear models between contents and PC2 scores. The results prove THz technology to be a promising means for determining reservoir lithology as well as other properties, which will be a significant supplementary method in oil fields.

  7. Score-moment combined linear discrimination analysis (SMC-LDA) as an improved discrimination method.

    PubMed

    Han, Jintae; Chung, Hoeil; Han, Sung-Hwan; Yoon, Moon-Young

    2007-01-01

    A new discrimination method called the score-moment combined linear discrimination analysis (SMC-LDA) has been developed and its performance has been evaluated using three practical spectroscopic datasets. The key concept of SMC-LDA was to use not only the score from principal component analysis (PCA), but also the moment of the spectrum, as inputs for LDA to improve discrimination. Along with conventional score, moment is used in spectroscopic fields as an effective alternative for spectral feature representation. Three different approaches were considered. Initially, the score generated from PCA was projected onto a two-dimensional feature space by maximizing Fisher's criterion function (conventional PCA-LDA). Next, the same procedure was performed using only moment. Finally, both score and moment were utilized simultaneously for LDA. To evaluate discrimination performances, three different spectroscopic datasets were employed: (1) infrared (IR) spectra of normal and malignant stomach tissue, (2) near-infrared (NIR) spectra of diesel and light gas oil (LGO) and (3) Raman spectra of Chinese and Korean ginseng. For each case, the best discrimination results were achieved when both score and moment were used for LDA (SMC-LDA). Since the spectral representation character of moment was different from that of score, inclusion of both score and moment for LDA provided more diversified and descriptive information.

  8. Non-linear principal component analysis applied to Lorenz models and to North Atlantic SLP

    NASA Astrophysics Data System (ADS)

    Russo, A.; Trigo, R. M.

    2003-04-01

    A non-linear generalisation of Principal Component Analysis (PCA), denoted Non-Linear Principal Component Analysis (NLPCA), is introduced and applied to the analysis of three data sets. Non-Linear Principal Component Analysis allows for the detection and characterisation of low-dimensional non-linear structure in multivariate data sets. This method is implemented using a 5-layer feed-forward neural network introduced originally in the chemical engineering literature (Kramer, 1991). The method is described and details of its implementation are addressed. Non-Linear Principal Component Analysis is first applied to a data set sampled from the Lorenz attractor (1963). It is found that the NLPCA approximations are more representative of the data than are the corresponding PCA approximations. The same methodology was applied to the less known Lorenz attractor (1984). However, the results obtained weren't as good as those attained with the famous 'Butterfly' attractor. Further work with this model is underway in order to assess if NLPCA techniques can be more representative of the data characteristics than are the corresponding PCA approximations. The application of NLPCA to relatively 'simple' dynamical systems, such as those proposed by Lorenz, is well understood. However, the application of NLPCA to a large climatic data set is much more challenging. Here, we have applied NLPCA to the sea level pressure (SLP) field for the entire North Atlantic area and the results show a slight imcrement of explained variance associated. Finally, directions for future work are presented.%}

  9. An extended data mining method for identifying differentially expressed assay-specific signatures in functional genomic studies.

    PubMed

    Rollins, Derrick K; Teh, Ailing

    2010-12-17

    Microarray data sets provide relative expression levels for thousands of genes for a small number, in comparison, of different experimental conditions called assays. Data mining techniques are used to extract specific information of genes as they relate to the assays. The multivariate statistical technique of principal component analysis (PCA) has proven useful in providing effective data mining methods. This article extends the PCA approach of Rollins et al. to the development of ranking genes of microarray data sets that express most differently between two biologically different grouping of assays. This method is evaluated on real and simulated data and compared to a current approach on the basis of false discovery rate (FDR) and statistical power (SP) which is the ability to correctly identify important genes. This work developed and evaluated two new test statistics based on PCA and compared them to a popular method that is not PCA based. Both test statistics were found to be effective as evaluated in three case studies: (i) exposing E. coli cells to two different ethanol levels; (ii) application of myostatin to two groups of mice; and (iii) a simulated data study derived from the properties of (ii). The proposed method (PM) effectively identified critical genes in these studies based on comparison with the current method (CM). The simulation study supports higher identification accuracy for PM over CM for both proposed test statistics when the gene variance is constant and for one of the test statistics when the gene variance is non-constant. PM compares quite favorably to CM in terms of lower FDR and much higher SP. Thus, PM can be quite effective in producing accurate signatures from large microarray data sets for differential expression between assays groups identified in a preliminary step of the PCA procedure and is, therefore, recommended for use in these applications.

  10. Differentiation of the two major species of Echinacea (E. augustifolia and E. purpurea) using a flow injection mass spectrometric (FIMS) fingerprinting method and chemometric analysis

    USDA-ARS?s Scientific Manuscript database

    A rapid, simple, and reliable flow-injection mass spectrometric (FIMS) method was developed to discriminate two major Echinacea species (E. purpurea and E. angustifolia) samples. Fifty-eight Echinacea samples collected from United States were analyzed using FIMS. Principle component analysis (PCA) a...

  11. A better understanding of long-range temporal dependence of traffic flow time series

    NASA Astrophysics Data System (ADS)

    Feng, Shuo; Wang, Xingmin; Sun, Haowei; Zhang, Yi; Li, Li

    2018-02-01

    Long-range temporal dependence is an important research perspective for modelling of traffic flow time series. Various methods have been proposed to depict the long-range temporal dependence, including autocorrelation function analysis, spectral analysis and fractal analysis. However, few researches have studied the daily temporal dependence (i.e. the similarity between different daily traffic flow time series), which can help us better understand the long-range temporal dependence, such as the origin of crossover phenomenon. Moreover, considering both types of dependence contributes to establishing more accurate model and depicting the properties of traffic flow time series. In this paper, we study the properties of daily temporal dependence by simple average method and Principal Component Analysis (PCA) based method. Meanwhile, we also study the long-range temporal dependence by Detrended Fluctuation Analysis (DFA) and Multifractal Detrended Fluctuation Analysis (MFDFA). The results show that both the daily and long-range temporal dependence exert considerable influence on the traffic flow series. The DFA results reveal that the daily temporal dependence creates crossover phenomenon when estimating the Hurst exponent which depicts the long-range temporal dependence. Furthermore, through the comparison of the DFA test, PCA-based method turns out to be a better method to extract the daily temporal dependence especially when the difference between days is significant.

  12. [Determination of the Plant Origin of Licorice Oil Extract, a Natural Food Additive, by Principal Component Analysis Based on Chemical Components].

    PubMed

    Tada, Atsuko; Ishizuki, Kyoko; Sugimoto, Naoki; Yoshimatsu, Kayo; Kawahara, Nobuo; Suematsu, Takako; Arifuku, Kazunori; Fukai, Toshio; Tamura, Yukiyoshi; Ohtsuki, Takashi; Tahara, Maiko; Yamazaki, Takeshi; Akiyama, Hiroshi

    2015-01-01

    "Licorice oil extract" (LOE) (antioxidant agent) is described in the notice of Japanese food additive regulations as a material obtained from the roots and/or rhizomes of Glycyrrhiza uralensis, G. inflata or G. glabra. In this study, we aimed to identify the original Glycyrrhiza species of eight food additive products using LC/MS. Glabridin, a characteristic compound in G. glabra, was specifically detected in seven products, and licochalcone A, a characteristic compound in G. inflata, was detected in one product. In addition, Principal Component Analysis (PCA) (a kind of multivariate analysis) using the data of LC/MS or (1)H-NMR analysis was performed. The data of thirty-one samples, including LOE products used as food additives, ethanol extracts of various Glycyrrhiza species and commercially available Glycyrrhiza species-derived products were assessed. Based on the PCA results, the majority of LOE products was confirmed to be derived from G. glabra. This study suggests that PCA using (1)H-NMR analysis data is a simple and useful method to identify the plant species of origin of natural food additive products.

  13. Discrimination among Panax species using spectral fingerprinting

    USDA-ARS?s Scientific Manuscript database

    Spectral fingerprints of samples of three Panax species (P. quinquefolius L., P. ginseng, and P. notoginseng) were acquired using UV, NIR, and MS spectrometry. With principal components analysis (PCA), all three methods allowed visual discrimination between all three species. All three methods wer...

  14. Elastic Versus Rigid Image Registration in Magnetic Resonance Imaging-transrectal Ultrasound Fusion Prostate Biopsy: A Systematic Review and Meta-analysis.

    PubMed

    Venderink, Wulphert; de Rooij, Maarten; Sedelaar, J P Michiel; Huisman, Henkjan J; Fütterer, Jurgen J

    2016-07-29

    The main difference between the available magnetic resonance imaging-transrectal ultrasound (MRI-TRUS) fusion platforms for prostate biopsy is the method of image registration being either rigid or elastic. As elastic registration compensates for possible deformation caused by the introduction of an ultrasound probe for example, it is expected that it would perform better than rigid registration. The aim of this meta-analysis is to compare rigid with elastic registration by calculating the detection odds ratio (OR) for both subgroups. The detection OR is defined as the ratio of the odds of detecting clinically significant prostate cancer (csPCa) by MRI-TRUS fusion biopsy compared with systematic TRUS biopsy. Secondary objectives were the OR for any PCa and the OR after pooling both registration techniques. The electronic databases PubMed, Embase, and Cochrane were systematically searched for relevant studies according to the Preferred Reporting Items for Systematic Review and Meta-analysis Statement. Studies comparing MRI-TRUS fusion and systematic TRUS-guided biopsies in the same patient were included. The quality assessment of included studies was performed using the Quality Assessment of Diagnostic Accuracy Studies version 2. Eleven papers describing elastic and 10 describing rigid registration were included. Meta-analysis showed an OR of csPCa for elastic and rigid registration of 1.45 (95% confidence interval [CI]: 1.21-1.73, p<0.0001) and 1.40 (95% CI: 1.13-1.75, p=0.002), respectively. No significant difference was seen between the subgroups (p=0.83). Pooling subgroups resulted in an OR of 1.43 (95% CI: 1.25-1.63, p<0.00001). No significant difference was identified between rigid and elastic registration for MRI-TRUS fusion-guided biopsy in the detection of csPCa; however, both techniques detected more csPCa than TRUS-guided biopsy alone. We did not identify any significant differences in prostate cancer detection between two distinct magnetic resonance imaging-transrectal ultrasound fusion systems which vary in their method of compensating for prostate deformation. Copyright © 2016 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  15. Chemometrics-based Approach in Analysis of Arnicae flos

    PubMed Central

    Zheleva-Dimitrova, Dimitrina Zh.; Balabanova, Vessela; Gevrenova, Reneta; Doichinova, Irini; Vitkova, Antonina

    2015-01-01

    Introduction: Arnica montana flowers have a long history as herbal medicines for external use on injuries and rheumatic complaints. Objective: To investigate Arnicae flos of cultivated accessions from Bulgaria, Poland, Germany, Finland, and Pharmacy store for phenolic derivatives and sesquiterpene lactones (STLs). Materials and Methods: Samples of Arnica from nine origins were prepared by ultrasound-assisted extraction with 80% methanol for phenolic compounds analysis. Subsequent reverse-phase high-performance liquid chromatography (HPLC) separation of the analytes was performed using gradient elution and ultraviolet detection at 280 and 310 nm (phenolic acids), and 360 nm (flavonoids). Total STLs were determined in chloroform extracts by solid-phase extraction-HPLC at 225 nm. The HPLC generated chromatographic data were analyzed using principal component analysis (PCA) and hierarchical clustering (HC). Results: The highest total amount of phenolic acids was found in the sample from Botanical Garden at Joensuu University, Finland (2.36 mg/g dw). Astragalin, isoquercitrin, and isorhamnetin 3-glucoside were the main flavonol glycosides being present up to 3.37 mg/g (astragalin). Three well-defined clusters were distinguished by PCA and HC. Cluster C1 comprised of the German and Finnish accessions characterized by the highest content of flavonols. Cluster C2 included the Bulgarian and Polish samples presenting a low content of flavonoids. Cluster C3 consisted only of one sample from a pharmacy store. Conclusion: A validated HPLC method for simultaneous determination of phenolic acids, flavonoid glycosides, and aglycones in A. montana flowers was developed. The PCA loading plot showed that quercetin, kaempferol, and isorhamnetin can be used to distinguish different Arnica accessions. SUMMARY A principal component analysis (PCA) on 13 phenolic compounds and total amount of sesquiterpene lactones in Arnicae flos collection tended to cluster the studied 9 accessions into three main groups. The profiles obtained demonstrated that the samples from Germany and Finland are characterized by greater amounts of phenolic derivatives than the Bulgarian and Polish ones. The PCA loading plot showed that quercetin, kaemferol and isorhamnetin can be used to distinguish different arnica accessions. PMID:27013791

  16. [Patient-controlled Analgesia (PCA): an Overview About Methods, Handling and New Modalities].

    PubMed

    Abrolat, Marie; Eberhart, Leopold H J; Kalmus, Gerald; Koch, Tilo; Nardi-Hiebl, Stefan

    2018-04-01

    Patient-controlled analgesia (PCA) is one of the well established methods for the treatment of postoperative pain. A cochrane-review concluded that PCA is associated with better postoperative pain ratings and improved patient-satifaction compared to traditional way of administering opioids. Some prerequisites concerning patient selection, education of the patient and the medical staff, and supervision during PCA therapy are mandatory for a safe use of PCA. Current PCA modalities (intravenous and epidural routes of application) are expanded by newer, less invasive routes of drug administration, e.g. by the iontophoretic transdermal and the sublingual route. Their role in improving safety and the quality of pain therapy on the one hand side, and costs on the other hand side are discussion. Georg Thieme Verlag KG Stuttgart · New York.

  17. Evaluation of Parallel Analysis Methods for Determining the Number of Factors

    ERIC Educational Resources Information Center

    Crawford, Aaron V.; Green, Samuel B.; Levy, Roy; Lo, Wen-Juo; Scott, Lietta; Svetina, Dubravka; Thompson, Marilyn S.

    2010-01-01

    Population and sample simulation approaches were used to compare the performance of parallel analysis using principal component analysis (PA-PCA) and parallel analysis using principal axis factoring (PA-PAF) to identify the number of underlying factors. Additionally, the accuracies of the mean eigenvalue and the 95th percentile eigenvalue criteria…

  18. Removal of BCG artefact from concurrent fMRI-EEG recordings based on EMD and PCA.

    PubMed

    Javed, Ehtasham; Faye, Ibrahima; Malik, Aamir Saeed; Abdullah, Jafri Malin

    2017-11-01

    Simultaneous electroencephalography (EEG) and functional magnetic resonance image (fMRI) acquisitions provide better insight into brain dynamics. Some artefacts due to simultaneous acquisition pose a threat to the quality of the data. One such problematic artefact is the ballistocardiogram (BCG) artefact. We developed a hybrid algorithm that combines features of empirical mode decomposition (EMD) with principal component analysis (PCA) to reduce the BCG artefact. The algorithm does not require extra electrocardiogram (ECG) or electrooculogram (EOG) recordings to extract the BCG artefact. The method was tested with both simulated and real EEG data of 11 participants. From the simulated data, the similarity index between the extracted BCG and the simulated BCG showed the effectiveness of the proposed method in BCG removal. On the other hand, real data were recorded with two conditions, i.e. resting state (eyes closed dataset) and task influenced (event-related potentials (ERPs) dataset). Using qualitative (visual inspection) and quantitative (similarity index, improved normalized power spectrum (INPS) ratio, power spectrum, sample entropy (SE)) evaluation parameters, the assessment results showed that the proposed method can efficiently reduce the BCG artefact while preserving the neuronal signals. Compared with conventional methods, namely, average artefact subtraction (AAS), optimal basis set (OBS) and combined independent component analysis and principal component analysis (ICA-PCA), the statistical analyses of the results showed that the proposed method has better performance, and the differences were significant for all quantitative parameters except for the power and sample entropy. The proposed method does not require any reference signal, prior information or assumption to extract the BCG artefact. It will be very useful in circumstances where the reference signal is not available. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Application of FT-IR spectroscopy on breast cancer serum analysis

    NASA Astrophysics Data System (ADS)

    Elmi, Fatemeh; Movaghar, Afshin Fayyaz; Elmi, Maryam Mitra; Alinezhad, Heshmatollah; Nikbakhsh, Novin

    2017-12-01

    Breast cancer is regarded as the most malignant tumor among women throughout the world. Therefore, early detection and proper diagnostic methods have been known to help save women's lives. Fourier Transform Infrared (FT-IR) spectroscopy, coupled with PCA-LDA analysis, is a new technique to investigate the characteristics of serum in breast cancer. In this study, 43 breast cancer and 43 healthy serum samples were collected, and the FT-IR spectra were recorded for each one. Then, PCA analysis and linear discriminant analysis (LDA) were used to analyze the spectral data. The results showed that there were differences between the spectra of the two groups. Discriminating wavenumbers were associated with several spectral differences over the 950-1200 cm- 1(sugar), 1190-1350 cm- 1 (collagen), 1475-1710 cm- 1 (protein), 1710-1760 cm- 1 (ester), 2800-3000 cm- 1 (stretching motions of -CH2 & -CH3), and 3090-3700 cm- 1 (NH stretching) regions. PCA-LDA performance on serum IR could recognize changes between the control and the breast cancer cases. The diagnostic accuracy, sensitivity, and specificity of PCA-LDA analysis for 3000-3600 cm- 1 (NH stretching) were found to be 83%, 84%, 74% for the control and 80%, 76%, 72% for the breast cancer cases, respectively. The results showed that the major spectral differences between the two groups were related to the differences in protein conformation in serum samples. It can be concluded that FT-IR spectroscopy, together with multivariate data analysis, is able to discriminate between breast cancer and healthy serum samples.

  20. Using principal component analysis and annual seasonal trend analysis to assess karst rocky desertification in southwestern China.

    PubMed

    Zhang, Zhiming; Ouyang, Zhiyun; Xiao, Yi; Xiao, Yang; Xu, Weihua

    2017-06-01

    Increasing exploitation of karst resources is causing severe environmental degradation because of the fragility and vulnerability of karst areas. By integrating principal component analysis (PCA) with annual seasonal trend analysis (ASTA), this study assessed karst rocky desertification (KRD) within a spatial context. We first produced fractional vegetation cover (FVC) data from a moderate-resolution imaging spectroradiometer normalized difference vegetation index using a dimidiate pixel model. Then, we generated three main components of the annual FVC data using PCA. Subsequently, we generated the slope image of the annual seasonal trends of FVC using median trend analysis. Finally, we combined the three PCA components and annual seasonal trends of FVC with the incidence of KRD for each type of carbonate rock to classify KRD into one of four categories based on K-means cluster analysis: high, moderate, low, and none. The results of accuracy assessments indicated that this combination approach produced greater accuracy and more reasonable KRD mapping than the average FVC based on the vegetation coverage standard. The KRD map for 2010 indicated that the total area of KRD was 78.76 × 10 3  km 2 , which constitutes about 4.06% of the eight southwest provinces of China. The largest KRD areas were found in Yunnan province. The combined PCA and ASTA approach was demonstrated to be an easily implemented, robust, and flexible method for the mapping and assessment of KRD, which can be used to enhance regional KRD management schemes or to address assessment of other environmental issues.

  1. Optimization of sol-gel technique for coating of metallic substrates by hydroxyapatite using the Taguchi method

    NASA Astrophysics Data System (ADS)

    Pourbaghi-Masouleh, M.; Asgharzadeh, H.

    2013-08-01

    In this study, the Taguchi method of design of experiment (DOE) was used to optimize the hydroxyapatite (HA) coatings on various metallic substrates deposited by sol-gel dip-coating technique. The experimental design consisted of five factors including substrate material (A), surface preparation of substrate (B), dipping/withdrawal speed (C), number of layers (D), and calcination temperature (E) with three levels of each factor. An orthogonal array of L18 type with mixed levels of the control factors was utilized. The image processing of the micrographs of the coatings was conducted to determine the percentage of coated area ( PCA). Chemical and phase composition of HA coatings were studied by XRD, FT-IR, SEM, and EDS techniques. The analysis of variance (ANOVA) indicated that the PCA of HA coatings was significantly affected by the calcination temperature. The optimum conditions from signal-to-noise ( S/N) ratio analysis were A: pure Ti, B: polishing and etching for 24 h, C: 50 cm min-1, D: 1, and E: 300 °C. In the confirmation experiment using the optimum conditions, the HA coating with high PCA of 98.5 % was obtained.

  2. Impact of PCA Strategies on Pain Intensity and Functional Assessment Measures in Adults with Sickle Cell Disease during Hospitalized Vaso-Occlusive Episodes

    PubMed Central

    Dampier, Carlton D.; Wager, Carrie G.; Harrison, Ryan; Hsu, Lewis L.; Minniti, Caterina P.; Smith, Wally R.

    2012-01-01

    Clinical trials of sickle cell disease (SCD) pain treatment usually observe only small decrements in pain intensity during the course of hospitalization. Sub-optimal analgesic management and inadequate pain assessment methods are possible explanations for these findings. In a search for better methods for assessing inpatient SCD pain in adults, we examined several pain intensity and interference measures in both arms of a randomized controlled trial comparing two different opioid PCA therapies. Based upon longitudinal analysis of pain episodes, we found that scores from daily average Visual Analogue Scales (VAS) and several other measures, especially the Brief Pain Inventory (BPI), were sensitive to change in daily improvements in pain intensity associated with resolution of vaso-occlusive pain. In this preliminary trial, the low demand, high basal infusion (LDHI) strategy demonstrated faster, larger improvements in various measures of pain than the high demand, low basal infusion (HDLI) strategy for opioid PCA dosing, however, verification in larger studies is required. The measures and statistical approaches used in this analysis may facilitate design, reduce sample size, and improve analyses of treatment response in future SCD clinical trials of vaso-occlusive episodes. PMID:22886853

  3. The Application of Principal Component Analysis Using Fixed Eigenvectors to the Infrared Thermographic Inspection of the Space Shuttle Thermal Protection System

    NASA Technical Reports Server (NTRS)

    Cramer, K. Elliott; Winfree, William P.

    2006-01-01

    The Nondestructive Evaluation Sciences Branch at NASA s Langley Research Center has been actively involved in the development of thermographic inspection techniques for more than 15 years. Since the Space Shuttle Columbia accident, NASA has focused on the improvement of advanced NDE techniques for the Reinforced Carbon-Carbon (RCC) panels that comprise the orbiter s wing leading edge. Various nondestructive inspection techniques have been used in the examination of the RCC, but thermography has emerged as an effective inspection alternative to more traditional methods. Thermography is a non-contact inspection method as compared to ultrasonic techniques which typically require the use of a coupling medium between the transducer and material. Like radiographic techniques, thermography can be used to inspect large areas, but has the advantage of minimal safety concerns and the ability for single-sided measurements. Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. A typical implementation of PCA is when the eigenvectors are generated from the data set being analyzed. Although it is a powerful tool for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the good material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued when a fixed set of eigenvectors is used to process the thermal data from the RCC materials. These eigen vectors can be generated either from an analytic model of the thermal response of the material under examination, or from a large cross section of experimental data. This paper will provide the details of the analytic model; an overview of the PCA process; as well as a quantitative signal-to-noise comparison of the results of performing both embodiments of PCA on thermographic data from various RCC specimens. Details of a system that has been developed to allow insitu inspection of a majority of shuttle RCC components will be presented along with the acceptance test results for this system. Additionally, the results of applying this technology to the Space Shuttle Discovery after its return from flight will be presented.

  4. Biomarker microRNAs for prostate cancer metastasis: screened with a network vulnerability analysis model.

    PubMed

    Lin, Yuxin; Chen, Feifei; Shen, Li; Tang, Xiaoyu; Du, Cui; Sun, Zhandong; Ding, Huijie; Chen, Jiajia; Shen, Bairong

    2018-05-21

    Prostate cancer (PCa) is a fatal malignant tumor among males in the world and the metastasis is a leading cause for PCa death. Biomarkers are therefore urgently needed to detect PCa metastatic signature at the early time. MicroRNAs are small non-coding RNAs with the potential to be biomarkers for disease prediction. In addition, computer-aided biomarker discovery is now becoming an attractive paradigm for precision diagnosis and prognosis of complex diseases. In this study, we identified key microRNAs as biomarkers for predicting PCa metastasis based on network vulnerability analysis. We first extracted microRNAs and mRNAs that were differentially expressed between primary PCa and metastatic PCa (MPCa) samples. Then we constructed the MPCa-specific microRNA-mRNA network and screened microRNA biomarkers by a novel bioinformatics model. The model emphasized the characterization of systems stability changes and the network vulnerability with three measurements, i.e. the structurally single-line regulation, the functional importance of microRNA targets and the percentage of transcription factor genes in microRNA unique targets. With this model, we identified five microRNAs as putative biomarkers for PCa metastasis. Among them, miR-101-3p and miR-145-5p have been previously reported as biomarkers for PCa metastasis and the remaining three, i.e. miR-204-5p, miR-198 and miR-152, were screened as novel biomarkers for PCa metastasis. The results were further confirmed by the assessment of their predictive power and biological function analysis. Five microRNAs were identified as candidate biomarkers for predicting PCa metastasis based on our network vulnerability analysis model. The prediction performance, literature exploration and functional enrichment analysis convinced our findings. This novel bioinformatics model could be applied to biomarker discovery for other complex diseases.

  5. Immunoseroproteomic Profiling in African American Men with Prostate Cancer: Evidence for an Autoantibody Response to Glycolysis and Plasminogen-Associated Proteins*

    PubMed Central

    Sanchez, Tino W.; Zhang, Guangyu; Li, Jitian; Dai, Liping; Mirshahidi, Saied; Wall, Nathan R.; Yates, Clayton; Wilson, Colwick; Montgomery, Susanne; Zhang, Jian-Ying; Casiano, Carlos A.

    2016-01-01

    African American (AA) men suffer from a disproportionately high incidence and mortality of prostate cancer (PCa) compared with other racial/ethnic groups. Despite these disparities, African American men are underrepresented in clinical trials and in studies on PCa biology and biomarker discovery. We used immunoseroproteomics to profile antitumor autoantibody responses in AA and European American (EA) men with PCa, and explored differences in these responses. This minimally invasive approach detects autoantibodies to tumor-associated antigens that could serve as clinical biomarkers and immunotherapeutic agents. Sera from AA and EA men with PCa were probed by immunoblotting against PC3 cell proteins, with AA sera showing stronger immunoreactivity. Mass spectrometry analysis of immunoreactive protein spots revealed that several AA sera contained autoantibodies to a number of proteins associated with both the glycolysis and plasminogen pathways, particularly to alpha-enolase (ENO1). The proteomic data is deposited in ProteomeXchange with identifier PXD003968. Analysis of sera from 340 racially diverse men by enzyme-linked immunosorbent assays (ELISA) showed higher frequency of anti-ENO1 autoantibodies in PCa sera compared with control sera. We observed differences between AA-PCa and EA-PCa patients in their immunoreactivity against ENO1. Although EA-PCa sera reacted with higher frequency against purified ENO1 in ELISA and recognized by immunoblotting the endogenous cellular ENO1 across a panel of prostate cell lines, AA-PCa sera reacted weakly against this protein by ELISA but recognized it by immunoblotting preferentially in metastatic cell lines. These race-related differences in immunoreactivity to ENO1 could not be accounted by differential autoantibody recognition of phosphoepitopes within this antigen. Proteomic analysis revealed differences in the posttranslational modification profiles of ENO1 variants differentially recognized by AA-PCa and EA-PCa sera. These intriguing results suggest the possibility of race-related differences in the antitumor autoantibody response in PCa, and have implications for defining novel biological determinants of PCa health disparities. PMID:27742740

  6. Lycopene and Risk of Prostate Cancer

    PubMed Central

    Chen, Ping; Zhang, Wenhao; Wang, Xiao; Zhao, Keke; Negi, Devendra Singh; Zhuo, Li; Qi, Mao; Wang, Xinghuan; Zhang, Xinhua

    2015-01-01

    Abstract Prostate cancer (PCa) is a common illness for aging males. Lycopene has been identified as an antioxidant agent with potential anticancer properties. Studies investigating the relation between lycopene and PCa risk have produced inconsistent results. This study aims to determine dietary lycopene consumption/circulating concentration and any potential dose–response associations with the risk of PCa. Eligible studies published in English up to April 10, 2014, were searched and identified from Pubmed, Sciencedirect Online, Wiley online library databases and hand searching. The STATA (version 12.0) was applied to process the dose–response meta-analysis. Random effects models were used to calculate pooled relative risks (RRs) and 95% confidence intervals (CIs) and to incorporate variation between studies. The linear and nonlinear dose–response relations were evaluated with data from categories of lycopene consumption/circulating concentrations. Twenty-six studies were included with 17,517 cases of PCa reported from 563,299 participants. Although inverse association between lycopene consumption and PCa risk was not found in all studies, there was a trend that with higher lycopene intake, there was reduced incidence of PCa (P = 0.078). Removal of one Chinese study in sensitivity analysis, or recalculation using data from only high-quality studies for subgroup analysis, indicated that higher lycopene consumption significantly lowered PCa risk. Furthermore, our dose–response meta-analysis demonstrated that higher lycopene consumption was linearly associated with a reduced risk of PCa with a threshold between 9 and 21 mg/day. Consistently, higher circulating lycopene levels significantly reduced the risk of PCa. Interestingly, the concentration of circulating lycopene between 2.17 and 85 μg/dL was linearly inversed with PCa risk whereas there was no linear association >85 μg/dL. In addition, greater efficacy for the circulating lycopene concentration on preventing PCa was found for studies with high quality, follow-up >10 years and where results were adjusted by the age or the body mass index. In conclusion, our novel data demonstrates that higher lycopene consumption/circulating concentration is associated with a lower risk of PCa. However, further studies are required to determine the mechanism by which lycopene reduces the risk of PCa and if there are other factors in tomato products that might potentially decrease PCa risk and progression. PMID:26287411

  7. Multiresolution generalized N dimension PCA for ultrasound image denoising

    PubMed Central

    2014-01-01

    Background Ultrasound images are usually affected by speckle noise, which is a type of random multiplicative noise. Thus, reducing speckle and improving image visual quality are vital to obtaining better diagnosis. Method In this paper, a novel noise reduction method for medical ultrasound images, called multiresolution generalized N dimension PCA (MR-GND-PCA), is presented. In this method, the Gaussian pyramid and multiscale image stacks on each level are built first. GND-PCA as a multilinear subspace learning method is used for denoising. Each level is combined to achieve the final denoised image based on Laplacian pyramids. Results The proposed method is tested with synthetically speckled and real ultrasound images, and quality evaluation metrics, including MSE, SNR and PSNR, are used to evaluate its performance. Conclusion Experimental results show that the proposed method achieved the lowest noise interference and improved image quality by reducing noise and preserving the structure. Our method is also robust for the image with a much higher level of speckle noise. For clinical images, the results show that MR-GND-PCA can reduce speckle and preserve resolvable details. PMID:25096917

  8. The Application of Infrared Thermographic Inspection Techniques to the Space Shuttle Thermal Protection System

    NASA Technical Reports Server (NTRS)

    Cramer, K. E.; Winfree, W. P.

    2005-01-01

    The Nondestructive Evaluation Sciences Branch at NASA s Langley Research Center has been actively involved in the development of thermographic inspection techniques for more than 15 years. Since the Space Shuttle Columbia accident, NASA has focused on the improvement of advanced NDE techniques for the Reinforced Carbon-Carbon (RCC) panels that comprise the orbiter s wing leading edge. Various nondestructive inspection techniques have been used in the examination of the RCC, but thermography has emerged as an effective inspection alternative to more traditional methods. Thermography is a non-contact inspection method as compared to ultrasonic techniques which typically require the use of a coupling medium between the transducer and material. Like radiographic techniques, thermography can be used to inspect large areas, but has the advantage of minimal safety concerns and the ability for single-sided measurements. Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. A typical implementation of PCA is when the eigenvectors are generated from the data set being analyzed. Although it is a powerful tool for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the "good" material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued where a fixed set of eigenvectors, generated from an analytic model of the thermal response of the material under examination, is used to process the thermal data from the RCC materials. Details of a one-dimensional analytic model and a two-dimensional finite-element model will be presented. An overview of the PCA process as well as a quantitative signal-to-noise comparison of the results of performing both embodiments of PCA on thermographic data from various RCC specimens will be shown. Finally, a number of different applications of this technology to various RCC components will be presented.

  9. [An improved low spectral distortion PCA fusion method].

    PubMed

    Peng, Shi; Zhang, Ai-Wu; Li, Han-Lun; Hu, Shao-Xing; Meng, Xian-Gang; Sun, Wei-Dong

    2013-10-01

    Aiming at the spectral distortion produced in PCA fusion process, the present paper proposes an improved low spectral distortion PCA fusion method. This method uses NCUT (normalized cut) image segmentation algorithm to make a complex hyperspectral remote sensing image into multiple sub-images for increasing the separability of samples, which can weaken the spectral distortions of traditional PCA fusion; Pixels similarity weighting matrix and masks were produced by using graph theory and clustering theory. These masks are used to cut the hyperspectral image and high-resolution image into some sub-region objects. All corresponding sub-region objects between the hyperspectral image and high-resolution image are fused by using PCA method, and all sub-regional integration results are spliced together to produce a new image. In the experiment, Hyperion hyperspectral data and Rapid Eye data were used. And the experiment result shows that the proposed method has the same ability to enhance spatial resolution and greater ability to improve spectral fidelity performance.

  10. Extracting spectral contrast in Landsat Thematic Mapper image data using selective principal component analysis

    USGS Publications Warehouse

    Chavez, P.S.; Kwarteng, A.Y.

    1989-01-01

    A challenge encountered with Landsat Thematic Mapper (TM) data, which includes data from size reflective spectral bands, is displaying as much information as possible in a three-image set for color compositing or digital analysis. Principal component analysis (PCA) applied to the six TM bands simultaneously is often used to address this problem. However, two problems that can be encountered using the PCA method are that information of interest might be mathematically mapped to one of the unused components and that a color composite can be difficult to interpret. "Selective' PCA can be used to minimize both of these problems. The spectral contrast among several spectral regions was mapped for a northern Arizona site using Landsat TM data. Field investigations determined that most of the spectral contrast seen in this area was due to one of the following: the amount of iron and hematite in the soils and rocks, vegetation differences, standing and running water, or the presence of gypsum, which has a higher moisture retention capability than do the surrounding soils and rocks. -from Authors

  11. Stability of Nonlinear Principal Components Analysis: An Empirical Study Using the Balanced Bootstrap

    ERIC Educational Resources Information Center

    Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Kooij, Anita J.

    2007-01-01

    Principal components analysis (PCA) is used to explore the structure of data sets containing linearly related numeric variables. Alternatively, nonlinear PCA can handle possibly nonlinearly related numeric as well as nonnumeric variables. For linear PCA, the stability of its solution can be established under the assumption of multivariate…

  12. A PCA-Based method for determining craniofacial relationship and sexual dimorphism of facial shapes.

    PubMed

    Shui, Wuyang; Zhou, Mingquan; Maddock, Steve; He, Taiping; Wang, Xingce; Deng, Qingqiong

    2017-11-01

    Previous studies have used principal component analysis (PCA) to investigate the craniofacial relationship, as well as sex determination using facial factors. However, few studies have investigated the extent to which the choice of principal components (PCs) affects the analysis of craniofacial relationship and sexual dimorphism. In this paper, we propose a PCA-based method for visual and quantitative analysis, using 140 samples of 3D heads (70 male and 70 female), produced from computed tomography (CT) images. There are two parts to the method. First, skull and facial landmarks are manually marked to guide the model's registration so that dense corresponding vertices occupy the same relative position in every sample. Statistical shape spaces of the skull and face in dense corresponding vertices are constructed using PCA. Variations in these vertices, captured in every principal component (PC), are visualized to observe shape variability. The correlations of skull- and face-based PC scores are analysed, and linear regression is used to fit the craniofacial relationship. We compute the PC coefficients of a face based on this craniofacial relationship and the PC scores of a skull, and apply the coefficients to estimate a 3D face for the skull. To evaluate the accuracy of the computed craniofacial relationship, the mean and standard deviation of every vertex between the two models are computed, where these models are reconstructed using real PC scores and coefficients. Second, each PC in facial space is analysed for sex determination, for which support vector machines (SVMs) are used. We examined the correlation between PCs and sex, and explored the extent to which the choice of PCs affects the expression of sexual dimorphism. Our results suggest that skull- and face-based PCs can be used to describe the craniofacial relationship and that the accuracy of the method can be improved by using an increased number of face-based PCs. The results show that the accuracy of the sex classification is related to the choice of PCs. The highest sex classification rate is 91.43% using our method. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. [Vis-NIR spectroscopic pattern recognition combined with SG smoothing applied to breed screening of transgenic sugarcane].

    PubMed

    Liu, Gui-Song; Guo, Hao-Song; Pan, Tao; Wang, Ji-Hua; Cao, Gan

    2014-10-01

    Based on Savitzky-Golay (SG) smoothing screening, principal component analysis (PCA) combined with separately supervised linear discriminant analysis (LDA) and unsupervised hierarchical clustering analysis (HCA) were used for non-destructive visible and near-infrared (Vis-NIR) detection for breed screening of transgenic sugarcane. A random and stability-dependent framework of calibration, prediction, and validation was proposed. A total of 456 samples of sugarcane leaves planting in the elongating stage were collected from the field, which was composed of 306 transgenic (positive) samples containing Bt and Bar gene and 150 non-transgenic (negative) samples. A total of 156 samples (negative 50 and positive 106) were randomly selected as the validation set; the remaining samples (negative 100 and positive 200, a total of 300 samples) were used as the modeling set, and then the modeling set was subdivided into calibration (negative 50 and positive 100, a total of 150 samples) and prediction sets (negative 50 and positive 100, a total of 150 samples) for 50 times. The number of SG smoothing points was ex- panded, while some modes of higher derivative were removed because of small absolute value, and a total of 264 smoothing modes were used for screening. The pairwise combinations of first three principal components were used, and then the optimal combination of principal components was selected according to the model effect. Based on all divisions of calibration and prediction sets and all SG smoothing modes, the SG-PCA-LDA and SG-PCA-HCA models were established, the model parameters were optimized based on the average prediction effect for all divisions to produce modeling stability. Finally, the model validation was performed by validation set. With SG smoothing, the modeling accuracy and stability of PCA-LDA, PCA-HCA were signif- icantly improved. For the optimal SG-PCA-LDA model, the recognition rate of positive and negative validation samples were 94.3%, 96.0%; and were 92.5%, 98.0% for the optimal SG-PCA-LDA model, respectively. Vis-NIR spectro- scopic pattern recognition combined with SG smoothing could be used for accurate recognition of transgenic sugarcane leaves, and provided a convenient screening method for transgenic sugarcane breeding.

  14. Reaction Kinetics for the Biocatalytic Conversion of Phenazine-1-Carboxylic Acid to 2-Hydroxyphenazine

    PubMed Central

    Chen, Mingmin; Cao, Hongxia; Peng, Huasong; Hu, Hongbo; Wang, Wei; Zhang, Xuehong

    2014-01-01

    The phenazine derivative 2-hydroxyphenazine (2-OH-PHZ) plays an important role in the biocontrol of plant diseases, and exhibits stronger bacteriostatic and fungistatic activity than phenazine-1-carboxylic acid (PCA) toward some pathogens. PhzO has been shown to be responsible for the conversion of PCA to 2-OH-PHZ, however the kinetics of the reaction have not been systematically studied. Further, the yield of 2-OH-PHZ in fermentation culture is quite low and enhancement in our understanding of the reaction kinetics may contribute to improvements in large-scale, high-yield production of 2-OH-PHZ for biological control and other applications. In this study we confirmed previous reports that free PCA is converted to 2-hydroxy-phenazine-1-carboxylic acid (2-OH-PCA) by the action of a single enzyme PhzO, and particularly demonstrate that this reaction is dependent on NADP(H) and Fe3+. Fe3+ enhanced the conversion from PCA to 2-OH-PHZ and 28°C was a optimum temperature for the conversion. However, PCA added in excess to the culture inhibited the production of 2-OH-PHZ. 2-OH-PCA was extracted and purified from the broth, and it was confirmed that the decarboxylation of 2-OH-PCA could occur without the involvement of any enzyme. A kinetic analysis of the conversion of 2-OH-PCA to 2-OH-PHZ in the absence of enzyme and under different temperatures and pHs in vitro, revealed that the conversion followed first-order reaction kinetics. In the fermentation, the concentration of 2-OH-PCA increased to about 90 mg/L within a red precipitate fraction, as compared to 37 mg/L within the supernatant. The results of this study elucidate the reaction kinetics involved in the biosynthesis of 2-OH-PHZ and provide insights into in vitro methods to enhance yields of 2-OH-PHZ. PMID:24905009

  15. Potential of cancer screening with serum surface-enhanced Raman spectroscopy and a support vector machine

    NASA Astrophysics Data System (ADS)

    Li, S. X.; Zhang, Y. J.; Zeng, Q. Y.; Li, L. F.; Guo, Z. Y.; Liu, Z. M.; Xiong, H. L.; Liu, S. H.

    2014-06-01

    Cancer is the most common disease to threaten human health. The ability to screen individuals with malignant tumours with only a blood sample would be greatly advantageous to early diagnosis and intervention. This study explores the possibility of discriminating between cancer patients and normal subjects with serum surface-enhanced Raman spectroscopy (SERS) and a support vector machine (SVM) through a peripheral blood sample. A total of 130 blood samples were obtained from patients with liver cancer, colonic cancer, esophageal cancer, nasopharyngeal cancer, gastric cancer, as well as 113 blood samples from normal volunteers. Several diagnostic models were built with the serum SERS spectra using SVM and principal component analysis (PCA) techniques. The results show that a diagnostic accuracy of 85.5% is acquired with a PCA algorithm, while a diagnostic accuracy of 95.8% is obtained using radial basis function (RBF), PCA-SVM methods. The results prove that a RBF kernel PCA-SVM technique is superior to PCA and conventional SVM (C-SVM) algorithms in classification serum SERS spectra. The study demonstrates that serum SERS, in combination with SVM techniques, has great potential for screening cancerous patients with any solid malignant tumour through a peripheral blood sample.

  16. Multi-ingredients determination and fingerprint analysis of leaves from Ilex latifolia using ultra-performance liquid chromatography coupled with quadrupole time-of-flight mass spectrometry.

    PubMed

    Fan, Chunlin; Deng, Jiewei; Yang, Yunyun; Liu, Junshan; Wang, Ying; Zhang, Xiaoqi; Fai, Kuokchiu; Zhang, Qingwen; Ye, Wencai

    2013-10-01

    An ultra-performance liquid chromatography coupled with quadrupole time-of-flight mass spectrometry (UPLC-QTOF-MS) method integrating multi-ingredients determination and fingerprint analysis has been established for quality assessment and control of leaves from Ilex latifolia. The method possesses the advantages of speediness, efficiency, accuracy, and allows the multi-ingredients determination and fingerprint analysis in one chromatographic run within 13min. Multi-ingredients determination was performed based on the extracted ion chromatograms of the exact pseudo-molecular ions (with a 0.01Da window), and fingerprint analysis was performed based on the base peak chromatograms, obtained by negative-ion electrospray ionization QTOF-MS. The method validation results demonstrated our developed method possessing desirable specificity, linearity, precision and accuracy. The method was utilized to analyze 22 I. latifolia samples from different origins. The quality assessment was achieved by using both similarity analysis (SA) and principal component analysis (PCA), and the results from SA were consistent with those from PCA. Our experimental results demonstrate that the strategy integrated multi-ingredients determination and fingerprint analysis using UPLC-QTOF-MS technique is a useful approach for rapid pharmaceutical analysis, with promising prospects for the differentiation of origin, the determination of authenticity, and the overall quality assessment of herbal medicines. Copyright © 2013 Elsevier B.V. All rights reserved.

  17. Use of electrospray ionization ion-trap tandem mass spectrometry and principal component analysis to directly distinguish monosaccharides.

    PubMed

    Xia, Bing; Zhou, Yan; Liu, Xin; Xiao, Juan; Liu, Qing; Gu, Yucheng; Ding, Lisheng

    2012-06-15

    Carbohydrates are good source of drugs and play important roles in metabolism processes and cellular interactions in organisms. Distinguishing monosaccharide isomers in saccharide derivates is an important and elementary work in investigating saccharides. It is important to develop a fast, simple and direct method for this purpose, which is described in this study. Stock solutions of monosaccharide with a concentration of 400 μM and sodium chloride at a concentration of 10 μM were made in water/methanol (50:50, v/v). The samples were subjected to electrospray ionization ion-trap tandem mass spectrometry (ESI-MS) and the detected [2M + Na - H(2)O](+) ions were further investigated by tandem mass spectrometry (MS/MS), followed by applying principal component analysis (PCA) on the obtained MS/MS data sets. The MS/MS spectra of the [2M + Na - H(2)O](+) ions at m/z 365 for hexoses and m/z 305 for pentoses yielded unambiguous fragment patterns, while rhamnose can be directly identified by its ESI-MS [M + Na](+) ion at m/z 187. PCA showed clustering of MS/MS data of identical monosaccharide samples obtained from different experiments. By using this method, the monosaccharide in daucosterol hydrolysate was successfully identified. A new strategy was developed for differentiation of the monosaccharides using ESI-MS/MS and PCA. In MS/MS spectra, the [2M + Na - H(2)O](+) ions yielded unambiguous distinction. PCA of the archived MS/MS data sets was applied to demonstrate the spatial resolution of the studied samples. This method presented a simple and reliable way for distinguishing monosaccharides by ESI-MS/MS. Copyright © 2012 John Wiley & Sons, Ltd.

  18. Performance comparisons between PCA-EA-LBG and PCA-LBG-EA approaches in VQ codebook generation for image compression

    NASA Astrophysics Data System (ADS)

    Tsai, Jinn-Tsong; Chou, Ping-Yi; Chou, Jyh-Horng

    2015-11-01

    The aim of this study is to generate vector quantisation (VQ) codebooks by integrating principle component analysis (PCA) algorithm, Linde-Buzo-Gray (LBG) algorithm, and evolutionary algorithms (EAs). The EAs include genetic algorithm (GA), particle swarm optimisation (PSO), honey bee mating optimisation (HBMO), and firefly algorithm (FF). The study is to provide performance comparisons between PCA-EA-LBG and PCA-LBG-EA approaches. The PCA-EA-LBG approaches contain PCA-GA-LBG, PCA-PSO-LBG, PCA-HBMO-LBG, and PCA-FF-LBG, while the PCA-LBG-EA approaches contain PCA-LBG, PCA-LBG-GA, PCA-LBG-PSO, PCA-LBG-HBMO, and PCA-LBG-FF. All training vectors of test images are grouped according to PCA. The PCA-EA-LBG used the vectors grouped by PCA as initial individuals, and the best solution gained by the EAs was given for LBG to discover a codebook. The PCA-LBG approach is to use the PCA to select vectors as initial individuals for LBG to find a codebook. The PCA-LBG-EA used the final result of PCA-LBG as an initial individual for EAs to find a codebook. The search schemes in PCA-EA-LBG first used global search and then applied local search skill, while in PCA-LBG-EA first used local search and then employed global search skill. The results verify that the PCA-EA-LBG indeed gain superior results compared to the PCA-LBG-EA, because the PCA-EA-LBG explores a global area to find a solution, and then exploits a better one from the local area of the solution. Furthermore the proposed PCA-EA-LBG approaches in designing VQ codebooks outperform existing approaches shown in the literature.

  19. Permeability Estimation of Rock Reservoir Based on PCA and Elman Neural Networks

    NASA Astrophysics Data System (ADS)

    Shi, Ying; Jian, Shaoyong

    2018-03-01

    an intelligent method which based on fuzzy neural networks with PCA algorithm, is proposed to estimate the permeability of rock reservoir. First, the dimensionality reduction process is utilized for these parameters by principal component analysis method. Further, the mapping relationship between rock slice characteristic parameters and permeability had been found through fuzzy neural networks. The estimation validity and reliability for this method were tested with practical data from Yan’an region in Ordos Basin. The result showed that the average relative errors of permeability estimation for this method is 6.25%, and this method had the better convergence speed and more accuracy than other. Therefore, by using the cheap rock slice related information, the permeability of rock reservoir can be estimated efficiently and accurately, and it is of high reliability, practicability and application prospect.

  20. Principal component analysis of socioeconomic factors and their association with malaria and arbovirus risk in Tanzania: a sensitivity analysis.

    PubMed

    Homenauth, Esha; Kajeguka, Debora; Kulkarni, Manisha A

    2017-11-01

    Principal component analysis (PCA) is frequently adopted for creating socioeconomic proxies in order to investigate the independent effects of wealth on disease status. The guidelines and methods for the creation of these proxies are well described and validated. The Demographic and Health Survey, World Health Survey and the Living Standards Measurement Survey are examples of large data sets that use PCA to create wealth indices particularly in low and middle-income countries (LMIC), where quantifying wealth-disease associations is problematic due to the unavailability of reliable income and expenditure data. However, the application of this method to smaller survey data sets, especially in rural LMIC settings, is less rigorously studied.In this paper, we aimed to highlight some of these issues by investigating the association of derived wealth indices using PCA on risk of vector-borne disease infection in Tanzania focusing on malaria and key arboviruses (ie, dengue and chikungunya). We demonstrated that indices consisting of subsets of socioeconomic indicators provided the least methodologically flawed representations of household wealth compared with an index that combined all socioeconomic variables. These results suggest that the choice of the socioeconomic indicators included in a wealth proxy can influence the relative position of households in the overall wealth hierarchy, and subsequently the strength of disease associations. This can, therefore, influence future resource planning activities and should be considered among investigators who use a PCA-derived wealth index based on community-level survey data to influence programme or policy decisions in rural LMIC settings. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  1. Evaluation of Soil Contamination Indices in a Mining Area of Jiangxi, China

    PubMed Central

    Wu, Jin; Teng, Yanguo; Lu, Sijin; Wang, Yeyao; Jiao, Xudong

    2014-01-01

    There is currently a wide variety of methods used to evaluate soil contamination. We present a discussion of the advantages and limitations of different soil contamination assessment methods. In this study, we analyzed seven trace elements (As, Cd, Cr, Cu, Hg, Pb, and Zn) that are indicators of soil contamination in Dexing, a city in China that is famous for its vast nonferrous mineral resources in China, using enrichment factor (EF), geoaccumulation index (Igeo), pollution index (PI), and principal component analysis (PCA). The three contamination indices and PCA were then mapped to understand the status and trends of soil contamination in this region. The entire study area is strongly enriched in Cd, Cu, Pb, and Zn, especially in areas near mine sites. As and Hg were also present in high concentrations in urban areas. Results indicated that Cr in this area originated from both anthropogenic and natural sources. PCA combined with Geographic Information System (GIS) was successfully used to discriminate between natural and anthropogenic trace metals. PMID:25397401

  2. Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival.

    PubMed

    Kaplan, Adam; Lock, Eric F

    2017-01-01

    Predictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of 'omics data measure different but related biological components. In this article, we use recent advances in the dimension reduction of multisource data for predictive modeling. In particular, we apply exploratory results from Joint and Individual Variation Explained (JIVE), an extension of PCA for multisource data, for prediction of differing response types. We conduct illustrative simulations to illustrate the practical advantages and interpretability of our approach. As an application example, we consider predicting survival for patients with glioblastoma multiforme from 3 data sources measuring messenger RNA expression, microRNA expression, and DNA methylation. We also introduce a method to estimate JIVE scores for new samples that were not used in the initial dimension reduction and study its theoretical properties; this method is implemented in the R package R.JIVE on CRAN, in the function jive.predict.

  3. Principal Component Analysis for Normal-Distribution-Valued Symbolic Data.

    PubMed

    Wang, Huiwen; Chen, Meiling; Shi, Xiaojun; Li, Nan

    2016-02-01

    This paper puts forward a new approach to principal component analysis (PCA) for normal-distribution-valued symbolic data, which has a vast potential of applications in the economic and management field. We derive a full set of numerical characteristics and variance-covariance structure for such data, which forms the foundation for our analytical PCA approach. Our approach is able to use all of the variance information in the original data than the prevailing representative-type approach in the literature which only uses centers, vertices, etc. The paper also provides an accurate approach to constructing the observations in a PC space based on the linear additivity property of normal distribution. The effectiveness of the proposed method is illustrated by simulated numerical experiments. At last, our method is applied to explain the puzzle of risk-return tradeoff in China's stock market.

  4. Machine learning-based analysis of MR radiomics can help to improve the diagnostic performance of PI-RADS v2 in clinically relevant prostate cancer.

    PubMed

    Wang, Jing; Wu, Chen-Jiang; Bao, Mei-Ling; Zhang, Jing; Wang, Xiao-Ning; Zhang, Yu-Dong

    2017-10-01

    To investigate whether machine learning-based analysis of MR radiomics can help improve the performance PI-RADS v2 in clinically relevant prostate cancer (PCa). This IRB-approved study included 54 patients with PCa undergoing multi-parametric (mp) MRI before prostatectomy. Imaging analysis was performed on 54 tumours, 47 normal peripheral (PZ) and 48 normal transitional (TZ) zone based on histological-radiological correlation. Mp-MRI was scored via PI-RADS, and quantified by measuring radiomic features. Predictive model was developed using a novel support vector machine trained with: (i) radiomics, (ii) PI-RADS scores, (iii) radiomics and PI-RADS scores. Paired comparison was made via ROC analysis. For PCa versus normal TZ, the model trained with radiomics had a significantly higher area under the ROC curve (Az) (0.955 [95% CI 0.923-0.976]) than PI-RADS (Az: 0.878 [0.834-0.914], p < 0.001). The Az between them was insignificant for PCa versus PZ (0.972 [0.945-0.988] vs. 0.940 [0.905-0.965], p = 0.097). When radiomics was added, performance of PI-RADS was significantly improved for PCa versus PZ (Az: 0.983 [0.960-0.995]) and PCa versus TZ (Az: 0.968 [0.940-0.985]). Machine learning analysis of MR radiomics can help improve the performance of PI-RADS in clinically relevant PCa. • Machine-based analysis of MR radiomics outperformed in TZ cancer against PI-RADS. • Adding MR radiomics significantly improved the performance of PI-RADS. • DKI-derived Dapp and Kapp were two strong markers for the diagnosis of PCa.

  5. Prostate cancer mortality reduction by prostate-specific antigen-based screening adjusted for nonattendance and contamination in the European Randomised Study of Screening for Prostate Cancer (ERSPC).

    PubMed

    Roobol, Monique J; Kerkhof, Melissa; Schröder, Fritz H; Cuzick, Jack; Sasieni, Peter; Hakama, Matti; Stenman, Ulf Hakan; Ciatto, Stefano; Nelen, Vera; Kwiatkowski, Maciej; Lujan, Marcos; Lilja, Hans; Zappa, Marco; Denis, Louis; Recker, Franz; Berenguer, Antonio; Ruutu, Mirja; Kujala, Paula; Bangma, Chris H; Aus, Gunnar; Tammela, Teuvo L J; Villers, Arnauld; Rebillard, Xavier; Moss, Sue M; de Koning, Harry J; Hugosson, Jonas; Auvinen, Anssi

    2009-10-01

    Prostate-specific antigen (PSA) based screening for prostate cancer (PCa) has been shown to reduce prostate specific mortality by 20% in an intention to screen (ITS) analysis in a randomised trial (European Randomised Study of Screening for Prostate Cancer [ERSPC]). This effect may be diluted by nonattendance in men randomised to the screening arm and contamination in men randomised to the control arm. To assess the magnitude of the PCa-specific mortality reduction after adjustment for nonattendance and contamination. We analysed the occurrence of PCa deaths during an average follow-up of 9 yr in 162,243 men 55-69 yr of age randomised in seven participating centres of the ERSPC. Centres were also grouped according to the type of randomisation (ie, before or after informed written consent). Nonattendance was defined as nonattending the initial screening round in ERSPC. The estimate of contamination was based on PSA use in controls in ERSPC Rotterdam. Relative risks (RRs) with 95% confidence intervals (CIs) were compared between an ITS analysis and analyses adjusting for nonattendance and contamination using a statistical method developed for this purpose. In the ITS analysis, the RR of PCa death in men allocated to the intervention arm relative to the control arm was 0.80 (95% CI, 0.68-0.96). Adjustment for nonattendance resulted in a RR of 0.73 (95% CI, 0.58-0.93), and additional adjustment for contamination using two different estimates led to estimated reductions of 0.69 (95% CI, 0.51-0.92) to 0.71 (95% CI, 0.55-0.93), respectively. Contamination data were obtained through extrapolation of single-centre data. No heterogeneity was found between the groups of centres. PSA screening reduces the risk of dying of PCa by up to 31% in men actually screened. This benefit should be weighed against a degree of overdiagnosis and overtreatment inherent in PCa screening.

  6. Using both principal component analysis and reduced rank regression to study dietary patterns and diabetes in Chinese adults.

    PubMed

    Batis, Carolina; Mendez, Michelle A; Gordon-Larsen, Penny; Sotres-Alvarez, Daniela; Adair, Linda; Popkin, Barry

    2016-02-01

    We examined the association between dietary patterns and diabetes using the strengths of two methods: principal component analysis (PCA) to identify the eating patterns of the population and reduced rank regression (RRR) to derive a pattern that explains the variation in glycated Hb (HbA1c), homeostasis model assessment of insulin resistance (HOMA-IR) and fasting glucose. We measured diet over a 3 d period with 24 h recalls and a household food inventory in 2006 and used it to derive PCA and RRR dietary patterns. The outcomes were measured in 2009. Adults (n 4316) from the China Health and Nutrition Survey. The adjusted odds ratio for diabetes prevalence (HbA1c≥6·5 %), comparing the highest dietary pattern score quartile with the lowest, was 1·26 (95 % CI 0·76, 2·08) for a modern high-wheat pattern (PCA; wheat products, fruits, eggs, milk, instant noodles and frozen dumplings), 0·76 (95 % CI 0·49, 1·17) for a traditional southern pattern (PCA; rice, meat, poultry and fish) and 2·37 (95 % CI 1·56, 3·60) for the pattern derived with RRR. By comparing the dietary pattern structures of RRR and PCA, we found that the RRR pattern was also behaviourally meaningful. It combined the deleterious effects of the modern high-wheat pattern (high intakes of wheat buns and breads, deep-fried wheat and soya milk) with the deleterious effects of consuming the opposite of the traditional southern pattern (low intakes of rice, poultry and game, fish and seafood). Our findings suggest that using both PCA and RRR provided useful insights when studying the association of dietary patterns with diabetes.

  7. Using both Principal Component Analysis and Reduced Rank Regression to Study Dietary Patterns and Diabetes in Chinese Adults

    PubMed Central

    Batis, Carolina; Mendez, Michelle A.; Gordon-Larsen, Penny; Sotres-Alvarez, Daniela; Adair, Linda; Popkin, Barry

    2014-01-01

    Objective We examined the association between dietary patterns and diabetes using the strengths of two methods: principal component analysis (PCA) to identify the eating patterns of the population and reduced rank regression (RRR) to derive a pattern that explains the variation in hemoglobin A1c (HbA1c), homeostasis model of insulin resistance (HOMA-IR), and fasting glucose. Design We measured diet over a 3-day period with 24-hour recalls and a household food inventory in 2006 and used it to derive PCA and RRR dietary patterns. The outcomes were measured in 2009. Setting Adults (n = 4,316) from the China Health and Nutrition Survey. Results The adjusted odds ratio for diabetes prevalence (HbA1c ≥ 6.5%), comparing the highest dietary pattern score quartile to the lowest, was 1.26 (0.76, 2.08) for a modern high-wheat pattern (PCA; wheat products, fruits, eggs, milk, instant noodles and frozen dumplings), 0.76 (0.49, 1.17) for a traditional southern pattern (PCA; rice, meat, poultry, and fish), and 2.37 (1.56, 3.60) for the pattern derived with RRR. By comparing the dietary pattern structures of RRR and PCA, we found that the RRR pattern was also behaviorally meaningful. It combined the deleterious effects of the modern high-wheat (high intake of wheat buns and breads, deep-fried wheat, and soy milk) with the deleterious effects of consuming the opposite of the traditional southern (low intake of rice, poultry and game, fish and seafood). Conclusions Our findings suggest that using both PCA and RRR provided useful insights when studying the association of dietary patterns with diabetes. PMID:26784586

  8. A diffusion-matched principal component analysis (DM-PCA) based two-channel denoising procedure for high-resolution diffusion-weighted MRI

    PubMed Central

    Chang, Hing-Chiu; Bilgin, Ali; Bernstein, Adam; Trouard, Theodore P.

    2018-01-01

    Over the past several years, significant efforts have been made to improve the spatial resolution of diffusion-weighted imaging (DWI), aiming at better detecting subtle lesions and more reliably resolving white-matter fiber tracts. A major concern with high-resolution DWI is the limited signal-to-noise ratio (SNR), which may significantly offset the advantages of high spatial resolution. Although the SNR of DWI data can be improved by denoising in post-processing, existing denoising procedures may potentially reduce the anatomic resolvability of high-resolution imaging data. Additionally, non-Gaussian noise induced signal bias in low-SNR DWI data may not always be corrected with existing denoising approaches. Here we report an improved denoising procedure, termed diffusion-matched principal component analysis (DM-PCA), which comprises 1) identifying a group of (not necessarily neighboring) voxels that demonstrate very similar magnitude signal variation patterns along the diffusion dimension, 2) correcting low-frequency phase variations in complex-valued DWI data, 3) performing PCA along the diffusion dimension for real- and imaginary-components (in two separate channels) of phase-corrected DWI voxels with matched diffusion properties, 4) suppressing the noisy PCA components in real- and imaginary-components, separately, of phase-corrected DWI data, and 5) combining real- and imaginary-components of denoised DWI data. Our data show that the new two-channel (i.e., for real- and imaginary-components) DM-PCA denoising procedure performs reliably without noticeably compromising anatomic resolvability. Non-Gaussian noise induced signal bias could also be reduced with the new denoising method. The DM-PCA based denoising procedure should prove highly valuable for high-resolution DWI studies in research and clinical uses. PMID:29694400

  9. Effect of Statins and Anticoagulants on Prostate Cancer Aggressiveness

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alizadeh, Moein; Sylvestre, Marie-Pierre; Zilli, Thomas

    2012-07-15

    Purpose: Statins and anticoagulants (ACs) have both been associated with a less-aggressive prostate cancer (PCa) and a better outcome after treatment of localized PCa. The results of these studies might have been confounded because patients might often take both medications. We examined their respective influence on PCa aggressiveness at initial diagnosis. Materials and Methods: We analyzed 381 patients treated with either external beam radiotherapy or brachytherapy for low-risk (n = 152), intermediate-risk (n = 142), or high-risk (n = 87) localized PCa. Univariate and multivariate logistic regression analyses were used to investigate an association between these drug classes and prostatemore » cancer aggressiveness. We tested whether the concomitant use of statins and ACs had a different effect than that of either AC or statin use alone. Results: Of the 381 patients, 172 (45.1%) were taking statins and 141 (37.0%) ACs; 105 patients (27.6%) used both. On univariate analysis, the statin and AC users were associated with the prostate-specific antigen (PSA) level (p = .017) and National Comprehensive Cancer Network risk group (p = .0022). On multivariate analysis, statin use was associated with a PSA level <10 ng/mL (odds ratio, 2.9; 95% confidence interval, 1.3-6.8; p = .012) and a PSA level >20 ng/mL (odds ratio, 0.29; 95% confidence interval, 0.08-0.83; p = .03). The use of ACs was associated with a PSA level >20 ng/mL (odds ratio, 0.13; 95% confidence interval, 0.02-0.59, p = .02). Conclusion: Both AC and statins have an effect on PCa aggressiveness, with statins having a more stringent relationship with the PSA level, highlighting the importance of considering statin use in studies of PCa aggressiveness.« less

  10. Demixed principal component analysis of neural population data

    PubMed Central

    Kobak, Dmitry; Brendel, Wieland; Constantinidis, Christos; Feierstein, Claudia E; Kepecs, Adam; Mainen, Zachary F; Qi, Xue-Lian; Romo, Ranulfo; Uchida, Naoshige; Machens, Christian K

    2016-01-01

    Neurons in higher cortical areas, such as the prefrontal cortex, are often tuned to a variety of sensory and motor variables, and are therefore said to display mixed selectivity. This complexity of single neuron responses can obscure what information these areas represent and how it is represented. Here we demonstrate the advantages of a new dimensionality reduction technique, demixed principal component analysis (dPCA), that decomposes population activity into a few components. In addition to systematically capturing the majority of the variance of the data, dPCA also exposes the dependence of the neural representation on task parameters such as stimuli, decisions, or rewards. To illustrate our method we reanalyze population data from four datasets comprising different species, different cortical areas and different experimental tasks. In each case, dPCA provides a concise way of visualizing the data that summarizes the task-dependent features of the population response in a single figure. DOI: http://dx.doi.org/10.7554/eLife.10989.001 PMID:27067378

  11. Rapid classification of hairtail fish and pork freshness using an electronic nose based on the PCA method.

    PubMed

    Tian, Xiu-Ying; Cai, Qiang; Zhang, Yong-Ming

    2012-01-01

    We report a method for building a simple and reproducible electronic nose based on commercially available metal oxide sensors (MOS) to monitor the freshness of hairtail fish and pork stored at 15, 10, and 5 °C. After assembly in the laboratory, the proposed product was tested by a manufacturer. Sample delivery was based on the dynamic headspace method, and two features were extracted from the transient response of each sensor using an unsupervised principal component analysis (PCA) method. The compensation method and pattern recognition based on PCA are discussed in the current paper. PCA compensation can be used for all storage temperatures, however, pattern recognition differs according to storage conditions. Total volatile basic nitrogen (TVBN) and aerobic bacterial counts of the samples were measured simultaneously with the standard indicators of hairtail fish and pork freshness. The PCA models based on TVBN and aerobic bacterial counts were used to classify hairtail fish samples as "fresh" (TVBN ≤ 25 g and microbial counts ≤ 10(6) cfu/g) or "spoiled" (TVBN ≥ 25 g and microbial counts ≥ 10(6) cfu/g) and pork samples also as "fresh" (TVBN ≤ 15 g and microbial counts ≤ 10(6) cfu/g) or "spoiled" (TVBN ≥ 15 g and microbial counts ≥ 10(6) cfu/g). Good correlation coefficients between the responses of the electronic nose and the TVBN and aerobic bacterial counts of the samples were obtained. For hairtail fish, correlation coefficients were 0.97 and 0.91, and for pork, correlation coefficients were 0.81 and 0.88, respectively. Through laboratory simulation and field application, we were able to determine that the electronic nose could help ensure the shelf life of hairtail fish and pork, especially when an instrument is needed to take measurements rapidly. The results also showed that the electronic nose could analyze the process and level of spoilage for hairtail fish and pork.

  12. Choline Kinase Alpha as an Androgen Receptor Chaperone and Prostate Cancer Therapeutic Target

    PubMed Central

    Asim, Mohammad; Massie, Charles E.; Orafidiya, Folake; Pértega-Gomes, Nelma; Warren, Anne Y.; Esmaeili, Mohsen; Selth, Luke A.; Zecchini, Heather I.; Luko, Katarina; Qureshi, Arham; Baridi, Ajoeb; Menon, Suraj; Madhu, Basetti; Escriu, Carlos; Lyons, Scott; Vowler, Sarah L.; Zecchini, Vincent R.; Shaw, Greg; Hessenkemper, Wiebke; Russell, Roslin; Mohammed, Hisham; Stefanos, Niki; Lynch, Andy G.; Grigorenko, Elena; D’Santos, Clive; Taylor, Chris; Lamb, Alastair; Sriranjan, Rouchelle; Yang, Jiali; Stark, Rory; Dehm, Scott M.; Rennie, Paul S.; Carroll, Jason S.; Griffiths, John R.; Tavaré, Simon; Mills, Ian G.; McEwan, Iain J.; Baniahmad, Aria; Tilley, Wayne D.; Neal, David E.

    2016-01-01

    Background: The androgen receptor (AR) is a major drug target in prostate cancer (PCa). We profiled the AR-regulated kinome to identify clinically relevant and druggable effectors of AR signaling. Methods: Using genome-wide approaches, we interrogated all AR regulated kinases. Among these, choline kinase alpha (CHKA) expression was evaluated in benign (n = 195), prostatic intraepithelial neoplasia (PIN) (n = 153) and prostate cancer (PCa) lesions (n = 359). We interrogated how CHKA regulates AR signaling using biochemical assays and investigated androgen regulation of CHKA expression in men with PCa, both untreated (n = 20) and treated with an androgen biosynthesis inhibitor degarelix (n = 27). We studied the effect of CHKA inhibition on the PCa transcriptome using RNA sequencing and tested the effect of CHKA inhibition on cell growth, clonogenic survival and invasion. Tumor xenografts (n = 6 per group) were generated in mice using genetically engineered prostate cancer cells with inducible CHKA knockdown. Data were analyzed with χ2 tests, Cox regression analysis, and Kaplan-Meier methods. All statistical tests were two-sided. Results: CHKA expression was shown to be androgen regulated in cell lines, xenografts, and human tissue (log fold change from 6.75 to 6.59, P = .002) and was positively associated with tumor stage. CHKA binds directly to the ligand-binding domain (LBD) of AR, enhancing its stability. As such, CHKA is the first kinase identified as an AR chaperone. Inhibition of CHKA repressed the AR transcriptional program including pathways enriched for regulation of protein folding, decreased AR protein levels, and inhibited the growth of PCa cell lines, human PCa explants, and tumor xenografts. Conclusions: CHKA can act as an AR chaperone, providing, to our knowledge, the first evidence for kinases as molecular chaperones, making CHKA both a marker of tumor progression and a potential therapeutic target for PCa. PMID:26657335

  13. EFEMP1 as a novel DNA methylation marker for prostate cancer: array-based DNA methylation and expression profiling.

    PubMed

    Kim, Yong-June; Yoon, Hyung-Yoon; Kim, Seon-Kyu; Kim, Young-Won; Kim, Eun-Jung; Kim, Isaac Yi; Kim, Wun-Jae

    2011-07-01

    Abnormal DNA methylation is associated with many human cancers. The aim of the present study was to identify novel methylation markers in prostate cancer (PCa) by microarray analysis and to test whether these markers could discriminate normal and PCa cells. Microarray-based DNA methylation and gene expression profiling was carried out using a panel of PCa cell lines and a control normal prostate cell line. The methylation status of candidate genes in prostate cell lines was confirmed by real-time reverse transcriptase-PCR, bisulfite sequencing analysis, and treatment with a demethylation agent. DNA methylation and gene expression analysis in 203 human prostate specimens, including 106 PCa and 97 benign prostate hyperplasia (BPH), were carried out. Further validation using microarray gene expression data from the Gene Expression Omnibus (GEO) was carried out. Epidermal growth factor-containing fibulin-like extracellular matrix protein 1 (EFEMP1) was identified as a lead candidate methylation marker for PCa. The gene expression level of EFEMP1 was significantly higher in tissue samples from patients with BPH than in those with PCa (P < 0.001). The sensitivity and specificity of EFEMP1 methylation status in discriminating between PCa and BPH reached 95.3% (101 of 106) and 86.6% (84 of 97), respectively. From the GEO data set, we confirmed that the expression level of EFEMP1 was significantly different between PCa and BPH. Genome-wide characterization of DNA methylation profiles enabled the identification of EFEMP1 aberrant methylation patterns in PCa. EFEMP1 might be a useful indicator for the detection of PCa.

  14. Using Serological Proteome Analysis to Identify Serum Anti-Nucleophosmin 1 Autoantibody as a Potential Biomarker in European-American and African-American Patients With Prostate Cancer.

    PubMed

    Dai, Liping; Li, Jitian; Xing, Mengtao; Sanchez, Tino W; Casiano, Carlos A; Zhang, Jian-Ying

    2016-11-01

    The prostate-specific antigen (PSA) testing has been widely implemented for the early detection and management of prostate cancer (PCa). However, the lack of specificity has led to overdiagnosis, resulting in many possibly unnecessary biopsies and overtreatment. Therefore, novel serological biomarkers with high sensitivity and specificity are of vital importance needed to complement PSA testing in the early diagnosis and effective management of PCa. This is particularly critical in the context of PCa health disparities, where early detection and management could help reduce the disproportionately high PCa mortality observed in African-American men. Previous studies have demonstrated that sera from patients with PCa contain autoantibodies that react with tumor-associated antigens (TAAs). The serological proteome analysis (SERPA) approach was used to identify tumor-associated antigens (TAAs) of PCa. In evaluation study, the level of anti-NPM1 antibody was examined in sera from test cohort, validation cohort, as well as European-American (EA) and African-American (AA) men with PCa by using immunoassay. Nucleophosmin 1 (NPM1) as a 33 kDa TAA in PCa was identified and characterized by SERPA approach. Anti-NPM1 antibody level in PCa was higher than in benign prostatic hyperplasia (BPH) patients and healthy individuals. Receiver operating characteristic (ROC) curve analysis showed similar high diagnostic value for PCa in the test cohort (area under the curve (AUC):0.860) and validation cohort (AUC: 0.822) to differentiate from normal individuals and BPH. Interestingly, AUC values were significantly higher for AA PCa patients. When considering concurrent serum measurements of anti-NPM1 antibody and PSA, 97.1% PCa patients at early stage were identified correctly, while 69.2% BPH patients who had elevated PSA levels were found to be anti-NPM1 negative. Additionally, anti-NPM1 antibody levels in PCa patients at early stage significantly increased after surgery treatment. This intriguing data suggested that NPM1 can elicit autoantibody response in PCa and might be a potential biomarker for the immunodiagnosis and prognosis of PCa, and for supplementing PSA testing in distinguishing PCa from BPH. Prostate 76:1375-1386, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  15. [Evaluation of the effectiveness of patient-controlled analgesia in children with sickle cell anemia from the perspective of healthcare professionals and parents].

    PubMed

    Turaç, Ayşegül; Rumeli Atıcı, Şebnem

    2016-07-01

    This study evaluated the efficacy of patient-controlled analgesia (PCA) used by children with sickle cell anemia (SCA) based on the attitudes of parents and healthcare professionals. A total of 86 individuals were involved in the study: 54 parents of children with SCA who were receiving treatment and 32 healthcare providers (doctors, nurses). To evaluate the effectiveness of the PCA method, a questionnaire was prepared to determine the level of knowledge of the participants about the PCA method and their perception of its advantages and disadvantages. According to 65.6% (n=21) of the healthcare providers, PCA should be used during acute phase of pain. The great majority of the participants (93%; n=80) thought that pain was effectively controlled both during the day and at night. PCA reduced the fear of unavailability of analgesic drugs in 83.3% (n=45) of parents and in 87.5% (n=28) of healthcare providers. More parents (37%) reported a reduction in the fear of return of pain than healthcare providers (9.4%) (p<0.05). Most parents (87%; n=47) reported that they preferred to wait until their child complained of severe pain to use on-demand doses of analgesic due to concerns about overdose and addiction. Resolving machine alarms (48%; n=26) and the length of time required to refill the machine (48%; n=26) were reported as disadvantages of PCA method. In this study, parents and healthcare professionals found PCA to be effective in relieving pain in children with SCA; however, fears and biased knowledge of users about the analgesic drug are thought to inhibit reaching sufficient dosage. Educational courses for users about PCA and the drugs used may increase the effectiveness of PCA method.

  16. Fast and Accurate Radiative Transfer Calculations Using Principal Component Analysis for (Exo-)Planetary Retrieval Models

    NASA Astrophysics Data System (ADS)

    Kopparla, P.; Natraj, V.; Shia, R. L.; Spurr, R. J. D.; Crisp, D.; Yung, Y. L.

    2015-12-01

    Radiative transfer (RT) computations form the engine of atmospheric retrieval codes. However, full treatment of RT processes is computationally expensive, prompting usage of two-stream approximations in current exoplanetary atmospheric retrieval codes [Line et al., 2013]. Natraj et al. [2005, 2010] and Spurr and Natraj [2013] demonstrated the ability of a technique using principal component analysis (PCA) to speed up RT computations. In the PCA method for RT performance enhancement, empirical orthogonal functions are developed for binned sets of inherent optical properties that possess some redundancy; costly multiple-scattering RT calculations are only done for those few optical states corresponding to the most important principal components, and correction factors are applied to approximate radiation fields. Kopparla et al. [2015, in preparation] extended the PCA method to a broadband spectral region from the ultraviolet to the shortwave infrared (0.3-3 micron), accounting for major gas absorptions in this region. Here, we apply the PCA method to a some typical (exo-)planetary retrieval problems. Comparisons between the new model, called Universal Principal Component Analysis Radiative Transfer (UPCART) model, two-stream models and line-by-line RT models are performed, for spectral radiances, spectral fluxes and broadband fluxes. Each of these are calculated at the top of the atmosphere for several scenarios with varying aerosol types, extinction and scattering optical depth profiles, and stellar and viewing geometries. We demonstrate that very accurate radiance and flux estimates can be obtained, with better than 1% accuracy in all spectral regions and better than 0.1% in most cases, as compared to a numerically exact line-by-line RT model. The accuracy is enhanced when the results are convolved to typical instrument resolutions. The operational speed and accuracy of UPCART can be further improved by optimizing binning schemes and parallelizing the codes, work on which is under way.

  17. Sparse PCA corrects for cell type heterogeneity in epigenome-wide association studies.

    PubMed

    Rahmani, Elior; Zaitlen, Noah; Baran, Yael; Eng, Celeste; Hu, Donglei; Galanter, Joshua; Oh, Sam; Burchard, Esteban G; Eskin, Eleazar; Zou, James; Halperin, Eran

    2016-05-01

    In epigenome-wide association studies (EWAS), different methylation profiles of distinct cell types may lead to false discoveries. We introduce ReFACTor, a method based on principal component analysis (PCA) and designed for the correction of cell type heterogeneity in EWAS. ReFACTor does not require knowledge of cell counts, and it provides improved estimates of cell type composition, resulting in improved power and control for false positives in EWAS. Corresponding software is available at http://www.cs.tau.ac.il/~heran/cozygene/software/refactor.html.

  18. Method for factor analysis of GC/MS data

    DOEpatents

    Van Benthem, Mark H; Kotula, Paul G; Keenan, Michael R

    2012-09-11

    The method of the present invention provides a fast, robust, and automated multivariate statistical analysis of gas chromatography/mass spectroscopy (GC/MS) data sets. The method can involve systematic elimination of undesired, saturated peak masses to yield data that follow a linear, additive model. The cleaned data can then be subjected to a combination of PCA and orthogonal factor rotation followed by refinement with MCR-ALS to yield highly interpretable results.

  19. Prebiotic Low Sugar Chocolate Dairy Desserts: Physical and Optical Characteristics and Performance of PARAFAC and PCA Preference Map.

    PubMed

    Morais, E C; Esmerino, E A; Monteiro, R A; Pinheiro, C M; Nunes, C A; Cruz, A G; Bolini, Helena M A

    2016-01-01

    The addition of prebiotic and sweeteners in chocolate dairy desserts opens up new opportunities to develop dairy desserts that besides having a lower calorie intake still has functional properties. In this study, prebiotic low sugar dairy desserts were evaluated by 120 consumers using a 9-point hedonic scale, in relation to the attributes of appearance, aroma, flavor, texture, and overall liking. Internal preference map using parallel factor analysis (PARAFAC) and principal component analysis (PCA) was performed using the consumer data. In addition, physical (texture profile) and optical (instrumental color) analyses were also performed. Prebiotic dairy desserts containing sucrose and sucralose were equally liked by the consumers. These samples were characterized by firmness and gumminess, which can be considered drivers of liking by the consumers. Optimization of the prebiotic low sugar dessert formulation should take in account the choice of ingredients that contribute in a positive manner for these parameters. PARAFAC allowed the extraction of more relevant information in relation to PCA, demonstrating that consumer acceptance analysis can be evaluated by simultaneously considering several attributes. Multiple factor analysis reported Rv value of 0.964, suggesting excellent concordance for both methods. © 2015 Institute of Food Technologists®

  20. Water quality analysis of the Rapur area, Andhra Pradesh, South India using multivariate techniques

    NASA Astrophysics Data System (ADS)

    Nagaraju, A.; Sreedhar, Y.; Thejaswi, A.; Sayadi, Mohammad Hossein

    2017-10-01

    The groundwater samples from Rapur area were collected from different sites to evaluate the major ion chemistry. The large number of data can lead to difficulties in the integration, interpretation, and representation of the results. Two multivariate statistical methods, hierarchical cluster analysis (HCA) and factor analysis (FA), were applied to evaluate their usefulness to classify and identify geochemical processes controlling groundwater geochemistry. Four statistically significant clusters were obtained from 30 sampling stations. This has resulted two important clusters viz., cluster 1 (pH, Si, CO3, Mg, SO4, Ca, K, HCO3, alkalinity, Na, Na + K, Cl, and hardness) and cluster 2 (EC and TDS) which are released to the study area from different sources. The application of different multivariate statistical techniques, such as principal component analysis (PCA), assists in the interpretation of complex data matrices for a better understanding of water quality of a study area. From PCA, it is clear that the first factor (factor 1), accounted for 36.2% of the total variance, was high positive loading in EC, Mg, Cl, TDS, and hardness. Based on the PCA scores, four significant cluster groups of sampling locations were detected on the basis of similarity of their water quality.

  1. Occipital-posterior cerebral artery bypass via the occipital interhemispheric approach

    PubMed Central

    Kazumata, Ken; Yokoyama, Yuka; Sugiyama, Taku; Asaoka, Katsuyuki

    2013-01-01

    Background: The unavailability of the superficial temporal artery (STA) and the location of lesions pose a more technically demanding challenge when compared with conventional STA-superior cerebellar or posterior cerebral artery (PCA) bypass in vascular reconstruction procedures. To describe a case series of patients with cerebrovascular lesions who were treated using an occipital artery (OA) to PCA bypass via the occipital interhemispheric approach. Methods: We retrospectively reviewed three consecutive cases of patients with cerebrovascular lesions who were treated using OA-PCA bypass. Results: OA-PCA bypass was performed via the occipital interhemispheric approach. This procedure included: (1) OA-PCA bypass (n = 1), and combined OA-posterior inferior cerebellar artery and OA-PCA saphenous vein interposition graft bypass (n = 1) in patients with vertebrobasilar ischemia; (2) OA-PCA radial artery interposition graft bypass in one patient with residual PCA aneurysm. Conclusions: OA-PCA bypass represents a useful alternative to conventional STA-SCA or PCA bypass. PMID:23956933

  2. A fuzzy-based data transformation for feature extraction to increase classification performance with small medical data sets.

    PubMed

    Li, Der-Chiang; Liu, Chiao-Wen; Hu, Susan C

    2011-05-01

    Medical data sets are usually small and have very high dimensionality. Too many attributes will make the analysis less efficient and will not necessarily increase accuracy, while too few data will decrease the modeling stability. Consequently, the main objective of this study is to extract the optimal subset of features to increase analytical performance when the data set is small. This paper proposes a fuzzy-based non-linear transformation method to extend classification related information from the original data attribute values for a small data set. Based on the new transformed data set, this study applies principal component analysis (PCA) to extract the optimal subset of features. Finally, we use the transformed data with these optimal features as the input data for a learning tool, a support vector machine (SVM). Six medical data sets: Pima Indians' diabetes, Wisconsin diagnostic breast cancer, Parkinson disease, echocardiogram, BUPA liver disorders dataset, and bladder cancer cases in Taiwan, are employed to illustrate the approach presented in this paper. This research uses the t-test to evaluate the classification accuracy for a single data set; and uses the Friedman test to show the proposed method is better than other methods over the multiple data sets. The experiment results indicate that the proposed method has better classification performance than either PCA or kernel principal component analysis (KPCA) when the data set is small, and suggest creating new purpose-related information to improve the analysis performance. This paper has shown that feature extraction is important as a function of feature selection for efficient data analysis. When the data set is small, using the fuzzy-based transformation method presented in this work to increase the information available produces better results than the PCA and KPCA approaches. Copyright © 2011 Elsevier B.V. All rights reserved.

  3. Low-rank plus sparse decomposition for exoplanet detection in direct-imaging ADI sequences. The LLSG algorithm

    NASA Astrophysics Data System (ADS)

    Gomez Gonzalez, C. A.; Absil, O.; Absil, P.-A.; Van Droogenbroeck, M.; Mawet, D.; Surdej, J.

    2016-05-01

    Context. Data processing constitutes a critical component of high-contrast exoplanet imaging. Its role is almost as important as the choice of a coronagraph or a wavefront control system, and it is intertwined with the chosen observing strategy. Among the data processing techniques for angular differential imaging (ADI), the most recent is the family of principal component analysis (PCA) based algorithms. It is a widely used statistical tool developed during the first half of the past century. PCA serves, in this case, as a subspace projection technique for constructing a reference point spread function (PSF) that can be subtracted from the science data for boosting the detectability of potential companions present in the data. Unfortunately, when building this reference PSF from the science data itself, PCA comes with certain limitations such as the sensitivity of the lower dimensional orthogonal subspace to non-Gaussian noise. Aims: Inspired by recent advances in machine learning algorithms such as robust PCA, we aim to propose a localized subspace projection technique that surpasses current PCA-based post-processing algorithms in terms of the detectability of companions at near real-time speed, a quality that will be useful for future direct imaging surveys. Methods: We used randomized low-rank approximation methods recently proposed in the machine learning literature, coupled with entry-wise thresholding to decompose an ADI image sequence locally into low-rank, sparse, and Gaussian noise components (LLSG). This local three-term decomposition separates the starlight and the associated speckle noise from the planetary signal, which mostly remains in the sparse term. We tested the performance of our new algorithm on a long ADI sequence obtained on β Pictoris with VLT/NACO. Results: Compared to a standard PCA approach, LLSG decomposition reaches a higher signal-to-noise ratio and has an overall better performance in the receiver operating characteristic space. This three-term decomposition brings a detectability boost compared to the full-frame standard PCA approach, especially in the small inner working angle region where complex speckle noise prevents PCA from discerning true companions from noise.

  4. Influence of Posterior Corneal Astigmatism on Total Corneal Astigmatism in Eyes With Keratoconus.

    PubMed

    Savini, Giacomo; Næser, Kristian; Schiano-Lomoriello, Domenico; Mularoni, Alessandro

    2016-11-01

    To measure posterior corneal astigmatism (PCA) and investigate its influence on total corneal astigmatism (TCA) in eyes with keratoconus. Keratometric astigmatism (KA), PCA, and TCA were investigated by means of a dual Scheimpflug analyzer in patients with keratoconus. Vector analysis was carried out with the Næser polar value method. We enrolled 119 eyes. PCA magnitude averaged 0.77 ± 0.43 diopters (D) and exceeded 0.50, 1.00, and 2.00 D in 73.9%, 21.8%, and 16.8% of eyes, respectively. PCA averaged 0.95 ± 0.48, 0.55 ± 0.28, and 0.70 ± 0.35 D in eyes with with-the-rule (WTR), against-the-rule (ATR), and oblique astigmatism. The steepest posterior meridian was oriented vertically (between 61 and 119 degrees) in 55.5% of eyes, thus generating ATR astigmatism. The difference between the location of the steepest meridian of KA and that of TCA was >10 degrees in 8.4% of eyes. On average, KA overestimated TCA in eyes with WTR astigmatism by 0.16 D and underestimated TCA in eyes with ATR astigmatism by 0.22 D. The PCA power oriented along the steeper anterior corneal meridian averaged -0.83 ± 0.40, -0.40 ± 0.37, and -0.53 ± 0.43 D for WTR, ATR, and obliquely astigmatic eyes, respectively. Linear regression disclosed a statistically significant correlation (P < 0.0001, r = 0.16) between the meridional powers of TCA and PCA. In eyes with keratoconus, PCA displays large, variable values and is correlated to TCA. The influence of PCA on TCA cannot be disregarded when planning astigmatism correction by toric intraocular lenses.

  5. Germline BRCA mutations are associated with higher risk of nodal involvement, distant metastasis, and poor survival outcomes in prostate cancer.

    PubMed

    Castro, Elena; Goh, Chee; Olmos, David; Saunders, Ed; Leongamornlert, Daniel; Tymrakiewicz, Malgorzata; Mahmud, Nadiya; Dadaev, Tokhir; Govindasami, Koveela; Guy, Michelle; Sawyer, Emma; Wilkinson, Rosemary; Ardern-Jones, Audrey; Ellis, Steve; Frost, Debra; Peock, Susan; Evans, D Gareth; Tischkowitz, Marc; Cole, Trevor; Davidson, Rosemarie; Eccles, Diana; Brewer, Carole; Douglas, Fiona; Porteous, Mary E; Donaldson, Alan; Dorkins, Huw; Izatt, Louise; Cook, Jackie; Hodgson, Shirley; Kennedy, M John; Side, Lucy E; Eason, Jacqueline; Murray, Alex; Antoniou, Antonis C; Easton, Douglas F; Kote-Jarai, Zsofia; Eeles, Rosalind

    2013-05-10

    To analyze the baseline clinicopathologic characteristics of prostate tumors with germline BRCA1 and BRCA2 (BRCA1/2) mutations and the prognostic value of those mutations on prostate cancer (PCa) outcomes. This study analyzed the tumor features and outcomes of 2,019 patients with PCa (18 BRCA1 carriers, 61 BRCA2 carriers, and 1,940 noncarriers). The Kaplan-Meier method and Cox regression analysis were used to evaluate the associations between BRCA1/2 status and other PCa prognostic factors with overall survival (OS), cause-specific OS (CSS), CSS in localized PCa (CSS_M0), metastasis-free survival (MFS), and CSS from metastasis (CSS_M1). PCa with germline BRCA1/2 mutations were more frequently associated with Gleason ≥ 8 (P = .00003), T3/T4 stage (P = .003), nodal involvement (P = .00005), and metastases at diagnosis (P = .005) than PCa in noncarriers. CSS was significantly longer in noncarriers than in carriers (15.7 v 8.6 years, multivariable analyses [MVA] P = .015; hazard ratio [HR] = 1.8). For localized PCa, 5-year CSS and MFS were significantly higher in noncarriers (96% v 82%; MVA P = .01; HR = 2.6%; and 93% v 77%; MVA P = .009; HR = 2.7, respectively). Subgroup analyses confirmed the poor outcomes in BRCA2 patients, whereas the role of BRCA1 was not well defined due to the limited size and follow-up in this subgroup. Our results confirm that BRCA1/2 mutations confer a more aggressive PCa phenotype with a higher probability of nodal involvement and distant metastasis. BRCA mutations are associated with poor survival outcomes and this should be considered for tailoring clinical management of these patients.

  6. Genetic variants in RNA-induced silencing complex genes and prostate cancer.

    PubMed

    Nikolić, Z; Savić Pavićević, D; Vučić, N; Cerović, S; Vukotić, V; Brajušković, G

    2017-04-01

    The purpose of this study is to evaluate the potential association between genetic variants in genes encoding the components of RNA-induced silencing complex and prostate cancer (PCa) risk. Genetic variants chosen for this study are rs3742330 in DICER1, rs4961280 in AGO2, rs784567 in TARBP2, rs7813 in GEMIN4 and rs197414 in GEMIN3. The study involved 355 PCa patients, 360 patients with benign prostatic hyperplasia and 318 healthy controls. For individuals diagnosed with PCa, clinicopathological characteristics including serum prostate-specific antigen level at diagnosis, Gleason score (GS) and clinical stage were determined. Genotyping was performed using high-resolution melting analysis, PCR-RFLP, TaqMan SNP Genotyping Assay and real-time PCR-based genotyping assay using specific probes. Allelic and genotypic associations were evaluated by unconditional linear and logistic regression methods. The study provided no evidence of association between the analyzed genetic variants and PCa risk. Nevertheless, allele A of rs784567 was found to confer the reduced risk of higher serum PSA level at diagnosis (P = 0.046; Difference = -66.64, 95 % CI -131.93 to 1.35, for log-additive model). Furthermore, rs4961280, as well as rs3742330, were shown to be associated with GS. These variants, together with rs7813, were found to be associated with the lower clinical stage of PCa. Also, rs3742330 minor allele G was found to be associated with lower PCa aggressiveness (P = 0.036; OR 0.14, 95 % CI 0.023-1.22, for recessive model). According to our data, rs3742330, rs4961280 and rs7813 qualify for potentially protective genetic variants against PCa progression. These variants were not shown to be associated with PCa risk.

  7. Research on distributed heterogeneous data PCA algorithm based on cloud platform

    NASA Astrophysics Data System (ADS)

    Zhang, Jin; Huang, Gang

    2018-05-01

    Principal component analysis (PCA) of heterogeneous data sets can solve the problem that centralized data scalability is limited. In order to reduce the generation of intermediate data and error components of distributed heterogeneous data sets, a principal component analysis algorithm based on heterogeneous data sets under cloud platform is proposed. The algorithm performs eigenvalue processing by using Householder tridiagonalization and QR factorization to calculate the error component of the heterogeneous database associated with the public key to obtain the intermediate data set and the lost information. Experiments on distributed DBM heterogeneous datasets show that the model method has the feasibility and reliability in terms of execution time and accuracy.

  8. Risk Stratification Among Men With Prostate Imaging Reporting and Data System Version 2 Category 3 Transition Zone Lesions: Is Biopsy Always Necessary?

    PubMed Central

    Felker, Ely R.; Raman, Steven S.; Margolis, Daniel J.; Lu, David S. K.; Shaheen, Nicholas; Natarajan, Shyam; Sharma, Devi; Huang, Jiaoti; Dorey, Fred; Marks, Leonard S.

    2017-01-01

    OBJECTIVE The objective of our study was to determine the clinical and MRI characteristics of clinically significant prostate cancer (PCA) (Gleason score ≥ 3 + 4) in men with Prostate Imaging Reporting and Data System version 2 (PI-RADSv2) category 3 transition zone (TZ) lesions. MATERIALS AND METHODS From 2014 to 2016, 865 men underwent prostate MRI and MRI/ultrasound (US) fusion biopsy (FB). A subset of 90 FB-naïve men with 96 PI-RADSv2 category 3 TZ lesions was identified. Patients were imaged at 3 T using a body coil. Images were assigned a PI-RADSv2 category by an experienced radiologist. Using clinical data and imaging features, we performed univariate and multivariate analyses to identify predictors of clinically significant PCA. RESULTS The mean patient age was 66 years, and the mean prostate-specific antigen density (PSAD) was 0.13 ng/mL2. PCA was detected in 34 of 96 (35%) lesions, 14 of which (15%) harbored clinically significant PCA. In univariate analysis, DWI score, prostate volume, and PSAD were significant predictors (p < 0.05) of clinically significant PCA with a suggested significance for apparent diffusion coefficient (ADC) and prostate-specific antigen value (p < 0.10). On multivariate analysis, PSAD and lesion ADC were the most important covariates. The combination of both PSAD of 0.15 ng/mL2 or greater and an ADC value of less than 1000 mm2/s yielded an AUC of 0.91 for clinically significant PCA (p < 0.001). If FB had been restricted to these criteria, only 10 of 90 men would have undergone biopsy, resulting in diagnosis of clinically significant PCA in 60% with eight men (9%) misdiagnosed (false-negative). CONCLUSION The yield of FB in men with PI-RADSv2 category 3 TZ lesions for clinically significant PCA is 15% but significantly improves to 60% (AUC > 0.9) among men with PSAD of 0.15 ng/mL2 or greater and lesion ADC value of less than 1000 mm2/s. PMID:28858541

  9. Principal component and normal mode analysis of proteins; a quantitative comparison using the GroEL subunit.

    PubMed

    Skjaerven, Lars; Martinez, Aurora; Reuter, Nathalie

    2011-01-01

    Principal component analysis (PCA) and normal mode analysis (NMA) have emerged as two invaluable tools for studying conformational changes in proteins. To compare these approaches for studying protein dynamics, we have used a subunit of the GroEL chaperone, whose dynamics is well characterized. We first show that both PCA on trajectories from molecular dynamics (MD) simulations and NMA reveal a general dynamical behavior in agreement with what has previously been described for GroEL. We thus compare the reproducibility of PCA on independent MD runs and subsequently investigate the influence of the length of the MD simulations. We show that there is a relatively poor one-to-one correspondence between eigenvectors obtained from two independent runs and conclude that caution should be taken when analyzing principal components individually. We also observe that increasing the simulation length does not improve the agreement with the experimental structural difference. In fact, relatively short MD simulations are sufficient for this purpose. We observe a rapid convergence of the eigenvectors (after ca. 6 ns). Although there is not always a clear one-to-one correspondence, there is a qualitatively good agreement between the movements described by the first five modes obtained with the three different approaches; PCA, all-atoms NMA, and coarse-grained NMA. It is particularly interesting to relate this to the computational cost of the three methods. The results we obtain on the GroEL subunit contribute to the generalization of robust and reproducible strategies for the study of protein dynamics, using either NMA or PCA of trajectories from MD simulations. © 2010 Wiley-Liss, Inc.

  10. The Use of the Visualisation of Multidimensional Data Using PCA to Evaluate Possibilities of the Division of Coal Samples Space Due to their Suitability for Fluidised Gasification

    NASA Astrophysics Data System (ADS)

    Jamróz, Dariusz; Niedoba, Tomasz; Surowiak, Agnieszka; Tumidajski, Tadeusz

    2016-09-01

    Methods serving to visualise multidimensional data through the transformation of multidimensional space into two-dimensional space, enable to present the multidimensional data on the computer screen. Thanks to this, qualitative analysis of this data can be performed in the most natural way for humans, through the sense of sight. An example of such a method of multidimensional data visualisation is PCA (principal component analysis) method. This method was used in this work to present and analyse a set of seven-dimensional data (selected seven properties) describing coal samples obtained from Janina and Wieczorek coal mines. Coal from these mines was previously subjected to separation by means of a laboratory ring jig, consisting of ten rings. With 5 layers of both types of coal (with 2 rings each) were obtained in this way. It was decided to check if the method of multidimensional data visualisation enables to divide the space of such divided samples into areas with different suitability for the fluidised gasification process. To that end, the card of technological suitability of coal was used (Sobolewski et al., 2012; 2013), in which key, relevant and additional parameters, having effect on the gasification process, were described. As a result of analyses, it was stated that effective determination of coal samples suitability for the on-surface gasification process in a fluidised reactor is possible. The PCA method enables the visualisation of the optimal subspace containing the set requirements concerning the properties of coals intended for this process.

  11. Principal component analysis-based unsupervised feature extraction applied to in silico drug discovery for posttraumatic stress disorder-mediated heart disease.

    PubMed

    Taguchi, Y-h; Iwadate, Mitsuo; Umeyama, Hideaki

    2015-04-30

    Feature extraction (FE) is difficult, particularly if there are more features than samples, as small sample numbers often result in biased outcomes or overfitting. Furthermore, multiple sample classes often complicate FE because evaluating performance, which is usual in supervised FE, is generally harder than the two-class problem. Developing sample classification independent unsupervised methods would solve many of these problems. Two principal component analysis (PCA)-based FE, specifically, variational Bayes PCA (VBPCA) was extended to perform unsupervised FE, and together with conventional PCA (CPCA)-based unsupervised FE, were tested as sample classification independent unsupervised FE methods. VBPCA- and CPCA-based unsupervised FE both performed well when applied to simulated data, and a posttraumatic stress disorder (PTSD)-mediated heart disease data set that had multiple categorical class observations in mRNA/microRNA expression of stressed mouse heart. A critical set of PTSD miRNAs/mRNAs were identified that show aberrant expression between treatment and control samples, and significant, negative correlation with one another. Moreover, greater stability and biological feasibility than conventional supervised FE was also demonstrated. Based on the results obtained, in silico drug discovery was performed as translational validation of the methods. Our two proposed unsupervised FE methods (CPCA- and VBPCA-based) worked well on simulated data, and outperformed two conventional supervised FE methods on a real data set. Thus, these two methods have suggested equivalence for FE on categorical multiclass data sets, with potential translational utility for in silico drug discovery.

  12. Unsupervised analysis of small animal dynamic Cerenkov luminescence imaging

    NASA Astrophysics Data System (ADS)

    Spinelli, Antonello E.; Boschi, Federico

    2011-12-01

    Clustering analysis (CA) and principal component analysis (PCA) were applied to dynamic Cerenkov luminescence images (dCLI). In order to investigate the performances of the proposed approaches, two distinct dynamic data sets obtained by injecting mice with 32P-ATP and 18F-FDG were acquired using the IVIS 200 optical imager. The k-means clustering algorithm has been applied to dCLI and was implemented using interactive data language 8.1. We show that cluster analysis allows us to obtain good agreement between the clustered and the corresponding emission regions like the bladder, the liver, and the tumor. We also show a good correspondence between the time activity curves of the different regions obtained by using CA and manual region of interest analysis on dCLIT and PCA images. We conclude that CA provides an automatic unsupervised method for the analysis of preclinical dynamic Cerenkov luminescence image data.

  13. Extended principle component analysis - a useful tool to understand processes governing water quality at catchment scales

    NASA Astrophysics Data System (ADS)

    Selle, B.; Schwientek, M.

    2012-04-01

    Water quality of ground and surface waters in catchments is typically driven by many complex and interacting processes. While small scale processes are often studied in great detail, their relevance and interplay at catchment scales remain often poorly understood. For many catchments, extensive monitoring data on water quality have been collected for different purposes. These heterogeneous data sets contain valuable information on catchment scale processes but are rarely analysed using integrated methods. Principle component analysis (PCA) has previously been applied to this kind of data sets. However, a detailed analysis of scores, which are an important result of a PCA, is often missing. Mathematically, PCA expresses measured variables on water quality, e.g. nitrate concentrations, as linear combination of independent, not directly observable key processes. These computed key processes are represented by principle components. Their scores are interpretable as process intensities which vary in space and time. Subsequently, scores can be correlated with other key variables and catchment characteristics, such as water travel times and land use that were not considered in PCA. This detailed analysis of scores represents an extension of the commonly applied PCA which could considerably improve the understanding of processes governing water quality at catchment scales. In this study, we investigated the 170 km2 Ammer catchment in SW Germany which is characterised by an above average proportion of agricultural (71%) and urban (17%) areas. The Ammer River is mainly fed by karstic springs. For PCA, we separately analysed concentrations from (a) surface waters of the Ammer River and its tributaries, (b) spring waters from the main aquifers and (c) deep groundwater from production wells. This analysis was extended by a detailed analysis of scores. We analysed measured concentrations on major ions and selected organic micropollutants. Additionally, redox-sensitive variables and environmental tracers indicating groundwater age were analysed for deep groundwater from production wells. For deep groundwater, we found that microbial turnover was stronger influenced by local availability of energy sources than by travel times of groundwater to the wells. Groundwater quality primarily reflected the input of pollutants determined by landuse, e.g. agrochemicals. We concluded that for water quality in the Ammer catchment, conservative mixing of waters with different origin is more important than reactive transport processes along the flow path.

  14. Metabolic profiling of Angelica acutiloba roots utilizing gas chromatography-time-of-flight-mass spectrometry for quality assessment based on cultivation area and cultivar via multivariate pattern recognition.

    PubMed

    Tianniam, Sukanda; Tarachiwin, Lucksanaporn; Bamba, Takeshi; Kobayashi, Akio; Fukusaki, Eiichiro

    2008-06-01

    Gas chromatography time-of-flight mass spectrometry was applied to elucidate the profiling of primary metabolites and to evaluate the differences between quality differences in Angelica acutiloba (or Yamato-toki) roots through the utilization of multivariate pattern recognition-principal component analysis (PCA). Twenty-two metabolites consisting of sugars, amino and organic acids were identified. PCA analysis successfully discriminated the good, the moderate and the bad quality Yamato-toki roots in accordance to their cultivation areas. The results signified two reducing sugars, fructose and glucose being the most accumulated in the bad quality, whereas higher quantity of phosphoric acid, proline, malic acid and citric acid were found in the good and the moderate quality toki roots. PCA was also effective in discriminating samples derive from different cultivars. Yamato-toki roots with the moderate quality were compared by means of PCA, and the results illustrated good discrimination which was influenced most by malic acid. Overall, this study demonstrated that metabolomics technique is accurate and efficient in determining the quality differences in Yamato-toki roots, and has a potential to be a superior and suitable method to assess the quality of this medicinal plant.

  15. Differential resistances to anthracnose in Capsicum baccatum as responding to two Colletotrichum pathotypes and inoculation methods.

    PubMed

    Mahasuk, Pitchayapa; Chinthaisong, Jittima; Mongkolporn, Orarat

    2013-09-01

    Chili anthracnose, caused by Colletotrichum spp., is one of the major diseases to chili production in the tropics and subtropics worldwide. Breeding for durable anthracnose resistance requires a good understanding of the resistance mechanisms to different pathotypes and inoculation methods. This study aimed to investigate the inheritances of differential resistances as responding to two different Colletotrichum pathotypes, PCa2 and PCa3 and as by two different inoculation methods, microinjection (MI) and high pressure spray (HP). Detached ripe fruit of Capsicum baccatum 'PBC80' derived F2 and BC1s populations was assessed for anthracnose resistance. Two dominant genes were identified responsible for the differential resistance to anthracnose. One was responsible for the resistance to PCa2 and PCa3 by MI and the other was responsible for the resistance to PCa3 by HP. The two genes were linked with 16.7 cM distance.

  16. Differential resistances to anthracnose in Capsicum baccatum as responding to two Colletotrichum pathotypes and inoculation methods

    PubMed Central

    Mahasuk, Pitchayapa; Chinthaisong, Jittima; Mongkolporn, Orarat

    2013-01-01

    Chili anthracnose, caused by Colletotrichum spp., is one of the major diseases to chili production in the tropics and subtropics worldwide. Breeding for durable anthracnose resistance requires a good understanding of the resistance mechanisms to different pathotypes and inoculation methods. This study aimed to investigate the inheritances of differential resistances as responding to two different Colletotrichum pathotypes, PCa2 and PCa3 and as by two different inoculation methods, microinjection (MI) and high pressure spray (HP). Detached ripe fruit of Capsicum baccatum ‘PBC80’ derived F2 and BC1s populations was assessed for anthracnose resistance. Two dominant genes were identified responsible for the differential resistance to anthracnose. One was responsible for the resistance to PCa2 and PCa3 by MI and the other was responsible for the resistance to PCa3 by HP. The two genes were linked with 16.7 cM distance. PMID:24273429

  17. The impact of moderate wine consumption on the risk of developing prostate cancer.

    PubMed

    Vartolomei, Mihai Dorin; Kimura, Shoji; Ferro, Matteo; Foerster, Beat; Abufaraj, Mohammad; Briganti, Alberto; Karakiewicz, Pierre I; Shariat, Shahrokh F

    2018-01-01

    To investigate the impact of moderate wine consumption on the risk of prostate cancer (PCa). We focused on the differential effect of moderate consumption of red versus white wine. This study was a meta-analysis that includes data from case-control and cohort studies. A systematic search of Web of Science, Medline/PubMed, and Cochrane library was performed on December 1, 2017. Studies were deemed eligible if they assessed the risk of PCa due to red, white, or any wine using multivariable logistic regression analysis. We performed a formal meta-analysis for the risk of PCa according to moderate wine and wine type consumption (white or red). Heterogeneity between studies was assessed using Cochrane's Q test and I 2 statistics. Publication bias was assessed using Egger's regression test. A total of 930 abstracts and titles were initially identified. After removal of duplicates, reviews, and conference abstracts, 83 full-text original articles were screened. Seventeen studies (611,169 subjects) were included for final evaluation and fulfilled the inclusion criteria. In the case of moderate wine consumption: the pooled risk ratio (RR) for the risk of PCa was 0.98 (95% CI 0.92-1.05, p =0.57) in the multivariable analysis. Moderate white wine consumption increased the risk of PCa with a pooled RR of 1.26 (95% CI 1.10-1.43, p =0.001) in the multi-variable analysis. Meanwhile, moderate red wine consumption had a protective role reducing the risk by 12% (RR 0.88, 95% CI 0.78-0.999, p =0.047) in the multivariable analysis that comprised 222,447 subjects. In this meta-analysis, moderate wine consumption did not impact the risk of PCa. Interestingly, regarding the type of wine, moderate consumption of white wine increased the risk of PCa, whereas moderate consumption of red wine had a protective effect. Further analyses are needed to assess the differential molecular effect of white and red wine conferring their impact on PCa risk.

  18. SELF-ORGANIZING MAPS FOR INTEGRATED ASSESSMENT OF THE MID-ATLANTIC REGION

    EPA Science Inventory

    A. new method was developed to perform an environmental assessment for the
    Mid-Atlantic Region (MAR). This was a combination of the self-organizing map (SOM) neural network and principal component analysis (PCA). The method is capable of clustering ecosystems in terms of envi...

  19. ARLTS1 and Prostate Cancer Risk - Analysis of Expression and Regulation

    PubMed Central

    Siltanen, Sanna; Fischer, Daniel; Rantapero, Tommi; Laitinen, Virpi; Mpindi, John Patrick; Kallioniemi, Olli; Wahlfors, Tiina; Schleutker, Johanna

    2013-01-01

    Prostate cancer (PCa) is a heterogeneous trait for which several susceptibility loci have been implicated by genome-wide linkage and association studies. The genomic region 13q14 is frequently deleted in tumour tissues of both sporadic and familial PCa patients and is consequently recognised as a possible locus of tumour suppressor gene(s). Deletions of this region have been found in many other cancers. Recently, we showed that homozygous carriers for the T442C variant of the ARLTS1 gene (ADP-ribosylation factor-like tumour suppressor protein 1 or ARL11, located at 13q14) are associated with an increased risk for both unselected and familial PCa. Furthermore, the variant T442C was observed in greater frequency among malignant tissue samples, PCa cell lines and xenografts, supporting its role in PCa tumourigenesis. In this study, 84 PCa cases and 15 controls were analysed for ARLTS1 expression status in blood-derived RNA. A statistically significant (p = 0.0037) decrease of ARLTS1 expression in PCa cases was detected. Regulation of ARLTS1 expression was analysed with eQTL (expression quantitative trait loci) methods. Altogether fourteen significant cis-eQTLs affecting the ARLTS1 expression level were found. In addition, epistatic interactions of ARLTS1 genomic variants with genes involved in immune system processes were predicted with the MDR program. In conclusion, this study further supports the role of ARLTS1 as a tumour suppressor gene and reveals that the expression is regulated through variants localised in regulatory regions. PMID:23940804

  20. Temporal Processing of Dynamic Positron Emission Tomography via Principal Component Analysis in the Sinogram Domain

    NASA Astrophysics Data System (ADS)

    Chen, Zhe; Parker, B. J.; Feng, D. D.; Fulton, R.

    2004-10-01

    In this paper, we compare various temporal analysis schemes applied to dynamic PET for improved quantification, image quality and temporal compression purposes. We compare an optimal sampling schedule (OSS) design, principal component analysis (PCA) applied in the image domain, and principal component analysis applied in the sinogram domain; for region-of-interest quantification, sinogram-domain PCA is combined with the Huesman algorithm to quantify from the sinograms directly without requiring reconstruction of all PCA channels. Using a simulated phantom FDG brain study and three clinical studies, we evaluate the fidelity of the compressed data for estimation of local cerebral metabolic rate of glucose by a four-compartment model. Our results show that using a noise-normalized PCA in the sinogram domain gives similar compression ratio and quantitative accuracy to OSS, but with substantially better precision. These results indicate that sinogram-domain PCA for dynamic PET can be a useful preprocessing stage for PET compression and quantification applications.

  1. Investigation of cell wall composition related to stem lodging resistance in wheat (Triticum aestivum L.) by FTIR spectroscopy.

    PubMed

    Wang, Jian; Zhu, Jinmao; Huang, RuZhu; Yang, YuSheng

    2012-07-01

    We explored the rapid qualitative analysis of wheat cultivars with good lodging resistances by Fourier transform infrared resonance (FTIR) spectroscopy and multivariate statistical analysis. FTIR imaging showing that wheat stem cell walls were mainly composed of cellulose, pectin, protein, and lignin. Principal components analysis (PCA) was used to eliminate multicollinearity among multiple peak absorptions. PCA revealed the developmental internodes of wheat stems could be distributed from low to high along the load of the second principal component, which was consistent with the corresponding bands of cellulose in the FTIR spectra of the cell walls. Furthermore, four distinct stem populations could also be identified by spectral features related to their corresponding mechanical properties via PCA and cluster analysis. Histochemical staining of four types of wheat stems with various abilities to resist lodging revealed that cellulose contributed more than lignin to the ability to resist lodging. These results strongly suggested that the main cell wall component responsible for these differences was cellulose. Therefore, the combination of multivariate analysis and FTIR could rapidly screen wheat cultivars with good lodging resistance. Furthermore, the application of these methods to a much wider range of cultivars of unknown mechanical properties promises to be of interest.

  2. Design, analysis and control of large transports so that control of engine thrust can be used as a back-up of the primary flight controls. Ph.D. Thesis

    NASA Technical Reports Server (NTRS)

    Roskam, Jan; Ackers, Deane E.; Gerren, Donna S.

    1995-01-01

    A propulsion controlled aircraft (PCA) system has been developed at NASA Dryden Flight Research Center at Edwards Air Force Base, California, to provide safe, emergency landing capability should the primary flight control system of the aircraft fail. As a result of the successful PCA work being done at NASA Dryden, this project investigated the possibility of incorporating the PCA system as a backup flight control system in the design of a large, ultra-high capacity megatransport in such a way that flight path control using only the engines is not only possible, but meets MIL-Spec Level 1 or Level 2 handling quality requirements. An 800 passenger megatransport aircraft was designed and programmed into the NASA Dryden simulator. Many different analysis methods were used to evaluate the flying qualities of the megatransport while using engine thrust for flight path control, including: (1) Bode and root locus plot analysis to evaluate the frequency and damping ratio response of the megatransport; (2) analysis of actual simulator strip chart recordings to evaluate the time history response of the megatransport; and (3) analysis of Cooper-Harper pilot ratings by two NaSA test pilots.

  3. Identification and apportionment of hazardous elements in the sediments in the Yangtze River estuary.

    PubMed

    Wang, Jiawei; Liu, Ruimin; Wang, Haotian; Yu, Wenwen; Xu, Fei; Shen, Zhenyao

    2015-12-01

    In this study, positive matrix factorization (PMF) and principal components analysis (PCA) were combined to identify and apportion pollution-based sources of hazardous elements in the surface sediments in the Yangtze River estuary (YRE). Source identification analysis indicated that PC1, including Al, Fe, Mn, Cr, Ni, As, Cu, and Zn, can be defined as a sewage component; PC2, including Pb and Sb, can be considered as an atmospheric deposition component; and PC3, containing Cd and Hg, can be considered as an agricultural nonpoint component. To better identify the sources and quantitatively apportion the concentrations to their sources, eight sources were identified with PMF: agricultural/industrial sewage mixed (18.6 %), mining wastewater (15.9 %), agricultural fertilizer (14.5 %), atmospheric deposition (12.8 %), agricultural nonpoint (10.6 %), industrial wastewater (9.8 %), marine activity (9.0 %), and nickel plating industry (8.8 %). Overall, the hazardous element content seems to be more connected to anthropogenic activity instead of natural sources. The PCA results laid the foundation for the PMF analysis by providing a general classification of sources. PMF resolves more factors with a higher explained variance than PCA; PMF provided both the internal analysis and the quantitative analysis. The combination of the two methods can provide more reasonable and reliable results.

  4. Ripening-dependent metabolic changes in the volatiles of pineapple (Ananas comosus (L.) Merr.) fruit: II. Multivariate statistical profiling of pineapple aroma compounds based on comprehensive two-dimensional gas chromatography-mass spectrometry.

    PubMed

    Steingass, Christof Björn; Jutzi, Manfred; Müller, Jenny; Carle, Reinhold; Schmarr, Hans-Georg

    2015-03-01

    Ripening-dependent changes of pineapple volatiles were studied in a nontargeted profiling analysis. Volatiles were isolated via headspace solid phase microextraction and analyzed by comprehensive 2D gas chromatography and mass spectrometry (HS-SPME-GC×GC-qMS). Profile patterns presented in the contour plots were evaluated applying image processing techniques and subsequent multivariate statistical data analysis. Statistical methods comprised unsupervised hierarchical cluster analysis (HCA) and principal component analysis (PCA) to classify the samples. Supervised partial least squares discriminant analysis (PLS-DA) and partial least squares (PLS) regression were applied to discriminate different ripening stages and describe the development of volatiles during postharvest storage, respectively. Hereby, substantial chemical markers allowing for class separation were revealed. The workflow permitted the rapid distinction between premature green-ripe pineapples and postharvest-ripened sea-freighted fruits. Volatile profiles of fully ripe air-freighted pineapples were similar to those of green-ripe fruits postharvest ripened for 6 days after simulated sea freight export, after PCA with only two principal components. However, PCA considering also the third principal component allowed differentiation between air-freighted fruits and the four progressing postharvest maturity stages of sea-freighted pineapples.

  5. Homogeneity study of a corn flour laboratory reference material candidate for inorganic analysis.

    PubMed

    Dos Santos, Ana Maria Pinto; Dos Santos, Liz Oliveira; Brandao, Geovani Cardoso; Leao, Danilo Junqueira; Bernedo, Alfredo Victor Bellido; Lopes, Ricardo Tadeu; Lemos, Valfredo Azevedo

    2015-07-01

    In this work, a homogeneity study of a corn flour reference material candidate for inorganic analysis is presented. Seven kilograms of corn flour were used to prepare the material, which was distributed among 100 bottles. The elements Ca, K, Mg, P, Zn, Cu, Fe, Mn and Mo were quantified by inductively coupled plasma optical emission spectrometry (ICP OES) after acid digestion procedure. The method accuracy was confirmed by analyzing the rice flour certified reference material, NIST 1568a. All results were evaluated by analysis of variance (ANOVA) and principal component analysis (PCA). In the study, a sample mass of 400mg was established as the minimum mass required for analysis, according to the PCA. The between-bottle test was performed by analyzing 9 bottles of the material. Subsamples of a single bottle were analyzed for the within-bottle test. No significant differences were observed for the results obtained through the application of both statistical methods. This fact demonstrates that the material is homogeneous for use as a laboratory reference material. Copyright © 2015 Elsevier Ltd. All rights reserved.

  6. An Efficient Taguchi Approach for the Performance Optimization of Health, Safety, Environment and Ergonomics in Generation Companies

    PubMed Central

    Azadeh, Ali; Sheikhalishahi, Mohammad

    2014-01-01

    Background A unique framework for performance optimization of generation companies (GENCOs) based on health, safety, environment, and ergonomics (HSEE) indicators is presented. Methods To rank this sector of industry, the combination of data envelopment analysis (DEA), principal component analysis (PCA), and Taguchi are used for all branches of GENCOs. These methods are applied in an integrated manner to measure the performance of GENCO. The preferred model between DEA, PCA, and Taguchi is selected based on sensitivity analysis and maximum correlation between rankings. To achieve the stated objectives, noise is introduced into input data. Results The results show that Taguchi outperforms other methods. Moreover, a comprehensive experiment is carried out to identify the most influential factor for ranking GENCOs. Conclusion The approach developed in this study could be used for continuous assessment and improvement of GENCO's performance in supplying energy with respect to HSEE factors. The results of such studies would help managers to have better understanding of weak and strong points in terms of HSEE factors. PMID:26106505

  7. Noninvasive prostate cancer screening based on serum surface-enhanced Raman spectroscopy and support vector machine

    NASA Astrophysics Data System (ADS)

    Li, Shaoxin; Zhang, Yanjiao; Xu, Junfa; Li, Linfang; Zeng, Qiuyao; Lin, Lin; Guo, Zhouyi; Liu, Zhiming; Xiong, Honglian; Liu, Songhao

    2014-09-01

    This study aims to present a noninvasive prostate cancer screening methods using serum surface-enhanced Raman scattering (SERS) and support vector machine (SVM) techniques through peripheral blood sample. SERS measurements are performed using serum samples from 93 prostate cancer patients and 68 healthy volunteers by silver nanoparticles. Three types of kernel functions including linear, polynomial, and Gaussian radial basis function (RBF) are employed to build SVM diagnostic models for classifying measured SERS spectra. For comparably evaluating the performance of SVM classification models, the standard multivariate statistic analysis method of principal component analysis (PCA) is also applied to classify the same datasets. The study results show that for the RBF kernel SVM diagnostic model, the diagnostic accuracy of 98.1% is acquired, which is superior to the results of 91.3% obtained from PCA methods. The receiver operating characteristic curve of diagnostic models further confirm above research results. This study demonstrates that label-free serum SERS analysis technique combined with SVM diagnostic algorithm has great potential for noninvasive prostate cancer screening.

  8. Expression of SLCO transport genes in castration resistant prostate cancer and impact of genetic variation in SCLO1B3 and SLCO2B1 on prostate cancer outcomes

    PubMed Central

    Wright, Jonathan L; Kwon, Erika M; Ostrander, Elaine A; Montgomery, R Bruce; Lin, Daniel W; Vessella, Robert; Stanford, Janet L; Mostaghel, Elahe A

    2011-01-01

    Background Metastases from men with castration resistant prostate cancer (CRPC) harbor increased tumoral androgens vs. untreated prostate cancers (PCa). This may reflect steroid uptake by OATP/SLCO transporters. We evaluated SLCO gene expression in CRPC metastases and determined whether PCa outcomes are associated with single nucleotide polymorphisms (SNPs) in SLCO2B1 and SLCO1B3, transporters previously demonstrated to mediate androgen uptake. Methods Transcripts encoding 11 SLCO genes were analyzed in untreated PCa, and in metastatic CRPC tumors obtained by rapid autopsy. SNPs in SLCO2B1 and SLCO1B3 were genotyped in a population-based cohort of 1,309 Caucasian PCa patients. Median survival follow-up was 7.0 years (0.77–16.4). The risk of PCa recurrence/progression and PCa-specific mortality (PCSM) was estimated with Cox proportional hazards analysis. Results Six SLCO genes were highly expressed in CRPC metastases vs. untreated PCa, including SLCO1B3 (3.6 fold, p=0.0517) and SLCO2B1 (5.5 fold, p=0.0034). Carriers of the variant alleles SLCO2B1 SNP rs12422149 (HR 1.99, 95% CI 1.11 – 3.55) or SLCO1B3 SNP rs4149117 (HR 1.76, 95% CI 1.00 – 3.08) had an increased risk of PCSM. Conclusions CRPC metastases demonstrate increased expression of SLCO genes vs. primary PCa. Genetic variants of SLCO1B3 and SLCO2B1 are associated with PCSM. Expression and genetic variation of SLCO genes which alter androgen uptake may be important in PCa outcomes. Impact OATP/SLCO genes may be potential biomarkers for assessing risk of prostate cancer-specific mortality. Expression and genetic variation in these genes may allow stratification of patients to more aggressive hormonal therapy or earlier incorporation of non-hormonal based treatment strategies. PMID:21266523

  9. Migration of styrene and ethylbenzene from virgin and recycled expanded polystyrene containers and discrimination of these two kinds of polystyrene by principal component analysis.

    PubMed

    Lin, Qin-Bao; Song, Xue-Chao; Fang, Hong; Wu, Yu-Mei; Wang, Zhi-Wei

    2017-01-01

    The migration of styrene and ethylbenzene from virgin and recycled expanded polystyrene (EPS) containers into isooctane was investigated using gas chromatography-mass spectrometry (GC-MS). EPS containers were in two-sided contact with isooctane at temperatures of 25 and 40°C. It was shown that recycled EPS gave greater migration ratios compared with virgin EPS, which indicated that styrene and ethylbenzene migrated more easily from recycled EPS. In addition, an analytical method to distinguish between virgin and recycled EPS containers was established by GC-MS followed by principal component analysis (PCA). The relative peak area of the identified compounds was used as input data for PCA. Distinct separation between virgin and recycled EPS was achieved on a score plot. Extension of this method to other plastics may be of great interest for recycled plastics identification.

  10. Classification of narcotics in solid mixtures using principal component analysis and Raman spectroscopy.

    PubMed

    Ryder, Alan G

    2002-03-01

    Eighty-five solid samples consisting of illegal narcotics diluted with several different materials were analyzed by near-infrared (785 nm excitation) Raman spectroscopy. Principal Component Analysis (PCA) was employed to classify the samples according to narcotic type. The best sample discrimination was obtained by using the first derivative of the Raman spectra. Furthermore, restricting the spectral variables for PCA to 2 or 3% of the original spectral data according to the most intense peaks in the Raman spectrum of the pure narcotic resulted in a rapid discrimination method for classifying samples according to narcotic type. This method allows for the easy discrimination between cocaine, heroin, and MDMA mixtures even when the Raman spectra are complex or very similar. This approach of restricting the spectral variables also decreases the computational time by a factor of 30 (compared to the complete spectrum), making the methodology attractive for rapid automatic classification and identification of suspect materials.

  11. Determination of butter adulteration with margarine using Raman spectroscopy.

    PubMed

    Uysal, Reyhan Selin; Boyaci, Ismail Hakki; Genis, Hüseyin Efe; Tamer, Ugur

    2013-12-15

    In this study, adulteration of butter with margarine was analysed using Raman spectroscopy combined with chemometric methods (principal component analysis (PCA), principal component regression (PCR), partial least squares (PLS)) and artificial neural networks (ANNs). Different butter and margarine samples were mixed at various concentrations ranging from 0% to 100% w/w. PCA analysis was applied for the classification of butters, margarines and mixtures. PCR, PLS and ANN were used for the detection of adulteration ratios of butter. Models were created using a calibration data set and developed models were evaluated using a validation data set. The coefficient of determination (R(2)) values between actual and predicted values obtained for PCR, PLS and ANN for the validation data set were 0.968, 0.987 and 0.978, respectively. In conclusion, a combination of Raman spectroscopy with chemometrics and ANN methods can be applied for testing butter adulteration. Copyright © 2013 Elsevier Ltd. All rights reserved.

  12. A Molecular Dynamic Modeling of Hemoglobin-Hemoglobin Interactions

    NASA Astrophysics Data System (ADS)

    Wu, Tao; Yang, Ye; Sheldon Wang, X.; Cohen, Barry; Ge, Hongya

    2010-05-01

    In this paper, we present a study of hemoglobin-hemoglobin interaction with model reduction methods. We begin with a simple spring-mass system with given parameters (mass and stiffness). With this known system, we compare the mode superposition method with Singular Value Decomposition (SVD) based Principal Component Analysis (PCA). Through PCA we are able to recover the principal direction of this system, namely the model direction. This model direction will be matched with the eigenvector derived from mode superposition analysis. The same technique will be implemented in a much more complicated hemoglobin-hemoglobin molecule interaction model, in which thousands of atoms in hemoglobin molecules are coupled with tens of thousands of T3 water molecule models. In this model, complex inter-atomic and inter-molecular potentials are replaced by nonlinear springs. We employ the same method to get the most significant modes and their frequencies of this complex dynamical system. More complex physical phenomena can then be further studied by these coarse grained models.

  13. Sensor Failure Detection of FASSIP System using Principal Component Analysis

    NASA Astrophysics Data System (ADS)

    Sudarno; Juarsa, Mulya; Santosa, Kussigit; Deswandri; Sunaryo, Geni Rina

    2018-02-01

    In the nuclear reactor accident of Fukushima Daiichi in Japan, the damages of core and pressure vessel were caused by the failure of its active cooling system (diesel generator was inundated by tsunami). Thus researches on passive cooling system for Nuclear Power Plant are performed to improve the safety aspects of nuclear reactors. The FASSIP system (Passive System Simulation Facility) is an installation used to study the characteristics of passive cooling systems at nuclear power plants. The accuracy of sensor measurement of FASSIP system is essential, because as the basis for determining the characteristics of a passive cooling system. In this research, a sensor failure detection method for FASSIP system is developed, so the indication of sensor failures can be detected early. The method used is Principal Component Analysis (PCA) to reduce the dimension of the sensor, with the Squarred Prediction Error (SPE) and statistic Hotteling criteria for detecting sensor failure indication. The results shows that PCA method is capable to detect the occurrence of a failure at any sensor.

  14. An Efficient Taguchi Approach for the Performance Optimization of Health, Safety, Environment and Ergonomics in Generation Companies.

    PubMed

    Azadeh, Ali; Sheikhalishahi, Mohammad

    2015-06-01

    A unique framework for performance optimization of generation companies (GENCOs) based on health, safety, environment, and ergonomics (HSEE) indicators is presented. To rank this sector of industry, the combination of data envelopment analysis (DEA), principal component analysis (PCA), and Taguchi are used for all branches of GENCOs. These methods are applied in an integrated manner to measure the performance of GENCO. The preferred model between DEA, PCA, and Taguchi is selected based on sensitivity analysis and maximum correlation between rankings. To achieve the stated objectives, noise is introduced into input data. The results show that Taguchi outperforms other methods. Moreover, a comprehensive experiment is carried out to identify the most influential factor for ranking GENCOs. The approach developed in this study could be used for continuous assessment and improvement of GENCO's performance in supplying energy with respect to HSEE factors. The results of such studies would help managers to have better understanding of weak and strong points in terms of HSEE factors.

  15. Screening of the key volatile organic compounds of Tuber melanosporum fermentation by aroma sensory evaluation combination with principle component analysis

    PubMed Central

    Liu, Rui-Sang; Jin, Guang-Huai; Xiao, Deng-Rong; Li, Hong-Mei; Bai, Feng-Wu; Tang, Ya-Jie

    2015-01-01

    Aroma results from the interplay of volatile organic compounds (VOCs) and the attributes of microbial-producing aromas are significantly affected by fermentation conditions. Among the VOCs, only a few of them contribute to aroma. Thus, screening and identification of the key VOCs is critical for microbial-producing aroma. The traditional method is based on gas chromatography-olfactometry (GC-O), which is time-consuming and laborious. Considering the Tuber melanosporum fermentation system as an example, a new method to screen and identify the key VOCs by combining the aroma evaluation method with principle component analysis (PCA) was developed in this work. First, an aroma sensory evaluation method was developed to screen 34 potential favorite aroma samples from 504 fermentation samples. Second, PCA was employed to screen nine common key VOCs from these 34 samples. Third, seven key VOCs were identified by the traditional method. Finally, all of the seven key VOCs identified by the traditional method were also identified, along with four others, by the new strategy. These results indicate the reliability of the new method and demonstrate it to be a viable alternative to the traditional method. PMID:26655663

  16. Substantial Family History of Prostate Cancer in Black Men Recruited for Prostate Cancer Screening

    PubMed Central

    Mastalski, Kathleen; Coups, Elliot J.; Ruth, Karen; Raysor, Susan; Giri, Veda N.

    2008-01-01

    Background Black men are at increased risk for prostate cancer (PCA), particularly with a family history (FH) of the disease. Previous reports have raised concern for suboptimal screening of Black men with a FH of PCA. We report on the extent of FH of PCA from a prospective, longitudinal PCA screening program for high-risk men. Methods Black men ages 35-69 are eligible for PCA screening through the Prostate Cancer Risk Assessment Program (PRAP) regardless of FH. Rates of self-reported FH of PCA, breast, and colon cancer at baseline were compared with an age-matched sample of Black men from the 2005 National Health Interview Survey (NHIS) using standard statistical methods. Results As of January 2007, 332 Black men with pedigree information were enrolled in PRAP and FH of PCA was compared to 838 Black men from the 2005 NHIS. Black men in PRAP reported significantly more first-degree relatives with PCA compared to Black men in the 2005 NHIS (34.3%, 95% CI 29.2-39.7 vs. 5.7%, 95% CI 3.9-7.4). Black men in PRAP also had more FH of breast cancer compared to the 2005 NHIS (11.5%, 95% CI 8.2-15.4 vs 6.3%, 95% CI 4.6-8.0). Conclusions FH of PCA appears to be a motivating factor for Black men seeking PCA screening. Targeted recruitment and education among Black families should improve PCA screening rates. Efforts to recruit Black men without a FH of PCA are also needed. Condensed Abstract Black men seeking prostate cancer screening have a substantial burden of family history of prostate cancer. Targeted education and enhancing discussion in Black families should increase prostate cancer screening and adherence. PMID:18816608

  17. Isolation of candidate genes for apomictic development in buffelgrass (Pennisetum ciliare).

    PubMed

    Singh, Manjit; Burson, Byron L; Finlayson, Scott A

    2007-08-01

    Asexual reproduction through seeds, or apomixis, is a process that holds much promise for agricultural advances. However, the molecular mechanisms underlying apomixis are currently poorly understood. To identify genes related to female gametophyte development in apomictic ovaries of buffelgrass (Pennisetum ciliare (L.) Link), Suppression Subtractive Hybridization of ovary cDNA with leaf cDNA was performed. Through macroarray screening of subtracted cDNAs two genes were identified, Pca21 and Pca24, that showed differential expression between apomictic and sexual ovaries. Sequence analysis showed that both Pca21 and Pca24 are novel genes not previously characterized in plants. Pca21 shows homology to two wheat genes that are also expressed during reproductive development. Pca24 has similarity to coiled-coil-helix-coiled-coil-helix (CHCH) domain containing proteins from maize and sugarcane. Northern blot analysis revealed that both of these genes are expressed throughout female gametophyte development in apomictic ovaries. In situ hybridizations localized the transcript of these two genes to the developing embryo sacs in the apomictic ovaries. Based on the expression patterns it was concluded that Pca21 and Pca24 likely play a role during apomictic development in buffelgrass.

  18. Meta-analysis of CDKN2A methylation to find its role in prostate cancer development and progression, and also to find the effect of CDKN2A expression on disease-free survival (PRISMA).

    PubMed

    Cao, Zipei; Wei, Lijuan; Zhu, Weizhi; Yao, Xuping

    2018-03-01

    Reduction of cyclin-dependent kinase inhibitor 2A (CDKN2A) (p16 and p14) expression through DNA methylation has been reported in prostate cancer (PCa). This meta-analysis was conducted to assess the difference of p16 and p14 methylation between PCa and different histological types of nonmalignant controls and the correlation of p16 or p14 methylation with clinicopathological features of PCa. According to the preferred reporting items for systematic reviews and meta-analyses (PRISMA) statement criteria, articles were searched in PubMed, Embase, EBSCO, Wanfang, and CNKI databases. The strength of correlation was calculated by the pooled odds ratios (ORs) and their corresponding 95% confidence intervals (95% CIs). Trial sequential analysis (TSA) was used to estimate the required population information for significant results. A total of 20 studies published from 1997 to 2017 were identified in this meta-analysis, including 1140 PCa patients and 530 cases without cancer. Only p16 methylation in PCa was significantly higher than in benign prostatic lesions (OR = 4.72, P = .011), but had a similar level in PCa and adjacent tissues or high-grade prostatic intraepithelial neoplasias (HGPIN). TSA revealed that this analysis on p16 methylation is a false positive result in cancer versus benign prostatic lesions (the estimated required information size of 5116 participants). p16 methylation was not correlated with PCa in the urine and blood. Besides, p16 methylation was not linked to clinical stage, prostate-specific antigen (PSA) level, and Gleason score (GS) of patients with PCa. p14 methylation was not correlated with PCa in tissue and urine samples. No correlation was observed between p14 methylation and clinical stage or GS. CDKN2A mutation and copy number alteration were not associated with prognosis of PCa in overall survival and disease-free survival. CDKN2A expression was not correlated with the prognosis of PCa in overall survival (492 cases) (P > .1), while CDKN2A expression was significantly associated with a poor disease-free survival (P < .01). CDKN2A methylation may not be significantly associated with the development, progression of PCa. Although CDKN2A expression had an unfavorable prognosis in disease-free survival. More studies are needed to confirm our results.

  19. MindEdit: A P300-based text editor for mobile devices.

    PubMed

    Elsawy, Amr S; Eldawlatly, Seif; Taher, Mohamed; Aly, Gamal M

    2017-01-01

    Practical application of Brain-Computer Interfaces (BCIs) requires that the whole BCI system be portable. The mobility of BCI systems involves two aspects: making the electroencephalography (EEG) recording devices portable, and developing software applications with low computational complexity to be able to run on low computational-power devices such as tablets and smartphones. This paper addresses the development of MindEdit; a P300-based text editor for Android-based devices. Given the limited resources of mobile devices and their limited computational power, a novel ensemble classifier is utilized that uses Principal Component Analysis (PCA) features to identify P300 evoked potentials from EEG recordings. PCA computations in the proposed method are channel-based as opposed to concatenating all channels as in traditional feature extraction methods; thus, this method has less computational complexity compared to traditional P300 detection methods. The performance of the method is demonstrated on data recorded from MindEdit on an Android tablet using the Emotiv wireless neuroheadset. Results demonstrate the capability of the introduced PCA ensemble classifier to classify P300 data with maximum average accuracy of 78.37±16.09% for cross-validation data and 77.5±19.69% for online test data using only 10 trials per symbol and a 33-character training dataset. Our analysis indicates that the introduced method outperforms traditional feature extraction methods. For a faster operation of MindEdit, a variable number of trials scheme is introduced that resulted in an online average accuracy of 64.17±19.6% and a maximum bitrate of 6.25bit/min. These results demonstrate the efficacy of using the developed BCI application with mobile devices. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Epigenetics-related genes in prostate cancer: expression profile in prostate cancer tissues, androgen-sensitive and -insensitive cell lines.

    PubMed

    Shaikhibrahim, Zaki; Lindstrot, Andreas; Ochsenfahrt, Jacqueline; Fuchs, Kerstin; Wernert, Nicolas

    2013-01-01

    Epigenetic changes have been suggested to drive prostate cancer (PCa) development and progression. Therefore, in this study, we aimed to identify novel epigenetics-related genes in PCa tissues, and to examine their expression in metastatic PCa cell lines. We analyzed the expression of epigenetics-related genes via a clustering analysis based on gene function in moderately and poorly differentiated PCa glands compared to normal glands of the peripheral zone (prostate proper) from PCa patients using Whole Human Genome Oligo Microarrays. Our analysis identified 12 epigenetics-related genes with a more than 2-fold increase or decrease in expression and a p-value <0.01. In modera-tely differentiated tumors compared to normal glands of the peripheral zone, we found the genes, TDRD1, IGF2, DICER1, ADARB1, HILS1, GLMN and TRIM27, to be upregulated, whereas TNRC6A and DGCR8 were found to be downregulated. In poorly differentiated tumors, we found TDRD1, ADARB and RBM3 to be upregulated, whereas DGCR8, PIWIL2 and BC069781 were downregulated. Our analysis of the expression level for each gene in the metastatic androgen-sensitive VCaP and LNCaP, and -insensitive PC3 and DU-145 PCa cell lines revealed differences in expression among the cell lines which may reflect the different biological properties of each cell line, and the potential role of each gene at different metastatic sites. The novel epigenetics-related genes that we identified in primary PCa tissues may provide further insight into the role that epigenetic changes play in PCa. Moreover, some of the genes that we identified may play important roles in primary PCa and metastasis, in primary PCa only, or in metastasis only. Follow-up studies are required to investigate the functional role and the role that the expression of these genes play in the outcome and progression of PCa using tissue microarrays.

  1. Intelligence, Surveillance, and Reconnaissance Fusion for Coalition Operations

    DTIC Science & Technology

    2008-07-01

    classification of the targets of interest. The MMI features extracted in this manner have two properties that provide a sound justification for...are generalizations of well- known feature extraction methods such as Principal Components Analysis (PCA) and Independent Component Analysis (ICA...augment (without degrading performance) a large class of generic fusion processes. Ontologies Classifications Feature extraction Feature analysis

  2. Spectral data compression using weighted principal component analysis with consideration of human visual system and light sources

    NASA Astrophysics Data System (ADS)

    Cao, Qian; Wan, Xiaoxia; Li, Junfeng; Liu, Qiang; Liang, Jingxing; Li, Chan

    2016-10-01

    This paper proposed two weight functions based on principal component analysis (PCA) to reserve more colorimetric information in spectral data compression process. One weight function consisted of the CIE XYZ color-matching functions representing the characteristic of the human visual system, while another was made up of the CIE XYZ color-matching functions of human visual system and relative spectral power distribution of the CIE standard illuminant D65. The improvement obtained from the proposed two methods were tested to compress and reconstruct the reflectance spectra of 1600 glossy Munsell color chips and 1950 Natural Color System color chips as well as six multispectral images. The performance was evaluated by the mean values of color difference under the CIE 1931 standard colorimetric observer and the CIE standard illuminant D65 and A. The mean values of root mean square errors between the original and reconstructed spectra were also calculated. The experimental results show that the proposed two methods significantly outperform the standard PCA and another two weighted PCA in the aspects of colorimetric reconstruction accuracy with very slight degradation in spectral reconstruction accuracy. In addition, weight functions with the CIE standard illuminant D65 can improve the colorimetric reconstruction accuracy compared to weight functions without the CIE standard illuminant D65.

  3. Simultaneous and Continuous Estimation of Shoulder and Elbow Kinematics from Surface EMG Signals

    PubMed Central

    Zhang, Qin; Liu, Runfeng; Chen, Wenbin; Xiong, Caihua

    2017-01-01

    In this paper, we present a simultaneous and continuous kinematics estimation method for multiple DoFs across shoulder and elbow joint. Although simultaneous and continuous kinematics estimation from surface electromyography (EMG) is a feasible way to achieve natural and intuitive human-machine interaction, few works investigated multi-DoF estimation across the significant joints of upper limb, shoulder and elbow joints. This paper evaluates the feasibility to estimate 4-DoF kinematics at shoulder and elbow during coordinated arm movements. Considering the potential applications of this method in exoskeleton, prosthetics and other arm rehabilitation techniques, the estimation performance is presented with different muscle activity decomposition and learning strategies. Principle component analysis (PCA) and independent component analysis (ICA) are respectively employed for EMG mode decomposition with artificial neural network (ANN) for learning the electromechanical association. Four joint angles across shoulder and elbow are simultaneously and continuously estimated from EMG in four coordinated arm movements. By using ICA (PCA) and single ANN, the average estimation accuracy 91.12% (90.23%) is obtained in 70-s intra-cross validation and 87.00% (86.30%) is obtained in 2-min inter-cross validation. This result suggests it is feasible and effective to use ICA (PCA) with single ANN for multi-joint kinematics estimation in variant application conditions. PMID:28611573

  4. Identification of fungal phytopathogens using Fourier transform infrared-attenuated total reflection spectroscopy and advanced statistical methods

    NASA Astrophysics Data System (ADS)

    Salman, Ahmad; Lapidot, Itshak; Pomerantz, Ami; Tsror, Leah; Shufan, Elad; Moreh, Raymond; Mordechai, Shaul; Huleihel, Mahmoud

    2012-01-01

    The early diagnosis of phytopathogens is of a great importance; it could save large economical losses due to crops damaged by fungal diseases, and prevent unnecessary soil fumigation or the use of fungicides and bactericides and thus prevent considerable environmental pollution. In this study, 18 isolates of three different fungi genera were investigated; six isolates of Colletotrichum coccodes, six isolates of Verticillium dahliae and six isolates of Fusarium oxysporum. Our main goal was to differentiate these fungi samples on the level of isolates, based on their infrared absorption spectra obtained using the Fourier transform infrared-attenuated total reflection (FTIR-ATR) sampling technique. Advanced statistical and mathematical methods: principal component analysis (PCA), linear discriminant analysis (LDA), and k-means were applied to the spectra after manipulation. Our results showed significant spectral differences between the various fungi genera examined. The use of k-means enabled classification between the genera with a 94.5% accuracy, whereas the use of PCA [3 principal components (PCs)] and LDA has achieved a 99.7% success rate. However, on the level of isolates, the best differentiation results were obtained using PCA (9 PCs) and LDA for the lower wavenumber region (800-1775 cm-1), with identification success rates of 87%, 85.5%, and 94.5% for Colletotrichum, Fusarium, and Verticillium strains, respectively.

  5. Discrimination of geographical origin and detection of adulteration of kudzu root by fluorescence spectroscopy coupled with multi-way pattern recognition

    NASA Astrophysics Data System (ADS)

    Hu, Leqian; Ma, Shuai; Yin, Chunling

    2018-03-01

    In this work, fluorescence spectroscopy combined with multi-way pattern recognition techniques were developed for determining the geographical origin of kudzu root and detection and quantification of adulterants in kudzu root. Excitation-emission (EEM) spectra were obtained for 150 pure kudzu root samples of different geographical origins and 150 fake kudzu roots with different adulteration proportions by recording emission from 330 to 570 nm with excitation in the range of 320-480 nm, respectively. Multi-way principal components analysis (M-PCA) and multilinear partial least squares discriminant analysis (N-PLS-DA) methods were used to decompose the excitation-emission matrices datasets. 150 pure kudzu root samples could be differentiated exactly from each other according to their geographical origins by M-PCA and N-PLS-DA models. For the adulteration kudzu root samples, N-PLS-DA got better and more reliable classification result comparing with the M-PCA model. The results obtained in this study indicated that EEM spectroscopy coupling with multi-way pattern recognition could be used as an easy, rapid and novel tool to distinguish the geographical origin of kudzu root and detect adulterated kudzu root. Besides, this method was also suitable for determining the geographic origin and detection the adulteration of the other foodstuffs which can produce fluorescence.

  6. Prostate cancer molecular detection in plasma samples by glutathione S-transferase P1 (GSTP1) methylation analysis.

    PubMed

    Dumache, Raluca; Puiu, Maria; Motoc, Marilena; Vernic, Corina; Dumitrascu, Victor

    2014-01-01

    Prostate cancer (PCa) represents the most commonly diagnosed type of malignancy among men in Western European countries and the second cause of cancer-related deaths among men worldwide. Methylation of the CpG island has an important role in prostate carcinogenesis and progression. The purpose of the study was to analyse the diagnostic value of aberrant promoter hypermethylation of the gene for glutathione S-transferase P1 (GSTP1) in plasma DNA to discriminate between prostate cancer (PCa) and benign prostatic hyperplasia (BPH) patients by minimally invasive methods. Aberrant promoter hypermethylation was investigated in DNA isolated from plasma samples of 31 patients with diagnostic of PCa and 44 cancer-free males (control subjects). Extracted genomic DNA was bisulfite treated and analyzed using methylation-specific polymerase chain reaction (MS-PCR) technique. Hypermethylation of the GSTP1 gene was detected in plasma samples from 27 of 31 (92.86%) patients with PCa. Genomic DNA from plasma samples from the 44 controls without genitourinary cancer revealed promoter hypermethylation of GSTP1 gene in 3 (10.6%) of the 44 patients. Receiver operating curve (ROC) included clinico-pathological parameters such as: serum PSA levels, pathological stage, Gleason score, hypermethylation status of GSTP1 gene, and it gave a predictive accuracy of 93% with a sensitivity and specificity of 95% and 87%, respectively. In this study, we have evaluated the ability of GSTP1 gene to discriminate between PCa and BPH patients in genomic DNA from plasma samples by non-invasive methods.

  7. Researches of fruit quality prediction model based on near infrared spectrum

    NASA Astrophysics Data System (ADS)

    Shen, Yulin; Li, Lian

    2018-04-01

    With the improvement in standards for food quality and safety, people pay more attention to the internal quality of fruits, therefore the measurement of fruit internal quality is increasingly imperative. In general, nondestructive soluble solid content (SSC) and total acid content (TAC) analysis of fruits is vital and effective for quality measurement in global fresh produce markets, so in this paper, we aim at establishing a novel fruit internal quality prediction model based on SSC and TAC for Near Infrared Spectrum. Firstly, the model of fruit quality prediction based on PCA + BP neural network, PCA + GRNN network, PCA + BP adaboost strong classifier, PCA + ELM and PCA + LS_SVM classifier are designed and implemented respectively; then, in the NSCT domain, the median filter and the SavitzkyGolay filter are used to preprocess the spectral signal, Kennard-Stone algorithm is used to automatically select the training samples and test samples; thirdly, we achieve the optimal models by comparing 15 kinds of prediction model based on the theory of multi-classifier competition mechanism, specifically, the non-parametric estimation is introduced to measure the effectiveness of proposed model, the reliability and variance of nonparametric estimation evaluation of each prediction model to evaluate the prediction result, while the estimated value and confidence interval regard as a reference, the experimental results demonstrate that this model can better achieve the optimal evaluation of the internal quality of fruit; finally, we employ cat swarm optimization to optimize two optimal models above obtained from nonparametric estimation, empirical testing indicates that the proposed method can provide more accurate and effective results than other forecasting methods.

  8. The Effect of Phenazine-1-Carboxylic Acid on Mycelial Growth of Botrytis cinerea Produced by Pseudomonas aeruginosa LV Strain.

    PubMed

    Simionato, Ane S; Navarro, Miguel O P; de Jesus, Maria L A; Barazetti, André R; da Silva, Caroline S; Simões, Glenda C; Balbi-Peña, Maria I; de Mello, João C P; Panagio, Luciano A; de Almeida, Ricardo S C; Andrade, Galdino; de Oliveira, Admilton G

    2017-01-01

    One of the most important postharvest plant pathogens that affect strawberries, grapes and tomatoes is Botrytis cinerea , known as gray mold. The fungus remains in latent form until spore germination conditions are good, making infection control difficult, causing great losses in the whole production chain. This study aimed to purify and identify phenazine-1-carboxylic acid (PCA) produced by the Pseudomonas aeruginosa LV strain and to determine its antifungal activity against B. cinerea . The compounds produced were extracted with dichloromethane and passed through a chromatographic process. The purity level of PCA was determined by reversed-phase high-performance liquid chromatography semi-preparative. The structure of PCA was confirmed by nuclear magnetic resonance and electrospray ionization mass spectrometry. Antifungal activity was determined by the dry paper disk and minimum inhibitory concentration (MIC) methods and identified by scanning electron microscopy and confocal microscopy. The results showed that PCA inhibited mycelial growth, where MIC was 25 μg mL -1 . Microscopic analysis revealed a reduction in exopolysaccharide (EPS) formation, showing distorted and damaged hyphae of B. cinerea . The results suggested that PCA has a high potential in the control of B. cinerea and inhibition of EPS (important virulence factor). This natural compound is a potential alternative to postharvest control of gray mold disease.

  9. Prostate Cancer Radiation Therapy and Risk of Thromboembolic Events

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bosco, Cecilia, E-mail: Cecilia.t.bosco@kcl.ac.uk; Garmo, Hans; Regional Cancer Centre, Uppsala, Akademiska Sjukhuset, Uppsala

    Purpose: To investigate the risk of thromboembolic disease (TED) after radiation therapy (RT) with curative intent for prostate cancer (PCa). Patients and Methods: We identified all men who received RT as curative treatment (n=9410) and grouped according to external beam RT (EBRT) or brachytherapy (BT). By comparing with an age- and county-matched comparison cohort of PCa-free men (n=46,826), we investigated risk of TED after RT using Cox proportional hazard regression models. The model was adjusted for tumor characteristics, demographics, comorbidities, PCa treatments, and known risk factors of TED, such as recent surgery and disease progression. Results: Between 2006 and 2013, 6232more » men with PCa received EBRT, and 3178 underwent BT. A statistically significant association was found between EBRT and BT and risk of pulmonary embolism in the crude analysis. However, upon adjusting for known TED risk factors these associations disappeared. No significant associations were found between BT or EBRT and deep venous thrombosis. Conclusion: Curative RT for prostate cancer using contemporary methodologies was not associated with an increased risk of TED.« less

  10. Identification of an IL-1-induced gene expression pattern in AR+ PCa cells that mimics the molecular phenotype of AR- PCa cells.

    PubMed

    Thomas-Jardin, Shayna E; Kanchwala, Mohammed S; Jacob, Joan; Merchant, Sana; Meade, Rachel K; Gahnim, Nagham M; Nawas, Afshan F; Xing, Chao; Delk, Nikki A

    2018-06-01

    In immunosurveillance, bone-derived immune cells infiltrate the tumor and secrete inflammatory cytokines to destroy cancer cells. However, cancer cells have evolved mechanisms to usurp inflammatory cytokines to promote tumor progression. In particular, the inflammatory cytokine, interleukin-1 (IL-1), is elevated in prostate cancer (PCa) patient tissue and serum, and promotes PCa bone metastasis. IL-1 also represses androgen receptor (AR) accumulation and activity in PCa cells, yet the cells remain viable and tumorigenic; suggesting that IL-1 may also contribute to AR-targeted therapy resistance. Furthermore, IL-1 and AR protein levels negatively correlate in PCa tumor cells. Taken together, we hypothesize that IL-1 reprograms AR positive (AR + ) PCa cells into AR negative (AR - ) PCa cells that co-opt IL-1 signaling to ensure AR-independent survival and tumor progression in the inflammatory tumor microenvironment. LNCaP and PC3 PCa cells were treated with IL-1β or HS-5 bone marrow stromal cell (BMSC) conditioned medium and analyzed by RNA sequencing and RT-QPCR. To verify genes identified by RNA sequencing, LNCaP, MDA-PCa-2b, PC3, and DU145 PCa cell lines were treated with the IL-1 family members, IL-1α or IL-1β, or exposed to HS-5 BMSC in the presence or absence of Interleukin-1 Receptor Antagonist (IL-1RA). Treated cells were analyzed by western blot and/or RT-QPCR. Comparative analysis of sequencing data from the AR + LNCaP PCa cell line versus the AR - PC3 PCa cell line reveals an IL-1-conferred gene suite in LNCaP cells that is constitutive in PC3 cells. Bioinformatics analysis of the IL-1 regulated gene suite revealed that inflammatory and immune response pathways are primarily elicited; likely facilitating PCa cell survival and tumorigenicity in an inflammatory tumor microenvironment. Our data supports that IL-1 reprograms AR + PCa cells to mimic AR - PCa gene expression patterns that favor AR-targeted treatment resistance and cell survival. © 2018 Wiley Periodicals, Inc.

  11. Analysis of environmental variation in a Great Plains reservoir using principal components analysis and geographic information systems

    USGS Publications Warehouse

    Long, J.M.; Fisher, W.L.

    2006-01-01

    We present a method for spatial interpretation of environmental variation in a reservoir that integrates principal components analysis (PCA) of environmental data with geographic information systems (GIS). To illustrate our method, we used data from a Great Plains reservoir (Skiatook Lake, Oklahoma) with longitudinal variation in physicochemical conditions. We measured 18 physicochemical features, mapped them using GIS, and then calculated and interpreted four principal components. Principal component 1 (PC1) was readily interpreted as longitudinal variation in water chemistry, but the other principal components (PC2-4) were difficult to interpret. Site scores for PC1-4 were calculated in GIS by summing weighted overlays of the 18 measured environmental variables, with the factor loadings from the PCA as the weights. PC1-4 were then ordered into a landscape hierarchy, an emergent property of this technique, which enabled their interpretation. PC1 was interpreted as a reservoir scale change in water chemistry, PC2 was a microhabitat variable of rip-rap substrate, PC3 identified coves/embayments and PC4 consisted of shoreline microhabitats related to slope. The use of GIS improved our ability to interpret the more obscure principal components (PC2-4), which made the spatial variability of the reservoir environment more apparent. This method is applicable to a variety of aquatic systems, can be accomplished using commercially available software programs, and allows for improved interpretation of the geographic environmental variability of a system compared to using typical PCA plots. ?? Copyright by the North American Lake Management Society 2006.

  12. The impact of metformin use on survival in prostate cancer: a systematic review and meta-analysis

    PubMed Central

    Xiao, Yao; Zheng, Lei; Mei, Zubing; Xu, Changbao; Liu, Changwei; Chu, Xiaohan; Hao, Bin

    2017-01-01

    Background Metformin has been implicated to reduce the risk of prostate cancer (PCa) beyond its glucose-lowering effect. However, the influence of metformin on prognosis of PCa is often controversial. Results A total of 13 cohort studies encompassing 177,490 individuals were included in the meta-analysis. Data on overall survival (OS) and cancer-specific survival (CSS) was extracted from 8 and six studies, respectively. Comparing metformin users with non-metformin users, the pooled hazard ratios (HRs) for OS and CSS were 0.79 (95% confidence interval [CI] 0.63–0.98) and 0.76 (95% CI 0.57–1.02), respectively. Subgroup analyses stratified by baseline charcteristics indicated significant CSS benefits were noted in studies conducted in USA/Canada with prospective, large sample size, multiple-centered study design. Five studies reported the PCa prognosis for recurrence-free survival (RFS) and metformin use was significantly associated with patient RFS (HR 0.74, 95% CI, 0.58–0.95). Methods Relevant studies were searched and identified using PubMed, Embase and Cochrane databases from inception through January 2017, which investigated associations between the use of metformin and PCa prognosis. Combined HRs with 95% CI were pooled using a random-effects model. The primary outcomes of interest were OS and CSS. Conclusions Our findings provide indication that metformin therapy has a trend to improve survival for patients with PCa. Further prospective, multi-centered, large sample size cohort studies are warranted to determine the true relationship. PMID:29245991

  13. Comprehensive Profiling and Quantification of Ginsenosides in the Root, Stem, Leaf, and Berry of Panax ginseng by UPLC-QTOF/MS.

    PubMed

    Lee, Jae Won; Choi, Bo-Ram; Kim, Young-Chang; Choi, Doo Jin; Lee, Young-Seob; Kim, Geum-Soog; Baek, Nam-In; Kim, Seung-Yu; Lee, Dae Young

    2017-12-04

    The effective production and usage of ginsenosides, given their distinct pharmacological effects, are receiving increasing amounts of attention. As the ginsenosides content differs in different parts of Panax ginseng, we wanted to assess and compare the ginsenosides content in the ginseng roots, leave, stems, and berries. To extract the ginsenosides, 70% (v/v) methanol was used. The optimal ultra-performance liquid chromatography-quadrupole time of flight mass spectrometry (UPLC-QTOF/MS) method was used to profile various ginsenosides from the different parts of P. ginseng. The datasets were then subjected to multivariate analysis including principal component analysis (PCA) and hierarchical clustering analysis (HCA). A UPLC-QTOF/MS method with an in-house library was constructed to profile 58 ginsenosides. With this method, a total of 39 ginsenosides were successfully identified and quantified in the ginseng roots, leave, stem, and berries. PCA and HCA characterized the different ginsenosides compositions from the different parts. The quantitative ginsenoside contents were also characterized from each plant part. The results of this study indicate that the UPLC-QTOF/MS method can be an effective tool to characterize various ginsenosides from the different parts of P. ginseng.

  14. Discrimination of Geographical Origin of Asian Garlic Using Isotopic and Chemical Datasets under Stepwise Principal Component Analysis.

    PubMed

    Liu, Tsang-Sen; Lin, Jhen-Nan; Peng, Tsung-Ren

    2018-01-16

    Isotopic compositions of δ 2 H, δ 18 O, δ 13 C, and δ 15 N and concentrations of 22 trace elements from garlic samples were analyzed and processed with stepwise principal component analysis (PCA) to discriminate garlic's country of origin among Asian regions including South Korea, Vietnam, Taiwan, and China. Results indicate that there is no single trace-element concentration or isotopic composition that can accomplish the study's purpose and the stepwise PCA approach proposed does allow for discrimination between countries on a regional basis. Sequentially, Step-1 PCA distinguishes garlic's country of origin among Taiwanese, South Korean, and Vietnamese samples; Step-2 PCA discriminates Chinese garlic from South Korean garlic; and Step-3 and Step-4 PCA, Chinese garlic from Vietnamese garlic. In model tests, countries of origin of all audit samples were correctly discriminated by stepwise PCA. Consequently, this study demonstrates that stepwise PCA as applied is a simple and effective approach to discriminating country of origin among Asian garlics. © 2018 American Academy of Forensic Sciences.

  15. Targeted and non-targeted detection of lemon juice adulteration by LC-MS and chemometrics.

    PubMed

    Wang, Zhengfang; Jablonski, Joseph E

    2016-01-01

    Economically motivated adulteration (EMA) of lemon juice was detected by LC-MS and principal component analysis (PCA). Twenty-two batches of freshly squeezed lemon juice were adulterated by adding an aqueous solution containing 5% citric acid and 6% sucrose to pure lemon juice to obtain 30%, 60% and 100% lemon juice samples. Their total titratable acidities, °Brix and pH values were measured, and then all the lemon juice samples were subject to LC-MS analysis. Concentrations of hesperidin and eriocitrin, major phenolic components of lemon juice, were quantified. The PCA score plots for LC-MS datasets were used to preview the classification of pure and adulterated lemon juice samples. Results showed a large inherent variability in the chemical properties among 22 batches of 100% lemon juice samples. Measurement or quantitation of one or several chemical properties (targeted detection) was not effective in detecting lemon juice adulteration. However, by using the LC-MS datasets, including both chromatographic and mass spectrometric information, 100% lemon juice samples were successfully differentiated from adulterated samples containing 30% lemon juice in the PCA score plot. LC-MS coupled with chemometric analysis can be a complement to existing methods for detecting juice adulteration.

  16. Plaque echodensity and textural features are associated with histologic carotid plaque instability.

    PubMed

    Doonan, Robert J; Gorgui, Jessica; Veinot, Jean P; Lai, Chi; Kyriacou, Efthyvoulos; Corriveau, Marc M; Steinmetz, Oren K; Daskalopoulou, Stella S

    2016-09-01

    Carotid plaque echodensity and texture features predict cerebrovascular symptomatology. Our purpose was to determine the association of echodensity and textural features obtained from a digital image analysis (DIA) program with histologic features of plaque instability as well as to identify the specific morphologic characteristics of unstable plaques. Patients scheduled to undergo carotid endarterectomy were recruited and underwent carotid ultrasound imaging. DIA was performed to extract echodensity and textural features using Plaque Texture Analysis software (LifeQ Medical Ltd, Nicosia, Cyprus). Carotid plaque surgical specimens were obtained and analyzed histologically. Principal component analysis (PCA) was performed to reduce imaging variables. Logistic regression models were used to determine if PCA variables and individual imaging variables predicted histologic features of plaque instability. Image analysis data from 160 patients were analyzed. Individual imaging features of plaque echolucency and homogeneity were associated with a more unstable plaque phenotype on histology. These results were independent of age, sex, and degree of carotid stenosis. PCA reduced 39 individual imaging variables to five PCA variables. PCA1 and PCA2 were significantly associated with overall plaque instability on histology (both P = .02), whereas PCA3 did not achieve statistical significance (P = .07). DIA features of carotid plaques are associated with histologic plaque instability as assessed by multiple histologic features. Importantly, unstable plaques on histology appear more echolucent and homogeneous on ultrasound imaging. These results are independent of stenosis, suggesting that image analysis may have a role in refining the selection of patients who undergo carotid endarterectomy. Copyright © 2016 Society for Vascular Surgery. Published by Elsevier Inc. All rights reserved.

  17. Quantitative assessment of the mechanical properties of prostate tissue with optical coherence elastography

    NASA Astrophysics Data System (ADS)

    Ling, Yuting; Li, Chunhui; Zhou, Kanheng; Guan, Guangying; Lang, Stephen; McGloin, David; Nabi, Ghulam; Huang, Zhihong

    2018-02-01

    Prostate cancer (PCa) is a heterogeneous disease with multifocal origin. In current clinical care, the Gleason scoring system is the well-established diagnosis by microscopic evaluation of the tissue from trans-rectal ultrasound (TRUS) guided biopsies. Nevertheless, the sensitivity and specificity in detecting PCa can range from 40 to 50% for conventional TRUS B-mode imaging. Tissue elasticity is associated with the disease progression and elastography technique has recently shown promise in aiding PCa diagnosis. However, many cancer foci in the prostate gland has very small size less than 1 mm and those detected by medical elastography were larger than 2 mm. Hereby, we introduce optical coherence elastography (OCE) to quantify the prostate stiffness with high resolution in the magnitude of 10 µm. Following our feasibility study of 10 patients reported previously, we recruited 60 more patients undergoing 12-core TRUS guided biopsies for suspected PCa with a total of 720 biopsies. The stiffness of cancer tissue was approximately 57.63% higher than that of benign ones. Using histology as reference standard and cut-off threshold of 600kPa, the data analysis showed sensitivity and specificity of 89.6% and 99.8% respectively. The method also demonstrated potential in characterising different grades of PCa based on the change of tissue morphology and quantitative mechanical properties. In conclusion, quantitative OCE can be a reliable technique to identify PCa lesion and differentiate indolent from aggressive cancer.

  18. Online dimensionality reduction using competitive learning and Radial Basis Function network.

    PubMed

    Tomenko, Vladimir

    2011-06-01

    The general purpose dimensionality reduction method should preserve data interrelations at all scales. Additional desired features include online projection of new data, processing nonlinearly embedded manifolds and large amounts of data. The proposed method, called RBF-NDR, combines these features. RBF-NDR is comprised of two modules. The first module learns manifolds by utilizing modified topology representing networks and geodesic distance in data space and approximates sampled or streaming data with a finite set of reference patterns, thus achieving scalability. Using input from the first module, the dimensionality reduction module constructs mappings between observation and target spaces. Introduction of specific loss function and synthesis of the training algorithm for Radial Basis Function network results in global preservation of data structures and online processing of new patterns. The RBF-NDR was applied for feature extraction and visualization and compared with Principal Component Analysis (PCA), neural network for Sammon's projection (SAMANN) and Isomap. With respect to feature extraction, the method outperformed PCA and yielded increased performance of the model describing wastewater treatment process. As for visualization, RBF-NDR produced superior results compared to PCA and SAMANN and matched Isomap. For the Topic Detection and Tracking corpus, the method successfully separated semantically different topics. Copyright © 2011 Elsevier Ltd. All rights reserved.

  19. Differentiation of live and dead salmonella cells using fourier transform infrared (FTIR) spectroscopy and principle component analysis (PCA) technique

    USDA-ARS?s Scientific Manuscript database

    Various technologies have been developed for pathogen detection using optical, electrochemical, biochemical and physical properties. Conventional microbiological methods need time from days to week to get the result. Though this method is very sensitive and accurate, a rapid detection of pathogens i...

  20. Respiratory motion compensation algorithm of ultrasound hepatic perfusion data acquired in free-breathing

    NASA Astrophysics Data System (ADS)

    Wu, Kaizhi; Zhang, Xuming; Chen, Guangxie; Weng, Fei; Ding, Mingyue

    2013-10-01

    Images acquired in free breathing using contrast enhanced ultrasound exhibit a periodic motion that needs to be compensated for if a further accurate quantification of the hepatic perfusion analysis is to be executed. In this work, we present an algorithm to compensate the respiratory motion by effectively combining the PCA (Principal Component Analysis) method and block matching method. The respiratory kinetics of the ultrasound hepatic perfusion image sequences was firstly extracted using the PCA method. Then, the optimal phase of the obtained respiratory kinetics was detected after normalizing the motion amplitude and determining the image subsequences of the original image sequences. The image subsequences were registered by the block matching method using cross-correlation as the similarity. Finally, the motion-compensated contrast images can be acquired by using the position mapping and the algorithm was evaluated by comparing the TICs extracted from the original image sequences and compensated image subsequences. Quantitative comparisons demonstrated that the average fitting error estimated of ROIs (region of interest) was reduced from 10.9278 +/- 6.2756 to 5.1644 +/- 3.3431 after compensating.

  1. Satellite image fusion based on principal component analysis and high-pass filtering.

    PubMed

    Metwalli, Mohamed R; Nasr, Ayman H; Allah, Osama S Farag; El-Rabaie, S; Abd El-Samie, Fathi E

    2010-06-01

    This paper presents an integrated method for the fusion of satellite images. Several commercial earth observation satellites carry dual-resolution sensors, which provide high spatial resolution or simply high-resolution (HR) panchromatic (pan) images and low-resolution (LR) multi-spectral (MS) images. Image fusion methods are therefore required to integrate a high-spectral-resolution MS image with a high-spatial-resolution pan image to produce a pan-sharpened image with high spectral and spatial resolutions. Some image fusion methods such as the intensity, hue, and saturation (IHS) method, the principal component analysis (PCA) method, and the Brovey transform (BT) method provide HR MS images, but with low spectral quality. Another family of image fusion methods, such as the high-pass-filtering (HPF) method, operates on the basis of the injection of high frequency components from the HR pan image into the MS image. This family of methods provides less spectral distortion. In this paper, we propose the integration of the PCA method and the HPF method to provide a pan-sharpened MS image with superior spatial resolution and less spectral distortion. The experimental results show that the proposed fusion method retains the spectral characteristics of the MS image and, at the same time, improves the spatial resolution of the pan-sharpened image.

  2. Investigation of inversion polymorphisms in the human genome using principal components analysis.

    PubMed

    Ma, Jianzhong; Amos, Christopher I

    2012-01-01

    Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct "populations" of inversion homozygotes of different orientations and their 1:1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases.

  3. Principal Component Analysis: Resources for an Essential Application of Linear Algebra

    ERIC Educational Resources Information Center

    Pankavich, Stephen; Swanson, Rebecca

    2015-01-01

    Principal Component Analysis (PCA) is a highly useful topic within an introductory Linear Algebra course, especially since it can be used to incorporate a number of applied projects. This method represents an essential application and extension of the Spectral Theorem and is commonly used within a variety of fields, including statistics,…

  4. Dynamic analysis environment for nuclear forensic analyses

    NASA Astrophysics Data System (ADS)

    Stork, C. L.; Ummel, C. C.; Stuart, D. S.; Bodily, S.; Goldblum, B. L.

    2017-01-01

    A Dynamic Analysis Environment (DAE) software package is introduced to facilitate group inclusion/exclusion method testing, evaluation and comparison for pre-detonation nuclear forensics applications. Employing DAE, the multivariate signatures of a questioned material can be compared to the signatures for different, known groups, enabling the linking of the questioned material to its potential process, location, or fabrication facility. Advantages of using DAE for group inclusion/exclusion include built-in query tools for retrieving data of interest from a database, the recording and documentation of all analysis steps, a clear visualization of the analysis steps intelligible to a non-expert, and the ability to integrate analysis tools developed in different programming languages. Two group inclusion/exclusion methods are implemented in DAE: principal component analysis, a parametric feature extraction method, and k nearest neighbors, a nonparametric pattern recognition method. Spent Fuel Isotopic Composition (SFCOMPO), an open source international database of isotopic compositions for spent nuclear fuels (SNF) from 14 reactors, is used to construct PCA and KNN models for known reactor groups, and 20 simulated SNF samples are utilized in evaluating the performance of these group inclusion/exclusion models. For all 20 simulated samples, PCA in conjunction with the Q statistic correctly excludes a large percentage of reactor groups and correctly includes the true reactor of origination. Employing KNN, 14 of the 20 simulated samples are classified to their true reactor of origination.

  5. Comparison between target magnetic resonance imaging (MRI) in-gantry and cognitively directed transperineal or transrectal-guided prostate biopsies for Prostate Imaging-Reporting and Data System (PI-RADS) 3-5 MRI lesions.

    PubMed

    Yaxley, Anna J; Yaxley, John W; Thangasamy, Isaac A; Ballard, Emma; Pokorny, Morgan R

    2017-11-01

    To compare the detection rates of prostate cancer (PCa) in men with Prostate Imaging-Reporting and Data System (PI-RADS) 3-5 abnormalities on 3-Tesla multiparametric (mp) magnetic resonance imaging (MRI) using in-bore MRI-guided biopsy compared with cognitively directed transperineal (cTP) biopsy and transrectal ultrasonography (cTRUS) biopsy. This was a retrospective single-centre study of consecutive men attending the private practice clinic of an experienced urologist performing MRI-guided biopsy and an experienced urologist performing cTP and cTRUS biopsy techniques for PI-RADS 3-5 lesions identified on 3-Tesla mpMRI. There were 595 target mpMRI lesions from 482 men with PI-RADS 3-5 regions of interest during 483 episodes of biopsy. The abnormal mpMRI target lesion was biopsied using the MRI-guided method for 298 biopsies, the cTP method for 248 biopsies and the cTRUS method for 49 biopsies. There were no significant differences in PCa detection among the three biopsy methods in PI-RADS 3 (48.9%, 40.0% and 44.4%, respectively), PI-RADS 4 (73.2%, 81.0% and 85.0%, respectively) or PI-RADS 5 (95.2, 92.0% and 95.0%, respectively) lesions, and there was no significant difference in detection of significant PCa among the biopsy methods in PI-RADS 3 (42.2%, 30.0% and 33.3%, respectively), PI-RADS 4 (66.8%, 66.0% and 80.0%, respectively) or PI-RADS 5 (90.5%, 89.8% and 90.0%, respectively) lesions. There were also no differences in PCa or significant PCa detection based on lesion location or size among the methods. We found no significant difference in the ability to detect PCa or significant PCa using targeted MRI-guided, cTP or cTRUS biopsy methods. Identification of an abnormal area on mpMRI appears to be more important in increasing the detection of PCa than the technique used to biopsy an MRI abnormality. © 2017 The Authors BJU International © 2017 BJU International Published by John Wiley & Sons Ltd.

  6. Experimental variability and data pre-processing as factors affecting the discrimination power of some chemometric approaches (PCA, CA and a new algorithm based on linear regression) applied to (+/-)ESI/MS and RPLC/UV data: Application on green tea extracts.

    PubMed

    Iorgulescu, E; Voicu, V A; Sârbu, C; Tache, F; Albu, F; Medvedovici, A

    2016-08-01

    The influence of the experimental variability (instrumental repeatability, instrumental intermediate precision and sample preparation variability) and data pre-processing (normalization, peak alignment, background subtraction) on the discrimination power of multivariate data analysis methods (Principal Component Analysis -PCA- and Cluster Analysis -CA-) as well as a new algorithm based on linear regression was studied. Data used in the study were obtained through positive or negative ion monitoring electrospray mass spectrometry (+/-ESI/MS) and reversed phase liquid chromatography/UV spectrometric detection (RPLC/UV) applied to green tea extracts. Extractions in ethanol and heated water infusion were used as sample preparation procedures. The multivariate methods were directly applied to mass spectra and chromatograms, involving strictly a holistic comparison of shapes, without assignment of any structural identity to compounds. An alternative data interpretation based on linear regression analysis mutually applied to data series is also discussed. Slopes, intercepts and correlation coefficients produced by the linear regression analysis applied on pairs of very large experimental data series successfully retain information resulting from high frequency instrumental acquisition rates, obviously better defining the profiles being compared. Consequently, each type of sample or comparison between samples produces in the Cartesian space an ellipsoidal volume defined by the normal variation intervals of the slope, intercept and correlation coefficient. Distances between volumes graphically illustrates (dis)similarities between compared data. The instrumental intermediate precision had the major effect on the discrimination power of the multivariate data analysis methods. Mass spectra produced through ionization from liquid state in atmospheric pressure conditions of bulk complex mixtures resulting from extracted materials of natural origins provided an excellent data basis for multivariate analysis methods, equivalent to data resulting from chromatographic separations. The alternative evaluation of very large data series based on linear regression analysis produced information equivalent to results obtained through application of PCA an CA. Copyright © 2016 Elsevier B.V. All rights reserved.

  7. Rapid analysis of sugars in honey by processing Raman spectrum using chemometric methods and artificial neural networks.

    PubMed

    Özbalci, Beril; Boyaci, İsmail Hakkı; Topcu, Ali; Kadılar, Cem; Tamer, Uğur

    2013-02-15

    The aim of this study was to quantify glucose, fructose, sucrose and maltose contents of honey samples using Raman spectroscopy as a rapid method. By performing a single measurement, quantifications of sugar contents have been said to be unaffordable according to the molecular similarities between sugar molecules in honey matrix. This bottleneck was overcome by coupling Raman spectroscopy with chemometric methods (principal component analysis (PCA) and partial least squares (PLS)) and an artificial neural network (ANN). Model solutions of four sugars were processed with PCA and significant separation was observed. This operation, done with the spectral features by using PLS and ANN methods, led to the discriminant analysis of sugar contents. Models/trained networks were created using a calibration data set and evaluated using a validation data set. The correlation coefficient values between actual and predicted values of glucose, fructose, sucrose and maltose were determined as 0.964, 0.965, 0.968 and 0.949 for PLS and 0.965, 0.965, 0.978 and 0.956 for ANN, respectively. The requirement of rapid analysis of sugar contents of commercial honeys has been met by the data processed within this article. Copyright © 2012 Elsevier Ltd. All rights reserved.

  8. Functional Data Analysis in NTCP Modeling: A New Method to Explore the Radiation Dose-Volume Effects

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Benadjaoud, Mohamed Amine, E-mail: mohamedamine.benadjaoud@gustaveroussy.fr; Université Paris sud, Le Kremlin-Bicêtre; Institut Gustave Roussy, Villejuif

    2014-11-01

    Purpose/Objective(s): To describe a novel method to explore radiation dose-volume effects. Functional data analysis is used to investigate the information contained in differential dose-volume histograms. The method is applied to the normal tissue complication probability modeling of rectal bleeding (RB) for patients irradiated in the prostatic bed by 3-dimensional conformal radiation therapy. Methods and Materials: Kernel density estimation was used to estimate the individual probability density functions from each of the 141 rectum differential dose-volume histograms. Functional principal component analysis was performed on the estimated probability density functions to explore the variation modes in the dose distribution. The functional principalmore » components were then tested for association with RB using logistic regression adapted to functional covariates (FLR). For comparison, 3 other normal tissue complication probability models were considered: the Lyman-Kutcher-Burman model, logistic model based on standard dosimetric parameters (LM), and logistic model based on multivariate principal component analysis (PCA). Results: The incidence rate of grade ≥2 RB was 14%. V{sub 65Gy} was the most predictive factor for the LM (P=.058). The best fit for the Lyman-Kutcher-Burman model was obtained with n=0.12, m = 0.17, and TD50 = 72.6 Gy. In PCA and FLR, the components that describe the interdependence between the relative volumes exposed at intermediate and high doses were the most correlated to the complication. The FLR parameter function leads to a better understanding of the volume effect by including the treatment specificity in the delivered mechanistic information. For RB grade ≥2, patients with advanced age are significantly at risk (odds ratio, 1.123; 95% confidence interval, 1.03-1.22), and the fits of the LM, PCA, and functional principal component analysis models are significantly improved by including this clinical factor. Conclusion: Functional data analysis provides an attractive method for flexibly estimating the dose-volume effect for normal tissues in external radiation therapy.« less

  9. Degradation trend estimation of slewing bearing based on LSSVM model

    NASA Astrophysics Data System (ADS)

    Lu, Chao; Chen, Jie; Hong, Rongjing; Feng, Yang; Li, Yuanyuan

    2016-08-01

    A novel prediction method is proposed based on least squares support vector machine (LSSVM) to estimate the slewing bearing's degradation trend with small sample data. This method chooses the vibration signal which contains rich state information as the object of the study. Principal component analysis (PCA) was applied to fuse multi-feature vectors which could reflect the health state of slewing bearing, such as root mean square, kurtosis, wavelet energy entropy, and intrinsic mode function (IMF) energy. The degradation indicator fused by PCA can reflect the degradation more comprehensively and effectively. Then the degradation trend of slewing bearing was predicted by using the LSSVM model optimized by particle swarm optimization (PSO). The proposed method was demonstrated to be more accurate and effective by the whole life experiment of slewing bearing. Therefore, it can be applied in engineering practice.

  10. Authentication of monofloral Yemeni Sidr honey using ultraviolet spectroscopy and chemometric analysis.

    PubMed

    Roshan, Abdul-Rahman A; Gad, Haidy A; El-Ahmady, Sherweit H; Khanbash, Mohamed S; Abou-Shoer, Mohamed I; Al-Azizi, Mohamed M

    2013-08-14

    This work describes a simple model developed for the authentication of monofloral Yemeni Sidr honey using UV spectroscopy together with chemometric techniques of hierarchical cluster analysis (HCA), principal component analysis (PCA), and soft independent modeling of class analogy (SIMCA). The model was constructed using 13 genuine Sidr honey samples and challenged with 25 honey samples of different botanical origins. HCA and PCA were successfully able to present a preliminary clustering pattern to segregate the genuine Sidr samples from the lower priced local polyfloral and non-Sidr samples. The SIMCA model presented a clear demarcation of the samples and was used to identify genuine Sidr honey samples as well as detect admixture with lower priced polyfloral honey by detection limits >10%. The constructed model presents a simple and efficient method of analysis and may serve as a basis for the authentication of other honey types worldwide.

  11. Prostate cancer gene 3 and multiparametric magnetic resonance can reduce unnecessary biopsies: decision curve analysis to evaluate predictive models.

    PubMed

    Busetto, Gian Maria; De Berardinis, Ettore; Sciarra, Alessandro; Panebianco, Valeria; Giovannone, Riccardo; Rosato, Stefano; D'Errigo, Paola; Di Silverio, Franco; Gentile, Vincenzo; Salciccia, Stefano

    2013-12-01

    To overcome the well-known prostate-specific antigen limits, several new biomarkers have been proposed. Since its introduction in clinical practice, the urinary prostate cancer gene 3 (PCA3) assay has shown promising results for prostate cancer (PC) detection. Furthermore, multiparametric magnetic resonance imaging (mMRI) has the ability to better describe several aspects of PC. A prospective study of 171 patients with negative prostate biopsy findings and a persistent high prostate-specific antigen level was conducted to assess the role of mMRI and PCA3 in identifying PC. All patients underwent the PCA3 test and mMRI before a second transrectal ultrasound-guided prostate biopsy. The accuracy and reliability of PCA3 (3 different cutoff points) and mMRI were evaluated. Four multivariate logistic regression models were analyzed, in terms of discrimination and the cost benefit, to assess the clinical role of PCA3 and mMRI in predicting the biopsy outcome. A decision curve analysis was also plotted. Repeated transrectal ultrasound-guided biopsy identified 68 new cases (41.7%) of PC. The sensitivity and specificity of the PCA3 test and mMRI was 68% and 49% and 74% and 90%, respectively. Evaluating the regression models, the best discrimination (area under the curve 0.808) was obtained using the full model (base clinical model plus mMRI and PCA3). The decision curve analysis, to evaluate the cost/benefit ratio, showed good performance in predicting PC with the model that included mMRI and PCA3. mMRI increased the accuracy and sensitivity of the PCA3 test, and the use of the full model significantly improved the cost/benefit ratio, avoiding unnecessary biopsies. Copyright © 2013 Elsevier Inc. All rights reserved.

  12. Prostate Cancer Associated Lipid Signatures in Serum Studied by ESI-Tandem Mass Spectrometryas Potential New Biomarkers.

    PubMed

    Duscharla, Divya; Bhumireddy, Sudarshana Reddy; Lakshetti, Sridhar; Pospisil, Heike; Murthy, P V L N; Walther, Reinhard; Sripadi, Prabhakar; Ummanni, Ramesh

    2016-01-01

    Prostate cancer (PCa) is one amongst the most common cancersin western men. Incidence rate ofPCa is on the rise worldwide. The present study deals with theserum lipidome profiling of patients diagnosed with PCa to identify potential new biomarkers. We employed ESI-MS/MS and GC-MS for identification of significantly altered lipids in cancer patient's serum compared to controls. Lipidomic data revealed 24 lipids are significantly altered in cancer patinet's serum (n = 18) compared to normal (n = 18) with no history of PCa. By using hierarchical clustering and principal component analysis (PCA) we could clearly separate cancer patients from control group. Correlation and partition analysis along with Formal Concept Analysis (FCA) have identified that PC (39:6) and FA (22:3) could classify samples with higher certainty. Both the lipids, PC (39:6) and FA (22:3) could influence the cataloging of patients with 100% sensitivity (all 18 control samples are classified correctly) and 77.7% specificity (of 18 tumor samples 4 samples are misclassified) with p-value of 1.612×10-6 in Fischer's exact test. Further, we performed GC-MS to denote fatty acids altered in PCa patients and found that alpha-linolenic acid (ALA) levels are altered in PCa. We also performed an in vitro proliferation assay to determine the effect of ALA in survival of classical human PCa cell lines LNCaP and PC3. We hereby report that the altered lipids PC (39:6) and FA (22:3) offer a new set of biomarkers in addition to the existing diagnostic tests that could significantly improve sensitivity and specificity in PCa diagnosis.

  13. Prostate Cancer Associated Lipid Signatures in Serum Studied by ESI-Tandem Mass Spectrometryas Potential New Biomarkers

    PubMed Central

    Duscharla, Divya; Bhumireddy, Sudarshana Reddy; Lakshetti, Sridhar; Pospisil, Heike; Murthy, P. V. L. N.; Walther, Reinhard; Sripadi, Prabhakar; Ummanni, Ramesh

    2016-01-01

    Prostate cancer (PCa) is one amongst the most common cancersin western men. Incidence rate ofPCa is on the rise worldwide. The present study deals with theserum lipidome profiling of patients diagnosed with PCa to identify potential new biomarkers. We employed ESI-MS/MS and GC-MS for identification of significantly altered lipids in cancer patient’s serum compared to controls. Lipidomic data revealed 24 lipids are significantly altered in cancer patinet’s serum (n = 18) compared to normal (n = 18) with no history of PCa. By using hierarchical clustering and principal component analysis (PCA) we could clearly separate cancer patients from control group. Correlation and partition analysis along with Formal Concept Analysis (FCA) have identified that PC (39:6) and FA (22:3) could classify samples with higher certainty. Both the lipids, PC (39:6) and FA (22:3) could influence the cataloging of patients with 100% sensitivity (all 18 control samples are classified correctly) and 77.7% specificity (of 18 tumor samples 4 samples are misclassified) with p-value of 1.612×10−6 in Fischer’s exact test. Further, we performed GC-MS to denote fatty acids altered in PCa patients and found that alpha-linolenic acid (ALA) levels are altered in PCa. We also performed an in vitro proliferation assay to determine the effect of ALA in survival of classical human PCa cell lines LNCaP and PC3. We hereby report that the altered lipids PC (39:6) and FA (22:3) offer a new set of biomarkers in addition to the existing diagnostic tests that could significantly improve sensitivity and specificity in PCa diagnosis. PMID:26958841

  14. FFT-enhanced IHS transform method for fusing high-resolution satellite images

    USGS Publications Warehouse

    Ling, Y.; Ehlers, M.; Usery, E.L.; Madden, M.

    2007-01-01

    Existing image fusion techniques such as the intensity-hue-saturation (IHS) transform and principal components analysis (PCA) methods may not be optimal for fusing the new generation commercial high-resolution satellite images such as Ikonos and QuickBird. One problem is color distortion in the fused image, which causes visual changes as well as spectral differences between the original and fused images. In this paper, a fast Fourier transform (FFT)-enhanced IHS method is developed for fusing new generation high-resolution satellite images. This method combines a standard IHS transform with FFT filtering of both the panchromatic image and the intensity component of the original multispectral image. Ikonos and QuickBird data are used to assess the FFT-enhanced IHS transform method. Experimental results indicate that the FFT-enhanced IHS transform method may improve upon the standard IHS transform and the PCA methods in preserving spectral and spatial information. ?? 2006 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS).

  15. Associations of Plasma Concentrations of Dichlorodiphenyldichloroethylene and Polychlorinated Biphenyls with Prostate Cancer: A Case–Control Study in Guadeloupe (French West Indies)

    PubMed Central

    Emeville, Elise; Giusti, Arnaud; Coumoul, Xavier; Thomé, Jean-Pierre; Blanchet, Pascal

    2014-01-01

    Background: Long-term exposure to persistent pollutants with hormonal properties (endocrine-disrupting chemicals; EDCs) may contribute to the risk of prostate cancer (PCa). However, epidemiological evidence remains limited. Objectives: We investigated the relationship between PCa and plasma concentrations of universally widespread pollutants, in particular p,p´-dichlorodiphenyl dichloroethene (DDE) and the non-dioxin-like polychlorinated biphenyl congener 153 (PCB-153). Methods: We evaluated 576 men with newly diagnosed PCa (before treatment) and 655 controls in Guadeloupe (French West Indies). Exposure was analyzed according to case–control status. Associations were assessed by unconditional logistic regression analysis, controlling for confounding factors. Missing data were handled by multiple imputation. Results: We estimated a significant positive association between DDE and PCa [adjusted odds ratio (OR) = 1.53; 95% CI: 1.02, 2.30 for the highest vs. lowest quintile of exposure; ptrend = 0.01]. PCB-153 was inversely associated with PCa (OR = 0.30; 95% CI: 0.19, 0.47 for the highest vs. lowest quintile of exposure values; ptrend < 0.001). Also, PCB-153 was more strongly associated with low-grade than with high-grade PCa. Conclusions: Associations of PCa with DDE and PCB-153 were in opposite directions. This may reflect differences in the mechanisms of action of these EDCs; and although our findings need to be replicated in other populations, they are consistent with complex effects of EDCs on human health. Citation: Emeville E, Giusti A, Coumoul X, Thomé JP, Blanchet P, Multigner L. 2015. Associations of plasma concentrations of dichlorodiphenyldichloroethylene and polychlorinated biphenyls with prostate cancer: a case–control study in Guadeloupe (French West Indies). Environ Health Perspect 123:317–323; http://dx.doi.org/10.1289/ehp.1408407 PMID:25493337

  16. Pretreatment tables predicting pathologic stage of locally advanced prostate cancer.

    PubMed

    Joniau, Steven; Spahn, Martin; Briganti, Alberto; Gandaglia, Giorgio; Tombal, Bertrand; Tosco, Lorenzo; Marchioro, Giansilvio; Hsu, Chao-Yu; Walz, Jochen; Kneitz, Burkhard; Bader, Pia; Frohneberg, Detlef; Tizzani, Alessandro; Graefen, Markus; van Cangh, Paul; Karnes, R Jeffrey; Montorsi, Francesco; van Poppel, Hein; Gontero, Paolo

    2015-02-01

    Pretreatment tables for the prediction of pathologic stage have been published and validated for localized prostate cancer (PCa). No such tables are available for locally advanced (cT3a) PCa. To construct tables predicting pathologic outcome after radical prostatectomy (RP) for patients with cT3a PCa with the aim to help guide treatment decisions in clinical practice. This was a multicenter retrospective cohort study including 759 consecutive patients with cT3a PCa treated with RP between 1987 and 2010. Retropubic RP and pelvic lymphadenectomy. Patients were divided into pretreatment prostate-specific antigen (PSA) and biopsy Gleason score (GS) subgroups. These parameters were used to construct tables predicting pathologic outcome and the presence of positive lymph nodes (LNs) after RP for cT3a PCa using ordinal logistic regression. In the model predicting pathologic outcome, the main effects of biopsy GS and pretreatment PSA were significant. A higher GS and/or higher PSA level was associated with a more unfavorable pathologic outcome. The validation procedure, using a repeated split-sample method, showed good predictive ability. Regression analysis also showed an increasing probability of positive LNs with increasing PSA levels and/or higher GS. Limitations of the study are the retrospective design and the long study period. These novel tables predict pathologic stage after RP for patients with cT3a PCa based on pretreatment PSA level and biopsy GS. They can be used to guide decision making in men with locally advanced PCa. Our study might provide physicians with a useful tool to predict pathologic stage in locally advanced prostate cancer that might help select patients who may need multimodal treatment. Copyright © 2014 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  17. Raman spectroscopy for the characterization of different fractions of hemp essential oil extracted at 130 °C using steam distillation method

    NASA Astrophysics Data System (ADS)

    Hanif, Muhammad Asif; Nawaz, Haq; Naz, Saima; Mukhtar, Rubina; Rashid, Nosheen; Bhatti, Ijaz Ahmad; Saleem, Muhammad

    2017-07-01

    In this study, Raman spectroscopy along with Principal Component Analysis (PCA) is used for the characterization of pure essential oil (pure EO) isolated from the leaves of the Hemp (Cannabis sativa L.,) as well as its different fractions obtained by fractional distillation process. Raman spectra of pure Hemp essential oil and its different fractions show characteristic key bands of main volatile terpenes and terpenoids, which significantly differentiate them from each other. These bands provide information about the chemical composition of sample under investigation and hence can be used as Raman spectral markers for the qualitative monitoring of the pure EO and different fractions containing different active compounds. PCA differentiates the Raman spectral data into different clusters and loadings of the PCA further confirm the biological origin of the different fractions of the essential oil.

  18. Evaluation of three pumpkin species: correlation with physicochemical, antioxidant properties and classification using SPME-GC-MS and E-nose methods.

    PubMed

    Zhou, Chun-Li; Mi, Li; Hu, Xue-Yan; Zhu, Bi-Hua

    2017-09-01

    To ascertain the most discriminant variables for three pumpkin species principal component analysis (PCA) was performed. Twenty-four parameters (pH, conductivity, sucrose, glucose, total soluble solids, L* , a* , b* , individual weight, edible rate, firmness, citric acid, fumaric acid, l-ascorbic acid, malic acid, PPO activity, POD activity, total flavonoids, vitamin E, total phenolics, DPPH, FRAP, β-carotene, and aroma) were considered. The studied pumpkin species were Cucurbita maxima , Cucurbita moschata , and Cucurbita pepo . Three pumpkin species were classified by PCA based on aroma, physicochemical and antioxidant properties because the sum of PC1 and PC2 were both greater than 85% (85.06 and 93.64% respectively). Results were validated by the PCA and showed that PPO activity, total flavonoid, sucrose, glucose, TSS, a* , pH, malic acid, vitamin E, DPPH, FRAP and β-carotene, and aroma are highly useful parameters to classify pumpkin species.

  19. Decoupled ARX and RBF Neural Network Modeling Using PCA and GA Optimization for Nonlinear Distributed Parameter Systems.

    PubMed

    Zhang, Ridong; Tao, Jili; Lu, Renquan; Jin, Qibing

    2018-02-01

    Modeling of distributed parameter systems is difficult because of their nonlinearity and infinite-dimensional characteristics. Based on principal component analysis (PCA), a hybrid modeling strategy that consists of a decoupled linear autoregressive exogenous (ARX) model and a nonlinear radial basis function (RBF) neural network model are proposed. The spatial-temporal output is first divided into a few dominant spatial basis functions and finite-dimensional temporal series by PCA. Then, a decoupled ARX model is designed to model the linear dynamics of the dominant modes of the time series. The nonlinear residual part is subsequently parameterized by RBFs, where genetic algorithm is utilized to optimize their hidden layer structure and the parameters. Finally, the nonlinear spatial-temporal dynamic system is obtained after the time/space reconstruction. Simulation results of a catalytic rod and a heat conduction equation demonstrate the effectiveness of the proposed strategy compared to several other methods.

  20. SU-G-BRA-03: PCA Based Imaging Angle Optimization for 2D Cine MRI Based Radiotherapy Guidance

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, T; Yue, N; Jabbour, S

    2016-06-15

    Purpose: To develop an imaging angle optimization methodology for orthogonal 2D cine MRI based radiotherapy guidance using Principal Component Analysis (PCA) of target motion retrieved from 4DCT. Methods: We retrospectively analyzed 4DCT of 6 patients with lung tumor. A radiation oncologist manually contoured the target volume at the maximal inhalation phase of the respiratory cycle. An object constrained deformable image registration (DIR) method has been developed to track the target motion along the respiration at ten phases. The motion of the center of the target mass has been analyzed using the PCA to find out the principal motion components thatmore » were uncorrelated with each other. Two orthogonal image planes for cineMRI have been determined using this method to minimize the through plane motion during MRI based radiotherapy guidance. Results: 3D target respiratory motion for all 6 patients has been efficiently retrieved from 4DCT. In this process, the object constrained DIR demonstrated satisfactory accuracy and efficiency to enable the automatic motion tracking for clinical application. The average motion amplitude in the AP, lateral, and longitudinal directions were 3.6mm (min: 1.6mm, max: 5.6mm), 1.7mm (min: 0.6mm, max: 2.7mm), and 5.6mm (min: 1.8mm, max: 16.1mm), respectively. Based on PCA, the optimal orthogonal imaging planes were determined for cineMRI. The average angular difference between the PCA determined imaging planes and the traditional AP and lateral imaging planes were 47 and 31 degrees, respectively. After optimization, the average amplitude of through plane motion reduced from 3.6mm in AP images to 2.5mm (min:1.3mm, max:3.9mm); and from 1.7mm in lateral images to 0.6mm (min: 0.2mm, max:1.5mm), while the principal in plane motion amplitude increased from 5.6mm to 6.5mm (min: 2.8mm, max: 17mm). Conclusion: DIR and PCA can be used to optimize the orthogonal image planes of cineMRI to minimize the through plane motion during radiotherapy guidance.« less

  1. Benchmarking of data fusion algorithms in support of earth observation based Antarctic wildlife monitoring

    NASA Astrophysics Data System (ADS)

    Witharana, Chandi; LaRue, Michelle A.; Lynch, Heather J.

    2016-03-01

    Remote sensing is a rapidly developing tool for mapping the abundance and distribution of Antarctic wildlife. While both panchromatic and multispectral imagery have been used in this context, image fusion techniques have received little attention. We tasked seven widely-used fusion algorithms: Ehlers fusion, hyperspherical color space fusion, high-pass fusion, principal component analysis (PCA) fusion, University of New Brunswick fusion, and wavelet-PCA fusion to resolution enhance a series of single-date QuickBird-2 and Worldview-2 image scenes comprising penguin guano, seals, and vegetation. Fused images were assessed for spectral and spatial fidelity using a variety of quantitative quality indicators and visual inspection methods. Our visual evaluation elected the high-pass fusion algorithm and the University of New Brunswick fusion algorithm as best for manual wildlife detection while the quantitative assessment suggested the Gram-Schmidt fusion algorithm and the University of New Brunswick fusion algorithm as best for automated classification. The hyperspherical color space fusion algorithm exhibited mediocre results in terms of spectral and spatial fidelities. The PCA fusion algorithm showed spatial superiority at the expense of spectral inconsistencies. The Ehlers fusion algorithm and the wavelet-PCA algorithm showed the weakest performances. As remote sensing becomes a more routine method of surveying Antarctic wildlife, these benchmarks will provide guidance for image fusion and pave the way for more standardized products for specific types of wildlife surveys.

  2. Stream-based Hebbian eigenfilter for real-time neuronal spike discrimination

    PubMed Central

    2012-01-01

    Background Principal component analysis (PCA) has been widely employed for automatic neuronal spike sorting. Calculating principal components (PCs) is computationally expensive, and requires complex numerical operations and large memory resources. Substantial hardware resources are therefore needed for hardware implementations of PCA. General Hebbian algorithm (GHA) has been proposed for calculating PCs of neuronal spikes in our previous work, which eliminates the needs of computationally expensive covariance analysis and eigenvalue decomposition in conventional PCA algorithms. However, large memory resources are still inherently required for storing a large volume of aligned spikes for training PCs. The large size memory will consume large hardware resources and contribute significant power dissipation, which make GHA difficult to be implemented in portable or implantable multi-channel recording micro-systems. Method In this paper, we present a new algorithm for PCA-based spike sorting based on GHA, namely stream-based Hebbian eigenfilter, which eliminates the inherent memory requirements of GHA while keeping the accuracy of spike sorting by utilizing the pseudo-stationarity of neuronal spikes. Because of the reduction of large hardware storage requirements, the proposed algorithm can lead to ultra-low hardware resources and power consumption of hardware implementations, which is critical for the future multi-channel micro-systems. Both clinical and synthetic neural recording data sets were employed for evaluating the accuracy of the stream-based Hebbian eigenfilter. The performance of spike sorting using stream-based eigenfilter and the computational complexity of the eigenfilter were rigorously evaluated and compared with conventional PCA algorithms. Field programmable logic arrays (FPGAs) were employed to implement the proposed algorithm, evaluate the hardware implementations and demonstrate the reduction in both power consumption and hardware memories achieved by the streaming computing Results and discussion Results demonstrate that the stream-based eigenfilter can achieve the same accuracy and is 10 times more computationally efficient when compared with conventional PCA algorithms. Hardware evaluations show that 90.3% logic resources, 95.1% power consumption and 86.8% computing latency can be reduced by the stream-based eigenfilter when compared with PCA hardware. By utilizing the streaming method, 92% memory resources and 67% power consumption can be saved when compared with the direct implementation of GHA. Conclusion Stream-based Hebbian eigenfilter presents a novel approach to enable real-time spike sorting with reduced computational complexity and hardware costs. This new design can be further utilized for multi-channel neuro-physiological experiments or chronic implants. PMID:22490725

  3. Potential of non-invasive esophagus cancer detection based on urine surface-enhanced Raman spectroscopy

    NASA Astrophysics Data System (ADS)

    Huang, Shaohua; Wang, Lan; Chen, Weisheng; Feng, Shangyuan; Lin, Juqiang; Huang, Zufang; Chen, Guannan; Li, Buhong; Chen, Rong

    2014-11-01

    Non-invasive esophagus cancer detection based on urine surface-enhanced Raman spectroscopy (SERS) analysis was presented. Urine SERS spectra were measured on esophagus cancer patients (n = 56) and healthy volunteers (n = 36) for control analysis. Tentative assignments of the urine SERS spectra indicated some interesting esophagus cancer-specific biomolecular changes, including a decrease in the relative content of urea and an increase in the percentage of uric acid in the urine of esophagus cancer patients compared to that of healthy subjects. Principal component analysis (PCA) combined with linear discriminant analysis (LDA) was employed to analyze and differentiate the SERS spectra between normal and esophagus cancer urine. The diagnostic algorithms utilizing a multivariate analysis method achieved a diagnostic sensitivity of 89.3% and specificity of 83.3% for separating esophagus cancer samples from normal urine samples. These results from the explorative work suggested that silver nano particle-based urine SERS analysis coupled with PCA-LDA multivariate analysis has potential for non-invasive detection of esophagus cancer.

  4. Discovering phases, phase transitions, and crossovers through unsupervised machine learning: A critical examination

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.

    Here, we apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models - the square and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-one Ising (BSI) model, and the 2D XY model, and examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow exploration of different phases and symmetry-breaking, but can distinguish phase transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which ismore » particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the 'charge' correlations (vorticity) in the BSI model (XY model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the 'antoencoder method', and demonstrate that it too can be trained to capture phase transitions and critical points.« less

  5. Discovering phases, phase transitions, and crossovers through unsupervised machine learning: A critical examination

    DOE PAGES

    Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.

    2017-06-19

    Here, we apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models - the square and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-one Ising (BSI) model, and the 2D XY model, and examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow exploration of different phases and symmetry-breaking, but can distinguish phase transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which ismore » particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the 'charge' correlations (vorticity) in the BSI model (XY model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the 'antoencoder method', and demonstrate that it too can be trained to capture phase transitions and critical points.« less

  6. Discovering phases, phase transitions, and crossovers through unsupervised machine learning: A critical examination

    NASA Astrophysics Data System (ADS)

    Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.

    2017-06-01

    We apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models—the square- and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-1 Ising (BSI) model, and the two-dimensional X Y model—and we examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow the exploration of different phases and symmetry-breaking, but they can distinguish phase-transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which is particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the "charge" correlations (vorticity) in the BSI model (X Y model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the "autoencoder method," and we demonstrate that it too can be trained to capture phase transitions and critical points.

  7. Computer-aided diagnosis of prostate cancer using a deep convolutional neural network from multiparametric MRI.

    PubMed

    Song, Yang; Zhang, Yu-Dong; Yan, Xu; Liu, Hui; Zhou, Minxiong; Hu, Bingwen; Yang, Guang

    2018-04-16

    Deep learning is the most promising methodology for automatic computer-aided diagnosis of prostate cancer (PCa) with multiparametric MRI (mp-MRI). To develop an automatic approach based on deep convolutional neural network (DCNN) to classify PCa and noncancerous tissues (NC) with mp-MRI. Retrospective. In all, 195 patients with localized PCa were collected from a PROSTATEx database. In total, 159/17/19 patients with 444/48/55 observations (215/23/23 PCas and 229/25/32 NCs) were randomly selected for training/validation/testing, respectively. T 2 -weighted, diffusion-weighted, and apparent diffusion coefficient images. A radiologist manually labeled the regions of interest of PCas and NCs and estimated the Prostate Imaging Reporting and Data System (PI-RADS) scores for each region. Inspired by VGG-Net, we designed a patch-based DCNN model to distinguish between PCa and NCs based on a combination of mp-MRI data. Additionally, an enhanced prediction method was used to improve the prediction accuracy. The performance of DCNN prediction was tested using a receiver operating characteristic (ROC) curve, and the area under the ROC curve (AUC), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Moreover, the predicted result was compared with the PI-RADS score to evaluate its clinical value using decision curve analysis. Two-sided Wilcoxon signed-rank test with statistical significance set at 0.05. The DCNN produced excellent diagnostic performance in distinguishing between PCa and NC for testing datasets with an AUC of 0.944 (95% confidence interval: 0.876-0.994), sensitivity of 87.0%, specificity of 90.6%, PPV of 87.0%, and NPV of 90.6%. The decision curve analysis revealed that the joint model of PI-RADS and DCNN provided additional net benefits compared with the DCNN model and the PI-RADS scheme. The proposed DCNN-based model with enhanced prediction yielded high performance in statistical analysis, suggesting that DCNN could be used in computer-aided diagnosis (CAD) for PCa classification. 3 Technical Efficacy: Stage 2 J. Magn. Reson. Imaging 2018. © 2018 International Society for Magnetic Resonance in Medicine.

  8. Integrated analysis of epigenomic and genomic changes by DNA methylation dependent mechanisms provides potential novel biomarkers for prostate cancer.

    PubMed

    White-Al Habeeb, Nicole M A; Ho, Linh T; Olkhov-Mitsel, Ekaterina; Kron, Ken; Pethe, Vaijayanti; Lehman, Melanie; Jovanovic, Lidija; Fleshner, Neil; van der Kwast, Theodorus; Nelson, Colleen C; Bapat, Bharati

    2014-09-15

    Epigenetic silencing mediated by CpG methylation is a common feature of many cancers. Characterizing aberrant DNA methylation changes associated with tumor progression may identify potential prognostic markers for prostate cancer (PCa). We treated two PCa cell lines, 22Rv1 and DU-145 with the demethylating agent 5-Aza 2'-deoxycitidine (DAC) and global methylation status was analyzed by performing methylation-sensitive restriction enzyme based differential methylation hybridization strategy followed by genome-wide CpG methylation array profiling. In addition, we examined gene expression changes using a custom microarray. Gene Set Enrichment Analysis (GSEA) identified the most significantly dysregulated pathways. In addition, we assessed methylation status of candidate genes that showed reduced CpG methylation and increased gene expression after DAC treatment, in Gleason score (GS) 8 vs. GS6 patients using three independent cohorts of patients; the publically available The Cancer Genome Atlas (TCGA) dataset, and two separate patient cohorts. Our analysis, by integrating methylation and gene expression in PCa cell lines, combined with patient tumor data, identified novel potential biomarkers for PCa patients. These markers may help elucidate the pathogenesis of PCa and represent potential prognostic markers for PCa patients.

  9. Whole milk intake is associated with prostate cancer-specific mortality among U.S. male physicians.

    PubMed

    Song, Yan; Chavarro, Jorge E; Cao, Yin; Qiu, Weiliang; Mucci, Lorelei; Sesso, Howard D; Stampfer, Meir J; Giovannucci, Edward; Pollak, Michael; Liu, Simin; Ma, Jing

    2013-02-01

    Previous studies have associated higher milk intake with greater prostate cancer (PCa) incidence, but little data are available concerning milk types and the relation between milk intake and risk of fatal PCa. We investigated the association between intake of dairy products and the incidence and survival of PCa during a 28-y follow-up. We conducted a cohort study in the Physicians' Health Study (n = 21,660) and a survival analysis among the incident PCa cases (n = 2806). Information on dairy product consumption was collected at baseline. PCa cases and deaths (n = 305) were confirmed during follow-up. The intake of total dairy products was associated with increased PCa incidence [HR = 1.12 (95% CI: 0.93, 1.35); >2.5 servings/d vs. ≤0.5 servings/d]. Skim/low-fat milk intake was positively associated with risk of low-grade, early stage, and screen-detected cancers, whereas whole milk intake was associated only with fatal PCa [HR = 1.49 (95% CI: 0.97, 2.28); ≥237 mL/d (1 serving/d) vs. rarely consumed]. In the survival analysis, whole milk intake remained associated with risk of progression to fatal disease after diagnosis [HR = 2.17 (95% CI: 1.34, 3.51)]. In this prospective cohort, higher intake of skim/low-fat milk was associated with a greater risk of nonaggressive PCa. Most importantly, only whole milk was consistently associated with higher incidence of fatal PCa in the entire cohort and higher PCa-specific mortality among cases. These findings add further evidence to suggest the potential role of dairy products in the development and prognosis of PCa.

  10. Activation of Beta-Catenin Signaling in Androgen Receptor–Negative Prostate Cancer Cells

    PubMed Central

    Wan, Xinhai; Liu, Jie; Lu, Jing-Fang; Tzelepi, Vassiliki; Yang, Jun; Starbuck, Michael W.; Diao, Lixia; Wang, Jing; Efstathiou, Eleni; Vazquez, Elba S.; Troncoso, Patricia; Maity, Sankar N.; Navone, Nora M.

    2012-01-01

    Purpose To study Wnt/beta-catenin in castrate-resistant prostate cancer (CRPC) and understand its function independently of the beta-catenin–androgen receptor (AR) interaction. Experimental Design We performed beta-catenin immunocytochemical analysis, evaluated TOP-flash reporter activity (a reporter of beta-catenin–mediated transcription), and sequenced the beta-catenin gene in MDA PCa 118a, MDA PCa 118b, MDA PCa 2b, and PC-3 prostate cancer (PCa) cells. We knocked down beta-catenin in AR-negative MDA PCa 118b cells and performed comparative gene-array analysis. We also immunohistochemically analyzed beta-catenin and AR in 27 bone metastases of human CRPCs. Results Beta-catenin nuclear accumulation and TOP-flash reporter activity were high in MDA PCa 118b but not in MDA PCa 2b or PC-3 cells. MDA PCa 118a and 118b cells carry a mutated beta-catenin at codon 32 (D32G). Ten genes were expressed differently (false discovery rate, 0.05) in MDA PCa 118b cells with downregulated beta-catenin. One such gene, hyaluronan synthase 2 (HAS2), synthesizes hyaluronan, a core component of the extracellular matrix. We confirmed HAS2 upregulation in PC-3 cells transfected with D32G-mutant beta-catenin. Finally, we found nuclear localization of beta-catenin in 10 of 27 human tissue specimens; this localization was inversely associated with AR expression (P = 0.056, Fisher’s exact test), suggesting that reduced AR expression enables Wnt/beta-catenin signaling. Conclusion We identified a previously unknown downstream target of beta-catenin, HAS2, in PCa, and found that high beta-catenin nuclear localization and low or no AR expression may define a subpopulation of men with bone-metastatic PCa. These findings may guide physicians in managing these patients. PMID:22298898

  11. Association of microRNA-21 expression with clinicopathological characteristics and the risk of progression in advanced prostate cancer patients receiving androgen deprivation therapy.

    PubMed

    Guan, Yangbo; Wu, You; Liu, Yifei; Ni, Jian; Nong, Shaojun

    2016-08-01

    Despite androgen deprivation therapy (ADT) remains the mainstay therapy for advanced prostate cancer (PCa), the patients have widely variable durations of response to ADT. Unfortunately, there is limited knowledge of pre-treatment prognostic factors for response to ADT. Recently, microRNA-21 (miR-21) has been reported to play an important role in development of castration resistance of CaP. However, little is known about the expression of miR-21 in advanced PCa biopsy tissues, and data on its potential predictive value in advanced PCa are completely lacking. In this study, paraffin-embedded prostate carcinoma tissues obtained by needle biopsy from 85 advanced PCa patients were evaluated for the expression levels of miR-21 by quantitative real-time PCR (qRT-PCR). In situ hybridization (ISH) analysis was performed to further confirm the qRT-PCR results. Kaplan-Meier analysis and Cox proportional hazards regression models were performed to investigate the correlation between miR-21 expression and time to progression of advanced PCa patients. Compared with adjacent non-cancerous prostate tissues, the expression level of miR-21 was significantly increased in PCa tissues (PCa vs. non-cancerous prostate: 1.3273 ± 0.3207 vs. 0.9970 ± 0.2054, P < 0.001). By and large, in ISH analysis miR-21 was expressed at a higher level in tumor areas than in adjacent non-cancerous areas. Additionally, PCa patients with higher expression of miR-21 were significantly more likely to be of high Gleason score and high clinical stage (P < 0.05). There was no significant association between miR-21 expression and the initial prostate-specific antigen (PSA) level or age at diagnosis. Moreover, Kaplan-Meier survival analysis found that PCa patients with high miR-21 expression have shorter progression-free survival than those with low miR-21 expression. Furthermore, Multivariate Cox analysis revealed both miR-21 expression status (P = 0.040) and clinical stage (P = 0.042) were all independent predictive factor for progression-free survival for advanced PCa. These findings suggest for the first time that the up-regulation of miR-21 may serve as an independent predictor of progress-free survival in patients with advanced PCa. Prostate 76:986-993, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  12. Combinations of elevated tissue miRNA-17-92 cluster expression and serum prostate-specific antigen as potential diagnostic biomarkers for prostate cancer.

    PubMed

    Feng, Sujuan; Qian, Xiaosong; Li, Han; Zhang, Xiaodong

    2017-12-01

    The aim of the present study was to investigate the effectiveness of the miR-17-92 cluster as a disease progression marker in prostate cancer (PCa). Reverse transcription-quantitative polymerase chain reaction analysis was used to detect the microRNA (miR)-17-92 cluster expression levels in tissues from patients with PCa or benign prostatic hyperplasia (BPH), in addition to in PCa and BPH cell lines. Spearman correlation was used for comparison and estimation of correlations between miRNA expression levels and clinicopathological characteristics such as the Gleason score and prostate-specific antigen (PSA). Receiver operating curve (ROC) analysis was performed for evaluation of specificity and sensitivity of miR-17-92 cluster expression levels for discriminating patients with PCa from patients with BPH. Kaplan-Meier analysis was plotted to investigate the predictive potential of miR-17-92 cluster for PCa biochemical recurrence. Expression of the majority of miRNAs in the miR-17-92 cluster was identified to be significantly increased in PCa tissues and cell lines. Bivariate correlation analysis indicated that the high expression of unregulated miRNAs was positively correlated with Gleason grade, but had no significant association with PSA. ROC curves demonstrated that high expression of miR-17-92 cluster predicted a higher diagnostic accuracy compared with PSA. Improved discriminating quotients were observed when combinations of unregulated miRNAs with PSA were used. Survival analysis confirmed a high combined miRNA score of miR-17-92 cluster was associated with shorter biochemical recurrence interval. miR-17-92 cluster could be a potential diagnostic and prognostic biomarker for PCa, and the combination of the miR-17-92 cluster and serum PSA may enhance the accuracy for diagnosis of PCa.

  13. The Northern Norway Mother-and-Child Contaminant Cohort (MISA) Study: PCA analyses of environmental contaminants in maternal sera and dietary intake in early pregnancy.

    PubMed

    Veyhe, Anna Sofía; Hofoss, Dag; Hansen, Solrunn; Thomassen, Yngvar; Sandanger, Torkjel M; Odland, Jon Øyvind; Nieboer, Evert

    2015-03-01

    Although predictors of contaminants in serum or whole blood are usually examined by chemical groups (e.g., POPs, toxic and/or essential elements; dietary sources), principal component analysis (PCA) permits consideration of both individual substances and combined variables. Our study had two primary objectives: (i) Characterize the sources and predictors of a suite of eight PCBs, four organochlorine (OC) pesticides, five essential and five toxic elements in serum and/or whole blood of pregnant women recruited as part of the Mother-and-Child Contaminant Cohort Study conducted in Northern Norway (The MISA study); and (ii) determine the influence of personal and social characteristics on both dietary and contaminant factors. Recruitment and sampling started in May 2007 and continued for the next 31 months until December 2009. Blood/serum samples were collected during the 2nd trimester (mean: 18.2 weeks, range 9.0-36.0). A validated questionnaire was administered to obtain personal information. The samples were analysed by established laboratories employing verified methods and reference standards. PCA involved Varimax rotation, and significant predictors (p≤0.05) in linear regression models were included in the multivariable linear regression analysis. When considering all the contaminants, three prominent PCA axes stood out with prominent loadings of: all POPs; arsenic, selenium and mercury; and cadmium and lead. Respectively, in the multivariate models the following were predictors: maternal age, parity and consumption of freshwater fish and land-based wild animals; marine fish; cigarette smoking, dietary PCA axes reflecting consumption of grains and cereals, and food items involving hunting. PCA of only the POPs separated them into two axes that, in terms of recently published findings, could be understood to reflect longitudinal trends and their relative contributions to summed POPs. The linear combinations of variables generated by PCA identified prominent dietary sources of OC groups and of prominent toxic elements and highlighted the importance of maternal characteristics. Copyright © 2014 Elsevier GmbH. All rights reserved.

  14. A reduced basis method for molecular dynamics simulation

    NASA Astrophysics Data System (ADS)

    Vincent-Finley, Rachel Elisabeth

    In this dissertation, we develop a method for molecular simulation based on principal component analysis (PCA) of a molecular dynamics trajectory and least squares approximation of a potential energy function. Molecular dynamics (MD) simulation is a computational tool used to study molecular systems as they evolve through time. With respect to protein dynamics, local motions, such as bond stretching, occur within femtoseconds, while rigid body and large-scale motions, occur within a range of nanoseconds to seconds. To capture motion at all levels, time steps on the order of a femtosecond are employed when solving the equations of motion and simulations must continue long enough to capture the desired large-scale motion. To date, simulations of solvated proteins on the order of nanoseconds have been reported. It is typically the case that simulations of a few nanoseconds do not provide adequate information for the study of large-scale motions. Thus, the development of techniques that allow longer simulation times can advance the study of protein function and dynamics. In this dissertation we use principal component analysis (PCA) to identify the dominant characteristics of an MD trajectory and to represent the coordinates with respect to these characteristics. We augment PCA with an updating scheme based on a reduced representation of a molecule and consider equations of motion with respect to the reduced representation. We apply our method to butane and BPTI and compare the results to standard MD simulations of these molecules. Our results indicate that the molecular activity with respect to our simulation method is analogous to that observed in the standard MD simulation with simulations on the order of picoseconds.

  15. A theoretical study in extracting the essential features and dynamics of molecular motions: Intrinsic geometry methods for PF(5) pseudorotations and statistical methods for argon clusters

    NASA Astrophysics Data System (ADS)

    Panahi, Nima S.

    We studied the problem of understanding and computing the essential features and dynamics of molecular motions through the development of two theories for two different systems. First, we studied the process of the Berry Pseudorotation of PF5 and the rotations it induces in the molecule through its natural and intrinsic geometric nature by setting it in the language of fiber bundles and graph theory. With these tools, we successfully extracted the essentials of the process' loops and induced rotations. The infinite number of pseudorotation loops were broken down into a small set of essential loops called "super loops", with their intrinsic properties and link to the physical movements of the molecule extensively studied. In addition, only the three "self-edge loops" generated any induced rotations, and then only a finite number of classes of them. Second, we studied applying the statistical methods of Principal Components Analysis (PCA) and Principal Coordinate Analysis (PCO) to capture only the most important changes in Argon clusters so as to reduce computational costs and graph the potential energy surface (PES) in three dimensions respectively. Both methods proved successful, but PCA was only partially successful since one will only see advantages for PES database systems much larger than those both currently being studied and those that can be computationally studied in the next few decades to come. In addition, PCA is only needed for the very rare case of a PES database that does not already include Hessian eigenvalues.

  16. Principal component analysis of chemical shift perturbation data of a multiple-ligand-binding system for elucidation of respective binding mechanism.

    PubMed

    Konuma, Tsuyoshi; Lee, Young-Ho; Goto, Yuji; Sakurai, Kazumasa

    2013-01-01

    Chemical shift perturbations (CSPs) in NMR spectra provide useful information about the interaction of a protein with its ligands. However, in a multiple-ligand-binding system, determining quantitative parameters such as a dissociation constant (K(d) ) is difficult. Here, we used a method we named CS-PCA, a principal component analysis (PCA) of chemical shift (CS) data, to analyze the interaction between bovine β-lactoglobulin (βLG) and 1-anilinonaphthalene-8-sulfonate (ANS), which is a multiple-ligand-binding system. The CSP on the binding of ANS involved contributions from two distinct binding sites. PCA of the titration data successfully separated the CSP pattern into contributions from each site. Docking simulations based on the separated CSP patterns provided the structures of βLG-ANS complexes for each binding site. In addition, we determined the K(d) values as 3.42 × 10⁻⁴ M² and 2.51 × 10⁻³ M for Sites 1 and 2, respectively. In contrast, it was difficult to obtain reliable K(d) values for respective sites from the isothermal titration calorimetry experiments. Two ANS molecules were found to bind at Site 1 simultaneously, suggesting that the binding occurs cooperatively with a partial unfolding of the βLG structure. On the other hand, the binding of ANS to Site 2 was a simple attachment without a significant conformational change. From the present results, CS-PCA was confirmed to provide not only the positions and the K(d) values of binding sites but also information about the binding mechanism. Thus, it is anticipated to be a general method to investigate protein-ligand interactions. Copyright © 2012 Wiley Periodicals, Inc.

  17. Modeling Pair Distribution Functions of Rare-Earth Phosphate Glasses Using Principal Component Analysis.

    PubMed

    Cole, Jacqueline M; Cheng, Xie; Payne, Michael C

    2016-11-07

    The use of principal component analysis (PCA) to statistically infer features of local structure from experimental pair distribution function (PDF) data is assessed on a case study of rare-earth phosphate glasses (REPGs). Such glasses, codoped with two rare-earth ions (R and R') of different sizes and optical properties, are of interest to the laser industry. The determination of structure-property relationships in these materials is an important aspect of their technological development. Yet, realizing the local structure of codoped REPGs presents significant challenges relative to their singly doped counterparts; specifically, R and R' are difficult to distinguish in terms of establishing relative material compositions, identifying atomic pairwise correlation profiles in a PDF that are associated with each ion, and resolving peak overlap of such profiles in PDFs. This study demonstrates that PCA can be employed to help overcome these structural complications, by statistically inferring trends in PDFs that exist for a restricted set of experimental data on REPGs, and using these as training data to predict material compositions and PDF profiles in unknown codoped REPGs. The application of these PCA methods to resolve individual atomic pairwise correlations in t(r) signatures is also presented. The training methods developed for these structural predictions are prevalidated by testing their ability to reproduce known physical phenomena, such as the lanthanide contraction, on PDF signatures of the structurally simpler singly doped REPGs. The intrinsic limitations of applying PCA to analyze PDFs relative to the quality control of source data, data processing, and sample definition, are also considered. While this case study is limited to lanthanide-doped REPGs, this type of statistical inference may easily be extended to other inorganic solid-state materials and be exploited in large-scale data-mining efforts that probe many t(r) functions.

  18. Combining ANOVA-PCA with POCHEMON to analyse micro-organism development in a polymicrobial environment.

    PubMed

    Geurts, Brigitte P; Neerincx, Anne H; Bertrand, Samuel; Leemans, Manja A A P; Postma, Geert J; Wolfender, Jean-Luc; Cristescu, Simona M; Buydens, Lutgarde M C; Jansen, Jeroen J

    2017-04-22

    Revealing the biochemistry associated to micro-organismal interspecies interactions is highly relevant for many purposes. Each pathogen has a characteristic metabolic fingerprint that allows identification based on their unique multivariate biochemistry. When pathogen species come into mutual contact, their co-culture will display a chemistry that may be attributed both to mixing of the characteristic chemistries of the mono-cultures and to competition between the pathogens. Therefore, investigating pathogen development in a polymicrobial environment requires dedicated chemometric methods to untangle and focus upon these sources of variation. The multivariate data analysis method Projected Orthogonalised Chemical Encounter Monitoring (POCHEMON) is dedicated to highlight metabolites characteristic for the interaction of two micro-organisms in co-culture. However, this approach is currently limited to a single time-point, while development of polymicrobial interactions may be highly dynamic. A well-known multivariate implementation of Analysis of Variance (ANOVA) uses Principal Component Analysis (ANOVA-PCA). This allows the overall dynamics to be separated from the pathogen-specific chemistry to analyse the contributions of both aspects separately. For this reason, we propose to integrate ANOVA-PCA with the POCHEMON approach to disentangle the pathogen dynamics and the specific biochemistry in interspecies interactions. Two complementary case studies show great potential for both liquid and gas chromatography - mass spectrometry to reveal novel information on chemistry specific to interspecies interaction during pathogen development. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.

  19. Progress Towards Improved Analysis of TES X-ray Data Using Principal Component Analysis

    NASA Technical Reports Server (NTRS)

    Busch, S. E.; Adams, J. S.; Bandler, S. R.; Chervenak, J. A.; Eckart, M. E.; Finkbeiner, F. M.; Fixsen, D. J.; Kelley, R. L.; Kilbourne, C. A.; Lee, S.-J.; hide

    2015-01-01

    The traditional method of applying a digital optimal filter to measure X-ray pulses from transition-edge sensor (TES) devices does not achieve the best energy resolution when the signals have a highly non-linear response to energy, or the noise is non-stationary during the pulse. We present an implementation of a method to analyze X-ray data from TESs, which is based upon principal component analysis (PCA). Our method separates the X-ray signal pulse into orthogonal components that have the largest variance. We typically recover pulse height, arrival time, differences in pulse shape, and the variation of pulse height with detector temperature. These components can then be combined to form a representation of pulse energy. An added value of this method is that by reporting information on more descriptive parameters (as opposed to a single number representing energy), we generate a much more complete picture of the pulse received. Here we report on progress in developing this technique for future implementation on X-ray telescopes. We used an 55Fe source to characterize Mo/Au TESs. On the same dataset, the PCA method recovers a spectral resolution that is better by a factor of two than achievable with digital optimal filters.

  20. Apo adenylate kinase encodes its holo form: a principal component and varimax analysis.

    PubMed

    Cukier, Robert I

    2009-02-12

    Adenylate kinase undergoes large-scale motions of its LID and AMP-binding (AMPbd) domains when its apo, open form closes over its substrates, AMP and Mg2+-ATP. It may be an example of an enzyme that provides an ensemble of conformations in its apo state from which its substrates can select and bind to produce catalytically competent conformations. In this work, the fluctuations of the enzyme apo Escherichia coli adenylate kinase (AKE) are obtained with molecular dynamics. The resulting trajectory is analyzed with principal component analysis (PCA) that decomposes the atom motions into orthogonal modes ordered by their decreasing contributions to the total protein fluctuation. In apo AKE, a small set of the PCA modes describes the bulk of the fluctuations. Identification of the atom motions that are important contributors to these modes is improved with the use of a varimax rotation method that rotates the PCA modes to a new mode set that concentrates the atom contributions to a smaller set of atoms in these new modes. In this way, the nature of the important motions of the LID and AMPbd domains are clarified. The dominant PCA modes are used to investigate if apo AKE can fluctuate to conformations that are holo-like, even though the apo trajectory is mainly confined to a region around the initial apo structure. This is accomplished by expressing the difference between the protein coordinates, obtained from the holo and apo crystal structures, using as a basis the PCA modes from the apo AKE trajectory. The coherent motion described by a small set of the apo PCA modes is shown to be able to produce protein conformations that are quite similar to the holo conformation of the protein. In this sense, apo AKE does encode in its fluctuations information about holo-like conformations.

  1. Decreased expression of serine protease inhibitor family G1 (SERPING1) in prostate cancer can help distinguish high-risk prostate cancer and predicts malignant progression.

    PubMed

    Peng, Shengmeng; Du, Tao; Wu, Wanhua; Chen, Xianju; Lai, Yiming; Zhu, Dingjun; Wang, Qiong; Ma, Xiaoming; Lin, Chunhao; Li, Zean; Guo, Zhenghui; Huang, Hai

    2018-06-11

    The aim of this study was to investigate the associations of serine proteinase inhibitor family G1 (SERPING1) down-regulation with poor prognosis in patients with prostate cancer (PCa). Furthermore, we aim to find more novel and effective PCa molecular markers to provide an early screening of PCa, distinguish patients with aggressive PCa, predict the prognosis, or reduce the economic burden of PCa. SERPING1 protein expression in both human PCa and normal prostate tissues was detected by immunohistochemical staining, which intensity was analyzed in association with clinical pathological parameters such Gleason score, pathological grade, clinical stage, tumor stage, lymph node metastasis, and distant metastasis. Moreover, we used The Cancer Genome Atlas (TCGA) Database, Taylor Database, and Oncomine dataset to validate our immunohistochemical results and investigated the value of SERPING1 in PCa at mRNA level. Kaplan-Meier analysis and Cox regression analysis were performed to evaluate the relationship between SERPING1 and prognosis of patients with PCa. The outcome showed that SERPING1 was expressed mainly in cytoplasm of grand cells of prostate tissue and was significantly expressed less in PCa (P<0.001). Furthermore, in the tissue microarray of our samples, decreasing expression of SERPING1 was correlated with the higher Gleason score (P = 0.004), the higher pathological grade (P = 0.01) and the advanced tumor stage (P = 0.005) at protein level. In TCGA dataset and Taylor Dataset, low-expressed SERPING1 was correlated with the younger patient (P = 0.02 in TCGA, P = 0.044 in Taylor) and the higher Gleason score (P = 0.019 in TCGA, P<0.001 in Taylor) at mRNA level. Kaplan-Meier analysis revealed that the lower mRNA of SERPING1 predicted lower overall survivals (P = 0.027 in TCGA), lower disease-free survival (P = 0.029) and lower biochemical recurrence-free survival (P = 0.011 in Taylor). Data from Oncomine database shown that SERPING1 low expression implying higher malignancy of prostate lesions. Using multivariate analysis, we also found that SERPING1 expression was independent prognostic marker of poor disease-free survival and biochemical recurrence-free survival. SERPING1 may play an important role in PCa and can be serve as a novel marker in diagnosis and prognostic prediction in PCa. In addition, levels of SERPING1 can help identify low-risk prostate to provide reference for patients with PCa to accept active surveillance and reduce overtreatment. Copyright © 2018 Elsevier Inc. All rights reserved.

  2. Predicting prostate biopsy outcome: prostate health index (phi) and prostate cancer antigen 3 (PCA3) are useful biomarkers.

    PubMed

    Ferro, Matteo; Bruzzese, Dario; Perdonà, Sisto; Mazzarella, Claudia; Marino, Ada; Sorrentino, Alessandra; Di Carlo, Angelina; Autorino, Riccardo; Di Lorenzo, Giuseppe; Buonerba, Carlo; Altieri, Vincenzo; Mariano, Angela; Macchia, Vincenzo; Terracciano, Daniela

    2012-08-16

    Indication for prostate biopsy is presently mainly based on prostate-specific antigen (PSA) serum levels and digital-rectal examination (DRE). In view of the unsatisfactory accuracy of these two diagnostic exams, research has focused on novel markers to improve pre-biopsy prostate cancer detection, such as phi and PCA3. The purpose of this prospective study was to assess the diagnostic accuracy of phi and PCA3 for prostate cancer using biopsy as gold standard. Phi index (Beckman coulter immunoassay), PCA3 score (Progensa PCA3 assay) and other established biomarkers (tPSA, fPSA and %fPSA) were assessed before a 18-core prostate biopsy in a group of 251 subjects at their first biopsy. Values of %p2PSA and phi were significantly higher in patients with PCa compared with PCa-negative group (p<0.001) and also compared with high grade prostatic intraepithelial neoplasia (HGPIN) (p<0.001). PCA3 score values were significantly higher in PCa compared with PCa-negative subjects (p<0.001) and in HGPIN vs PCa-negative patients (p<0.001). ROC curve analysis showed that %p2PSA, phi and PCA3 are predictive of malignancy. In conclusion, %p2PSA, phi and PCA3 may predict a diagnosis of PCa in men undergoing their first prostate biopsy. PCA3 score is more useful in discriminating between HGPIN and non-cancer. Copyright © 2012 Elsevier B.V. All rights reserved.

  3. A pilot study assessing the association between paraoxonase 1 gene polymorphism and prostate cancer

    PubMed Central

    Uluocak, Nihat; Atılgan, Doğan; Parlaktaş, Bekir Süha; Erdemir, Fikret; Ateş, Ömer

    2017-01-01

    Objective We aimed to show the relationship between paraoxonase 1 (PON1) gene polymorphism and the development of prostate cancer (PCa). Material and methods We investigated the association of single nuclotide polymorphisms of PON1 enzyme with the development of PCa risk. A total of 147 male patients were divided into PCa, and control groups. The control group was also divided into two subgroups according to serum prostate specific antigen (PSA) levels as non PCa-high PSA (>4 ng/mL) and non PCa-low PSA (≤4 ng/mL) groups. Results The mean ages of the patients were 64.81 years, 63.27 years and 64.22 years in PCa group, non PCa-low PSA and non PCa –high PSA groups, respectively. The mean PSA levels were 10.9 ng/mL, 1.16 ng/mL and 6.63 ng/mL for PCa group, non PCa –low PSA and non PCa –high PSA groups, respectively. In terms of PON1 polymorphisms and allele frequencies, there were no statistically significant differences between PCa and control groups. There was not a statistically significant difference between PCa and non PCa-high PSA groups as for genotypic and allelic frequencies. As a result of this small sample sized hypothetical study of polymorphism, a relationship could not be detected between PCa development and PON1 gene polymorphism. Conclusion According to the results of this preliminary study, it is thought that more comprehensive future studies are necessary to clarify the possible role of PON1 gene polymorphism in the etiology of PCa. PMID:28861298

  4. Selected questions on biomechanical exposures for surveillance of upper-limb work-related musculoskeletal disorders

    PubMed Central

    Descatha, Alexis; Roquelaure, Yves; Evanoff, Bradley; Niedhammer, Isabelle; Chastang, Jean François; Mariot, Camille; Ha, Catherine; Imbernon, Ellen; Goldberg, Marcel; Leclerc, Annette

    2007-01-01

    Objective Questionnaires for assessment of biomechanical exposure are frequently used in surveillance programs, though few studies have evaluated which key questions are needed. We sought to reduce the number of variables on a surveillance questionnaire by identifying which variables best summarized biomechanical exposure in a survey of the French working population. Methods We used data from the 2002–2003 French experimental network of Upper-limb work-related musculoskeletal disorders (UWMSD), performed on 2685 subjects in which 37 variables assessing biomechanical exposures were available (divided into four ordinal categories, according to the task frequency or duration). Principal Component Analysis (PCA) with orthogonal rotation was performed on these variables. Variables closely associated with factors issued from PCA were retained, except those highly correlated to another variable (rho>0.70). In order to study the relevance of the final list of variables, correlations between a score based on retained variables (PCA score) and the exposure score suggested by the SALTSA group were calculated. The associations between the PCA score and the prevalence of UWMSD were also studied. In a final step, we added back to the list a few variables not retained by PCA, because of their established recognition as risk factors. Results According to the results of the PCA, seven interpretable factors were identified: posture exposures, repetitiveness, handling of heavy loads, distal biomechanical exposures, computer use, forklift operator specific task, and recovery time. Twenty variables strongly correlated with the factors obtained from PCA were retained. The PCA score was strongly correlated both with the SALTSA score and with UWMSD prevalence (p<0.0001). In the final step, six variables were reintegrated. Conclusion Twenty-six variables out of 37 were efficiently selected according to their ability to summarize major biomechanical constraints in a working population, with an approach combining statistical analyses and existing knowledge. PMID:17476519

  5. Recursive approach of EEG-segment-based principal component analysis substantially reduces cryogenic pump artifacts in simultaneous EEG-fMRI data.

    PubMed

    Kim, Hyun-Chul; Yoo, Seung-Schik; Lee, Jong-Hwan

    2015-01-01

    Electroencephalography (EEG) data simultaneously acquired with functional magnetic resonance imaging (fMRI) data are preprocessed to remove gradient artifacts (GAs) and ballistocardiographic artifacts (BCAs). Nonetheless, these data, especially in the gamma frequency range, can be contaminated by residual artifacts produced by mechanical vibrations in the MRI system, in particular the cryogenic pump that compresses and transports the helium that chills the magnet (the helium-pump). However, few options are available for the removal of helium-pump artifacts. In this study, we propose a recursive approach of EEG-segment-based principal component analysis (rsPCA) that enables the removal of these helium-pump artifacts. Using the rsPCA method, feature vectors representing helium-pump artifacts were successfully extracted as eigenvectors, and the reconstructed signals of the feature vectors were subsequently removed. A test using simultaneous EEG-fMRI data acquired from left-hand (LH) and right-hand (RH) clenching tasks performed by volunteers found that the proposed rsPCA method substantially reduced helium-pump artifacts in the EEG data and significantly enhanced task-related gamma band activity levels (p=0.0038 and 0.0363 for LH and RH tasks, respectively) in EEG data that have had GAs and BCAs removed. The spatial patterns of the fMRI data were estimated using a hemodynamic response function (HRF) modeled from the estimated gamma band activity in a general linear model (GLM) framework. Active voxel clusters were identified in the post-/pre-central gyri of motor area, only from the rsPCA method (uncorrected p<0.001 for both LH/RH tasks). In addition, the superior temporal pole areas were consistently observed (uncorrected p<0.001 for the LH task and uncorrected p<0.05 for the RH task) in the spatial patterns of the HRF model for gamma band activity when the task paradigm and movement were also included in the GLM. Copyright © 2014 Elsevier Inc. All rights reserved.

  6. Chemometric and multivariate statistical analysis of time-of-flight secondary ion mass spectrometry spectra from complex Cu-Fe sulfides.

    PubMed

    Kalegowda, Yogesh; Harmer, Sarah L

    2012-03-20

    Time-of-flight secondary ion mass spectrometry (TOF-SIMS) spectra of mineral samples are complex, comprised of large mass ranges and many peaks. Consequently, characterization and classification analysis of these systems is challenging. In this study, different chemometric and statistical data evaluation methods, based on monolayer sensitive TOF-SIMS data, have been tested for the characterization and classification of copper-iron sulfide minerals (chalcopyrite, chalcocite, bornite, and pyrite) at different flotation pulp conditions (feed, conditioned feed, and Eh modified). The complex mass spectral data sets were analyzed using the following chemometric and statistical techniques: principal component analysis (PCA); principal component-discriminant functional analysis (PC-DFA); soft independent modeling of class analogy (SIMCA); and k-Nearest Neighbor (k-NN) classification. PCA was found to be an important first step in multivariate analysis, providing insight into both the relative grouping of samples and the elemental/molecular basis for those groupings. For samples exposed to oxidative conditions (at Eh ~430 mV), each technique (PCA, PC-DFA, SIMCA, and k-NN) was found to produce excellent classification. For samples at reductive conditions (at Eh ~ -200 mV SHE), k-NN and SIMCA produced the most accurate classification. Phase identification of particles that contain the same elements but a different crystal structure in a mixed multimetal mineral system has been achieved.

  7. Intraindividual Comparison of 18F-PSMA-1007 PET/CT, Multiparametric MRI, and Radical Prostatectomy Specimens in Patients with Primary Prostate Cancer: A Retrospective, Proof-of-Concept Study.

    PubMed

    Kesch, Claudia; Vinsensia, Maria; Radtke, Jan P; Schlemmer, Heinz P; Heller, Martina; Ellert, Elena; Holland-Letz, Tim; Duensing, Stefan; Grabe, Nils; Afshar-Oromieh, Ali; Wieczorek, Kathrin; Schäfer, Martin; Neels, Oliver C; Cardinale, Jens; Kratochwil, Clemens; Hohenfellner, Markus; Kopka, Klaus; Haberkorn, Uwe; Hadaschik, Boris A; Giesel, Frederik L

    2017-11-01

    68 Ga-prostate-specific membrane antigen (PSMA)-11 PET/CT represents an advanced method for the staging of primary prostate cancer (PCa) and diagnosis of recurrent or metastatic PCa. However, because of the narrow availability of 68 Ga the development of alternative tracers is of high interest. The objective of this study was to examine the value of the new PET tracer 18 F-PSMA-1007 for the staging of local disease by comparing it with multiparametric MRI (mpMRI) and radical prostatectomy (RP) histopathology. Methods: In 2016, 18 F-PSMA-1007 PET/CT was performed in 10 men with biopsy-confirmed high-risk PCa. Nine patients underwent mpMRI in the process of primary diagnosis. Consecutively, RP was performed in all 10 men. Agreement analysis was performed retrospectively. PSMA staining was added for representative sections in RP specimen slices. Localization and agreement analysis of 18 F-PSMA-1007 PET/CT, mpMRI, and RP specimens was performed by dividing the prostate into 38 sections as described in the prostate imaging reporting and data system (PI-RADS) (version 2). Sensitivity, specificity, positive predictive values, negative predictive values (NPVs), and accuracy were calculated for total and near-total agreement. Results: 18 F-PSMA-1007 PET/CT had an NPV of 68% and an accuracy of 75%, and mpMRI had an NPV of 88% and an accuracy of 73% for total agreement. Near-total agreement analysis resulted in an NPV of 91% and an accuracy of 93% for 18 F-PSMA-1007 PET/CT and 91% and 87% for mpMRI, respectively. Retrospective combination of mpMRI and PET/CT had an accuracy of 81% for total and 93% for near-total agreement. Conclusion: Comparison with RP histopathology demonstrates that 18 F-PSMA-1007 PET/CT is promising for accurate local staging of PCa. © 2017 by the Society of Nuclear Medicine and Molecular Imaging.

  8. Radar target classification method with high accuracy and decision speed performance using MUSIC spectrum vectors and PCA projection

    NASA Astrophysics Data System (ADS)

    Secmen, Mustafa

    2011-10-01

    This paper introduces the performance of an electromagnetic target recognition method in resonance scattering region, which includes pseudo spectrum Multiple Signal Classification (MUSIC) algorithm and principal component analysis (PCA) technique. The aim of this method is to classify an "unknown" target as one of the "known" targets in an aspect-independent manner. The suggested method initially collects the late-time portion of noise-free time-scattered signals obtained from different reference aspect angles of known targets. Afterward, these signals are used to obtain MUSIC spectrums in real frequency domain having super-resolution ability and noise resistant feature. In the final step, PCA technique is applied to these spectrums in order to reduce dimensionality and obtain only one feature vector per known target. In the decision stage, noise-free or noisy scattered signal of an unknown (test) target from an unknown aspect angle is initially obtained. Subsequently, MUSIC algorithm is processed for this test signal and resulting test vector is compared with feature vectors of known targets one by one. Finally, the highest correlation gives the type of test target. The method is applied to wire models of airplane targets, and it is shown that it can tolerate considerable noise levels although it has a few different reference aspect angles. Besides, the runtime of the method for a test target is sufficiently low, which makes the method suitable for real-time applications.

  9. Improved Statistical Fault Detection Technique and Application to Biological Phenomena Modeled by S-Systems.

    PubMed

    Mansouri, Majdi; Nounou, Mohamed N; Nounou, Hazem N

    2017-09-01

    In our previous work, we have demonstrated the effectiveness of the linear multiscale principal component analysis (PCA)-based moving window (MW)-generalized likelihood ratio test (GLRT) technique over the classical PCA and multiscale principal component analysis (MSPCA)-based GLRT methods. The developed fault detection algorithm provided optimal properties by maximizing the detection probability for a particular false alarm rate (FAR) with different values of windows, and however, most real systems are nonlinear, which make the linear PCA method not able to tackle the issue of non-linearity to a great extent. Thus, in this paper, first, we apply a nonlinear PCA to obtain an accurate principal component of a set of data and handle a wide range of nonlinearities using the kernel principal component analysis (KPCA) model. The KPCA is among the most popular nonlinear statistical methods. Second, we extend the MW-GLRT technique to one that utilizes exponential weights to residuals in the moving window (instead of equal weightage) as it might be able to further improve fault detection performance by reducing the FAR using exponentially weighed moving average (EWMA). The developed detection method, which is called EWMA-GLRT, provides improved properties, such as smaller missed detection and FARs and smaller average run length. The idea behind the developed EWMA-GLRT is to compute a new GLRT statistic that integrates current and previous data information in a decreasing exponential fashion giving more weight to the more recent data. This provides a more accurate estimation of the GLRT statistic and provides a stronger memory that will enable better decision making with respect to fault detection. Therefore, in this paper, a KPCA-based EWMA-GLRT method is developed and utilized in practice to improve fault detection in biological phenomena modeled by S-systems and to enhance monitoring process mean. The idea behind a KPCA-based EWMA-GLRT fault detection algorithm is to combine the advantages brought forward by the proposed EWMA-GLRT fault detection chart with the KPCA model. Thus, it is used to enhance fault detection of the Cad System in E. coli model through monitoring some of the key variables involved in this model such as enzymes, transport proteins, regulatory proteins, lysine, and cadaverine. The results demonstrate the effectiveness of the proposed KPCA-based EWMA-GLRT method over Q , GLRT, EWMA, Shewhart, and moving window-GLRT methods. The detection performance is assessed and evaluated in terms of FAR, missed detection rates, and average run length (ARL 1 ) values.

  10. [Discrimination of varieties of borneol using terahertz spectra based on principal component analysis and support vector machine].

    PubMed

    Li, Wu; Hu, Bing; Wang, Ming-wei

    2014-12-01

    In the present paper, the terahertz time-domain spectroscopy (THz-TDS) identification model of borneol based on principal component analysis (PCA) and support vector machine (SVM) was established. As one Chinese common agent, borneol needs a rapid, simple and accurate detection and identification method for its different source and being easily confused in the pharmaceutical and trade links. In order to assure the quality of borneol product and guard the consumer's right, quickly, efficiently and correctly identifying borneol has significant meaning to the production and transaction of borneol. Terahertz time-domain spectroscopy is a new spectroscopy approach to characterize material using terahertz pulse. The absorption terahertz spectra of blumea camphor, borneol camphor and synthetic borneol were measured in the range of 0.2 to 2 THz with the transmission THz-TDS. The PCA scores of 2D plots (PC1 X PC2) and 3D plots (PC1 X PC2 X PC3) of three kinds of borneol samples were obtained through PCA analysis, and both of them have good clustering effect on the 3 different kinds of borneol. The value matrix of the first 10 principal components (PCs) was used to replace the original spectrum data, and the 60 samples of the three kinds of borneol were trained and then the unknown 60 samples were identified. Four kinds of support vector machine model of different kernel functions were set up in this way. Results show that the accuracy of identification and classification of SVM RBF kernel function for three kinds of borneol is 100%, and we selected the SVM with the radial basis kernel function to establish the borneol identification model, in addition, in the noisy case, the classification accuracy rates of four SVM kernel function are above 85%, and this indicates that SVM has strong generalization ability. This study shows that PCA with SVM method of borneol terahertz spectroscopy has good classification and identification effects, and provides a new method for species identification of borneol in Chinese medicine.

  11. Modelling the habitat suitability of cetaceans: Example of the sperm whale in the northwestern Mediterranean Sea

    NASA Astrophysics Data System (ADS)

    Praca, Emilie; Gannier, Alexandre; Das, Krishna; Laran, Sophie

    2009-04-01

    Cetaceans are mobile and spend long periods underwater. Because of this, modelling their habitat could be subject to a serious problem of false absence. Furthermore, extensive surveys at sea are time and money consuming, and presence-absence data are difficult to apply. This study compares the ability of two presence-absence and two presence-only habitat modelling methods and uses the example of the sperm whale ( Physeter macrocephalus) in the northwestern Mediterranean Sea. The data consist of summer visual and acoustical detections of sperm whales, compiled between 1998 and 2005. Habitat maps were computed using topographical and hydrological eco-geographical variables. Four methods were compared: principal component analysis (PCA), ecological niche factor analysis (ENFA), generalized linear model (GLM) and multivariate adaptive regression splines (MARS). The evaluation of the models was achieved by calculating the receiver operating characteristic (ROC) of the models and their respective area under the curve (AUC). Presence-absence methods (GLM, AUC=0.70, and MARS, AUC=0.79) presented better AUC than presence-only methods (PCA, AUC=0.58, and ENFA, AUC=0.66), but this difference was not statistically significant, except between the MARS and the PCA models. The four models showed an influence of both topographical and hydrological factors, but the resulting habitat suitability maps differed. The core habitat on the continental slope was well highlighted by the four models, while GLM and MARS maps also showed a suitable habitat in the offshore waters. Presence-absence methods are therefore recommended for modelling the habitat suitability of cetaceans, as they seem more accurate to highlight complex habitat. However, the use of presence-only techniques, in particular ENFA, could be very useful for a first model of the habitat range or when important surveys at sea are not possible.

  12. Differential distribution of sperm subpopulations and incidence of pleiomorphisms in ejaculates of captive howling monkeys ( Alouatta caraya)

    NASA Astrophysics Data System (ADS)

    Valle, R. R.; Carvalho, F. M.; Muniz, J. A. P. C.; Leal, C. L. V.; García-Herreros, M.

    2013-10-01

    The aim of this study was to develop an objective method to determine the incidence of pleiomorphisms and its influence on the distribution of sperm morphometric subpopulations in ejaculates of howling monkeys ( Alouatta caraya) by using a combination of computerized analysis system (ASMA) and principal component analysis (PCA) methods. Ejaculates were collected by electroejaculation methods on a regular basis from five individuals maintained under identical captive environmental, nutritional, and management conditions. Each sperm head was measured for dimensional parameters (Area [ A, (square micrometers)], Perimeter [ P, (micrometers)], Length [ L, (micrometers)], and Width [ W, (micrometers)]) and shape-derived parameters (Ellipticity [( L/ W)], Elongation [( L - W)/( L + W)], and Rugosity [(4л A/ P 2)]). PCA revealed two principal components explaining more than the 96 % of the variance. Clustering methods and discriminant analyzes were performed and seven separate subpopulations were identified. There were differences ( P < 0.001) in the distribution of the seven subpopulations as well as in the incidence of abnormal pleiomorphisms (58.6 %, 49.8 %, 35.1 %, 66.4 %, and 55.1 %, P < 0.05) among the five donors tested. Our results indicated that differences among individuals related to the incidence of pleiomorphisms, and sperm subpopulational structure was not related to the captivity conditions or the sperm collection method, since all individuals were studied under identical conditions. In conclusion, the combination of ASMA and PCA is a useful clinical diagnostic resource for detecting deficiencies in sperm morphology and sperm subpopulations in A. caraya ejaculates that could be used in ex situ conservation programs of threatened species in Alouatta genus or even other endangered neotropical primate species.

  13. Imaging Prostate Cancer (Pca) Phenotype and Evolution

    DTIC Science & Technology

    2014-10-01

    Extracellular flux analysis experiments with the Seahorse system showed a marked decrease in OCR after inhibition of ATP synthase by oligomycin...measured in each well 34 h after seeding the cells, using the Seahorse extracellular flux analyzer, as also described in Methods section. OCR

  14. The application of compound-specific isotope analysis of fatty acids for traceability of sea cucumber (Apostichopus japonicus) in the coastal areas of China.

    PubMed

    Liu, Yu; Zhang, Xufeng; Li, Ying; Wang, Haixia

    2017-11-01

    Geographical origin traceability is an important issue for controlling the quality of seafood and safeguarding the interest of consumers. In the present study, a new method of compound-specific isotope analysis (CSIA) of fatty acids was established to evaluate its applicability in establishing the origin traceability of Apostichopus japonicus in the coastal areas of China. Moreover, principal component analysis (PCA) and discriminant analysis (DA) were applied to distinguish between the origins of A. japonicus. The results show that the stable carbon isotope compositions of fatty acids of A. japonicus significantly differ in terms of both season and origin. They also indicate that the stable carbon isotope composition of fatty acids could effectively discriminate between the origins of A. japonicus, except for between Changhai Island and Zhangzi Island in the spring of 2016 because of geographical proximity or the similarity of food sources. The fatty acids that have the highest contribution to identifying the geographical origins of A. japonicus are C22:6n-3, C16:1n-7, C20:5n-3, C18:0 and C23:1n-9, when considering the fatty acid contents, the stable carbon isotope composition of fatty acids and the results of the PCA and DA. We conclude that CSIA of fatty acids, combined with multivariate statistical analysis such as PCA and DA, may be an effective tool for establishing the traceability of A. japonicus in the coastal areas of China. The relevant conclusions of the present study provide a new method for determining the traceability of seafood or other food products. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dhou, S; Cai, W; Hurwitz, M

    Purpose: The goal of this study is to quantify the interfraction reproducibility of patient-specific motion models derived from 4DCBCT acquired on the day of treatment of lung cancer stereotactic body radiotherapy (SBRT) patients. Methods: Motion models are derived from patient 4DCBCT images acquired daily over 3–5 fractions of treatment by 1) applying deformable image registration between each 4DCBCT image and a reference phase from that day, resulting in a set of displacement vector fields (DVFs), and 2) performing principal component analysis (PCA) on the DVFs to derive a motion model. The motion model from the first day of treatment ismore » compared to motion models from each successive day of treatment to quantify variability in motion models generated from different days. Four SBRT patient datasets have been acquired thus far in this IRB approved study. Results: Fraction-specific motion models for each fraction and patient were derived and PCA eigenvectors and their associated eigenvalues are compared for each fraction. For the first patient dataset, the average root mean square error between the first two eigenvectors associated with the highest two eigenvalues, in four fractions was 0.1, while it was 0.25 between the last three PCA eigenvectors associated with the lowest three eigenvalues. It was found that the eigenvectors and eigenvalues of PCA motion models for each treatment fraction have variations and the first few eigenvectors are shown to be more stable across treatment fractions than others. Conclusion: Analysis of this dataset showed that the first two eigenvectors of the PCA patient-specific motion models derived from 4DCBCT were stable over the course of several treatment fractions. The third, fourth, and fifth eigenvectors had larger variations.« less

  16. Quantitative thickness prediction of tectonically deformed coal using Extreme Learning Machine and Principal Component Analysis: a case study

    NASA Astrophysics Data System (ADS)

    Wang, Xin; Li, Yan; Chen, Tongjun; Yan, Qiuyan; Ma, Li

    2017-04-01

    The thickness of tectonically deformed coal (TDC) has positive correlation associations with gas outbursts. In order to predict the TDC thickness of coal beds, we propose a new quantitative predicting method using an extreme learning machine (ELM) algorithm, a principal component analysis (PCA) algorithm, and seismic attributes. At first, we build an ELM prediction model using the PCA attributes of a synthetic seismic section. The results suggest that the ELM model can produce a reliable and accurate prediction of the TDC thickness for synthetic data, preferring Sigmoid activation function and 20 hidden nodes. Then, we analyze the applicability of the ELM model on the thickness prediction of the TDC with real application data. Through the cross validation of near-well traces, the results suggest that the ELM model can produce a reliable and accurate prediction of the TDC. After that, we use 250 near-well traces from 10 wells to build an ELM predicting model and use the model to forecast the TDC thickness of the No. 15 coal in the study area using the PCA attributes as the inputs. Comparing the predicted results, it is noted that the trained ELM model with two selected PCA attributes yields better predication results than those from the other combinations of the attributes. Finally, the trained ELM model with real seismic data have a different number of hidden nodes (10) than the trained ELM model with synthetic seismic data. In summary, it is feasible to use an ELM model to predict the TDC thickness using the calculated PCA attributes as the inputs. However, the input attributes, the activation function and the number of hidden nodes in the ELM model should be selected and tested carefully based on individual application.

  17. The application of near infrared (NIR) spectroscopy to inorganic preservative-treated wood

    Treesearch

    Chi-Leung So; Stan T. Lebow; Leslie H. Groom; Timothy G. Rials

    2004-01-01

    There is a growing need to find a rapid, inexpensive, and reliable method to distinguish between treated and untreated waste wood. This paper evaluates the ability of near infrared (NIR) spectroscopy with multivariate analysis (MVA) to distinguish preservative types and retentions. It is demonstrated that principal component analysis (PCA) can differentiate lumber...

  18. Characterization and forensic analysis of soil samples using laser-induced breakdown spectroscopy (LIBS).

    PubMed

    Jantzi, Sarah C; Almirall, José R

    2011-07-01

    A method for the quantitative elemental analysis of surface soil samples using laser-induced breakdown spectroscopy (LIBS) was developed and applied to the analysis of bulk soil samples for discrimination between specimens. The use of a 266 nm laser for LIBS analysis is reported for the first time in forensic soil analysis. Optimization of the LIBS method is discussed, and the results compared favorably to a laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) method previously developed. Precision for both methods was <10% for most elements. LIBS limits of detection were <33 ppm and bias <40% for most elements. In a proof of principle study, the LIBS method successfully discriminated samples from two different sites in Dade County, FL. Analysis of variance, Tukey's post hoc test and Student's t test resulted in 100% discrimination with no type I or type II errors. Principal components analysis (PCA) resulted in clear groupings of the two sites. A correct classification rate of 99.4% was obtained with linear discriminant analysis using leave-one-out validation. Similar results were obtained when the same samples were analyzed by LA-ICP-MS, showing that LIBS can provide similar information to LA-ICP-MS. In a forensic sampling/spatial heterogeneity study, the variation between sites, between sub-plots, between samples and within samples was examined on three similar Dade sites. The closer the sampling locations, the closer the grouping on a PCA plot and the higher the misclassification rate. These results underscore the importance of careful sampling for geographic site characterization.

  19. Using recurrence plot analysis for software execution interpretation and fault detection

    NASA Astrophysics Data System (ADS)

    Mosdorf, M.

    2015-09-01

    This paper shows a method targeted at software execution interpretation and fault detection using recurrence plot analysis. In in the proposed approach recurrence plot analysis is applied to software execution trace that contains executed assembly instructions. Results of this analysis are subject to further processing with PCA (Principal Component Analysis) method that simplifies number coefficients used for software execution classification. This method was used for the analysis of five algorithms: Bubble Sort, Quick Sort, Median Filter, FIR, SHA-1. Results show that some of the collected traces could be easily assigned to particular algorithms (logs from Bubble Sort and FIR algorithms) while others are more difficult to distinguish.

  20. The Effect of Phenazine-1-Carboxylic Acid on Mycelial Growth of Botrytis cinerea Produced by Pseudomonas aeruginosa LV Strain

    PubMed Central

    Simionato, Ane S.; Navarro, Miguel O. P.; de Jesus, Maria L. A.; Barazetti, André R.; da Silva, Caroline S.; Simões, Glenda C.; Balbi-Peña, Maria I.; de Mello, João C. P.; Panagio, Luciano A.; de Almeida, Ricardo S. C.; Andrade, Galdino; de Oliveira, Admilton G.

    2017-01-01

    One of the most important postharvest plant pathogens that affect strawberries, grapes and tomatoes is Botrytis cinerea, known as gray mold. The fungus remains in latent form until spore germination conditions are good, making infection control difficult, causing great losses in the whole production chain. This study aimed to purify and identify phenazine-1-carboxylic acid (PCA) produced by the Pseudomonas aeruginosa LV strain and to determine its antifungal activity against B. cinerea. The compounds produced were extracted with dichloromethane and passed through a chromatographic process. The purity level of PCA was determined by reversed-phase high-performance liquid chromatography semi-preparative. The structure of PCA was confirmed by nuclear magnetic resonance and electrospray ionization mass spectrometry. Antifungal activity was determined by the dry paper disk and minimum inhibitory concentration (MIC) methods and identified by scanning electron microscopy and confocal microscopy. The results showed that PCA inhibited mycelial growth, where MIC was 25 μg mL-1. Microscopic analysis revealed a reduction in exopolysaccharide (EPS) formation, showing distorted and damaged hyphae of B. cinerea. The results suggested that PCA has a high potential in the control of B. cinerea and inhibition of EPS (important virulence factor). This natural compound is a potential alternative to postharvest control of gray mold disease. PMID:28659907

  1. High Performance Parallel Architectures

    NASA Technical Reports Server (NTRS)

    El-Ghazawi, Tarek; Kaewpijit, Sinthop

    1998-01-01

    Traditional remote sensing instruments are multispectral, where observations are collected at a few different spectral bands. Recently, many hyperspectral instruments, that can collect observations at hundreds of bands, have been operational. Furthermore, there have been ongoing research efforts on ultraspectral instruments that can produce observations at thousands of spectral bands. While these remote sensing technology developments hold great promise for new findings in the area of Earth and space science, they present many challenges. These include the need for faster processing of such increased data volumes, and methods for data reduction. Dimension Reduction is a spectral transformation, aimed at concentrating the vital information and discarding redundant data. One such transformation, which is widely used in remote sensing, is the Principal Components Analysis (PCA). This report summarizes our progress on the development of a parallel PCA and its implementation on two Beowulf cluster configuration; one with fast Ethernet switch and the other with a Myrinet interconnection. Details of the implementation and performance results, for typical sets of multispectral and hyperspectral NASA remote sensing data, are presented and analyzed based on the algorithm requirements and the underlying machine configuration. It will be shown that the PCA application is quite challenging and hard to scale on Ethernet-based clusters. However, the measurements also show that a high- performance interconnection network, such as Myrinet, better matches the high communication demand of PCA and can lead to a more efficient PCA execution.

  2. Analysis of serum from type II diabetes mellitus and diabetic complication using surface-enhanced Raman spectra (SERS)

    NASA Astrophysics Data System (ADS)

    Han, H. W.; Yan, X. L.; Dong, R. X.; Ban, G.; Li, K.

    2009-03-01

    In this paper, we show surface-enhanced Raman spectra (SERS) of serums from type II diabetes mellitus and diabetic complication (coronary disease, glaucoma and cerebral infarction), and analyze the SERS through the multivariate statistical methods of principal component analysis (PCA). In particular, we find that there exist many adenines in these serums, which maybe come from DNA (RNA) damage. The relative intensity of the band at 725±2 cm-1 assigned to adenine is higher for patients than for the healthy volunteers; therefore, it can be used as an important ‘fingerprint’ in order to diagnose these diseases. It is also shown that serums from type II diabetes mellitus group, diabetic complication group and healthy volunteers group can be discriminated by PCA.

  3. Discrimination of chicken seasonings and beef seasonings using electronic nose and sensory evaluation.

    PubMed

    Tian, Huaixiang; Li, Fenghua; Qin, Lan; Yu, Haiyan; Ma, Xia

    2014-11-01

    This study examines the feasibility of electronic nose as a method to discriminate chicken and beef seasonings and to predict sensory attributes. Sensory evaluation showed that 8 chicken seasonings and 4 beef seasonings could be well discriminated and classified based on 8 sensory attributes. The sensory attributes including chicken/beef, gamey, garlic, spicy, onion, soy sauce, retention, and overall aroma intensity were generated by a trained evaluation panel. Principal component analysis (PCA), discriminant factor analysis (DFA), and cluster analysis (CA) combined with electronic nose were used to discriminate seasoning samples based on the difference of the sensor response signals of chicken and beef seasonings. The correlation between sensory attributes and electronic nose sensors signal was established using partial least squares regression (PLSR) method. The results showed that the seasoning samples were all correctly classified by the electronic nose combined with PCA, DFA, and CA. The electronic nose gave good prediction results for all the sensory attributes with correlation coefficient (r) higher than 0.8. The work indicated that electronic nose is an effective method for discriminating different seasonings and predicting sensory attributes. © 2014 Institute of Food Technologists®

  4. Fourier transform infrared spectroscopy combined with chemometrics for discrimination of Curcuma longa, Curcuma xanthorrhiza and Zingiber cassumunar

    NASA Astrophysics Data System (ADS)

    Rohaeti, Eti; Rafi, Mohamad; Syafitri, Utami Dyah; Heryanto, Rudi

    2015-02-01

    Turmeric (Curcuma longa), java turmeric (Curcuma xanthorrhiza) and cassumunar ginger (Zingiber cassumunar) are widely used in traditional Indonesian medicines (jamu). They have similar color for their rhizome and possess some similar uses, so it is possible to substitute one for the other. The identification and discrimination of these closely-related plants is a crucial task to ensure the quality of the raw materials. Therefore, an analytical method which is rapid, simple and accurate for discriminating these species using Fourier transform infrared spectroscopy (FTIR) combined with some chemometrics methods was developed. FTIR spectra were acquired in the mid-IR region (4000-400 cm-1). Standard normal variate, first and second order derivative spectra were compared for the spectral data. Principal component analysis (PCA) and canonical variate analysis (CVA) were used for the classification of the three species. Samples could be discriminated by visual analysis of the FTIR spectra by using their marker bands. Discrimination of the three species was also possible through the combination of the pre-processed FTIR spectra with PCA and CVA, in which CVA gave clearer discrimination. Subsequently, the developed method could be used for the identification and discrimination of the three closely-related plant species.

  5. Efficient principal component analysis for multivariate 3D voxel-based mapping of brain functional imaging data sets as applied to FDG-PET and normal aging.

    PubMed

    Zuendorf, Gerhard; Kerrouche, Nacer; Herholz, Karl; Baron, Jean-Claude

    2003-01-01

    Principal component analysis (PCA) is a well-known technique for reduction of dimensionality of functional imaging data. PCA can be looked at as the projection of the original images onto a new orthogonal coordinate system with lower dimensions. The new axes explain the variance in the images in decreasing order of importance, showing correlations between brain regions. We used an efficient, stable and analytical method to work out the PCA of Positron Emission Tomography (PET) images of 74 normal subjects using [(18)F]fluoro-2-deoxy-D-glucose (FDG) as a tracer. Principal components (PCs) and their relation to age effects were investigated. Correlations between the projections of the images on the new axes and the age of the subjects were carried out. The first two PCs could be identified as being the only PCs significantly correlated to age. The first principal component, which explained 10% of the data set variance, was reduced only in subjects of age 55 or older and was related to loss of signal in and adjacent to ventricles and basal cisterns, reflecting expected age-related brain atrophy with enlarging CSF spaces. The second principal component, which accounted for 8% of the total variance, had high loadings from prefrontal, posterior parietal and posterior cingulate cortices and showed the strongest correlation with age (r = -0.56), entirely consistent with previously documented age-related declines in brain glucose utilization. Thus, our method showed that the effect of aging on brain metabolism has at least two independent dimensions. This method should have widespread applications in multivariate analysis of brain functional images. Copyright 2002 Wiley-Liss, Inc.

  6. Quantitative comparison of caffeoylquinic acids and flavonoids in Chrysanthemum morifolium flowers and their sulfur-fumigated products by three-channel liquid chromatography with electrochemical detection.

    PubMed

    Chen, Liangmian; Kotani, Akira; Kusu, Fumiyo; Wang, Zhimin; Zhu, Jingjing; Hakamata, Hideki

    2015-01-01

    For the determination of seven caffeoylquinic acids [neochlorogenic acid (NcA), cryptochlorogenic acid (CcA), chlorogenic acid (CA), caffeic acid (CfA), isochlorogenic acid A (Ic A), isochlorogenic acid B (Ic B), isochlorogenic acid C (Ic C)] and two flavonoids [luteolin 7-O-glucoside (LtG) and luteolin (Lt)], a three-channel liquid chromatography with electrochemical detection (LC-3ECD) method was established. Chromatographic peak heights were proportional to each concentration, ranging from 2.5 to 100 ng/mL for NcA, CA, CcA, and CfA, and ranging from 2.5 to 250 ng/mL for LtG, Ic B, Ic A, Ic C, and Lt, respectively. The present LC-3ECD method was applied to the quantitative analysis of caffeoylquinic acids and flavonoids in four cultivars of Chrysanthemum morifolium flowers and their sulfur-fumigated products. It was found that 60% of LtG and more than 47% of caffeoylquinic acids were lost during the sulfur fumigation processing. Sulfur fumigation showed a destructive effect on the C. morifolium flowers. In addition, principle component analyses (PCA) were performed using the results of the quantitative analysis of caffeoylquinic acids and flavonoids to compare the "sameness" and "differences" of these analytes in C. morifolium flowers and the sulfur-fumigated products. PCA score plots showed that the four cultivars of C. morifolium flowers were clearly classified into four groups, and that significant differences were also found between the non-fumigated C. morifolium flowers and the sulfur-fumigated products. Therefore, it was demonstrated that the present LC-3ECD method coupled with PCA is applicable to the variation analysis of different C. morifolium flower samples.

  7. Biological Evaluation and Molecular Docking of Protocatechuic Acid from Hibiscus sabdariffa L. as a Potent Urease Inhibitor by an ESI-MS Based Method.

    PubMed

    Hassan, Sherif T S; Švajdlenka, Emil

    2017-10-11

    Studies on enzyme inhibition remain a crucial area in drug discovery since these studies have led to the discoveries of new lead compounds useful in the treatment of several diseases. In this study, protocatechuic acid (PCA), an active compound from Hibiscus sabdariffa L. has been evaluated for its inhibitory properties against jack bean urease (JBU) as well as its possible toxic effect on human gastric epithelial cells (GES-1). Anti-urease activity was evaluated by an Electrospray Ionization-Mass Spectrometry (ESI-MS) based method, while cytotoxicity was assayed by the MTT method. PCA exerted notable anti-JBU activity compared with that of acetohydroxamic acid (AHA), with IC 50 values of 1.7 and 3.2 µM, respectively. PCA did not show any significant cytotoxic effect on (GES-1) cells at concentrations ranging from 1.12 to 3.12 µM. Molecular docking study revealed high spontaneous binding ability of PCA to the active site of urease. Additionally, the anti-urease activity was found to be related to the presence of hydroxyl moieties of PCA. This study presents PCA as a natural urease inhibitor, which could be used safely in the treatment of diseases caused by urease-producing bacteria.

  8. Expression of spermidine/spermine N(1) -acetyl transferase (SSAT) in human prostate tissues is related to prostate cancer progression and metastasis.

    PubMed

    Huang, Wei; Eickhoff, Jens C; Mehraein-Ghomi, Farideh; Church, Dawn R; Wilding, George; Basu, Hirak S

    2015-08-01

    Prostate cancer (PCa) in many patients remains indolent for the rest of their lives, but in some patients, it progresses to lethal metastatic disease. Gleason score is the current clinical method for PCa prognosis. It cannot reliably identify aggressive PCa, when GS is ≤ 7. It is shown that oxidative stress plays a key role in PCa progression. We have shown that in cultured human PCa cells, an activation of spermidine/spermine N(1) -acetyl transferase (SSAT; EC 2.3.1.57) enzyme initiates a polyamine oxidation pathway and generates copious amounts of reactive oxygen species in polyamine-rich PCa cells. We used RNA in situ hybridization and immunohistochemistry methods to detect SSAT mRNA and protein expression in two tissue microarrays (TMA) created from patient's prostate tissues. We analyzed 423 patient's prostate tissues in the two TMAs. Our data show that there is a significant increase in both SSAT mRNA and the enzyme protein in the PCa cells as compared to their benign counterpart. This increase is even more pronounced in metastatic PCa tissues as compared to the PCa localized in the prostate. In the prostatectomy tissues from early-stage patients, the SSAT protein level is also high in the tissues obtained from the patients who ultimately progress to advanced metastatic disease. Based on these results combined with published data from our and other laboratories, we propose an activation of an autocrine feed-forward loop of PCa cell proliferation in the absence of androgen as a possible mechanism of castrate-resistant prostate cancer growth. © 2015 Wiley Periodicals, Inc.

  9. E-nose based rapid prediction of early mouldy grain using probabilistic neural networks

    PubMed Central

    Ying, Xiaoguo; Liu, Wei; Hui, Guohua; Fu, Jun

    2015-01-01

    In this paper, early mouldy grain rapid prediction method using probabilistic neural network (PNN) and electronic nose (e-nose) was studied. E-nose responses to rice, red bean, and oat samples with different qualities were measured and recorded. E-nose data was analyzed using principal component analysis (PCA), back propagation (BP) network, and PNN, respectively. Results indicated that PCA and BP network could not clearly discriminate grain samples with different mouldy status and showed poor predicting accuracy. PNN showed satisfying discriminating abilities to grain samples with an accuracy of 93.75%. E-nose combined with PNN is effective for early mouldy grain prediction. PMID:25714125

  10. Characterization of Leaf Extracts of Schinus terebinthifolius Raddi by GC-MS and Chemometric Analysis

    PubMed Central

    Carneiro, Fabíola B.; Lopes, Pablo Q.; Ramalho, Ricardo C.; Scotti, Marcus T.; Santos, Sócrates G.; Soares, Luiz A. L.

    2017-01-01

    Background: Schinus terebinthifolius Raddi belongs to Anacardiacea family and is widely known as “aroeira.” This species originates from South America, and its extracts are used in folk medicine due to its therapeutic properties, which include antimicrobial, anti-inflammatory, and antipyretic effects. The complexity and variability of the chemical constitution of the herbal raw material establishes the quality of the respective herbal medicine products. Objective: Thus, the purpose of this study was to investigate the variability of the volatile compounds from leaves of S. terebinthifolius. Materials and Methods: The samples were collected from different states of the Northeast region of Brazil and analyzed with a gas chromatograph coupled to a mass spectrometer (GC-MS). The collected data were analyzed using multivariate data analysis. Results: The samples’ chromatograms, obtained by GC-MS, showed similar chemical profiles in a number of peaks, but some differences were observed in the intensity of these analytical markers. The chromatographic fingerprints obtained by GC-MS were suitable for discrimination of the samples; these results along with a statistical treatment (principal component analysis [PCA]) were used as a tool for comparative analysis between the different samples of S. terebinthifolius. Conclusion: The experimental data show that the PCA used in this study clustered the samples into groups with similar chemical profiles, which builds an appropriate approach to evaluate the similarity in the phytochemical pattern found in the different leaf samples. SUMMARY The leave extracts of Schinus terebinthifolius were obtained by turbo-extractionThe extracts were partitioned with hexane and analyzed by GC-MSThe chromatographic data were analyzed using the principal component analysis (PCA)The PCA plots showed the main compounds (phellandrene, limonene, and carene), which were used to group the samples from a different geographical location in accordance to their chemical similarity. Abbreviations used: AL: Alagoas, BA: Bahia, CE: Ceará, CPETEC: Center for Weather Forecasting and Climate Studies, GC-MS: Gas chromatograph coupled to a mass spectrometer, MA: Maranhão, MVA: Multivariate data analysis, PB: Paraíba, PC1: Direction that describes the maximum variance of the original data, PC2: Maximum direction variance of the data in the subspace orthogonal to PC1, PCA: Principal component analysis, PE: Pernambuco, PI: Piauí, RN: Rio Grande do Norte, SE: Sergipe. PMID:29142431

  11. The Network Structure of Human Personality According to the NEO-PI-R: Matching Network Community Structure to Factor Structure

    PubMed Central

    Goekoop, Rutger; Goekoop, Jaap G.; Scholte, H. Steven

    2012-01-01

    Introduction Human personality is described preferentially in terms of factors (dimensions) found using factor analysis. An alternative and highly related method is network analysis, which may have several advantages over factor analytic methods. Aim To directly compare the ability of network community detection (NCD) and principal component factor analysis (PCA) to examine modularity in multidimensional datasets such as the neuroticism-extraversion-openness personality inventory revised (NEO-PI-R). Methods 434 healthy subjects were tested on the NEO-PI-R. PCA was performed to extract factor structures (FS) of the current dataset using both item scores and facet scores. Correlational network graphs were constructed from univariate correlation matrices of interactions between both items and facets. These networks were pruned in a link-by-link fashion while calculating the network community structure (NCS) of each resulting network using the Wakita Tsurumi clustering algorithm. NCSs were matched against FS and networks of best matches were kept for further analysis. Results At facet level, NCS showed a best match (96.2%) with a ‘confirmatory’ 5-FS. At item level, NCS showed a best match (80%) with the standard 5-FS and involved a total of 6 network clusters. Lesser matches were found with ‘confirmatory’ 5-FS and ‘exploratory’ 6-FS of the current dataset. Network analysis did not identify facets as a separate level of organization in between items and clusters. A small-world network structure was found in both item- and facet level networks. Conclusion We present the first optimized network graph of personality traits according to the NEO-PI-R: a ‘Personality Web’. Such a web may represent the possible routes that subjects can take during personality development. NCD outperforms PCA by producing plausible modularity at item level in non-standard datasets, and can identify the key roles of individual items and clusters in the network. PMID:23284713

  12. Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map

    PubMed Central

    An, Yan; Zou, Zhihong; Li, Ranran

    2016-01-01

    In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009–2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data. PMID:26761018

  13. Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map.

    PubMed

    An, Yan; Zou, Zhihong; Li, Ranran

    2016-01-08

    In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009-2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data.

  14. Analysis of Zinc-Exporters Expression in Prostate Cancer.

    PubMed

    Singh, Chandra K; Malas, Kareem M; Tydrick, Caitlin; Siddiqui, Imtiaz A; Iczkowski, Kenneth A; Ahmad, Nihal

    2016-11-11

    Maintaining optimal intracellular zinc (Zn) concentration is crucial for critical cellular functions. Depleted Zn has been associated with prostate cancer (PCa) progression. Solute carrier family 30 (SLC30A) proteins maintain cytoplasmic Zn balance by exporting Zn out to the extracellular space or by sequestering cytoplasmic Zn into intracellular compartments. In this study, we determined the involvement of Zn-exporters, SLC30A 1-10 in PCa, in the context of racial health disparity in human PCa samples obtained from European-American (EA) and African-American (AA) populations. We also analyzed the levels of Zn-exporters in a panel of PCa cells derived from EA and AA populations. We further explored the expression profile of Zn-exporters in PCa using Oncomine database. Zn-exporters were found to be differentially expressed at the mRNA level, with a significant upregulation of SLC30A1, SLC30A9 and SLC30A10, and downregulation of SLC30A5 and SLC30A6 in PCa, compared to benign prostate. Moreover, Ingenuity Pathway analysis revealed several interactions of Zn-exporters with certain tumor suppressor and promoter proteins known to be modulated in PCa. Our study provides an insight regarding Zn-exporters in PCa, which may open new avenues for future studies aimed at enhancing the levels of Zn by modulating Zn-transporters via pharmacological means.

  15. Analysis of Zinc-Exporters Expression in Prostate Cancer

    PubMed Central

    Singh, Chandra K.; Malas, Kareem M.; Tydrick, Caitlin; Siddiqui, Imtiaz A.; Iczkowski, Kenneth A.; Ahmad, Nihal

    2016-01-01

    Maintaining optimal intracellular zinc (Zn) concentration is crucial for critical cellular functions. Depleted Zn has been associated with prostate cancer (PCa) progression. Solute carrier family 30 (SLC30A) proteins maintain cytoplasmic Zn balance by exporting Zn out to the extracellular space or by sequestering cytoplasmic Zn into intracellular compartments. In this study, we determined the involvement of Zn-exporters, SLC30A 1–10 in PCa, in the context of racial health disparity in human PCa samples obtained from European-American (EA) and African-American (AA) populations. We also analyzed the levels of Zn-exporters in a panel of PCa cells derived from EA and AA populations. We further explored the expression profile of Zn-exporters in PCa using Oncomine database. Zn-exporters were found to be differentially expressed at the mRNA level, with a significant upregulation of SLC30A1, SLC30A9 and SLC30A10, and downregulation of SLC30A5 and SLC30A6 in PCa, compared to benign prostate. Moreover, Ingenuity Pathway analysis revealed several interactions of Zn-exporters with certain tumor suppressor and promoter proteins known to be modulated in PCa. Our study provides an insight regarding Zn-exporters in PCa, which may open new avenues for future studies aimed at enhancing the levels of Zn by modulating Zn-transporters via pharmacological means. PMID:27833104

  16. Visual tracking based on the sparse representation of the PCA subspace

    NASA Astrophysics Data System (ADS)

    Chen, Dian-bing; Zhu, Ming; Wang, Hui-li

    2017-09-01

    We construct a collaborative model of the sparse representation and the subspace representation. First, we represent the tracking target in the principle component analysis (PCA) subspace, and then we employ an L 1 regularization to restrict the sparsity of the residual term, an L 2 regularization term to restrict the sparsity of the representation coefficients, and an L 2 norm to restrict the distance between the reconstruction and the target. Then we implement the algorithm in the particle filter framework. Furthermore, an iterative method is presented to get the global minimum of the residual and the coefficients. Finally, an alternative template update scheme is adopted to avoid the tracking drift which is caused by the inaccurate update. In the experiment, we test the algorithm on 9 sequences, and compare the results with 5 state-of-art methods. According to the results, we can conclude that our algorithm is more robust than the other methods.

  17. Radiative transfer models for retrieval of cloud parameters from EPIC/DSCOVR measurements

    NASA Astrophysics Data System (ADS)

    Molina García, Víctor; Sasi, Sruthy; Efremenko, Dmitry S.; Doicu, Adrian; Loyola, Diego

    2018-07-01

    In this paper we analyze the accuracy and efficiency of several radiative transfer models for inferring cloud parameters from radiances measured by the Earth Polychromatic Imaging Camera (EPIC) on board the Deep Space Climate Observatory (DSCOVR). The radiative transfer models are the exact discrete ordinate and matrix operator methods with matrix exponential, and the approximate asymptotic and equivalent Lambertian cloud models. To deal with the computationally expensive radiative transfer calculations, several acceleration techniques such as, for example, the telescoping technique, the method of false discrete ordinate, the correlated k-distribution method and the principal component analysis (PCA) are used. We found that, for the EPIC oxygen A-band absorption channel at 764 nm, the exact models using the correlated k-distribution in conjunction with PCA yield an accuracy better than 1.5% and a computation time of 18 s for radiance calculations at 5 viewing zenith angles.

  18. Neuropsychiatric Symptoms in Posterior Cortical Atrophy and Alzheimer Disease

    PubMed Central

    Crutch, Sebastian J.; Franco-Macías, Emilio; Gil-Néciga, Eulogio

    2016-01-01

    Background: Posterior cortical atrophy (PCA) is a rare neurodegenerative syndrome characterized by early progressive visual dysfunction in the context of relative preservation of memory and a pattern of atrophy mainly involving the posterior cortex. The aim of the present study is to characterize the neuropsychiatric profile of PCA. Methods: The Neuropsychiatric Inventory was used to assess 12 neuropsychiatric symptoms (NPS) in 28 patients with PCA and 34 patients with typical Alzheimer disease (AD) matched by age, disease duration, and illness severity. Results: The most commonly reported NPS in both groups were depression, anxiety, apathy, and irritability. However, aside from a trend toward lower rates of apathy in patients with PCA, there were no differences in the percentage of NPS presented in each group. All those patients presenting visual hallucinations in the PCA group also met diagnostic criteria for dementia with Lewy bodies (DLB). Auditory hallucinations were only present in patients meeting diagnosis criteria for DLB. Conclusion: Prevalence of the 12 NPS examined was similar between patients with PCA and AD. Hallucinations in PCA may be helpful in the differential diagnosis between PCA-AD and PCA-DLB. PMID:26404166

  19. Consensus classification of posterior cortical atrophy

    PubMed Central

    Crutch, Sebastian J.; Schott, Jonathan M.; Rabinovici, Gil D.; Murray, Melissa; Snowden, Julie S.; van der Flier, Wiesje M.; Dickerson, Bradford C.; Vandenberghe, Rik; Ahmed, Samrah; Bak, Thomas H.; Boeve, Bradley F.; Butler, Christopher; Cappa, Stefano F.; Ceccaldi, Mathieu; de Souza, Leonardo Cruz; Dubois, Bruno; Felician, Olivier; Galasko, Douglas; Graff-Radford, Jonathan; Graff-Radford, Neill R.; Hof, Patrick R.; Krolak-Salmon, Pierre; Lehmann, Manja; Magnin, Eloi; Mendez, Mario F.; Nestor, Peter J.; Onyike, Chiadi U.; Pelak, Victoria S.; Pijnenburg, Yolande; Primativo, Silvia; Rossor, Martin N.; Ryan, Natalie S.; Scheltens, Philip; Shakespeare, Timothy J.; González, Aida Suárez; Tang-Wai, David F.; Yong, Keir X. X.; Carrillo, Maria; Fox, Nick C.

    2017-01-01

    Introduction A classification framework for posterior cortical atrophy (PCA) is proposed to improve the uniformity of definition of the syndrome in a variety of research settings. Methods Consensus statements about PCA were developed through a detailed literature review, the formation of an international multidisciplinary working party which convened on four occasions, and a Web-based quantitative survey regarding symptom frequency and the conceptualization of PCA. Results A three-level classification framework for PCA is described comprising both syndrome- and disease-level descriptions. Classification level 1 (PCA) defines the core clinical, cognitive, and neuroimaging features and exclusion criteria of the clinico-radiological syndrome. Classification level 2 (PCA-pure, PCA-plus) establishes whether, in addition to the core PCA syndrome, the core features of any other neurodegenerative syndromes are present. Classification level 3 (PCA attributable to AD [PCA-AD], Lewy body disease [PCA-LBD], corticobasal degeneration [PCA-CBD], prion disease [PCA-prion]) provides a more formal determination of the underlying cause of the PCA syndrome, based on available pathophysiological biomarker evidence. The issue of additional syndrome-level descriptors is discussed in relation to the challenges of defining stages of syndrome severity and characterizing phenotypic heterogeneity within the PCA spectrum. Discussion There was strong agreement regarding the definition of the core clinico-radiological syndrome, meaning that the current consensus statement should be regarded as a refinement, development, and extension of previous single-center PCA criteria rather than any wholesale alteration or redescription of the syndrome. The framework and terminology may facilitate the interpretation of research data across studies, be applicable across a broad range of research scenarios (e.g., behavioral interventions, pharmacological trials), and provide a foundation for future collaborative work. PMID:28259709

  20. 24 CFR 401.451 - PAE Physical Condition Analysis (PCA).

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition... 24 Housing and Urban Development 2 2010-04-01 2010-04-01 false PAE Physical Condition Analysis...

  1. 24 CFR 401.451 - PAE Physical Condition Analysis (PCA).

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 24 Housing and Urban Development 2 2013-04-01 2013-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...

  2. 24 CFR 401.451 - PAE Physical Condition Analysis (PCA).

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... 24 Housing and Urban Development 2 2011-04-01 2011-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...

  3. 24 CFR 401.451 - PAE Physical Condition Analysis (PCA).

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 24 Housing and Urban Development 2 2012-04-01 2012-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...

  4. 24 CFR 401.451 - PAE Physical Condition Analysis (PCA).

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 24 Housing and Urban Development 2 2014-04-01 2014-04-01 false PAE Physical Condition Analysis... PROGRAM (MARK-TO-MARKET) Restructuring Plan § 401.451 PAE Physical Condition Analysis (PCA). (a) Review and certification of owner evaluation. (1) The PAE must independently evaluate the physical condition...

  5. Physical activity in relation to risk of prostate cancer: a systematic review and meta-analysis.

    PubMed

    Benke, I N; Leitzmann, M F; Behrens, G; Schmid, D

    2018-05-01

    Prostate cancer (PCa) is one of the most common cancers among men, yet little is known about its modifiable risk and protective factors. This study aims to quantitatively summarize observational studies relating physical activity (PA) to PCa incidence and mortality. Published articles pertaining to PA and PCa incidence and mortality were retrieved in July 2017 using the Medline and EMBASE databases. The literature review yielded 48 cohort studies and 24 case-control studies with a total of 151 748 PCa cases. The mean age of the study participants at baseline was 61 years. In random-effects models, comparing the highest versus the lowest level of overall PA showed a summary relative risk (RR) estimate for total PCa incidence close to the null [RR = 0.99, 95% confidence interval (CI) = 0.94-1.04]. The corresponding RRs for advanced and non-advanced PCa were 0.92 (95% CI = 0.80-1.06) and 0.95 (95% CI = 0.85-1.07), respectively. We noted a statistically significant inverse association between long-term occupational activity and total PCa (RR = 0.83, 95% CI = 0.71-0.98, n studies = 13), although that finding became statistically non-significant when individual studies were removed from the analysis. When evaluated by cancer subtype, an inverse association with long-term occupational activity was noted for non-advanced/non-aggressive PCa (RR = 0.51, 95% CI = 0.37-0.71, n studies = 2) and regular recreational activity was inversely related to advanced/aggressive PCa (RR = 0.75, 95% CI = 0.60-0.95, n studies = 2), although these observations are based on a low number of studies. Moreover, PA after diagnosis was related to reduced risk of PCa mortality among survivors of PCa (summary RR based on four studies = 0.69, 95% CI = 0.55-0.85). Whether PA protects against PCa remains elusive. Further investigation taking into account the complex clinical and pathologic nature of PCa is needed to clarify the PA and PCa incidence relation. Moreover, future studies are needed to confirm whether PA after diagnosis reduces risk of PCa mortality.

  6. Cluster and principal component analysis based on SSR markers of Amomum tsao-ko in Jinping County of Yunnan Province

    NASA Astrophysics Data System (ADS)

    Ma, Mengli; Lei, En; Meng, Hengling; Wang, Tiantao; Xie, Linyan; Shen, Dong; Xianwang, Zhou; Lu, Bingyue

    2017-08-01

    Amomum tsao-ko is a commercial plant that used for various purposes in medicinal and food industries. For the present investigation, 44 germplasm samples were collected from Jinping County of Yunnan Province. Clusters analysis and 2-dimensional principal component analysis (PCA) was used to represent the genetic relations among Amomum tsao-ko by using simple sequence repeat (SSR) markers. Clustering analysis clearly distinguished the samples groups. Two major clusters were formed; first (Cluster I) consisted of 34 individuals, the second (Cluster II) consisted of 10 individuals, Cluster I as the main group contained multiple sub-clusters. PCA also showed 2 groups: PCA Group 1 included 29 individuals, PCA Group 2 included 12 individuals, consistent with the results of cluster analysis. The purpose of the present investigation was to provide information on genetic relationship of Amomum tsao-ko germplasm resources in main producing areas, also provide a theoretical basis for the protection and utilization of Amomum tsao-ko resources.

  7. Receptor modeling for source apportionment of polycyclic aromatic hydrocarbons in urban atmosphere.

    PubMed

    Singh, Kunwar P; Malik, Amrita; Kumar, Ranjan; Saxena, Puneet; Sinha, Sarita

    2008-01-01

    This study reports source apportionment of polycyclic aromatic hydrocarbons (PAHs) in particulate depositions on vegetation foliages near highway in the urban environment of Lucknow city (India) using the principal components analysis/absolute principal components scores (PCA/APCS) receptor modeling approach. The multivariate method enables identification of major PAHs sources along with their quantitative contributions with respect to individual PAH. The PCA identified three major sources of PAHs viz. combustion, vehicular emissions, and diesel based activities. The PCA/APCS receptor modeling approach revealed that the combustion sources (natural gas, wood, coal/coke, biomass) contributed 19-97% of various PAHs, vehicular emissions 0-70%, diesel based sources 0-81% and other miscellaneous sources 0-20% of different PAHs. The contributions of major pyrolytic and petrogenic sources to the total PAHs were 56 and 42%, respectively. Further, the combustion related sources contribute major fraction of the carcinogenic PAHs in the study area. High correlation coefficient (R2 > 0.75 for most PAHs) between the measured and predicted concentrations of PAHs suggests for the applicability of the PCA/APCS receptor modeling approach for estimation of source contribution to the PAHs in particulates.

  8. Finger crease pattern recognition using Legendre moments and principal component analysis

    NASA Astrophysics Data System (ADS)

    Luo, Rongfang; Lin, Tusheng

    2007-03-01

    The finger joint lines defined as finger creases and its distribution can identify a person. In this paper, we propose a new finger crease pattern recognition method based on Legendre moments and principal component analysis (PCA). After obtaining the region of interest (ROI) for each finger image in the pre-processing stage, Legendre moments under Radon transform are applied to construct a moment feature matrix from the ROI, which greatly decreases the dimensionality of ROI and can represent principal components of the finger creases quite well. Then, an approach to finger crease pattern recognition is designed based on Karhunen-Loeve (K-L) transform. The method applies PCA to a moment feature matrix rather than the original image matrix to achieve the feature vector. The proposed method has been tested on a database of 824 images from 103 individuals using the nearest neighbor classifier. The accuracy up to 98.584% has been obtained when using 4 samples per class for training. The experimental results demonstrate that our proposed approach is feasible and effective in biometrics.

  9. Clustering analysis strategies for electron energy loss spectroscopy (EELS).

    PubMed

    Torruella, Pau; Estrader, Marta; López-Ortega, Alberto; Baró, Maria Dolors; Varela, Maria; Peiró, Francesca; Estradé, Sònia

    2018-02-01

    In this work, the use of cluster analysis algorithms, widely applied in the field of big data, is proposed to explore and analyze electron energy loss spectroscopy (EELS) data sets. Three different data clustering approaches have been tested both with simulated and experimental data from Fe 3 O 4 /Mn 3 O 4 core/shell nanoparticles. The first method consists on applying data clustering directly to the acquired spectra. A second approach is to analyze spectral variance with principal component analysis (PCA) within a given data cluster. Lastly, data clustering on PCA score maps is discussed. The advantages and requirements of each approach are studied. Results demonstrate how clustering is able to recover compositional and oxidation state information from EELS data with minimal user input, giving great prospects for its usage in EEL spectroscopy. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Surface-enhanced Raman spectra of hemoglobin for esophageal cancer diagnosis

    NASA Astrophysics Data System (ADS)

    Zhou, Xue; Diao, Zhenqi; Fan, Chunzhen; Guo, Huiqiang; Xiong, Yang; Tang, Weiyue

    2014-03-01

    Surface-enhanced Raman scattering (SERS) spectra of hemoglobin from 30 esophageal cancer patients and 30 healthy persons have been detected and analyzed. The results indicate that, there are more iron ions in low spin state and less in high for the hemoglobin of esophageal cancer patients than normal persons, which is consistent with the fact that it is easier to hemolyze for the blood of cancer patients. By using principal component analysis (PCA) and discriminate analysis, we can get a three-dimensional scatter plot of PC scores from the SERS spectra of healthy persons and cancer patients, from which the two groups can be discriminated. The total accuracy of this method is 90%, while the diagnostic specificity is 93.3% and sensitivity is 86.7%. Thus SERS spectra of hemoglobin analysis combined with PCA may be a new technique for the early diagnose of esophageal cancer.

  11. An analytics of electricity consumption characteristics based on principal component analysis

    NASA Astrophysics Data System (ADS)

    Feng, Junshu

    2018-02-01

    Abstract . More detailed analysis of the electricity consumption characteristics can make demand side management (DSM) much more targeted. In this paper, an analytics of electricity consumption characteristics based on principal component analysis (PCA) is given, which the PCA method can be used in to extract the main typical characteristics of electricity consumers. Then, electricity consumption characteristics matrix is designed, which can make a comparison of different typical electricity consumption characteristics between different types of consumers, such as industrial consumers, commercial consumers and residents. In our case study, the electricity consumption has been mainly divided into four characteristics: extreme peak using, peak using, peak-shifting using and others. Moreover, it has been found that industrial consumers shift their peak load often, meanwhile commercial and residential consumers have more peak-time consumption. The conclusions can provide decision support of DSM for the government and power providers.

  12. Large Deformation Diffeomorphism and Momentum Based Hippocampal Shape Discrimination in Dementia of the Alzheimer type

    PubMed Central

    Wang, Lei; Beg, Faisal; Ratnanather, Tilak; Ceritoglu, Can; Younes, Laurent; Morris, John C.; Csernansky, John G.; Miller, Michael I.

    2010-01-01

    In large-deformation diffeomorphic metric mapping (LDDMM), the diffeomorphic matching of images are modeled as evolution in time, or a flow, of an associated smooth velocity vector field v controlling the evolution. The initial momentum parameterizes the whole geodesic and encodes the shape and form of the target image. Thus, methods such as principal component analysis (PCA) of the initial momentum leads to analysis of anatomical shape and form in target images without being restricted to small-deformation assumption in the analysis of linear displacements. We apply this approach to a study of dementia of the Alzheimer type (DAT). The left hippocampus in the DAT group shows significant shape abnormality while the right hippocampus shows similar pattern of abnormality. Further, PCA of the initial momentum leads to correct classification of 12 out of 18 DAT subjects and 22 out of 26 control subjects. PMID:17427733

  13. Evaluation of cerebral ischemia using near-infrared spectroscopy with oxygen inhalation

    NASA Astrophysics Data System (ADS)

    Ebihara, Akira; Tanaka, Yuichi; Konno, Takehiko; Kawasaki, Shingo; Fujiwara, Michiyuki; Watanabe, Eiju

    2012-09-01

    Conventional methods presently used to evaluate cerebral hemodynamics are invasive, require physical restraint, and employ equipment that is not easily transportable. Therefore, it is difficult to take repeated measurements at the patient's bedside. An alternative method to evaluate cerebral hemodynamics was developed using near-infrared spectroscopy (NIRS) with oxygen inhalation. The bilateral fronto-temporal areas of 30 normal volunteers and 33 patients with cerebral ischemia were evaluated with the NIRS system. The subjects inhaled oxygen through a mask for 2 min at a flow rate of 8 L/min. Principal component analysis (PCA) was applied to the data, and a topogram was drawn using the calculated weights. NIRS findings were compared with those of single-photon-emission computed tomography (SPECT). In normal volunteers, no laterality of the PCA weights was observed in 25 of 30 cases (83%). In patients with cerebral ischemia, PCA weights in ischemic regions were lower than in normal regions. In 28 of 33 patients (85%) with cerebral ischemia, NIRS findings agreed with those of SPECT. The results suggest that transmission of the changes in systemic SpO2 were attenuated in ischemic regions. The method discussed here should be clinically useful because it can be used to measure cerebral ischemia easily, repeatedly, and noninvasively.

  14. The Effect of Temperature on Pressurised Hot Water Extraction of Pharmacologically Important Metabolites as Analysed by UPLC-qTOF-MS and PCA

    PubMed Central

    Khoza, B. S.; Chimuka, L.; Mukwevho, E.; Steenkamp, P. A.; Madala, N. E.

    2014-01-01

    Metabolite extraction methods have been shown to be a critical consideration for pharmacometabolomics studies and, as such, optimization and development of new extraction methods are crucial. In the current study, an organic solvent-free method, namely, pressurised hot water extraction (PHWE), was used to extract pharmacologically important metabolites from dried Moringa oleifera leaves. Here, the temperature of the extraction solvent (pure water) was altered while keeping other factors constant using a homemade PHWE system. Samples extracted at different temperatures (50, 100, and 150°C) were assayed for antioxidant activities and the effect of the temperature on the extraction process was evaluated. The samples were further analysed by mass spectrometry to elucidate their metabolite compositions. Principal component analysis (PCA) evaluation of the UPLC-MS data showed distinctive differential metabolite patterns. Here, temperature changes during PHWE were shown to affect the levels of metabolites with known pharmacological activities, such as chlorogenic acids and flavonoids. Our overall findings suggest that, if not well optimised, the extraction temperature could compromise the “pharmacological potency” of the extracts. The use of MS in combination with PCA was furthermore shown to be an excellent approach to evaluate the quality and content of pharmacologically important extracts. PMID:25371697

  15. Plant microRNA-Target Interaction Identification Model Based on the Integration of Prediction Tools and Support Vector Machine

    PubMed Central

    Meng, Jun; Shi, Lin; Luan, Yushi

    2014-01-01

    Background Confident identification of microRNA-target interactions is significant for studying the function of microRNA (miRNA). Although some computational miRNA target prediction methods have been proposed for plants, results of various methods tend to be inconsistent and usually lead to more false positive. To address these issues, we developed an integrated model for identifying plant miRNA–target interactions. Results Three online miRNA target prediction toolkits and machine learning algorithms were integrated to identify and analyze Arabidopsis thaliana miRNA-target interactions. Principle component analysis (PCA) feature extraction and self-training technology were introduced to improve the performance. Results showed that the proposed model outperformed the previously existing methods. The results were validated by using degradome sequencing supported Arabidopsis thaliana miRNA-target interactions. The proposed model constructed on Arabidopsis thaliana was run over Oryza sativa and Vitis vinifera to demonstrate that our model is effective for other plant species. Conclusions The integrated model of online predictors and local PCA-SVM classifier gained credible and high quality miRNA-target interactions. The supervised learning algorithm of PCA-SVM classifier was employed in plant miRNA target identification for the first time. Its performance can be substantially improved if more experimentally proved training samples are provided. PMID:25051153

  16. Application of principal component analysis for improvement of X-ray fluorescence images obtained by polycapillary-based micro-XRF technique

    NASA Astrophysics Data System (ADS)

    Aida, S.; Matsuno, T.; Hasegawa, T.; Tsuji, K.

    2017-07-01

    Micro X-ray fluorescence (micro-XRF) analysis is repeated as a means of producing elemental maps. In some cases, however, the XRF images of trace elements that are obtained are not clear due to high background intensity. To solve this problem, we applied principal component analysis (PCA) to XRF spectra. We focused on improving the quality of XRF images by applying PCA. XRF images of the dried residue of standard solution on the glass substrate were taken. The XRF intensities for the dried residue were analyzed before and after PCA. Standard deviations of XRF intensities in the PCA-filtered images were improved, leading to clear contrast of the images. This improvement of the XRF images was effective in cases where the XRF intensity was weak.

  17. Multivariate frequency domain analysis of protein dynamics

    NASA Astrophysics Data System (ADS)

    Matsunaga, Yasuhiro; Fuchigami, Sotaro; Kidera, Akinori

    2009-03-01

    Multivariate frequency domain analysis (MFDA) is proposed to characterize collective vibrational dynamics of protein obtained by a molecular dynamics (MD) simulation. MFDA performs principal component analysis (PCA) for a bandpass filtered multivariate time series using the multitaper method of spectral estimation. By applying MFDA to MD trajectories of bovine pancreatic trypsin inhibitor, we determined the collective vibrational modes in the frequency domain, which were identified by their vibrational frequencies and eigenvectors. At near zero temperature, the vibrational modes determined by MFDA agreed well with those calculated by normal mode analysis. At 300 K, the vibrational modes exhibited characteristic features that were considerably different from the principal modes of the static distribution given by the standard PCA. The influences of aqueous environments were discussed based on two different sets of vibrational modes, one derived from a MD simulation in water and the other from a simulation in vacuum. Using the varimax rotation, an algorithm of the multivariate statistical analysis, the representative orthogonal set of eigenmodes was determined at each vibrational frequency.

  18. Noninvasive detection of nasopharyngeal carcinoma based on saliva proteins using surface-enhanced Raman spectroscopy

    NASA Astrophysics Data System (ADS)

    Lin, Xueliang; Lin, Duo; Ge, Xiaosong; Qiu, Sufang; Feng, Shangyuan; Chen, Rong

    2017-10-01

    The present study evaluated the capability of saliva analysis combining membrane protein purification with surface-enhanced Raman spectroscopy (SERS) for noninvasive detection of nasopharyngeal carcinoma (NPC). A rapid and convenient protein purification method based on cellulose acetate membrane was developed. A total of 659 high-quality SERS spectra were acquired from purified proteins extracted from the saliva samples of 170 patients with pathologically confirmed NPC and 71 healthy volunteers. Spectral analysis of those saliva protein SERS spectra revealed specific changes in some biochemical compositions, which were possibly associated with NPC transformation. Furthermore, principal component analysis combined with linear discriminant analysis (PCA-LDA) was utilized to analyze and classify the saliva protein SERS spectra from NPC and healthy subjects. Diagnostic sensitivity of 70.7%, specificity of 70.3%, and diagnostic accuracy of 70.5% could be achieved by PCA-LDA for NPC identification. These results show that this assay based on saliva protein SERS analysis holds promising potential for developing a rapid, noninvasive, and convenient clinical tool for NPC screening.

  19. Microarray Analysis Gene Expression Profiles in Laryngeal Muscle After Recurrent Laryngeal Nerve Injury.

    PubMed

    Bijangi-Vishehsaraei, Khadijeh; Blum, Kevin; Zhang, Hongji; Safa, Ahmad R; Halum, Stacey L

    2016-03-01

    The pathophysiology of recurrent laryngeal nerve (RLN) transection injury is rare in that it is characteristically followed by a high degree of spontaneous reinnervation, with reinnervation of the laryngeal adductor complex (AC) preceding that of the abducting posterior cricoarytenoid (PCA) muscle. Here, we aim to elucidate the differentially expressed myogenic factors following RLN injury that may be at least partially responsible for the spontaneous reinnervation. F344 male rats underwent RLN injury (n = 12) or sham surgery (n = 12). One week after RLN injury, larynges were harvested following euthanasia. The mRNA was extracted from PCA and AC muscles bilaterally, and microarray analysis was performed using a full rat genome array. Microarray analysis of denervated AC and PCA muscles demonstrated dramatic differences in gene expression profiles, with 205 individual probes that were differentially expressed between the denervated AC and PCA muscles and only 14 genes with similar expression patterns. The differential expression patterns of the AC and PCA suggest different mechanisms of reinnervation. The PCA showed the gene patterns of Wallerian degeneration, while the AC expressed the gene patterns of reinnervation by adjacent axonal sprouting. This finding may reveal important therapeutic targets applicable to RLN and other peripheral nerve injuries. © The Author(s) 2015.

  20. Prostate Cancer Patients-Negative Biopsy Controls Discrimination by Untargeted Metabolomics Analysis of Urine by LC-QTOF: Upstream Information on Other Omics

    NASA Astrophysics Data System (ADS)

    Fernández-Peralbo, M. A.; Gómez-Gómez, E.; Calderón-Santiago, M.; Carrasco-Valiente, J.; Ruiz-García, J.; Requena-Tapia, M. J.; Luque de Castro, M. D.; Priego-Capote, F.

    2016-12-01

    The existing clinical biomarkers for prostate cancer (PCa) diagnosis are far from ideal (e.g., the prostate specific antigen (PSA) serum level suffers from lack of specificity, providing frequent false positives leading to over-diagnosis). A key step in the search for minimum invasive tests to complement or replace PSA should be supported on the changes experienced by the biochemical pathways in PCa patients as compared to negative biopsy control individuals. In this research a comprehensive global analysis by LC-QTOF was applied to urine from 62 patients with a clinically significant PCa and 42 healthy individuals, both groups confirmed by biopsy. An unpaired t-test (p-value < 0.05) provided 28 significant metabolites tentatively identified in urine, used to develop a partial least squares discriminant analysis (PLS-DA) model characterized by 88.4 and 92.9% of sensitivity and specificity, respectively. Among the 28 significant metabolites 27 were present at lower concentrations in PCa patients than in control individuals, while only one reported higher concentrations in PCa patients. The connection among the biochemical pathways in which they are involved (DNA methylation, epigenetic marks on histones and RNA cap methylation) could explain the concentration changes with PCa and supports, once again, the role of metabolomics in upstream processes.

  1. A Parallel Product-Convolution approach for representing the depth varying Point Spread Functions in 3D widefield microscopy based on principal component analysis.

    PubMed

    Arigovindan, Muthuvel; Shaevitz, Joshua; McGowan, John; Sedat, John W; Agard, David A

    2010-03-29

    We address the problem of computational representation of image formation in 3D widefield fluorescence microscopy with depth varying spherical aberrations. We first represent 3D depth-dependent point spread functions (PSFs) as a weighted sum of basis functions that are obtained by principal component analysis (PCA) of experimental data. This representation is then used to derive an approximating structure that compactly expresses the depth variant response as a sum of few depth invariant convolutions pre-multiplied by a set of 1D depth functions, where the convolving functions are the PCA-derived basis functions. The model offers an efficient and convenient trade-off between complexity and accuracy. For a given number of approximating PSFs, the proposed method results in a much better accuracy than the strata based approximation scheme that is currently used in the literature. In addition to yielding better accuracy, the proposed methods automatically eliminate the noise in the measured PSFs.

  2. A Mass Spectrometric Analysis Method Based on PPCA and SVM for Early Detection of Ovarian Cancer.

    PubMed

    Wu, Jiang; Ji, Yanju; Zhao, Ling; Ji, Mengying; Ye, Zhuang; Li, Suyi

    2016-01-01

    Background. Surfaced-enhanced laser desorption-ionization-time of flight mass spectrometry (SELDI-TOF-MS) technology plays an important role in the early diagnosis of ovarian cancer. However, the raw MS data is highly dimensional and redundant. Therefore, it is necessary to study rapid and accurate detection methods from the massive MS data. Methods. The clinical data set used in the experiments for early cancer detection consisted of 216 SELDI-TOF-MS samples. An MS analysis method based on probabilistic principal components analysis (PPCA) and support vector machine (SVM) was proposed and applied to the ovarian cancer early classification in the data set. Additionally, by the same data set, we also established a traditional PCA-SVM model. Finally we compared the two models in detection accuracy, specificity, and sensitivity. Results. Using independent training and testing experiments 10 times to evaluate the ovarian cancer detection models, the average prediction accuracy, sensitivity, and specificity of the PCA-SVM model were 83.34%, 82.70%, and 83.88%, respectively. In contrast, those of the PPCA-SVM model were 90.80%, 92.98%, and 88.97%, respectively. Conclusions. The PPCA-SVM model had better detection performance. And the model combined with the SELDI-TOF-MS technology had a prospect in early clinical detection and diagnosis of ovarian cancer.

  3. Can Prostate Imaging Reporting and Data System Version 2 reduce unnecessary prostate biopsies in men with PSA levels of 4-10 ng/ml?

    PubMed

    Xu, Ning; Wu, Yu-Peng; Chen, Dong-Ning; Ke, Zhi-Bin; Cai, Hai; Wei, Yong; Zheng, Qing-Shui; Huang, Jin-Bei; Li, Xiao-Dong; Xue, Xue-Yi

    2018-05-01

    To explore the value of Prostate Imaging Reporting and Data System Version 2 (PI-RADS v2) for predicting prostate biopsy results in patients with prostate specific antigen (PSA) levels of 4-10 ng/ml. We retrospectively reviewed multi-parameter magnetic resonance images from 528 patients with PSA levels of 4-10 ng/ml who underwent transrectal ultrasound-guided prostate biopsies between May 2015 and May 2017. Among them, 137 were diagnosed with prostate cancer (PCa), and we further subdivided them according to pathological results into the significant PCa (S-PCa) and insignificant significant PCa (Ins-PCa) groups (121 cases were defined by surgical pathological specimen and 16 by biopsy). Age, PSA, percent free PSA, PSA density (PSAD), prostate volume (PV), and PI-RADS score were collected. Logistic regression analysis was performed to determine predictors of pathological results. Receiver operating characteristic curves were constructed to analyze the diagnostic value of PI-RADS v2 in PCa. Multivariate analysis indicated that age, PV, percent free PSA, and PI-RADS score were independent predictors of biopsy findings, while only PI-RADS score was an independent predictor of S-PCa (P < 0.05). The areas under the receiver operating characteristic curve for diagnosing PCa with respect to age, PV, percent free PSA, and PI-RADS score were 0.570, 0.430, 0.589 and 0.836, respectively. The area under the curve for diagnosing S-PCa with respect to PI-RADS score was 0.732. A PI-RADS score of 3 was the best cutoff for predicting PCa, and 4 was the best cutoff for predicting S-PCa. Thus, 92.8% of patients with PI-RADS scores of 1-2 would have avoided biopsy, but at the cost of missing 2.2% of the potential PCa cases. Similarly, 83.82% of patients with a PI-RADS score ≤ 3 would have avoided biopsy, but at the cost of missing 3.3% of the potential S-PCa cases. PI-RADS v2 could be used to reduce unnecessary prostate biopsies in patients with PSA levels of 4-10 ng/ml.

  4. Dihedral angle principal component analysis of molecular dynamics simulations.

    PubMed

    Altis, Alexandros; Nguyen, Phuong H; Hegger, Rainer; Stock, Gerhard

    2007-06-28

    It has recently been suggested by Mu et al. [Proteins 58, 45 (2005)] to use backbone dihedral angles instead of Cartesian coordinates in a principal component analysis of molecular dynamics simulations. Dihedral angles may be advantageous because internal coordinates naturally provide a correct separation of internal and overall motion, which was found to be essential for the construction and interpretation of the free energy landscape of a biomolecule undergoing large structural rearrangements. To account for the circular statistics of angular variables, a transformation from the space of dihedral angles {phi(n)} to the metric coordinate space {x(n)=cos phi(n),y(n)=sin phi(n)} was employed. To study the validity and the applicability of the approach, in this work the theoretical foundations underlying the dihedral angle principal component analysis (dPCA) are discussed. It is shown that the dPCA amounts to a one-to-one representation of the original angle distribution and that its principal components can readily be characterized by the corresponding conformational changes of the peptide. Furthermore, a complex version of the dPCA is introduced, in which N angular variables naturally lead to N eigenvalues and eigenvectors. Applying the methodology to the construction of the free energy landscape of decaalanine from a 300 ns molecular dynamics simulation, a critical comparison of the various methods is given.

  5. Dihedral angle principal component analysis of molecular dynamics simulations

    NASA Astrophysics Data System (ADS)

    Altis, Alexandros; Nguyen, Phuong H.; Hegger, Rainer; Stock, Gerhard

    2007-06-01

    It has recently been suggested by Mu et al. [Proteins 58, 45 (2005)] to use backbone dihedral angles instead of Cartesian coordinates in a principal component analysis of molecular dynamics simulations. Dihedral angles may be advantageous because internal coordinates naturally provide a correct separation of internal and overall motion, which was found to be essential for the construction and interpretation of the free energy landscape of a biomolecule undergoing large structural rearrangements. To account for the circular statistics of angular variables, a transformation from the space of dihedral angles {φn} to the metric coordinate space {xn=cosφn,yn=sinφn} was employed. To study the validity and the applicability of the approach, in this work the theoretical foundations underlying the dihedral angle principal component analysis (dPCA) are discussed. It is shown that the dPCA amounts to a one-to-one representation of the original angle distribution and that its principal components can readily be characterized by the corresponding conformational changes of the peptide. Furthermore, a complex version of the dPCA is introduced, in which N angular variables naturally lead to N eigenvalues and eigenvectors. Applying the methodology to the construction of the free energy landscape of decaalanine from a 300ns molecular dynamics simulation, a critical comparison of the various methods is given.

  6. Multi-segmental movements as a function of experience in karate.

    PubMed

    Zago, Matteo; Codari, Marina; Iaia, F Marcello; Sforza, Chiarella

    2017-08-01

    Karate is a martial art that partly depends on subjective scoring of complex movements. Principal component analysis (PCA)-based methods can identify the fundamental synergies (principal movements) of motor system, providing a quantitative global analysis of technique. In this study, we aimed at describing the fundamental multi-joint synergies of a karate performance, under the hypothesis that the latter are skilldependent; estimate karateka's experience level, expressed as years of practice. A motion capture system recorded traditional karate techniques of 10 professional and amateur karateka. At any time point, the 3D-coordinates of body markers produced posture vectors that were normalised, concatenated from all karateka and submitted to a first PCA. Five principal movements described both gross movement synergies and individual differences. A second PCA followed by linear regression estimated the years of practice using principal movements (eigenpostures and weighting curves) and centre of mass kinematics (error: 3.71 years; R2 = 0.91, P ≪ 0.001). Principal movements and eigenpostures varied among different karateka and as functions of experience. This approach provides a framework to develop visual tools for the analysis of motor synergies in karate, allowing to detect the multi-joint motor patterns that should be restored after an injury, or to be specifically trained to increase performance.

  7. Application of Hyperspectral Imaging and Chemometric Calibrations for Variety Discrimination of Maize Seeds

    PubMed Central

    Zhang, Xiaolei; Liu, Fei; He, Yong; Li, Xiaoli

    2012-01-01

    Hyperspectral imaging in the visible and near infrared (VIS-NIR) region was used to develop a novel method for discriminating different varieties of commodity maize seeds. Firstly, hyperspectral images of 330 samples of six varieties of maize seeds were acquired using a hyperspectral imaging system in the 380–1,030 nm wavelength range. Secondly, principal component analysis (PCA) and kernel principal component analysis (KPCA) were used to explore the internal structure of the spectral data. Thirdly, three optimal wavelengths (523, 579 and 863 nm) were selected by implementing PCA directly on each image. Then four textural variables including contrast, homogeneity, energy and correlation were extracted from gray level co-occurrence matrix (GLCM) of each monochromatic image based on the optimal wavelengths. Finally, several models for maize seeds identification were established by least squares-support vector machine (LS-SVM) and back propagation neural network (BPNN) using four different combinations of principal components (PCs), kernel principal components (KPCs) and textural features as input variables, respectively. The recognition accuracy achieved in the PCA-GLCM-LS-SVM model (98.89%) was the most satisfactory one. We conclude that hyperspectral imaging combined with texture analysis can be implemented for fast classification of different varieties of maize seeds. PMID:23235456

  8. The histogram analysis of diffusion-weighted intravoxel incoherent motion (IVIM) imaging for differentiating the gleason grade of prostate cancer.

    PubMed

    Zhang, Yu-Dong; Wang, Qing; Wu, Chen-Jiang; Wang, Xiao-Ning; Zhang, Jing; Liu, Hui; Liu, Xi-Sheng; Shi, Hai-Bin

    2015-04-01

    To evaluate histogram analysis of intravoxel incoherent motion (IVIM) for discriminating the Gleason grade of prostate cancer (PCa). A total of 48 patients pathologically confirmed as having clinically significant PCa (size > 0.5 cm) underwent preoperative DW-MRI (b of 0-900 s/mm(2)). Data was post-processed by monoexponential and IVIM model for quantitation of apparent diffusion coefficients (ADCs), perfusion fraction f, diffusivity D and pseudo-diffusivity D*. Histogram analysis was performed by outlining entire-tumour regions of interest (ROIs) from histological-radiological correlation. The ability of imaging indices to differentiate low-grade (LG, Gleason score (GS) ≤6) from intermediate/high-grade (HG, GS > 6) PCa was analysed by ROC regression. Eleven patients had LG tumours (18 foci) and 37 patients had HG tumours (42 foci) on pathology examination. HG tumours had significantly lower ADCs and D in terms of mean, median, 10th and 75th percentiles, combined with higher histogram kurtosis and skewness for ADCs, D and f, than LG PCa (p < 0.05). Histogram D showed relatively higher correlations (ñ = 0.641-0.668 vs. ADCs: 0.544-0.574) with ordinal GS of PCa; and its mean, median and 10th percentile performed better than ADCs did in distinguishing LG from HG PCa. It is feasible to stratify the pathological grade of PCa by IVIM with histogram metrics. D performed better in distinguishing LG from HG tumour than conventional ADCs. • GS had relatively higher correlation with tumour D than ADCs. • Difference of histogram D among two-grade tumours was statistically significant. • D yielded better individual features in demonstrating tumour grade than ADC. • D* and f failed to determine tumour grade of PCa.

  9. Detecting most influencing courses on students grades using block PCA

    NASA Astrophysics Data System (ADS)

    Othman, Osama H.; Gebril, Rami Salah

    2014-12-01

    One of the modern solutions adopted in dealing with the problem of large number of variables in statistical analyses is the Block Principal Component Analysis (Block PCA). This modified technique can be used to reduce the vertical dimension (variables) of the data matrix Xn×p by selecting a smaller number of variables, (say m) containing most of the statistical information. These selected variables can then be employed in further investigations and analyses. Block PCA is an adapted multistage technique of the original PCA. It involves the application of Cluster Analysis (CA) and variable selection throughout sub principal components scores (PC's). The application of Block PCA in this paper is a modified version of the original work of Liu et al (2002). The main objective was to apply PCA on each group of variables, (established using cluster analysis), instead of involving the whole large pack of variables which was proved to be unreliable. In this work, the Block PCA is used to reduce the size of a huge data matrix ((n = 41) × (p = 251)) consisting of Grade Point Average (GPA) of the students in 251 courses (variables) in the faculty of science in Benghazi University. In other words, we are constructing a smaller analytical data matrix of the GPA's of the students with less variables containing most variation (statistical information) in the original database. By applying the Block PCA, (12) courses were found to `absorb' most of the variation or influence from the original data matrix, and hence worth to be keep for future statistical exploring and analytical studies. In addition, the course Independent Study (Math.) was found to be the most influencing course on students GPA among the 12 selected courses.

  10. Subject order-independent group ICA (SOI-GICA) for functional MRI data analysis.

    PubMed

    Zhang, Han; Zuo, Xi-Nian; Ma, Shuang-Ye; Zang, Yu-Feng; Milham, Michael P; Zhu, Chao-Zhe

    2010-07-15

    Independent component analysis (ICA) is a data-driven approach to study functional magnetic resonance imaging (fMRI) data. Particularly, for group analysis on multiple subjects, temporally concatenation group ICA (TC-GICA) is intensively used. However, due to the usually limited computational capability, data reduction with principal component analysis (PCA: a standard preprocessing step of ICA decomposition) is difficult to achieve for a large dataset. To overcome this, TC-GICA employs multiple-stage PCA data reduction. Such multiple-stage PCA data reduction, however, leads to variable outputs due to different subject concatenation orders. Consequently, the ICA algorithm uses the variable multiple-stage PCA outputs and generates variable decompositions. In this study, a rigorous theoretical analysis was conducted to prove the existence of such variability. Simulated and real fMRI experiments were used to demonstrate the subject-order-induced variability of TC-GICA results using multiple PCA data reductions. To solve this problem, we propose a new subject order-independent group ICA (SOI-GICA). Both simulated and real fMRI data experiments demonstrated the high robustness and accuracy of the SOI-GICA results compared to those of traditional TC-GICA. Accordingly, we recommend SOI-GICA for group ICA-based fMRI studies, especially those with large data sets. Copyright 2010 Elsevier Inc. All rights reserved.

  11. Quantitative Ultrasound Using Texture Analysis of Myofascial Pain Syndrome in the Trapezius.

    PubMed

    Kumbhare, Dinesh A; Ahmed, Sara; Behr, Michael G; Noseworthy, Michael D

    2018-01-01

    Objective-The objective of this study is to assess the discriminative ability of textural analyses to assist in the differentiation of the myofascial trigger point (MTrP) region from normal regions of skeletal muscle. Also, to measure the ability to reliably differentiate between three clinically relevant groups: healthy asymptomatic, latent MTrPs, and active MTrP. Methods-18 and 19 patients were identified with having active and latent MTrPs in the trapezius muscle, respectively. We included 24 healthy volunteers. Images were obtained by research personnel, who were blinded with respect to the clinical status of the study participant. Histograms provided first-order parameters associated with image grayscale. Haralick, Galloway, and histogram-related features were used in texture analysis. Blob analysis was conducted on the regions of interest (ROIs). Principal component analysis (PCA) was performed followed by multivariate analysis of variance (MANOVA) to determine the statistical significance of the features. Results-92 texture features were analyzed for factorability using Bartlett's test of sphericity, which was significant. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.94. PCA demonstrated rotated eigenvalues of the first eight components (each comprised of multiple texture features) explained 94.92% of the cumulative variance in the ultrasound image characteristics. The 24 features identified by PCA were included in the MANOVA as dependent variables, and the presence of a latent or active MTrP or healthy muscle were independent variables. Conclusion-Texture analysis techniques can discriminate between the three clinically relevant groups.

  12. PBOV1 as a potential biomarker for more advanced prostate cancer based on protein and digital histomorphometric analysis.

    PubMed

    Carleton, Neil M; Zhu, Guangjing; Gorbounov, Mikhail; Miller, M Craig; Pienta, Kenneth J; Resar, Linda M S; Veltri, Robert W

    2018-05-01

    There are few tissue-based biomarkers that can accurately predict prostate cancer (PCa) progression and aggressiveness. We sought to evaluate the clinical utility of prostate and breast overexpressed 1 (PBOV1) as a potential PCa biomarker. Patient tumor samples were designated by Grade Groups using the 2014 Gleason grading system. Primary radical prostatectomy tumors were obtained from 48 patients and evaluated for PBOV1 levels using Western blot analysis in matched cancer and benign cancer-adjacent regions. Immunohistochemical evaluation of PBOV1 was subsequently performed in 80 cancer and 80 benign cancer-adjacent patient samples across two tissue microarrays (TMAs) to verify protein levels in epithelial tissue and to assess correlation between PBOV1 proteins and nuclear architectural changes in PCa cells. Digital histomorphometric analysis was used to track 22 parameters that characterized nuclear changes in PBOV1-stained cells. Using a training and test set for validation, multivariate logistic regression (MLR) models were used to identify significant nuclear parameters that distinguish Grade Group 3 and above PCa from Grade Group 1 and 2 PCa regions. PBOV1 protein levels were increased in tumors from Grade Group 3 and above (GS 4 + 3 and ≥ 8) regions versus Grade Groups 1 and 2 (GS 3 + 3 and 3 + 4) regions (P = 0.005) as assessed by densitometry of immunoblots. Additionally, by immunoblotting, PBOV1 protein levels differed significantly between Grade Group 2 (GS 3 + 4) and Grade Group 3 (GS 4 + 3) PCa samples (P = 0.028). In the immunohistochemical analysis, measures of PBOV1 staining intensity strongly correlated with nuclear alterations in cancer cells. An MLR model retaining eight parameters describing PBOV1 staining intensity and nuclear architecture discriminated Grade Group 3 and above PCa from Grade Group 1 and 2 PCa and benign cancer-adjacent regions with a ROC-AUC of 0.90 and 0.80, respectively, in training and test sets. Our study demonstrates that the PBOV1 protein could be used to discriminate Grade Group 3 and above PCa. Additionally, the PBOV1 protein could be involved in modulating changes to the nuclear architecture of PCa cells. Confirmatory studies are warranted in an independent population for further validation. © 2018 Wiley Periodicals, Inc.

  13. A Positive Family History as risk factor for Prostate Cancer in a Population-based Study with organized PSA-Screening: Results of the Swiss ERSPC (Aarau)

    PubMed Central

    Randazzo, Marco; Müller, Alexander; Carlsson, Sigrid; Eberli, Daniel; Huber, Andreas; Grobholz, Rainer; Manka, Lukas; Mortezavi, Ashkan; Sulser, Tullio; Recker, Franz; Kwiatkowski, Maciej

    2016-01-01

    Objective To assess the value of positive family history (FH) as a risk factor for prostate cancer (PCa) incidence and grade among men undergoing organized PSA-screening in a population-based study. Patients and Methods The study cohort comprised all attendees of the Swiss arm of the European Randomized Study of Screening for Prostate Cancer (ERSPC) with systematic PSA-tests every 4 years. Men reporting first-degree relative(s) diagnosed with PCa were considered to have a positive FH. Biopsy was exclusively PSA-triggered with a threshold of 3 ng/ml. Primary endpoint was PCa diagnosis. Kaplan-Meier and Cox regression analyses were used. Results Of 4,932 attendees with a median age of 60.9 (IQR 57.6–65.1) years, 334 (6.8%) reported a positive FH. Median follow-up duration was 11.6 years (IQR 10.3–13.3). Cumulative PCa incidence was 60/334 (18%, positive FH) and 550/4,598 (12%, negative FH) (OR 1.6, 95% CI 1.2–2.2, p=0.001), respectively. In both groups, most PCa diagnosed had a low grade. There were no significant differences in PSA at diagnosis, biopsy Gleason score or Gleason score on pathologic specimen among men who underwent radical prostatectomy between both groups, respectively. On multivariable analysis, age (HR 1.04, 95% CI 1.02–1.06), baseline PSA (HR 1.13 95% CI 1.12–1.14), and FH (HR 1.6, CI 1.24–2.14) were independent predictors for overall PCa incidence (p<0.0001 each). Only baseline PSA (HR 1.14, 95% CI 1.12–1.16, p<0.0001) was an independent predictor of Gleason score ≥7 PCa on prostate biopsy. The proportion of interval PCa diagnosed in between the screening rounds was non-significantly different. Conclusion Irrespective of the FH status, the current PSA-based screening setting detects the majority of aggressive PCa and missed only a minority of interval cancers with a 4-year screening algorithm. Our results suggest that men with a positive FH are at increased risk for low grade but not aggressive PCa. PMID:26332304

  14. Association between PSA kinetics and cancer-specific mortality in patients with localised prostate cancer: analysis of the placebo arm of the SPCG-6 study.

    PubMed

    Thomsen, F B; Brasso, K; Berg, K D; Gerds, T A; Johansson, J-E; Angelsen, A; Tammela, T L J; Iversen, P

    2016-03-01

    The prognostic value of prostate-specific antigen (PSA) kinetics in untreated prostate cancer (PCa) patients is debatable. We investigated the association between PSA doubling time (PSAdt), PSA velocity (PSAvel) and PSAvel risk count (PSAvRC) and PCa mortality in a cohort of patients with localised PCa managed on watchful waiting. Patients with clinically localised PCa managed observationally, who were randomised to and remained on placebo for minimum 18 months in the SPCG-6 study, were included. All patients survived at least 2 years and had a minimum of three PSA determinations available. The prognostic value of PSA kinetics was analysed and patients were stratified according to their PSA at consent: ≤10, 10.1-25, and >25 ng/ml. Cumulative incidences of PCa-specific mortality were estimated with the Aalen-Johansen method. Two hundred and sixty-three patients were included of which 116, 76 and 71 had a PSA at consent ≤10, 10.1-25, and >25 ng/ml, respectively. Median follow-up was 13.6 years. For patients with PSA at consent between 10.1 and 25 ng/ml, the 13-year risks of PCa mortality were associated with PSA kinetics: PSAdt ≤3 years: 62.0% versus PSAdt >3 years: 16.3% (Gray's test: P < 0.0001), PSAvel ≥2 ng/ml/year: 48.0% versus PSAvel <2 ng/ml/year: 11.0% (Gray's test: P = 0.0008), and PSAvRC 2: 45.0% versus 0-1: 3.8% (Gray's test: P = 0.001). In contrast, none of the PSA kinetics were significantly associated with changes of 13-year risks of PCa mortality in patients with PSA at consent ≤10 or >25 ng/ml. We found that magnitude changes in 13-year risks of PCa mortality that can be indicated by PSA kinetics depend on PSA level in patients with localised PCa who were managed observationally. Our results question PSA kinetics as surrogate marker for PCa mortality in patients with low and high PSA values. NCT00672282. © The Author 2015. Published by Oxford University Press on behalf of the European Society for Medical Oncology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  15. Ultra-sensitive high performance liquid chromatography-laser-induced fluorescence based proteomics for clinical applications.

    PubMed

    Patil, Ajeetkumar; Bhat, Sujatha; Pai, Keerthilatha M; Rai, Lavanya; Kartha, V B; Chidangil, Santhosh

    2015-09-08

    An ultra-sensitive high performance liquid chromatography-laser induced fluorescence (HPLC-LIF) based technique has been developed by our group at Manipal, for screening, early detection, and staging for various cancers, using protein profiling of clinical samples like, body fluids, cellular specimens, and biopsy-tissue. More than 300 protein profiles of different clinical samples (serum, saliva, cellular samples and tissue homogenates) from volunteers (normal, and different pre-malignant/malignant conditions) were recorded using this set-up. The protein profiles were analyzed using principal component analysis (PCA) to achieve objective detection and classification of malignant, premalignant and healthy conditions with high sensitivity and specificity. The HPLC-LIF protein profiling combined with PCA, as a routine method for screening, diagnosis, and staging of cervical cancer and oral cancer, is discussed in this paper. In recent years, proteomics techniques have advanced tremendously in life sciences and medical sciences for the detection and identification of proteins in body fluids, tissue homogenates and cellular samples to understand biochemical mechanisms leading to different diseases. Some of the methods include techniques like high performance liquid chromatography, 2D-gel electrophoresis, MALDI-TOF-MS, SELDI-TOF-MS, CE-MS and LC-MS techniques. We have developed an ultra-sensitive high performance liquid chromatography-laser induced fluorescence (HPLC-LIF) based technique, for screening, early detection, and staging for various cancers, using protein profiling of clinical samples like, body fluids, cellular specimens, and biopsy-tissue. More than 300 protein profiles of different clinical samples (serum, saliva, cellular samples and tissue homogenates) from healthy and volunteers with different malignant conditions were recorded by using this set-up. The protein profile data were analyzed using principal component analysis (PCA) for objective classification and detection of malignant, premalignant and healthy conditions. The method is extremely sensitive to detect proteins with limit of detection of the order of femto-moles. The HPLC-LIF combined with PCA as a potential proteomic method for the diagnosis of oral cancer and cervical cancer has been discussed in this paper. This article is part of a Special Issue entitled: Proteomics in India. Copyright © 2015 Elsevier B.V. All rights reserved.

  16. Probing long-range interactions by extracting free energies from genome-wide chromosome conformation capture data.

    PubMed

    Saberi, Saeed; Farré, Pau; Cuvier, Olivier; Emberly, Eldon

    2015-05-23

    A variety of DNA binding proteins are involved in regulating and shaping the packing of chromatin. They aid the formation of loops in the DNA that function to isolate different structural domains. A recent experimental technique, Hi-C, provides a method for determining the frequency of such looping between all distant parts of the genome. Given that the binding locations of many chromatin associated proteins have also been measured, it has been possible to make estimates for their influence on the long-range interactions as measured by Hi-C. However, a challenge in this analysis is the predominance of non-specific contacts that mask out the specific interactions of interest. We show that transforming the Hi-C contact frequencies into free energies gives a natural method for separating out the distance dependent non-specific interactions. In particular we apply Principal Component Analysis (PCA) to the transformed free energy matrix to identify the dominant modes of interaction. PCA identifies systematic effects as well as high frequency spatial noise in the Hi-C data which can be filtered out. Thus it can be used as a data driven approach for normalizing Hi-C data. We assess this PCA based normalization approach, along with several other normalization schemes, by fitting the transformed Hi-C data using a pairwise interaction model that takes as input the known locations of bound chromatin factors. The result of fitting is a set of predictions for the coupling energies between the various chromatin factors and their effect on the energetics of looping. We show that the quality of the fit can be used as a means to determine how much PCA filtering should be applied to the Hi-C data. We find that the different normalizations of the Hi-C data vary in the quality of fit to the pairwise interaction model. PCA filtering can improve the fit, and the predicted coupling energies lead to biologically meaningful insights for how various chromatin bound factors influence the stability of DNA loops in chromatin.

  17. Predictive spectroscopy and chemical imaging based on novel optical systems

    NASA Astrophysics Data System (ADS)

    Nelson, Matthew Paul

    1998-10-01

    This thesis describes two futuristic optical systems designed to surpass contemporary spectroscopic methods for predictive spectroscopy and chemical imaging. These systems are advantageous to current techniques in a number of ways including lower cost, enhanced portability, shorter analysis time, and improved S/N. First, a novel optical approach to predicting chemical and physical properties based on principal component analysis (PCA) is proposed and evaluated. A regression vector produced by PCA is designed into the structure of a set of paired optical filters. Light passing through the paired filters produces an analog detector signal directly proportional to the chemical/physical property for which the regression vector was designed. Second, a novel optical system is described which takes a single-shot approach to chemical imaging with high spectroscopic resolution using a dimension-reduction fiber-optic array. Images are focused onto a two- dimensional matrix of optical fibers which are drawn into a linear distal array with specific ordering. The distal end is imaged with a spectrograph equipped with an ICCD camera for spectral analysis. Software is used to extract the spatial/spectral information contained in the ICCD images and deconvolute them into wave length-specific reconstructed images or position-specific spectra which span a multi-wavelength space. This thesis includes a description of the fabrication of two dimension-reduction arrays as well as an evaluation of the system for spatial and spectral resolution, throughput, image brightness, resolving power, depth of focus, and channel cross-talk. PCA is performed on the images by treating rows of the ICCD images as spectra and plotting the scores of each PC as a function of reconstruction position. In addition, iterative target transformation factor analysis (ITTFA) is performed on the spectroscopic images to generate ``true'' chemical maps of samples. Univariate zero-order images, univariate first-order spectroscopic images, bivariate first-order spectroscopic images, and multivariate first-order spectroscopic images of the temporal development of laser-induced plumes are presented and interpreted. Reconstructed chemical images generated using bivariate and trivariate wavelength techniques, bimodal and trimodal PCA methods, and bimodal and trimodal ITTFA approaches are also included.

  18. Physicochemical and mechanical properties of paracetamol cocrystal with 5-nitroisophthalic acid.

    PubMed

    Hiendrawan, Stevanus; Veriansyah, Bambang; Widjojokusumo, Edward; Soewandhi, Sundani Nurono; Wikarsa, Saleh; Tjandrawinata, Raymond R

    2016-01-30

    We report novel pharmaceutical cocrystal of a popular antipyretic drug paracetamol (PCA) with coformer 5-nitroisophhthalic acid (5NIP) to improve its tabletability. The cocrystal (PCA-5NIP at molar ratio of 1:1) was synthesized by solvent evaporation technique using methanol as solvent. The physicochemical properties of cocrystal were characterized by powder X-ray diffraction (PXRD), differential scanning calorimetry (DSC), thermogravimetry analysis (TGA), fourier transform infrared spectroscopy (FTIR), hot stage polarized microscopy (HSPM) and scanning electron microscopy (SEM). Stability of the cocrystal was assessed by storing them at 40°C/75% RH for one month. Compared to PCA, the cocrystal displayed superior tableting performance. PCA-5NIP cocrystal showed a similar dissolution profile as compared to PCA and exhibited good stability. This study showed the utility of PCA-5NIP cocrystal for improving mechanical properties of PCA. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. PEM-PCA: a parallel expectation-maximization PCA face recognition architecture.

    PubMed

    Rujirakul, Kanokmon; So-In, Chakchai; Arnonkijpanich, Banchar

    2014-01-01

    Principal component analysis or PCA has been traditionally used as one of the feature extraction techniques in face recognition systems yielding high accuracy when requiring a small number of features. However, the covariance matrix and eigenvalue decomposition stages cause high computational complexity, especially for a large database. Thus, this research presents an alternative approach utilizing an Expectation-Maximization algorithm to reduce the determinant matrix manipulation resulting in the reduction of the stages' complexity. To improve the computational time, a novel parallel architecture was employed to utilize the benefits of parallelization of matrix computation during feature extraction and classification stages including parallel preprocessing, and their combinations, so-called a Parallel Expectation-Maximization PCA architecture. Comparing to a traditional PCA and its derivatives, the results indicate lower complexity with an insignificant difference in recognition precision leading to high speed face recognition systems, that is, the speed-up over nine and three times over PCA and Parallel PCA.

  20. Innovations in diagnostic imaging of localized prostate cancer.

    PubMed

    Pummer, Karl; Rieken, Malte; Augustin, Herbert; Gutschi, Thomas; Shariat, Shahrokh F

    2014-08-01

    In recent years, various imaging modalities have been developed to improve diagnosis, staging, and localization of early-stage prostate cancer (PCa). A MEDLINE literature search of the time frame between 01/2007 and 06/2013 was performed on imaging of localized PCa. Conventional transrectal ultrasound (TRUS) is mainly used to guide prostate biopsy. Contrast-enhanced ultrasound is based on the assumption that PCa tissue is hypervascularized and might be better identified after intravenous injection of a microbubble contrast agent. However, results on its additional value for cancer detection are controversial. Computer-based analysis of the transrectal ultrasound signal (C-TRUS) appears to detect cancer in a high rate of patients with previous biopsies. Real-time elastography seems to have higher sensitivity, specificity, and positive predictive value than conventional TRUS. However, the method still awaits prospective validation. The same is true for prostate histoscanning, an ultrasound-based method for tissue characterization. Currently, multiparametric MRI provides improved tissue visualization of the prostate, which may be helpful in the diagnosis and targeting of prostate lesions. However, most published series are small and suffer from variations in indication, methodology, quality, interpretation, and reporting. Among ultrasound-based techniques, real-time elastography and C-TRUS seem the most promising techniques. Multiparametric MRI appears to have advantages over conventional T2-weighted MRI in the detection of PCa. Despite these promising results, currently, no recommendation for the routine use of these novel imaging techniques can be made. Prospective studies defining the value of various imaging modalities are urgently needed.

Top