Sample records for components analyses pca

  1. Priority of VHS Development Based in Potential Area using Principal Component Analysis

    NASA Astrophysics Data System (ADS)

    Meirawan, D.; Ana, A.; Saripudin, S.

    2018-02-01

    The current condition of VHS is still inadequate in quality, quantity and relevance. The purpose of this research is to analyse the development of VHS based on the development of regional potential by using principal component analysis (PCA) in Bandung, Indonesia. This study used descriptive qualitative data analysis using the principle of secondary data reduction component. The method used is Principal Component Analysis (PCA) analysis with Minitab Statistics Software tool. The results of this study indicate the value of the lowest requirement is a priority of the construction of development VHS with a program of majors in accordance with the development of regional potential. Based on the PCA score found that the main priority in the development of VHS in Bandung is in Saguling, which has the lowest PCA value of 416.92 in area 1, Cihampelas with the lowest PCA value in region 2 and Padalarang with the lowest PCA value.

  2. Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map

    PubMed Central

    An, Yan; Zou, Zhihong; Li, Ranran

    2016-01-01

    In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009–2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data. PMID:26761018

  3. Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map.

    PubMed

    An, Yan; Zou, Zhihong; Li, Ranran

    2016-01-08

    In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009-2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data.

  4. Exploring functional data analysis and wavelet principal component analysis on ecstasy (MDMA) wastewater data.

    PubMed

    Salvatore, Stefania; Bramness, Jørgen G; Røislien, Jo

    2016-07-12

    Wastewater-based epidemiology (WBE) is a novel approach in drug use epidemiology which aims to monitor the extent of use of various drugs in a community. In this study, we investigate functional principal component analysis (FPCA) as a tool for analysing WBE data and compare it to traditional principal component analysis (PCA) and to wavelet principal component analysis (WPCA) which is more flexible temporally. We analysed temporal wastewater data from 42 European cities collected daily over one week in March 2013. The main temporal features of ecstasy (MDMA) were extracted using FPCA using both Fourier and B-spline basis functions with three different smoothing parameters, along with PCA and WPCA with different mother wavelets and shrinkage rules. The stability of FPCA was explored through bootstrapping and analysis of sensitivity to missing data. The first three principal components (PCs), functional principal components (FPCs) and wavelet principal components (WPCs) explained 87.5-99.6 % of the temporal variation between cities, depending on the choice of basis and smoothing. The extracted temporal features from PCA, FPCA and WPCA were consistent. FPCA using Fourier basis and common-optimal smoothing was the most stable and least sensitive to missing data. FPCA is a flexible and analytically tractable method for analysing temporal changes in wastewater data, and is robust to missing data. WPCA did not reveal any rapid temporal changes in the data not captured by FPCA. Overall the results suggest FPCA with Fourier basis functions and common-optimal smoothing parameter as the most accurate approach when analysing WBE data.

  5. Classification of alloys using laser induced breakdown spectroscopy with principle component analysis

    NASA Astrophysics Data System (ADS)

    Syuhada Mangsor, Aneez; Haider Rizvi, Zuhaib; Chaudhary, Kashif; Safwan Aziz, Muhammad

    2018-05-01

    The study of atomic spectroscopy has contributed to a wide range of scientific applications. In principle, laser induced breakdown spectroscopy (LIBS) method has been used to analyse various types of matter regardless of its physical state, either it is solid, liquid or gas because all elements emit light of characteristic frequencies when it is excited to sufficiently high energy. The aim of this work was to analyse the signature spectrums of each element contained in three different types of samples. Metal alloys of Aluminium, Titanium and Brass with the purities of 75%, 80%, 85%, 90% and 95% were used as the manipulated variable and their LIBS spectra were recorded. The characteristic emission lines of main elements were identified from the spectra as well as its corresponding contents. Principal component analysis (PCA) was carried out using the data from LIBS spectra. Three obvious clusters were observed in 3-dimensional PCA plot which corresponding to the different group of alloys. Findings from this study showed that LIBS technology with the help of principle component analysis could conduct the variety discrimination of alloys demonstrating the capability of LIBS-PCA method in field of spectro-analysis. Thus, LIBS-PCA method is believed to be an effective method for classifying alloys with different percentage of purifications, which was high-cost and time-consuming before.

  6. Detecting most influencing courses on students grades using block PCA

    NASA Astrophysics Data System (ADS)

    Othman, Osama H.; Gebril, Rami Salah

    2014-12-01

    One of the modern solutions adopted in dealing with the problem of large number of variables in statistical analyses is the Block Principal Component Analysis (Block PCA). This modified technique can be used to reduce the vertical dimension (variables) of the data matrix Xn×p by selecting a smaller number of variables, (say m) containing most of the statistical information. These selected variables can then be employed in further investigations and analyses. Block PCA is an adapted multistage technique of the original PCA. It involves the application of Cluster Analysis (CA) and variable selection throughout sub principal components scores (PC's). The application of Block PCA in this paper is a modified version of the original work of Liu et al (2002). The main objective was to apply PCA on each group of variables, (established using cluster analysis), instead of involving the whole large pack of variables which was proved to be unreliable. In this work, the Block PCA is used to reduce the size of a huge data matrix ((n = 41) × (p = 251)) consisting of Grade Point Average (GPA) of the students in 251 courses (variables) in the faculty of science in Benghazi University. In other words, we are constructing a smaller analytical data matrix of the GPA's of the students with less variables containing most variation (statistical information) in the original database. By applying the Block PCA, (12) courses were found to `absorb' most of the variation or influence from the original data matrix, and hence worth to be keep for future statistical exploring and analytical studies. In addition, the course Independent Study (Math.) was found to be the most influencing course on students GPA among the 12 selected courses.

  7. Differential principal component analysis of ChIP-seq.

    PubMed

    Ji, Hongkai; Li, Xia; Wang, Qian-fei; Ning, Yang

    2013-04-23

    We propose differential principal component analysis (dPCA) for analyzing multiple ChIP-sequencing datasets to identify differential protein-DNA interactions between two biological conditions. dPCA integrates unsupervised pattern discovery, dimension reduction, and statistical inference into a single framework. It uses a small number of principal components to summarize concisely the major multiprotein synergistic differential patterns between the two conditions. For each pattern, it detects and prioritizes differential genomic loci by comparing the between-condition differences with the within-condition variation among replicate samples. dPCA provides a unique tool for efficiently analyzing large amounts of ChIP-sequencing data to study dynamic changes of gene regulation across different biological conditions. We demonstrate this approach through analyses of differential chromatin patterns at transcription factor binding sites and promoters as well as allele-specific protein-DNA interactions.

  8. Identification of the isomers using principal component analysis (PCA) method

    NASA Astrophysics Data System (ADS)

    Kepceoǧlu, Abdullah; Gündoǧdu, Yasemin; Ledingham, Kenneth William David; Kilic, Hamdi Sukur

    2016-03-01

    In this work, we have carried out a detailed statistical analysis for experimental data of mass spectra from xylene isomers. Principle Component Analysis (PCA) was used to identify the isomers which cannot be distinguished using conventional statistical methods for interpretation of their mass spectra. Experiments have been carried out using a linear TOF-MS coupled to a femtosecond laser system as an energy source for the ionisation processes. We have performed experiments and collected data which has been analysed and interpreted using PCA as a multivariate analysis of these spectra. This demonstrates the strength of the method to get an insight for distinguishing the isomers which cannot be identified using conventional mass analysis obtained through dissociative ionisation processes on these molecules. The PCA results dependending on the laser pulse energy and the background pressure in the spectrometers have been presented in this work.

  9. Extended principle component analysis - a useful tool to understand processes governing water quality at catchment scales

    NASA Astrophysics Data System (ADS)

    Selle, B.; Schwientek, M.

    2012-04-01

    Water quality of ground and surface waters in catchments is typically driven by many complex and interacting processes. While small scale processes are often studied in great detail, their relevance and interplay at catchment scales remain often poorly understood. For many catchments, extensive monitoring data on water quality have been collected for different purposes. These heterogeneous data sets contain valuable information on catchment scale processes but are rarely analysed using integrated methods. Principle component analysis (PCA) has previously been applied to this kind of data sets. However, a detailed analysis of scores, which are an important result of a PCA, is often missing. Mathematically, PCA expresses measured variables on water quality, e.g. nitrate concentrations, as linear combination of independent, not directly observable key processes. These computed key processes are represented by principle components. Their scores are interpretable as process intensities which vary in space and time. Subsequently, scores can be correlated with other key variables and catchment characteristics, such as water travel times and land use that were not considered in PCA. This detailed analysis of scores represents an extension of the commonly applied PCA which could considerably improve the understanding of processes governing water quality at catchment scales. In this study, we investigated the 170 km2 Ammer catchment in SW Germany which is characterised by an above average proportion of agricultural (71%) and urban (17%) areas. The Ammer River is mainly fed by karstic springs. For PCA, we separately analysed concentrations from (a) surface waters of the Ammer River and its tributaries, (b) spring waters from the main aquifers and (c) deep groundwater from production wells. This analysis was extended by a detailed analysis of scores. We analysed measured concentrations on major ions and selected organic micropollutants. Additionally, redox-sensitive variables and environmental tracers indicating groundwater age were analysed for deep groundwater from production wells. For deep groundwater, we found that microbial turnover was stronger influenced by local availability of energy sources than by travel times of groundwater to the wells. Groundwater quality primarily reflected the input of pollutants determined by landuse, e.g. agrochemicals. We concluded that for water quality in the Ammer catchment, conservative mixing of waters with different origin is more important than reactive transport processes along the flow path.

  10. Burst and Principal Components Analyses of MEA Data Separates Chemicals by Class

    EPA Science Inventory

    Microelectrode arrays (MEAs) detect drug and chemical induced changes in action potential "spikes" in neuronal networks and can be used to screen chemicals for neurotoxicity. Analytical "fingerprinting," using Principal Components Analysis (PCA) on spike trains recorded from prim...

  11. [Research on fast classification based on LIBS technology and principle component analyses].

    PubMed

    Yu, Qi; Ma, Xiao-Hong; Wang, Rui; Zhao, Hua-Feng

    2014-11-01

    Laser-induced breakdown spectroscopy (LIBS) and the principle component analysis (PCA) were combined to study aluminum alloy classification in the present article. Classification experiments were done on thirteen different kinds of standard samples of aluminum alloy which belong to 4 different types, and the results suggested that the LIBS-PCA method can be used to aluminum alloy fast classification. PCA was used to analyze the spectrum data from LIBS experiments, three principle components were figured out that contribute the most, the principle component scores of the spectrums were calculated, and the scores of the spectrums data in three-dimensional coordinates were plotted. It was found that the spectrum sample points show clear convergence phenomenon according to the type of aluminum alloy they belong to. This result ensured the three principle components and the preliminary aluminum alloy type zoning. In order to verify its accuracy, 20 different aluminum alloy samples were used to do the same experiments to verify the aluminum alloy type zoning. The experimental result showed that the spectrum sample points all located in their corresponding area of the aluminum alloy type, and this proved the correctness of the earlier aluminum alloy standard sample type zoning method. Based on this, the identification of unknown type of aluminum alloy can be done. All the experimental results showed that the accuracy of principle component analyses method based on laser-induced breakdown spectroscopy is more than 97.14%, and it can classify the different type effectively. Compared to commonly used chemical methods, laser-induced breakdown spectroscopy can do the detection of the sample in situ and fast with little sample preparation, therefore, using the method of the combination of LIBS and PCA in the areas such as quality testing and on-line industrial controlling can save a lot of time and cost, and improve the efficiency of detection greatly.

  12. Principal component analysis for protein folding dynamics.

    PubMed

    Maisuradze, Gia G; Liwo, Adam; Scheraga, Harold A

    2009-01-09

    Protein folding is considered here by studying the dynamics of the folding of the triple beta-strand WW domain from the Formin-binding protein 28. Starting from the unfolded state and ending either in the native or nonnative conformational states, trajectories are generated with the coarse-grained united residue (UNRES) force field. The effectiveness of principal components analysis (PCA), an already established mathematical technique for finding global, correlated motions in atomic simulations of proteins, is evaluated here for coarse-grained trajectories. The problems related to PCA and their solutions are discussed. The folding and nonfolding of proteins are examined with free-energy landscapes. Detailed analyses of many folding and nonfolding trajectories at different temperatures show that PCA is very efficient for characterizing the general folding and nonfolding features of proteins. It is shown that the first principal component captures and describes in detail the dynamics of a system. Anomalous diffusion in the folding/nonfolding dynamics is examined by the mean-square displacement (MSD) and the fractional diffusion and fractional kinetic equations. The collisionless (or ballistic) behavior of a polypeptide undergoing Brownian motion along the first few principal components is accounted for.

  13. Characterization and source identification of pollutants in runoff from a mixed land use watershed using ordination analyses.

    PubMed

    Lee, Dong Hoon; Kim, Jin Hwi; Mendoza, Joseph A; Lee, Chang Hee; Kang, Joo-Hyon

    2016-05-01

    While identification of critical pollutant sources is the key initial step for cost-effective runoff management, it is challenging due to the highly uncertain nature of runoff pollution, especially during a storm event. To identify critical sources and their quantitative contributions to runoff pollution (especially focusing on phosphorous), two ordination methods were used in this study: principal component analysis (PCA) and positive matrix factorization (PMF). For the ordination analyses, we used runoff quality data for 14 storm events, including data for phosphorus, 11 heavy metal species, and eight ionic species measured at the outlets of subcatchments with different land use compositions in a mixed land use watershed. Five factors as sources of runoff pollutants were identified by PCA: agrochemicals, groundwater, native soils, domestic sewage, and urban sources (building materials and automotive activities). PMF identified similar factors to those identified by PCA, with more detailed source mechanisms for groundwater (i.e., nitrate leaching and cation exchange) and urban sources (vehicle components/motor oils/building materials and vehicle exhausts), confirming the sources identified by PCA. PMF was further used to quantify contributions of the identified sources to the water quality. Based on the results, agrochemicals and automotive activities were the two dominant and ubiquitous phosphorus sources (39-61 and 16-47 %, respectively) in the study area, regardless of land use types.

  14. The relationship between nutrition and prostate cancer: is more always better?

    PubMed

    Masko, Elizabeth M; Allott, Emma H; Freedland, Stephen J

    2013-05-01

    Prostate cancer (PCa) remains one of the most diagnosed malignancies in the world, correlating with regions where men consume more of a so-called Western-style diet. As such, there is much interest in understanding the role of lifestyle and diet on the incidence and progression of PCa. To provide a summary of published literature with regard to dietary macro- and micronutrients and PCa incidence and progression. A literature search was completed using the PubMed database for all studies published on diet and PCa in June 2012 or earlier. Primary literature and meta-analyses were given preference over other review articles when possible. The literature was reviewed on seven dietary components: carbohydrates, protein, fat and cholesterol, vegetables, vitamins and minerals, and phytochemicals. Current literature linking these nutrients to PCa is limited at best, but trends in the published data suggest consumption of carbohydrates, saturated and ω-6 fats, and certain vitamin supplements may promote PCa risk and progression. Conversely, consumption of many plant phytochemicals and ω-3 fatty acids seem to slow the risk and progression of the disease. All other nutrients seem to have no effect or data are inconclusive. A brief summary about the clinical implications of dietary interventions with respect to PCa prevention, treatment, and survivorship is provided. Due to the number and heterogeneity of published studies investigating diet and PCa, it is difficult to determine what nutrients make up the perfect diet for the primary and secondary prevention of PCa. Because diets are made of multiple macro- and micronutrients, further prospective studies are warranted, particularly those investigating the relationship between whole foods instead of a single nutritional component. Copyright © 2012 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  15. Decision tree and PCA-based fault diagnosis of rotating machinery

    NASA Astrophysics Data System (ADS)

    Sun, Weixiang; Chen, Jin; Li, Jiaqing

    2007-04-01

    After analysing the flaws of conventional fault diagnosis methods, data mining technology is introduced to fault diagnosis field, and a new method based on C4.5 decision tree and principal component analysis (PCA) is proposed. In this method, PCA is used to reduce features after data collection, preprocessing and feature extraction. Then, C4.5 is trained by using the samples to generate a decision tree model with diagnosis knowledge. At last the tree model is used to make diagnosis analysis. To validate the method proposed, six kinds of running states (normal or without any defect, unbalance, rotor radial rub, oil whirl, shaft crack and a simultaneous state of unbalance and radial rub), are simulated on Bently Rotor Kit RK4 to test C4.5 and PCA-based method and back-propagation neural network (BPNN). The result shows that C4.5 and PCA-based diagnosis method has higher accuracy and needs less training time than BPNN.

  16. Detection of compatibility between baclofen and excipients with aid of infrared spectroscopy and chemometry

    NASA Astrophysics Data System (ADS)

    Rojek, Barbara; Wesolowski, Marek; Suchacz, Bogdan

    2013-12-01

    In the paper infrared (IR) spectroscopy and multivariate exploration techniques: principal component analysis (PCA) and cluster analysis (CA) were applied as supportive methods for the detection of physicochemical incompatibilities between baclofen and excipients. In the course of research, the most useful rotational strategy in PCA proved to be varimax normalized, while in CA Ward's hierarchical agglomeration with Euclidean distance measure enabled to yield the most interpretable results. Chemometrical calculations confirmed the suitability of PCA and CA as the auxiliary methods for interpretation of infrared spectra in order to recognize whether compatibilities or incompatibilities between active substance and excipients occur. On the basis of IR spectra and the results of PCA and CA it was possible to demonstrate that the presence of lactose, β-cyclodextrin and meglumine in binary mixtures produce interactions with baclofen. The results were verified using differential scanning calorimetry, differential thermal analysis, thermogravimetry/differential thermogravimetry and X-ray powder diffraction analyses.

  17. Structural Ecosystems Therapy for HIV+ African-American women and drug abuse relapse.

    PubMed

    Feaster, Daniel J; Burns, Myron J; Brincks, Ahnalee M; Prado, Guillermo; Mitrani, Victoria B; Mauer, Megaly H; Szapocznik, Jose

    2010-06-01

    This report examines the effect of Structural Ecosystems Therapy (SET) for (n=143) HIV+ African-American women on rate of relapse to substance use relative to both a person-centered approach (PCA) to therapy and a community control (CC) group. A prior report has shown SET to decrease psychological distress and family hassles relative to these 2 comparison groups. In new analyses, SET and CC had a significant protective effect against relapse as compared with PCA. There is evidence that SET's protective effect on relapse was related to reductions in family hassles, whereas there was not a direct impact of change in psychological distress on rates of relapse. Lower retention in PCA, perhaps caused by the lack of a directive component to PCA, may have put these women at greater risk for relapse. Whereas SET did not specifically address substance abuse, SET indirectly protected at-risk women from relapse through reductions in family hassles.

  18. Characterizing Variability of Modular Brain Connectivity with Constrained Principal Component Analysis

    PubMed Central

    Hirayama, Jun-ichiro; Hyvärinen, Aapo; Kiviniemi, Vesa; Kawanabe, Motoaki; Yamashita, Okito

    2016-01-01

    Characterizing the variability of resting-state functional brain connectivity across subjects and/or over time has recently attracted much attention. Principal component analysis (PCA) serves as a fundamental statistical technique for such analyses. However, performing PCA on high-dimensional connectivity matrices yields complicated “eigenconnectivity” patterns, for which systematic interpretation is a challenging issue. Here, we overcome this issue with a novel constrained PCA method for connectivity matrices by extending the idea of the previously proposed orthogonal connectivity factorization method. Our new method, modular connectivity factorization (MCF), explicitly introduces the modularity of brain networks as a parametric constraint on eigenconnectivity matrices. In particular, MCF analyzes the variability in both intra- and inter-module connectivities, simultaneously finding network modules in a principled, data-driven manner. The parametric constraint provides a compact module-based visualization scheme with which the result can be intuitively interpreted. We develop an optimization algorithm to solve the constrained PCA problem and validate our method in simulation studies and with a resting-state functional connectivity MRI dataset of 986 subjects. The results show that the proposed MCF method successfully reveals the underlying modular eigenconnectivity patterns in more general situations and is a promising alternative to existing methods. PMID:28002474

  19. Evaluation of Staining-Dependent Colour Changes in Resin Composites Using Principal Component Analysis

    PubMed Central

    Manojlovic, D.; Lenhardt, L.; Milićević, B.; Antonov, M.; Miletic, V.; Dramićanin, M. D.

    2015-01-01

    Colour changes in Gradia Direct™ composite after immersion in tea, coffee, red wine, Coca-Cola, Colgate mouthwash, and distilled water were evaluated using principal component analysis (PCA) and the CIELAB colour coordinates. The reflection spectra of the composites were used as input data for the PCA. The output data (scores and loadings) provided information about the magnitude and origin of the surface reflection changes after exposure to the staining solutions. The reflection spectra of the stained samples generally exhibited lower reflection in the blue spectral range, which was manifested in the lower content of the blue shade for the samples. Both analyses demonstrated the high staining abilities of tea, coffee, and red wine, which produced total colour changes of 4.31, 6.61, and 6.22, respectively, according to the CIELAB analysis. PCA revealed subtle changes in the reflection spectra of composites immersed in Coca-Cola, demonstrating Coca-Cola’s ability to stain the composite to a small degree. PMID:26450008

  20. Evaluation of Staining-Dependent Colour Changes in Resin Composites Using Principal Component Analysis.

    PubMed

    Manojlovic, D; Lenhardt, L; Milićević, B; Antonov, M; Miletic, V; Dramićanin, M D

    2015-10-09

    Colour changes in Gradia Direct™ composite after immersion in tea, coffee, red wine, Coca-Cola, Colgate mouthwash, and distilled water were evaluated using principal component analysis (PCA) and the CIELAB colour coordinates. The reflection spectra of the composites were used as input data for the PCA. The output data (scores and loadings) provided information about the magnitude and origin of the surface reflection changes after exposure to the staining solutions. The reflection spectra of the stained samples generally exhibited lower reflection in the blue spectral range, which was manifested in the lower content of the blue shade for the samples. Both analyses demonstrated the high staining abilities of tea, coffee, and red wine, which produced total colour changes of 4.31, 6.61, and 6.22, respectively, according to the CIELAB analysis. PCA revealed subtle changes in the reflection spectra of composites immersed in Coca-Cola, demonstrating Coca-Cola's ability to stain the composite to a small degree.

  1. Principal component and spatial correlation analysis of spectroscopic-imaging data in scanning probe microscopy.

    PubMed

    Jesse, Stephen; Kalinin, Sergei V

    2009-02-25

    An approach for the analysis of multi-dimensional, spectroscopic-imaging data based on principal component analysis (PCA) is explored. PCA selects and ranks relevant response components based on variance within the data. It is shown that for examples with small relative variations between spectra, the first few PCA components closely coincide with results obtained using model fitting, and this is achieved at rates approximately four orders of magnitude faster. For cases with strong response variations, PCA allows an effective approach to rapidly process, de-noise, and compress data. The prospects for PCA combined with correlation function analysis of component maps as a universal tool for data analysis and representation in microscopy are discussed.

  2. Rolling Bearing Fault Diagnosis Based on an Improved HTT Transform

    PubMed Central

    Tang, Guiji; Tian, Tian; Zhou, Chong

    2018-01-01

    When rolling bearing failure occurs, vibration signals generally contain different signal components, such as impulsive fault feature signals, background noise and harmonic interference signals. One of the most challenging aspects of rolling bearing fault diagnosis is how to inhibit noise and harmonic interference signals, while enhancing impulsive fault feature signals. This paper presents a novel bearing fault diagnosis method, namely an improved Hilbert time–time (IHTT) transform, by combining a Hilbert time–time (HTT) transform with principal component analysis (PCA). Firstly, the HTT transform was performed on vibration signals to derive a HTT transform matrix. Then, PCA was employed to de-noise the HTT transform matrix in order to improve the robustness of the HTT transform. Finally, the diagonal time series of the de-noised HTT transform matrix was extracted as the enhanced impulsive fault feature signal and the contained fault characteristic information was identified through further analyses of amplitude and envelope spectrums. Both simulated and experimental analyses validated the superiority of the presented method for detecting bearing failures. PMID:29662013

  3. Individual differences in voluntary alcohol intake in rats: relationship with impulsivity, decision making and Pavlovian conditioned approach.

    PubMed

    Spoelder, Marcia; Flores Dourojeanni, Jacques P; de Git, Kathy C G; Baars, Annemarie M; Lesscher, Heidi M B; Vanderschuren, Louk J M J

    2017-07-01

    Alcohol use disorder (AUD) has been associated with suboptimal decision making, exaggerated impulsivity, and aberrant responses to reward-paired cues, but the relationship between AUD and these behaviors is incompletely understood. This study aims to assess decision making, impulsivity, and Pavlovian-conditioned approach in rats that voluntarily consume low (LD) or high (HD) amounts of alcohol. LD and HD were tested in the rat gambling task (rGT) or the delayed reward task (DRT). Next, the effect of alcohol (0-1.0 g/kg) was tested in these tasks. Pavlovian-conditioned approach (PCA) was assessed both prior to and after intermittent alcohol access (IAA). Principal component analyses were performed to identify relationships between the most important behavioral parameters. HD showed more optimal decision making in the rGT. In the DRT, HD transiently showed reduced impulsive choice. In both LD and HD, alcohol treatment increased optimal decision making in the rGT and increased impulsive choice in the DRT. PCA prior to and after IAA was comparable for LD and HD. When PCA was tested after IAA only, HD showed a more sign-tracking behavior. The principal component analyses indicated dimensional relationships between alcohol intake, impulsivity, and sign-tracking behavior in the PCA task after IAA. HD showed a more efficient performance in the rGT and DRT. Moreover, alcohol consumption enhanced approach behavior to reward-predictive cues, but sign-tracking did not predict the level of alcohol consumption. Taken together, these findings suggest that high levels of voluntary alcohol intake are associated with enhanced cue- and reward-driven behavior.

  4. Nonlinear Principal Components Analysis: Introduction and Application

    ERIC Educational Resources Information Center

    Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Koojj, Anita J.

    2007-01-01

    The authors provide a didactic treatment of nonlinear (categorical) principal components analysis (PCA). This method is the nonlinear equivalent of standard PCA and reduces the observed variables to a number of uncorrelated principal components. The most important advantages of nonlinear over linear PCA are that it incorporates nominal and ordinal…

  5. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge

    PubMed Central

    Wagner, Florian

    2015-01-01

    Method Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. Results I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets. PMID:26575370

  6. GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.

    PubMed

    Wagner, Florian

    2015-01-01

    Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.

  7. Tracing and separating plasma components causing matrix effects in hydrophilic interaction chromatography-electrospray ionization mass spectrometry.

    PubMed

    Ekdahl, Anja; Johansson, Maria C; Ahnoff, Martin

    2013-04-01

    Matrix effects on electrospray ionization were investigated for plasma samples analysed by hydrophilic interaction chromatography (HILIC) in gradient elution mode, and HILIC columns of different chemistries were tested for separation of plasma components and model analytes. By combining mass spectral data with post-column infusion traces, the following components of protein-precipitated plasma were identified and found to have significant effect on ionization: urea, creatinine, phosphocholine, lysophosphocholine, sphingomyelin, sodium ion, chloride ion, choline and proline betaine. The observed effect on ionization was both matrix-component and analyte dependent. The separation of identified plasma components and model analytes on eight columns was compared, using pair-wise linear correlation analysis and principal component analysis (PCA). Large changes in selectivity could be obtained by change of column, while smaller changes were seen when the mobile phase buffer was changed from ammonium formate pH 3.0 to ammonium acetate pH 4.5. While results from PCA and linear correlation analysis were largely in accord, linear correlation analysis was judged to be more straight-forward in terms of conduction and interpretation.

  8. Principal component analysis of PiB distribution in Parkinson and Alzheimer diseases

    PubMed Central

    Markham, Joanne; Flores, Hubert; Hartlein, Johanna M.; Goate, Alison M.; Cairns, Nigel J.; Videen, Tom O.; Perlmutter, Joel S.

    2013-01-01

    Objective: To use principal component analyses (PCA) of Pittsburgh compound B (PiB) PET imaging to determine whether the pattern of in vivo β-amyloid (Aβ) in Parkinson disease (PD) with cognitive impairment is similar to the pattern found in symptomatic Alzheimer disease (AD). Methods: PiB PET scans were obtained from participants with PD with cognitive impairment (n = 53), participants with symptomatic AD (n = 35), and age-matched controls (n = 67). All were assessed using the Clinical Dementia Rating and APOE genotype was determined in 137 participants. PCA was used to 1) determine the PiB binding pattern in AD, 2) determine a possible unique PD pattern, and 3) directly compare the PiB binding patterns in PD and AD groups. Results: The first 2 principal components (PC1 and PC2) significantly separated the AD and control participants (p < 0.001). Participants with PD with cognitive impairment also were significantly different from participants with symptomatic AD on both components (p < 0.001). However, there was no difference between PD and controls on either component. Even those participants with PD with elevated mean cortical binding potentials were significantly different from participants with AD on both components. Conclusion: Using PCA, we demonstrated that participants with PD with cognitive impairment do not exhibit the same PiB binding pattern as participants with AD. These data suggest that Aβ deposition may play a different pathophysiologic role in the cognitive impairment of PD compared to that in AD. PMID:23825179

  9. Principal component analysis for the early detection of mastitis and lameness in dairy cows.

    PubMed

    Miekley, Bettina; Traulsen, Imke; Krieter, Joachim

    2013-08-01

    This investigation analysed the applicability of principal component analysis (PCA), a latent variable method, for the early detection of mastitis and lameness. Data used were recorded on the Karkendamm dairy research farm between August 2008 and December 2010. For mastitis and lameness detection, data of 338 and 315 cows in their first 200 d in milk were analysed, respectively. Mastitis as well as lameness were specified according to veterinary treatments. Diseases were defined as disease blocks. The different definitions used (two for mastitis, three for lameness) varied solely in the sequence length of the blocks. Only the days before the treatment were included in the blocks. Milk electrical conductivity, milk yield and feeding patterns (feed intake, number of feeding visits and time at the trough) were used for recognition of mastitis. Pedometer activity and feeding patterns were utilised for lameness detection. To develop and verify the PCA model, the mastitis and the lameness datasets were divided into training and test datasets. PCA extracted uncorrelated principle components (PC) by linear transformations of the raw data so that the first few PCs captured most of the variations in the original dataset. For process monitoring and disease detection, these resulting PCs were applied to the Hotelling's T 2 chart and to the residual control chart. The results show that block sensitivity of mastitis detection ranged from 77·4 to 83·3%, whilst specificity was around 76·7%. The error rates were around 98·9%. For lameness detection, the block sensitivity ranged from 73·8 to 87·8% while the obtained specificities were between 54·8 and 61·9%. The error rates varied from 87·8 to 89·2%. In conclusion, PCA seems to be not yet transferable into practical usage. Results could probably be improved if different traits and more informative sensor data are included in the analysis.

  10. A perspective on two chemometrics tools: PCA and MCR, and introduction of a new one: Pattern recognition entropy (PRE), as applied to XPS and ToF-SIMS depth profiles of organic and inorganic materials

    NASA Astrophysics Data System (ADS)

    Chatterjee, Shiladitya; Singh, Bhupinder; Diwan, Anubhav; Lee, Zheng Rong; Engelhard, Mark H.; Terry, Jeff; Tolley, H. Dennis; Gallagher, Neal B.; Linford, Matthew R.

    2018-03-01

    X-ray photoelectron spectroscopy (XPS) and time-of-flight secondary ion mass spectrometry (ToF-SIMS) are much used analytical techniques that provide information about the outermost atomic and molecular layers of materials. In this work, we discuss the application of multivariate spectral techniques, including principal component analysis (PCA) and multivariate curve resolution (MCR), to the analysis of XPS and ToF-SIMS depth profiles. Multivariate analyses often provide insight into data sets that is not easily obtained in a univariate fashion. Pattern recognition entropy (PRE), which has its roots in Shannon's information theory, is also introduced. This approach is not the same as the mutual information/entropy approaches sometimes used in data processing. A discussion of the theory of each technique is presented. PCA, MCR, and PRE are applied to four different data sets obtained from: a ToF-SIMS depth profile through ca. 100 nm of plasma polymerized C3F6 on Si, a ToF-SIMS depth profile through ca. 100 nm of plasma polymerized PNIPAM (poly (N-isopropylacrylamide)) on Si, an XPS depth profile through a film of SiO2 on Si, and an XPS depth profile through a film of Ta2O5 on Ta. PCA, MCR, and PRE reveal the presence of interfaces in the films, and often indicate that the first few scans in the depth profiles are different from those that follow. PRE and backward difference PRE provide this information in a straightforward fashion. Rises in the PRE signals at interfaces suggest greater complexity to the corresponding spectra. Results from PCA, especially for the higher principal components, were sometimes difficult to understand. MCR analyses were generally more interpretable.

  11. Genome-wide divergence and linkage disequilibrium analyses for Capsicum baccatum revealed by genome-anchored single nucleotide polymorphisms

    USDA-ARS?s Scientific Manuscript database

    Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...

  12. Morningness-eveningness and depressive symptoms: Test on the components level with CES-D in Polish students.

    PubMed

    Jankowski, Konrad S

    2016-05-15

    The study aimed to elucidate previously observed associations between morningness-eveningness and depressive symptomatology in university students. Relations between components of depressive symptomatology and morningness-eveningness were analysed. Nine hundred and seventy-four university students completed Polish versions of the Centre for Epidemiological Studies - Depression scale (CES-D; Polish translation appended to this paper) and the Composite Scale of Morningness. Principal component analysis (PCA) was used to test the structure of depressive symptoms. Pearson and partial correlations (with age and sex controlled), along with regression analyses with morning affect (MA) and circadian preference as predictors, were used. PCA revealed three components of depressive symptoms: depressed/somatic affect, positive affect, interpersonal relations. Greater MA was related to less depressive symptoms in three components. Morning circadian preference was related to less depressive symptoms in depressed/somatic and positive affects and unrelated to interpersonal relations. Both morningness-eveningness components exhibited stronger links with depressed/somatic and positive affects than with interpersonal relations. Three CES-D components exhibited stronger links with MA than with circadian preference. In regression analyses only MA was statistically significant for positive affect and better interpersonal relations, whereas more depressed/somatic affect was predicted by lower MA and morning circadian preference (relationship reversed compared to correlations). Self-report assessment. There are three groups of depressive symptoms in Polish university students. Associations of MA with depressed/somatic and positive affects are primarily responsible for the observed links between morningness-eveningness and depressive symptoms in university students. People with evening circadian preference whose MA is not lowered have less depressed/somatic affect. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. Principal component analysis of dietary and lifestyle patterns in relation to risk of subtypes of esophageal and gastric cancer

    PubMed Central

    Silvera, Stephanie A. Navarro; Mayne, Susan T; Risch, Harvey A.; Gammon, Marilie D; Vaughan, Thomas; Chow, Wong-Ho; Dubin, Joel A; Dubrow, Robert; Schoenberg, Janet; Stanford, Janet L; West, A. Brian; Rotterdam, Heidrun; Blot, William J

    2011-01-01

    Purpose To perform pattern analyses of dietary and lifestyle factors in relation to risk of esophageal and gastric cancers. Methods We evaluated risk factors for esophageal adenocarcinoma (EA), esophageal squamous cell carcinoma (ESCC), gastric cardia adenocarcinoma (GCA), and other gastric cancers (OGA) using data from a population-based case-control study conducted in Connecticut, New Jersey, and western Washington state. Dietary/lifestyle patterns were created using principal component analysis (PCA). Impact of the resultant scores on cancer risk was estimated through logistic regression. Results PCA identified six patterns: meat/nitrite, fruit/vegetable, smoking/alcohol, legume/meat alternate, GERD/BMI, and fish/vitamin C. Risk of each cancer under study increased with rising meat/nitrite score. Risk of EA increased with increasing GERD/BMI score, and risk of ESCC rose with increasing smoking/alcohol score and decreasing GERD/BMI score. Fruit/vegetable scores were inversely associated with EA, ESCC, and GCA. Conclusions PCA may provide a useful approach for summarizing extensive dietary/lifestyle data into fewer interpretable combinations that discriminate between cancer cases and controls. The analyses suggest that meat/nitrite intake is associated with elevated risk of each cancer under study, while fruit/vegetable intake reduces risk of EA, ESCC, and GCA. GERD/obesity were confirmed as risk factors for EA and smoking/alcohol as risk factors for ESCC. PMID:21435900

  14. Liquid chromatography tandem mass spectrometry determination of chemical markers and principal component analysis of Vitex agnus-castus L. fruits (Verbenaceae) and derived food supplements.

    PubMed

    Mari, Angela; Montoro, Paola; Pizza, Cosimo; Piacente, Sonia

    2012-11-01

    A validated analytical method for the quantitative determination of seven chemical markers occurring in a hydroalcoholic extract of Vitex agnus-castus fruits by liquid chromatography electrospray triple quadrupole tandem mass spectrometry (LC/ESI/(QqQ)MSMS) is reported. To carry out a comparative study, five commercial food supplements corresponding to hydroalcoholic extracts of V. agnus-castus fruits were analysed under the same chromatographic conditions of the crude extract. Principal component analysis (PCA), based only on the variation of the amount of the seven chemical markers, was applied in order to find similarities between the hydroalcoholic extract and the food supplements. A second PCA analysis was carried out considering the whole spectroscopic data deriving from liquid chromatography electrospray linear ion trap mass spectrometry (LC/ESI/(LIT)MS) analysis. High similarity between the two PCA was observed, showing the possibility to select one of these two approaches for future applications in the field of comparative analysis of food supplements and quality control procedures. Copyright © 2012 Elsevier B.V. All rights reserved.

  15. A comparison of PCA/ICA for data preprocessing in remote sensing imagery classification

    NASA Astrophysics Data System (ADS)

    He, Hui; Yu, Xianchuan

    2005-10-01

    In this paper a performance comparison of a variety of data preprocessing algorithms in remote sensing image classification is presented. These selected algorithms are principal component analysis (PCA) and three different independent component analyses, ICA (Fast-ICA (Aapo Hyvarinen, 1999), Kernel-ICA (KCCA and KGV (Bach & Jordan, 2002), EFFICA (Aiyou Chen & Peter Bickel, 2003). These algorithms were applied to a remote sensing imagery (1600×1197), obtained from Shunyi, Beijing. For classification, a MLC method is used for the raw and preprocessed data. The results show that classification with the preprocessed data have more confident results than that with raw data and among the preprocessing algorithms, ICA algorithms improve on PCA and EFFICA performs better than the others. The convergence of these ICA algorithms (for data points more than a million) are also studied, the result shows EFFICA converges much faster than the others. Furthermore, because EFFICA is a one-step maximum likelihood estimate (MLE) which reaches asymptotic Fisher efficiency (EFFICA), it computers quite small so that its demand of memory come down greatly, which settled the "out of memory" problem occurred in the other algorithms.

  16. Local Geographic Variation of Public Services Inequality: Does the Neighborhood Scale Matter?

    PubMed Central

    Wei, Chunzhu; Cabrera-Barona, Pablo; Blaschke, Thomas

    2016-01-01

    This study aims to explore the effect of the neighborhood scale when estimating public services inequality based on the aggregation of social, environmental, and health-related indicators. Inequality analyses were carried out at three neighborhood scales: the original census blocks and two aggregated neighborhood units generated by the spatial “k”luster analysis by the tree edge removal (SKATER) algorithm and the self-organizing map (SOM) algorithm. Then, we combined a set of health-related public services indicators with the geographically weighted principal components analyses (GWPCA) and the principal components analyses (PCA) to measure the public services inequality across all multi-scale neighborhood units. Finally, a statistical test was applied to evaluate the scale effects in inequality measurements by combining all available field survey data. We chose Quito as the case study area. All of the aggregated neighborhood units performed better than the original census blocks in terms of the social indicators extracted from a field survey. The SKATER and SOM algorithms can help to define the neighborhoods in inequality analyses. Moreover, GWPCA performs better than PCA in multivariate spatial inequality estimation. Understanding the scale effects is essential to sustain a social neighborhood organization, which, in turn, positively affects social determinants of public health and public quality of life. PMID:27706072

  17. Exploring patterns enriched in a dataset with contrastive principal component analysis.

    PubMed

    Abid, Abubakar; Zhang, Martin J; Bagaria, Vivek K; Zou, James

    2018-05-30

    Visualization and exploration of high-dimensional data is a ubiquitous challenge across disciplines. Widely used techniques such as principal component analysis (PCA) aim to identify dominant trends in one dataset. However, in many settings we have datasets collected under different conditions, e.g., a treatment and a control experiment, and we are interested in visualizing and exploring patterns that are specific to one dataset. This paper proposes a method, contrastive principal component analysis (cPCA), which identifies low-dimensional structures that are enriched in a dataset relative to comparison data. In a wide variety of experiments, we demonstrate that cPCA with a background dataset enables us to visualize dataset-specific patterns missed by PCA and other standard methods. We further provide a geometric interpretation of cPCA and strong mathematical guarantees. An implementation of cPCA is publicly available, and can be used for exploratory data analysis in many applications where PCA is currently used.

  18. Free energy landscape of a biomolecule in dihedral principal component space: sampling convergence and correspondence between structures and minima.

    PubMed

    Maisuradze, Gia G; Leitner, David M

    2007-05-15

    Dihedral principal component analysis (dPCA) has recently been developed and shown to display complex features of the free energy landscape of a biomolecule that may be absent in the free energy landscape plotted in principal component space due to mixing of internal and overall rotational motion that can occur in principal component analysis (PCA) [Mu et al., Proteins: Struct Funct Bioinfo 2005;58:45-52]. Another difficulty in the implementation of PCA is sampling convergence, which we address here for both dPCA and PCA using a tetrapeptide as an example. We find that for both methods the sampling convergence can be reached over a similar time. Minima in the free energy landscape in the space of the two largest dihedral principal components often correspond to unique structures, though we also find some distinct minima to correspond to the same structure. 2007 Wiley-Liss, Inc.

  19. Spectral discrimination of bleached and healthy submerged corals based on principal components analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Holden, H.; LeDrew, E.

    1997-06-01

    Remote discrimination of substrate types in relatively shallow coastal waters has been limited by the spatial and spectral resolution of available sensors. An additional limiting factor is the strong attenuating influence of the water column over the substrate. As a result, there have been limited attempts to map submerged ecosystems such as coral reefs based on spectral characteristics. Both healthy and bleached corals were measured at depth with a hand-held spectroradiometer, and their spectra compared. Two separate principal components analyses (PCA) were performed on two sets of spectral data. The PCA revealed that there is indeed a spectral difference basedmore » on health. In the first data set, the first component (healthy coral) explains 46.82%, while the second component (bleached coral) explains 46.35% of the variance. In the second data set, the first component (bleached coral) explained 46.99%; the second component (healthy coral) explained 36.55%; and the third component (healthy coral) explained 15.44 % of the total variance in the original data. These results are encouraging with respect to using an airborne spectroradiometer to identify areas of bleached corals thus enabling accurate monitoring over time.« less

  20. A structural investigation into the compaction behavior of pharmaceutical composites using powder X-ray diffraction and total scattering analysis.

    PubMed

    Moore, Michael D; Steinbach, Alison M; Buckner, Ira S; Wildfong, Peter L D

    2009-11-01

    To use advanced powder X-ray diffraction (PXRD) to characterize the structure of anhydrous theophylline following compaction, alone, and as part of a binary mixture with either alpha-lactose monohydrate or microcrystalline cellulose. Compacts formed from (1) pure theophylline and (2) each type of binary mixture were analyzed intact using PXRD. A novel mathematical technique was used to accurately separate multi-component diffraction patterns. The pair distribution function (PDF) of isolated theophylline diffraction data was employed to assess structural differences induced by consolidation and evaluated by principal components analysis (PCA). Changes induced in PXRD patterns by increasing compaction pressure were amplified by the PDF. Simulated data suggest PDF dampening is attributable to molecular deviations from average crystalline position. Samples compacted at different pressures were identified and differentiated using PCA. Samples compacted at common pressures exhibited similar inter-atomic correlations, where excipient concentration factored in the analyses involving lactose. Practical real-space structural analysis of PXRD data by PDF was accomplished for intact, compacted crystalline drug with and without excipient. PCA was used to compare multiple PDFs and successfully differentiated pattern changes consistent with compaction-induced disordering of theophylline as a single component and in the presence of another material.

  1. Interpretation of data on the aggregate composition of typical chernozems under different land use by cluster and principal component analyses

    NASA Astrophysics Data System (ADS)

    Kholodov, V. A.; Yaroslavtseva, N. V.; Lazarev, V. I.; Frid, A. S.

    2016-09-01

    Cluster analysis and principal component analysis (PCA) have been used for the interpretation of dry sieving data. Chernozems from the treatments of long-term field experiments with different land-use patterns— annually mowed steppe, continuous potato culture, permanent black fallow, and untilled fallow since 1998 after permanent black fallow—have been used. Analysis of dry sieving data by PCA has shown that the treatments of untilled fallow after black fallow and annually mowed steppe differ most in the series considered; the content of dry aggregates of 10-7 mm makes the largest contribution to the distribution of objects along the first principal component. This fraction has been sieved in water and analyzed by PCA. In contrast to dry sieving data, the wet sieving data showed the closest mathematical distance between the treatment of untilled fallow after black fallow and the undisturbed treatment of annually mowed steppe, while the untilled fallow after black fallow and the permanent black fallow were the most distant treatments. Thus, it may be suggested that the water stability of structure is first restored after the removal of destructive anthropogenic load. However, the restoration of the distribution of structural separates to the parameters characteristic of native soils is a significantly longer process.

  2. Neuronal substrates of Corsi Block span: Lesion symptom mapping analyses in relation to attentional competition and spatial bias.

    PubMed

    Chechlacz, Magdalena; Rotshtein, Pia; Humphreys, Glyn W

    2014-11-01

    Spatial working memory problems are frequently reported following brain damage within both left and right hemispheres but with the severity often being grater in individuals with right hemisphere lesions. Clinically, deficits in spatial working memory have also been noted in patients with visuospatial disorders such as unilateral neglect. Here, we examined neural substrates of short-term memory for spatial locations based on the Corsi Block tapping task and the relationship with the visuospatial deficits of neglect and extinction in a group of chronic neuropsychological patients. Principal Component Analysis (PCA) was used to distinguish shared and dissociate functional components. The neural substrates of spatial short-term memory deficits and the components identified by PCA were examined using whole brain voxel-based morphometry and tract-wise lesion deficits analyses. We found that bilateral lesions within occipital cortex (middle occipital gyrus) and right posterior parietal cortex, along with disconnection of the right parieto-temporal segment of arcuate fasciculus, were associated with low spatial memory span. A single component revealed by PCA accounted for over half of the variance and was linked to damage to right posterior brain regions (temporo-parietal junction, the inferior parietal lobule and middle temporal gyrus extending into middle occipital gyrus). We also found link to disconnections within several association pathways including the superior longitudinal fasciculus, arcuate fasciculus, inferior fronto-occipital fasciculus and inferior longitudinal fasciculus. These results indicate that different visuospatial deficits converge into a single component mapped within posterior parietal areas and fronto-parietal white matter pathways. Furthermore, the data presented here fit with the role of posterior parietal cortex/temporo-parietal junction in maintaining a map of salient locations in space, with Corsi Block performance being impaired when the spatial map is damaged. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. Empirical evaluation of grouping of lower urinary tract symptoms: principal component analysis of Tampere Ageing Male Urological Study data.

    PubMed

    Pöyhönen, Antti; Häkkinen, Jukka T; Koskimäki, Juha; Hakama, Matti; Tammela, Teuvo L J; Auvinen, Anssi

    2013-03-01

    WHAT'S KNOWN ON THE SUBJECT? AND WHAT DOES THE STUDY ADD?: The ICS has divided LUTS into three groups: storage, voiding and post-micturition symptoms. The classification is based on anatomical, physiological and urodynamic considerations of a theoretical nature. We used principal component analysis (PCA) to determine the inter-correlations of various LUTS, which is a novel approach to research and can strengthen existing knowledge of the phenomenology of LUTS. After we had completed our analyses, another study was published that used a similar approach and results were very similar to those of the present study. We evaluated the constellation of LUTS using PCA of the data from a population-based study that included >4000 men. In our analysis, three components emerged from the 12 LUTS: voiding, storage and incontinence components. Our results indicated that incontinence may be separate from the other storage symptoms and post-micturition symptoms should perhaps be regarded as voiding symptoms. To determine how lower urinary tract symptoms (LUTS) relate to each other and assess if the classification proposed by the International Continence Society (ICS) is consistent with empirical findings. The information on urinary symptoms for this population-based study was collected using a self-administered postal questionnaire in 2004. The questionnaire was sent to 7470 men, aged 30-80 years, from Pirkanmaa County (Finland), of whom 4384 (58.7%) returned the questionnaire. The Danish Prostatic Symptom Score-1 questionnaire was used to evaluate urinary symptoms. Principal component analysis (PCA) was used to evaluate the inter-correlations among various urinary symptoms. The PCA produced a grouping of 12 LUTS into three categories consisting of voiding, storage and incontinence symptoms. Post-micturition symptoms were related to voiding symptoms, but incontinence symptoms were separate from storage symptoms. In the analyses by age group, similar categorization was found at ages 40, 50, 60 and 80 years, but only two groups of symptoms emerged among men aged 70 years. The prevalence among men aged 30 was too low for meaningful analysis. This population-based study suggests that LUTS can be divided into three subgroups consisting of voiding, storage and incontinence symptoms based on their inter-correlations. Our empirical findings suggest an alternative grouping of LUTS. The potential utility of such an approach requires careful consideration. © 2012 BJU International.

  4. Motivation for HPV Vaccination Among Young Adult Men: Validation of TTM Decisional Balance and Self-Efficacy Constructs.

    PubMed

    Fernandez, Anne C; Amoyal, Nicole R; Paiva, Andrea L; Prochaska, James O

    2016-01-01

    In the United States, 36% of human papillomavirus (HPV)-related cancers occur among men. HPV vaccination can substantially reduce the risk of HPV infection; however, the vast majority of men are unvaccinated. This study developed and validated transtheoretical model-based measures for HPV vaccination in young adult men. Cross-sectional measurement development. Online survey of young adult men. Three hundred twenty-nine mostly college-attending men, ages 18 to 26. Stage of change, decisional balance (pros/cons), and self-efficacy. The sample was randomly split into halves for exploratory principal components analysis (PCA), followed by confirmatory factor analyses (CFA) to test measurement models. Multivariate analyses examined relationships between scales. For decisional balance, PCA revealed two uncorrelated five-item factors (pros α = .78; cons α = .83). For the self-efficacy scale, PCA revealed a single-factor solution (α = .83). CFA confirmed that the two-factor uncorrelated model for decisional balance and a single-factor model for self-efficacy. Follow-up analyses of variance supported the theoretically predicted relationships between stage of change, pros, and self-efficacy. This study resulted in reliable and valid measures of pros and self-efficacy for HPV vaccination that can be used in future clinical research.

  5. Analysis of Soccer Players’ Positional Variability During the 2012 UEFA European Championship: A Case Study

    PubMed Central

    Moura, Felipe Arruda; Santana, Juliana Exel; Vieira, Nathália Arnosti; Santiago, Paulo Roberto Pereira; Cunha, Sergio Augusto

    2015-01-01

    The purpose of this study was to analyse players’ positional variability during the 2012 UEFA European Championship by applying principal component analysis (PCA) to data gathered from heat maps posted on the UEFA website. We analysed the teams that reached the finals and semi-finals of the competition. The players’ 2D coordinates from each match were obtained by applying an image-processing algorithm to the heat maps. With all the players’ 2D coordinates for each match, we applied PCA to identify the directions of greatest variability. Then, two orthogonal segments were centred on each player’s mean position for all matches. The segments’ directions were driven by the eigenvectors of the PCA, and the length of each segment was defined as one standard deviation around the mean. Finally, an ellipse was circumscribed around both segments. To represent player variability, segment lengths and elliptical areas were analysed. The results demonstrate that Portugal exhibited the lowest variability, followed by Germany, Spain and Italy. Additionally, a graphical representation of every player’s ellipse provided insight into the teams’ organisational features throughout the competition. The presented study provides important information regarding soccer teams’ tactical strategy in high-level championships that allows coaches to better control team organisation on the pitch. PMID:26557206

  6. Analysis of the principal component algorithm in phase-shifting interferometry.

    PubMed

    Vargas, J; Quiroga, J Antonio; Belenguer, T

    2011-06-15

    We recently presented a new asynchronous demodulation method for phase-sampling interferometry. The method is based in the principal component analysis (PCA) technique. In the former work, the PCA method was derived heuristically. In this work, we present an in-depth analysis of the PCA demodulation method.

  7. Y-chromosome phylogeographic analysis of the Greek-Cypriot population reveals elements consistent with Neolithic and Bronze Age settlements.

    PubMed

    Voskarides, Konstantinos; Mazières, Stéphane; Hadjipanagi, Despina; Di Cristofaro, Julie; Ignatiou, Anastasia; Stefanou, Charalambos; King, Roy J; Underhill, Peter A; Chiaroni, Jacques; Deltas, Constantinos

    2016-01-01

    The archeological record indicates that the permanent settlement of Cyprus began with pioneering agriculturalists circa 11,000 years before present, (ca. 11,000 y BP). Subsequent colonization events followed, some recognized regionally. Here, we assess the Y-chromosome structure of Cyprus in context to regional populations and correlate it to phases of prehistoric colonization. Analysis of haplotypes from 574 samples showed that island-wide substructure was barely significant in a spatial analysis of molecular variance (SAMOVA). However, analyses of molecular variance (AMOVA) of haplogroups using 92 binary markers genotyped in 629 Cypriots revealed that the proportion of variance among the districts was irregularly distributed. Principal component analysis (PCA) revealed potential genetic associations of Greek-Cypriots with neighbor populations. Contrasting haplogroups in the PCA were used as surrogates of parental populations. Admixture analyses suggested that the majority of G2a-P15 and R1b-M269 components were contributed by Anatolia and Levant sources, respectively, while Greece Balkans supplied the majority of E-V13 and J2a-M67. Haplotype-based expansion times were at historical levels suggestive of recent demography. Analyses of Cypriot haplogroup data are consistent with two stages of prehistoric settlement. E-V13 and E-M34 are widespread, and PCA suggests sourcing them to the Balkans and Levant/Anatolia, respectively. The persistent pre-Greek component is represented by elements of G2-U5(xL30) haplogroups: U5*, PF3147, and L293. J2b-M205 may contribute also to the pre-Greek strata. The majority of R1b-Z2105 lineages occur in both the westernmost and easternmost districts. Distinctively, sub-haplogroup R1b- M589 occurs only in the east. The absence of R1b- M589 lineages in Crete and the Balkans and the presence in Asia Minor are compatible with Late Bronze Age influences from Anatolia rather than from Mycenaean Greeks.

  8. Multivariate statistical analysis of the hydrogeochemical and isotopic composition of the groundwater resources in northeastern Peloponnesus (Greece).

    PubMed

    Matiatos, Ioannis; Alexopoulos, Apostolos; Godelitsas, Athanasios

    2014-04-01

    The present study involves an integration of the hydrogeological, hydrochemical and isotopic (both stable and radiogenic) data of the groundwater samples taken from aquifers occurring in the region of northeastern Peloponnesus. Special emphasis has been given to health-related ions and isotopes in relation to the WHO and USEPA guidelines, to highlight the concentrations of compounds (e.g., As and Ba) exceeding the drinking water thresholds. Multivariate statistical analyses, i.e. two principal component analyses (PCA) and one discriminant analysis (DA), combined with conventional hydrochemical methodologies, were applied, with the aim to interpret the spatial variations in the groundwater quality and to identify the main hydrogeochemical factors and human activities responsible for the high ion concentrations and isotopic content in the groundwater analysed. The first PCA resulted in a three component model, which explained approximately 82% of the total variance of the data sets and enabled the identification of the hydrogeological processes responsible for the isotopic content i.e., δ(18)Ο, tritium and (222)Rn. The second PCA, involving the trace element presence in the water samples, revealed a four component model, which explained approximately 89% of the total variance of the data sets, giving more insight into the geochemical and anthropogenic controls on the groundwater composition (e.g., water-rock interaction, hydrothermal activity and agricultural activities). Using discriminant analysis, a four parameter (δ(18)O, (Ca+Mg)/(HCO3+SO4), EC and Cl) discriminant function concerning the (222)Rn content was derived, which favoured a classification of the samples according to the concentration of (222)Rn as (222)Rn-safe (<11 Bq·L(-1)) and (222)Rn-contaminated (>11 Bq·L(-1)). The selection of radon builds on the fact that this radiogenic isotope has been generally related to increased health risk when consumed. Copyright © 2014 Elsevier B.V. All rights reserved.

  9. PCA-LBG-based algorithms for VQ codebook generation

    NASA Astrophysics Data System (ADS)

    Tsai, Jinn-Tsong; Yang, Po-Yuan

    2015-04-01

    Vector quantisation (VQ) codebooks are generated by combining principal component analysis (PCA) algorithms with Linde-Buzo-Gray (LBG) algorithms. All training vectors are grouped according to the projected values of the principal components. The PCA-LBG-based algorithms include (1) PCA-LBG-Median, which selects the median vector of each group, (2) PCA-LBG-Centroid, which adopts the centroid vector of each group, and (3) PCA-LBG-Random, which randomly selects a vector of each group. The LBG algorithm finds a codebook based on the better vectors sent to an initial codebook by the PCA. The PCA performs an orthogonal transformation to convert a set of potentially correlated variables into a set of variables that are not linearly correlated. Because the orthogonal transformation efficiently distinguishes test image vectors, the proposed PCA-LBG-based algorithm is expected to outperform conventional algorithms in designing VQ codebooks. The experimental results confirm that the proposed PCA-LBG-based algorithms indeed obtain better results compared to existing methods reported in the literature.

  10. Use of principal components analysis and protein microarray to explore the association of HIV-1-specific IgG responses with disease progression.

    PubMed

    Gerns Storey, Helen L; Richardson, Barbra A; Singa, Benson; Naulikha, Jackie; Prindle, Vivian C; Diaz-Ochoa, Vladimir E; Felgner, Phil L; Camerini, David; Horton, Helen; John-Stewart, Grace; Walson, Judd L

    2014-01-01

    The role of HIV-1-specific antibody responses in HIV disease progression is complex and would benefit from analysis techniques that examine clusterings of responses. Protein microarray platforms facilitate the simultaneous evaluation of numerous protein-specific antibody responses, though excessive data are cumbersome in analyses. Principal components analysis (PCA) reduces data dimensionality by generating fewer composite variables that maximally account for variance in a dataset. To identify clusters of antibody responses involved in disease control, we investigated the association of HIV-1-specific antibody responses by protein microarray, and assessed their association with disease progression using PCA in a nested cohort design. Associations observed among collections of antibody responses paralleled protein-specific responses. At baseline, greater antibody responses to the transmembrane glycoprotein (TM) and reverse transcriptase (RT) were associated with higher viral loads, while responses to the surface glycoprotein (SU), capsid (CA), matrix (MA), and integrase (IN) proteins were associated with lower viral loads. Over 12 months greater antibody responses were associated with smaller decreases in CD4 count (CA, MA, IN), and reduced likelihood of disease progression (CA, IN). PCA and protein microarray analyses highlighted a collection of HIV-specific antibody responses that together were associated with reduced disease progression, and may not have been identified by examining individual antibody responses. This technique may be useful to explore multifaceted host-disease interactions, such as HIV coinfections.

  11. PCA as a practical indicator of OPLS-DA model reliability.

    PubMed

    Worley, Bradley; Powers, Robert

    Principal Component Analysis (PCA) and Orthogonal Projections to Latent Structures Discriminant Analysis (OPLS-DA) are powerful statistical modeling tools that provide insights into separations between experimental groups based on high-dimensional spectral measurements from NMR, MS or other analytical instrumentation. However, when used without validation, these tools may lead investigators to statistically unreliable conclusions. This danger is especially real for Partial Least Squares (PLS) and OPLS, which aggressively force separations between experimental groups. As a result, OPLS-DA is often used as an alternative method when PCA fails to expose group separation, but this practice is highly dangerous. Without rigorous validation, OPLS-DA can easily yield statistically unreliable group separation. A Monte Carlo analysis of PCA group separations and OPLS-DA cross-validation metrics was performed on NMR datasets with statistically significant separations in scores-space. A linearly increasing amount of Gaussian noise was added to each data matrix followed by the construction and validation of PCA and OPLS-DA models. With increasing added noise, the PCA scores-space distance between groups rapidly decreased and the OPLS-DA cross-validation statistics simultaneously deteriorated. A decrease in correlation between the estimated loadings (added noise) and the true (original) loadings was also observed. While the validity of the OPLS-DA model diminished with increasing added noise, the group separation in scores-space remained basically unaffected. Supported by the results of Monte Carlo analyses of PCA group separations and OPLS-DA cross-validation metrics, we provide practical guidelines and cross-validatory recommendations for reliable inference from PCA and OPLS-DA models.

  12. Stability of Nonlinear Principal Components Analysis: An Empirical Study Using the Balanced Bootstrap

    ERIC Educational Resources Information Center

    Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Kooij, Anita J.

    2007-01-01

    Principal components analysis (PCA) is used to explore the structure of data sets containing linearly related numeric variables. Alternatively, nonlinear PCA can handle possibly nonlinearly related numeric as well as nonnumeric variables. For linear PCA, the stability of its solution can be established under the assumption of multivariate…

  13. Low-Dimensional Feature Representation for Instrument Identification

    NASA Astrophysics Data System (ADS)

    Ihara, Mizuki; Maeda, Shin-Ichi; Ikeda, Kazushi; Ishii, Shin

    For monophonic music instrument identification, various feature extraction and selection methods have been proposed. One of the issues toward instrument identification is that the same spectrum is not always observed even in the same instrument due to the difference of the recording condition. Therefore, it is important to find non-redundant instrument-specific features that maintain information essential for high-quality instrument identification to apply them to various instrumental music analyses. For such a dimensionality reduction method, the authors propose the utilization of linear projection methods: local Fisher discriminant analysis (LFDA) and LFDA combined with principal component analysis (PCA). After experimentally clarifying that raw power spectra are actually good for instrument classification, the authors reduced the feature dimensionality by LFDA or by PCA followed by LFDA (PCA-LFDA). The reduced features achieved reasonably high identification performance that was comparable or higher than those by the power spectra and those achieved by other existing studies. These results demonstrated that our LFDA and PCA-LFDA can successfully extract low-dimensional instrument features that maintain the characteristic information of the instruments.

  14. Free-energy landscape, principal component analysis, and structural clustering to identify representative conformations from molecular dynamics simulations: the myoglobin case.

    PubMed

    Papaleo, Elena; Mereghetti, Paolo; Fantucci, Piercarlo; Grandori, Rita; De Gioia, Luca

    2009-01-01

    Several molecular dynamics (MD) simulations were used to sample conformations in the neighborhood of the native structure of holo-myoglobin (holo-Mb), collecting trajectories spanning 0.22 micros at 300 K. Principal component (PCA) and free-energy landscape (FEL) analyses, integrated by cluster analysis, which was performed considering the position and structures of the individual helices of the globin fold, were carried out. The coherence between the different structural clusters and the basins of the FEL, together with the convergence of parameters derived by PCA indicates that an accurate description of the Mb conformational space around the native state was achieved by multiple MD trajectories spanning at least 0.14 micros. The integration of FEL, PCA, and structural clustering was shown to be a very useful approach to gain an overall view of the conformational landscape accessible to a protein and to identify representative protein substates. This method could be also used to investigate the conformational and dynamical properties of Mb apo-, mutant, or delete versions, in which greater conformational variability is expected and, therefore identification of representative substates from the simulations is relevant to disclose structure-function relationship.

  15. Determination of butter adulteration with margarine using Raman spectroscopy.

    PubMed

    Uysal, Reyhan Selin; Boyaci, Ismail Hakki; Genis, Hüseyin Efe; Tamer, Ugur

    2013-12-15

    In this study, adulteration of butter with margarine was analysed using Raman spectroscopy combined with chemometric methods (principal component analysis (PCA), principal component regression (PCR), partial least squares (PLS)) and artificial neural networks (ANNs). Different butter and margarine samples were mixed at various concentrations ranging from 0% to 100% w/w. PCA analysis was applied for the classification of butters, margarines and mixtures. PCR, PLS and ANN were used for the detection of adulteration ratios of butter. Models were created using a calibration data set and developed models were evaluated using a validation data set. The coefficient of determination (R(2)) values between actual and predicted values obtained for PCR, PLS and ANN for the validation data set were 0.968, 0.987 and 0.978, respectively. In conclusion, a combination of Raman spectroscopy with chemometrics and ANN methods can be applied for testing butter adulteration. Copyright © 2013 Elsevier Ltd. All rights reserved.

  16. 2L-PCA: a two-level principal component analyzer for quantitative drug design and its applications.

    PubMed

    Du, Qi-Shi; Wang, Shu-Qing; Xie, Neng-Zhong; Wang, Qing-Yan; Huang, Ri-Bo; Chou, Kuo-Chen

    2017-09-19

    A two-level principal component predictor (2L-PCA) was proposed based on the principal component analysis (PCA) approach. It can be used to quantitatively analyze various compounds and peptides about their functions or potentials to become useful drugs. One level is for dealing with the physicochemical properties of drug molecules, while the other level is for dealing with their structural fragments. The predictor has the self-learning and feedback features to automatically improve its accuracy. It is anticipated that 2L-PCA will become a very useful tool for timely providing various useful clues during the process of drug development.

  17. Molecular Clustering Interrelationships and Carbohydrate Conformation in Hull and Seeds Among Barley Cultivars

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    N Liu; P Yu

    2011-12-31

    The objective of this study was to use molecular spectral analyses with the diffuse reflectance Fourier transform infrared spectroscopy (DRIFT) bioanlytical technique to study carbohydrate conformation features, molecular clustering and interrelationships in hull and seed among six barley cultivars (AC Metcalfe, CDC Dolly, McLeod, CDC Helgason, CDC Trey, CDC Cowboy), which had different degradation kinetics in rumen. The molecular structure spectral analyses in both hull and seed involved the fingerprint regions of ca. 1536-1484 cm{sup -1} (attributed mainly to aromatic lignin semicircle ring stretch), ca. 1293-1212 cm{sup -1} (attributed mainly to cellulosic compounds in the hull), ca. 1269-1217 cm{sup -1}more » (attributed mainly to cellulosic compound in the seeds), and ca. 1180-800 cm{sup -1} (attributed mainly to total CHO C-O stretching vibrations) together with an agglomerative hierarchical cluster (AHCA) and principal component spectral analyses (PCA). The results showed that the DRIFT technique plus AHCA and PCA molecular analyses were able to reveal carbohydrate conformation features and identify carbohydrate molecular structure differences in both hull and seeds among the barley varieties. The carbohydrate molecular spectral analyses at the region of ca. 1185-800 cm{sup -1} together with the AHCA and PCA were able to show that the barley seed inherent structures exhibited distinguishable differences among the barley varieties. CDC Helgason had differences from AC Metcalfe, MeLeod, CDC Cowboy and CDC Dolly in carbohydrate conformation in the seed. Clear molecular cluster classes could be distinguished and identified in AHCA analysis and the separate ellipses could be grouped in PCA analysis. But CDC Helgason had no distinguished differences from CDC Trey in carbohydrate conformation. These carbohydrate conformation/structure difference could partially explain why the varieties were different in digestive behaviors in animals. The molecular spectroscopy technique used in this study could also be used for other plant-based feed and food structure studies.« less

  18. Quantication and analysis of respiratory motion from 4D MRI

    NASA Astrophysics Data System (ADS)

    Aizzuddin Abd Rahni, Ashrani; Lewis, Emma; Wells, Kevin

    2014-11-01

    It is well known that respiratory motion affects image acquisition and also external beam radiotherapy (EBRT) treatment planning and delivery. However often the existing approaches for respiratory motion management are based on a generic view of respiratory motion such as the general movement of organ, tissue or fiducials. This paper thus aims to present a more in depth analysis of respiratory motion based on 4D MRI for further integration into motion correction in image acquisition or image based EBRT. Internal and external motion was first analysed separately, on a per-organ basis for internal motion. Principal component analysis (PCA) was then performed on the internal and external motion vectors separately and the relationship between the two PCA spaces was analysed. The motion extracted from 4D MRI on general was found to be consistent with what has been reported in literature.

  19. A PCA-Based method for determining craniofacial relationship and sexual dimorphism of facial shapes.

    PubMed

    Shui, Wuyang; Zhou, Mingquan; Maddock, Steve; He, Taiping; Wang, Xingce; Deng, Qingqiong

    2017-11-01

    Previous studies have used principal component analysis (PCA) to investigate the craniofacial relationship, as well as sex determination using facial factors. However, few studies have investigated the extent to which the choice of principal components (PCs) affects the analysis of craniofacial relationship and sexual dimorphism. In this paper, we propose a PCA-based method for visual and quantitative analysis, using 140 samples of 3D heads (70 male and 70 female), produced from computed tomography (CT) images. There are two parts to the method. First, skull and facial landmarks are manually marked to guide the model's registration so that dense corresponding vertices occupy the same relative position in every sample. Statistical shape spaces of the skull and face in dense corresponding vertices are constructed using PCA. Variations in these vertices, captured in every principal component (PC), are visualized to observe shape variability. The correlations of skull- and face-based PC scores are analysed, and linear regression is used to fit the craniofacial relationship. We compute the PC coefficients of a face based on this craniofacial relationship and the PC scores of a skull, and apply the coefficients to estimate a 3D face for the skull. To evaluate the accuracy of the computed craniofacial relationship, the mean and standard deviation of every vertex between the two models are computed, where these models are reconstructed using real PC scores and coefficients. Second, each PC in facial space is analysed for sex determination, for which support vector machines (SVMs) are used. We examined the correlation between PCs and sex, and explored the extent to which the choice of PCs affects the expression of sexual dimorphism. Our results suggest that skull- and face-based PCs can be used to describe the craniofacial relationship and that the accuracy of the method can be improved by using an increased number of face-based PCs. The results show that the accuracy of the sex classification is related to the choice of PCs. The highest sex classification rate is 91.43% using our method. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. Morphological analysis of Trichomycterus areolatus Valenciennes, 1846 from southern Chilean rivers using a truss-based system (Siluriformes, Trichomycteridae).

    PubMed

    Colihueque, Nelson; Corrales, Olga; Yáñez, Miguel

    2017-01-01

    Trichomycterus areolatus Valenciennes, 1846 is a small endemic catfish inhabiting the Andean river basins of Chile. In this study, the morphological variability of three T. areolatus populations, collected in two river basins from southern Chile, was assessed with multivariate analyses, including principal component analysis (PCA) and discriminant function analysis (DFA). It is hypothesized that populations must segregate morphologically from each other based on the river basin that they were sampled from, since each basin presents relatively particular hydrological characteristics. Significant morphological differences among the three populations were found with PCA (ANOSIM test, r = 0.552, p < 0.0001) and DFA (Wilks's λ = 0.036, p < 0.01). PCA accounted for a total variation of 56.16% by the first two principal components. The first Principal Component (PC1) and PC2 explained 34.72 and 21.44% of the total variation, respectively. The scatter-plot of the first two discriminant functions (DF1 on DF2) also validated the existence of three different populations. In group classification using DFA, 93.3% of the specimens were correctly-classified into their original populations. Of the total of 22 transformed truss measurements, 17 exhibited highly significant ( p < 0.01) differences among populations. The data support the existence of T. areolatus morphological variation across different rivers in southern Chile, likely reflecting the geographic isolation underlying population structure of the species.

  1. In Vitro Assessment of Nanoparticle Effects on Blood Coagulation.

    PubMed

    Potter, Timothy M; Rodriguez, Jamie C; Neun, Barry W; Ilinskaya, Anna N; Cedrone, Edward; Dobrovolskaia, Marina A

    2018-01-01

    Blood clotting is a complex process which involves both cellular and biochemical components. The key cellular players in the blood clotting process are thrombocytes or platelets. Other cells, including leukocytes and endothelial cells, contribute to clotting by expressing the so-called pro-coagulant activity (PCA) complex on their surface. The biochemical component of blood clotting is represented by the plasma coagulation cascade, which includes plasma proteins also known as coagulation factors. The coordinated interaction between platelets, leukocytes, endothelial cells, and plasma coagulation factors is necessary for maintaining hemostasis and for preventing excessive bleeding. Undesirable activation of all or some of these components may lead to pathological blood coagulation and life-threatening conditions such as consumptive coagulopathy or disseminated intravascular coagulation (DIC). In contrast, unintended inhibition of the coagulation pathways may lead to hemorrhage. Thrombogenicity is the property of a test material to induce blood coagulation by affecting one or more elements of the clotting process. Anticoagulant activity refers to the property of a test material to inhibit coagulation. The tendency to cause platelet aggregation, perturb plasma coagulation, and induce leukocyte PCA can serve as an in vitro measure of a nanomaterial's likelihood to be pro- or anticoagulant in vivo. This chapter describes three procedures for in vitro analyses of platelet aggregation, plasma coagulation time, and activation of leukocyte PCA. Platelet aggregation and plasma coagulation procedures have been described earlier. The revision here includes updated details about nanoparticle sample preparation, selection of nanoparticle concentration for the in vitro study, and updated details about assay controls. The chapter is expanded to describe a method for the leukocyte PCA analysis and case studies demonstrating the performance of these in vitro assays.

  2. Principal component analysis-based pattern analysis of dose-volume histograms and influence on rectal toxicity.

    PubMed

    Söhn, Matthias; Alber, Markus; Yan, Di

    2007-09-01

    The variability of dose-volume histogram (DVH) shapes in a patient population can be quantified using principal component analysis (PCA). We applied this to rectal DVHs of prostate cancer patients and investigated the correlation of the PCA parameters with late bleeding. PCA was applied to the rectal wall DVHs of 262 patients, who had been treated with a four-field box, conformal adaptive radiotherapy technique. The correlated changes in the DVH pattern were revealed as "eigenmodes," which were ordered by their importance to represent data set variability. Each DVH is uniquely characterized by its principal components (PCs). The correlation of the first three PCs and chronic rectal bleeding of Grade 2 or greater was investigated with uni- and multivariate logistic regression analyses. Rectal wall DVHs in four-field conformal RT can primarily be represented by the first two or three PCs, which describe approximately 94% or 96% of the DVH shape variability, respectively. The first eigenmode models the total irradiated rectal volume; thus, PC1 correlates to the mean dose. Mode 2 describes the interpatient differences of the relative rectal volume in the two- or four-field overlap region. Mode 3 reveals correlations of volumes with intermediate doses ( approximately 40-45 Gy) and volumes with doses >70 Gy; thus, PC3 is associated with the maximal dose. According to univariate logistic regression analysis, only PC2 correlated significantly with toxicity. However, multivariate logistic regression analysis with the first two or three PCs revealed an increased probability of bleeding for DVHs with more than one large PC. PCA can reveal the correlation structure of DVHs for a patient population as imposed by the treatment technique and provide information about its relationship to toxicity. It proves useful for augmenting normal tissue complication probability modeling approaches.

  3. Experimental Researches on the Durability Indicators and the Physiological Comfort of Fabrics using the Principal Component Analysis (PCA) Method

    NASA Astrophysics Data System (ADS)

    Hristian, L.; Ostafe, M. M.; Manea, L. R.; Apostol, L. L.

    2017-06-01

    The work pursued the distribution of combed wool fabrics destined to manufacturing of external articles of clothing in terms of the values of durability and physiological comfort indices, using the mathematical model of Principal Component Analysis (PCA). Principal Components Analysis (PCA) applied in this study is a descriptive method of the multivariate analysis/multi-dimensional data, and aims to reduce, under control, the number of variables (columns) of the matrix data as much as possible to two or three. Therefore, based on the information about each group/assortment of fabrics, it is desired that, instead of nine inter-correlated variables, to have only two or three new variables called components. The PCA target is to extract the smallest number of components which recover the most of the total information contained in the initial data.

  4. Incorporating biological information in sparse principal component analysis with application to genomic data.

    PubMed

    Li, Ziyi; Safo, Sandra E; Long, Qi

    2017-07-11

    Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.

  5. Item response theory and factor analysis as a mean to characterize occurrence of response shift in a longitudinal quality of life study in breast cancer patients

    PubMed Central

    2014-01-01

    Background The occurrence of response shift (RS) in longitudinal health-related quality of life (HRQoL) studies, reflecting patient adaptation to disease, has already been demonstrated. Several methods have been developed to detect the three different types of response shift (RS), i.e. recalibration RS, 2) reprioritization RS, and 3) reconceptualization RS. We investigated two complementary methods that characterize the occurrence of RS: factor analysis, comprising Principal Component Analysis (PCA) and Multiple Correspondence Analysis (MCA), and a method of Item Response Theory (IRT). Methods Breast cancer patients (n = 381) completed the EORTC QLQ-C30 and EORTC QLQ-BR23 questionnaires at baseline, immediately following surgery, and three and six months after surgery, according to the “then-test/post-test” design. Recalibration was explored using MCA and a model of IRT, called the Linear Logistic Model with Relaxed Assumptions (LLRA) using the then-test method. Principal Component Analysis (PCA) was used to explore reconceptualization and reprioritization. Results MCA highlighted the main profiles of recalibration: patients with high HRQoL level report a slightly worse HRQoL level retrospectively and vice versa. The LLRA model indicated a downward or upward recalibration for each dimension. At six months, the recalibration effect was statistically significant for 11/22 dimensions of the QLQ-C30 and BR23 according to the LLRA model (p ≤ 0.001). Regarding the QLQ-C30, PCA indicated a reprioritization of symptom scales and reconceptualization via an increased correlation between functional scales. Conclusions Our findings demonstrate the usefulness of these analyses in characterizing the occurrence of RS. MCA and IRT model had convergent results with then-test method to characterize recalibration component of RS. PCA is an indirect method in investigating the reprioritization and reconceptualization components of RS. PMID:24606836

  6. An evaluation of independent component analyses with an application to resting-state fMRI

    PubMed Central

    Matteson, David S.; Ruppert, David; Eloyan, Ani; Caffo, Brian S.

    2013-01-01

    Summary We examine differences between independent component analyses (ICAs) arising from different as-sumptions, measures of dependence, and starting points of the algorithms. ICA is a popular method with diverse applications including artifact removal in electrophysiology data, feature extraction in microarray data, and identifying brain networks in functional magnetic resonance imaging (fMRI). ICA can be viewed as a generalization of principal component analysis (PCA) that takes into account higher-order cross-correlations. Whereas the PCA solution is unique, there are many ICA methods–whose solutions may differ. Infomax, FastICA, and JADE are commonly applied to fMRI studies, with FastICA being arguably the most popular. Hastie and Tibshirani (2003) demonstrated that ProDenICA outperformed FastICA in simulations with two components. We introduce the application of ProDenICA to simulations with more components and to fMRI data. ProDenICA was more accurate in simulations, and we identified differences between biologically meaningful ICs from ProDenICA versus other methods in the fMRI analysis. ICA methods require nonconvex optimization, yet current practices do not recognize the importance of, nor adequately address sensitivity to, initial values. We found that local optima led to dramatically different estimates in both simulations and group ICA of fMRI, and we provide evidence that the global optimum from ProDenICA is the best estimate. We applied a modification of the Hungarian (Kuhn-Munkres) algorithm to match ICs from multiple estimates, thereby gaining novel insights into how brain networks vary in their sensitivity to initial values and ICA method. PMID:24350655

  7. The Northern Norway Mother-and-Child Contaminant Cohort (MISA) Study: PCA analyses of environmental contaminants in maternal sera and dietary intake in early pregnancy.

    PubMed

    Veyhe, Anna Sofía; Hofoss, Dag; Hansen, Solrunn; Thomassen, Yngvar; Sandanger, Torkjel M; Odland, Jon Øyvind; Nieboer, Evert

    2015-03-01

    Although predictors of contaminants in serum or whole blood are usually examined by chemical groups (e.g., POPs, toxic and/or essential elements; dietary sources), principal component analysis (PCA) permits consideration of both individual substances and combined variables. Our study had two primary objectives: (i) Characterize the sources and predictors of a suite of eight PCBs, four organochlorine (OC) pesticides, five essential and five toxic elements in serum and/or whole blood of pregnant women recruited as part of the Mother-and-Child Contaminant Cohort Study conducted in Northern Norway (The MISA study); and (ii) determine the influence of personal and social characteristics on both dietary and contaminant factors. Recruitment and sampling started in May 2007 and continued for the next 31 months until December 2009. Blood/serum samples were collected during the 2nd trimester (mean: 18.2 weeks, range 9.0-36.0). A validated questionnaire was administered to obtain personal information. The samples were analysed by established laboratories employing verified methods and reference standards. PCA involved Varimax rotation, and significant predictors (p≤0.05) in linear regression models were included in the multivariable linear regression analysis. When considering all the contaminants, three prominent PCA axes stood out with prominent loadings of: all POPs; arsenic, selenium and mercury; and cadmium and lead. Respectively, in the multivariate models the following were predictors: maternal age, parity and consumption of freshwater fish and land-based wild animals; marine fish; cigarette smoking, dietary PCA axes reflecting consumption of grains and cereals, and food items involving hunting. PCA of only the POPs separated them into two axes that, in terms of recently published findings, could be understood to reflect longitudinal trends and their relative contributions to summed POPs. The linear combinations of variables generated by PCA identified prominent dietary sources of OC groups and of prominent toxic elements and highlighted the importance of maternal characteristics. Copyright © 2014 Elsevier GmbH. All rights reserved.

  8. Principal component analysis of Raman spectra for TiO2 nanoparticle characterization

    NASA Astrophysics Data System (ADS)

    Ilie, Alina Georgiana; Scarisoareanu, Monica; Morjan, Ion; Dutu, Elena; Badiceanu, Maria; Mihailescu, Ion

    2017-09-01

    The Raman spectra of anatase/rutile mixed phases of Sn doped TiO2 nanoparticles and undoped TiO2 nanoparticles, synthesised by laser pyrolysis, with nanocrystallite dimensions varying from 8 to 28 nm, was simultaneously processed with a self-written software that applies Principal Component Analysis (PCA) on the measured spectrum to verify the possibility of objective auto-characterization of nanoparticles from their vibrational modes. The photo-excited process of Raman scattering is very sensible to the material characteristics, especially in the case of nanomaterials, where more properties become relevant for the vibrational behaviour. We used PCA, a statistical procedure that performs eigenvalue decomposition of descriptive data covariance, to automatically analyse the sample's measured Raman spectrum, and to interfere the correlation between nanoparticle dimensions, tin and carbon concentration, and their Principal Component values (PCs). This type of application can allow an approximation of the crystallite size, or tin concentration, only by measuring the Raman spectrum of the sample. The study of loadings of the principal components provides information of the way the vibrational modes are affected by the nanoparticle features and the spectral area relevant for the classification.

  9. Univariate and multivariate molecular spectral analyses of lipid related molecular structural components in relation to nutrient profile in feed and food mixtures

    NASA Astrophysics Data System (ADS)

    Abeysekara, Saman; Damiran, Daalkhaijav; Yu, Peiqiang

    2013-02-01

    The objectives of this study were (i) to determine lipid related molecular structures components (functional groups) in feed combination of cereal grain (barley, Hordeum vulgare) and wheat (Triticum aestivum) based dried distillers grain solubles (wheat DDGSs) from bioethanol processing at five different combination ratios using univariate and multivariate molecular spectral analyses with infrared Fourier transform molecular spectroscopy, and (ii) to correlate lipid-related molecular-functional structure spectral profile to nutrient profiles. The spectral intensity of (i) CH3 asymmetric, CH2 asymmetric, CH3 symmetric and CH2 symmetric groups, (ii) unsaturation (Cdbnd C) group, and (iii) carbonyl ester (Cdbnd O) group were determined. Spectral differences of functional groups were detected by hierarchical cluster analysis (HCA) and principal components analysis (PCA). The results showed that the combination treatments significantly inflicted modifications (P < 0.05) in nutrient profile and lipid related molecular spectral intensity (CH2 asymmetric stretching peak height, CH2 symmetric stretching peak height, ratio of CH2 to CH3 symmetric stretching peak intensity, and carbonyl peak area). Ratio of CH2 to CH3 symmetric stretching peak intensity, and carbonyl peak significantly correlated with nutrient profiles. Both PCA and HCA differentiated lipid-related spectrum. In conclusion, the changes of lipid molecular structure spectral profiles through feed combination could be detected using molecular spectroscopy. These changes were associated with nutrient profiles and functionality.

  10. Robust prediction of protein subcellular localization combining PCA and WSVMs.

    PubMed

    Tian, Jiang; Gu, Hong; Liu, Wenqi; Gao, Chiyang

    2011-08-01

    Automated prediction of protein subcellular localization is an important tool for genome annotation and drug discovery, and Support Vector Machines (SVMs) can effectively solve this problem in a supervised manner. However, the datasets obtained from real experiments are likely to contain outliers or noises, which can lead to poor generalization ability and classification accuracy. To explore this problem, we adopt strategies to lower the effect of outliers. First we design a method based on Weighted SVMs, different weights are assigned to different data points, so the training algorithm will learn the decision boundary according to the relative importance of the data points. Second we analyse the influence of Principal Component Analysis (PCA) on WSVM classification, propose a hybrid classifier combining merits of both PCA and WSVM. After performing dimension reduction operations on the datasets, kernel-based possibilistic c-means algorithm can generate more suitable weights for the training, as PCA transforms the data into a new coordinate system with largest variances affected greatly by the outliers. Experiments on benchmark datasets show promising results, which confirms the effectiveness of the proposed method in terms of prediction accuracy. Copyright © 2011 Elsevier Ltd. All rights reserved.

  11. A stock market forecasting model combining two-directional two-dimensional principal component analysis and radial basis function neural network.

    PubMed

    Guo, Zhiqiang; Wang, Huaiqing; Yang, Jie; Miller, David J

    2015-01-01

    In this paper, we propose and implement a hybrid model combining two-directional two-dimensional principal component analysis ((2D)2PCA) and a Radial Basis Function Neural Network (RBFNN) to forecast stock market behavior. First, 36 stock market technical variables are selected as the input features, and a sliding window is used to obtain the input data of the model. Next, (2D)2PCA is utilized to reduce the dimension of the data and extract its intrinsic features. Finally, an RBFNN accepts the data processed by (2D)2PCA to forecast the next day's stock price or movement. The proposed model is used on the Shanghai stock market index, and the experiments show that the model achieves a good level of fitness. The proposed model is then compared with one that uses the traditional dimension reduction method principal component analysis (PCA) and independent component analysis (ICA). The empirical results show that the proposed model outperforms the PCA-based model, as well as alternative models based on ICA and on the multilayer perceptron.

  12. A Stock Market Forecasting Model Combining Two-Directional Two-Dimensional Principal Component Analysis and Radial Basis Function Neural Network

    PubMed Central

    Guo, Zhiqiang; Wang, Huaiqing; Yang, Jie; Miller, David J.

    2015-01-01

    In this paper, we propose and implement a hybrid model combining two-directional two-dimensional principal component analysis ((2D)2PCA) and a Radial Basis Function Neural Network (RBFNN) to forecast stock market behavior. First, 36 stock market technical variables are selected as the input features, and a sliding window is used to obtain the input data of the model. Next, (2D)2PCA is utilized to reduce the dimension of the data and extract its intrinsic features. Finally, an RBFNN accepts the data processed by (2D)2PCA to forecast the next day's stock price or movement. The proposed model is used on the Shanghai stock market index, and the experiments show that the model achieves a good level of fitness. The proposed model is then compared with one that uses the traditional dimension reduction method principal component analysis (PCA) and independent component analysis (ICA). The empirical results show that the proposed model outperforms the PCA-based model, as well as alternative models based on ICA and on the multilayer perceptron. PMID:25849483

  13. Perturbational formulation of principal component analysis in molecular dynamics simulation.

    PubMed

    Koyama, Yohei M; Kobayashi, Tetsuya J; Tomoda, Shuji; Ueda, Hiroki R

    2008-10-01

    Conformational fluctuations of a molecule are important to its function since such intrinsic fluctuations enable the molecule to respond to the external environmental perturbations. For extracting large conformational fluctuations, which predict the primary conformational change by the perturbation, principal component analysis (PCA) has been used in molecular dynamics simulations. However, several versions of PCA, such as Cartesian coordinate PCA and dihedral angle PCA (dPCA), are limited to use with molecules with a single dominant state or proteins where the dihedral angle represents an important internal coordinate. Other PCAs with general applicability, such as the PCA using pairwise atomic distances, do not represent the physical meaning clearly. Therefore, a formulation that provides general applicability and clearly represents the physical meaning is yet to be developed. For developing such a formulation, we consider the conformational distribution change by the perturbation with arbitrary linearly independent perturbation functions. Within the second order approximation of the Kullback-Leibler divergence by the perturbation, the PCA can be naturally interpreted as a method for (1) decomposing a given perturbation into perturbations that independently contribute to the conformational distribution change or (2) successively finding the perturbation that induces the largest conformational distribution change. In this perturbational formulation of PCA, (i) the eigenvalue measures the Kullback-Leibler divergence from the unperturbed to perturbed distributions, (ii) the eigenvector identifies the combination of the perturbation functions, and (iii) the principal component determines the probability change induced by the perturbation. Based on this formulation, we propose a PCA using potential energy terms, and we designate it as potential energy PCA (PEPCA). The PEPCA provides both general applicability and clear physical meaning. For demonstrating its power, we apply the PEPCA to an alanine dipeptide molecule in vacuum as a minimal model of a nonsingle dominant conformational biomolecule. The first and second principal components clearly characterize two stable states and the transition state between them. Positive and negative components with larger absolute values of the first and second eigenvectors identify the electrostatic interactions, which stabilize or destabilize each stable state and the transition state. Our result therefore indicates that PCA can be applied, by carefully selecting the perturbation functions, not only to identify the molecular conformational fluctuation but also to predict the conformational distribution change by the perturbation beyond the limitation of the previous methods.

  14. Perturbational formulation of principal component analysis in molecular dynamics simulation

    NASA Astrophysics Data System (ADS)

    Koyama, Yohei M.; Kobayashi, Tetsuya J.; Tomoda, Shuji; Ueda, Hiroki R.

    2008-10-01

    Conformational fluctuations of a molecule are important to its function since such intrinsic fluctuations enable the molecule to respond to the external environmental perturbations. For extracting large conformational fluctuations, which predict the primary conformational change by the perturbation, principal component analysis (PCA) has been used in molecular dynamics simulations. However, several versions of PCA, such as Cartesian coordinate PCA and dihedral angle PCA (dPCA), are limited to use with molecules with a single dominant state or proteins where the dihedral angle represents an important internal coordinate. Other PCAs with general applicability, such as the PCA using pairwise atomic distances, do not represent the physical meaning clearly. Therefore, a formulation that provides general applicability and clearly represents the physical meaning is yet to be developed. For developing such a formulation, we consider the conformational distribution change by the perturbation with arbitrary linearly independent perturbation functions. Within the second order approximation of the Kullback-Leibler divergence by the perturbation, the PCA can be naturally interpreted as a method for (1) decomposing a given perturbation into perturbations that independently contribute to the conformational distribution change or (2) successively finding the perturbation that induces the largest conformational distribution change. In this perturbational formulation of PCA, (i) the eigenvalue measures the Kullback-Leibler divergence from the unperturbed to perturbed distributions, (ii) the eigenvector identifies the combination of the perturbation functions, and (iii) the principal component determines the probability change induced by the perturbation. Based on this formulation, we propose a PCA using potential energy terms, and we designate it as potential energy PCA (PEPCA). The PEPCA provides both general applicability and clear physical meaning. For demonstrating its power, we apply the PEPCA to an alanine dipeptide molecule in vacuum as a minimal model of a nonsingle dominant conformational biomolecule. The first and second principal components clearly characterize two stable states and the transition state between them. Positive and negative components with larger absolute values of the first and second eigenvectors identify the electrostatic interactions, which stabilize or destabilize each stable state and the transition state. Our result therefore indicates that PCA can be applied, by carefully selecting the perturbation functions, not only to identify the molecular conformational fluctuation but also to predict the conformational distribution change by the perturbation beyond the limitation of the previous methods.

  15. Analysis of antique bronze coins by Laser Induced Breakdown Spectroscopy and multivariate analysis

    NASA Astrophysics Data System (ADS)

    Bachler, M. Orlić; Bišćan, M.; Kregar, Z.; Jelovica Badovinac, I.; Dobrinić, J.; Milošević, S.

    2016-09-01

    This work presents a feasibility study of applying the Principal Component Analysis (PCA) to data obtained by Laser-Induced Breakdown Spectroscopy (LIBS) with the aim of determining correlation between different samples. The samples were antique bronze coins coated in silver (follis) dated in the Roman Empire period and were made during different rulers in different mints. While raw LIBS data revealed that in the period from the year 286 to 383 CE content of silver was constantly decreasing, the PCA showed that the samples can be somewhat grouped together based on their place of origin, which could be a useful hint when analysing unknown samples. It was also found that PCA can help in discriminating spectra corresponding to ablation from the surface and from the bulk. Furthermore, Partial Least Squares method (PLS) was used to obtain, based on a set of samples with known composition, an estimation of relative copper concentration in studied ancient coins. This analysis showed that copper concentration in surface layers ranged from 83% to 90%.

  16. Non-targeted analyses of animal plasma: betaine and choline represent the nutritional and metabolic status.

    PubMed

    Katayama, K; Sato, T; Arai, T; Amao, H; Ohta, Y; Ozawa, T; Kenyon, P R; Hickson, R E; Tazaki, H

    2013-02-01

    Simple liquid chromatography-mass spectrometry (LC-MS) was applied to non-targeted metabolic analyses to discover new metabolic markers in animal plasma. Principle component analysis (PCA) and partial least squares-discriminate analysis (PLS-DA) were used to analyse LC-MS multivariate data. PCA clearly generated two separate clusters for artificially induced diabetic mice and healthy control mice. PLS-DA of time-course changes in plasma metabolites of chicks after feeding generated three clusters (pre- and immediately after feeding, 0.5-3 h after feeding and 4 h after feeding). Two separate clusters were also generated for plasma metabolites of pregnant Angus heifers with differing live-weight change profiles (gaining or losing). The accompanying PLS-DA loading plot detailed the metabolites that contribute the most to the cluster separation. In each case, the same highly hydrophilic metabolite was strongly correlated to the group separation. The metabolite was identified as betaine by LC-MS/MS. This result indicates that betaine and its metabolic precursor, choline, may be useful biomarkers to evaluate the nutritional and metabolic status of animals. © 2011 Blackwell Verlag GmbH.

  17. Revealing the ultrafast outflow in IRAS 13224-3809 through spectral variability

    NASA Astrophysics Data System (ADS)

    Parker, M. L.; Alston, W. N.; Buisson, D. J. K.; Fabian, A. C.; Jiang, J.; Kara, E.; Lohfink, A.; Pinto, C.; Reynolds, C. S.

    2017-08-01

    We present an analysis of the long-term X-ray variability of the extreme narrow-line Seyfert 1 galaxy IRAS 13224-3809 using principal component analysis (PCA) and fractional excess variability (Fvar) spectra to identify model-independent spectral components. We identify a series of variability peaks in both the first PCA component and Fvar spectrum which correspond to the strongest predicted absorption lines from the ultrafast outflow (UFO) discovered by Parker et al. (2017). We also find higher order PCA components, which correspond to variability of the soft excess and reflection features. The subtle differences between RMS and PCA results argue that the observed flux-dependence of the absorption is due to increased ionization of the gas, rather than changes in column density or covering fraction. This result demonstrates that we can detect outflows from variability alone and that variability studies of UFOs are an extremely promising avenue for future research.

  18. Source apportionment of PAH in Hamilton Harbour suspended sediments: comparison of two factor analysis methods.

    PubMed

    Sofowote, Uwayemi M; McCarry, Brian E; Marvin, Christopher H

    2008-08-15

    A total of 26 suspended sediment samples collected over a 5-year period in Hamilton Harbour, Ontario, Canada and surrounding creeks were analyzed for a suite of polycyclic aromatic hydrocarbons and sulfur heterocycles. Hamilton Harbour sediments contain relatively high levels of polycyclic aromatic compounds and heavy metals due to emissions from industrial and mobile sources. Two receptor modeling methods using factor analyses were compared to determine the profiles and relative contributions of pollution sources to the harbor; these methods are principal component analyses (PCA) with multiple linear regression analysis (MLR) and positive matrix factorization (PMF). Both methods identified four factors and gave excellent correlation coefficients between predicted and measured levels of 25 aromatic compounds; both methods predicted similar contributions from coal tar/coal combustion sources to the harbor (19 and 26%, respectively). One PCA factor was identified as contributions from vehicular emissions (61%); PMF was able to differentiate vehicular emissions into two factors, one attributed to gasoline emissions sources (28%) and the other to diesel emissions sources (24%). Overall, PMF afforded better source identification than PCA with MLR. This work constitutes one of the few examples of the application of PMF to the source apportionment of sediments; the addition of sulfur heterocycles to the analyte list greatly aided in the source identification process.

  19. The Effect of Temperature on Pressurised Hot Water Extraction of Pharmacologically Important Metabolites as Analysed by UPLC-qTOF-MS and PCA

    PubMed Central

    Khoza, B. S.; Chimuka, L.; Mukwevho, E.; Steenkamp, P. A.; Madala, N. E.

    2014-01-01

    Metabolite extraction methods have been shown to be a critical consideration for pharmacometabolomics studies and, as such, optimization and development of new extraction methods are crucial. In the current study, an organic solvent-free method, namely, pressurised hot water extraction (PHWE), was used to extract pharmacologically important metabolites from dried Moringa oleifera leaves. Here, the temperature of the extraction solvent (pure water) was altered while keeping other factors constant using a homemade PHWE system. Samples extracted at different temperatures (50, 100, and 150°C) were assayed for antioxidant activities and the effect of the temperature on the extraction process was evaluated. The samples were further analysed by mass spectrometry to elucidate their metabolite compositions. Principal component analysis (PCA) evaluation of the UPLC-MS data showed distinctive differential metabolite patterns. Here, temperature changes during PHWE were shown to affect the levels of metabolites with known pharmacological activities, such as chlorogenic acids and flavonoids. Our overall findings suggest that, if not well optimised, the extraction temperature could compromise the “pharmacological potency” of the extracts. The use of MS in combination with PCA was furthermore shown to be an excellent approach to evaluate the quality and content of pharmacologically important extracts. PMID:25371697

  20. Androgen Receptor Expression in Epithelial and Stromal Cells of Prostatic Carcinoma and Benign Prostatic Hyperplasia.

    PubMed

    Filipovski, Vanja; Kubelka-Sabit, Katerina; Jasar, Dzengis; Janevska, Vesna

    2017-08-15

    Prostatic carcinoma (PCa) derives from prostatic epithelial cells. However stromal microenvironment, associated with malignant epithelium, also plays a role in prostatic carcinogenesis. Alterations in prostatic stromal cells contribute to the loss of growth control in epithelial cells that lead to progression of PCa. To analyse the differences between Androgen Receptor (AR) expression in both epithelial and stromal cells in PCa and the surrounding benign prostatic hyperplasia (BPH) and to compare the results with tumour grade. Samples from 70 cases of radical prostatectomy specimens were used. The expression and intensity of the signal for AR was analysed in the epithelial and stromal cells of PCa and BPH, and the data was quantified using histological score (H-score). AR showed significantly lower expression in both epithelial and stromal cells of PCa compared to BPH. In PCa a significant positive correlation of AR expression was found between stromal and epithelial cells of PCa. AR expression showed a correlation between the stromal cells of PCa and tumour grade. AR expression is reduced in epithelial and stromal cells of PCa. Expression of AR in stromal cells of PCa significantly correlates with tumour grade.

  1. Morphological analysis of Trichomycterus areolatus Valenciennes, 1846 from southern Chilean rivers using a truss-based system (Siluriformes, Trichomycteridae)

    PubMed Central

    Colihueque, Nelson; Corrales, Olga; Yáñez, Miguel

    2017-01-01

    Abstract Trichomycterus areolatus Valenciennes, 1846 is a small endemic catfish inhabiting the Andean river basins of Chile. In this study, the morphological variability of three T. areolatus populations, collected in two river basins from southern Chile, was assessed with multivariate analyses, including principal component analysis (PCA) and discriminant function analysis (DFA). It is hypothesized that populations must segregate morphologically from each other based on the river basin that they were sampled from, since each basin presents relatively particular hydrological characteristics. Significant morphological differences among the three populations were found with PCA (ANOSIM test, r = 0.552, p < 0.0001) and DFA (Wilks’s λ = 0.036, p < 0.01). PCA accounted for a total variation of 56.16% by the first two principal components. The first Principal Component (PC1) and PC2 explained 34.72 and 21.44% of the total variation, respectively. The scatter-plot of the first two discriminant functions (DF1 on DF2) also validated the existence of three different populations. In group classification using DFA, 93.3% of the specimens were correctly-classified into their original populations. Of the total of 22 transformed truss measurements, 17 exhibited highly significant (p < 0.01) differences among populations. The data support the existence of T. areolatus morphological variation across different rivers in southern Chile, likely reflecting the geographic isolation underlying population structure of the species. PMID:29134012

  2. Source Determination of Red Gel Pen Inks using Raman Spectroscopy and Attenuated Total Reflectance Fourier Transform Infrared Spectroscopy combined with Pearson's Product Moment Correlation Coefficients and Principal Component Analysis.

    PubMed

    Mohamad Asri, Muhammad Naeim; Mat Desa, Wan Nur Syuhaila; Ismail, Dzulkiflee

    2018-01-01

    The potential combination of two nondestructive techniques, that is, Raman spectroscopy (RS) and attenuated total reflectance-fourier transform infrared (ATR-FTIR) spectroscopy with Pearson's product moment correlation (PPMC) coefficient (r) and principal component analysis (PCA) to determine the actual source of red gel pen ink used to write a simulated threatening note, was examined. Eighteen (18) red gel pens purchased from Japan and Malaysia from November to December 2014 where one of the pens was used to write a simulated threatening note were analyzed using RS and ATR-FTIR spectroscopy, respectively. The spectra of all the red gel pen inks including the ink deposited on the simulated threatening note gathered from the RS and ATR-FTIR analyses were subjected to PPMC coefficient (r) calculation and principal component analysis (PCA). The coefficients r = 0.9985 and r = 0.9912 for pairwise combination of RS and ATR-FTIR spectra respectively and similarities in terms of PC1 and PC2 scores of one of the inks to the ink deposited on the simulated threatening note substantiated the feasibility of combining RS and ATR-FTIR spectroscopy with PPMC coefficient (r) and PCA for successful source determination of red gel pen inks. The development of pigment spectral library had allowed the ink deposited on the threatening note to be identified as XSL Poppy Red (CI Pigment Red 112). © 2017 American Academy of Forensic Sciences.

  3. A diffusion-matched principal component analysis (DM-PCA) based two-channel denoising procedure for high-resolution diffusion-weighted MRI

    PubMed Central

    Chang, Hing-Chiu; Bilgin, Ali; Bernstein, Adam; Trouard, Theodore P.

    2018-01-01

    Over the past several years, significant efforts have been made to improve the spatial resolution of diffusion-weighted imaging (DWI), aiming at better detecting subtle lesions and more reliably resolving white-matter fiber tracts. A major concern with high-resolution DWI is the limited signal-to-noise ratio (SNR), which may significantly offset the advantages of high spatial resolution. Although the SNR of DWI data can be improved by denoising in post-processing, existing denoising procedures may potentially reduce the anatomic resolvability of high-resolution imaging data. Additionally, non-Gaussian noise induced signal bias in low-SNR DWI data may not always be corrected with existing denoising approaches. Here we report an improved denoising procedure, termed diffusion-matched principal component analysis (DM-PCA), which comprises 1) identifying a group of (not necessarily neighboring) voxels that demonstrate very similar magnitude signal variation patterns along the diffusion dimension, 2) correcting low-frequency phase variations in complex-valued DWI data, 3) performing PCA along the diffusion dimension for real- and imaginary-components (in two separate channels) of phase-corrected DWI voxels with matched diffusion properties, 4) suppressing the noisy PCA components in real- and imaginary-components, separately, of phase-corrected DWI data, and 5) combining real- and imaginary-components of denoised DWI data. Our data show that the new two-channel (i.e., for real- and imaginary-components) DM-PCA denoising procedure performs reliably without noticeably compromising anatomic resolvability. Non-Gaussian noise induced signal bias could also be reduced with the new denoising method. The DM-PCA based denoising procedure should prove highly valuable for high-resolution DWI studies in research and clinical uses. PMID:29694400

  4. A stable systemic risk ranking in China's banking sector: Based on principal component analysis

    NASA Astrophysics Data System (ADS)

    Fang, Libing; Xiao, Binqing; Yu, Honghai; You, Qixing

    2018-02-01

    In this paper, we compare five popular systemic risk rankings, and apply principal component analysis (PCA) model to provide a stable systemic risk ranking for the Chinese banking sector. Our empirical results indicate that five methods suggest vastly different systemic risk rankings for the same bank, while the combined systemic risk measure based on PCA provides a reliable ranking. Furthermore, according to factor loadings of the first component, PCA combined ranking is mainly based on fundamentals instead of market price data. We clearly find that price-based rankings are not as practical a method as fundamentals-based ones. This PCA combined ranking directly shows systemic risk contributions of each bank for banking supervision purpose and reminds banks to prevent and cope with the financial crisis in advance.

  5. Weak acid extractable metals in Bramble Bay, Queensland, Australia: temporal behaviour, enrichment and source apportionment.

    PubMed

    Brady, James P; Ayoko, Godwin A; Martens, Wayde N; Goonetilleke, Ashantha

    2015-02-15

    Sediment samples were taken from six sampling sites in Bramble Bay, Queensland, Australia between February and November in 2012. They were analysed for a range of heavy metals including Al, Fe, Mn, Ti, Ce, Th, U, V, Cr, Co, Ni, Cu, Zn, As, Cd, Sb, Te, Hg, Tl and Pb. Fraction analysis, Enrichment Factors and Principal Component Analysis-Absolute Principal Component Scores (PCA-APCS) were carried out in order to assess metal pollution, potential bioavailability and source apportionment. Cr and Ni exceeded the Australian Interim Sediment Quality Guidelines at some sampling sites, while Hg was found to be the most enriched metal. Fraction analysis identified increased weak acid soluble Hg and Cd during the sampling period. Source apportionment via PCA-APCS found four sources of metals pollution, namely, marine sediments, shipping, antifouling coatings and a mixed source. These sources need to be considered in any metal pollution control measure within Bramble Bay. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. PCA based clustering for brain tumor segmentation of T1w MRI images.

    PubMed

    Kaya, Irem Ersöz; Pehlivanlı, Ayça Çakmak; Sekizkardeş, Emine Gezmez; Ibrikci, Turgay

    2017-03-01

    Medical images are huge collections of information that are difficult to store and process consuming extensive computing time. Therefore, the reduction techniques are commonly used as a data pre-processing step to make the image data less complex so that a high-dimensional data can be identified by an appropriate low-dimensional representation. PCA is one of the most popular multivariate methods for data reduction. This paper is focused on T1-weighted MRI images clustering for brain tumor segmentation with dimension reduction by different common Principle Component Analysis (PCA) algorithms. Our primary aim is to present a comparison between different variations of PCA algorithms on MRIs for two cluster methods. Five most common PCA algorithms; namely the conventional PCA, Probabilistic Principal Component Analysis (PPCA), Expectation Maximization Based Principal Component Analysis (EM-PCA), Generalize Hebbian Algorithm (GHA), and Adaptive Principal Component Extraction (APEX) were applied to reduce dimensionality in advance of two clustering algorithms, K-Means and Fuzzy C-Means. In the study, the T1-weighted MRI images of the human brain with brain tumor were used for clustering. In addition to the original size of 512 lines and 512 pixels per line, three more different sizes, 256 × 256, 128 × 128 and 64 × 64, were included in the study to examine their effect on the methods. The obtained results were compared in terms of both the reconstruction errors and the Euclidean distance errors among the clustered images containing the same number of principle components. According to the findings, the PPCA obtained the best results among all others. Furthermore, the EM-PCA and the PPCA assisted K-Means algorithm to accomplish the best clustering performance in the majority as well as achieving significant results with both clustering algorithms for all size of T1w MRI images. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  7. Temporal Processing of Dynamic Positron Emission Tomography via Principal Component Analysis in the Sinogram Domain

    NASA Astrophysics Data System (ADS)

    Chen, Zhe; Parker, B. J.; Feng, D. D.; Fulton, R.

    2004-10-01

    In this paper, we compare various temporal analysis schemes applied to dynamic PET for improved quantification, image quality and temporal compression purposes. We compare an optimal sampling schedule (OSS) design, principal component analysis (PCA) applied in the image domain, and principal component analysis applied in the sinogram domain; for region-of-interest quantification, sinogram-domain PCA is combined with the Huesman algorithm to quantify from the sinograms directly without requiring reconstruction of all PCA channels. Using a simulated phantom FDG brain study and three clinical studies, we evaluate the fidelity of the compressed data for estimation of local cerebral metabolic rate of glucose by a four-compartment model. Our results show that using a noise-normalized PCA in the sinogram domain gives similar compression ratio and quantitative accuracy to OSS, but with substantially better precision. These results indicate that sinogram-domain PCA for dynamic PET can be a useful preprocessing stage for PET compression and quantification applications.

  8. Applying robust variant of Principal Component Analysis as a damage detector in the presence of outliers

    NASA Astrophysics Data System (ADS)

    Gharibnezhad, Fahit; Mujica, Luis E.; Rodellar, José

    2015-01-01

    Using Principal Component Analysis (PCA) for Structural Health Monitoring (SHM) has received considerable attention over the past few years. PCA has been used not only as a direct method to identify, classify and localize damages but also as a significant primary step for other methods. Despite several positive specifications that PCA conveys, it is very sensitive to outliers. Outliers are anomalous observations that can affect the variance and the covariance as vital parts of PCA method. Therefore, the results based on PCA in the presence of outliers are not fully satisfactory. As a main contribution, this work suggests the use of robust variant of PCA not sensitive to outliers, as an effective way to deal with this problem in SHM field. In addition, the robust PCA is compared with the classical PCA in the sense of detecting probable damages. The comparison between the results shows that robust PCA can distinguish the damages much better than using classical one, and even in many cases allows the detection where classic PCA is not able to discern between damaged and non-damaged structures. Moreover, different types of robust PCA are compared with each other as well as with classical counterpart in the term of damage detection. All the results are obtained through experiments with an aircraft turbine blade using piezoelectric transducers as sensors and actuators and adding simulated damages.

  9. Once upon Multivariate Analyses: When They Tell Several Stories about Biological Evolution.

    PubMed

    Renaud, Sabrina; Dufour, Anne-Béatrice; Hardouin, Emilie A; Ledevin, Ronan; Auffray, Jean-Christophe

    2015-01-01

    Geometric morphometrics aims to characterize of the geometry of complex traits. It is therefore by essence multivariate. The most popular methods to investigate patterns of differentiation in this context are (1) the Principal Component Analysis (PCA), which is an eigenvalue decomposition of the total variance-covariance matrix among all specimens; (2) the Canonical Variate Analysis (CVA, a.k.a. linear discriminant analysis (LDA) for more than two groups), which aims at separating the groups by maximizing the between-group to within-group variance ratio; (3) the between-group PCA (bgPCA) which investigates patterns of between-group variation, without standardizing by the within-group variance. Standardizing within-group variance, as performed in the CVA, distorts the relationships among groups, an effect that is particularly strong if the variance is similarly oriented in a comparable way in all groups. Such shared direction of main morphological variance may occur and have a biological meaning, for instance corresponding to the most frequent standing genetic variation in a population. Here we undertake a case study of the evolution of house mouse molar shape across various islands, based on the real dataset and simulations. We investigated how patterns of main variance influence the depiction of among-group differentiation according to the interpretation of the PCA, bgPCA and CVA. Without arguing about a method performing 'better' than another, it rather emerges that working on the total or between-group variance (PCA and bgPCA) will tend to put the focus on the role of direction of main variance as line of least resistance to evolution. Standardizing by the within-group variance (CVA), by dampening the expression of this line of least resistance, has the potential to reveal other relevant patterns of differentiation that may otherwise be blurred.

  10. Multiple fingerprinting analyses in quality control of Cassiae Semen polysaccharides.

    PubMed

    Cheng, Jing; He, Siyu; Wan, Qiang; Jing, Pu

    2018-03-01

    Quality control issue overshadows potential health benefits of Cassiae Semen due to the analytic limitations. In this study, multiple-fingerprint analysis integrated with several chemometrics was performed to assess the polysaccharide quality of Cassiae Semen harvested from different locations. FT-IR, HPLC, and GC fingerprints of polysaccharide extracts from the authentic source were established as standard profiles, applying to assess the quality of foreign sources. Analyses of FT-IR fingerprints of polysaccharide extracts using either Pearson correlation analysis or principal component analysis (PCA), or HPLC fingerprints of partially hydrolyzed polysaccharides with PCA, distinguished the foreign sources from the authentic source. However, HPLC or GC fingerprints of completely hydrolyzed polysaccharides couldn't identify all foreign sources and the methodology using GC is quite limited in determining the monosaccharide composition. This indicates that FT-IR/HPLC fingerprints of non/partially-hydrolyzed polysaccharides, respectively, accompanied by multiple chemometrics methods, might be potentially applied in detecting and differentiating sources of Cassiae Semen. Copyright © 2018 Elsevier B.V. All rights reserved.

  11. Apple variety and maturity profiling of base ciders using UV spectroscopy.

    PubMed

    Girschik, Lachlan; Jones, Joanna E; Kerslake, Fiona L; Robertson, Mark; Dambergs, Robert G; Swarts, Nigel D

    2017-08-01

    Varietal base ciders were produced from three varieties of dessert apples ('Pink Lady®', 'Royal Gala' and 'Red Delicious') at pre-commercial, commercial and post-commercial harvest timings. Rapid analytical methods were used to categorise the base ciders, and data analysed using principal component analysis (PCA). The titratable acidity of apple must was significantly higher for the pre-commercial harvest fruit for both the 'Royal Gala' and 'Red Delicious' varieties. The base cider phenolic content was highest in the pre-commercial harvest fruit for all varieties. 'Red Delicious' had the highest total phenolics as determined by spectral analysis and supported by the classification provided by the PCA analysis. The spectral fingerprints of the ciders showed two main peaks at approximately 280nm and 320nm indicating phenolic concentrations. Studies analysing characteristics of dessert apple varieties with relevance for cider production will allow for informed decision making for both apple producers and cider makers. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Optimized principal component analysis on coronagraphic images of the fomalhaut system

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Meshkat, Tiffany; Kenworthy, Matthew A.; Quanz, Sascha P.

    We present the results of a study to optimize the principal component analysis (PCA) algorithm for planet detection, a new algorithm complementing angular differential imaging and locally optimized combination of images (LOCI) for increasing the contrast achievable next to a bright star. The stellar point spread function (PSF) is constructed by removing linear combinations of principal components, allowing the flux from an extrasolar planet to shine through. The number of principal components used determines how well the stellar PSF is globally modeled. Using more principal components may decrease the number of speckles in the final image, but also increases themore » background noise. We apply PCA to Fomalhaut Very Large Telescope NaCo images acquired at 4.05 μm with an apodized phase plate. We do not detect any companions, with a model dependent upper mass limit of 13-18 M {sub Jup} from 4-10 AU. PCA achieves greater sensitivity than the LOCI algorithm for the Fomalhaut coronagraphic data by up to 1 mag. We make several adaptations to the PCA code and determine which of these prove the most effective at maximizing the signal-to-noise from a planet very close to its parent star. We demonstrate that optimizing the number of principal components used in PCA proves most effective for pulling out a planet signal.« less

  13. Structured Sparse Principal Components Analysis With the TV-Elastic Net Penalty.

    PubMed

    de Pierrefeu, Amicie; Lofstedt, Tommy; Hadj-Selem, Fouad; Dubois, Mathieu; Jardri, Renaud; Fovet, Thomas; Ciuciu, Philippe; Frouin, Vincent; Duchesnay, Edouard

    2018-02-01

    Principal component analysis (PCA) is an exploratory tool widely used in data analysis to uncover the dominant patterns of variability within a population. Despite its ability to represent a data set in a low-dimensional space, PCA's interpretability remains limited. Indeed, the components produced by PCA are often noisy or exhibit no visually meaningful patterns. Furthermore, the fact that the components are usually non-sparse may also impede interpretation, unless arbitrary thresholding is applied. However, in neuroimaging, it is essential to uncover clinically interpretable phenotypic markers that would account for the main variability in the brain images of a population. Recently, some alternatives to the standard PCA approach, such as sparse PCA (SPCA), have been proposed, their aim being to limit the density of the components. Nonetheless, sparsity alone does not entirely solve the interpretability problem in neuroimaging, since it may yield scattered and unstable components. We hypothesized that the incorporation of prior information regarding the structure of the data may lead to improved relevance and interpretability of brain patterns. We therefore present a simple extension of the popular PCA framework that adds structured sparsity penalties on the loading vectors in order to identify the few stable regions in the brain images that capture most of the variability. Such structured sparsity can be obtained by combining, e.g., and total variation (TV) penalties, where the TV regularization encodes information on the underlying structure of the data. This paper presents the structured SPCA (denoted SPCA-TV) optimization framework and its resolution. We demonstrate SPCA-TV's effectiveness and versatility on three different data sets. It can be applied to any kind of structured data, such as, e.g., -dimensional array images or meshes of cortical surfaces. The gains of SPCA-TV over unstructured approaches (such as SPCA and ElasticNet PCA) or structured approach (such as GraphNet PCA) are significant, since SPCA-TV reveals the variability within a data set in the form of intelligible brain patterns that are easier to interpret and more stable across different samples.

  14. The Influence Function of Principal Component Analysis by Self-Organizing Rule.

    PubMed

    Higuchi; Eguchi

    1998-07-28

    This article is concerned with a neural network approach to principal component analysis (PCA). An algorithm for PCA by the self-organizing rule has been proposed and its robustness observed through the simulation study by Xu and Yuille (1995). In this article, the robustness of the algorithm against outliers is investigated by using the theory of influence function. The influence function of the principal component vector is given in an explicit form. Through this expression, the method is shown to be robust against any directions orthogonal to the principal component vector. In addition, a statistic generated by the self-organizing rule is proposed to assess the influence of data in PCA.

  15. Identification and classification of upper limb motions using PCA.

    PubMed

    Veer, Karan; Vig, Renu

    2018-03-28

    This paper describes the utility of principal component analysis (PCA) in classifying upper limb signals. PCA is a powerful tool for analyzing data of high dimension. Here, two different input strategies were explored. The first method uses upper arm dual-position-based myoelectric signal acquisition and the other solely uses PCA for classifying surface electromyogram (SEMG) signals. SEMG data from the biceps and the triceps brachii muscles and four independent muscle activities of the upper arm were measured in seven subjects (total dataset=56). The datasets used for the analysis are rotated by class-specific principal component matrices to decorrelate the measured data prior to feature extraction.

  16. Multivariate Statistical Analysis of MSL APXS Bulk Geochemical Data

    NASA Astrophysics Data System (ADS)

    Hamilton, V. E.; Edwards, C. S.; Thompson, L. M.; Schmidt, M. E.

    2014-12-01

    We apply cluster and factor analyses to bulk chemical data of 130 soil and rock samples measured by the Alpha Particle X-ray Spectrometer (APXS) on the Mars Science Laboratory (MSL) rover Curiosity through sol 650. Multivariate approaches such as principal components analysis (PCA), cluster analysis, and factor analysis compliment more traditional approaches (e.g., Harker diagrams), with the advantage of simultaneously examining the relationships between multiple variables for large numbers of samples. Principal components analysis has been applied with success to APXS, Pancam, and Mössbauer data from the Mars Exploration Rovers. Factor analysis and cluster analysis have been applied with success to thermal infrared (TIR) spectral data of Mars. Cluster analyses group the input data by similarity, where there are a number of different methods for defining similarity (hierarchical, density, distribution, etc.). For example, without any assumptions about the chemical contributions of surface dust, preliminary hierarchical and K-means cluster analyses clearly distinguish the physically adjacent rock targets Windjana and Stephen as being distinctly different than lithologies observed prior to Curiosity's arrival at The Kimberley. In addition, they are separated from each other, consistent with chemical trends observed in variation diagrams but without requiring assumptions about chemical relationships. We will discuss the variation in cluster analysis results as a function of clustering method and pre-processing (e.g., log transformation, correction for dust cover) and implications for interpreting chemical data. Factor analysis shares some similarities with PCA, and examines the variability among observed components of a dataset so as to reveal variations attributable to unobserved components. Factor analysis has been used to extract the TIR spectra of components that are typically observed in mixtures and only rarely in isolation; there is the potential for similar results with data from APXS. These techniques offer new ways to understand the chemical relationships between the materials interrogated by Curiosity, and potentially their relation to materials observed by APXS instruments on other landed missions.

  17. Biochemical and nutritional components of selected honey samples.

    PubMed

    Chua, Lee Suan; Adnan, Nur Ardawati

    2014-01-01

    The purpose of this study was to investigate the relationship of biochemical (enzymes) and nutritional components in the selected honey samples from Malaysia. The relationship is important to estimate the quality of honey based on the concentration of these nutritious components. Such a study is limited for honey samples from tropical countries with heavy rainfall throughout the year. A number of six honey samples that commonly consumed by local people were collected for the study. Both the biochemical and nutritional components were analysed by using standard methods from Association of Official Analytical Chemists (AOAC). Individual monosaccharides, disaccharides and 17 amino acids in honey were determined by using liquid chromatographic method. The results showed that the peroxide activity was positively correlated with moisture content (r = 0.8264), but negatively correlated with carbohydrate content (r = 0.7755) in honey. The chromatographic sugar and free amino acid profiles showed that the honey samples could be clustered based on the type and maturity of honey. Proline explained for 64.9% of the total variance in principle component analysis (PCA). The correlation between honey components and honey quality has been established for the selected honey samples based on their biochemical and nutritional concentrations. PCA results revealed that the ratio of sucrose to maltose could be used to measure honey maturity, whereas proline was the marker compound used to distinguish honey either as floral or honeydew.

  18. Complexity of free energy landscapes of peptides revealed by nonlinear principal component analysis.

    PubMed

    Nguyen, Phuong H

    2006-12-01

    Employing the recently developed hierarchical nonlinear principal component analysis (NLPCA) method of Saegusa et al. (Neurocomputing 2004;61:57-70 and IEICE Trans Inf Syst 2005;E88-D:2242-2248), the complexities of the free energy landscapes of several peptides, including triglycine, hexaalanine, and the C-terminal beta-hairpin of protein G, were studied. First, the performance of this NLPCA method was compared with the standard linear principal component analysis (PCA). In particular, we compared two methods according to (1) the ability of the dimensionality reduction and (2) the efficient representation of peptide conformations in low-dimensional spaces spanned by the first few principal components. The study revealed that NLPCA reduces the dimensionality of the considered systems much better, than did PCA. For example, in order to get the similar error, which is due to representation of the original data of beta-hairpin in low dimensional space, one needs 4 and 21 principal components of NLPCA and PCA, respectively. Second, by representing the free energy landscapes of the considered systems as a function of the first two principal components obtained from PCA, we obtained the relatively well-structured free energy landscapes. In contrast, the free energy landscapes of NLPCA are much more complicated, exhibiting many states which are hidden in the PCA maps, especially in the unfolded regions. Furthermore, the study also showed that many states in the PCA maps are mixed up by several peptide conformations, while those of the NLPCA maps are more pure. This finding suggests that the NLPCA should be used to capture the essential features of the systems. (c) 2006 Wiley-Liss, Inc.

  19. Causes of Death in Men With Prevalent Diabetes and Newly Diagnosed High- Versus Favorable-Risk Prostate Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    D'Amico, Anthony V., E-mail: adamico@partners.or; Braccioforte, Michelle H.; Moran, Brian J.

    2010-08-01

    Purpose: To determine whether prevalent diabetes mellitus (pDM) affects the presentation, extent of radiotherapy, or prostate cancer (PCa)-specific mortality (PCSM) and whether PCa aggressiveness affects the risk of non-PCSM, DM-related mortality, and all-cause mortality in men with pDM. Methods: Between October 1997 and July 2907, 5,279 men treated at the Chicago Prostate Cancer Center with radiotherapy for PCa were included in the study. Logistic and competing risk regression analyses were performed to assess whether pDM was associated with high-grade PCa, less aggressive radiotherapy, and an increased risk of PCSM. Competing risks and Cox regression analyses were performed to assess whethermore » PCa aggressiveness described by risk group in men with pDM was associated with the risk of non-PCSM, DM-related mortality, and all-cause mortality. Analyses were adjusted for predictors of high-grade PCa and factors that could affect treatment extent and mortality. Results: Men with pDM were more likely (adjusted hazard ratio [AHR], 1.9; 95% confidence interval [CI], 1.3-2.7; p = .002) to present with high-grade PCa but were not treated less aggressively (p = .33) and did not have an increased risk of PCSM (p = .58) compared to men without pDM. Among the men with pDM, high-risk PCa was associated with a greater risk of non-PCSM (AHR, 2.2; 95% CI, 1.1-4.5; p = .035), DM-related mortality (AHR, 5.2; 95% CI, 2.0-14.0; p = .001), and all-cause mortality (AHR, 2.4; 95% CI, 1.2-4.7; p = .01) compared to favorable-risk PCa. Conclusion: Aggressive management of pDM is warranted in men with high-risk PCa.« less

  20. Assessing signal-to-noise in quantitative proteomics: multivariate statistical analysis in DIGE experiments.

    PubMed

    Friedman, David B

    2012-01-01

    All quantitative proteomics experiments measure variation between samples. When performing large-scale experiments that involve multiple conditions or treatments, the experimental design should include the appropriate number of individual biological replicates from each condition to enable the distinction between a relevant biological signal from technical noise. Multivariate statistical analyses, such as principal component analysis (PCA), provide a global perspective on experimental variation, thereby enabling the assessment of whether the variation describes the expected biological signal or the unanticipated technical/biological noise inherent in the system. Examples will be shown from high-resolution multivariable DIGE experiments where PCA was instrumental in demonstrating biologically significant variation as well as sample outliers, fouled samples, and overriding technical variation that would not be readily observed using standard univariate tests.

  1. Articulation handicap index: an instrument for quantifying psychosocial consequences of impaired articulation.

    PubMed

    Keilmann, Annerose; Konerding, Uwe; Oberherr, Constantin; Nawka, Tadeus

    2016-12-01

    Structural, neurological and muscular diseases can lead to impairments of articulation. These impairments can severely impact social life. To judge health status comprehensively, this impact must be adequately quantified. For this purpose, the articulation handicap index (AHI) has been developed. Psychometric analyses referring to this index are presented here. The AHI was completed by 113 patients who had undergone treatment of tumours of the head or neck. The patients also gave a general self-assessment of their impairments due to articulation problems. Furthermore, tumour size, tumour location and kind of therapy were recorded. Missing data were analysed and replaced by multiple imputation. Internal structure was investigated using principal component analysis (PCA); reliability using Cronbach's alpha. Validity was investigated by analysing the relationship between AHI and general self-assessment of impairments. Moreover, the relationships with tumour size, tumour location and kind of therapy were analysed. Only 0.12 % of the answers to the AHI were missing. The Scree test performed with the PCA results suggested one-dimensionality with the first component explaining 49.6 % of the item variance. Cronbach's alpha was 0.96. Kendall's tau between the AHI sum score and the general self-assessment was 0.69. The intervals of AHI sum scores for the self-assessment categories were determined with 0-13 for no, 14-44 for mild, 46-76 for moderate, and 77-120 for severe impairment. The AHI sum score did not systematically relate to tumour size, tumour location or kind of therapy. The results are evidence for high acceptance, reliability and validity.

  2. Discrimination of Geographical Origin of Asian Garlic Using Isotopic and Chemical Datasets under Stepwise Principal Component Analysis.

    PubMed

    Liu, Tsang-Sen; Lin, Jhen-Nan; Peng, Tsung-Ren

    2018-01-16

    Isotopic compositions of δ 2 H, δ 18 O, δ 13 C, and δ 15 N and concentrations of 22 trace elements from garlic samples were analyzed and processed with stepwise principal component analysis (PCA) to discriminate garlic's country of origin among Asian regions including South Korea, Vietnam, Taiwan, and China. Results indicate that there is no single trace-element concentration or isotopic composition that can accomplish the study's purpose and the stepwise PCA approach proposed does allow for discrimination between countries on a regional basis. Sequentially, Step-1 PCA distinguishes garlic's country of origin among Taiwanese, South Korean, and Vietnamese samples; Step-2 PCA discriminates Chinese garlic from South Korean garlic; and Step-3 and Step-4 PCA, Chinese garlic from Vietnamese garlic. In model tests, countries of origin of all audit samples were correctly discriminated by stepwise PCA. Consequently, this study demonstrates that stepwise PCA as applied is a simple and effective approach to discriminating country of origin among Asian garlics. © 2018 American Academy of Forensic Sciences.

  3. From measurements to metrics: PCA-based indicators of cyber anomaly

    NASA Astrophysics Data System (ADS)

    Ahmed, Farid; Johnson, Tommy; Tsui, Sonia

    2012-06-01

    We present a framework of the application of Principal Component Analysis (PCA) to automatically obtain meaningful metrics from intrusion detection measurements. In particular, we report the progress made in applying PCA to analyze the behavioral measurements of malware and provide some preliminary results in selecting dominant attributes from an arbitrary number of malware attributes. The results will be useful in formulating an optimal detection threshold in the principal component space, which can both validate and augment existing malware classifiers.

  4. Selected questions on biomechanical exposures for surveillance of upper-limb work-related musculoskeletal disorders

    PubMed Central

    Descatha, Alexis; Roquelaure, Yves; Evanoff, Bradley; Niedhammer, Isabelle; Chastang, Jean François; Mariot, Camille; Ha, Catherine; Imbernon, Ellen; Goldberg, Marcel; Leclerc, Annette

    2007-01-01

    Objective Questionnaires for assessment of biomechanical exposure are frequently used in surveillance programs, though few studies have evaluated which key questions are needed. We sought to reduce the number of variables on a surveillance questionnaire by identifying which variables best summarized biomechanical exposure in a survey of the French working population. Methods We used data from the 2002–2003 French experimental network of Upper-limb work-related musculoskeletal disorders (UWMSD), performed on 2685 subjects in which 37 variables assessing biomechanical exposures were available (divided into four ordinal categories, according to the task frequency or duration). Principal Component Analysis (PCA) with orthogonal rotation was performed on these variables. Variables closely associated with factors issued from PCA were retained, except those highly correlated to another variable (rho>0.70). In order to study the relevance of the final list of variables, correlations between a score based on retained variables (PCA score) and the exposure score suggested by the SALTSA group were calculated. The associations between the PCA score and the prevalence of UWMSD were also studied. In a final step, we added back to the list a few variables not retained by PCA, because of their established recognition as risk factors. Results According to the results of the PCA, seven interpretable factors were identified: posture exposures, repetitiveness, handling of heavy loads, distal biomechanical exposures, computer use, forklift operator specific task, and recovery time. Twenty variables strongly correlated with the factors obtained from PCA were retained. The PCA score was strongly correlated both with the SALTSA score and with UWMSD prevalence (p<0.0001). In the final step, six variables were reintegrated. Conclusion Twenty-six variables out of 37 were efficiently selected according to their ability to summarize major biomechanical constraints in a working population, with an approach combining statistical analyses and existing knowledge. PMID:17476519

  5. Principal components analysis in clinical studies.

    PubMed

    Zhang, Zhongheng; Castelló, Adela

    2017-09-01

    In multivariate analysis, independent variables are usually correlated to each other which can introduce multicollinearity in the regression models. One approach to solve this problem is to apply principal components analysis (PCA) over these variables. This method uses orthogonal transformation to represent sets of potentially correlated variables with principal components (PC) that are linearly uncorrelated. PCs are ordered so that the first PC has the largest possible variance and only some components are selected to represent the correlated variables. As a result, the dimension of the variable space is reduced. This tutorial illustrates how to perform PCA in R environment, the example is a simulated dataset in which two PCs are responsible for the majority of the variance in the data. Furthermore, the visualization of PCA is highlighted.

  6. Univariate and multivariate molecular spectral analyses of lipid related molecular structural components in relation to nutrient profile in feed and food mixtures.

    PubMed

    Abeysekara, Saman; Damiran, Daalkhaijav; Yu, Peiqiang

    2013-02-01

    The objectives of this study were (i) to determine lipid related molecular structures components (functional groups) in feed combination of cereal grain (barley, Hordeum vulgare) and wheat (Triticum aestivum) based dried distillers grain solubles (wheat DDGSs) from bioethanol processing at five different combination ratios using univariate and multivariate molecular spectral analyses with infrared Fourier transform molecular spectroscopy, and (ii) to correlate lipid-related molecular-functional structure spectral profile to nutrient profiles. The spectral intensity of (i) CH(3) asymmetric, CH(2) asymmetric, CH(3) symmetric and CH(2) symmetric groups, (ii) unsaturation (CC) group, and (iii) carbonyl ester (CO) group were determined. Spectral differences of functional groups were detected by hierarchical cluster analysis (HCA) and principal components analysis (PCA). The results showed that the combination treatments significantly inflicted modifications (P<0.05) in nutrient profile and lipid related molecular spectral intensity (CH(2) asymmetric stretching peak height, CH(2) symmetric stretching peak height, ratio of CH(2) to CH(3) symmetric stretching peak intensity, and carbonyl peak area). Ratio of CH(2) to CH(3) symmetric stretching peak intensity, and carbonyl peak significantly correlated with nutrient profiles. Both PCA and HCA differentiated lipid-related spectrum. In conclusion, the changes of lipid molecular structure spectral profiles through feed combination could be detected using molecular spectroscopy. These changes were associated with nutrient profiles and functionality. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. Phenotypic Characterization and Multivariate Analysis to Explain Body Conformation in Lesser Known Buffalo (Bubalus bubalis) from North India

    PubMed Central

    Vohra, V.; Niranjan, S. K.; Mishra, A. K.; Jamuna, V.; Chopra, A.; Sharma, Neelesh; Jeong, Dong Kee

    2015-01-01

    Phenotypic characterization and body biometric in 13 traits (height at withers, body length, chest girth, paunch girth, ear length, tail length, length of tail up to switch, face length, face width, horn length, circumference of horn at base, distances between pin bone and hip bone) were recorded in 233 adult Gojri buffaloes from Punjab and Himachal Pradesh states of India. Traits were analysed by using varimax rotated principal component analysis (PCA) with Kaiser Normalization to explain body conformation. PCA revealed four components which explained about 70.9% of the total variation. First component described the general body conformation and explained 31.5% of total variation. It was represented by significant positive high loading of height at wither, body length, heart girth, face length and face width. The communality ranged from 0.83 (hip bone distance) to 0.45 (horn length) and unique factors ranged from 0.16 to 0.55 for all these 13 different biometric traits. Present study suggests that first principal component can be used in the evaluation and comparison of body conformation in buffaloes and thus provides an opportunity to distinguish between early and late maturing to adult, based on a small group of biometric traits to explain body conformation in adult buffaloes. PMID:25656215

  8. Dysfunctional error-related processing in incarcerated youth with elevated psychopathic traits

    PubMed Central

    Maurer, J. Michael; Steele, Vaughn R.; Cope, Lora M.; Vincent, Gina M.; Stephen, Julia M.; Calhoun, Vince D.; Kiehl, Kent A.

    2016-01-01

    Adult psychopathic offenders show an increased propensity towards violence, impulsivity, and recidivism. A subsample of youth with elevated psychopathic traits represent a particularly severe subgroup characterized by extreme behavioral problems and comparable neurocognitive deficits as their adult counterparts, including perseveration deficits. Here, we investigate response-locked event-related potential (ERP) components (the error-related negativity [ERN/Ne] related to early error-monitoring processing and the error-related positivity [Pe] involved in later error-related processing) in a sample of incarcerated juvenile male offenders (n = 100) who performed a response inhibition Go/NoGo task. Psychopathic traits were assessed using the Hare Psychopathy Checklist: Youth Version (PCL:YV). The ERN/Ne and Pe were analyzed with classic windowed ERP components and principal component analysis (PCA). Using linear regression analyses, PCL:YV scores were unrelated to the ERN/Ne, but were negatively related to Pe mean amplitude. Specifically, the PCL:YV Facet 4 subscale reflecting antisocial traits emerged as a significant predictor of reduced amplitude of a subcomponent underlying the Pe identified with PCA. This is the first evidence to suggest a negative relationship between adolescent psychopathy scores and Pe mean amplitude. PMID:26930170

  9. Silibinin inhibits fibronectin induced motility, invasiveness and survival in human prostate carcinoma PC3 cells via targeting integrin signaling

    PubMed Central

    Deep, Gagan; Kumar, Rahul; Jain, Anil K; Agarwal, Chapla; Agarwal, Rajesh

    2014-01-01

    Prostate cancer (PCA) is the 2nd leading cause of cancer-related deaths among men in the United States. Preventing or inhibiting metastasis-related events through non-toxic agents could be a useful approach for lowering high mortality among PCA patients. We have earlier reported that natural flavonoid silibinin possesses strong anti-metastatic efficacy against PCA however, mechanism/s of its action still remains largely unknown. One of the major events during metastasis is the replacement of cell-cell interaction with integrins-based cell-matrix interaction that controls motility, invasiveness and survival of cancer cells. Accordingly, here we examined silibinin effect on advanced human PCA PC3 cells' interaction with extracellular matrix component fibronectin. Silibinin (50-200 μM) treatment significantly decreased the fibronectin (5 μg/ml)-induced motile morphology via targeting actin cytoskeleton organization in PC3 cells. Silibinin also decreased the fibronectin-induced cell proliferation and motility but significantly increased cell death in PC3 cells. Silibinin also inhibited the PC3 cells invasiveness in Transwell invasion assays with fibronectin or cancer associated fibroblasts (CAFs) serving as chemoattractant. Importantly, PC3-luc cells cultured on fibronectin showed rapid dissemination and localized in lungs following tail vein injection in athymic male nude mice; however, in silibinin-treated PC3-luc cells, dissemination and lung localization was largely compromised. Molecular analyses revealed that silibinin treatment modulated the fibronectin-induced expression of integrins (α5, αV, β1 and β3), actin-remodeling (FAK, Src, GTPases, ARP2 and cortactin), apoptosis (cPARP and cleaved caspase 3), EMT (E-cadherin and β-catenin), and cell survival (survivin and Akt) related signaling molecules in PC3 cells. Furthermore, PC3-xenograft tissue analyses confirmed the inhibitory effect of silibinin on fibronectin and integrins expression. Together, these results showed that silibinin targets PCA cells' interaction with fibronectin and inhibits their motility, invasiveness and survival; thus further supporting silibinin use in PCA intervention including its metastatic progression. PMID:25285031

  10. Silibinin inhibits fibronectin induced motility, invasiveness and survival in human prostate carcinoma PC3 cells via targeting integrin signaling.

    PubMed

    Deep, Gagan; Kumar, Rahul; Jain, Anil K; Agarwal, Chapla; Agarwal, Rajesh

    2014-10-01

    Prostate cancer (PCA) is the 2nd leading cause of cancer-related deaths among men in the United States. Preventing or inhibiting metastasis-related events through non-toxic agents could be a useful approach for lowering high mortality among PCA patients. We have earlier reported that natural flavonoid silibinin possesses strong anti-metastatic efficacy against PCA however, mechanism/s of its action still remains largely unknown. One of the major events during metastasis is the replacement of cell-cell interaction with integrins-based cell-matrix interaction that controls motility, invasiveness and survival of cancer cells. Accordingly, here we examined silibinin effect on advanced human PCA PC3 cells' interaction with extracellular matrix component fibronectin. Silibinin (50-200 μM) treatment significantly decreased the fibronectin (5 μg/ml)-induced motile morphology via targeting actin cytoskeleton organization in PC3 cells. Silibinin also decreased the fibronectin-induced cell proliferation and motility but significantly increased cell death in PC3 cells. Silibinin also inhibited the PC3 cells invasiveness in Transwell invasion assays with fibronectin or cancer associated fibroblasts (CAFs) serving as chemoattractant. Importantly, PC3-luc cells cultured on fibronectin showed rapid dissemination and localized in lungs following tail vein injection in athymic male nude mice; however, in silibinin-treated PC3-luc cells, dissemination and lung localization was largely compromised. Molecular analyses revealed that silibinin treatment modulated the fibronectin-induced expression of integrins (α5, αV, β1 and β3), actin-remodeling (FAK, Src, GTPases, ARP2 and cortactin), apoptosis (cPARP and cleaved caspase 3), EMT (E-cadherin and β-catenin), and cell survival (survivin and Akt) related signaling molecules in PC3 cells. Furthermore, PC3-xenograft tissue analyses confirmed the inhibitory effect of silibinin on fibronectin and integrins expression. Together, these results showed that silibinin targets PCA cells' interaction with fibronectin and inhibits their motility, invasiveness and survival; thus further supporting silibinin use in PCA intervention including its metastatic progression.

  11. Distribution of a low dose compound within pharmaceutical tablet by using multivariate curve resolution on Raman hyperspectral images.

    PubMed

    Boiret, Mathieu; de Juan, Anna; Gorretta, Nathalie; Ginot, Yves-Michel; Roger, Jean-Michel

    2015-01-25

    In this work, Raman hyperspectral images and multivariate curve resolution-alternating least squares (MCR-ALS) are used to study the distribution of actives and excipients within a pharmaceutical drug product. This article is mainly focused on the distribution of a low dose constituent. Different approaches are compared, using initially filtered or non-filtered data, or using a column-wise augmented dataset before starting the MCR-ALS iterative process including appended information on the low dose component. In the studied formulation, magnesium stearate is used as a lubricant to improve powder flowability. With a theoretical concentration of 0.5% (w/w) in the drug product, the spectral variance contained in the data is weak. By using a principal component analysis (PCA) filtered dataset as a first step of the MCR-ALS approach, the lubricant information is lost in the non-explained variance and its associated distribution in the tablet cannot be highlighted. A sufficient number of components to generate the PCA noise-filtered matrix has to be used in order to keep the lubricant variability within the data set analyzed or, otherwise, work with the raw non-filtered data. Different models are built using an increasing number of components to perform the PCA reduction. It is shown that the magnesium stearate information can be extracted from a PCA model using a minimum of 20 components. In the last part, a column-wise augmented matrix, including a reference spectrum of the lubricant, is used before starting MCR-ALS process. PCA reduction is performed on the augmented matrix, so the magnesium stearate contribution is included within the MCR-ALS calculations. By using an appropriate PCA reduction, with a sufficient number of components, or by using an augmented dataset including appended information on the low dose component, the distribution of the two actives, the two main excipients and the low dose lubricant are correctly recovered. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Untargeted Metabolomic Analysis of Capsicum spp. by GC-MS.

    PubMed

    Aranha, Bianca Camargo; Hoffmann, Jessica Fernanda; Barbieri, Rosa Lia; Rombaldi, Cesar Valmor; Chaves, Fábio Clasen

    2017-09-01

    In order to conserve the biodiversity of Capsicum species and find genotypes with potential to be utilised commercially, Embrapa Clima Temperado maintains an active germplasm collection (AGC) that requires characterisation, enabling genotype selection and support for breeding programmes. The objective of this study was to characterise pepper accessions from the Embrapa Clima Temperado AGC and differentiate species based on their metabolic profile using an untargeted metabolomics approach. Cold (-20°C) methanol extraction residue of freeze-dried fruit samples was partitioned into water/methanol (A) and chloroform (B) fractions. The polar fraction (A) was derivatised and both fractions (A and B) were analysed by gas chromatography coupled to mass spectrometry (GC-MS). Data from each fraction was analysed using a multivariate principal component analysis (PCA) with XCMS software. Amino acids, sugars, organic acids, capsaicinoids, and hydrocarbons were identified. Outlying accessions including P116 (C. chinense), P46, and P76 (C. annuum) were observed in a PCA plot mainly due to their high sucrose and fructose contents. PCA also indicated a separation of P221 (C. annuum) and P200 (C. chinense), because of their high dihydrocapsaicin content. Although the metabolic profiling did not allow for grouping by species, it permitted the simultaneous identification and quantification of several compounds complementing and expanding the metabolic database of the studied Capsicum spp. in the AGC. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  13. Larger men have larger prostates: Detection bias in epidemiologic studies of obesity and prostate cancer risk

    PubMed Central

    Rundle, Andrew; Wang, Yun; Sadasivan, Sudha; Chitale, Dhananjay A.; Gupta, Nilesh S.; Tang, Deliang; Rybicki, Benjamin A.

    2017-01-01

    BACKGROUND Obesity is associated with risk of aggressive prostate cancer (PCa), but not with over-all PCa risk. However, obese men have larger prostates which may lower biopsy accuracy and cause a systematic bias towards the null in epidemiologic studies of over-all risk. METHODS Within a cohort of 6,692 men followed-up after a biopsy or transurethral resection of the prostate (TURP) with benign findings, a nested case-control study was conducted of 495 prostate cancer cases and controls matched on age, race, follow-up duration, biopsy versus TURP and procedure date. Data on body mass index and prostate volume at the time of the initial procedure were abstracted from medical records. RESULTS Prior to consideration of differences in prostate volume, overweight (OR = 1.41; 95% CI 1.01, 1.97) and obese status (OR = 1.59; 95% CI 1.09, 2.33) at the time of the original benign biopsy or TURP were associated with PCa incidence during follow-up. Prostate volume did not significantly moderate the association between body-size and PCa, however it did act as an inverse confounder; adjustment for prostate volume increased the effect size for overweight by 22% (adjusted OR = 1.52; 95% CI 1.08, 2.14) and for obese status by 23% (adjusted OR = 1.77; 95% CI 1.20, 2.62). Larger prostate volume at the time of the original benign biopsy or TURP was inversely associated with PCa incidence during follow-up (OR = 0.92 per 10 cc difference in volume; 95% CI 0.88, 0.97). In analyses that stratified case-control pairs by tumor aggressiveness of the case, prostate volume acted as an inverse confounder in analyses of non-aggressive PCa but not in analyses of aggressive PCa. CONCLUSIONS In studies of obesity and PCa, differences in prostate volume cause a bias towards the null, particularly in analyses of non-aggressive PCa. A pervasive underestimation of the association between obesity and overall PCa risk may exist in the literature. PMID:28349547

  14. Larger men have larger prostates: Detection bias in epidemiologic studies of obesity and prostate cancer risk.

    PubMed

    Rundle, Andrew; Wang, Yun; Sadasivan, Sudha; Chitale, Dhananjay A; Gupta, Nilesh S; Tang, Deliang; Rybicki, Benjamin A

    2017-06-01

    Obesity is associated with risk of aggressive prostate cancer (PCa), but not with over-all PCa risk. However, obese men have larger prostates which may lower biopsy accuracy and cause a systematic bias toward the null in epidemiologic studies of over-all risk. Within a cohort of 6692 men followed-up after a biopsy or transurethral resection of the prostate (TURP) with benign findings, a nested case-control study was conducted of 495 prostate cancer cases and controls matched on age, race, follow-up duration, biopsy versus TURP, and procedure date. Data on body mass index and prostate volume at the time of the initial procedure were abstracted from medical records. Prior to consideration of differences in prostate volume, overweight (OR = 1.41; 95%CI 1.01, 1.97), and obese status (OR = 1.59; 95%CI 1.09, 2.33) at the time of the original benign biopsy or TURP were associated with PCa incidence during follow-up. Prostate volume did not significantly moderate the association between body-size and PCa, however it did act as an inverse confounder; adjustment for prostate volume increased the effect size for overweight by 22% (adjusted OR = 1.52; 95%CI 1.08, 2.14) and for obese status by 23% (adjusted OR = 1.77; 95%CI 1.20, 2.62). Larger prostate volume at the time of the original benign biopsy or TURP was inversely associated with PCa incidence during follow-up (OR = 0.92 per 10 cc difference in volume; 95%CI 0.88, 0.97). In analyses that stratified case-control pairs by tumor aggressiveness of the case, prostate volume acted as an inverse confounder in analyses of non-aggressive PCa but not in analyses of aggressive PCa. In studies of obesity and PCa, differences in prostate volume cause a bias toward the null, particularly in analyses of non-aggressive PCa. A pervasive underestimation of the association between obesity and overall PCa risk may exist in the literature. © 2017 Wiley Periodicals, Inc.

  15. Investigation of domain walls in PPLN by confocal raman microscopy and PCA analysis

    NASA Astrophysics Data System (ADS)

    Shur, Vladimir Ya.; Zelenovskiy, Pavel; Bourson, Patrice

    2017-07-01

    Confocal Raman microscopy (CRM) is a powerful tool for investigation of ferroelectric domains. Mechanical stresses and electric fields existed in the vicinity of neutral and charged domain walls modify frequency, intensity and width of spectral lines [1], thus allowing to visualize micro- and nanodomain structures both at the surface and in the bulk of the crystal [2,3]. Stresses and fields are naturally coupled in ferroelectrics due to inverse piezoelectric effect and hardly can be separated in Raman spectra. PCA is a powerful statistical method for analysis of large data matrix providing a set of orthogonal variables, called principal components (PCs). PCA is widely used for classification of experimental data, for example, in crystallization experiments, for detection of small amounts of components in solid mixtures etc. [4,5]. In Raman spectroscopy PCA was applied for analysis of phase transitions and provided critical pressure with good accuracy [6]. In the present work we for the first time applied Principal Component Analysis (PCA) method for analysis of Raman spectra measured in periodically poled lithium niobate (PPLN). We found that principal components demonstrate different sensitivity to mechanical stresses and electric fields in the vicinity of the domain walls. This allowed us to separately visualize spatial distribution of fields and electric fields at the surface and in the bulk of PPLN.

  16. A measure for objects clustering in principal component analysis biplot: A case study in inter-city buses maintenance cost data

    NASA Astrophysics Data System (ADS)

    Ginanjar, Irlandia; Pasaribu, Udjianna S.; Indratno, Sapto W.

    2017-03-01

    This article presents the application of the principal component analysis (PCA) biplot for the needs of data mining. This article aims to simplify and objectify the methods for objects clustering in PCA biplot. The novelty of this paper is to get a measure that can be used to objectify the objects clustering in PCA biplot. Orthonormal eigenvectors, which are the coefficients of a principal component model representing an association between principal components and initial variables. The existence of the association is a valid ground to objects clustering based on principal axes value, thus if m principal axes used in the PCA, then the objects can be classified into 2m clusters. The inter-city buses are clustered based on maintenance costs data by using two principal axes PCA biplot. The buses are clustered into four groups. The first group is the buses with high maintenance costs, especially for lube, and brake canvass. The second group is the buses with high maintenance costs, especially for tire, and filter. The third group is the buses with low maintenance costs, especially for lube, and brake canvass. The fourth group is buses with low maintenance costs, especially for tire, and filter.

  17. Blind source separation problem in GPS time series

    NASA Astrophysics Data System (ADS)

    Gualandi, A.; Serpelloni, E.; Belardinelli, M. E.

    2016-04-01

    A critical point in the analysis of ground displacement time series, as those recorded by space geodetic techniques, is the development of data-driven methods that allow the different sources of deformation to be discerned and characterized in the space and time domains. Multivariate statistic includes several approaches that can be considered as a part of data-driven methods. A widely used technique is the principal component analysis (PCA), which allows us to reduce the dimensionality of the data space while maintaining most of the variance of the dataset explained. However, PCA does not perform well in finding the solution to the so-called blind source separation (BSS) problem, i.e., in recovering and separating the original sources that generate the observed data. This is mainly due to the fact that PCA minimizes the misfit calculated using an L2 norm (χ 2), looking for a new Euclidean space where the projected data are uncorrelated. The independent component analysis (ICA) is a popular technique adopted to approach the BSS problem. However, the independence condition is not easy to impose, and it is often necessary to introduce some approximations. To work around this problem, we test the use of a modified variational Bayesian ICA (vbICA) method to recover the multiple sources of ground deformation even in the presence of missing data. The vbICA method models the probability density function (pdf) of each source signal using a mix of Gaussian distributions, allowing for more flexibility in the description of the pdf of the sources with respect to standard ICA, and giving a more reliable estimate of them. Here we present its application to synthetic global positioning system (GPS) position time series, generated by simulating deformation near an active fault, including inter-seismic, co-seismic, and post-seismic signals, plus seasonal signals and noise, and an additional time-dependent volcanic source. We evaluate the ability of the PCA and ICA decomposition techniques in explaining the data and in recovering the original (known) sources. Using the same number of components, we find that the vbICA method fits the data almost as well as a PCA method, since the χ 2 increase is less than 10 % the value calculated using a PCA decomposition. Unlike PCA, the vbICA algorithm is found to correctly separate the sources if the correlation of the dataset is low (<0.67) and the geodetic network is sufficiently dense (ten continuous GPS stations within a box of side equal to two times the locking depth of a fault where an earthquake of Mw >6 occurred). We also provide a cookbook for the use of the vbICA algorithm in analyses of position time series for tectonic and non-tectonic applications.

  18. IMPROVED SEARCH OF PRINCIPAL COMPONENT ANALYSIS DATABASES FOR SPECTRO-POLARIMETRIC INVERSION

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Casini, R.; Lites, B. W.; Ramos, A. Asensio

    2013-08-20

    We describe a simple technique for the acceleration of spectro-polarimetric inversions based on principal component analysis (PCA) of Stokes profiles. This technique involves the indexing of the database models based on the sign of the projections (PCA coefficients) of the first few relevant orders of principal components of the four Stokes parameters. In this way, each model in the database can be attributed a distinctive binary number of 2{sup 4n} bits, where n is the number of PCA orders used for the indexing. Each of these binary numbers (indices) identifies a group of ''compatible'' models for the inversion of amore » given set of observed Stokes profiles sharing the same index. The complete set of the binary numbers so constructed evidently determines a partition of the database. The search of the database for the PCA inversion of spectro-polarimetric data can profit greatly from this indexing. In practical cases it becomes possible to approach the ideal acceleration factor of 2{sup 4n} as compared to the systematic search of a non-indexed database for a traditional PCA inversion. This indexing method relies on the existence of a physical meaning in the sign of the PCA coefficients of a model. For this reason, the presence of model ambiguities and of spectro-polarimetric noise in the observations limits in practice the number n of relevant PCA orders that can be used for the indexing.« less

  19. Morphological analyses suggest a new taxonomic circumscription for Hymenaea courbaril L. (Leguminosae, Caesalpinioideae)

    PubMed Central

    Souza, Isys Mascarenhas; Funch, Ligia Silveira; de Queiroz, Luciano Paganucci

    2014-01-01

    Abstract Hymenaea is a genus of the Resin-producing Clade of the tribe Detarieae (Leguminosae: Caesalpinioideae) with 14 species. Hymenaea courbaril is the most widespread species of the genus, ranging from southern Mexico to southeastern Brazil. As currently circumscribed, Hymenaea courbaril is a polytypic species with six varieties: var. altissima, var. courbaril, var. longifolia, var. stilbocarpa, var. subsessilis, and var. villosa. These varieties are distinguishable mostly by traits related to leaflet shape and indumentation, and calyx indumentation. We carried out morphometric analyses of 14 quantitative (continuous) leaf characters in order to assess the taxonomy of Hymenaea courbaril under the Unified Species Concept framework. Cluster analysis used the Unweighted Pair Group Method with Arithmetic Mean (UPGMA) based on Bray-Curtis dissimilarity matrices. Principal Component Analyses (PCA) were carried out based on the same morphometric matrix. Two sets of Analyses of Similarity and Non Parametric Multivariate Analysis of Variance were carried out to evaluate statistical support (1) for the major groups recovered using UPGMA and PCA, and (2) for the varieties. All analyses recovered three major groups coincident with (1) var. altissima, (2) var. longifolia, and (3) all other varieties. These results, together with geographical and habitat information, were taken as evidence of three separate metapopulation lineages recognized here as three distinct species. Nomenclatural adjustments, including reclassifying formerly misapplied types, are proposed. PMID:25009440

  20. Morphological analyses suggest a new taxonomic circumscription for Hymenaea courbaril L. (Leguminosae, Caesalpinioideae).

    PubMed

    Souza, Isys Mascarenhas; Funch, Ligia Silveira; de Queiroz, Luciano Paganucci

    2014-01-01

    Hymenaea is a genus of the Resin-producing Clade of the tribe Detarieae (Leguminosae: Caesalpinioideae) with 14 species. Hymenaea courbaril is the most widespread species of the genus, ranging from southern Mexico to southeastern Brazil. As currently circumscribed, Hymenaea courbaril is a polytypic species with six varieties: var. altissima, var. courbaril, var. longifolia, var. stilbocarpa, var. subsessilis, and var. villosa. These varieties are distinguishable mostly by traits related to leaflet shape and indumentation, and calyx indumentation. We carried out morphometric analyses of 14 quantitative (continuous) leaf characters in order to assess the taxonomy of Hymenaea courbaril under the Unified Species Concept framework. Cluster analysis used the Unweighted Pair Group Method with Arithmetic Mean (UPGMA) based on Bray-Curtis dissimilarity matrices. Principal Component Analyses (PCA) were carried out based on the same morphometric matrix. Two sets of Analyses of Similarity and Non Parametric Multivariate Analysis of Variance were carried out to evaluate statistical support (1) for the major groups recovered using UPGMA and PCA, and (2) for the varieties. All analyses recovered three major groups coincident with (1) var. altissima, (2) var. longifolia, and (3) all other varieties. These results, together with geographical and habitat information, were taken as evidence of three separate metapopulation lineages recognized here as three distinct species. Nomenclatural adjustments, including reclassifying formerly misapplied types, are proposed.

  1. Prostatitis, other genitourinary infections and prostate cancer: results from a population-based case-control study.

    PubMed

    Boehm, Katharina; Valdivieso, Roger; Meskawi, Malek; Larcher, Alessandro; Schiffmann, Jonas; Sun, Maxine; Graefen, Markus; Saad, Fred; Parent, Marie-Élise; Karakiewicz, Pierre I

    2016-03-01

    We relied on a population-based case-control study (PROtEuS) to examine a potential association between the presence of histologically confirmed prostate cancer (PCa) and history of genitourinary infections, e.g., prostatitis, urethritis, orchitis and epididymitis. Cases were 1933 men with incident PCa, diagnosed across Montreal hospitals between 2005 and 2009. Population controls were 1994 men from the same residential area and age distribution. In-person interviews collected information about socio-demographic characteristics, lifestyle and medical history, e.g., self-reported history of several genitourinary infections, as well as on PCa screening. Logistic regression analyses tested overall and grade-specific associations, including subgroup analyses with frequent PSA testing. After multivariable adjustment, prostatitis was associated with an increased risk of any PCa (OR 1.81 [1.44-2.27]), but not urethritis (OR 1.05 [0.84-1.30]), orchitis (OR 1.28 [0.92-1.78]) or epididymitis (OR 0.98 [0.57-1.68]). The association between prostatitis and PCa was more pronounced for low-grade PCa (Gleason ≤ 6: OR 2.11 [1.61-2.77]; Gleason ≥ 7: OR 1.59 [1.22-2.07]). Adjusting for frequency of physician visits, PSA testing frequency or restricting analyses to frequently screened subjects did not affect these results. Prostatitis was associated with an increased probability for detecting PCa even after adjustment for frequency of PSA testing and physician visits, but not urethritis, orchitis or epididymitis. These considerations may be helpful in clinical risk stratification of individuals in whom the risk of PCa is pertinent.

  2. Evaluation of skin melanoma in spectral range 450-950 nm using principal component analysis

    NASA Astrophysics Data System (ADS)

    Jakovels, D.; Lihacova, I.; Kuzmina, I.; Spigulis, J.

    2013-06-01

    Diagnostic potential of principal component analysis (PCA) of multi-spectral imaging data in the wavelength range 450- 950 nm for distant skin melanoma recognition is discussed. Processing of the measured clinical data by means of PCA resulted in clear separation between malignant melanomas and pigmented nevi.

  3. Principle Component Analysis with Incomplete Data: A simulation of R pcaMethods package in Constructing an Environmental Quality Index with Missing Data

    EPA Science Inventory

    Missing data is a common problem in the application of statistical techniques. In principal component analysis (PCA), a technique for dimensionality reduction, incomplete data points are either discarded or imputed using interpolation methods. Such approaches are less valid when ...

  4. Metabolic syndrome and low high-density lipoprotein cholesterol are associated with adverse pathological features in patients with prostate cancer treated by radical prostatectomy.

    PubMed

    Lebdai, Souhil; Mathieu, Romain; Leger, Julie; Haillot, Olivier; Vincendeau, Sébastien; Rioux-Leclercq, Nathalie; Fournier, Georges; Perrouin-Verbe, Marie-Aimée; Doucet, Laurent; Azzouzi, Abdel Rahmene; Rigaud, Jérome; Renaudin, Karine; Charles, Thomas; Bruyere, Franck; Fromont, Gaelle

    2018-02-01

    Previous studies have suggested a link between metabolic syndrome (MetS) and prostate cancer (PCa). In the present study, we aimed to assess the association between MetS and markers of PCa aggressiveness on radical prostatectomy (RP). All patients consecutively treated for PCa by RP in 6 academic institutions between August 2013 and July 2016 were included. MetS was defined as at least 3 of 5 components (obesity, elevated blood pressure, diabetes, low high-density lipoprotein (HDL)-cholesterol, and hypertriglyceridemia). Demographic, biological, and clinical parameters were prospectively collected, including: age, biopsy results, preoperative serum prostate-specific antigen, surgical procedure, and pathological data of RP specimen. Locally advanced disease was defined as a pT-stage ≥3. International Society of Urological Pathology (ISUP) groups were used for pathological grading. Qualitative and quantitative variables were compared using chi-square and Wilcoxon tests; logistic regression analyses assessed the association of MetS and its components with pathological data. Statistical significance was defined as a P<0.05. Among 567 men, 249 (44%) had MetS. In a multivariate model including preoperative prostate-specific antigen, biopsy ISUP-score, clinical T-stage, age, and ethnicity: we found that MetS was an independent risk factor for positive margins, and ISUP group ≥4 on the RP specimen (odds ratio [OR] = 1.5; 95% CI: 1.1-2.3; P = 0.035; OR = 2.0; 95% CI: 1.1-4.0; P = 0.044, respectively). In addition, low HDL-cholesterol level was associated with locally advanced PCa (OR = 1.6; 95% CI: 1.1-2.4; P = 0.024). Risks of adverse pathological features increased with the number of MetS components: having ≥ 4 MetS components was significantly associated with higher risk of ISUP group ≥ 4 and higher risk of positive margins (OR = 1.9; 95% CI: 1.1-3.3; P = 0.017; OR = 1.8; 95% CI: 1.1-2.8; P = 0.007, respectively). MetS was an independent predictive factor for higher ISUP group and positive margins at RP. Low HDL-cholesterol alone, and having 4 and more MetS components were also associated with higher risk of adverse pathological features. Copyright © 2018. Published by Elsevier Inc.

  5. Application of principal component analysis to distinguish patients with schizophrenia from healthy controls based on fractional anisotropy measurements.

    PubMed

    Caprihan, A; Pearlson, G D; Calhoun, V D

    2008-08-15

    Principal component analysis (PCA) is often used to reduce the dimension of data before applying more sophisticated data analysis methods such as non-linear classification algorithms or independent component analysis. This practice is based on selecting components corresponding to the largest eigenvalues. If the ultimate goal is separation of data in two groups, then these set of components need not have the most discriminatory power. We measured the distance between two such populations using Mahalanobis distance and chose the eigenvectors to maximize it, a modified PCA method, which we call the discriminant PCA (DPCA). DPCA was applied to diffusion tensor-based fractional anisotropy images to distinguish age-matched schizophrenia subjects from healthy controls. The performance of the proposed method was evaluated by the one-leave-out method. We show that for this fractional anisotropy data set, the classification error with 60 components was close to the minimum error and that the Mahalanobis distance was twice as large with DPCA, than with PCA. Finally, by masking the discriminant function with the white matter tracts of the Johns Hopkins University atlas, we identified left superior longitudinal fasciculus as the tract which gave the least classification error. In addition, with six optimally chosen tracts the classification error was zero.

  6. Understanding the pattern of the BSE Sensex

    NASA Astrophysics Data System (ADS)

    Mukherjee, I.; Chatterjee, Soumya; Giri, A.; Barat, P.

    2017-09-01

    An attempt is made to understand the pattern of behaviour of the BSE Sensex by analysing the tick-by-tick Sensex data for the years 2006 to 2012 on yearly as well as cumulative basis using Principal Component Analysis (PCA) and its nonlinear variant Kernel Principal Component Analysis (KPCA). The latter technique ensures that the nonlinear character of the interactions present in the system gets captured in the analysis. The analysis is carried out by constructing vector spaces of varying dimensions. The size of the data set ranges from a minimum of 360,000 for one year to a maximum of 2,520,000 for seven years. In all cases the prices appear to be highly correlated and restricted to a very low dimensional subspace of the original vector space. An external perturbation is added to the system in the form of noise. It is observed that while standard PCA is unable to distinguish the behaviour of the noise-mixed data from that of the original, KPCA clearly identifies the effect of the noise. The exercise is extended in case of daily data of other stock markets and similar results are obtained.

  7. BPH: a tell-tale sign of prostate cancer? Results from the Prostate Cancer and Environment Study (PROtEuS).

    PubMed

    Boehm, Katharina; Valdivieso, Roger; Meskawi, Malek; Larcher, Alessandro; Sun, Maxine; Sosa, José; Blanc-Lapierre, Audrey; Weiss, Deborah; Graefen, Markus; Saad, Fred; Parent, Marie-Élise; Karakiewicz, Pierre I

    2015-12-01

    In a population-based case-control study (PROtEuS), we examined the association between prostate cancer (PCa) and (1) benign prostatic hypertrophy (BPH) history at any time prior to PCa diagnosis, (2) BPH-history reported at least 1 year prior to interview/diagnosis (index date) and (3) exposure to BPH-medications. Cases were 1933 men with incident prostate cancer diagnosed across Montreal French hospitals between 2005 and 2009. Population controls were 1994 men from the same age distribution and residential area. In-person interviews collected socio-demographic characteristics and medical history, e.g., BPH diagnosis, duration and treatment, as well as on PCa screening. Logistic regression analyses tested overall and grade-specific associations, including subgroup analyses with frequent PSA testing. A BPH-history was associated with an increased risk of PCa (OR 1.37 [95 % CI 1.16-2.61]), more pronounced for low-grade PCa (Gleason ≤6: OR 1.54 [1.26-1.87]; Gleason ≥7: OR 1.05 [0.86-1.27]). The association was not significant when BPH-history diagnosis was more than 1 year prior to index date, except for low-grade PCa (OR 1.29 [1.05-1.60]). Exposure to 5α reductase inhibitors (5α-RI) resulted in a decreased risk of overall PCa (OR 0.62 [0.42-0.92]), particularly for intermediate- to high-grade PCa (Gleason ≤6: OR 0.70 [0.43-1.14]; Gleason ≥7: OR 0.43 [0.26-0.72]). Adjusting for PSA testing frequency or restricting analyses to frequently screened subjects did not affect these results. BPH-history was associated with an increased PCa risk, which disappeared, when BPH-history did not include BPH diagnosis within the previous year. Our results also suggest that 5α-RI exposure exerts a protective effect on intermediate and high-grade PCa.

  8. Multilevel principal component analysis (mPCA) in shape analysis: A feasibility study in medical and dental imaging.

    PubMed

    Farnell, D J J; Popat, H; Richmond, S

    2016-06-01

    Methods used in image processing should reflect any multilevel structures inherent in the image dataset or they run the risk of functioning inadequately. We wish to test the feasibility of multilevel principal components analysis (PCA) to build active shape models (ASMs) for cases relevant to medical and dental imaging. Multilevel PCA was used to carry out model fitting to sets of landmark points and it was compared to the results of "standard" (single-level) PCA. Proof of principle was tested by applying mPCA to model basic peri-oral expressions (happy, neutral, sad) approximated to the junction between the mouth/lips. Monte Carlo simulations were used to create this data which allowed exploration of practical implementation issues such as the number of landmark points, number of images, and number of groups (i.e., "expressions" for this example). To further test the robustness of the method, mPCA was subsequently applied to a dental imaging dataset utilising landmark points (placed by different clinicians) along the boundary of mandibular cortical bone in panoramic radiographs of the face. Changes of expression that varied between groups were modelled correctly at one level of the model and changes in lip width that varied within groups at another for the Monte Carlo dataset. Extreme cases in the test dataset were modelled adequately by mPCA but not by standard PCA. Similarly, variations in the shape of the cortical bone were modelled by one level of mPCA and variations between the experts at another for the panoramic radiographs dataset. Results for mPCA were found to be comparable to those of standard PCA for point-to-point errors via miss-one-out testing for this dataset. These errors reduce with increasing number of eigenvectors/values retained, as expected. We have shown that mPCA can be used in shape models for dental and medical image processing. mPCA was found to provide more control and flexibility when compared to standard "single-level" PCA. Specifically, mPCA is preferable to "standard" PCA when multiple levels occur naturally in the dataset. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  9. Geographical origin discrimination of lentils (Lens culinaris Medik.) using 1H NMR fingerprinting and multivariate statistical analyses.

    PubMed

    Longobardi, Francesco; Innamorato, Valentina; Di Gioia, Annalisa; Ventrella, Andrea; Lippolis, Vincenzo; Logrieco, Antonio F; Catucci, Lucia; Agostiano, Angela

    2017-12-15

    Lentil samples coming from two different countries, i.e. Italy and Canada, were analysed using untargeted 1 H NMR fingerprinting in combination with chemometrics in order to build models able to classify them according to their geographical origin. For such aim, Soft Independent Modelling of Class Analogy (SIMCA), k-Nearest Neighbor (k-NN), Principal Component Analysis followed by Linear Discriminant Analysis (PCA-LDA) and Partial Least Squares-Discriminant Analysis (PLS-DA) were applied to the NMR data and the results were compared. The best combination of average recognition (100%) and cross-validation prediction abilities (96.7%) was obtained for the PCA-LDA. All the statistical models were validated both by using a test set and by carrying out a Monte Carlo Cross Validation: the obtained performances were found to be satisfying for all the models, with prediction abilities higher than 95% demonstrating the suitability of the developed methods. Finally, the metabolites that mostly contributed to the lentil discrimination were indicated. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. Psychological Analyses of Courageous Performance in Military Personnel

    DTIC Science & Technology

    1986-11-01

    schedule HR heart rate IBI inter- beat interval N number of subjects NS not statistically significant P probability PCA principal components analysis RAQ...tones in the range of 400 to 600 Hz, set at a level of 60 dB, transmitted for 1 sec binaurally through earphones from a commercial oscillator. The...because of interference on the recording trace. Cardiac activity was measured in terms of heart rate (HR). The number of beats /minute was estimared by

  11. Q-mode versus R-mode principal component analysis for linear discriminant analysis (LDA)

    NASA Astrophysics Data System (ADS)

    Lee, Loong Chuen; Liong, Choong-Yeun; Jemain, Abdul Aziz

    2017-05-01

    Many literature apply Principal Component Analysis (PCA) as either preliminary visualization or variable con-struction methods or both. Focus of PCA can be on the samples (R-mode PCA) or variables (Q-mode PCA). Traditionally, R-mode PCA has been the usual approach to reduce high-dimensionality data before the application of Linear Discriminant Analysis (LDA), to solve classification problems. Output from PCA composed of two new matrices known as loadings and scores matrices. Each matrix can then be used to produce a plot, i.e. loadings plot aids identification of important variables whereas scores plot presents spatial distribution of samples on new axes that are also known as Principal Components (PCs). Fundamentally, the scores matrix always be the input variables for building classification model. A recent paper uses Q-mode PCA but the focus of analysis was not on the variables but instead on the samples. As a result, the authors have exchanged the use of both loadings and scores plots in which clustering of samples was studied using loadings plot whereas scores plot has been used to identify important manifest variables. Therefore, the aim of this study is to statistically validate the proposed practice. Evaluation is based on performance of external error obtained from LDA models according to number of PCs. On top of that, bootstrapping was also conducted to evaluate the external error of each of the LDA models. Results show that LDA models produced by PCs from R-mode PCA give logical performance and the matched external error are also unbiased whereas the ones produced with Q-mode PCA show the opposites. With that, we concluded that PCs produced from Q-mode is not statistically stable and thus should not be applied to problems of classifying samples, but variables. We hope this paper will provide some insights on the disputable issues.

  12. Adaptive online monitoring for ICU patients by combining just-in-time learning and principal component analysis.

    PubMed

    Li, Xuejian; Wang, Youqing

    2016-12-01

    Offline general-type models are widely used for patients' monitoring in intensive care units (ICUs), which are developed by using past collected datasets consisting of thousands of patients. However, these models may fail to adapt to the changing states of ICU patients. Thus, to be more robust and effective, the monitoring models should be adaptable to individual patients. A novel combination of just-in-time learning (JITL) and principal component analysis (PCA), referred to learning-type PCA (L-PCA), was proposed for adaptive online monitoring of patients in ICUs. JITL was used to gather the most relevant data samples for adaptive modeling of complex physiological processes. PCA was used to build an online individual-type model and calculate monitoring statistics, and then to judge whether the patient's status is normal or not. The adaptability of L-PCA lies in the usage of individual data and the continuous updating of the training dataset. Twelve subjects were selected from the Physiobank's Multi-parameter Intelligent Monitoring for Intensive Care II (MIMIC II) database, and five vital signs of each subject were chosen. The proposed method was compared with the traditional PCA and fast moving-window PCA (Fast MWPCA). The experimental results demonstrated that the fault detection rates respectively increased by 20 % and 47 % compared with PCA and Fast MWPCA. L-PCA is first introduced into ICU patients monitoring and achieves the best monitoring performance in terms of adaptability to changes in patient status and sensitivity for abnormality detection.

  13. Breast Shape Analysis With Curvature Estimates and Principal Component Analysis for Cosmetic and Reconstructive Breast Surgery.

    PubMed

    Catanuto, Giuseppe; Taher, Wafa; Rocco, Nicola; Catalano, Francesca; Allegra, Dario; Milotta, Filippo Luigi Maria; Stanco, Filippo; Gallo, Giovanni; Nava, Maurizio Bruno

    2018-03-20

    Breast shape is defined utilizing mainly qualitative assessment (full, flat, ptotic) or estimates, such as volume or distances between reference points, that cannot describe it reliably. We will quantitatively describe breast shape with two parameters derived from a statistical methodology denominated principal component analysis (PCA). We created a heterogeneous dataset of breast shapes acquired with a commercial infrared 3-dimensional scanner on which PCA was performed. We plotted on a Cartesian plane the two highest values of PCA for each breast (principal components 1 and 2). Testing of the methodology on a preoperative and postoperative surgical case and test-retest was performed by two operators. The first two principal components derived from PCA are able to characterize the shape of the breast included in the dataset. The test-retest demonstrated that different operators are able to obtain very similar values of PCA. The system is also able to identify major changes in the preoperative and postoperative stages of a two-stage reconstruction. Even minor changes were correctly detected by the system. This methodology can reliably describe the shape of a breast. An expert operator and a newly trained operator can reach similar results in a test/re-testing validation. Once developed and after further validation, this methodology could be employed as a good tool for outcome evaluation, auditing, and benchmarking.

  14. Rotation of EOFs by the Independent Component Analysis: Towards A Solution of the Mixing Problem in the Decomposition of Geophysical Time Series

    NASA Technical Reports Server (NTRS)

    Aires, Filipe; Rossow, William B.; Chedin, Alain; Hansen, James E. (Technical Monitor)

    2001-01-01

    The Independent Component Analysis is a recently developed technique for component extraction. This new method requires the statistical independence of the extracted components, a stronger constraint that uses higher-order statistics, instead of the classical decorrelation, a weaker constraint that uses only second-order statistics. This technique has been used recently for the analysis of geophysical time series with the goal of investigating the causes of variability in observed data (i.e. exploratory approach). We demonstrate with a data simulation experiment that, if initialized with a Principal Component Analysis, the Independent Component Analysis performs a rotation of the classical PCA (or EOF) solution. This rotation uses no localization criterion like other Rotation Techniques (RT), only the global generalization of decorrelation by statistical independence is used. This rotation of the PCA solution seems to be able to solve the tendency of PCA to mix several physical phenomena, even when the signal is just their linear sum.

  15. Use of Geochemistry Data Collected by the Mars Exploration Rover Spirit in Gusev Crater to Teach Geomorphic Zonation through Principal Components Analysis

    ERIC Educational Resources Information Center

    Rodrigue, Christine M.

    2011-01-01

    This paper presents a laboratory exercise used to teach principal components analysis (PCA) as a means of surface zonation. The lab was built around abundance data for 16 oxides and elements collected by the Mars Exploration Rover Spirit in Gusev Crater between Sol 14 and Sol 470. Students used PCA to reduce 15 of these into 3 components, which,…

  16. Fast principal component analysis for stacking seismic data

    NASA Astrophysics Data System (ADS)

    Wu, Juan; Bai, Min

    2018-04-01

    Stacking seismic data plays an indispensable role in many steps of the seismic data processing and imaging workflow. Optimal stacking of seismic data can help mitigate seismic noise and enhance the principal components to a great extent. Traditional average-based seismic stacking methods cannot obtain optimal performance when the ambient noise is extremely strong. We propose a principal component analysis (PCA) algorithm for stacking seismic data without being sensitive to noise level. Considering the computational bottleneck of the classic PCA algorithm in processing massive seismic data, we propose an efficient PCA algorithm to make the proposed method readily applicable for industrial applications. Two numerically designed examples and one real seismic data are used to demonstrate the performance of the presented method.

  17. Selection of solubility parameters for characterization of pharmaceutical excipients.

    PubMed

    Adamska, Katarzyna; Voelkel, Adam; Héberger, Károly

    2007-11-09

    The solubility parameter (delta(2)), corrected solubility parameter (delta(T)) and its components (delta(d), delta(p), delta(h)) were determined for series of pharmaceutical excipients by using inverse gas chromatography (IGC). Principal component analysis (PCA) was applied for the selection of the solubility parameters which assure the complete characterization of examined materials. Application of PCA suggests that complete description of examined materials is achieved with four solubility parameters, i.e. delta(2) and Hansen solubility parameters (delta(d), delta(p), delta(h)). Selection of the excipients through PCA of their solubility parameters data can be used for prediction of their behavior in a multi-component system, e.g. for selection of the best materials to form stable pharmaceutical liquid mixtures or stable coating formulation.

  18. Common factor analysis versus principal component analysis: choice for symptom cluster research.

    PubMed

    Kim, Hee-Ju

    2008-03-01

    The purpose of this paper is to examine differences between two factor analytical methods and their relevance for symptom cluster research: common factor analysis (CFA) versus principal component analysis (PCA). Literature was critically reviewed to elucidate the differences between CFA and PCA. A secondary analysis (N = 84) was utilized to show the actual result differences from the two methods. CFA analyzes only the reliable common variance of data, while PCA analyzes all the variance of data. An underlying hypothetical process or construct is involved in CFA but not in PCA. PCA tends to increase factor loadings especially in a study with a small number of variables and/or low estimated communality. Thus, PCA is not appropriate for examining the structure of data. If the study purpose is to explain correlations among variables and to examine the structure of the data (this is usual for most cases in symptom cluster research), CFA provides a more accurate result. If the purpose of a study is to summarize data with a smaller number of variables, PCA is the choice. PCA can also be used as an initial step in CFA because it provides information regarding the maximum number and nature of factors. In using factor analysis for symptom cluster research, several issues need to be considered, including subjectivity of solution, sample size, symptom selection, and level of measure.

  19. Principal component analysis as a tool for library design: a case study investigating natural products, brand-name drugs, natural product-like libraries, and drug-like libraries.

    PubMed

    Wenderski, Todd A; Stratton, Christopher F; Bauer, Renato A; Kopp, Felix; Tan, Derek S

    2015-01-01

    Principal component analysis (PCA) is a useful tool in the design and planning of chemical libraries. PCA can be used to reveal differences in structural and physicochemical parameters between various classes of compounds by displaying them in a convenient graphical format. Herein, we demonstrate the use of PCA to gain insight into structural features that differentiate natural products, synthetic drugs, natural product-like libraries, and drug-like libraries, and show how the results can be used to guide library design.

  20. Principal Component Analysis as a Tool for Library Design: A Case Study Investigating Natural Products, Brand-Name Drugs, Natural Product-Like Libraries, and Drug-Like Libraries

    PubMed Central

    Wenderski, Todd A.; Stratton, Christopher F.; Bauer, Renato A.; Kopp, Felix; Tan, Derek S.

    2015-01-01

    Principal component analysis (PCA) is a useful tool in the design and planning of chemical libraries. PCA can be used to reveal differences in structural and physicochemical parameters between various classes of compounds by displaying them in a convenient graphical format. Herein, we demonstrate the use of PCA to gain insight into structural features that differentiate natural products, synthetic drugs, natural product-like libraries, and drug-like libraries, and show how the results can be used to guide library design. PMID:25618349

  1. Performance comparisons between PCA-EA-LBG and PCA-LBG-EA approaches in VQ codebook generation for image compression

    NASA Astrophysics Data System (ADS)

    Tsai, Jinn-Tsong; Chou, Ping-Yi; Chou, Jyh-Horng

    2015-11-01

    The aim of this study is to generate vector quantisation (VQ) codebooks by integrating principle component analysis (PCA) algorithm, Linde-Buzo-Gray (LBG) algorithm, and evolutionary algorithms (EAs). The EAs include genetic algorithm (GA), particle swarm optimisation (PSO), honey bee mating optimisation (HBMO), and firefly algorithm (FF). The study is to provide performance comparisons between PCA-EA-LBG and PCA-LBG-EA approaches. The PCA-EA-LBG approaches contain PCA-GA-LBG, PCA-PSO-LBG, PCA-HBMO-LBG, and PCA-FF-LBG, while the PCA-LBG-EA approaches contain PCA-LBG, PCA-LBG-GA, PCA-LBG-PSO, PCA-LBG-HBMO, and PCA-LBG-FF. All training vectors of test images are grouped according to PCA. The PCA-EA-LBG used the vectors grouped by PCA as initial individuals, and the best solution gained by the EAs was given for LBG to discover a codebook. The PCA-LBG approach is to use the PCA to select vectors as initial individuals for LBG to find a codebook. The PCA-LBG-EA used the final result of PCA-LBG as an initial individual for EAs to find a codebook. The search schemes in PCA-EA-LBG first used global search and then applied local search skill, while in PCA-LBG-EA first used local search and then employed global search skill. The results verify that the PCA-EA-LBG indeed gain superior results compared to the PCA-LBG-EA, because the PCA-EA-LBG explores a global area to find a solution, and then exploits a better one from the local area of the solution. Furthermore the proposed PCA-EA-LBG approaches in designing VQ codebooks outperform existing approaches shown in the literature.

  2. Removal of BCG artefact from concurrent fMRI-EEG recordings based on EMD and PCA.

    PubMed

    Javed, Ehtasham; Faye, Ibrahima; Malik, Aamir Saeed; Abdullah, Jafri Malin

    2017-11-01

    Simultaneous electroencephalography (EEG) and functional magnetic resonance image (fMRI) acquisitions provide better insight into brain dynamics. Some artefacts due to simultaneous acquisition pose a threat to the quality of the data. One such problematic artefact is the ballistocardiogram (BCG) artefact. We developed a hybrid algorithm that combines features of empirical mode decomposition (EMD) with principal component analysis (PCA) to reduce the BCG artefact. The algorithm does not require extra electrocardiogram (ECG) or electrooculogram (EOG) recordings to extract the BCG artefact. The method was tested with both simulated and real EEG data of 11 participants. From the simulated data, the similarity index between the extracted BCG and the simulated BCG showed the effectiveness of the proposed method in BCG removal. On the other hand, real data were recorded with two conditions, i.e. resting state (eyes closed dataset) and task influenced (event-related potentials (ERPs) dataset). Using qualitative (visual inspection) and quantitative (similarity index, improved normalized power spectrum (INPS) ratio, power spectrum, sample entropy (SE)) evaluation parameters, the assessment results showed that the proposed method can efficiently reduce the BCG artefact while preserving the neuronal signals. Compared with conventional methods, namely, average artefact subtraction (AAS), optimal basis set (OBS) and combined independent component analysis and principal component analysis (ICA-PCA), the statistical analyses of the results showed that the proposed method has better performance, and the differences were significant for all quantitative parameters except for the power and sample entropy. The proposed method does not require any reference signal, prior information or assumption to extract the BCG artefact. It will be very useful in circumstances where the reference signal is not available. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Principal components of hand kinematics and neurophysiological signals in motor cortex during reach to grasp movements

    PubMed Central

    Aggarwal, Vikram; Thakor, Nitish V.; Schieber, Marc H.

    2014-01-01

    A few kinematic synergies identified by principal component analysis (PCA) account for most of the variance in the coordinated joint rotations of the fingers and wrist used for a wide variety of hand movements. To examine the possibility that motor cortex might control the hand through such synergies, we collected simultaneous kinematic and neurophysiological data from monkeys performing a reach-to-grasp task. We used PCA, jPCA and isomap to extract kinematic synergies from 18 joint angles in the fingers and wrist and analyzed the relationships of both single-unit and multiunit spike recordings, as well as local field potentials (LFPs), to these synergies. For most spike recordings, the maximal absolute cross-correlations of firing rates were somewhat stronger with an individual joint angle than with any principal component (PC), any jPC or any isomap dimension. In decoding analyses, where spikes and LFP power in the 100- to 170-Hz band each provided better decoding than other LFP-based signals, the first PC was decoded as well as the best decoded joint angle. But the remaining PCs and jPCs were predicted with lower accuracy than individual joint angles. Although PCs, jPCs or isomap dimensions might provide a more parsimonious description of kinematics, our findings indicate that the kinematic synergies identified with these techniques are not represented in motor cortex more strongly than the original joint angles. We suggest that the motor cortex might act to sculpt the synergies generated by subcortical centers, superimposing an ability to individuate finger movements and adapt the hand to grasp a wide variety of objects. PMID:24990564

  4. Developing and Evaluating Creativity Gamification Rehabilitation System: The Application of PCA-ANFIS Based Emotions Model

    ERIC Educational Resources Information Center

    Su, Chung-Ho; Cheng, Ching-Hsue

    2016-01-01

    This study aims to explore the factors in a patient's rehabilitation achievement after a total knee replacement (TKR) patient exercises, using a PCA-ANFIS emotion model-based game rehabilitation system, which combines virtual reality (VR) and motion capture technology. The researchers combine a principal component analysis (PCA) and an adaptive…

  5. Comparison of common components analysis with principal components analysis and independent components analysis: Application to SPME-GC-MS volatolomic signatures.

    PubMed

    Bouhlel, Jihéne; Jouan-Rimbaud Bouveresse, Delphine; Abouelkaram, Said; Baéza, Elisabeth; Jondreville, Catherine; Travel, Angélique; Ratel, Jérémy; Engel, Erwan; Rutledge, Douglas N

    2018-02-01

    The aim of this work is to compare a novel exploratory chemometrics method, Common Components Analysis (CCA), with Principal Components Analysis (PCA) and Independent Components Analysis (ICA). CCA consists in adapting the multi-block statistical method known as Common Components and Specific Weights Analysis (CCSWA or ComDim) by applying it to a single data matrix, with one variable per block. As an application, the three methods were applied to SPME-GC-MS volatolomic signatures of livers in an attempt to reveal volatile organic compounds (VOCs) markers of chicken exposure to different types of micropollutants. An application of CCA to the initial SPME-GC-MS data revealed a drift in the sample Scores along CC2, as a function of injection order, probably resulting from time-related evolution in the instrument. This drift was eliminated by orthogonalization of the data set with respect to CC2, and the resulting data are used as the orthogonalized data input into each of the three methods. Since the first step in CCA is to norm-scale all the variables, preliminary data scaling has no effect on the results, so that CCA was applied only to orthogonalized SPME-GC-MS data, while, PCA and ICA were applied to the "orthogonalized", "orthogonalized and Pareto-scaled", and "orthogonalized and autoscaled" data. The comparison showed that PCA results were highly dependent on the scaling of variables, contrary to ICA where the data scaling did not have a strong influence. Nevertheless, for both PCA and ICA the clearest separations of exposed groups were obtained after autoscaling of variables. The main part of this work was to compare the CCA results using the orthogonalized data with those obtained with PCA and ICA applied to orthogonalized and autoscaled variables. The clearest separations of exposed chicken groups were obtained by CCA. CCA Loadings also clearly identified the variables contributing most to the Common Components giving separations. The PCA Loadings did not highlight the most influencing variables for each separation, whereas the ICA Loadings highlighted the same variables as did CCA. This study shows the potential of CCA for the extraction of pertinent information from a data matrix, using a procedure based on an original optimisation criterion, to produce results that are complementary, and in some cases may be superior, to those of PCA and ICA. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Resolution of coi-dominant phytoplankton species in a eutrophiclake using synchrotron-based Fourier transform infraredspectroscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dean, A.P.; Martin, Michael C.; Sigee, D.C.

    2006-10-09

    Synchrotron-based Fourier-transform infrared (FTIR)microspectroscopy was used to distinguish micropopulations of thecodominant algae Microcystis aeruginosa (Cyanophyceae) and Ceratiumhirundinella (Dinophyceae) in mixed phytoplankton samples taken from thewater column of a stratified eutrophic lake (Rostherne Mere, UK). FTIRspectra of the two algae showed a closely similar sequence of 10 bandsover the wave-number range 4000-900 cm-1. These were assigned to a rangeof vibrationally active chemical groups using published band assignmentsand on the basis of correlation and factor analysis. In both algae,intracellular concentrations of macromolecular components (determined asband intensity) varied considerably within the same population,indicating substantial intraspecific heterogeneity. Interspecificdifferences were separately analysed in relation tomore » discrete bands and bymultivariate analysis of the entire spectral region 1750-900 cm-1. Interms of discrete bands, comparison of individual intensities (normalisedto amide 1) demonstrated significant (99 percent probability level)differences in relation to six bands between the two algal species. Keyinterspecific differences were also noted in relation to the positions ofbands 2, 10 (carbohydrate) and 7 (protein) and in the 3-D plots derivedby principal component analysis (PCA) of the sequence of bandintensities. PCA of entire spectral regions showed clear resolutionofspecies in the PCA plot, with indication of separation on the basis ofprotein (region 1700-1500 cm1) and carbohydrate (region 1150-900 cm1)composition in the loading plot. Hierarchical cluster analysis (Wardalgorithm) of entire spectral regions also showed clear discrimination ofthe two species within the resulting dendrogram.« less

  7. Quantitative Ultrasound Using Texture Analysis of Myofascial Pain Syndrome in the Trapezius.

    PubMed

    Kumbhare, Dinesh A; Ahmed, Sara; Behr, Michael G; Noseworthy, Michael D

    2018-01-01

    Objective-The objective of this study is to assess the discriminative ability of textural analyses to assist in the differentiation of the myofascial trigger point (MTrP) region from normal regions of skeletal muscle. Also, to measure the ability to reliably differentiate between three clinically relevant groups: healthy asymptomatic, latent MTrPs, and active MTrP. Methods-18 and 19 patients were identified with having active and latent MTrPs in the trapezius muscle, respectively. We included 24 healthy volunteers. Images were obtained by research personnel, who were blinded with respect to the clinical status of the study participant. Histograms provided first-order parameters associated with image grayscale. Haralick, Galloway, and histogram-related features were used in texture analysis. Blob analysis was conducted on the regions of interest (ROIs). Principal component analysis (PCA) was performed followed by multivariate analysis of variance (MANOVA) to determine the statistical significance of the features. Results-92 texture features were analyzed for factorability using Bartlett's test of sphericity, which was significant. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.94. PCA demonstrated rotated eigenvalues of the first eight components (each comprised of multiple texture features) explained 94.92% of the cumulative variance in the ultrasound image characteristics. The 24 features identified by PCA were included in the MANOVA as dependent variables, and the presence of a latent or active MTrP or healthy muscle were independent variables. Conclusion-Texture analysis techniques can discriminate between the three clinically relevant groups.

  8. Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) Significantly Improve Prostate Cancer Detection at Initial Biopsy in a Total PSA Range of 2–10 ng/ml

    PubMed Central

    Perdonà, Sisto; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; D’Esposito, Vittoria; Cosimato, Vincenzo; Buonerba, Carlo; Di Lorenzo, Giuseppe; Musi, Gennaro; De Cobelli, Ottavio; Chun, Felix K.; Terracciano, Daniela

    2013-01-01

    Many efforts to reduce prostate specific antigen (PSA) overdiagnosis and overtreatment have been made. To this aim, Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) have been proposed as new more specific biomarkers. We evaluated the ability of phi and PCA3 to identify prostate cancer (PCa) at initial prostate biopsy in men with total PSA range of 2–10 ng/ml. The performance of phi and PCA3 were evaluated in 300 patients undergoing first prostate biopsy. ROC curve analyses tested the accuracy (AUC) of phi and PCA3 in predicting PCa. Decision curve analyses (DCA) were used to compare the clinical benefit of the two biomarkers. We found that the AUC value of phi (0.77) was comparable to those of %p2PSA (0.76) and PCA3 (0.73) with no significant differences in pairwise comparison (%p2PSA vs phi p = 0.673, %p2PSA vs. PCA3 p = 0.417 and phi vs. PCA3 p = 0.247). These three biomarkers significantly outperformed fPSA (AUC = 0.60), % fPSA (AUC = 0.62) and p2PSA (AUC = 0.63). At DCA, phi and PCA3 exhibited a very close net benefit profile until the threshold probability of 25%, then phi index showed higher net benefit than PCA3. Multivariable analysis showed that the addition of phi and PCA3 to the base multivariable model (age, PSA, %fPSA, DRE, prostate volume) increased predictive accuracy, whereas no model improved single biomarker performance. Finally we showed that subjects with active surveillance (AS) compatible cancer had significantly lower phi and PCA3 values (p<0.001 and p = 0.01, respectively). In conclusion, both phi and PCA3 comparably increase the accuracy in predicting the presence of PCa in total PSA range 2–10 ng/ml at initial biopsy, outperforming currently used %fPSA. PMID:23861782

  9. Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) significantly improve prostate cancer detection at initial biopsy in a total PSA range of 2-10 ng/ml.

    PubMed

    Ferro, Matteo; Bruzzese, Dario; Perdonà, Sisto; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; D'Esposito, Vittoria; Cosimato, Vincenzo; Buonerba, Carlo; Di Lorenzo, Giuseppe; Musi, Gennaro; De Cobelli, Ottavio; Chun, Felix K; Terracciano, Daniela

    2013-01-01

    Many efforts to reduce prostate specific antigen (PSA) overdiagnosis and overtreatment have been made. To this aim, Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) have been proposed as new more specific biomarkers. We evaluated the ability of phi and PCA3 to identify prostate cancer (PCa) at initial prostate biopsy in men with total PSA range of 2-10 ng/ml. The performance of phi and PCA3 were evaluated in 300 patients undergoing first prostate biopsy. ROC curve analyses tested the accuracy (AUC) of phi and PCA3 in predicting PCa. Decision curve analyses (DCA) were used to compare the clinical benefit of the two biomarkers. We found that the AUC value of phi (0.77) was comparable to those of %p2PSA (0.76) and PCA3 (0.73) with no significant differences in pairwise comparison (%p2PSA vs phi p = 0.673, %p2PSA vs. PCA3 p = 0.417 and phi vs. PCA3 p = 0.247). These three biomarkers significantly outperformed fPSA (AUC = 0.60), % fPSA (AUC = 0.62) and p2PSA (AUC = 0.63). At DCA, phi and PCA3 exhibited a very close net benefit profile until the threshold probability of 25%, then phi index showed higher net benefit than PCA3. Multivariable analysis showed that the addition of phi and PCA3 to the base multivariable model (age, PSA, %fPSA, DRE, prostate volume) increased predictive accuracy, whereas no model improved single biomarker performance. Finally we showed that subjects with active surveillance (AS) compatible cancer had significantly lower phi and PCA3 values (p<0.001 and p = 0.01, respectively). In conclusion, both phi and PCA3 comparably increase the accuracy in predicting the presence of PCa in total PSA range 2-10 ng/ml at initial biopsy, outperforming currently used %fPSA.

  10. Visible micro-Raman spectroscopy of single human mammary epithelial cells exposed to x-ray radiation.

    PubMed

    Delfino, Ines; Perna, Giuseppe; Lasalvia, Maria; Capozzi, Vito; Manti, Lorenzo; Camerlingo, Carlo; Lepore, Maria

    2015-03-01

    A micro-Raman spectroscopy investigation has been performed in vitro on single human mammary epithelial cells after irradiation by graded x-ray doses. The analysis by principal component analysis (PCA) and interval-PCA (i-PCA) methods has allowed us to point out the small differences in the Raman spectra induced by irradiation. This experimental approach has enabled us to delineate radiation-induced changes in protein, nucleic acid, lipid, and carbohydrate content. In particular, the dose dependence of PCA and i-PCA components has been analyzed. Our results have confirmed that micro-Raman spectroscopy coupled to properly chosen data analysis methods is a very sensitive technique to detect early molecular changes at the single-cell level following exposure to ionizing radiation. This would help in developing innovative approaches to monitor radiation cancer radiotherapy outcome so as to reduce the overall radiation dose and minimize damage to the surrounding healthy cells, both aspects being of great importance in the field of radiation therapy.

  11. EMPCA and Cluster Analysis of Quasar Spectra: Construction and Application to Simulated Spectra

    NASA Astrophysics Data System (ADS)

    Marrs, Adam; Leighly, Karen; Wagner, Cassidy; Macinnis, Francis

    2017-01-01

    Quasars have complex spectra with emission lines influenced by many factors. Therefore, to fully describe the spectrum requires specification of a large number of parameters, such as line equivalent width, blueshift, and ratios. Principal Component Analysis (PCA) aims to construct eigenvectors-or principal components-from the data with the goal of finding a few key parameters that can be used to predict the rest of the spectrum fairly well. Analysis of simulated quasar spectra was used to verify and justify our modified application of PCA.We used a variant of PCA called Weighted Expectation Maximization PCA (EMPCA; Bailey 2012) along with k-means cluster analysis to analyze simulated quasar spectra. Our approach combines both analytical methods to address two known problems with classical PCA. EMPCA uses weights to account for uncertainty and missing points in the spectra. K-means groups similar spectra together to address the nonlinearity of quasar spectra, specifically variance in blueshifts and widths of the emission lines.In producing and analyzing simulations, we first tested the effects of varying equivalent widths and blueshifts on the derived principal components, and explored the differences between standard PCA and EMPCA. We also tested the effects of varying signal-to-noise ratio. Next we used the results of fits to composite quasar spectra (see accompanying poster by Wagner et al.) to construct a set of realistic simulated spectra, and subjected those spectra to the EMPCA /k-means analysis. We concluded that our approach was validated when we found that the mean spectra from our k-means clusters derived from PCA projection coefficients reproduced the trends observed in the composite spectra.Furthermore, our method needed only two eigenvectors to identify both sets of correlations used to construct the simulations, as well as indicating the linear and nonlinear segments. Comparing this to regular PCA, which can require a dozen or more components, or to direct spectral analysis that may need measurement of 20 fit parameters, shows why the dual application of these two techniques is such a powerful tool.

  12. Investigation of probabilistic principal component analysis compared to proper orthogonal decomposition methods for basis extraction and missing data estimation

    NASA Astrophysics Data System (ADS)

    Lee, Kyunghoon

    To evaluate the maximum likelihood estimates (MLEs) of probabilistic principal component analysis (PPCA) parameters such as a factor-loading, PPCA can invoke an expectation-maximization (EM) algorithm, yielding an EM algorithm for PPCA (EM-PCA). In order to examine the benefits of the EM-PCA for aerospace engineering applications, this thesis attempts to qualitatively and quantitatively scrutinize the EM-PCA alongside both POD and gappy POD using high-dimensional simulation data. In pursuing qualitative investigations, the theoretical relationship between POD and PPCA is transparent such that the factor-loading MLE of PPCA, evaluated by the EM-PCA, pertains to an orthogonal basis obtained by POD. By contrast, the analytical connection between gappy POD and the EM-PCA is nebulous because they distinctively approximate missing data due to their antithetical formulation perspectives: gappy POD solves a least-squares problem whereas the EM-PCA relies on the expectation of the observation probability model. To juxtapose both gappy POD and the EM-PCA, this research proposes a unifying least-squares perspective that embraces the two disparate algorithms within a generalized least-squares framework. As a result, the unifying perspective reveals that both methods address similar least-squares problems; however, their formulations contain dissimilar bases and norms. Furthermore, this research delves into the ramifications of the different bases and norms that will eventually characterize the traits of both methods. To this end, two hybrid algorithms of gappy POD and the EM-PCA are devised and compared to the original algorithms for a qualitative illustration of the different basis and norm effects. After all, a norm reflecting a curve-fitting method is found to more significantly affect estimation error reduction than a basis for two example test data sets: one is absent of data only at a single snapshot and the other misses data across all the snapshots. From a numerical performance aspect, the EM-PCA is computationally less efficient than POD for intact data since it suffers from slow convergence inherited from the EM algorithm. For incomplete data, this thesis quantitatively found that the number of data missing snapshots predetermines whether the EM-PCA or gappy POD outperforms the other because of the computational cost of a coefficient evaluation, resulting from a norm selection. For instance, gappy POD demands laborious computational effort in proportion to the number of data-missing snapshots as a consequence of the gappy norm. In contrast, the computational cost of the EM-PCA is invariant to the number of data-missing snapshots thanks to the L2 norm. In general, the higher the number of data-missing snapshots, the wider the gap between the computational cost of gappy POD and the EM-PCA. Based on the numerical experiments reported in this thesis, the following criterion is recommended regarding the selection between gappy POD and the EM-PCA for computational efficiency: gappy POD for an incomplete data set containing a few data-missing snapshots and the EM-PCA for an incomplete data set involving multiple data-missing snapshots. Last, the EM-PCA is applied to two aerospace applications in comparison to gappy POD as a proof of concept: one with an emphasis on basis extraction and the other with a focus on missing data reconstruction for a given incomplete data set with scattered missing data. The first application exploits the EM-PCA to efficiently construct reduced-order models of engine deck responses obtained by the numerical propulsion system simulation (NPSS), some of whose results are absent due to failed analyses caused by numerical instability. Model-prediction tests validate that engine performance metrics estimated by the reduced-order NPSS model exhibit considerably good agreement with those directly obtained by NPSS. Similarly, the second application illustrates that the EM-PCA is significantly more cost effective than gappy POD at repairing spurious PIV measurements obtained from acoustically-excited, bluff-body jet flow experiments. The EM-PCA reduces computational cost on factors 8 ˜ 19 compared to gappy POD while generating the same restoration results as those evaluated by gappy POD. All in all, through comprehensive theoretical and numerical investigation, this research establishes that the EM-PCA is an efficient alternative to gappy POD for an incomplete data set containing missing data over an entire data set. (Abstract shortened by UMI.)

  13. Combining ANOVA-PCA with POCHEMON to analyse micro-organism development in a polymicrobial environment.

    PubMed

    Geurts, Brigitte P; Neerincx, Anne H; Bertrand, Samuel; Leemans, Manja A A P; Postma, Geert J; Wolfender, Jean-Luc; Cristescu, Simona M; Buydens, Lutgarde M C; Jansen, Jeroen J

    2017-04-22

    Revealing the biochemistry associated to micro-organismal interspecies interactions is highly relevant for many purposes. Each pathogen has a characteristic metabolic fingerprint that allows identification based on their unique multivariate biochemistry. When pathogen species come into mutual contact, their co-culture will display a chemistry that may be attributed both to mixing of the characteristic chemistries of the mono-cultures and to competition between the pathogens. Therefore, investigating pathogen development in a polymicrobial environment requires dedicated chemometric methods to untangle and focus upon these sources of variation. The multivariate data analysis method Projected Orthogonalised Chemical Encounter Monitoring (POCHEMON) is dedicated to highlight metabolites characteristic for the interaction of two micro-organisms in co-culture. However, this approach is currently limited to a single time-point, while development of polymicrobial interactions may be highly dynamic. A well-known multivariate implementation of Analysis of Variance (ANOVA) uses Principal Component Analysis (ANOVA-PCA). This allows the overall dynamics to be separated from the pathogen-specific chemistry to analyse the contributions of both aspects separately. For this reason, we propose to integrate ANOVA-PCA with the POCHEMON approach to disentangle the pathogen dynamics and the specific biochemistry in interspecies interactions. Two complementary case studies show great potential for both liquid and gas chromatography - mass spectrometry to reveal novel information on chemistry specific to interspecies interaction during pathogen development. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.

  14. Comparison of multi-subject ICA methods for analysis of fMRI data

    PubMed Central

    Erhardt, Erik Barry; Rachakonda, Srinivas; Bedrick, Edward; Allen, Elena; Adali, Tülay; Calhoun, Vince D.

    2010-01-01

    Spatial independent component analysis (ICA) applied to functional magnetic resonance imaging (fMRI) data identifies functionally connected networks by estimating spatially independent patterns from their linearly mixed fMRI signals. Several multi-subject ICA approaches estimating subject-specific time courses (TCs) and spatial maps (SMs) have been developed, however there has not yet been a full comparison of the implications of their use. Here, we provide extensive comparisons of four multi-subject ICA approaches in combination with data reduction methods for simulated and fMRI task data. For multi-subject ICA, the data first undergo reduction at the subject and group levels using principal component analysis (PCA). Comparisons of subject-specific, spatial concatenation, and group data mean subject-level reduction strategies using PCA and probabilistic PCA (PPCA) show that computationally intensive PPCA is equivalent to PCA, and that subject-specific and group data mean subject-level PCA are preferred because of well-estimated TCs and SMs. Second, aggregate independent components are estimated using either noise free ICA or probabilistic ICA (PICA). Third, subject-specific SMs and TCs are estimated using back-reconstruction. We compare several direct group ICA (GICA) back-reconstruction approaches (GICA1-GICA3) and an indirect back-reconstruction approach, spatio-temporal regression (STR, or dual regression). Results show the earlier group ICA (GICA1) approximates STR, however STR has contradictory assumptions and may show mixed-component artifacts in estimated SMs. Our evidence-based recommendation is to use GICA3, introduced here, with subject-specific PCA and noise-free ICA, providing the most robust and accurate estimated SMs and TCs in addition to offering an intuitive interpretation. PMID:21162045

  15. Principal component analysis of the CT density histogram to generate parametric response maps of COPD

    NASA Astrophysics Data System (ADS)

    Zha, N.; Capaldi, D. P. I.; Pike, D.; McCormack, D. G.; Cunningham, I. A.; Parraga, G.

    2015-03-01

    Pulmonary x-ray computed tomography (CT) may be used to characterize emphysema and airways disease in patients with chronic obstructive pulmonary disease (COPD). One analysis approach - parametric response mapping (PMR) utilizes registered inspiratory and expiratory CT image volumes and CT-density-histogram thresholds, but there is no consensus regarding the threshold values used, or their clinical meaning. Principal-component-analysis (PCA) of the CT density histogram can be exploited to quantify emphysema using data-driven CT-density-histogram thresholds. Thus, the objective of this proof-of-concept demonstration was to develop a PRM approach using PCA-derived thresholds in COPD patients and ex-smokers without airflow limitation. Methods: Fifteen COPD ex-smokers and 5 normal ex-smokers were evaluated. Thoracic CT images were also acquired at full inspiration and full expiration and these images were non-rigidly co-registered. PCA was performed for the CT density histograms, from which the components with the highest eigenvalues greater than one were summed. Since the values of the principal component curve correlate directly with the variability in the sample, the maximum and minimum points on the curve were used as threshold values for the PCA-adjusted PRM technique. Results: A significant correlation was determined between conventional and PCA-adjusted PRM with 3He MRI apparent diffusion coefficient (p<0.001), with CT RA950 (p<0.0001), as well as with 3He MRI ventilation defect percent, a measurement of both small airways disease (p=0.049 and p=0.06, respectively) and emphysema (p=0.02). Conclusions: PRM generated using PCA thresholds of the CT density histogram showed significant correlations with CT and 3He MRI measurements of emphysema, but not airways disease.

  16. Application of principal component analysis (PCA) as a sensory assessment tool for fermented food products.

    PubMed

    Ghosh, Debasree; Chattopadhyay, Parimal

    2012-06-01

    The objective of the work was to use the method of quantitative descriptive analysis (QDA) to describe the sensory attributes of the fermented food products prepared with the incorporation of lactic cultures. Panellists were selected and trained to evaluate various attributes specially color and appearance, body texture, flavor, overall acceptability and acidity of the fermented food products like cow milk curd and soymilk curd, idli, sauerkraut and probiotic ice cream. Principal component analysis (PCA) identified the six significant principal components that accounted for more than 90% of the variance in the sensory attribute data. Overall product quality was modelled as a function of principal components using multiple least squares regression (R (2) = 0.8). The result from PCA was statistically analyzed by analysis of variance (ANOVA). These findings demonstrate the utility of quantitative descriptive analysis for identifying and measuring the fermented food product attributes that are important for consumer acceptability.

  17. The fractal characteristic of facial anthropometric data for developing PCA fit test panels for youth born in central China.

    PubMed

    Yang, Lei; Wei, Ran; Shen, Henggen

    2017-01-01

    New principal component analysis (PCA) respirator fit test panels had been developed for current American and Chinese civilian workers based on anthropometric surveys. The PCA panels used the first two principal components (PCs) obtained from a set of 10 facial dimensions. Although the PCA panels for American and Chinese subjects adopted the bivairate framework with two PCs, the number of the PCs retained in the PCA analysis was different between Chinese subjects and Americans. For the Chinese youth group, the third PC should be retained in the PCA analysis for developing new fit test panels. In this article, an additional number label (ANL) is used to explain the third PC in PCA analysis when the first two PCs are used to construct the PCA half-facepiece respirator fit test panel for Chinese group. The three-dimensional box-counting method is proposed to estimate the ANLs by calculating fractal dimensions of the facial anthropometric data of the Chinese youth. The linear regression coefficients of scale-free range R 2 are all over 0.960, which demonstrates that the facial anthropometric data of the Chinese youth has fractal characteristic. The youth subjects born in Henan province has an ANL of 2.002, which is lower than the composite facial anthropometric data of Chinese subjects born in many provinces. Hence, Henan youth subjects have the self-similar facial anthropometric characteristic and should use the particular ANL (2.002) as the important tool along with using the PCA panel. The ANL method proposed in this article not only provides a new methodology in quantifying the characteristics of facial anthropometric dimensions for any ethnic/racial group, but also extends the scope of PCA panel studies to higher dimensions.

  18. A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis.

    PubMed

    Reese, Sarah E; Archer, Kellie J; Therneau, Terry M; Atkinson, Elizabeth J; Vachon, Celine M; de Andrade, Mariza; Kocher, Jean-Pierre A; Eckel-Passow, Jeanette E

    2013-11-15

    Batch effects are due to probe-specific systematic variation between groups of samples (batches) resulting from experimental features that are not of biological interest. Principal component analysis (PCA) is commonly used as a visual tool to determine whether batch effects exist after applying a global normalization method. However, PCA yields linear combinations of the variables that contribute maximum variance and thus will not necessarily detect batch effects if they are not the largest source of variability in the data. We present an extension of PCA to quantify the existence of batch effects, called guided PCA (gPCA). We describe a test statistic that uses gPCA to test whether a batch effect exists. We apply our proposed test statistic derived using gPCA to simulated data and to two copy number variation case studies: the first study consisted of 614 samples from a breast cancer family study using Illumina Human 660 bead-chip arrays, whereas the second case study consisted of 703 samples from a family blood pressure study that used Affymetrix SNP Array 6.0. We demonstrate that our statistic has good statistical properties and is able to identify significant batch effects in two copy number variation case studies. We developed a new statistic that uses gPCA to identify whether batch effects exist in high-throughput genomic data. Although our examples pertain to copy number data, gPCA is general and can be used on other data types as well. The gPCA R package (Available via CRAN) provides functionality and data to perform the methods in this article. reesese@vcu.edu

  19. Strain Transient Detection Techniques: A Comparison of Source Parameter Inversions of Signals Isolated through Principal Component Analysis (PCA), Non-Linear PCA, and Rotated PCA

    NASA Astrophysics Data System (ADS)

    Lipovsky, B.; Funning, G. J.

    2009-12-01

    We compare several techniques for the analysis of geodetic time series with the ultimate aim to characterize the physical processes which are represented therein. We compare three methods for the analysis of these data: Principal Component Analysis (PCA), Non-Linear PCA (NLPCA), and Rotated PCA (RPCA). We evaluate each method by its ability to isolate signals which may be any combination of low amplitude (near noise level), temporally transient, unaccompanied by seismic emissions, and small scale with respect to the spatial domain. PCA is a powerful tool for extracting structure from large datasets which is traditionally realized through either the solution of an eigenvalue problem or through iterative methods. PCA is an transformation of the coordinate system of our data such that the new "principal" data axes retain maximal variance and minimal reconstruction error (Pearson, 1901; Hotelling, 1933). RPCA is achieved by an orthogonal transformation of the principal axes determined in PCA. In the analysis of meteorological data sets, RPCA has been seen to overcome domain shape dependencies, correct for sampling errors, and to determine principal axes which more closely represent physical processes (e.g., Richman, 1986). NLPCA generalizes PCA such that principal axes are replaced by principal curves (e.g., Hsieh 2004). We achieve NLPCA through an auto-associative feed-forward neural network (Scholz, 2005). We show the geophysical relevance of these techniques by application of each to a synthetic data set. Results are compared by inverting principal axes to determine deformation source parameters. Temporal variability in source parameters, estimated by each method, are also compared.

  20. Discriminating the Mineralogical Composition in Drill Cuttings Based on Absorption Spectra in the Terahertz Range.

    PubMed

    Miao, Xinyang; Li, Hao; Bao, Rima; Feng, Chengjing; Wu, Hang; Zhan, Honglei; Li, Yizhang; Zhao, Kun

    2017-02-01

    Understanding the geological units of a reservoir is essential to the development and management of the resource. In this paper, drill cuttings from several depths from an oilfield were studied using terahertz time domain spectroscopy (THz-TDS). Cluster analysis (CA) and principal component analysis (PCA) were employed to classify and analyze the cuttings. The cuttings were clearly classified based on CA and PCA methods, and the results were in agreement with the lithology. Moreover, calcite and dolomite have stronger absorption of a THz pulse than any other minerals, based on an analysis of the PC1 scores. Quantitative analyses of minor minerals were also realized by building a series of linear and non-linear models between contents and PC2 scores. The results prove THz technology to be a promising means for determining reservoir lithology as well as other properties, which will be a significant supplementary method in oil fields.

  1. Evaluation of different approaches for identifying optimal sites to predict mean hillslope soil moisture content

    NASA Astrophysics Data System (ADS)

    Liao, Kaihua; Zhou, Zhiwen; Lai, Xiaoming; Zhu, Qing; Feng, Huihui

    2017-04-01

    The identification of representative soil moisture sampling sites is important for the validation of remotely sensed mean soil moisture in a certain area and ground-based soil moisture measurements in catchment or hillslope hydrological studies. Numerous approaches have been developed to identify optimal sites for predicting mean soil moisture. Each method has certain advantages and disadvantages, but they have rarely been evaluated and compared. In our study, surface (0-20 cm) soil moisture data from January 2013 to March 2016 (a total of 43 sampling days) were collected at 77 sampling sites on a mixed land-use (tea and bamboo) hillslope in the hilly area of Taihu Lake Basin, China. A total of 10 methods (temporal stability (TS) analyses based on 2 indices, K-means clustering based on 6 kinds of inputs and 2 random sampling strategies) were evaluated for determining optimal sampling sites for mean soil moisture estimation. They were TS analyses based on the smallest index of temporal stability (ITS, a combination of the mean relative difference and standard deviation of relative difference (SDRD)) and based on the smallest SDRD, K-means clustering based on soil properties and terrain indices (EFs), repeated soil moisture measurements (Theta), EFs plus one-time soil moisture data (EFsTheta), and the principal components derived from EFs (EFs-PCA), Theta (Theta-PCA), and EFsTheta (EFsTheta-PCA), and global and stratified random sampling strategies. Results showed that the TS based on the smallest ITS was better (RMSE = 0.023 m3 m-3) than that based on the smallest SDRD (RMSE = 0.034 m3 m-3). The K-means clustering based on EFsTheta (-PCA) was better (RMSE <0.020 m3 m-3) than these based on EFs (-PCA) and Theta (-PCA). The sampling design stratified by the land use was more efficient than the global random method. Forty and 60 sampling sites are needed for stratified sampling and global sampling respectively to make their performances comparable to the best K-means method (EFsTheta-PCA). Overall, TS required only one site, but its accuracy was limited. The best K-means method required <8 sites and yielded high accuracy, but extra soil and terrain information is necessary when using this method. The stratified sampling strategy can only be used if no pre-knowledge about soil moisture variation is available. This information will help in selecting the optimal methods for estimation the area mean soil moisture.

  2. Principal Component Analysis of Thermographic Data

    NASA Technical Reports Server (NTRS)

    Winfree, William P.; Cramer, K. Elliott; Zalameda, Joseph N.; Howell, Patricia A.; Burke, Eric R.

    2015-01-01

    Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. While a reliable technique for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the "good" material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued where a fixed set of eigenvectors, generated from an analytic model of the thermal response of the material under examination, is used to process the thermal data from composite materials. This method has been applied for characterization of flaws.

  3. A feasibility study on age-related factors of wrist pulse using principal component analysis.

    PubMed

    Jang-Han Bae; Young Ju Jeon; Sanghun Lee; Jaeuk U Kim

    2016-08-01

    Various analysis methods for examining wrist pulse characteristics are needed for accurate pulse diagnosis. In this feasibility study, principal component analysis (PCA) was performed to observe age-related factors of wrist pulse from various analysis parameters. Forty subjects in the age group of 20s and 40s were participated, and their wrist pulse signal and respiration signal were acquired with the pulse tonometric device. After pre-processing of the signals, twenty analysis parameters which have been regarded as values reflecting pulse characteristics were calculated and PCA was performed. As a results, we could reduce complex parameters to lower dimension and age-related factors of wrist pulse were observed by combining-new analysis parameter derived from PCA. These results demonstrate that PCA can be useful tool for analyzing wrist pulse signal.

  4. Application of principal component analysis for improvement of X-ray fluorescence images obtained by polycapillary-based micro-XRF technique

    NASA Astrophysics Data System (ADS)

    Aida, S.; Matsuno, T.; Hasegawa, T.; Tsuji, K.

    2017-07-01

    Micro X-ray fluorescence (micro-XRF) analysis is repeated as a means of producing elemental maps. In some cases, however, the XRF images of trace elements that are obtained are not clear due to high background intensity. To solve this problem, we applied principal component analysis (PCA) to XRF spectra. We focused on improving the quality of XRF images by applying PCA. XRF images of the dried residue of standard solution on the glass substrate were taken. The XRF intensities for the dried residue were analyzed before and after PCA. Standard deviations of XRF intensities in the PCA-filtered images were improved, leading to clear contrast of the images. This improvement of the XRF images was effective in cases where the XRF intensity was weak.

  5. Near infrared diffuse reflection and laser-induced fluorescence spectroscopy for myocardial tissue characterisation

    NASA Astrophysics Data System (ADS)

    Nilsson, A. M. K.; Heinrich, D.; Olajos, J.; Andersson-Engels, S.

    1997-10-01

    In order to evaluate the potential of cardiovascular tissue characterisation using near-infrared (NIR) spectroscopy, spectra in a previously unexplored wavelength region 0.8-2.3 μm were recorded from various pig heart tissue samples in vitro: normal myocardium (with and without endo/epicardium), aorta, fatty and fibrous heart tissue. The spectra were analysed with principal component analysis (PCA), revealing several spectroscopically characteristic features enabling tissue classification. Several of the identified spectral features could be attributed to specific tissue constituents by comparing the tissue signals with spectra obtained from water, elastin, collagen and cholesterol as well as with published data. The results obtained with the NIR spectroscopy technique in terms of its potential to classify different tissue types were compared with those from laser-induced fluorescence (LIF) using 337 nm excitation. LIF and NIR spectroscopy can in combination with PCA be used to discriminate between all previously mentioned tissue groups, apart from fatty versus fibrous tissue (LIF) and aorta versus fibrous tissue (NIR), respectively. The NIR analysis was improved by focusing the PCA to the wavelength segment 2.0-2.3 μm, resulting in successful spectral characterisation of all cardiovascular tissue groups.

  6. Prebiotic Low Sugar Chocolate Dairy Desserts: Physical and Optical Characteristics and Performance of PARAFAC and PCA Preference Map.

    PubMed

    Morais, E C; Esmerino, E A; Monteiro, R A; Pinheiro, C M; Nunes, C A; Cruz, A G; Bolini, Helena M A

    2016-01-01

    The addition of prebiotic and sweeteners in chocolate dairy desserts opens up new opportunities to develop dairy desserts that besides having a lower calorie intake still has functional properties. In this study, prebiotic low sugar dairy desserts were evaluated by 120 consumers using a 9-point hedonic scale, in relation to the attributes of appearance, aroma, flavor, texture, and overall liking. Internal preference map using parallel factor analysis (PARAFAC) and principal component analysis (PCA) was performed using the consumer data. In addition, physical (texture profile) and optical (instrumental color) analyses were also performed. Prebiotic dairy desserts containing sucrose and sucralose were equally liked by the consumers. These samples were characterized by firmness and gumminess, which can be considered drivers of liking by the consumers. Optimization of the prebiotic low sugar dessert formulation should take in account the choice of ingredients that contribute in a positive manner for these parameters. PARAFAC allowed the extraction of more relevant information in relation to PCA, demonstrating that consumer acceptance analysis can be evaluated by simultaneously considering several attributes. Multiple factor analysis reported Rv value of 0.964, suggesting excellent concordance for both methods. © 2015 Institute of Food Technologists®

  7. Exploring high-affinity binding properties of octamer peptides by principal component analysis of tetramer peptides.

    PubMed

    Kume, Akiko; Kawai, Shun; Kato, Ryuji; Iwata, Shinmei; Shimizu, Kazunori; Honda, Hiroyuki

    2017-02-01

    To investigate the binding properties of a peptide sequence, we conducted principal component analysis (PCA) of the physicochemical features of a tetramer peptide library comprised of 512 peptides, and the variables were reduced to two principal components. We selected IL-2 and IgG as model proteins and the binding affinity to these proteins was assayed using the 512 peptides mentioned above. PCA of binding affinity data showed that 16 and 18 variables were suitable for localizing IL-2 and IgG high-affinity binding peptides, respectively, into a restricted region of the PCA plot. We then investigated whether the binding affinity of octamer peptide libraries could be predicted using the identified region in the tetramer PCA. The results show that octamer high-affinity binding peptides were also concentrated in the tetramer high-affinity binding region of both IL-2 and IgG. The average fluorescence intensity of high-affinity binding peptides was 3.3- and 2.1-fold higher than that of low-affinity binding peptides for IL-2 and IgG, respectively. We conclude that PCA may be used to identify octamer peptides with high- or low-affinity binding properties from data from a tetramer peptide library. Copyright © 2016 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  8. Sparse principal component analysis in medical shape modeling

    NASA Astrophysics Data System (ADS)

    Sjöstrand, Karl; Stegmann, Mikkel B.; Larsen, Rasmus

    2006-03-01

    Principal component analysis (PCA) is a widely used tool in medical image analysis for data reduction, model building, and data understanding and exploration. While PCA is a holistic approach where each new variable is a linear combination of all original variables, sparse PCA (SPCA) aims at producing easily interpreted models through sparse loadings, i.e. each new variable is a linear combination of a subset of the original variables. One of the aims of using SPCA is the possible separation of the results into isolated and easily identifiable effects. This article introduces SPCA for shape analysis in medicine. Results for three different data sets are given in relation to standard PCA and sparse PCA by simple thresholding of small loadings. Focus is on a recent algorithm for computing sparse principal components, but a review of other approaches is supplied as well. The SPCA algorithm has been implemented using Matlab and is available for download. The general behavior of the algorithm is investigated, and strengths and weaknesses are discussed. The original report on the SPCA algorithm argues that the ordering of modes is not an issue. We disagree on this point and propose several approaches to establish sensible orderings. A method that orders modes by decreasing variance and maximizes the sum of variances for all modes is presented and investigated in detail.

  9. Scalable Robust Principal Component Analysis Using Grassmann Averages.

    PubMed

    Hauberg, Sren; Feragen, Aasa; Enficiaud, Raffi; Black, Michael J

    2016-11-01

    In large datasets, manual data verification is impossible, and we must expect the number of outliers to increase with data size. While principal component analysis (PCA) can reduce data size, and scalable solutions exist, it is well-known that outliers can arbitrarily corrupt the results. Unfortunately, state-of-the-art approaches for robust PCA are not scalable. We note that in a zero-mean dataset, each observation spans a one-dimensional subspace, giving a point on the Grassmann manifold. We show that the average subspace corresponds to the leading principal component for Gaussian data. We provide a simple algorithm for computing this Grassmann Average ( GA), and show that the subspace estimate is less sensitive to outliers than PCA for general distributions. Because averages can be efficiently computed, we immediately gain scalability. We exploit robust averaging to formulate the Robust Grassmann Average (RGA) as a form of robust PCA. The resulting Trimmed Grassmann Average ( TGA) is appropriate for computer vision because it is robust to pixel outliers. The algorithm has linear computational complexity and minimal memory requirements. We demonstrate TGA for background modeling, video restoration, and shadow removal. We show scalability by performing robust PCA on the entire Star Wars IV movie; a task beyond any current method. Source code is available online.

  10. A new statistical PCA-ICA algorithm for location of R-peaks in ECG.

    PubMed

    Chawla, M P S; Verma, H K; Kumar, Vinod

    2008-09-16

    The success of ICA to separate the independent components from the mixture depends on the properties of the electrocardiogram (ECG) recordings. This paper discusses some of the conditions of independent component analysis (ICA) that could affect the reliability of the separation and evaluation of issues related to the properties of the signals and number of sources. Principal component analysis (PCA) scatter plots are plotted to indicate the diagnostic features in the presence and absence of base-line wander in interpreting the ECG signals. In this analysis, a newly developed statistical algorithm by authors, based on the use of combined PCA-ICA for two correlated channels of 12-channel ECG data is proposed. ICA technique has been successfully implemented in identifying and removal of noise and artifacts from ECG signals. Cleaned ECG signals are obtained using statistical measures like kurtosis and variance of variance after ICA processing. This analysis also paper deals with the detection of QRS complexes in electrocardiograms using combined PCA-ICA algorithm. The efficacy of the combined PCA-ICA algorithm lies in the fact that the location of the R-peaks is bounded from above and below by the location of the cross-over points, hence none of the peaks are ignored or missed.

  11. Jovian Chromophore Characteristics from Multispectral HST Images

    NASA Technical Reports Server (NTRS)

    Strycker, Paul D.; Chanover, Nancy J.; Simon-Miller, Amy A.; Banfield, Don; Gierasch, Peter J.

    2011-01-01

    The chromophores responsible for coloring the jovian atmosphere are embedded within Jupiter's vertical aerosol structure. Sunlight propagates through this vertical distribution of aerosol particles, whose colors are defined by omega-bar (sub 0)(lambda), and we remotely observe the culmination of the radiative transfer as I/F(lambda). In this study, we employed a radiative transfer code to retrieve omega-bar (sub 0)(lambda) for particles in Jupiter's tropospheric haze at seven wavelengths in the near-UV and visible regimes. The data consisted of images of the 2008 passage of Oval BA to the south of the Great Red Spot obtained by the Wide Field Planetary Camera 2 on-board the Hubble Space Telescope. We present derived particle colors for locations that were selected from 14 weather regions, which spanned a large range of observed colors. All omega-bar (sub 0)(lambda) curves were absorbing in the blue, and omega-bar (sub 0)(lambda) increased monotonically to approximately unity as wavelength increased. We found accurate fits to all omega-bar (sub 0)(lambda) curves using an empirically derived functional form: omega-bar (sub 0)(lambda) = 1 A exp(-B lambda). The best-fit parameters for the mean omega-bar (sub 0)(lambda) curve were A = 25.4 and B = 0.0149 for lambda in units of nm. We performed a principal component analysis (PCA) on our omega-bar (sub 0)(lambda) results and found that one or two independent chromophores were sufficient to produce the variations in omega-bar (sub 0)(lambda). A PCA of I/F(lambda) for the same jovian locations resulted in principal components (PCs) with roughly the same variances as the omega-bar (sub 0)(lambda) PCA, but they did not result in a one-to-one mapping of PC amplitudes between the omega-bar (sub 0)(lambda) PCA and I/F(lambda) PCA. We suggest that statistical analyses performed on I/ F(lambda) image cubes have limited applicability to the characterization of chromophores in the jovian atmosphere due to the sensitivity of 1/ F(lambda) to horizontal variations in the vertical aerosol distribution.

  12. Free-energy landscape of RNA hairpins constructed via dihedral angle principal component analysis.

    PubMed

    Riccardi, Laura; Nguyen, Phuong H; Stock, Gerhard

    2009-12-31

    To systematically construct a low-dimensional free-energy landscape of RNA systems from a classical molecular dynamics simulation, various versions of the principal component analysis (PCA) are compared: the cPCA using the Cartesian coordinates of all atoms, the dPCA using the sine/cosine-transformed six backbone dihedral angles as well as the glycosidic torsional angle chi and the pseudorotational angle P, the aPCA which ignores the circularity of the 6 + 2 dihedral angles of the RNA, and the dPCA(etatheta), which approximates the 6 backbone dihedral angles by 2 pseudotorsional angles eta and theta. As representative examples, a 10-nucleotide UUCG hairpin and the 36-nucleotide segment SL1 of the Psi site of HIV-1 are studied by classical molecular dynamics simulation, using the Amber all-atom force field and explicit solvent. It is shown that the conformational heterogeneity of the RNA hairpins can only be resolved by an angular PCA such as the dPCA but not by the cPCA using Cartesian coordinates. Apart from possible artifacts due to the coupling of overall and internal motion, this is because the details of hydrogen bonding and stacking interactions but also of global structural rearrangements of the RNA are better discriminated by dihedral angles. In line with recent experiments, it is found that the free energy landscape of RNA hairpins is quite rugged and contains various metastable conformational states which may serve as an intermediate for unfolding.

  13. ERP Go/NoGo condition effects are better detected with separate PCAs.

    PubMed

    Barry, Robert J; De Blasio, Frances M; Fogarty, Jack S; Karamacoska, Diana

    2016-08-01

    We explored the separation of Go and NoGo effects in the ERP components elicited in an equiprobable Go/NoGo task, using different forms of temporal Principal Components Analysis (PCA). Following exploratory simulation studies assessing the PCA impact of latency jitter and between-condition latency differences in the P3 latency range, an empirical study compared results of a Combined PCA carried out using both Go and NoGo ERPs together as input, with those from two Separate PCAs carried out on the Go and NoGo ERPs separately. The simulation studies indicated that Separate PCAs provide adequate component recovery in the presence of P3 latency jitter, and that Combined PCAs provide good separation of components only when systematic condition-related latency differences are sufficiently large (here ~110ms). In the empirical data, broadly-similar components were obtained from the Combined and Separate PCAs, supporting previous findings from Combined PCA investigations, and the consequent interpretations of the sequential processing involved. However, the Separate PCAs generated latency differences for components in the Go and NoGo processing chains that better matched the late Go/NoGo ERP peaks, and produced better-defined and larger components that fitted the stages in a hypothetical processing schema developed for this paradigm. Overall, the Separate PCAs yielded a better partitioning of the ERP variance associated with the Go and NoGo conditions, and should be considered as the first choice in future investigations if systematic component or subcomponent latency differences are present or suspected. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. An analytical approach based on ESI-MS, LC-MS and PCA for the quali-quantitative analysis of cycloartane derivatives in Astragalus spp.

    PubMed

    Napolitano, Assunta; Akay, Seref; Mari, Angela; Bedir, Erdal; Pizza, Cosimo; Piacente, Sonia

    2013-11-01

    Astragalus species are widely used as health foods and dietary supplements, as well as drugs in traditional medicine. To rapidly evaluate metabolite similarities and differences among the EtOH extracts of the roots of eight commercial Astragalus spp., an approach based on direct analyses by ESI-MS followed by PCA of ESI-MS data, was carried out. Successively, quali-quantitative analyses of cycloartane derivatives in the eight Astragalus spp. by LC-ESI-MS(n) and PCA of LC-ESI-MS data were performed. This approach allowed to promptly highlighting metabolite similarities and differences among the various Astragalus spp. PCA results from LC-ESI-MS data of Astragalus samples were in reasonable agreement with both PCA results of ESI-MS data and quantitative results. This study affords an analytical method for the quali-quantitative determination of cycloartane derivatives in herbal preparations used as health and food supplements. Copyright © 2013 Elsevier B.V. All rights reserved.

  15. Improving the prediction of pathologic outcomes in patients undergoing radical prostatectomy: the value of prostate cancer antigen 3 (PCA3), prostate health index (phi) and sarcosine.

    PubMed

    Ferro, Matteo; Lucarelli, Giuseppe; Bruzzese, Dario; Perdonà, Sisto; Mazzarella, Claudia; Perruolo, Giuseppe; Marino, Ada; Cosimato, Vincenzo; Giorgio, Emilia; Tagliamonte, Virginia; Bottero, Danilo; De Cobelli, Ottavio; Terracciano, Daniela

    2015-02-01

    Several efforts have been made to find biomarkers that could help clinicians to preoperatively determine prostate cancer (PCa) pathological characteristics and choose the best therapeutic approach, avoiding over-treatment. On this effort, prostate cancer antigen 3 (PCA3), prostate health index (phi) and sarcosine have been presented as promising tools. We evaluated the ability of these biomarkers to predict the pathologic PCa characteristics within a prospectively collected contemporary cohort of patients who underwent radical prostatectomy (RP) for clinically localized PCa at a single high-volume Institution. The prognostic performance of PCA3, phi and sarcosine were evaluated in 78 patients undergoing RP for biopsy-proven PCa. Receiver operating characteristic (ROC) curve analyses tested the accuracy (area under the curve (AUC)) in predicting PCa pathological characteristics. Decision curve analyses (DCA) were used to assess the clinical benefit of the three biomarkers. We found that PCA3, phi and sarcosine levels were significantly higher in patients with tumor volume (TV)≥0.5 ml, pathologic Gleason sum (GS)≥7 and pT3 disease (all p-values≤0.01). ROC curve analysis showed that phi is an accurate predictor of high-stage (AUC 0.85 [0.77-0.93]), high-grade (AUC 0.83 [0.73-0.93]) and high-volume disease (AUC 0.94 [0.88-0.99]). Sarcosine showed a comparable AUC (0.85 [0.76-0.94]) only for T3 stage prediction, whereas PCA3 score showed lower AUCs, ranging from 0.74 (for GS) to 0.86 (for TV). PCA3, phi and sarcosine are predictors of PCa characteristics at final pathology. Successful clinical translation of these findings would reduce the frequency of surveillance biopsies and may enhance acceptance of active surveillance (AS). Copyright© 2015 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  16. Guided filter and principal component analysis hybrid method for hyperspectral pansharpening

    NASA Astrophysics Data System (ADS)

    Qu, Jiahui; Li, Yunsong; Dong, Wenqian

    2018-01-01

    Hyperspectral (HS) pansharpening aims to generate a fused HS image with high spectral and spatial resolution through integrating an HS image with a panchromatic (PAN) image. A guided filter (GF) and principal component analysis (PCA) hybrid HS pansharpening method is proposed. First, the HS image is interpolated and the PCA transformation is performed on the interpolated HS image. The first principal component (PC1) channel concentrates on the spatial information of the HS image. Different from the traditional PCA method, the proposed method sharpens the PAN image and utilizes the GF to obtain the spatial information difference between the HS image and the enhanced PAN image. Then, in order to reduce spectral and spatial distortion, an appropriate tradeoff parameter is defined and the spatial information difference is injected into the PC1 channel through multiplying by this tradeoff parameter. Once the new PC1 channel is obtained, the fused image is finally generated by the inverse PCA transformation. Experiments performed on both synthetic and real datasets show that the proposed method outperforms other several state-of-the-art HS pansharpening methods in both subjective and objective evaluations.

  17. Age-related differences in early novelty processing: Using PCA to parse the overlapping anterior P2 and N2 components

    PubMed Central

    Daffner, Kirk R.; Alperin, Brittany R.; Mott, Katherine K.; Tusch, Erich; Holcomb, Phillip J.

    2015-01-01

    Previous work demonstrated age-associated increases in the anterior P2 and age-related decreases in the anterior N2 in response to novel stimuli. Principal component analysis (PCA) was used to determine if the inverse relationship between these components was due to their temporal and spatial overlap. PCA revealed an early anterior P2, sensitive to task relevance, and a late anterior P2, responsive to novelty, both exhibiting age-related amplitude increases. A PCA factor representing the anterior N2, sensitive to novelty, exhibited age-related amplitude decreases. The late P2 and N2 to novels inversely correlated. Larger late P2 amplitude to novels was associated with better behavioral performance. Age-related differences in the anterior P2 and N2 to novel stimuli likely represent age-associated changes in independent cognitive operations. Enhanced anterior P2 activity (indexing augmentation in motivational salience) may be a compensatory mechanism for diminished anterior N2 activity (indexing reduced ability of older adults to process ambiguous representations). PMID:25596483

  18. PHI and PCA3 improve the prognostic performance of PRIAS and Epstein criteria in predicting insignificant prostate cancer in men eligible for active surveillance.

    PubMed

    Cantiello, Francesco; Russo, Giorgio Ivan; Cicione, Antonio; Ferro, Matteo; Cimino, Sebastiano; Favilla, Vincenzo; Perdonà, Sisto; De Cobelli, Ottavio; Magno, Carlo; Morgia, Giuseppe; Damiano, Rocco

    2016-04-01

    To assess the performance of prostate health index (PHI) and prostate cancer antigen 3 (PCA3) when added to the PRIAS or Epstein criteria in predicting the presence of pathologically insignificant prostate cancer (IPCa) in patients who underwent radical prostatectomy (RP) but eligible for active surveillance (AS). An observational retrospective study was performed in 188 PCa patients treated with laparoscopic or robot-assisted RP but eligible for AS according to Epstein or PRIAS criteria. Blood and urinary specimens were collected before initial prostate biopsy for PHI and PCA3 measurements. Multivariate logistic regression analyses and decision curve analysis were carried out to identify predictors of IPCa using the updated ERSPC definition. At the multivariate analyses, the inclusion of both PCA3 and PHI significantly increased the accuracy of the Epstein multivariate model in predicting IPCa with an increase of 17 % (AUC = 0.77) and of 32 % (AUC = 0.92), respectively. The inclusion of both PCA3 and PHI also increased the predictive accuracy of the PRIAS multivariate model with an increase of 29 % (AUC = 0.87) and of 39 % (AUC = 0.97), respectively. DCA revealed that the multivariable models with the addition of PHI or PCA3 showed a greater net benefit and performed better than the reference models. In a direct comparison, PHI outperformed PCA3 performance resulting in higher net benefit. In a same cohort of patients eligible for AS, the addition of PHI and PCA3 to Epstein or PRIAS models improved their prognostic performance. PHI resulted in greater net benefit in predicting IPCa compared to PCA3.

  19. Kernel Principal Component Analysis for dimensionality reduction in fMRI-based diagnosis of ADHD.

    PubMed

    Sidhu, Gagan S; Asgarian, Nasimeh; Greiner, Russell; Brown, Matthew R G

    2012-01-01

    This study explored various feature extraction methods for use in automated diagnosis of Attention-Deficit Hyperactivity Disorder (ADHD) from functional Magnetic Resonance Image (fMRI) data. Each participant's data consisted of a resting state fMRI scan as well as phenotypic data (age, gender, handedness, IQ, and site of scanning) from the ADHD-200 dataset. We used machine learning techniques to produce support vector machine (SVM) classifiers that attempted to differentiate between (1) all ADHD patients vs. healthy controls and (2) ADHD combined (ADHD-c) type vs. ADHD inattentive (ADHD-i) type vs. controls. In different tests, we used only the phenotypic data, only the imaging data, or else both the phenotypic and imaging data. For feature extraction on fMRI data, we tested the Fast Fourier Transform (FFT), different variants of Principal Component Analysis (PCA), and combinations of FFT and PCA. PCA variants included PCA over time (PCA-t), PCA over space and time (PCA-st), and kernelized PCA (kPCA-st). Baseline chance accuracy was 64.2% produced by guessing healthy control (the majority class) for all participants. Using only phenotypic data produced 72.9% accuracy on two class diagnosis and 66.8% on three class diagnosis. Diagnosis using only imaging data did not perform as well as phenotypic-only approaches. Using both phenotypic and imaging data with combined FFT and kPCA-st feature extraction yielded accuracies of 76.0% on two class diagnosis and 68.6% on three class diagnosis-better than phenotypic-only approaches. Our results demonstrate the potential of using FFT and kPCA-st with resting-state fMRI data as well as phenotypic data for automated diagnosis of ADHD. These results are encouraging given known challenges of learning ADHD diagnostic classifiers using the ADHD-200 dataset (see Brown et al., 2012).

  20. Analysis and Application of European Genetic Substructure Using 300 K SNP Information

    PubMed Central

    Tian, Chao; Plenge, Robert M; Ransom, Michael; Lee, Annette; Villoslada, Pablo; Selmi, Carlo; Klareskog, Lars; Pulver, Ann E; Qi, Lihong; Gregersen, Peter K; Seldin, Michael F

    2008-01-01

    European population genetic substructure was examined in a diverse set of >1,000 individuals of European descent, each genotyped with >300 K SNPs. Both STRUCTURE and principal component analyses (PCA) showed the largest division/principal component (PC) differentiated northern from southern European ancestry. A second PC further separated Italian, Spanish, and Greek individuals from those of Ashkenazi Jewish ancestry as well as distinguishing among northern European populations. In separate analyses of northern European participants other substructure relationships were discerned showing a west to east gradient. Application of this substructure information was critical in examining a real dataset in whole genome association (WGA) analyses for rheumatoid arthritis in European Americans to reduce false positive signals. In addition, two sets of European substructure ancestry informative markers (ESAIMs) were identified that provide substantial substructure information. The results provide further insight into European population genetic substructure and show that this information can be used for improving error rates in association testing of candidate genes and in replication studies of WGA scans. PMID:18208329

  1. Image restoration for three-dimensional fluorescence microscopy using an orthonormal basis for efficient representation of depth-variant point-spread functions

    PubMed Central

    Patwary, Nurmohammed; Preza, Chrysanthe

    2015-01-01

    A depth-variant (DV) image restoration algorithm for wide field fluorescence microscopy, using an orthonormal basis decomposition of DV point-spread functions (PSFs), is investigated in this study. The efficient PSF representation is based on a previously developed principal component analysis (PCA), which is computationally intensive. We present an approach developed to reduce the number of DV PSFs required for the PCA computation, thereby making the PCA-based approach computationally tractable for thick samples. Restoration results from both synthetic and experimental images show consistency and that the proposed algorithm addresses efficiently depth-induced aberration using a small number of principal components. Comparison of the PCA-based algorithm with a previously-developed strata-based DV restoration algorithm demonstrates that the proposed method improves performance by 50% in terms of accuracy and simultaneously reduces the processing time by 64% using comparable computational resources. PMID:26504634

  2. Comparative evaluation of urinary PCA3 and TMPRSS2: ERG scores and serum PHI in predicting prostate cancer aggressiveness.

    PubMed

    Tallon, Lucile; Luangphakdy, Devillier; Ruffion, Alain; Colombel, Marc; Devonec, Marian; Champetier, Denis; Paparel, Philippe; Decaussin-Petrucci, Myriam; Perrin, Paul; Vlaeminck-Guillem, Virginie

    2014-07-30

    It has been suggested that urinary PCA3 and TMPRSS2:ERG fusion tests and serum PHI correlate to cancer aggressiveness-related pathological criteria at prostatectomy. To evaluate and compare their ability in predicting prostate cancer aggressiveness, PHI and urinary PCA3 and TMPRSS2:ERG (T2) scores were assessed in 154 patients who underwent radical prostatectomy for biopsy-proven prostate cancer. Univariate and multivariate analyses using logistic regression and decision curve analyses were performed. All three markers were predictors of a tumor volume≥0.5 mL. Only PHI predicted Gleason score≥7. T2 score and PHI were both independent predictors of extracapsular extension(≥pT3), while multifocality was only predicted by PCA3 score. Moreover, when compared to a base model (age, digital rectal examination, serum PSA, and Gleason sum at biopsy), the addition of both PCA3 score and PHI to the base model induced a significant increase (+12%) when predicting tumor volume>0.5 mL. PHI and urinary PCA3 and T2 scores can be considered as complementary predictors of cancer aggressiveness at prostatectomy.

  3. Visualizing Hyolaryngeal Mechanics in Swallowing Using Dynamic MRI

    PubMed Central

    Pearson, William G.; Zumwalt, Ann C.

    2013-01-01

    Introduction Coordinates of anatomical landmarks are captured using dynamic MRI to explore whether a proposed two-sling mechanism underlies hyolaryngeal elevation in pharyngeal swallowing. A principal components analysis (PCA) is applied to coordinates to determine the covariant function of the proposed mechanism. Methods Dynamic MRI (dMRI) data were acquired from eleven healthy subjects during a repeated swallows task. Coordinates mapping the proposed mechanism are collected from each dynamic (frame) of a dynamic MRI swallowing series of a randomly selected subject in order to demonstrate shape changes in a single subject. Coordinates representing minimum and maximum hyolaryngeal elevation of all 11 subjects were also mapped to demonstrate shape changes of the system among all subjects. MophoJ software was used to perform PCA and determine vectors of shape change (eigenvectors) for elements of the two-sling mechanism of hyolaryngeal elevation. Results For both single subject and group PCAs, hyolaryngeal elevation accounted for the first principal component of variation. For the single subject PCA, the first principal component accounted for 81.5% of the variance. For the between subjects PCA, the first principal component accounted for 58.5% of the variance. Eigenvectors and shape changes associated with this first principal component are reported. Discussion Eigenvectors indicate that two-muscle slings and associated skeletal elements function as components of a covariant mechanism to elevate the hyolaryngeal complex. Morphological analysis is useful to model shape changes in the two-sling mechanism of hyolaryngeal elevation. PMID:25090608

  4. Construction and analysis of protein-protein interaction networks based on proteomics data of prostate cancer

    PubMed Central

    CHEN, CHEN; SHEN, HONG; ZHANG, LI-GUO; LIU, JIAN; CAO, XIAO-GE; YAO, AN-LIANG; KANG, SHAO-SAN; GAO, WEI-XING; HAN, HUI; CAO, FENG-HONG; LI, ZHI-GUO

    2016-01-01

    Currently, using human prostate cancer (PCa) tissue samples to conduct proteomics research has generated a large amount of data; however, only a very small amount has been thoroughly investigated. In this study, we manually carried out the mining of the full text of proteomics literature that involved comparisons between PCa and normal or benign tissue and identified 41 differentially expressed proteins verified or reported more than 2 times from different research studies. We regarded these proteins as seed proteins to construct a protein-protein interaction (PPI) network. The extended network included one giant network, which consisted of 1,264 nodes connected via 1,744 edges, and 3 small separate components. The backbone network was then constructed, which was derived from key nodes and the subnetwork consisting of the shortest path between seed proteins. Topological analyses of these networks were conducted to identify proteins essential for the genesis of PCa. Solute carrier family 2 (facilitated glucose transporter), member 4 (SLC2A4) had the highest closeness centrality located in the center of each network, and the highest betweenness centrality and largest degree in the backbone network. Tubulin, beta 2C (TUBB2C) had the largest degree in the giant network and subnetwork. In addition, using module analysis of the whole PPI network, we obtained a densely connected region. Functional annotation indicated that the Ras protein signal transduction biological process, mitogen-activated protein kinase (MAPK), neurotrophin and the gonadotropin-releasing hormone (GnRH) signaling pathway may play an important role in the genesis and development of PCa. Further investigation of the SLC2A4, TUBB2C proteins, and these biological processes and pathways may therefore provide a potential target for the diagnosis and treatment of PCa. PMID:27121963

  5. An Intelligent Architecture Based on Field Programmable Gate Arrays Designed to Detect Moving Objects by Using Principal Component Analysis

    PubMed Central

    Bravo, Ignacio; Mazo, Manuel; Lázaro, José L.; Gardel, Alfredo; Jiménez, Pedro; Pizarro, Daniel

    2010-01-01

    This paper presents a complete implementation of the Principal Component Analysis (PCA) algorithm in Field Programmable Gate Array (FPGA) devices applied to high rate background segmentation of images. The classical sequential execution of different parts of the PCA algorithm has been parallelized. This parallelization has led to the specific development and implementation in hardware of the different stages of PCA, such as computation of the correlation matrix, matrix diagonalization using the Jacobi method and subspace projections of images. On the application side, the paper presents a motion detection algorithm, also entirely implemented on the FPGA, and based on the developed PCA core. This consists of dynamically thresholding the differences between the input image and the one obtained by expressing the input image using the PCA linear subspace previously obtained as a background model. The proposal achieves a high ratio of processed images (up to 120 frames per second) and high quality segmentation results, with a completely embedded and reliable hardware architecture based on commercial CMOS sensors and FPGA devices. PMID:22163406

  6. An intelligent architecture based on Field Programmable Gate Arrays designed to detect moving objects by using Principal Component Analysis.

    PubMed

    Bravo, Ignacio; Mazo, Manuel; Lázaro, José L; Gardel, Alfredo; Jiménez, Pedro; Pizarro, Daniel

    2010-01-01

    This paper presents a complete implementation of the Principal Component Analysis (PCA) algorithm in Field Programmable Gate Array (FPGA) devices applied to high rate background segmentation of images. The classical sequential execution of different parts of the PCA algorithm has been parallelized. This parallelization has led to the specific development and implementation in hardware of the different stages of PCA, such as computation of the correlation matrix, matrix diagonalization using the Jacobi method and subspace projections of images. On the application side, the paper presents a motion detection algorithm, also entirely implemented on the FPGA, and based on the developed PCA core. This consists of dynamically thresholding the differences between the input image and the one obtained by expressing the input image using the PCA linear subspace previously obtained as a background model. The proposal achieves a high ratio of processed images (up to 120 frames per second) and high quality segmentation results, with a completely embedded and reliable hardware architecture based on commercial CMOS sensors and FPGA devices.

  7. Genetic variability of Brazilian isolates of Alternaria alternata detected by AFLP and RAPD techniques

    PubMed Central

    Dini-Andreote, Francisco; Pietrobon, Vivian Cristina; Andreote, Fernando Dini; Romão, Aline Silva; Spósito, Marcel Bellato; Araújo, Welington Luiz

    2009-01-01

    The Alternaria brown spot (ABS) is a disease caused in tangerine plants and its hybrids by the fungus Alternaria alternata f. sp. citri which has been found in Brazil since 2001. Due to the recent occurrence in Brazilian orchards, the epidemiology and genetic variability of this pathogen is still an issue to be addressed. Here it is presented a survey about the genetic variability of this fungus by the characterization of twenty four pathogenic isolates of A. alternata f. sp. citri from citrus plants and four endophytic isolates from mango (one Alternaria tenuissima and three Alternaria arborescens). The application of two molecular markers Random Amplified Polymorphic DNA (RAPD) and Amplified Fragment Length Polymorphism (AFLP) had revealed the isolates clustering in distinct groups when fingerprintings were analyzed by Principal Components Analysis (PCA). Despite the better assessment of the genetic variability through the AFLP, significant modifications in clusters components were not observed, and only slight shifts in the positioning of isolates LRS 39/3 and 25M were observed in PCA plots. Furthermore, in both analyses, only the isolates from lemon plants revealed to be clustered, differently from the absence of clustering for other hosts or plant tissues. Summarizing, both RAPD and AFLP analyses were both efficient to detect the genetic variability within the population of the pathogenic fungus Alternaria spp., supplying information on the genetic variability of this species as a basis for further studies aiming the disease control. PMID:24031413

  8. Identifying the regional-scale groundwater-surface water interaction on the Sanjiang Plain, Northeast China.

    PubMed

    Wang, Xihua; Zhang, Guangxin; Xu, Y Jun; Sun, Guangzhi

    2015-11-01

    Assessment on the interaction between groundwater and surface water (GW-SW) can generate information that is critical to regional water resource management, especially for regions that are highly dependent on groundwater resources for irrigation. This study investigated such interaction on China's Sanjiang Plain (10.9 × 10(4) km(2)) and produced results to assist sustainable regional water management for intensive agricultural activities. Methods of hierarchical cluster analysis (HCA), principal component analysis (PCA), and statistical analysis were used in this study. One hundred two water samplings (60 from shallow groundwater, 7 from deep groundwater, and 35 from surface water) were collected and grouped into three clusters and seven sub-clusters during the analyses. The PCA analysis identified four principal components of the interaction, which explained 85.9% variance of total database, attributed to the dissolution and evolution of gypsum, feldspar, and other natural minerals in the region that was affected by anthropic and geological (sedimentary rock mineral) activities. The analyses showed that surface water in the upper region of the Sanjiang Plain gained water from local shallow groundwater, indicating that the surface water in the upper region was relatively more resilient to withdrawal for usage, whereas in the middle region, there was only a weak interaction between shallow groundwater and surface water. In the lower region of the Sanjiang Plain, surface water lost water to shallow groundwater, indicating that the groundwater was vulnerable to pollution by pesticides and fertilizers from terrestrial sources.

  9. PEM-PCA: a parallel expectation-maximization PCA face recognition architecture.

    PubMed

    Rujirakul, Kanokmon; So-In, Chakchai; Arnonkijpanich, Banchar

    2014-01-01

    Principal component analysis or PCA has been traditionally used as one of the feature extraction techniques in face recognition systems yielding high accuracy when requiring a small number of features. However, the covariance matrix and eigenvalue decomposition stages cause high computational complexity, especially for a large database. Thus, this research presents an alternative approach utilizing an Expectation-Maximization algorithm to reduce the determinant matrix manipulation resulting in the reduction of the stages' complexity. To improve the computational time, a novel parallel architecture was employed to utilize the benefits of parallelization of matrix computation during feature extraction and classification stages including parallel preprocessing, and their combinations, so-called a Parallel Expectation-Maximization PCA architecture. Comparing to a traditional PCA and its derivatives, the results indicate lower complexity with an insignificant difference in recognition precision leading to high speed face recognition systems, that is, the speed-up over nine and three times over PCA and Parallel PCA.

  10. Health behaviours, body weight and self-esteem among grade five students in Canada.

    PubMed

    Wu, Xiuyun; Kirk, Sara F L; Ohinmaa, Arto; Veugelers, Paul

    2016-01-01

    This study sought to identify the principal components of self-esteem and the health behavioural determinants of these components among grade five students. We analysed data from a population-based survey among 4918 grade five students, who are primarily 10 and 11 years of age, and their parents in the Canadian province of Nova Scotia. The survey comprised the Harvard Youth and Adolescent Questionnaire, parental reporting of students' physical activity (PA) and time spent watching television or using computer/video games. Students heights and weights were objectively measured. We applied principal component analysis (PCA) to derive the components of self-esteem, and multilevel, multivariable logistic regression to quantify associations of diet quality, PA, sedentary behaviour and body weight with these components of self-esteem. PCA identified four components for self-esteem: self-perception, externalizing problems, internalizing problems, social-perception. Influences of health behaviours and body weight on self-esteem varied across the components. Better diet quality was associated with higher self-perception and fewer externalizing problems. Less PA and more use of computer/video games were related to lower self-perception and social-perception. Excessive TV watching was associated with more internalizing problems. Students classified as obese were more likely to report low self- and social-perception, and to experience fewer externalizing problems relative to students classified as normal weight. This study demonstrates independent influences of diet quality, physical activity, sedentary behaviour and body weight on four aspects of self-esteem among children. These findings suggest that school programs and health promotion strategies that target health behaviours may benefit self-esteem in childhood, and mental health and quality of life later in life.

  11. Predictors affecting personal health information management skills.

    PubMed

    Kim, Sujin; Abner, Erin

    2016-01-01

    This study investigated major factors affecting personal health records (PHRs) management skills associated with survey respondents' health information management related activities. A self-report survey was used to assess individuals' personal characteristics, health knowledge, PHR skills, and activities. Factors underlying respondents' current PHR-related activities were derived using principal component analysis (PCA). Scale scores were calculated based on the results of the PCA, and hierarchical linear regression analyses were used to identify respondent characteristics associated with the scale scores. Internal consistency of the derived scale scores was assessed with Cronbach's α. Among personal health information activities surveyed (N = 578 respondents), the four extracted factors were subsequently grouped and labeled as: collecting skills (Cronbach's α = 0.906), searching skills (Cronbach's α = 0.837), sharing skills (Cronbach's α = 0.763), and implementing skills (Cronbach's α = 0.908). In the hierarchical regression analyses, education and computer knowledge significantly increased the explanatory power of the models. Health knowledge (β = 0.25, p < 0.001) emerged as a positive predictor of PHR collecting skills. This study confirmed that PHR training and learning should consider a full spectrum of information management skills including collection, utilization and distribution to support patients' care and prevention continua.

  12. Principal components analysis of Jupiter VIMS spectra

    USGS Publications Warehouse

    Bellucci, G.; Formisano, V.; D'Aversa, E.; Brown, R.H.; Baines, K.H.; Bibring, J.-P.; Buratti, B.J.; Capaccioni, F.; Cerroni, P.; Clark, R.N.; Coradini, A.; Cruikshank, D.P.; Drossart, P.; Jaumann, R.; Langevin, Y.; Matson, D.L.; McCord, T.B.; Mennella, V.; Nelson, R.M.; Nicholson, P.D.; Sicardy, B.; Sotin, Christophe; Chamberlain, M.C.; Hansen, G.; Hibbits, K.; Showalter, M.; Filacchione, G.

    2004-01-01

    During Cassini - Jupiter flyby occurred in December 2000, Visual-Infrared mapping spectrometer (VIMS) instrument took several image cubes of Jupiter at different phase angles and distances. We have analysed the spectral images acquired by the VIMS visual channel by means of a principal component analysis technique (PCA). The original data set consists of 96 spectral images in the 0.35-1.05 ??m wavelength range. The product of the analysis are new PC bands, which contain all the spectral variance of the original data. These new components have been used to produce a map of Jupiter made of seven coherent spectral classes. The map confirms previously published work done on the Great Red Spot by using NIMS data. Some other new findings, presently under investigation, are presented. ?? 2004 Published by Elsevier Ltd on behalf of COSPAR.

  13. Contact- and distance-based principal component analysis of protein dynamics.

    PubMed

    Ernst, Matthias; Sittel, Florian; Stock, Gerhard

    2015-12-28

    To interpret molecular dynamics simulations of complex systems, systematic dimensionality reduction methods such as principal component analysis (PCA) represent a well-established and popular approach. Apart from Cartesian coordinates, internal coordinates, e.g., backbone dihedral angles or various kinds of distances, may be used as input data in a PCA. Adopting two well-known model problems, folding of villin headpiece and the functional dynamics of BPTI, a systematic study of PCA using distance-based measures is presented which employs distances between Cα-atoms as well as distances between inter-residue contacts including side chains. While this approach seems prohibitive for larger systems due to the quadratic scaling of the number of distances with the size of the molecule, it is shown that it is sufficient (and sometimes even better) to include only relatively few selected distances in the analysis. The quality of the PCA is assessed by considering the resolution of the resulting free energy landscape (to identify metastable conformational states and barriers) and the decay behavior of the corresponding autocorrelation functions (to test the time scale separation of the PCA). By comparing results obtained with distance-based, dihedral angle, and Cartesian coordinates, the study shows that the choice of input variables may drastically influence the outcome of a PCA.

  14. Contact- and distance-based principal component analysis of protein dynamics

    NASA Astrophysics Data System (ADS)

    Ernst, Matthias; Sittel, Florian; Stock, Gerhard

    2015-12-01

    To interpret molecular dynamics simulations of complex systems, systematic dimensionality reduction methods such as principal component analysis (PCA) represent a well-established and popular approach. Apart from Cartesian coordinates, internal coordinates, e.g., backbone dihedral angles or various kinds of distances, may be used as input data in a PCA. Adopting two well-known model problems, folding of villin headpiece and the functional dynamics of BPTI, a systematic study of PCA using distance-based measures is presented which employs distances between Cα-atoms as well as distances between inter-residue contacts including side chains. While this approach seems prohibitive for larger systems due to the quadratic scaling of the number of distances with the size of the molecule, it is shown that it is sufficient (and sometimes even better) to include only relatively few selected distances in the analysis. The quality of the PCA is assessed by considering the resolution of the resulting free energy landscape (to identify metastable conformational states and barriers) and the decay behavior of the corresponding autocorrelation functions (to test the time scale separation of the PCA). By comparing results obtained with distance-based, dihedral angle, and Cartesian coordinates, the study shows that the choice of input variables may drastically influence the outcome of a PCA.

  15. Algorithms for accelerated convergence of adaptive PCA.

    PubMed

    Chatterjee, C; Kang, Z; Roychowdhury, V P

    2000-01-01

    We derive and discuss new adaptive algorithms for principal component analysis (PCA) that are shown to converge faster than the traditional PCA algorithms due to Oja, Sanger, and Xu. It is well known that traditional PCA algorithms that are derived by using gradient descent on an objective function are slow to converge. Furthermore, the convergence of these algorithms depends on appropriate choices of the gain sequences. Since online applications demand faster convergence and an automatic selection of gains, we present new adaptive algorithms to solve these problems. We first present an unconstrained objective function, which can be minimized to obtain the principal components. We derive adaptive algorithms from this objective function by using: 1) gradient descent; 2) steepest descent; 3) conjugate direction; and 4) Newton-Raphson methods. Although gradient descent produces Xu's LMSER algorithm, the steepest descent, conjugate direction, and Newton-Raphson methods produce new adaptive algorithms for PCA. We also provide a discussion on the landscape of the objective function, and present a global convergence proof of the adaptive gradient descent PCA algorithm using stochastic approximation theory. Extensive experiments with stationary and nonstationary multidimensional Gaussian sequences show faster convergence of the new algorithms over the traditional gradient descent methods.We also compare the steepest descent adaptive algorithm with state-of-the-art methods on stationary and nonstationary sequences.

  16. An application of principal component analysis to the clavicle and clavicle fixation devices.

    PubMed

    Daruwalla, Zubin J; Courtis, Patrick; Fitzpatrick, Clare; Fitzpatrick, David; Mullett, Hannan

    2010-03-26

    Principal component analysis (PCA) enables the building of statistical shape models of bones and joints. This has been used in conjunction with computer assisted surgery in the past. However, PCA of the clavicle has not been performed. Using PCA, we present a novel method that examines the major modes of size and three-dimensional shape variation in male and female clavicles and suggests a method of grouping the clavicle into size and shape categories. Twenty-one high-resolution computerized tomography scans of the clavicle were reconstructed and analyzed using a specifically developed statistical software package. After performing statistical shape analysis, PCA was applied to study the factors that account for anatomical variation. The first principal component representing size accounted for 70.5 percent of anatomical variation. The addition of a further three principal components accounted for almost 87 percent. Using statistical shape analysis, clavicles in males have a greater lateral depth and are longer, wider and thicker than in females. However, the sternal angle in females is larger than in males. PCA confirmed these differences between genders but also noted that men exhibit greater variance and classified clavicles into five morphological groups. This unique approach is the first that standardizes a clavicular orientation. It provides information that is useful to both, the biomedical engineer and clinician. Other applications include implant design with regard to modifying current or designing future clavicle fixation devices. Our findings support the need for further development of clavicle fixation devices and the questioning of whether gender-specific devices are necessary.

  17. [Soil and forest structure in the Colombian Amazon].

    PubMed

    Calle-Rendón, Bayron R; Moreno, Flavio; Cárdenas López, Dairon

    2011-09-01

    Forests structural differences could result of environmental variations at different scales. Because soils are an important component of plant's environment, it is possible that edaphic and structural variables are associated and that, in consequence, spatial autocorrelation occurs. This paper aims to answer two questions: (1) are structural and edaphic variables associated at local scale in a terra firme forest of Colombian Amazonia? and (2) are these variables regionalized at the scale of work? To answer these questions we analyzed the data of a 6ha plot established in a terra firme forest of the Amacayacu National Park. Structural variables included basal area and density of large trees (diameter > or = 10cm) (Gdos and Ndos), basal area and density of understory individuals (diameter < 10cm) (Gsot and Nsot) and number of species of large trees (sp). Edaphic variables included were pH, organic matter, P, Mg, Ca, K, Al, sand, silt and clay. Structural and edaphic variables were reduced through a principal component analysis (PCA); then, the association between edaphic and structural components from PCA was evaluated by multiple regressions. The existence of regionalization of these variables was studied through isotropic variograms, and autocorrelated variables were spatially mapped. PCA found two significant components for structure, corresponding to the structure of large trees (G, Gdos, Ndos and sp) and of small trees (N, Nsot and Gsot), which explained 43.9% and 36.2% of total variance, respectively. Four components were identified for edaphic variables, which globally explained 81.9% of total variance and basically represent drainage and soil fertility. Regression analyses were significant (p < 0.05) and showed that the structure of both large and small trees is associated with greater sand contents and low soil fertility, though they explained a low proportion of total variability (R2 was 4.9% and 16.5% for the structure of large trees and small tress, respectively). Variables with spatial autocorrelation were the structure of small trees, Al, silt, and sand. Among them, Nsot and sand content showed similar patterns of spatial distribution inside the plot.

  18. Differences in chewing sounds of dry-crisp snacks by multivariate data analysis

    NASA Astrophysics Data System (ADS)

    De Belie, N.; Sivertsvik, M.; De Baerdemaeker, J.

    2003-09-01

    Chewing sounds of different types of dry-crisp snacks (two types of potato chips, prawn crackers, cornflakes and low calorie snacks from extruded starch) were analysed to assess differences in sound emission patterns. The emitted sounds were recorded by a microphone placed over the ear canal. The first bite and the first subsequent chew were selected from the time signal and a fast Fourier transformation provided the power spectra. Different multivariate analysis techniques were used for classification of the snack groups. This included principal component analysis (PCA) and unfold partial least-squares (PLS) algorithms, as well as multi-way techniques such as three-way PLS, three-way PCA (Tucker3), and parallel factor analysis (PARAFAC) on the first bite and subsequent chew. The models were evaluated by calculating the classification errors and the root mean square error of prediction (RMSEP) for independent validation sets. It appeared that the logarithm of the power spectra obtained from the chewing sounds could be used successfully to distinguish the different snack groups. When different chewers were used, recalibration of the models was necessary. Multi-way models distinguished better between chewing sounds of different snack groups than PCA on bite or chew separately and than unfold PLS. From all three-way models applied, N-PLS with three components showed the best classification capabilities, resulting in classification errors of 14-18%. The major amount of incorrect classifications was due to one type of potato chips that had a very irregular shape, resulting in a wide variation of the emitted sounds.

  19. Principal component analysis of indocyanine green fluorescence dynamics for diagnosis of vascular diseases

    NASA Astrophysics Data System (ADS)

    Seo, Jihye; An, Yuri; Lee, Jungsul; Choi, Chulhee

    2015-03-01

    Indocyanine green (ICG), a near-infrared fluorophore, has been used in visualization of vascular structure and non-invasive diagnosis of vascular disease. Although many imaging techniques have been developed, there are still limitations in diagnosis of vascular diseases. We have recently developed a minimally invasive diagnostics system based on ICG fluorescence imaging for sensitive detection of vascular insufficiency. In this study, we used principal component analysis (PCA) to examine ICG spatiotemporal profile and to obtain pathophysiological information from ICG dynamics. Here we demonstrated that principal components of ICG dynamics in both feet showed significant differences between normal control and diabetic patients with vascula complications. We extracted the PCA time courses of the first three components and found distinct pattern in diabetic patient. We propose that PCA of ICG dynamics reveal better classification performance compared to fluorescence intensity analysis. We anticipate that specific feature of spatiotemporal ICG dynamics can be useful in diagnosis of various vascular diseases.

  20. Biotic and abiotic degradation of 1,1,2,2-tetrachloroethane in wetland sediments: Geochemical and microbial community analyses

    USGS Publications Warehouse

    Lorah, M.M.; Voytek, M.A.; Kirshtein, J.

    2000-01-01

    Additional microcosm experiments with the wetland sediment and groundwater at the Aberdeen Proving Ground, MD, site was presented to assist in elucidating the conditions under which these potentially competing biotic and abiotic degradation reactions for 1,1,2,2-tetrachloroethane (PCA) occur in the environment and to evaluate potential seasonal changes in degradation reactions. PCA concentration decreased to below detection within 21 days in the March 1999 experiment, while PCA was still present at day 35 in the July 1999 experiment. Compared to March 1999 experiment, peak concentrations of all daughter products except trichloroethylene (TCE) were delayed in the July 1999 experiment. The relative intensity of the peaks was directly related to the biomass present for each fragment length (bp, base pair). The relative intensities were lower in sediment collected in August 1999 than in March 1999, especially in the bp size range of ??? 160??-240??. These microbial community analyses, along with the geochemical analyses of the microcosms, provide evidence that abiotic production of TCE from PCA degradation is more significant under conditions of low bacterial biomass in the wetland sediments.

  1. The Use of the Visualisation of Multidimensional Data Using PCA to Evaluate Possibilities of the Division of Coal Samples Space Due to their Suitability for Fluidised Gasification

    NASA Astrophysics Data System (ADS)

    Jamróz, Dariusz; Niedoba, Tomasz; Surowiak, Agnieszka; Tumidajski, Tadeusz

    2016-09-01

    Methods serving to visualise multidimensional data through the transformation of multidimensional space into two-dimensional space, enable to present the multidimensional data on the computer screen. Thanks to this, qualitative analysis of this data can be performed in the most natural way for humans, through the sense of sight. An example of such a method of multidimensional data visualisation is PCA (principal component analysis) method. This method was used in this work to present and analyse a set of seven-dimensional data (selected seven properties) describing coal samples obtained from Janina and Wieczorek coal mines. Coal from these mines was previously subjected to separation by means of a laboratory ring jig, consisting of ten rings. With 5 layers of both types of coal (with 2 rings each) were obtained in this way. It was decided to check if the method of multidimensional data visualisation enables to divide the space of such divided samples into areas with different suitability for the fluidised gasification process. To that end, the card of technological suitability of coal was used (Sobolewski et al., 2012; 2013), in which key, relevant and additional parameters, having effect on the gasification process, were described. As a result of analyses, it was stated that effective determination of coal samples suitability for the on-surface gasification process in a fluidised reactor is possible. The PCA method enables the visualisation of the optimal subspace containing the set requirements concerning the properties of coals intended for this process.

  2. Side Effect Perceptions and Their Impact on Treatment Decisions in Women.

    PubMed

    Waters, Erika A; Pachur, Thorsten; Colditz, Graham A

    2017-04-01

    Side effects prompt some patients to forego otherwise-beneficial therapies. This study explored which characteristics make side effects particularly aversive. We used a psychometric approach, originating from research on risk perception, to identify the factors (or components) underlying side effect perceptions. Women ( N = 149) aged 40 to 74 years were recruited from a patient registry to complete an online experiment. Participants were presented with hypothetical scenarios in which an effective and necessary medication conferred a small risk of a single side effect (e.g., nausea, dizziness). They rated a broad range of side effects on several characteristics (e.g., embarrassing, treatable). In addition, we collected 4 measures of aversiveness for each side effect: choosing to take the medication, willingness to pay to avoid the side effect (WTP), negative affective attitude associated with the side effect, and how each side effect ranks among others in terms of undesirability. A principal components analysis (PCA) was used to identify the components underlying side effect perceptions. Then, for each aversiveness measure separately, regression analyses were used to determine which components predicted differences in aversiveness among the side effects. The PCA revealed 4 components underlying side effect perceptions: affective challenge (e.g., frightening), social challenge (e.g., disfiguring), physical challenge (e.g., painful), and familiarity (e.g., common). Side effects perceived as affectively and physically challenging elicited the highest levels of aversiveness across all 4 measures. Understanding what side effect characteristics are most aversive may inform interventions to improve medical decisions and facilitate the translation of novel biomedical therapies into clinical practice.

  3. Side Effect Perceptions and their Impact on Treatment Decisions in Women

    PubMed Central

    Waters, Erika A.; Pachur, Thorsten; Colditz, Graham A.

    2016-01-01

    Background Side effects prompt some patients to forego otherwise-beneficial therapies. This study explored which characteristics make side effects particularly aversive. Methods We used a psychometric approach, originating from research on risk perception, to identify the factors (or components) underlying side effect perceptions. Women (N=149) aged 40–74 were recruited from a patient registry to complete an online experiment. Participants were presented with hypothetical scenarios in which an effective and necessary medication conferred a small risk of a single side effect (e.g., nausea, dizziness). They rated a broad range of side effects on several characteristics (e.g., embarrassing, treatable). In addition, we collected four measures of aversiveness for each side effect: choosing to take the medication, willingness to pay to avoid the side effect (WTP), negative affective attitude associated with the side effect, and how each side effect ranks among others in terms of undesirability. A principle-components analysis (PCA) was used to identify the components underlying side effect perceptions. Then, for each aversiveness measure separately, regression analyses were used to determine which components predicted differences in aversiveness among the side effects. Results The PCA revealed four components underlying side effect perceptions: affective challenge (e.g., frightening), social challenge (e.g., disfiguring), physical challenge (e.g., painful), and familiarity (e.g., common). Side effects perceived as affectively and physically challenging elicited the highest levels of aversiveness across all four measures. Conclusions Understanding what side effect characteristics are most aversive may inform interventions to improve medical decisions and facilitate the translation of novel biomedical therapies into clinical practice. PMID:27216581

  4. Analysis and Evaluation of the Characteristic Taste Components in Portobello Mushroom.

    PubMed

    Wang, Jinbin; Li, Wen; Li, Zhengpeng; Wu, Wenhui; Tang, Xueming

    2018-05-10

    To identify the characteristic taste components of the common cultivated mushroom (brown; Portobello), Agaricus bisporus, taste components in the stipe and pileus of Portobello mushroom harvested at different growth stages were extracted and identified, and principal component analysis (PCA) and taste active value (TAV) were used to reveal the characteristic taste components during the each of the growth stages of Portobello mushroom. In the stipe and pileus, 20 and 14 different principal taste components were identified, respectively, and they were considered as the principal taste components of Portobello mushroom fruit bodies, which included most amino acids and 5'-nucleotides. Some taste components that were found at high levels, such as lactic acid and citric acid, were not detected as Portobello mushroom principal taste components through PCA. However, due to their high content, Portobello mushroom could be used as a source of organic acids. The PCA and TAV results revealed that 5'-GMP, glutamic acid, malic acid, alanine, proline, leucine, and aspartic acid were the characteristic taste components of Portobello mushroom fruit bodies. Portobello mushroom was also found to be rich in protein and amino acids, so it might also be useful in the formulation of nutraceuticals and functional food. The results in this article could provide a theoretical basis for understanding and regulating the characteristic flavor components synthesis process of Portobello mushroom. © 2018 Institute of Food Technologists®.

  5. Essential-Oil Variability in Natural Populations of Pinus mugo Turra from the Julian Alps.

    PubMed

    Bojović, Srdjan; Jurc, Maja; Ristić, Mihailo; Popović, Zorica; Matić, Rada; Vidaković, Vera; Stefanović, Milena; Jurc, Dušan

    2016-02-01

    The composition and variability of the terpenes and their derivatives isolated from the needles of a representative pool of 114 adult trees originating from four natural populations of dwarf mountain pine (Pinus mugo Turra) from the Julian Alps were investigated by GC-FID and GC/MS analyses. In total, 54 of the 57 detected essential-oil components were identified. Among the different compound classes present in the essential oils, the chief constituents belonged to the monoterpenes, comprising an average content of 79.67% of the total oil composition (74.80% of monoterpene hydrocarbons and 4.87% of oxygenated monoterpenes). Sesquiterpenes were present in smaller amounts (average content of 19.02%), out of which 16.39% were sesquiterpene hydrocarbons and 2.62% oxygenated sesquiterpenes. The most abundant components in the needle essential oils were the monoterpenes δ-car-3-ene, β-phellandrene, α-pinene, β-myrcene, and β-pinene and the sesquiterpene β-caryophyllene. From the total data set of 57 detected compounds, 40 were selected for principal-component analysis (PCA), discriminant analysis (DA), and cluster analysis (CA). The overlap tendency of the four populations suggested by PCA, was as well observed by DA. CA also demonstrated similarity among the populations, which was the highest between Populations I and II. Copyright © 2016 Verlag Helvetica Chimica Acta AG, Zürich.

  6. Variability search in M 31 using principal component analysis and the Hubble Source Catalogue

    NASA Astrophysics Data System (ADS)

    Moretti, M. I.; Hatzidimitriou, D.; Karampelas, A.; Sokolovsky, K. V.; Bonanos, A. Z.; Gavras, P.; Yang, M.

    2018-06-01

    Principal component analysis (PCA) is being extensively used in Astronomy but not yet exhaustively exploited for variability search. The aim of this work is to investigate the effectiveness of using the PCA as a method to search for variable stars in large photometric data sets. We apply PCA to variability indices computed for light curves of 18 152 stars in three fields in M 31 extracted from the Hubble Source Catalogue. The projection of the data into the principal components is used as a stellar variability detection and classification tool, capable of distinguishing between RR Lyrae stars, long-period variables (LPVs) and non-variables. This projection recovered more than 90 per cent of the known variables and revealed 38 previously unknown variable stars (about 30 per cent more), all LPVs except for one object of uncertain variability type. We conclude that this methodology can indeed successfully identify candidate variable stars.

  7. Demixed principal component analysis of neural population data.

    PubMed

    Kobak, Dmitry; Brendel, Wieland; Constantinidis, Christos; Feierstein, Claudia E; Kepecs, Adam; Mainen, Zachary F; Qi, Xue-Lian; Romo, Ranulfo; Uchida, Naoshige; Machens, Christian K

    2016-04-12

    Neurons in higher cortical areas, such as the prefrontal cortex, are often tuned to a variety of sensory and motor variables, and are therefore said to display mixed selectivity. This complexity of single neuron responses can obscure what information these areas represent and how it is represented. Here we demonstrate the advantages of a new dimensionality reduction technique, demixed principal component analysis (dPCA), that decomposes population activity into a few components. In addition to systematically capturing the majority of the variance of the data, dPCA also exposes the dependence of the neural representation on task parameters such as stimuli, decisions, or rewards. To illustrate our method we reanalyze population data from four datasets comprising different species, different cortical areas and different experimental tasks. In each case, dPCA provides a concise way of visualizing the data that summarizes the task-dependent features of the population response in a single figure.

  8. A Genealogical Interpretation of Principal Components Analysis

    PubMed Central

    McVean, Gil

    2009-01-01

    Principal components analysis, PCA, is a statistical method commonly used in population genetics to identify structure in the distribution of genetic variation across geographical location and ethnic background. However, while the method is often used to inform about historical demographic processes, little is known about the relationship between fundamental demographic parameters and the projection of samples onto the primary axes. Here I show that for SNP data the projection of samples onto the principal components can be obtained directly from considering the average coalescent times between pairs of haploid genomes. The result provides a framework for interpreting PCA projections in terms of underlying processes, including migration, geographical isolation, and admixture. I also demonstrate a link between PCA and Wright's fst and show that SNP ascertainment has a largely simple and predictable effect on the projection of samples. Using examples from human genetics, I discuss the application of these results to empirical data and the implications for inference. PMID:19834557

  9. Classification using NMR-based metabolomics of Sophora flavescens grown in Japan and China.

    PubMed

    Suzuki, Ryuichiro; Ikeda, Yuriko; Yamamoto, Akari; Saima, Toyoe; Fujita, Tatsuya; Fukuda, Tatsuo; Fukuda, Eriko; Baba, Masaki; Okada, Yoshihito; Shirataki, Yoshiaki

    2012-11-01

    We demonstrate that NMR-based metabolomics can be used to identify the country of growth (Japan or China) of Sophora flavescens plants. Principle Component Analysis (PCA) conducted on extracts of S. flavescens grown in China provided data distinct from that of extracts of plants grown in Japan. Loading plot analysis showed signals characteristic of Japanese S. flavescens. NMR analyses showed these signals to be due to kurarinol (1) and kushenol H (2). These compounds were confirmed by HPLC analysis to be distinctive markers for Japanese S. flavescens.

  10. Derivation of Boundary Manikins: A Principal Component Analysis

    NASA Technical Reports Server (NTRS)

    Young, Karen; Margerum, Sarah; Barr, Abbe; Ferrer, Mike A.; Rajulu, Sudhakar

    2008-01-01

    When designing any human-system interface, it is critical to provide realistic anthropometry to properly represent how a person fits within a given space. This study aimed to identify a minimum number of boundary manikins or representative models of subjects anthropometry from a target population, which would realistically represent the population. The boundary manikin anthropometry was derived using, Principal Component Analysis (PCA). PCA is a statistical approach to reduce a multi-dimensional dataset using eigenvectors and eigenvalues. The measurements used in the PCA were identified as those measurements critical for suit and cockpit design. The PCA yielded a total of 26 manikins per gender, as well as their anthropometry from the target population. Reduction techniques were implemented to reduce this number further with a final result of 20 female and 22 male subjects. The anthropometry of the boundary manikins was then be used to create 3D digital models (to be discussed in subsequent papers) intended for use by designers to test components of their space suit design, to verify that the requirements specified in the Human Systems Integration Requirements (HSIR) document are met. The end-goal is to allow for designers to generate suits which accommodate the diverse anthropometry of the user population.

  11. Binding Isotherms and Time Courses Readily from Magnetic Resonance.

    PubMed

    Xu, Jia; Van Doren, Steven R

    2016-08-16

    Evidence is presented that binding isotherms, simple or biphasic, can be extracted directly from noninterpreted, complex 2D NMR spectra using principal component analysis (PCA) to reveal the largest trend(s) across the series. This approach renders peak picking unnecessary for tracking population changes. In 1:1 binding, the first principal component captures the binding isotherm from NMR-detected titrations in fast, slow, and even intermediate and mixed exchange regimes, as illustrated for phospholigand associations with proteins. Although the sigmoidal shifts and line broadening of intermediate exchange distorts binding isotherms constructed conventionally, applying PCA directly to these spectra along with Pareto scaling overcomes the distortion. Applying PCA to time-domain NMR data also yields binding isotherms from titrations in fast or slow exchange. The algorithm readily extracts from magnetic resonance imaging movie time courses such as breathing and heart rate in chest imaging. Similarly, two-step binding processes detected by NMR are easily captured by principal components 1 and 2. PCA obviates the customary focus on specific peaks or regions of images. Applying it directly to a series of complex data will easily delineate binding isotherms, equilibrium shifts, and time courses of reactions or fluctuations.

  12. An algorithm for separation of mixed sparse and Gaussian sources

    PubMed Central

    Akkalkotkar, Ameya

    2017-01-01

    Independent component analysis (ICA) is a ubiquitous method for decomposing complex signal mixtures into a small set of statistically independent source signals. However, in cases in which the signal mixture consists of both nongaussian and Gaussian sources, the Gaussian sources will not be recoverable by ICA and will pollute estimates of the nongaussian sources. Therefore, it is desirable to have methods for mixed ICA/PCA which can separate mixtures of Gaussian and nongaussian sources. For mixtures of purely Gaussian sources, principal component analysis (PCA) can provide a basis for the Gaussian subspace. We introduce a new method for mixed ICA/PCA which we call Mixed ICA/PCA via Reproducibility Stability (MIPReSt). Our method uses a repeated estimations technique to rank sources by reproducibility, combined with decomposition of multiple subsamplings of the original data matrix. These multiple decompositions allow us to assess component stability as the size of the data matrix changes, which can be used to determinine the dimension of the nongaussian subspace in a mixture. We demonstrate the utility of MIPReSt for signal mixtures consisting of simulated sources and real-word (speech) sources, as well as mixture of unknown composition. PMID:28414814

  13. An algorithm for separation of mixed sparse and Gaussian sources.

    PubMed

    Akkalkotkar, Ameya; Brown, Kevin Scott

    2017-01-01

    Independent component analysis (ICA) is a ubiquitous method for decomposing complex signal mixtures into a small set of statistically independent source signals. However, in cases in which the signal mixture consists of both nongaussian and Gaussian sources, the Gaussian sources will not be recoverable by ICA and will pollute estimates of the nongaussian sources. Therefore, it is desirable to have methods for mixed ICA/PCA which can separate mixtures of Gaussian and nongaussian sources. For mixtures of purely Gaussian sources, principal component analysis (PCA) can provide a basis for the Gaussian subspace. We introduce a new method for mixed ICA/PCA which we call Mixed ICA/PCA via Reproducibility Stability (MIPReSt). Our method uses a repeated estimations technique to rank sources by reproducibility, combined with decomposition of multiple subsamplings of the original data matrix. These multiple decompositions allow us to assess component stability as the size of the data matrix changes, which can be used to determinine the dimension of the nongaussian subspace in a mixture. We demonstrate the utility of MIPReSt for signal mixtures consisting of simulated sources and real-word (speech) sources, as well as mixture of unknown composition.

  14. RECENT APPLICATIONS OF SOURCE APPORTIONMENT METHODS AND RELATED NEEDS

    EPA Science Inventory

    Traditional receptor modeling studies have utilized factor analysis (like principal component analysis, PCA) and/or Chemical Mass Balance (CMB) to assess source influences. The limitations with these approaches is that PCA is qualitative and CMB requires the input of source pr...

  15. Food patterns and dietary quality associated with organic food consumption during pregnancy; data from a large cohort of pregnant women in Norway.

    PubMed

    Torjusen, Hanne; Lieblein, Geir; Næs, Tormod; Haugen, Margaretha; Meltzer, Helle Margrete; Brantsæter, Anne Lise

    2012-08-06

    Little is known about the consumption of organic food during pregnancy. The aim of this study was to describe dietary characteristics associated with frequent consumption of organic food among pregnant women participating in the Norwegian Mother and Child Cohort Study (MoBa). The present study includes 63 808 women who during the years 2002-2007 answered two questionnaires, a general health questionnaire at gestational weeks 15 and a food frequency questionnaire at weeks 17-22. The exploration of food patterns by Principal component analyses (PCA) was followed by ANOVA analyses investigating how these food patterns as well as intake of selected food groups were associated with consumption of organic food. The first principal component (PC1) identified by PCA, accounting for 12% of the variation, was interpreted as a 'health and sustainability component', with high positive loadings for vegetables, fruit and berries, cooking oil, whole grain bread and cereal products and negative loadings for meat, including processed meat, white bread, and cakes and sweets. Frequent consumption of organic food, which was reported among 9.1% of participants (n = 5786), was associated with increased scores on the 'health and sustainability component' (p < 0.001). The increase in score represented approximately 1/10 of the total variation and was independent of sociodemographic and lifestyle characteristics. Participants with frequent consumption of organic food had a diet with higher density of fiber and most nutrients such as folate, beta-carotene and vitamin C, and lower density of sodium compared to participants with no or low organic consumption. The present study showed that pregnant Norwegian women reporting frequent consumption of organically produced food had dietary pattern and quality more in line with public advice for healthy and sustainable diets. A methodological implication is that the overall diet needs to be included in future studies of potential health outcomes related to consumption of organic food during pregnancy.

  16. Principal Components Analysis Studies of Martian Clouds

    NASA Astrophysics Data System (ADS)

    Klassen, D. R.; Bell, J. F., III

    2001-11-01

    We present the principal components analysis (PCA) of absolutely calibrated multi-spectral images of Mars as a function of Martian season. The PCA technique is a mathematical rotation and translation of the data from a brightness/wavelength space to a vector space of principal ``traits'' that lie along the directions of maximal variance. The first of these traits, accounting for over 90% of the data variance, is overall brightness and represented by an average Mars spectrum. Interpretation of the remaining traits, which account for the remaining ~10% of the variance, is not always the same and depends upon what other components are in the scene and thus, varies with Martian season. For example, during seasons with large amounts of water ice in the scene, the second trait correlates with the ice and anti-corrlates with temperature. We will investigate the interpretation of the second, and successive important PCA traits. Although these PCA traits are orthogonal in their own vector space, it is unlikely that any one trait represents a singular, mineralogic, spectral end-member. It is more likely that there are many spectral endmembers that vary identically to within the noise level, that the PCA technique will not be able to distinguish them. Another possibility is that similar absorption features among spectral endmembers may be tied to one PCA trait, for example ''amount of 2 \\micron\\ absorption''. We thus attempt to extract spectral endmembers by matching linear combinations of the PCA traits to USGS, JHU, and JPL spectral libraries as aquired through the JPL Aster project. The recovered spectral endmembers are then linearly combined to model the multi-spectral image set. We present here the spectral abundance maps of the water ice/frost endmember which allow us to track Martian clouds and ground frosts. This work supported in part through NASA Planetary Astronomy Grant NAG5-6776. All data gathered at the NASA Infrared Telescope Facility in collaboration with the telescope operators and with thanks to the support staff and day crew.

  17. Principal component analysis of the Norwegian version of the quality of life in late-stage dementia scale.

    PubMed

    Mjørud, Marit; Kirkevold, Marit; Røsvik, Janne; Engedal, Knut

    2014-01-01

    To investigate which factors the Quality of Life in Late-Stage Dementia (QUALID) scale holds when used among people with dementia (pwd) in nursing homes and to find out how the symptom load varies across the different severity levels of dementia. We included 661 pwd [mean age ± SD, 85.3 ± 8.6 years; 71.4% women]. The QUALID and the Clinical Dementia Rating (CDR) scale were applied. A principal component analysis (PCA) with varimax rotation and Kaiser normalization was applied to test the factor structure. Nonparametric analyses were applied to examine differences of symptom load across the three CDR groups. The mean QUALID score was 21.5 (±7.1), and the CDR scores of the three groups were 1 in 22.5%, 2 in 33.6% and 3 in 43.9%. The results of the statistical measures employed were the following: Crohnbach's α of QUALID, 0.74; Bartlett's test of sphericity, p <0.001; the Kaiser-Meyer-Olkin measure, 0.77. The PCA analysis resulted in three components accounting for 53% of the variance. The first component was 'tension' ('facial expression of discomfort', 'appears physically uncomfortable', 'verbalization suggests discomfort', 'being irritable and aggressive', 'appears calm', Crohnbach's α = 0.69), the second was 'well-being' ('smiles', 'enjoys eating', 'enjoys touching/being touched', 'enjoys social interaction', Crohnbach's α = 0.62) and the third was 'sadness' ('appears sad', 'cries', 'facial expression of discomfort', Crohnbach's α 0.65). The mean score on the components 'tension' and 'well-being' increased significantly with increasing severity levels of dementia. Three components of quality of life (qol) were identified. Qol decreased with increasing severity of dementia. © 2013 S. Karger AG, Basel.

  18. Low-rank plus sparse decomposition for exoplanet detection in direct-imaging ADI sequences. The LLSG algorithm

    NASA Astrophysics Data System (ADS)

    Gomez Gonzalez, C. A.; Absil, O.; Absil, P.-A.; Van Droogenbroeck, M.; Mawet, D.; Surdej, J.

    2016-05-01

    Context. Data processing constitutes a critical component of high-contrast exoplanet imaging. Its role is almost as important as the choice of a coronagraph or a wavefront control system, and it is intertwined with the chosen observing strategy. Among the data processing techniques for angular differential imaging (ADI), the most recent is the family of principal component analysis (PCA) based algorithms. It is a widely used statistical tool developed during the first half of the past century. PCA serves, in this case, as a subspace projection technique for constructing a reference point spread function (PSF) that can be subtracted from the science data for boosting the detectability of potential companions present in the data. Unfortunately, when building this reference PSF from the science data itself, PCA comes with certain limitations such as the sensitivity of the lower dimensional orthogonal subspace to non-Gaussian noise. Aims: Inspired by recent advances in machine learning algorithms such as robust PCA, we aim to propose a localized subspace projection technique that surpasses current PCA-based post-processing algorithms in terms of the detectability of companions at near real-time speed, a quality that will be useful for future direct imaging surveys. Methods: We used randomized low-rank approximation methods recently proposed in the machine learning literature, coupled with entry-wise thresholding to decompose an ADI image sequence locally into low-rank, sparse, and Gaussian noise components (LLSG). This local three-term decomposition separates the starlight and the associated speckle noise from the planetary signal, which mostly remains in the sparse term. We tested the performance of our new algorithm on a long ADI sequence obtained on β Pictoris with VLT/NACO. Results: Compared to a standard PCA approach, LLSG decomposition reaches a higher signal-to-noise ratio and has an overall better performance in the receiver operating characteristic space. This three-term decomposition brings a detectability boost compared to the full-frame standard PCA approach, especially in the small inner working angle region where complex speckle noise prevents PCA from discerning true companions from noise.

  19. [Identification of varieties of textile fibers by using Vis/NIR infrared spectroscopy technique].

    PubMed

    Wu, Gui-Fang; He, Yong

    2010-02-01

    The aim of the present paper was to provide new insight into Vis/NIR spectroscopic analysis of textile fibers. In order to achieve rapid identification of the varieties of fibers, the authors selected 5 kinds of fibers of cotton, flax, wool, silk and tencel to do a study with Vis/NIR spectroscopy. Firstly, the spectra of each kind of fiber were scanned by spectrometer, and principal component analysis (PCA) method was used to analyze the characteristics of the pattern of Vis/NIR spectra. Principal component scores scatter plot (PC1 x PC2 x PC3) of fiber indicated the classification effect of five varieties of fibers. The former 6 principal components (PCs) were selected according to the quantity and size of PCs. The PCA classification model was optimized by using the least-squares support vector machines (LS-SVM) method. The authors used the 6 PCs extracted by PCA as the inputs of LS-SVM, and PCA-LS-SVM model was built to achieve varieties validation as well as mathematical model building and optimization analysis. Two hundred samples (40 samples for each variety of fibers) of five varieties of fibers were used for calibration of PCA-LS-SVM model, and the other 50 samples (10 samples for each variety of fibers) were used for validation. The result of validation showed that Vis/NIR spectroscopy technique based on PCA-LS-SVM had a powerful classification capability. It provides a new method for identifying varieties of fibers rapidly and real time, so it has important significance for protecting the rights of consumers, ensuring the quality of textiles, and implementing rationalization production and transaction of textile materials and its production.

  20. TARGETED PRINCIPLE COMPONENT ANALYSIS: A NEW MOTION ARTIFACT CORRECTION APPROACH FOR NEAR-INFRARED SPECTROSCOPY

    PubMed Central

    YÜCEL, MERYEM A.; SELB, JULIETTE; COOPER, ROBERT J.; BOAS, DAVID A.

    2014-01-01

    As near-infrared spectroscopy (NIRS) broadens its application area to different age and disease groups, motion artifacts in the NIRS signal due to subject movement is becoming an important challenge. Motion artifacts generally produce signal fluctuations that are larger than physiological NIRS signals, thus it is crucial to correct for them before obtaining an estimate of stimulus evoked hemodynamic responses. There are various methods for correction such as principle component analysis (PCA), wavelet-based filtering and spline interpolation. Here, we introduce a new approach to motion artifact correction, targeted principle component analysis (tPCA), which incorporates a PCA filter only on the segments of data identified as motion artifacts. It is expected that this will overcome the issues of filtering desired signals that plagues standard PCA filtering of entire data sets. We compared the new approach with the most effective motion artifact correction algorithms on a set of data acquired simultaneously with a collodion-fixed probe (low motion artifact content) and a standard Velcro probe (high motion artifact content). Our results show that tPCA gives statistically better results in recovering hemodynamic response function (HRF) as compared to wavelet-based filtering and spline interpolation for the Velcro probe. It results in a significant reduction in mean-squared error (MSE) and significant enhancement in Pearson’s correlation coefficient to the true HRF. The collodion-fixed fiber probe with no motion correction performed better than the Velcro probe corrected for motion artifacts in terms of MSE and Pearson’s correlation coefficient. Thus, if the experimental study permits, the use of a collodion-fixed fiber probe may be desirable. If the use of a collodion-fixed probe is not feasible, then we suggest the use of tPCA in the processing of motion artifact contaminated data. PMID:25360181

  1. Principal component analysis vs. self-organizing maps combined with hierarchical clustering for pattern recognition in volcano seismic spectra

    NASA Astrophysics Data System (ADS)

    Unglert, K.; Radić, V.; Jellinek, A. M.

    2016-06-01

    Variations in the spectral content of volcano seismicity related to changes in volcanic activity are commonly identified manually in spectrograms. However, long time series of monitoring data at volcano observatories require tools to facilitate automated and rapid processing. Techniques such as self-organizing maps (SOM) and principal component analysis (PCA) can help to quickly and automatically identify important patterns related to impending eruptions. For the first time, we evaluate the performance of SOM and PCA on synthetic volcano seismic spectra constructed from observations during two well-studied eruptions at Klauea Volcano, Hawai'i, that include features observed in many volcanic settings. In particular, our objective is to test which of the techniques can best retrieve a set of three spectral patterns that we used to compose a synthetic spectrogram. We find that, without a priori knowledge of the given set of patterns, neither SOM nor PCA can directly recover the spectra. We thus test hierarchical clustering, a commonly used method, to investigate whether clustering in the space of the principal components and on the SOM, respectively, can retrieve the known patterns. Our clustering method applied to the SOM fails to detect the correct number and shape of the known input spectra. In contrast, clustering of the data reconstructed by the first three PCA modes reproduces these patterns and their occurrence in time more consistently. This result suggests that PCA in combination with hierarchical clustering is a powerful practical tool for automated identification of characteristic patterns in volcano seismic spectra. Our results indicate that, in contrast to PCA, common clustering algorithms may not be ideal to group patterns on the SOM and that it is crucial to evaluate the performance of these tools on a control dataset prior to their application to real data.

  2. Development and initial validation of an instrument to assess stressors among South African sports coaches.

    PubMed

    Kubayi, Alliance; Toriola, Abel; Didymus, Faye

    2018-06-01

    The aim of this series of studies was to develop and initially validate an instrument to assess stressors among South African sports coaches. In study one, a preliminary pool of 45 items was developed based on existing literature and an expert panel was employed to assess the content validity and applicability of these items. In study two, the 32 items that were retained after study one were analysed using principal component analysis (PCA). The resultant factorial structure comprised four components: environmental stressors, performance stressors, task-related stressors, and athlete stressors. These four components were made up of 26 items and, together, the components and items comprised the provisional Stressors in Sports Coaching Questionnaire (SSCQ). The results show that the SSCQ demonstrates acceptable internal consistency (.73-.89). The findings provide preliminary evidence that SSCQ is a valid tool to assess stressors among South African sports coaches.

  3. Inhibitory effect of eupatilin and jaceosidin isolated from Artemisia princeps in IgE-induced hypersensitivity.

    PubMed

    Lee, Seung Hoon; Bae, Eun-Ah; Park, Eun-Kyung; Shin, Yong-Wook; Baek, Nam-In; Han, Eun-Joo; Chung, Hae-Gon; Kim, Dong-Hyun

    2007-12-15

    To understand the antiallergic effect of Artemisia princeps (AP), which has been found to show inhibitory activity against degranulation and a passive cutaneous anaphylaxis (PCA) reaction, eupatilin and jaceosidin, as the active components, were isolated by degranulation-inhibitory activity-guided fractionation, with their antiallergic activity investigated. These isolated components potently inhibited the release of beta-hexosaminidase from RBL-2H3 cells induced by the IgE-antigen complex, with IC(50) values of 3.4 and 4.5muM, respectively. Eupatilin and jaceosidin potently inhibited the PCA reaction and scratching behaviors induced by IgE- antigen complex and compound 48/80, respectively. Orally administered jaceosidin more potently inhibited the PCA reaction than that of eupatilin, although the PCA reaction-inhibitory activity of intraperitoneally administered jaceosidin was nearly the same as that of eupatilin. Eupatilin and jaceosidin inhibited the gene expressions of TNF-alpha and IL-4 in RBL-2H3 cells stimulated by IgE-antigen complex. Eupatilin and jaceosidin inhibited the activation of NF-kB. Based on these findings, eupatilin and jaceosidin may be useful for protection from the PCA and itching reactions, which are IgE-mediated representative skin allergic diseases.

  4. Comparison of water extraction methods in Tibet based on GF-1 data

    NASA Astrophysics Data System (ADS)

    Jia, Lingjun; Shang, Kun; Liu, Jing; Sun, Zhongqing

    2018-03-01

    In this study, we compared four different water extraction methods with GF-1 data according to different water types in Tibet, including Support Vector Machine (SVM), Principal Component Analysis (PCA), Decision Tree Classifier based on False Normalized Difference Water Index (FNDWI-DTC), and PCA-SVM. The results show that all of the four methods can extract large area water body, but only SVM and PCA-SVM can obtain satisfying extraction results for small size water body. The methods were evaluated by both overall accuracy (OAA) and Kappa coefficient (KC). The OAA of PCA-SVM, SVM, FNDWI-DTC, PCA are 96.68%, 94.23%, 93.99%, 93.01%, and the KCs are 0.9308, 0.8995, 0.8962, 0.8842, respectively, in consistent with visual inspection. In summary, SVM is better for narrow rivers extraction and PCA-SVM is suitable for water extraction of various types. As for dark blue lakes, the methods using PCA can extract more quickly and accurately.

  5. Principal Component Analysis: A Method for Determining the Essential Dynamics of Proteins

    PubMed Central

    David, Charles C.; Jacobs, Donald J.

    2015-01-01

    It has become commonplace to employ principal component analysis to reveal the most important motions in proteins. This method is more commonly known by its acronym, PCA. While most popular molecular dynamics packages inevitably provide PCA tools to analyze protein trajectories, researchers often make inferences of their results without having insight into how to make interpretations, and they are often unaware of limitations and generalizations of such analysis. Here we review best practices for applying standard PCA, describe useful variants, discuss why one may wish to make comparison studies, and describe a set of metrics that make comparisons possible. In practice, one will be forced to make inferences about the essential dynamics of a protein without having the desired amount of samples. Therefore, considerable time is spent on describing how to judge the significance of results, highlighting pitfalls. The topic of PCA is reviewed from the perspective of many practical considerations, and useful recipes are provided. PMID:24061923

  6. Principal component analysis: a method for determining the essential dynamics of proteins.

    PubMed

    David, Charles C; Jacobs, Donald J

    2014-01-01

    It has become commonplace to employ principal component analysis to reveal the most important motions in proteins. This method is more commonly known by its acronym, PCA. While most popular molecular dynamics packages inevitably provide PCA tools to analyze protein trajectories, researchers often make inferences of their results without having insight into how to make interpretations, and they are often unaware of limitations and generalizations of such analysis. Here we review best practices for applying standard PCA, describe useful variants, discuss why one may wish to make comparison studies, and describe a set of metrics that make comparisons possible. In practice, one will be forced to make inferences about the essential dynamics of a protein without having the desired amount of samples. Therefore, considerable time is spent on describing how to judge the significance of results, highlighting pitfalls. The topic of PCA is reviewed from the perspective of many practical considerations, and useful recipes are provided.

  7. Cluster and principal component analysis based on SSR markers of Amomum tsao-ko in Jinping County of Yunnan Province

    NASA Astrophysics Data System (ADS)

    Ma, Mengli; Lei, En; Meng, Hengling; Wang, Tiantao; Xie, Linyan; Shen, Dong; Xianwang, Zhou; Lu, Bingyue

    2017-08-01

    Amomum tsao-ko is a commercial plant that used for various purposes in medicinal and food industries. For the present investigation, 44 germplasm samples were collected from Jinping County of Yunnan Province. Clusters analysis and 2-dimensional principal component analysis (PCA) was used to represent the genetic relations among Amomum tsao-ko by using simple sequence repeat (SSR) markers. Clustering analysis clearly distinguished the samples groups. Two major clusters were formed; first (Cluster I) consisted of 34 individuals, the second (Cluster II) consisted of 10 individuals, Cluster I as the main group contained multiple sub-clusters. PCA also showed 2 groups: PCA Group 1 included 29 individuals, PCA Group 2 included 12 individuals, consistent with the results of cluster analysis. The purpose of the present investigation was to provide information on genetic relationship of Amomum tsao-ko germplasm resources in main producing areas, also provide a theoretical basis for the protection and utilization of Amomum tsao-ko resources.

  8. Total Electron Content forecast model over Australia

    NASA Astrophysics Data System (ADS)

    Bouya, Zahra; Terkildsen, Michael; Francis, Matthew

    Ionospheric perturbations can cause serious propagation errors in modern radio systems such as Global Navigation Satellite Systems (GNSS). Forecasting ionospheric parameters is helpful to estimate potential degradation of the performance of these systems. Our purpose is to establish an Australian Regional Total Electron Content (TEC) forecast model at IPS. In this work we present an approach based on the combined use of the Principal Component Analysis (PCA) and Artificial Neural Network (ANN) to predict future TEC values. PCA is used to reduce the dimensionality of the original TEC data by mapping it into its eigen-space. In this process the top- 5 eigenvectors are chosen to reflect the directions of the maximum variability. An ANN approach was then used for the multicomponent prediction. We outline the design of the ANN model with its parameters. A number of activation functions along with different spectral ranges and different numbers of Principal Components (PCs) were tested to find the PCA-ANN models reaching the best results. Keywords: GNSS, Space Weather, Regional, Forecast, PCA, ANN.

  9. SESNPCA: Principal Component Analysis Applied to Stripped-Envelope Core-Collapse Supernovae

    NASA Astrophysics Data System (ADS)

    Williamson, Marc; Bianco, Federica; Modjaz, Maryam

    2018-01-01

    In the new era of time-domain astronomy, it will become increasingly important to have rigorous, data driven models for classifying transients, including supernovae (SNe). We present the first application of principal component analysis (PCA) to stripped-envelope core-collapse supernovae (SESNe). Previous studies of SNe types Ib, IIb, Ic, and broad-line Ic (Ic-BL) focus only on specific spectral features, while our PCA algorithm uses all of the information contained in each spectrum. We use one of the largest compiled datasets of SESNe, containing over 150 SNe, each with spectra taken at multiple phases. Our work focuses on 49 SNe with spectra taken 15 ± 5 days after maximum V-band light where better distinctions can be made between SNe type Ib and Ic spectra. We find that spectra of SNe type IIb and Ic-BL are separable from the other types in PCA space, indicating that PCA is a promising option for developing a purely data driven model for SESNe classification.

  10. Identification of regional activation by factorization of high-density surface EMG signals: A comparison of Principal Component Analysis and Non-negative Matrix factorization.

    PubMed

    Gallina, Alessio; Garland, S Jayne; Wakeling, James M

    2018-05-22

    In this study, we investigated whether principal component analysis (PCA) and non-negative matrix factorization (NMF) perform similarly for the identification of regional activation within the human vastus medialis. EMG signals from 64 locations over the VM were collected from twelve participants while performing a low-force isometric knee extension. The envelope of the EMG signal of each channel was calculated by low-pass filtering (8 Hz) the monopolar EMG signal after rectification. The data matrix was factorized using PCA and NMF, and up to 5 factors were considered for each algorithm. Association between explained variance, spatial weights and temporal scores between the two algorithms were compared using Pearson correlation. For both PCA and NMF, a single factor explained approximately 70% of the variance of the signal, while two and three factors explained just over 85% or 90%. The variance explained by PCA and NMF was highly comparable (R > 0.99). Spatial weights and temporal scores extracted with non-negative reconstruction of PCA and NMF were highly associated (all p < 0.001, mean R > 0.97). Regional VM activation can be identified using high-density surface EMG and factorization algorithms. Regional activation explains up to 30% of the variance of the signal, as identified through both PCA and NMF. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. Profiling contents of water-soluble metabolites and mineral nutrients to evaluate the effects of pesticides and organic and chemical fertilizers on tomato fruit quality.

    PubMed

    Watanabe, Masami; Ohta, Yuko; Licang, Sun; Motoyama, Naoki; Kikuchi, Jun

    2015-02-15

    In this study, the contents of water-soluble metabolites and mineral nutrients were measured in tomatoes cultured using organic and chemical fertilizers, with or without pesticides. Mineral nutrients and water-soluble metabolites were determined by inductively coupled plasma-atomic emission spectrometry and (1)H nuclear magnetic resonance spectrometry, respectively, and results were analysed by principal components analysis (PCA). The mineral nutrient and water-soluble metabolite profiles differed between organic and chemical fertilizer applications, which accounted for 88.0% and 55.4%, respectively, of the variation. (1)H-(13)C-hetero-nuclear single quantum coherence experiments identified aliphatic protons that contributed to the discrimination of PCA. Pesticide application had little effect on mineral nutrient content (except Fe and P), but affected the correlation between mineral nutrients and metabolites. Differences in the content of mineral nutrients and water-soluble metabolites resulting from different fertilizer and pesticide applications probably affect tomato quality. Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. Determination of authenticity, regional origin, and vintage of Slovenian wines using a combination of IRMS and SNIF-NMR analyses.

    PubMed

    Ogrinc, N; Kosir, I J; Kocjancic, M; Kidric, J

    2001-03-01

    The authenticity and geographical origin of wines produced in Slovenia were investigated by a combination of IRMS and SNIF-NMR methods. A total of 102 grape samples of selected wines were carefully collected in three different wine-growing regions of Slovenia in 1996, 1997, and 1998. The stable isotope data were evaluated using principal component analysis (PCA) and linear discriminant analysis (LDA). The isotopic ratios to discriminate between coastal and continental regions are the deuterium/hydrogen isotopic ratio of the methylene site in the ethanol molecule (D/H)(II) and delta(13)C values; including also delta(18)O values in the PCA and LDA made possible separation between the two continental regions Drava and Sava. It was found that delta(18)O values are modified by the meteorological events during grape ripening and harvest. The usefulness of isotopic parameters for detecting adulteration or watering and to assess the geographical origin of wines is improved only when they are used concurrently.

  13. Factors affecting the abundance of selected fishes near oil and gas platforms in the northern Gulf of Mexico

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Stanley, D.R.; Wilson, C.A.

    1991-01-01

    A logbook program was initiated to determine the relative abundance of selected fish species around oil and gas platforms off the Louisiana coast. Logbooks were maintained by 55 anglers and 10 charterboat operators from March 1987 to March 1988. A total of 36,839 fish were caught representing over 46 different species. Principal component analysis (PCA) grouped the seventeen most abundant species into reef fish, pelagic fish, bluefish-red drum, Atlantic croaker-silver/sand seatrout, and cobia-shark-blue runner associations. Multiple regression analyses were used to compare PCA groupings to physical platform, temporal, geological, and angler characteristic variables and their interactions. Reef fish, Atlantic croaker,more » and silver/sand seatrout abundances were highest near large, structurally complex platforms in relatively deep water. High spotted seatrout abundances were correlated with small, unmanned oil and gas platforms in shallow water. Pelagic fish, bluefish, red drum, cobia, and shark abundances were not related to the physical parameters of the platforms.« less

  14. Antioxidant capacity of cornelian cherry (Cornus mas L.) - comparison between permanganate reducing antioxidant capacity and other antioxidant methods.

    PubMed

    Popović, Boris M; Stajner, Dubravka; Slavko, Kevrešan; Sandra, Bijelić

    2012-09-15

    Ethanol extracts (80% in water) of 10 cornelian cherry (Cornus mas L.) genotypes were studied for antioxidant properties, using methods including DPPH(), ()NO, O(2)(-) and ()OH antiradical powers, FRAP, total phenolic and anthocyanin content (TPC and ACC) and also one relatively new, permanganate method (permanganate reducing antioxidant capacity-PRAC). Lipid peroxidation (LP) was also determined as an indicator of oxidative stress. The data from different procedures were compared and analysed by multivariate techniques (correlation matrix calculation and principal component analysis (PCA)). Significant positive correlations were obtained between TPC, ACC and DPPH(), ()NO, O(2)(-), and ()OH antiradical powers, and also between PRAC and TPC, ACC and FRAP. PCA found two major clusters of cornelian cherry, based on antiradical power, FRAP and PRAC and also on chemical composition. Chemometric evaluation showed close interdependence between PRAC method and FRAP and ACC. There was a huge variation between C. mas genotypes in terms of antioxidant activity. Copyright © 2012 Elsevier Ltd. All rights reserved.

  15. A comparison between different coronagraphic data reduction techniques

    NASA Astrophysics Data System (ADS)

    Carolo, E.; Vassallo, D.; Farinato, J.; Bergomi, M.; Bonavita, M.; Carlotti, A.; D'Orazi, V.; Greggio, D.; Magrin, D.; Mesa, D.; Pinna, E.; Puglisi, A.; Stangalini, M.; Verinaud, C.; Viotto, V.

    2016-07-01

    A robust post processing technique is mandatory for analysing the coronagraphic high contrast imaging data. Angular Differential Imaging (ADI) and Principal Component Analysis (PCA) are the most used approaches to suppress the quasi-static structure presents in the Point Spread Function (PSF) for revealing planets at different separations from the host star. In this work, we present the comparison between ADI and PCA applied to System of coronagraphy with High order Adaptive optics from R to K band (SHARK-NIR), which will be implemented at Large Binocular Telescope (LBT). The comparison has been carried out by using as starting point the simulated wavefront residuals of the LBT Adaptive Optics (AO) system, in different observing conditions. Accurate tests for tuning the post processing parameters to obtain the best performance from each technique were performed in various seeing conditions (0:4"-1") for star magnitude ranging from 8 to 12, with particular care in finding the best compromise between quasi static speckle subtraction and planets detection.

  16. Analysis of seven salad rocket (Eruca sativa) accessions: The relationships between sensory attributes and volatile and non-volatile compounds.

    PubMed

    Bell, Luke; Methven, Lisa; Signore, Angelo; Oruna-Concha, Maria Jose; Wagstaff, Carol

    2017-03-01

    Sensory and chemical analyses were performed on accessions of rocket (Eruca sativa) to determine phytochemical influences on sensory attributes. A trained panel was used to evaluate leaves, and chemical data were obtained for polyatomic ions, amino acids, sugars and organic acids. These chemical data (and data of glucosinolates, flavonols and headspace volatiles previously reported) were used in Principal Component Analysis (PCA) to determine variables statistically important to sensory traits. Significant differences were observed between samples for polyatomic ion and amino acid concentrations. PCA revealed strong, positive correlations between glucosinolates, isothiocyanates and sulfur compounds with bitterness, mustard, peppery, warming and initial heat mouthfeel traits. The ratio between glucosinolates and sugars inferred reduced perception of bitter aftereffects. We highlight the diversity of E. sativa accessions from a sensory and phytochemical standpoint, and the potential for breeders to create varieties that are nutritionally and sensorially superior to existing ones. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  17. Libs-PCA based discrimination of Malaysian coins

    NASA Astrophysics Data System (ADS)

    Mustapha Imam, Auwal; Safwan Aziz, M.; Chaudhary, Kashif; Rizvi, Zuhaib; Ali, Jalil

    2018-05-01

    The investigations of currency coins dated back to many centuries. Many researchers developed an interest in the investigation of the coin’s weight, size, physical feature and elemental composition. Laser-induced breakdown spectroscopy (LIBS) has the novelty of analytical analyses of various samples. It has the ability for the elemental composition determination of samples of solid (including metals), liquid and/or gases. Malaysia as a country uses Ringgit as a currency, among which are coins of 10, 20 and 50 cents. These coins are in series of release from the Malaysian Central Bank from time to time. There are currently in circulation old and new coins of 5, 10, 20 and 50 cents coins. These coins differ in their physical features and are may be different also in their elemental composition. This paper presents the investigation of the differences in elemental composition between the old and new Malaysian coins of 10, 20 and 50 cents. Principal component analysis (PCA) was used to perform the discrimination between the coins from the LIBS spectra.

  18. Origin Discrimination of Osmanthus fragrans var. thunbergii Flowers using GC-MS and UPLC-PDA Combined with Multivariable Analysis Methods.

    PubMed

    Zhou, Fei; Zhao, Yajing; Peng, Jiyu; Jiang, Yirong; Li, Maiquan; Jiang, Yuan; Lu, Baiyi

    2017-07-01

    Osmanthus fragrans flowers are used as folk medicine and additives for teas, beverages and foods. The metabolites of O. fragrans flowers from different geographical origins were inconsistent in some extent. Chromatography and mass spectrometry combined with multivariable analysis methods provides an approach for discriminating the origin of O. fragrans flowers. To discriminate the Osmanthus fragrans var. thunbergii flowers from different origins with the identified metabolites. GC-MS and UPLC-PDA were conducted to analyse the metabolites in O. fragrans var. thunbergii flowers (in total 150 samples). Principal component analysis (PCA), soft independent modelling of class analogy analysis (SIMCA) and random forest (RF) analysis were applied to group the GC-MS and UPLC-PDA data. GC-MS identified 32 compounds common to all samples while UPLC-PDA/QTOF-MS identified 16 common compounds. PCA of the UPLC-PDA data generated a better clustering than PCA of the GC-MS data. Ten metabolites (six from GC-MS and four from UPLC-PDA) were selected as effective compounds for discrimination by PCA loadings. SIMCA and RF analysis were used to build classification models, and the RF model, based on the four effective compounds (caffeic acid derivative, acteoside, ligustroside and compound 15), yielded better results with the classification rate of 100% in the calibration set and 97.8% in the prediction set. GC-MS and UPLC-PDA combined with multivariable analysis methods can discriminate the origin of Osmanthus fragrans var. thunbergii flowers. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  19. Principal components derived from CSF inflammatory profiles predict outcome in survivors after severe traumatic brain injury.

    PubMed

    Kumar, Raj G; Rubin, Jonathan E; Berger, Rachel P; Kochanek, Patrick M; Wagner, Amy K

    2016-03-01

    Studies have characterized absolute levels of multiple inflammatory markers as significant risk factors for poor outcomes after traumatic brain injury (TBI). However, inflammatory marker concentrations are highly inter-related, and production of one may result in the production or regulation of another. Therefore, a more comprehensive characterization of the inflammatory response post-TBI should consider relative levels of markers in the inflammatory pathway. We used principal component analysis (PCA) as a dimension-reduction technique to characterize the sets of markers that contribute independently to variability in cerebrospinal (CSF) inflammatory profiles after TBI. Using PCA results, we defined groups (or clusters) of individuals (n=111) with similar patterns of acute CSF inflammation that were then evaluated in the context of outcome and other relevant CSF and serum biomarkers collected days 0-3 and 4-5 post-injury. We identified four significant principal components (PC1-PC4) for CSF inflammation from days 0-3, and PC1 accounted for the greatest (31%) percentage of variance. PC1 was characterized by relatively higher CSF sICAM-1, sFAS, IL-10, IL-6, sVCAM-1, IL-5, and IL-8 levels. Cluster analysis then defined two distinct clusters, such that individuals in cluster 1 had highly positive PC1 scores and relatively higher levels of CSF cortisol, progesterone, estradiol, testosterone, brain derived neurotrophic factor (BDNF), and S100b; this group also had higher serum cortisol and lower serum BDNF. Multinomial logistic regression analyses showed that individuals in cluster 1 had a 10.9 times increased likelihood of GOS scores of 2/3 vs. 4/5 at 6 months compared to cluster 2, after controlling for covariates. Cluster group did not discriminate between mortality compared to GOS scores of 4/5 after controlling for age and other covariates. Cluster groupings also did not discriminate mortality or 12 month outcomes in multivariate models. PCA and cluster analysis establish that a subset of CSF inflammatory markers measured in days 0-3 post-TBI may distinguish individuals with poor 6-month outcome, and future studies should prospectively validate these findings. PCA of inflammatory mediators after TBI could aid in prognostication and in identifying patient subgroups for therapeutic interventions. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Research on distributed heterogeneous data PCA algorithm based on cloud platform

    NASA Astrophysics Data System (ADS)

    Zhang, Jin; Huang, Gang

    2018-05-01

    Principal component analysis (PCA) of heterogeneous data sets can solve the problem that centralized data scalability is limited. In order to reduce the generation of intermediate data and error components of distributed heterogeneous data sets, a principal component analysis algorithm based on heterogeneous data sets under cloud platform is proposed. The algorithm performs eigenvalue processing by using Householder tridiagonalization and QR factorization to calculate the error component of the heterogeneous database associated with the public key to obtain the intermediate data set and the lost information. Experiments on distributed DBM heterogeneous datasets show that the model method has the feasibility and reliability in terms of execution time and accuracy.

  1. Stationary Wavelet-based Two-directional Two-dimensional Principal Component Analysis for EMG Signal Classification

    NASA Astrophysics Data System (ADS)

    Ji, Yi; Sun, Shanlin; Xie, Hong-Bo

    2017-06-01

    Discrete wavelet transform (WT) followed by principal component analysis (PCA) has been a powerful approach for the analysis of biomedical signals. Wavelet coefficients at various scales and channels were usually transformed into a one-dimensional array, causing issues such as the curse of dimensionality dilemma and small sample size problem. In addition, lack of time-shift invariance of WT coefficients can be modeled as noise and degrades the classifier performance. In this study, we present a stationary wavelet-based two-directional two-dimensional principal component analysis (SW2D2PCA) method for the efficient and effective extraction of essential feature information from signals. Time-invariant multi-scale matrices are constructed in the first step. The two-directional two-dimensional principal component analysis then operates on the multi-scale matrices to reduce the dimension, rather than vectors in conventional PCA. Results are presented from an experiment to classify eight hand motions using 4-channel electromyographic (EMG) signals recorded in healthy subjects and amputees, which illustrates the efficiency and effectiveness of the proposed method for biomedical signal analysis.

  2. Motor features in posterior cortical atrophy and their imaging correlates☆

    PubMed Central

    Ryan, Natalie S.; Shakespeare, Timothy J.; Lehmann, Manja; Keihaninejad, Shiva; Nicholas, Jennifer M.; Leung, Kelvin K.; Fox, Nick C.; Crutch, Sebastian J.

    2014-01-01

    Posterior cortical atrophy (PCA) is a neurodegenerative syndrome characterized by impaired higher visual processing skills; however, motor features more commonly associated with corticobasal syndrome may also occur. We investigated the frequency and clinical characteristics of motor features in 44 PCA patients and, with 30 controls, conducted voxel-based morphometry, cortical thickness, and subcortical volumetric analyses of their magnetic resonance imaging. Prominent limb rigidity was used to define a PCA-motor subgroup. A total of 30% (13) had PCA-motor; all demonstrating asymmetrical left upper limb rigidity. Limb apraxia was more frequent and asymmetrical in PCA-motor, as was myoclonus. Tremor and alien limb phenomena only occurred in this subgroup. The subgroups did not differ in neuropsychological test performance or apolipoprotein E4 allele frequency. Greater asymmetry of atrophy occurred in PCA-motor, particularly involving right frontoparietal and peri-rolandic cortices, putamen, and thalamus. The 9 patients (including 4 PCA-motor) with pathology or cerebrospinal fluid all showed evidence of Alzheimer's disease. Our data suggest that PCA patients with motor features have greater atrophy of contralateral sensorimotor areas but are still likely to have underlying Alzheimer's disease. PMID:25086839

  3. Germline BRCA mutations are associated with higher risk of nodal involvement, distant metastasis, and poor survival outcomes in prostate cancer.

    PubMed

    Castro, Elena; Goh, Chee; Olmos, David; Saunders, Ed; Leongamornlert, Daniel; Tymrakiewicz, Malgorzata; Mahmud, Nadiya; Dadaev, Tokhir; Govindasami, Koveela; Guy, Michelle; Sawyer, Emma; Wilkinson, Rosemary; Ardern-Jones, Audrey; Ellis, Steve; Frost, Debra; Peock, Susan; Evans, D Gareth; Tischkowitz, Marc; Cole, Trevor; Davidson, Rosemarie; Eccles, Diana; Brewer, Carole; Douglas, Fiona; Porteous, Mary E; Donaldson, Alan; Dorkins, Huw; Izatt, Louise; Cook, Jackie; Hodgson, Shirley; Kennedy, M John; Side, Lucy E; Eason, Jacqueline; Murray, Alex; Antoniou, Antonis C; Easton, Douglas F; Kote-Jarai, Zsofia; Eeles, Rosalind

    2013-05-10

    To analyze the baseline clinicopathologic characteristics of prostate tumors with germline BRCA1 and BRCA2 (BRCA1/2) mutations and the prognostic value of those mutations on prostate cancer (PCa) outcomes. This study analyzed the tumor features and outcomes of 2,019 patients with PCa (18 BRCA1 carriers, 61 BRCA2 carriers, and 1,940 noncarriers). The Kaplan-Meier method and Cox regression analysis were used to evaluate the associations between BRCA1/2 status and other PCa prognostic factors with overall survival (OS), cause-specific OS (CSS), CSS in localized PCa (CSS_M0), metastasis-free survival (MFS), and CSS from metastasis (CSS_M1). PCa with germline BRCA1/2 mutations were more frequently associated with Gleason ≥ 8 (P = .00003), T3/T4 stage (P = .003), nodal involvement (P = .00005), and metastases at diagnosis (P = .005) than PCa in noncarriers. CSS was significantly longer in noncarriers than in carriers (15.7 v 8.6 years, multivariable analyses [MVA] P = .015; hazard ratio [HR] = 1.8). For localized PCa, 5-year CSS and MFS were significantly higher in noncarriers (96% v 82%; MVA P = .01; HR = 2.6%; and 93% v 77%; MVA P = .009; HR = 2.7, respectively). Subgroup analyses confirmed the poor outcomes in BRCA2 patients, whereas the role of BRCA1 was not well defined due to the limited size and follow-up in this subgroup. Our results confirm that BRCA1/2 mutations confer a more aggressive PCa phenotype with a higher probability of nodal involvement and distant metastasis. BRCA mutations are associated with poor survival outcomes and this should be considered for tailoring clinical management of these patients.

  4. Germline BRCA Mutations Are Associated With Higher Risk of Nodal Involvement, Distant Metastasis, and Poor Survival Outcomes in Prostate Cancer

    PubMed Central

    Castro, Elena; Goh, Chee; Olmos, David; Saunders, Ed; Leongamornlert, Daniel; Tymrakiewicz, Malgorzata; Mahmud, Nadiya; Dadaev, Tokhir; Govindasami, Koveela; Guy, Michelle; Sawyer, Emma; Wilkinson, Rosemary; Ardern-Jones, Audrey; Ellis, Steve; Frost, Debra; Peock, Susan; Evans, D. Gareth; Tischkowitz, Marc; Cole, Trevor; Davidson, Rosemarie; Eccles, Diana; Brewer, Carole; Douglas, Fiona; Porteous, Mary E.; Donaldson, Alan; Dorkins, Huw; Izatt, Louise; Cook, Jackie; Hodgson, Shirley; Kennedy, M. John; Side, Lucy E.; Eason, Jacqueline; Murray, Alex; Antoniou, Antonis C.; Easton, Douglas F.; Kote-Jarai, Zsofia; Eeles, Rosalind

    2013-01-01

    Purpose To analyze the baseline clinicopathologic characteristics of prostate tumors with germline BRCA1 and BRCA2 (BRCA1/2) mutations and the prognostic value of those mutations on prostate cancer (PCa) outcomes. Patients and Methods This study analyzed the tumor features and outcomes of 2,019 patients with PCa (18 BRCA1 carriers, 61 BRCA2 carriers, and 1,940 noncarriers). The Kaplan-Meier method and Cox regression analysis were used to evaluate the associations between BRCA1/2 status and other PCa prognostic factors with overall survival (OS), cause-specific OS (CSS), CSS in localized PCa (CSS_M0), metastasis-free survival (MFS), and CSS from metastasis (CSS_M1). Results PCa with germline BRCA1/2 mutations were more frequently associated with Gleason ≥ 8 (P = .00003), T3/T4 stage (P = .003), nodal involvement (P = .00005), and metastases at diagnosis (P = .005) than PCa in noncarriers. CSS was significantly longer in noncarriers than in carriers (15.7 v 8.6 years, multivariable analyses [MVA] P = .015; hazard ratio [HR] = 1.8). For localized PCa, 5-year CSS and MFS were significantly higher in noncarriers (96% v 82%; MVA P = .01; HR = 2.6%; and 93% v 77%; MVA P = .009; HR = 2.7, respectively). Subgroup analyses confirmed the poor outcomes in BRCA2 patients, whereas the role of BRCA1 was not well defined due to the limited size and follow-up in this subgroup. Conclusion Our results confirm that BRCA1/2 mutations confer a more aggressive PCa phenotype with a higher probability of nodal involvement and distant metastasis. BRCA mutations are associated with poor survival outcomes and this should be considered for tailoring clinical management of these patients. PMID:23569316

  5. Application of principal component analysis to multispectral imaging data for evaluation of pigmented skin lesions

    NASA Astrophysics Data System (ADS)

    Jakovels, Dainis; Lihacova, Ilze; Kuzmina, Ilona; Spigulis, Janis

    2013-11-01

    Non-invasive and fast primary diagnostics of pigmented skin lesions is required due to frequent incidence of skin cancer - melanoma. Diagnostic potential of principal component analysis (PCA) for distant skin melanoma recognition is discussed. Processing of the measured clinical multi-spectral images (31 melanomas and 94 nonmalignant pigmented lesions) in the wavelength range of 450-950 nm by means of PCA resulted in 87 % sensitivity and 78 % specificity for separation between malignant melanomas and pigmented nevi.

  6. A new Integrated Negative Symptom structure of the Positive and Negative Syndrome Scale (PANSS) in schizophrenia using item response analysis.

    PubMed

    Khan, Anzalee; Lindenmayer, Jean-Pierre; Opler, Mark; Yavorsky, Christian; Rothman, Brian; Lucic, Luka

    2013-10-01

    Debate persists with regard to how best to categorize the syndromal dimension of negative symptoms in schizophrenia. The aim was to first review published Principle Components Analysis (PCA) of the PANSS, and extract items most frequently included in the negative domain, and secondly, to examine the quality of items using Item Response Theory (IRT) to select items that best represent a measurable dimension (or dimensions) of negative symptoms. First, 22 factor analyses and PCA met were included. Second, using a large dataset (n=7187) of participants in clinical trials with chronic schizophrenia, we extracted items loading on one or more PCA. Third, items not loading with a value of ≥ 0.5, or loading on more than one component with values of ≥ 0.5 were discarded. Fourth, resulting items were included in a non-parametric IRT and retained based on Option Characteristic Curves (OCCs) and Item Characteristic Curves (ICCs). 15 items loaded on a negative domain in at least one study, with Emotional Withdrawal loading on all studies. Non-parametric IRT retained nine items as an Integrated Negative Factor: Emotional Withdrawal, Blunted Affect, Passive/Apathetic Social Withdrawal, Poor Rapport, Lack of Spontaneity/Conversation Flow, Active Social Avoidance, Disturbance of Volition, Stereotyped Thinking and Difficulty in Abstract Thinking. This is the first study to use a psychometric IRT process to arrive at a set of negative symptom items. Future steps will include further examination of these nine items in terms of their stability, sensitivity to change, and correlations with functional and cognitive outcomes. © 2013 Elsevier B.V. All rights reserved.

  7. A principal component analysis of the relationship between the external body shape and internal skeleton for the upper body.

    PubMed

    Nerot, A; Skalli, W; Wang, X

    2016-10-03

    Recent progress in 3D scanning technologies allows easy access to 3D human body envelope. To create personalized human models with an articulated linkage for realistic re-posturing and motion analyses, an accurate estimation of internal skeleton points, including joint centers, from the external envelope is required. For this research project, 3D reconstructions of both internal skeleton and external envelope from low dose biplanar X-rays of 40 male adults were obtained. Using principal component analysis technique (PCA), a low-dimensional dataset was used to predict internal points of the upper body from the trunk envelope. A least squares method was used to find PC scores that fit the PCA-based model to the envelope of a new subject. To validate the proposed approach, estimated internal points were evaluated using a leave-one-out (LOO) procedure, i.e. successively considering each individual from our dataset as an extra-subject. In addition, different methods were proposed to reduce the variability in data and improve the performance of the PCA-based prediction. The best method was considered as the one providing the smallest errors between estimated and reference internal points with an average error of 8.3mm anterior-posteriorly, 6.7mm laterally and 6.5mm vertically. As the proposed approach relies on few or no bony landmarks, it could be easily applicable and generalizable to surface scans from any devices. Combined with automatic body scanning techniques, this study could potentially constitute a new step towards automatic generation of external/internal subject-specific manikins. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Inter-comparison of receptor models for PM source apportionment: Case study in an industrial area

    NASA Astrophysics Data System (ADS)

    Viana, M.; Pandolfi, M.; Minguillón, M. C.; Querol, X.; Alastuey, A.; Monfort, E.; Celades, I.

    2008-05-01

    Receptor modelling techniques are used to identify and quantify the contributions from emission sources to the levels and major and trace components of ambient particulate matter (PM). A wide variety of receptor models are currently available, and consequently the comparability between models should be evaluated if source apportionment data are to be used as input in health effects studies or mitigation plans. Three of the most widespread receptor models (principal component analysis, PCA; positive matrix factorization, PMF; chemical mass balance, CMB) were applied to a single PM10 data set (n=328 samples, 2002-2005) obtained from an industrial area in NE Spain, dedicated to ceramic production. Sensitivity and temporal trend analyses (using the Mann-Kendall test) were applied. Results evidenced the good overall performance of the three models (r2>0.83 and α>0.91×between modelled and measured PM10 mass), with a good agreement regarding source identification and high correlations between input (CMB) and output (PCA, PMF) source profiles. Larger differences were obtained regarding the quantification of source contributions (up to a factor of 4 in some cases). The combined application of different types of receptor models would solve the limitations of each of the models, by constructing a more robust solution based on their strengths. The authors suggest the combined use of factor analysis techniques (PCA, PMF) to identify and interpret emission sources, and to obtain a first quantification of their contributions to the PM mass, and the subsequent application of CMB. Further research is needed to ensure that source apportionment methods are robust enough for application to PM health effects assessments.

  9. Statistical techniques applied to aerial radiometric surveys (STAARS): principal components analysis user's manual. [NURE program

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Koch, C.D.; Pirkle, F.L.; Schmidt, J.S.

    1981-01-01

    A Principal Components Analysis (PCA) has been written to aid in the interpretation of multivariate aerial radiometric data collected by the US Department of Energy (DOE) under the National Uranium Resource Evaluation (NURE) program. The variations exhibited by these data have been reduced and classified into a number of linear combinations by using the PCA program. The PCA program then generates histograms and outlier maps of the individual variates. Black and white plots can be made on a Calcomp plotter by the application of follow-up programs. All programs referred to in this guide were written for a DEC-10. From thismore » analysis a geologist may begin to interpret the data structure. Insight into geological processes underlying the data may be obtained.« less

  10. Short-term PV/T module temperature prediction based on PCA-RBF neural network

    NASA Astrophysics Data System (ADS)

    Li, Jiyong; Zhao, Zhendong; Li, Yisheng; Xiao, Jing; Tang, Yunfeng

    2018-02-01

    Aiming at the non-linearity and large inertia of temperature control in PV/T system, short-term temperature prediction of PV/T module is proposed, to make the PV/T system controller run forward according to the short-term forecasting situation to optimize control effect. Based on the analysis of the correlation between PV/T module temperature and meteorological factors, and the temperature of adjacent time series, the principal component analysis (PCA) method is used to pre-process the original input sample data. Combined with the RBF neural network theory, the simulation results show that the PCA method makes the prediction accuracy of the network model higher and the generalization performance stronger than that of the RBF neural network without the main component extraction.

  11. Structure-seeking multilinear methods for the analysis of fMRI data.

    PubMed

    Andersen, Anders H; Rayens, William S

    2004-06-01

    In comprehensive fMRI studies of brain function, the data structures often contain higher-order ways such as trial, task condition, subject, and group in addition to the intrinsic dimensions of time and space. While multivariate bilinear methods such as principal component analysis (PCA) have been used successfully for extracting information about spatial and temporal features in data from a single fMRI run, the need to unfold higher-order data sets into bilinear arrays has led to decompositions that are nonunique and to the loss of multiway linkages and interactions present in the data. These additional dimensions or ways can be retained in multilinear models to produce structures that are unique and which admit interpretations that are neurophysiologically meaningful. Multiway analysis of fMRI data from multiple runs of a bilateral finger-tapping paradigm was performed using the parallel factor (PARAFAC) model. A trilinear model was fitted to a data cube of dimensions voxels by time by run. Similarly, a quadrilinear model was fitted to a higher-way structure of dimensions voxels by time by trial by run. The spatial and temporal response components were extracted and validated by comparison to results from traditional SVD/PCA analyses based on scenarios of unfolding into lower-order bilinear structures.

  12. The Laschamp geomagnetic excursion featured in nitrate record from EPICA-Dome C ice core

    PubMed Central

    Traversi, R.; Becagli, S.; Poluianov, S.; Severi, M.; Solanki, S. K.; Usoskin, I. G.; Udisti, R.

    2016-01-01

    Here we present the first direct comparison of cosmogenic 10Be and chemical species in the period of 38–45.5 kyr BP spanning the Laschamp geomagnetic excursion from the EPICA-Dome C ice core. A principal component analysis (PCA) allowed to group different components as a function of the main sources, transport and deposition processes affecting the atmospheric aerosol at Dome C. Moreover, a wavelet analysis highlighted the high coherence and in-phase relationship between 10Be and nitrate at this time. The evident preferential association of 10Be with nitrate rather than with other chemical species was ascribed to the presence of a distinct source, here labelled as “cosmogenic”. Both the PCA and wavelet analyses ruled out a significant role of calcium in driving the 10Be and nitrate relationship, which is particularly relevant for a plateau site such as Dome C, especially in the glacial period during which the Laschamp excursion took place. The evidence that the nitrate record from the EDC ice core is able to capture the Laschamp event hints toward the possibility of using this marker for studying galactic cosmic ray flux variations and thus also major geomagnetic field excursions at pluri-centennial-millennial time scales, thus opening up new perspectives in paleoclimatic studies. PMID:26819064

  13. Comparison of source apportionment of PM2.5 using receptor models in the main hub port city of East Asia: Busan

    NASA Astrophysics Data System (ADS)

    Jeong, Ju-Hee; Shon, Zang-Ho; Kang, Minsung; Song, Sang-Keun; Kim, Yoo-Keun; Park, Jinsoo; Kim, Hyunjae

    2017-01-01

    The contributions of various PM2.5 emission sources to ambient PM2.5 levels during 2013 in the main hub port city (Busan, South Korea) of East Asia was quantified using several receptor modeling techniques. Three receptor models of principal component analysis/absolute principal component score (PCA/APCS), positive matrix factorization (PMF), and chemical mass balance (CMB) were used to apportion the source of PM2.5 obtained from the target city. The results of the receptor models indicated that the secondary formation of PM2.5 was the dominant (45-60%) contributor to PM2.5 levels in the port city of Busan. The PMF and PCA/APCS suggested that ship emission was a non-negligible contributor of PM2.5 (up to about 10%) in the study area, whereas it was a negligible contributor based on CMB. The magnitude of source contribution estimates to PM2.5 levels differed significantly among these three models due to their limitations (e.g., PM2.5 emission source profiles and restrictions of the models). Potential source contribution function and concentration-weighted trajectory analyses indicated that long-range transport from sources in the eastern China and Yellow Sea contributed significantly to the level of PM2.5 in Busan.

  14. The added value of percentage of free to total prostate-specific antigen, PCA3, and a kallikrein panel to the ERSPC risk calculator for prostate cancer in prescreened men.

    PubMed

    Vedder, Moniek M; de Bekker-Grob, Esther W; Lilja, Hans G; Vickers, Andrew J; van Leenders, Geert J L H; Steyerberg, Ewout W; Roobol, Monique J

    2014-12-01

    Prostate-specific antigen (PSA) testing has limited accuracy for the early detection of prostate cancer (PCa). To assess the value added by percentage of free to total PSA (%fPSA), prostate cancer antigen 3 (PCA3), and a kallikrein panel (4k-panel) to the European Randomised Study of Screening for Prostate Cancer (ERSPC) multivariable prediction models: risk calculator (RC) 4, including transrectal ultrasound, and RC 4 plus digital rectal examination (4+DRE) for prescreened men. Participants were invited for rescreening between October 2007 and February 2009 within the Dutch part of the ERSPC study. Biopsies were taken in men with a PSA level ≥3.0 ng/ml or a PCA3 score ≥10. Additional analyses of the 4k-panel were done on serum samples. Outcome was defined as PCa detectable by sextant biopsy. Receiver operating characteristic curve and decision curve analyses were performed to compare the predictive capabilities of %fPSA, PCA3, 4k-panel, the ERSPC RCs, and their combinations in logistic regression models. PCa was detected in 119 of 708 men. The %fPSA did not perform better univariately or added to the RCs compared with the RCs alone. In 202 men with an elevated PSA, the 4k-panel discriminated better than PCA3 when modelled univariately (area under the curve [AUC]: 0.78 vs. 0.62; p=0.01). The multivariable models with PCA3 or the 4k-panel were equivalent (AUC: 0.80 for RC 4+DRE). In the total population, PCA3 discriminated better than the 4k-panel (univariate AUC: 0.63 vs. 0.56; p=0.05). There was no statistically significant difference between the multivariable model with PCA3 (AUC: 0.73) versus the model with the 4k-panel (AUC: 0.71; p=0.18). The multivariable model with PCA3 performed better than the reference model (0.73 vs. 0.70; p=0.02). Decision curves confirmed these patterns, although numbers were small. Both PCA3 and, to a lesser extent, a 4k-panel have added value to the DRE-based ERSPC RC in detecting PCa in prescreened men. We studied the added value of novel biomarkers to previously developed risk prediction models for prostate cancer. We found that inclusion of these biomarkers resulted in an increase in predictive ability. Copyright © 2014. Published by Elsevier B.V.

  15. Understanding deformation mechanisms during powder compaction using principal component analysis of compression data.

    PubMed

    Roopwani, Rahul; Buckner, Ira S

    2011-10-14

    Principal component analysis (PCA) was applied to pharmaceutical powder compaction. A solid fraction parameter (SF(c/d)) and a mechanical work parameter (W(c/d)) representing irreversible compression behavior were determined as functions of applied load. Multivariate analysis of the compression data was carried out using PCA. The first principal component (PC1) showed loadings for the solid fraction and work values that agreed with changes in the relative significance of plastic deformation to consolidation at different pressures. The PC1 scores showed the same rank order as the relative plasticity ranking derived from the literature for common pharmaceutical materials. The utility of PC1 in understanding deformation was extended to binary mixtures using a subset of the original materials. Combinations of brittle and plastic materials were characterized using the PCA method. The relationships between PC1 scores and the weight fractions of the mixtures were typically linear showing ideal mixing in their deformation behaviors. The mixture consisting of two plastic materials was the only combination to show a consistent positive deviation from ideality. The application of PCA to solid fraction and mechanical work data appears to be an effective means of predicting deformation behavior during compaction of simple powder mixtures. Copyright © 2011 Elsevier B.V. All rights reserved.

  16. Principal Component Analysis in the Spectral Analysis of the Dynamic Laser Speckle Patterns

    NASA Astrophysics Data System (ADS)

    Ribeiro, K. M.; Braga, R. A., Jr.; Horgan, G. W.; Ferreira, D. D.; Safadi, T.

    2014-02-01

    Dynamic laser speckle is a phenomenon that interprets an optical patterns formed by illuminating a surface under changes with coherent light. Therefore, the dynamic change of the speckle patterns caused by biological material is known as biospeckle. Usually, these patterns of optical interference evolving in time are analyzed by graphical or numerical methods, and the analysis in frequency domain has also been an option, however involving large computational requirements which demands new approaches to filter the images in time. Principal component analysis (PCA) works with the statistical decorrelation of data and it can be used as a data filtering. In this context, the present work evaluated the PCA technique to filter in time the data from the biospeckle images aiming the reduction of time computer consuming and improving the robustness of the filtering. It was used 64 images of biospeckle in time observed in a maize seed. The images were arranged in a data matrix and statistically uncorrelated by PCA technique, and the reconstructed signals were analyzed using the routine graphical and numerical methods to analyze the biospeckle. Results showed the potential of the PCA tool in filtering the dynamic laser speckle data, with the definition of markers of principal components related to the biological phenomena and with the advantage of fast computational processing.

  17. Proton Nuclear Magnetic Resonance-Spectroscopic Discrimination of Wines Reflects Genetic Homology of Several Different Grape (V. vinifera L.) Cultivars.

    PubMed

    Hu, Boran; Yue, Yaqing; Zhu, Yong; Wen, Wen; Zhang, Fengmin; Hardie, Jim W

    2015-01-01

    Proton nuclear magnetic resonance spectroscopy coupled multivariate analysis (1H NMR-PCA/PLS-DA) is an important tool for the discrimination of wine products. Although 1H NMR has been shown to discriminate wines of different cultivars, a grape genetic component of the discrimination has been inferred only from discrimination of cultivars of undefined genetic homology and in the presence of many confounding environmental factors. We aimed to confirm the influence of grape genotypes in the absence of those factors. We applied 1H NMR-PCA/PLS-DA and hierarchical cluster analysis (HCA) to wines from five, variously genetically-related grapevine (V. vinifera) cultivars; all grown similarly on the same site and vinified similarly. We also compared the semi-quantitative profiles of the discriminant metabolites of each cultivar with previously reported chemical analyses. The cultivars were clearly distinguishable and there was a general correlation between their grouping and their genetic homology as revealed by recent genomic studies. Between cultivars, the relative amounts of several of the cultivar-related discriminant metabolites conformed closely with reported chemical analyses. Differences in grape-derived metabolites associated with genetic differences alone are a major source of 1H NMR-based discrimination of wines and 1H NMR has the capacity to discriminate between very closely related cultivars. The study confirms that genetic variation among grape cultivars alone can account for the discrimination of wine by 1H NMR-PCA/PLS and indicates that 1H NMR spectra of wine of single grape cultivars may in future be used in tandem with hierarchical cluster analysis to elucidate genetic lineages and metabolomic relations of grapevine cultivars. In the absence of genetic information, for example, where predecessor varieties are no longer extant, this may be a particularly useful approach.

  18. SU-F-J-138: An Extension of PCA-Based Respiratory Deformation Modeling Via Multi-Linear Decomposition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Iliopoulos, AS; Sun, X; Pitsianis, N

    Purpose: To address and lift the limited degree of freedom (DoF) of globally bilinear motion components such as those based on principal components analysis (PCA), for encoding and modeling volumetric deformation motion. Methods: We provide a systematic approach to obtaining a multi-linear decomposition (MLD) and associated motion model from deformation vector field (DVF) data. We had previously introduced MLD for capturing multi-way relationships between DVF variables, without being restricted by the bilinear component format of PCA-based models. PCA-based modeling is commonly used for encoding patient-specific deformation as per planning 4D-CT images, and aiding on-board motion estimation during radiotherapy. However, themore » bilinear space-time decomposition inherently limits the DoF of such models by the small number of respiratory phases. While this limit is not reached in model studies using analytical or digital phantoms with low-rank motion, it compromises modeling power in the presence of relative motion, asymmetries and hysteresis, etc, which are often observed in patient data. Specifically, a low-DoF model will spuriously couple incoherent motion components, compromising its adaptability to on-board deformation changes. By the multi-linear format of extracted motion components, MLD-based models can encode higher-DoF deformation structure. Results: We conduct mathematical and experimental comparisons between PCA- and MLD-based models. A set of temporally-sampled analytical trajectories provides a synthetic, high-rank DVF; trajectories correspond to respiratory and cardiac motion factors, including different relative frequencies and spatial variations. Additionally, a digital XCAT phantom is used to simulate a lung lesion deforming incoherently with respect to the body, which adheres to a simple respiratory trend. In both cases, coupling of incoherent motion components due to a low model DoF is clearly demonstrated. Conclusion: Multi-linear decomposition can enable decoupling of distinct motion factors in high-rank DVF measurements. This may improve motion model expressiveness and adaptability to on-board deformation, aiding model-based image reconstruction for target verification. NIH Grant No. R01-184173.« less

  19. Comparing sugar components of cereal and pseudocereal flour by GC-MS analysis.

    PubMed

    Ačanski, Marijana M; Vujić, Djura N

    2014-02-15

    Gas chromatography with mass spectrometry was used for carrying out a qualitative analysis of the ethanol soluble flour extract of different types of cereals bread wheat and spelt and pseudocereals (amaranth and buckwheat). TMSI (trimethylsilylimidazole) was used as a reagent for the derivatisation of carbohydrates into trimethylsilyl ethers. All samples were first defatted with hexane. (In our earlier investigations, hexane extracts were used for the analysis of fatty acid of lipid components.) Many components of pentoses, hexoses and disaccharides were identified using 73 and 217 Da mass and the Wiley Online Library search. The aim of this paper is not to identify and find new components, but to compare sugar components of tested samples of flour of cereals bread wheat and spelt and pseudocereals (amarnath and buckwheat). Results were analysed using descriptive statistics (dendrograms and PCA). The results show that this method can be used for making a distinction among different types of flour. Copyright © 2013 Elsevier Ltd. All rights reserved.

  20. European Population Genetic Substructure: Further Definition of Ancestry Informative Markers for Distinguishing Among Diverse European Ethnic Groups

    PubMed Central

    Tian, Chao; Kosoy, Roman; Nassir, Rami; Lee, Annette; Villoslada, Pablo; Klareskog, Lars; Hammarström, Lennart; Garchon, Henri-Jean; Pulver, Ann E.; Ransom, Michael; Gregersen, Peter K.; Seldin, Michael F.

    2009-01-01

    The definition of European population genetic substructure and its application to understanding complex phenotypes is becoming increasingly important. In the current study using over 4000 subjects genotyped for 300 thousand SNPs we provide further insight into relationships among European population groups and identify sets of SNP ancestry informative markers (AIMs) for application in genetic studies. In general, the graphical description of these principal components analyses (PCA) of diverse European subjects showed a strong correspondence to the geographical relationships of specific countries or regions of origin. Clearer separation of different ethnic and regional populations was observed when northern and southern European groups were considered separately and the PCA results were influenced by the inclusion or exclusion of different self-identified population groups including Ashkenazi Jewish, Sardinian and Orcadian ethnic groups. SNP AIM sets were identified that could distinguish the regional and ethnic population groups. Moreover, the studies demonstrated that most allele frequency differences between different European groups could be effectively controlled in analyses using these AIM sets. The European substructure AIMs should be widely applicable to ongoing studies to confirm and delineate specific disease susceptibility candidate regions without the necessity to perform additional genome-wide SNP studies in additional subject sets. PMID:19707526

  1. European population genetic substructure: further definition of ancestry informative markers for distinguishing among diverse European ethnic groups.

    PubMed

    Tian, Chao; Kosoy, Roman; Nassir, Rami; Lee, Annette; Villoslada, Pablo; Klareskog, Lars; Hammarström, Lennart; Garchon, Henri-Jean; Pulver, Ann E; Ransom, Michael; Gregersen, Peter K; Seldin, Michael F

    2009-01-01

    The definition of European population genetic substructure and its application to understanding complex phenotypes is becoming increasingly important. In the current study using over 4,000 subjects genotyped for 300,000 single-nucleotide polymorphisms (SNPs), we provide further insight into relationships among European population groups and identify sets of SNP ancestry informative markers (AIMs) for application in genetic studies. In general, the graphical description of these principal components analyses (PCA) of diverse European subjects showed a strong correspondence to the geographical relationships of specific countries or regions of origin. Clearer separation of different ethnic and regional populations was observed when northern and southern European groups were considered separately and the PCA results were influenced by the inclusion or exclusion of different self-identified population groups including Ashkenazi Jewish, Sardinian, and Orcadian ethnic groups. SNP AIM sets were identified that could distinguish the regional and ethnic population groups. Moreover, the studies demonstrated that most allele frequency differences between different European groups could be controlled effectively in analyses using these AIM sets. The European substructure AIMs should be widely applicable to ongoing studies to confirm and delineate specific disease susceptibility candidate regions without the necessity of performing additional genome-wide SNP studies in additional subject sets.

  2. Validity of a New Patient Engagement Measure: The Altarum Consumer Engagement (ACE) Measure.

    PubMed

    Duke, Christopher C; Lynch, Wendy D; Smith, Brad; Winstanley, Julie

    2015-12-01

    The objective of this study was to report on the validation of new scales [called the Altarum Consumer Engagement (ACE) Measure] that are indicative of an individual's engagement in health and healthcare decisions. The instrument was created to broaden the scope of how engagement is measured and understood, and to update the concept of engagement to include modern information sources, such as online health resources and ratings of providers and patient health. Data were collected through an online survey with a US population of 2079 participants. A combination of Principal Component Analysis (PCA) and detailed Rasch analyses were conducted to identify specific subscales of engagement. Results were compared to another commonly used survey instrument, and outcomes were compared for construct validity. The PCA identified a four-factor structure composed of 21 items. The factors were named Commitment, Informed Choice, Navigation, and Ownership. Rasch analyses confirmed scale stability. Relevant outcomes were correlated in the expected direction, such as health status, lifestyle behaviors, medication adherence, and observed expected group differences. This study confirmed the validity of the new ACE Measure and its utility in screening for and finding group differences in activities related to patient engagement and health consumerism, such as using provider comparison tools and asking about medical costs.

  3. Evaluation of Food Freshness and Locality by Odor Sensor

    NASA Astrophysics Data System (ADS)

    Koike, Takayuki; Shimada, Koji; Kamimura, Hironobu; Kaneki, Noriaki

    The aim of this study was to investigate whether food freshness and locality can be classified using a food evaluation system consisting four SnO2-semiconductor gas sensors and a solid phase column, into which collecting aroma materials. The temperature of sensors was periodically changed to be in unsteady state and thus, the sensor information was increased. The parameters (in quefrency band) were extracted from sensor information using cepstrum analysis that enable to separate superimposed information on sinusoidal wave. The quefrency was used as parameters for principal component and discriminant analyses (PCA and DCA) to detect food freshness and food localities. We used three kinds of strawberries, people can perceive its odors, passed from one to three days after harvest, and kelps and Ceylon tea, people are hardly to perceive its odor, corrected from five areas as sample. Then, the deterioration of strawberries and localities of kelps and Ceylon teas were visually evaluated using the numerical analyses. While the deteriorations were classified using PCA or DCA, the localities were classified only by DCA. The findings indicate that, although odorant intensity influenced the method detecting food quality, the quefrency obtained from odorant information using cepstrum analysis were available to detect the difference in the freshness and the localities of foods.

  4. Demixed principal component analysis of neural population data

    PubMed Central

    Kobak, Dmitry; Brendel, Wieland; Constantinidis, Christos; Feierstein, Claudia E; Kepecs, Adam; Mainen, Zachary F; Qi, Xue-Lian; Romo, Ranulfo; Uchida, Naoshige; Machens, Christian K

    2016-01-01

    Neurons in higher cortical areas, such as the prefrontal cortex, are often tuned to a variety of sensory and motor variables, and are therefore said to display mixed selectivity. This complexity of single neuron responses can obscure what information these areas represent and how it is represented. Here we demonstrate the advantages of a new dimensionality reduction technique, demixed principal component analysis (dPCA), that decomposes population activity into a few components. In addition to systematically capturing the majority of the variance of the data, dPCA also exposes the dependence of the neural representation on task parameters such as stimuli, decisions, or rewards. To illustrate our method we reanalyze population data from four datasets comprising different species, different cortical areas and different experimental tasks. In each case, dPCA provides a concise way of visualizing the data that summarizes the task-dependent features of the population response in a single figure. DOI: http://dx.doi.org/10.7554/eLife.10989.001 PMID:27067378

  5. Ionospheric total electron content anomalies due to Typhoon Nakri on 29 May 2008: A nonlinear principal component analysis

    NASA Astrophysics Data System (ADS)

    Lin, Jyh-Woei

    2012-09-01

    This paper uses Nonlinear Principal Component Analysis (NLPCA) and Principal Component Analysis (PCA) to determine Total Electron Content (TEC) anomalies in the ionosphere for the Nakri Typhoon on 29 May, 2008 (UTC). NLPCA, PCA and image processing are applied to the global ionospheric map (GIM) with transforms conducted for the time period 12:00-14:00 UT on 29 May 2008 when the wind was most intense. Results show that at a height of approximately 150-200 km the TEC anomaly using NLPCA is more localized; however its intensity increases with height and becomes more widespread. The TEC anomalies are not found by PCA. Potential causes of the results are discussed with emphasis given to vertical acoustic gravity waves. The approximate position of the typhoon's eye can be detected if the GIM is divided into fine enough maps with adequate spatial-resolution at GPS-TEC receivers. This implies that the trace of the typhoon in the regional GIM is caught using NLPCA.

  6. ARLTS1 and Prostate Cancer Risk - Analysis of Expression and Regulation

    PubMed Central

    Siltanen, Sanna; Fischer, Daniel; Rantapero, Tommi; Laitinen, Virpi; Mpindi, John Patrick; Kallioniemi, Olli; Wahlfors, Tiina; Schleutker, Johanna

    2013-01-01

    Prostate cancer (PCa) is a heterogeneous trait for which several susceptibility loci have been implicated by genome-wide linkage and association studies. The genomic region 13q14 is frequently deleted in tumour tissues of both sporadic and familial PCa patients and is consequently recognised as a possible locus of tumour suppressor gene(s). Deletions of this region have been found in many other cancers. Recently, we showed that homozygous carriers for the T442C variant of the ARLTS1 gene (ADP-ribosylation factor-like tumour suppressor protein 1 or ARL11, located at 13q14) are associated with an increased risk for both unselected and familial PCa. Furthermore, the variant T442C was observed in greater frequency among malignant tissue samples, PCa cell lines and xenografts, supporting its role in PCa tumourigenesis. In this study, 84 PCa cases and 15 controls were analysed for ARLTS1 expression status in blood-derived RNA. A statistically significant (p = 0.0037) decrease of ARLTS1 expression in PCa cases was detected. Regulation of ARLTS1 expression was analysed with eQTL (expression quantitative trait loci) methods. Altogether fourteen significant cis-eQTLs affecting the ARLTS1 expression level were found. In addition, epistatic interactions of ARLTS1 genomic variants with genes involved in immune system processes were predicted with the MDR program. In conclusion, this study further supports the role of ARLTS1 as a tumour suppressor gene and reveals that the expression is regulated through variants localised in regulatory regions. PMID:23940804

  7. Subject order-independent group ICA (SOI-GICA) for functional MRI data analysis.

    PubMed

    Zhang, Han; Zuo, Xi-Nian; Ma, Shuang-Ye; Zang, Yu-Feng; Milham, Michael P; Zhu, Chao-Zhe

    2010-07-15

    Independent component analysis (ICA) is a data-driven approach to study functional magnetic resonance imaging (fMRI) data. Particularly, for group analysis on multiple subjects, temporally concatenation group ICA (TC-GICA) is intensively used. However, due to the usually limited computational capability, data reduction with principal component analysis (PCA: a standard preprocessing step of ICA decomposition) is difficult to achieve for a large dataset. To overcome this, TC-GICA employs multiple-stage PCA data reduction. Such multiple-stage PCA data reduction, however, leads to variable outputs due to different subject concatenation orders. Consequently, the ICA algorithm uses the variable multiple-stage PCA outputs and generates variable decompositions. In this study, a rigorous theoretical analysis was conducted to prove the existence of such variability. Simulated and real fMRI experiments were used to demonstrate the subject-order-induced variability of TC-GICA results using multiple PCA data reductions. To solve this problem, we propose a new subject order-independent group ICA (SOI-GICA). Both simulated and real fMRI data experiments demonstrated the high robustness and accuracy of the SOI-GICA results compared to those of traditional TC-GICA. Accordingly, we recommend SOI-GICA for group ICA-based fMRI studies, especially those with large data sets. Copyright 2010 Elsevier Inc. All rights reserved.

  8. A study of anatomy of distal femur pertaining to total knee replacement: an analysis, conclusions and recommendations.

    PubMed

    Kumar, K; Sharma, D

    2018-04-01

    Multiple landmarks including the transepicondylar axis (TEA), posterior condylar axis (PCA) and anterior trochlear line (TL) have been used to set up the femoral component rotation, but each is faced with its own practical obstacle that limits its usage. Also a common practice is to set the femoral component rotation at 3° external rotation to PCA and valgus resection angle at 5°-7° to anatomical axis of femur. For the reason that the anatomy of each knee is different, it may not be justified to practice such a set protocol in all cases. The aim of the study was to compare the anatomical landmarks used to set up the femoral component rotation and to study the variability in the different anatomical relationships relevant to total knee replacement. The study had 52 patients (94 knees) with grade IV osteoarthritis. Full-length lower limb scanogram and 1 mm cross-sectional cuts of distal femur were taken. aTEA, sTEA, PCL, TL, CTA, PCA, TLA and valgus angles were taken for all knees. aTEA is identifiable in all cases but sTEA in only 59 knees (62.77%). Correspondingly, CTA is calculable in all knees and PCA in 62.77% cases. Mean CTA and mean PCA were 5.4° ± 1.88° SD and 0.71° ± 1.95° SD, respectively. Mean angle between aTEA and sTEA was 4.88. TL is a line difficult to draw because of high incidence of anterior osteophytes, making CTA a more reliable parameter than TLA. Mean TLA was 10.31° ± 3.52° SD. Mean valgus resection angle was 4.86° ± 2.53° SD. Gender- or side-based differences in any of these values were not statistically different. Using aTEA or sTEA can make a big difference in femoral component rotation; therefore, whether aTEA or sTEA should be used needs to be further investigated. CTA, PCA and valgus resection angle need to be individually calculated for each knee. Use of TLA is not recommended.

  9. [Identification of varieties of cashmere by Vis/NIR spectroscopy technology based on PCA-SVM].

    PubMed

    Wu, Gui-Fang; He, Yong

    2009-06-01

    One mixed algorithm was presented to discriminate cashmere varieties with principal component analysis (PCA) and support vector machine (SVM). Cashmere fiber has such characteristics as threadlike, softness, glossiness and high tensile strength. The quality characters and economic value of each breed of cashmere are very different. In order to safeguard the consumer's rights and guarantee the quality of cashmere product, quickly, efficiently and correctly identifying cashmere has significant meaning to the production and transaction of cashmere material. The present research adopts Vis/NIRS spectroscopy diffuse techniques to collect the spectral data of cashmere. The near infrared fingerprint of cashmere was acquired by principal component analysis (PCA), and support vector machine (SVM) methods were used to further identify the cashmere material. The result of PCA indicated that the score map made by the scores of PC1, PC2 and PC3 was used, and 10 principal components (PCs) were selected as the input of support vector machine (SVM) based on the reliabilities of PCs of 99.99%. One hundred cashmere samples were used for calibration and the remaining 75 cashmere samples were used for validation. A one-against-all multi-class SVM model was built, the capabilities of SVM with different kernel function were comparatively analyzed, and the result showed that SVM possessing with the Gaussian kernel function has the best identification capabilities with the accuracy of 100%. This research indicated that the data mining method of PCA-SVM has a good identification effect, and can work as a new method for rapid identification of cashmere material varieties.

  10. Validation of decisional balance and self-efficacy measures for HPV vaccination in college women.

    PubMed

    Lipschitz, Jessica M; Fernandez, Anne C; Larson, H Elsa; Blaney, Cerissa L; Meier, Kathy S; Redding, Colleen A; Prochaska, James O; Paiva, Andrea L

    2013-01-01

    Women younger than 25 years are at greatest risk for human papillomavirus (HPV) infection, including high-risk strains associated with 70% of cervical cancers. Effective model-based measures that can lead to intervention development to increase HPV vaccination rates are necessary. This study validated Transtheoretical Model measures of Decisional Balance and Self-Efficacy for seeking the HPV vaccine in a sample of female college students. Cross-sectional measurement development. Setting. Online survey of undergraduate college students. A total of 340 female students ages 18 to 26 years. Stage of Change, Decisional Balance, and Self-Efficacy. The sample was randomly split into halves for exploratory principal components analyses (PCAs), followed by confirmatory factor analyses (CFAs) to test measurement models. Multivariate analyses examined relationships between constructs. For Decisional Balance, PCA indicated two 4-item factors (Pros -α = .90; and Cons -α = .66). CFA supported a two-factor correlated model, χ(2)(19) = 39.33; p < .01; comparative fit index (CFI) = .97; and average absolute standardized residual statistic (AASR) = .03; with Pros α = .90 and Cons α = .67. For Self-Efficacy, PCA indicated one 6-item factor (α = .84). CFA supported this structure, χ(2)(9) = 50.87; p < .05; CFI = .94; AASR = .03; and α = .90. Multivariate analyses indicated significant cross-stage differences on Pros, Cons, and Self-Efficacy in expected directions. Findings support the internal and external validity of these measures and their use in Transtheoretical Model-tailored interventions. Stage-construct relationships suggest that reducing the Cons of vaccination may be more important for HPV than for behaviors with a true Maintenance stage.

  11. Distinguishing Nonpareil marketing group almond cultivars through multivariate analyses.

    PubMed

    Ledbetter, Craig A; Sisterson, Mark S

    2013-09-01

    More than 80% of the world's almonds are grown in California with several dozen almond cultivars available commercially. To facilitate promotion and sale, almond cultivars are categorized into marketing groups based on kernel shape and appearance. Several marketing groups are recognized, with the Nonpareil Marketing Group (NMG) demanding the highest prices. Placement of cultivars into the NMG is historical and no objective standards exist for deciding whether newly developed cultivars belong in the NMG. Principal component analyses (PCA) were used to identify nut and kernel characteristics best separating the 4 NMG cultivars (Nonpareil, Jeffries, Kapareil, and Milow) from a representative of the California Marketing Group (cultivar Carmel) and the Mission Marketing Group (cultivar Padre). In addition, discriminant analyses were used to determine cultivar misclassification rates between and within the marketing groups. All 19 evaluated carpological characters differed significantly among the 6 cultivars and during 2 harvest seasons. A clear distinction of NMG cultivars from representatives of the California and Mission Marketing Groups was evident from a PCA involving the 6 cultivars. Further, NMG kernels were successfully discriminated from kernels representing the California and Mission Marketing Groups with overall kernel misclassification of only 2% using 16 of the 19 evaluated characters. Pellicle luminosity was the most discriminating character, regardless of the character set used in analyses. Results provide an objective classification of NMG almond kernels, clearly distinguishing them from kernels of cultivars representing the California and Mission Marketing Groups. Journal of Food Science © 2013 Institute of Food Technologists® No claim to original US government works.

  12. CscoreTool: fast Hi-C compartment analysis at high resolution.

    PubMed

    Zheng, Xiaobin; Zheng, Yixian

    2018-05-01

    The genome-wide chromosome conformation capture (Hi-C) has revealed that the eukaryotic genome can be partitioned into A and B compartments that have distinctive chromatin and transcription features. Current Principle Component Analyses (PCA)-based method for the A/B compartment prediction based on Hi-C data requires substantial CPU time and memory. We report the development of a method, CscoreTool, which enables fast and memory-efficient determination of A/B compartments at high resolution even in datasets with low sequencing depth. https://github.com/scoutzxb/CscoreTool. xzheng@carnegiescience.edu. Supplementary data are available at Bioinformatics online.

  13. [Vis-NIR spectroscopic pattern recognition combined with SG smoothing applied to breed screening of transgenic sugarcane].

    PubMed

    Liu, Gui-Song; Guo, Hao-Song; Pan, Tao; Wang, Ji-Hua; Cao, Gan

    2014-10-01

    Based on Savitzky-Golay (SG) smoothing screening, principal component analysis (PCA) combined with separately supervised linear discriminant analysis (LDA) and unsupervised hierarchical clustering analysis (HCA) were used for non-destructive visible and near-infrared (Vis-NIR) detection for breed screening of transgenic sugarcane. A random and stability-dependent framework of calibration, prediction, and validation was proposed. A total of 456 samples of sugarcane leaves planting in the elongating stage were collected from the field, which was composed of 306 transgenic (positive) samples containing Bt and Bar gene and 150 non-transgenic (negative) samples. A total of 156 samples (negative 50 and positive 106) were randomly selected as the validation set; the remaining samples (negative 100 and positive 200, a total of 300 samples) were used as the modeling set, and then the modeling set was subdivided into calibration (negative 50 and positive 100, a total of 150 samples) and prediction sets (negative 50 and positive 100, a total of 150 samples) for 50 times. The number of SG smoothing points was ex- panded, while some modes of higher derivative were removed because of small absolute value, and a total of 264 smoothing modes were used for screening. The pairwise combinations of first three principal components were used, and then the optimal combination of principal components was selected according to the model effect. Based on all divisions of calibration and prediction sets and all SG smoothing modes, the SG-PCA-LDA and SG-PCA-HCA models were established, the model parameters were optimized based on the average prediction effect for all divisions to produce modeling stability. Finally, the model validation was performed by validation set. With SG smoothing, the modeling accuracy and stability of PCA-LDA, PCA-HCA were signif- icantly improved. For the optimal SG-PCA-LDA model, the recognition rate of positive and negative validation samples were 94.3%, 96.0%; and were 92.5%, 98.0% for the optimal SG-PCA-LDA model, respectively. Vis-NIR spectro- scopic pattern recognition combined with SG smoothing could be used for accurate recognition of transgenic sugarcane leaves, and provided a convenient screening method for transgenic sugarcane breeding.

  14. AlleleCoder: a PERL script for coding codominant polymorphism data for PCA analysis

    USDA-ARS?s Scientific Manuscript database

    A useful biological interpretation of diploid heterozygotes is in terms of the dose of the common allele (0, 1 or 2 copies). We have developed a PERL script that converts FASTA files into coded spreadsheets suitable for Principal Component Analysis (PCA). In combination with R and R Commander, two- ...

  15. Is Eotaxin-1 a serum and urinary biomarker for prostate cancer detection and recurrence?

    PubMed

    Heidegger, Isabel; Höfer, Julia; Luger, Markus; Pichler, Renate; Klocker, Helmut; Horninger, Wolfgang; Steiner, Eberhard; Jochberger, Stefan; Culig, Zoran

    2015-12-01

    Eotaxin-1 (CCL11) is a protein expressed in various tissues influencing immunoregulatory processes by acting as selective eosinophil chemo-attractant. In prostate cancer (PCa), the expression and functional role of CCL11 have not been intensively investigated so far. Therefore, the aim of the present study was to investigate the diagnostic or prognostic potential of Eotaxin-1 in PCa patients. We analyzed serum from 140 patients who have undergone prostate biopsy due to elevated prostate-specific antigen (PSA) levels as well as serum of 20 individuals with PSA levels < 1ng/ml (healthy control group). Moreover, 40 urine samples were analyzed. A custom-made Q-Plex array ELISA (Quansys Biosciences) for the detection of Eotaxin-1 was performed and Q-View Software used for quantification. In addition, clinical courses of patients documented in our Prostate Biobank database were analyzed. ROC and survival analyses were used to determine the diagnostic and prognostic power of Eotaxin-1 levels. Serum Eotaxin-1 levels were significantly decreased in PCa (P = 0.006) as well as in benign prostate hyperplasia (P = 0.0006) compared to the control group. ROC analysis revealed that Eotaxin-1 is a significant marker to distinguish PCa from disease-free prostate. Moreover, we found that Eotaxin-1 expression is significantly decreased in Gleason score (GS) 6 (P = 0.0135) and GS 8 (P = 0.0057) patients compared to samples of healthy men, respectively. However, PCa aggressiveness was not predictable by Eotaxin-1 levels. In line with serum analyses, urine Eotaxin-1 was significantly decreased in patients with PCa compared to cancer-free individuals (P = 0.0185) but was not different between cancers of different GS. Patientś follow-up analyses showed no significant correlation between serum Eotaxin-1 levels and time to biochemical recurrence. Survival analyses also revealed no significant changes in progression-free survival among low (≤ 112.2 pg/ml) and high (> 112.2 pg/ml) Eotaxin-1 serum levels. Although this study has not established a prognostic role of Eotaxin-1 in PCa patients, this chemokine may serve as a diagnostic marker to distinguish between disease-free prostate and cancer. © 2015 Wiley Periodicals, Inc.

  16. The factor structure and reliability of the Illness Attitude Scales in a student and a patient sample

    PubMed Central

    Crössmann, Alexander; Pauli, Paul

    2006-01-01

    Background The Illness Attitude Scales (IAS), designed by Kellner in 1986, assesses fears, beliefs, and attitudes associated with hypochondriasis and abnormal illness behaviour. However, its factor structure is, especially for translations of the IAS, not sufficiently explored. Thus, the present Study aimed to analyse the factor structure of the IAS in a German student and a patient population using exploratory factor analysis. Methods A mixed student (N = 296) and a mixed patient (N = 130) sample completed the IAS. The data was submitted to principal components analyses (PCA) with subsequent oblique rotations. From identified factor structures, scales were derived and submitted to reliability analyses as well as to a preliminary validity analysis. Results The PCA revealed a four-factor solution in the student sample: (1) fear of illness and death; (2) treatment experience; (3) hypochondriacal beliefs; and (4) effect of symptoms. In the patient sample, the data was best explained by a two-factor solution: (1) health related anxiety and (2) effect of symptoms and treatment experience. All scales reached good to acceptable reliability coefficients. The scales derived from the student sample and those derived from the patient sample were able to distinguish between pain patients and a matched group of normal controls. Conclusion Our data suggests that the IAS is in student samples best represented by a four factor-solution and in patient samples by a two-factor-solution. PMID:17067384

  17. Testing of a simplified LED based vis/NIR system for rapid ripeness evaluation of white grape (Vitis vinifera L.) for Franciacorta wine.

    PubMed

    Giovenzana, Valentina; Civelli, Raffaele; Beghi, Roberto; Oberti, Roberto; Guidetti, Riccardo

    2015-11-01

    The aim of this work was to test a simplified optical prototype for a rapid estimation of the ripening parameters of white grape for Franciacorta wine directly in field. Spectral acquisition based on reflectance at four wavelengths (630, 690, 750 and 850 nm) was proposed. The integration of a simple processing algorithm in the microcontroller software would allow to visualize real time values of spectral reflectance. Non-destructive analyses were carried out on 95 grape bunches for a total of 475 berries. Samplings were performed weekly during the last ripening stages. Optical measurements were carried out both using the simplified system and a portable commercial vis/NIR spectrophotometer, as reference instrument for performance comparison. Chemometric analyses were performed in order to extract the maximum useful information from optical data. Principal component analysis (PCA) was performed for a preliminary evaluation of the data. Correlations between the optical data matrix and ripening parameters (total soluble solids content, SSC; titratable acidity, TA) were carried out using partial least square (PLS) regression for spectra and using multiple linear regression (MLR) for data from the simplified device. Classification analysis were also performed with the aim of discriminate ripe and unripe samples. PCA, MLR and classification analyses show the effectiveness of the simplified system in separating samples among different sampling dates and in discriminating ripe from unripe samples. Finally, simple equations for SSC and TA prediction were calculated. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Application of multivariate statistical analysis in the pollution and health risk of traffic-related heavy metals.

    PubMed

    Ebqa'ai, Mohammad; Ibrahim, Bashar

    2017-12-01

    This study aims to analyse the heavy metal pollutants in Jeddah, the second largest city in the Gulf Cooperation Council with a population exceeding 3.5 million, and many vehicles. Ninety-eight street dust samples were collected seasonally from the six major roads as well as the Jeddah Beach, and subsequently digested using modified Leeds Public Analyst method. The heavy metals (Fe, Zn, Mn, Cu, Cd, and Pb) were extracted from the ash using methyl isobutyl ketone as solvent extraction and eventually analysed by atomic absorption spectroscopy. Multivariate statistical techniques, principal component analysis (PCA), and hierarchical cluster analysis were applied to these data. Heavy metal concentrations were ranked according to the following descending order: Fe > Zn > Mn > Cu > Pb > Cd. In order to study the pollution and health risk from these heavy metals as well as estimating their effect on the environment, pollution indices, integrated pollution index, enrichment factor, daily dose average, hazard quotient, and hazard index were all analysed. The PCA showed high levels of Zn, Fe, and Cd in Al Kurnish road, while these elements were consistently detected on King Abdulaziz and Al Madina roads. The study indicates that high levels of Zn and Pb pollution were recorded for major roads in Jeddah. Six out of seven roads had high pollution indices. This study is the first step towards further investigations into current health problems in Jeddah, such as anaemia and asthma.

  19. Fast Steerable Principal Component Analysis

    PubMed Central

    Zhao, Zhizhen; Shkolnisky, Yoel; Singer, Amit

    2016-01-01

    Cryo-electron microscopy nowadays often requires the analysis of hundreds of thousands of 2-D images as large as a few hundred pixels in each direction. Here, we introduce an algorithm that efficiently and accurately performs principal component analysis (PCA) for a large set of 2-D images, and, for each image, the set of its uniform rotations in the plane and their reflections. For a dataset consisting of n images of size L × L pixels, the computational complexity of our algorithm is O(nL3 + L4), while existing algorithms take O(nL4). The new algorithm computes the expansion coefficients of the images in a Fourier–Bessel basis efficiently using the nonuniform fast Fourier transform. We compare the accuracy and efficiency of the new algorithm with traditional PCA and existing algorithms for steerable PCA. PMID:27570801

  20. Raman signatures of ferroic domain walls captured by principal component analysis.

    PubMed

    Nataf, G F; Barrett, N; Kreisel, J; Guennou, M

    2018-01-24

    Ferroic domain walls are currently investigated by several state-of-the art techniques in order to get a better understanding of their distinct, functional properties. Here, principal component analysis (PCA) of Raman maps is used to study ferroelectric domain walls (DWs) in LiNbO 3 and ferroelastic DWs in NdGaO 3 . It is shown that PCA allows us to quickly and reliably identify small Raman peak variations at ferroelectric DWs and that the value of a peak shift can be deduced-accurately and without a priori-from a first order Taylor expansion of the spectra. The ability of PCA to separate the contribution of ferroelastic domains and DWs to Raman spectra is emphasized. More generally, our results provide a novel route for the statistical analysis of any property mapped across a DW.

  1. Chemotypes of Pistacia atlantica leaf essential oils from Algeria.

    PubMed

    Gourine, Nadhir; Bombarda, Isabelle; Yousfi, Mohamed; Gaydou, Emile M

    2010-01-01

    The essential oils obtained by hydrodistillation of Pistacia atlantica Desf. leaves collected from different regions of Algeria were analyzed by GC and GC-MS. The essential oil was rich in monoterpenes and oxygenated sesquiterpenes. The major components were alpha-pinene (0.0-67%), delta-3-carene (0.0-56%), spathulenol (0.5-22%), camphene (0.0-21%), terpinen-4-ol (0.0-16%) and beta-pinene (0.0-13%). Among the various components identified, twenty were used for statistical analyses. The result of principal component analysis (PCA) showed the occurrence of three chemotypes: a delta-3-carene chemotype (16.4-56.2%), a terpinen-4-ol chemotype (10.8-16.0%) and an alpha-pinene/camphene chemotype (10.9-66.6%/3.8-20.9%). It was found that the essential oil from female plants (delta-3-carene chemotype) could be easily differentiated from the two other chemotypes corresponding to male trees.

  2. Motor features in posterior cortical atrophy and their imaging correlates.

    PubMed

    Ryan, Natalie S; Shakespeare, Timothy J; Lehmann, Manja; Keihaninejad, Shiva; Nicholas, Jennifer M; Leung, Kelvin K; Fox, Nick C; Crutch, Sebastian J

    2014-12-01

    Posterior cortical atrophy (PCA) is a neurodegenerative syndrome characterized by impaired higher visual processing skills; however, motor features more commonly associated with corticobasal syndrome may also occur. We investigated the frequency and clinical characteristics of motor features in 44 PCA patients and, with 30 controls, conducted voxel-based morphometry, cortical thickness, and subcortical volumetric analyses of their magnetic resonance imaging. Prominent limb rigidity was used to define a PCA-motor subgroup. A total of 30% (13) had PCA-motor; all demonstrating asymmetrical left upper limb rigidity. Limb apraxia was more frequent and asymmetrical in PCA-motor, as was myoclonus. Tremor and alien limb phenomena only occurred in this subgroup. The subgroups did not differ in neuropsychological test performance or apolipoprotein E4 allele frequency. Greater asymmetry of atrophy occurred in PCA-motor, particularly involving right frontoparietal and peri-rolandic cortices, putamen, and thalamus. The 9 patients (including 4 PCA-motor) with pathology or cerebrospinal fluid all showed evidence of Alzheimer's disease. Our data suggest that PCA patients with motor features have greater atrophy of contralateral sensorimotor areas but are still likely to have underlying Alzheimer's disease. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  3. Self-aggregation in scaled principal component space

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ding, Chris H.Q.; He, Xiaofeng; Zha, Hongyuan

    2001-10-05

    Automatic grouping of voluminous data into meaningful structures is a challenging task frequently encountered in broad areas of science, engineering and information processing. These data clustering tasks are frequently performed in Euclidean space or a subspace chosen from principal component analysis (PCA). Here we describe a space obtained by a nonlinear scaling of PCA in which data objects self-aggregate automatically into clusters. Projection into this space gives sharp distinctions among clusters. Gene expression profiles of cancer tissue subtypes, Web hyperlink structure and Internet newsgroups are analyzed to illustrate interesting properties of the space.

  4. Novel Three-Component Phenazine-1-Carboxylic Acid 1,2-Dioxygenase in Sphingomonas wittichii DP58

    PubMed Central

    Zhao, Qiang; Wang, Wei; Huang, Xian-Qing; Zhang, Xue-Hong

    2017-01-01

    ABSTRACT Phenazine-1-carboxylic acid, the main component of shenqinmycin, is widely used in southern China for the prevention of rice sheath blight. However, the fate of phenazine-1-carboxylic acid in soil remains uncertain. Sphingomonas wittichii DP58 can use phenazine-1-carboxylic acid as its sole carbon and nitrogen sources for growth. In this study, dioxygenase-encoding genes, pcaA1A2, were found using transcriptome analysis to be highly upregulated upon phenazine-1-carboxylic acid biodegradation. PcaA1 shares 68% amino acid sequence identity with the large oxygenase subunit of anthranilate 1,2-dioxygenase from Rhodococcus maanshanensis DSM 44675. The dioxygenase was coexpressed in Escherichia coli with its adjacent reductase-encoding gene, pcaA3, and ferredoxin-encoding gene, pcaA4, and showed phenazine-1-carboxylic acid consumption. The dioxygenase-, ferredoxin-, and reductase-encoding genes were expressed in Pseudomonas putida KT2440 or E. coli BL21, and the three recombinant proteins were purified. A phenazine-1-carboxylic acid conversion capability occurred in vitro only when all three components were present. However, P. putida KT2440 transformed with pcaA1A2 obtained phenazine-1-carboxylic acid degradation ability, suggesting that phenazine-1-carboxylic acid 1,2-dioxygenase has low specificities for its ferredoxin and reductase. This was verified by replacing PcaA3 with RedA2 in the in vitro enzyme assay. High-performance liquid chromatography–mass spectrometry (HPLC-MS) and nuclear magnetic resonance (NMR) analysis showed that phenazine-1-carboxylic acid was converted to 1,2-dihydroxyphenazine through decarboxylation and hydroxylation, indicating that PcaA1A2A3A4 constitutes the initial phenazine-1-carboxylic acid 1,2-dioxygenase. This study fills a gap in our understanding of the biodegradation of phenazine-1-carboxylic acid and illustrates a new dioxygenase for decarboxylation. IMPORTANCE Phenazine-1-carboxylic acid is widely used in southern China as a key fungicide to prevent rice sheath blight. However, the degradation characteristics of phenazine-1-carboxylic acid and the environmental consequences of the long-term application are not clear. S. wittichii DP58 can use phenazine-1-carboxylic acid as its sole carbon and nitrogen sources. In this study, a three-component dioxygenase, PcaA1A2A3A4, was determined to be the initial dioxygenase for phenazine-1-carboxylic acid degradation in S. wittichii DP58. Phenazine-1-carboxylic acid was converted to 1,2-dihydroxyphenazine through decarboxylation and hydroxylation. This finding may help us discover the pathway for phenazine-1-carboxylic acid degradation. PMID:28188209

  5. Facilitating in vivo tumor localization by principal component analysis based on dynamic fluorescence molecular imaging

    NASA Astrophysics Data System (ADS)

    Gao, Yang; Chen, Maomao; Wu, Junyu; Zhou, Yuan; Cai, Chuangjian; Wang, Daliang; Luo, Jianwen

    2017-09-01

    Fluorescence molecular imaging has been used to target tumors in mice with xenograft tumors. However, tumor imaging is largely distorted by the aggregation of fluorescent probes in the liver. A principal component analysis (PCA)-based strategy was applied on the in vivo dynamic fluorescence imaging results of three mice with xenograft tumors to facilitate tumor imaging, with the help of a tumor-specific fluorescent probe. Tumor-relevant features were extracted from the original images by PCA and represented by the principal component (PC) maps. The second principal component (PC2) map represented the tumor-related features, and the first principal component (PC1) map retained the original pharmacokinetic profiles, especially of the liver. The distribution patterns of the PC2 map of the tumor-bearing mice were in good agreement with the actual tumor location. The tumor-to-liver ratio and contrast-to-noise ratio were significantly higher on the PC2 map than on the original images, thus distinguishing the tumor from its nearby fluorescence noise of liver. The results suggest that the PC2 map could serve as a bioimaging marker to facilitate in vivo tumor localization, and dynamic fluorescence molecular imaging with PCA could be a valuable tool for future studies of in vivo tumor metabolism and progression.

  6. The hierarchical structure of self-reported impulsivity

    PubMed Central

    Kirby, Kris N.; Finch, Julia C.

    2010-01-01

    The hierarchical structure of 95 self-reported impulsivity items, along with delay-discount rates for money, was examined. A large sample of college students participated in the study (N = 407). Items represented every previously proposed dimension of self-reported impulsivity. Exploratory PCA yielded at least 7 interpretable components: Prepared/Careful, Impetuous, Divertible, Thrill and Risk Seeking, Happy-Go-Lucky, Impatiently Pleasure Seeking, and Reserved. Discount rates loaded on Impatiently Pleasure Seeking, and correlated with the impulsiveness and venturesomeness scales from the I7 (Eysenck, Pearson, Easting, & Allsopp, 1985). The hierarchical emergence of the components was explored, and we show how this hierarchical structure may help organize conflicting dimensions found in previous analyses. Finally, we argue that the discounting model (Ainslie, 1975) provides a qualitative framework for understanding the dimensions of impulsivity. PMID:20224803

  7. Decomposing the Apoptosis Pathway Into Biologically Interpretable Principal Components

    PubMed Central

    Wang, Min; Kornblau, Steven M; Coombes, Kevin R

    2018-01-01

    Principal component analysis (PCA) is one of the most common techniques in the analysis of biological data sets, but applying PCA raises 2 challenges. First, one must determine the number of significant principal components (PCs). Second, because each PC is a linear combination of genes, it rarely has a biological interpretation. Existing methods to determine the number of PCs are either subjective or computationally extensive. We review several methods and describe a new R package, PCDimension, that implements additional methods, the most important being an algorithm that extends and automates a graphical Bayesian method. Using simulations, we compared the methods. Our newly automated procedure is competitive with the best methods when considering both accuracy and speed and is the most accurate when the number of objects is small compared with the number of attributes. We applied the method to a proteomics data set from patients with acute myeloid leukemia. Proteins in the apoptosis pathway could be explained using 6 PCs. By clustering the proteins in PC space, we were able to replace the PCs by 6 “biological components,” 3 of which could be immediately interpreted from the current literature. We expect this approach combining PCA with clustering to be widely applicable. PMID:29881252

  8. Assessing the heterogeneity of aggressive behavior traits: exploratory and confirmatory analyses of the reactive and instrumental aggression Personality Assessment Inventory (PAI) scales.

    PubMed

    Antonius, Daniel; Sinclair, Samuel Justin; Shiva, Andrew A; Messinger, Julie W; Maile, Jordan; Siefert, Caleb J; Belfi, Brian; Malaspina, Dolores; Blais, Mark A

    2013-01-01

    The heterogeneity of violent behavior is often overlooked in risk assessment despite its importance in the management and treatment of psychiatric and forensic patients. In this study, items from the Personality Assessment Inventory (PAI) were first evaluated and rated by experts in terms of how well they assessed personality features associated with reactive and instrumental aggression. Exploratory principal component analyses (PCA) were then conducted on select items using a sample of psychiatric and forensic inpatients (n = 479) to examine the latent structure and construct validity of these reactive and instrumental aggression factors. Finally, a confirmatory factor analysis (CFA) was conducted on a separate sample of psychiatric inpatients (n = 503) to evaluate whether these factors yielded acceptable model fit. Overall, the exploratory and confirmatory analyses supported the existence of two latent PAI factor structures, which delineate personality traits related to reactive and instrumental aggression.

  9. ADC texture—An imaging biomarker for high-grade glioma?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brynolfsson, Patrik; Hauksson, Jón; Karlsson, Mikael

    2014-10-15

    Purpose: Survival for high-grade gliomas is poor, at least partly explained by intratumoral heterogeneity contributing to treatment resistance. Radiological evaluation of treatment response is in most cases limited to assessment of tumor size months after the initiation of therapy. Diffusion-weighted magnetic resonance imaging (MRI) and its estimate of the apparent diffusion coefficient (ADC) has been widely investigated, as it reflects tumor cellularity and proliferation. The aim of this study was to investigate texture analysis of ADC images in conjunction with multivariate image analysis as a means for identification of pretreatment imaging biomarkers. Methods: Twenty-three consecutive high-grade glioma patients were treatedmore » with radiotherapy (2 Gy/60 Gy) with concomitant and adjuvant temozolomide. ADC maps and T1-weighted anatomical images with and without contrast enhancement were collected prior to treatment, and (residual) tumor contrast enhancement was delineated. A gray-level co-occurrence matrix analysis was performed on the ADC maps in a cuboid encapsulating the tumor in coronal, sagittal, and transversal planes, giving a total of 60 textural descriptors for each tumor. In addition, similar examinations and analyses were performed at day 1, week 2, and week 6 into treatment. Principal component analysis (PCA) was applied to reduce dimensionality of the data, and the five largest components (scores) were used in subsequent analyses. MRI assessment three months after completion of radiochemotherapy was used for classifying tumor progression or regression. Results: The score scatter plots revealed that the first, third, and fifth components of the pretreatment examinations exhibited a pattern that strongly correlated to survival. Two groups could be identified: one with a median survival after diagnosis of 1099 days and one with 345 days, p = 0.0001. Conclusions: By combining PCA and texture analysis, ADC texture characteristics were identified, which seems to hold pretreatment prognostic information, independent of known prognostic factors such as age, stage, and surgical procedure. These findings encourage further studies with a larger patient cohort.« less

  10. Genetic and Morphological Analyses Demonstrate That Schizolecis guntheri (Siluriformes: Loricariidae) Is Likely to Be a Species Complex

    PubMed Central

    Souza, Camila S.; Costa-Silva, Guilherme J.; Roxo, Fábio F.; Foresti, Fausto; Oliveira, Claudio

    2018-01-01

    Schizolecis is a monotypic genus of Siluriformes widely distributed throughout isolated coastal drainages of southeastern Brazil. Previous studies have shown that fish groups found in isolated river basins tend to differentiate over time in the absence of gene flow, resulting in allopatric speciation. In this study, we used partial sequences of the mitochondrial gene COI with the analysis of the General Mixed Yule Coalescent model (GMYC) and the Automatic Barcode Gap Discovery (ABGD) for single locus species delimitation, and a Principal Component Analysis (PCA) of external morphology to test the hypothesis that Schizolecis guntheri is a complex of species. We analyzed 94 samples of S. guntheri for GMYC and ABGD, and 82 samples for PCA from 22 coastal rivers draining to the Atlantic in southeastern Brazil from the Paraná State to the north of the Rio de Janeiro State. As a result, the GMYC model and the ABGD delimited five operational taxonomy units (OTUs – a nomenclature referred to in the present study of the possible new species delimited for the genetic analysis), a much higher number compared to the traditional alfa taxonomy that only recognizes S. guntheri across the isolated coastal rivers of Brazil. Furthermore, the PCA analysis suggests that S. guntheri is highly variable in aspects of external body proportions, including dorsal-fin spine length, pectoral-fin spine length, pelvic-fin spine length, lower caudal-fin spine length, caudal peduncle depth, anal width and mandibular ramus length. However, no exclusive character was found among the isolated populations that could be used to describe a new species of Schizolecis. Therefore, we can conclude, based on our results of PCA contrasting with the results of GMYC and ABGD, that S. guntheri represents a complex of species. PMID:29552028

  11. Genetic and Morphological Analyses Demonstrate That Schizolecis guntheri (Siluriformes: Loricariidae) Is Likely to Be a Species Complex.

    PubMed

    Souza, Camila S; Costa-Silva, Guilherme J; Roxo, Fábio F; Foresti, Fausto; Oliveira, Claudio

    2018-01-01

    Schizolecis is a monotypic genus of Siluriformes widely distributed throughout isolated coastal drainages of southeastern Brazil. Previous studies have shown that fish groups found in isolated river basins tend to differentiate over time in the absence of gene flow, resulting in allopatric speciation. In this study, we used partial sequences of the mitochondrial gene COI with the analysis of the General Mixed Yule Coalescent model (GMYC) and the Automatic Barcode Gap Discovery (ABGD) for single locus species delimitation, and a Principal Component Analysis (PCA) of external morphology to test the hypothesis that Schizolecis guntheri is a complex of species. We analyzed 94 samples of S. guntheri for GMYC and ABGD, and 82 samples for PCA from 22 coastal rivers draining to the Atlantic in southeastern Brazil from the Paraná State to the north of the Rio de Janeiro State. As a result, the GMYC model and the ABGD delimited five operational taxonomy units (OTUs - a nomenclature referred to in the present study of the possible new species delimited for the genetic analysis), a much higher number compared to the traditional alfa taxonomy that only recognizes S. guntheri across the isolated coastal rivers of Brazil. Furthermore, the PCA analysis suggests that S. guntheri is highly variable in aspects of external body proportions, including dorsal-fin spine length, pectoral-fin spine length, pelvic-fin spine length, lower caudal-fin spine length, caudal peduncle depth, anal width and mandibular ramus length. However, no exclusive character was found among the isolated populations that could be used to describe a new species of Schizolecis . Therefore, we can conclude, based on our results of PCA contrasting with the results of GMYC and ABGD, that S. guntheri represents a complex of species.

  12. Evaluating the ability of artificial neural network and PCA-M5P models in predicting leachate COD load in landfills.

    PubMed

    Azadi, Sama; Amiri, Hamid; Rakhshandehroo, G Reza

    2016-09-01

    Waste burial in uncontrolled landfills can cause serious environmental damages and unpleasant consequences. Leachates produced in landfills have the potential to contaminate soil and groundwater resources. Leachate management is one of the major issues with respect to landfills environmental impacts. Improper design of landfills can lead to leachate spread in the environment, and hence, engineered landfills are required to have leachate monitoring programs. The high cost of such programs may be greatly reduced and cost efficiency of the program may be optimized if one can predict leachate contamination level and foresee management and treatment strategies. The aim of this study is to develop two expert systems consisting of Artificial Neural Network (ANN) and Principal Component Analysis-M5P (PCA-M5P) models to predict Chemical Oxygen Demand (COD) load in leachates produced in lab-scale landfills. Measured data from three landfill lysimeters, including rainfall depth, number of days after waste deposition, thickness of top and bottom Compacted Clay Liners (CCLs), and thickness of top cover over the lysimeter, were utilized to develop, train, validate, and test the expert systems and predict the leachate COD load. Statistical analysis of the prediction results showed that both models possess good prediction ability with a slight superiority for ANN over PCA-M5P. Based on test datasets, the mean absolute percentage error for ANN and PCA-M5P models were 4% and 12%, respectively, and the correlation coefficient for both models was greater than 0.98. Developed models may be used as a rough estimate for leachate COD load prediction in primary landfill designs, where the effect of a top and/or bottom liner is disputed. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. Principal component analysis to assess the efficiency and mechanism for enhanced coagulation of natural algae-laden water using a novel dual coagulant system.

    PubMed

    Ou, Hua-Se; Wei, Chao-Hai; Deng, Yang; Gao, Nai-Yun; Ren, Yuan; Hu, Yun

    2014-02-01

    A novel dual coagulant system of polyaluminum chloride sulfate (PACS) and polydiallyldimethylammonium chloride (PDADMAC) was used to treat natural algae-laden water from Meiliang Gulf, Lake Taihu. PACS (Aln(OH)mCl3n-m-2k(SO4)k) has a mass ratio of 10 %, a SO4 (2-)/Al3 (+) mole ratio of 0.0664, and an OH/Al mole ratio of 2. The PDADMAC ([C8H16NCl]m) has a MW which ranges from 5 × 10(5) to 20 × 10(5) Da. The variations of contaminants in water samples during treatments were estimated in the form of principal component analysis (PCA) factor scores and conventional variables (turbidity, DOC, etc.). Parallel factor analysis determined four chromophoric dissolved organic matters (CDOM) components, and PCA identified four integrated principle factors. PCA factor 1 had significant correlations with chlorophyll-a (r=0.718), protein-like CDOM C1 (0.689), and C2 (0.756). Factor 2 correlated with UV254 (0.672), humic-like CDOM component C3 (0.716), and C4 (0.758). Factors 3 and 4 had correlations with NH3-N (0.748) and T-P (0.769), respectively. The variations of PCA factors scores revealed that PACS contributed less aluminum dissolution than PAC to obtain equivalent removal efficiency of contaminants. This might be due to the high cationic charge and pre-hydrolyzation of PACS. Compared with PACS coagulation (20 mg L(-1)), the removal of PCA factors 1, 2, and 4 increased 45, 33, and 12 %, respectively, in combined PACS-PDADMAC treatment (0.8 mg L(-1) +20 mg L(-1)). Since PAC contained more Al (0.053 g/1 g) than PACS (0.028 g/1 g), the results indicated that PACS contributed less Al dissolution into the water to obtain equivalent removal efficiency.

  14. Dynamic Responses in Brain Networks to Social Feedback: A Dual EEG Acquisition Study in Adolescent Couples

    PubMed Central

    Kuo, Ching-Chang; Ha, Thao; Ebbert, Ashley M.; Tucker, Don M.; Dishion, Thomas J.

    2017-01-01

    Adolescence is a sensitive period for the development of romantic relationships. During this period the maturation of frontolimbic networks is particularly important for the capacity to regulate emotional experiences. In previous research, both functional magnetic resonance imaging (fMRI) and dense array electroencephalography (dEEG) measures have suggested that responses in limbic regions are enhanced in adolescents experiencing social rejection. In the present research, we examined social acceptance and rejection from romantic partners as they engaged in a Chatroom Interact Task. Dual 128-channel dEEG systems were used to record neural responses to acceptance and rejection from both adolescent romantic partners and unfamiliar peers (N = 75). We employed a two-step temporal principal component analysis (PCA) and spatial independent component analysis (ICA) approach to statistically identify the neural components related to social feedback. Results revealed that the early (288 ms) discrimination between acceptance and rejection reflected by the P3a component was significant for the romantic partner but not the unfamiliar peer. In contrast, the later (364 ms) P3b component discriminated between acceptance and rejection for both partners and peers. The two-step approach (PCA then ICA) was better able than either PCA or ICA alone in separating these components of the brain's electrical activity that reflected both temporal and spatial phases of the brain's processing of social feedback. PMID:28620292

  15. Quantitative structure-activity relationship study of P2X7 receptor inhibitors using combination of principal component analysis and artificial intelligence methods.

    PubMed

    Ahmadi, Mehdi; Shahlaei, Mohsen

    2015-01-01

    P2X7 antagonist activity for a set of 49 molecules of the P2X7 receptor antagonists, derivatives of purine, was modeled with the aid of chemometric and artificial intelligence techniques. The activity of these compounds was estimated by means of combination of principal component analysis (PCA), as a well-known data reduction method, genetic algorithm (GA), as a variable selection technique, and artificial neural network (ANN), as a non-linear modeling method. First, a linear regression, combined with PCA, (principal component regression) was operated to model the structure-activity relationships, and afterwards a combination of PCA and ANN algorithm was employed to accurately predict the biological activity of the P2X7 antagonist. PCA preserves as much of the information as possible contained in the original data set. Seven most important PC's to the studied activity were selected as the inputs of ANN box by an efficient variable selection method, GA. The best computational neural network model was a fully-connected, feed-forward model with 7-7-1 architecture. The developed ANN model was fully evaluated by different validation techniques, including internal and external validation, and chemical applicability domain. All validations showed that the constructed quantitative structure-activity relationship model suggested is robust and satisfactory.

  16. Quantitative structure–activity relationship study of P2X7 receptor inhibitors using combination of principal component analysis and artificial intelligence methods

    PubMed Central

    Ahmadi, Mehdi; Shahlaei, Mohsen

    2015-01-01

    P2X7 antagonist activity for a set of 49 molecules of the P2X7 receptor antagonists, derivatives of purine, was modeled with the aid of chemometric and artificial intelligence techniques. The activity of these compounds was estimated by means of combination of principal component analysis (PCA), as a well-known data reduction method, genetic algorithm (GA), as a variable selection technique, and artificial neural network (ANN), as a non-linear modeling method. First, a linear regression, combined with PCA, (principal component regression) was operated to model the structure–activity relationships, and afterwards a combination of PCA and ANN algorithm was employed to accurately predict the biological activity of the P2X7 antagonist. PCA preserves as much of the information as possible contained in the original data set. Seven most important PC's to the studied activity were selected as the inputs of ANN box by an efficient variable selection method, GA. The best computational neural network model was a fully-connected, feed-forward model with 7−7−1 architecture. The developed ANN model was fully evaluated by different validation techniques, including internal and external validation, and chemical applicability domain. All validations showed that the constructed quantitative structure–activity relationship model suggested is robust and satisfactory. PMID:26600858

  17. PCA leverage: outlier detection for high-dimensional functional magnetic resonance imaging data.

    PubMed

    Mejia, Amanda F; Nebel, Mary Beth; Eloyan, Ani; Caffo, Brian; Lindquist, Martin A

    2017-07-01

    Outlier detection for high-dimensional (HD) data is a popular topic in modern statistical research. However, one source of HD data that has received relatively little attention is functional magnetic resonance images (fMRI), which consists of hundreds of thousands of measurements sampled at hundreds of time points. At a time when the availability of fMRI data is rapidly growing-primarily through large, publicly available grassroots datasets-automated quality control and outlier detection methods are greatly needed. We propose principal components analysis (PCA) leverage and demonstrate how it can be used to identify outlying time points in an fMRI run. Furthermore, PCA leverage is a measure of the influence of each observation on the estimation of principal components, which are often of interest in fMRI data. We also propose an alternative measure, PCA robust distance, which is less sensitive to outliers and has controllable statistical properties. The proposed methods are validated through simulation studies and are shown to be highly accurate. We also conduct a reliability study using resting-state fMRI data from the Autism Brain Imaging Data Exchange and find that removal of outliers using the proposed methods results in more reliable estimation of subject-level resting-state networks using independent components analysis. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. [Research on spectra recognition method for cabbages and weeds based on PCA and SIMCA].

    PubMed

    Zu, Qin; Deng, Wei; Wang, Xiu; Zhao, Chun-Jiang

    2013-10-01

    In order to improve the accuracy and efficiency of weed identification, the difference of spectral reflectance was employed to distinguish between crops and weeds. Firstly, the different combinations of Savitzky-Golay (SG) convolutional derivation and multiplicative scattering correction (MSC) method were applied to preprocess the raw spectral data. Then the clustering analysis of various types of plants was completed by using principal component analysis (PCA) method, and the feature wavelengths which were sensitive for classifying various types of plants were extracted according to the corresponding loading plots of the optimal principal components in PCA results. Finally, setting the feature wavelengths as the input variables, the soft independent modeling of class analogy (SIMCA) classification method was used to identify the various types of plants. The experimental results of classifying cabbages and weeds showed that on the basis of the optimal pretreatment by a synthetic application of MSC and SG convolutional derivation with SG's parameters set as 1rd order derivation, 3th degree polynomial and 51 smoothing points, 23 feature wavelengths were extracted in accordance with the top three principal components in PCA results. When SIMCA method was used for classification while the previously selected 23 feature wavelengths were set as the input variables, the classification rates of the modeling set and the prediction set were respectively up to 98.6% and 100%.

  19. Classification of fMRI resting-state maps using machine learning techniques: A comparative study

    NASA Astrophysics Data System (ADS)

    Gallos, Ioannis; Siettos, Constantinos

    2017-11-01

    We compare the efficiency of Principal Component Analysis (PCA) and nonlinear learning manifold algorithms (ISOMAP and Diffusion maps) for classifying brain maps between groups of schizophrenia patients and healthy from fMRI scans during a resting-state experiment. After a standard pre-processing pipeline, we applied spatial Independent component analysis (ICA) to reduce (a) noise and (b) spatial-temporal dimensionality of fMRI maps. On the cross-correlation matrix of the ICA components, we applied PCA, ISOMAP and Diffusion Maps to find an embedded low-dimensional space. Finally, support-vector-machines (SVM) and k-NN algorithms were used to evaluate the performance of the algorithms in classifying between the two groups.

  20. A novel principal component analysis for spatially misaligned multivariate air pollution data.

    PubMed

    Jandarov, Roman A; Sheppard, Lianne A; Sampson, Paul D; Szpiro, Adam A

    2017-01-01

    We propose novel methods for predictive (sparse) PCA with spatially misaligned data. These methods identify principal component loading vectors that explain as much variability in the observed data as possible, while also ensuring the corresponding principal component scores can be predicted accurately by means of spatial statistics at locations where air pollution measurements are not available. This will make it possible to identify important mixtures of air pollutants and to quantify their health effects in cohort studies, where currently available methods cannot be used. We demonstrate the utility of predictive (sparse) PCA in simulated data and apply the approach to annual averages of particulate matter speciation data from national Environmental Protection Agency (EPA) regulatory monitors.

  1. Multivariate analysis for scanning tunneling spectroscopy data

    NASA Astrophysics Data System (ADS)

    Yamanishi, Junsuke; Iwase, Shigeru; Ishida, Nobuyuki; Fujita, Daisuke

    2018-01-01

    We applied principal component analysis (PCA) to two-dimensional tunneling spectroscopy (2DTS) data obtained on a Si(111)-(7 × 7) surface to explore the effectiveness of multivariate analysis for interpreting 2DTS data. We demonstrated that several components that originated mainly from specific atoms at the Si(111)-(7 × 7) surface can be extracted by PCA. Furthermore, we showed that hidden components in the tunneling spectra can be decomposed (peak separation), which is difficult to achieve with normal 2DTS analysis without the support of theoretical calculations. Our analysis showed that multivariate analysis can be an additional powerful way to analyze 2DTS data and extract hidden information from a large amount of spectroscopic data.

  2. Exploring the Saccharomyces cerevisiae Volatile Metabolome: Indigenous versus Commercial Strains

    PubMed Central

    Alves, Zélia; Melo, André; Figueiredo, Ana Raquel; Coimbra, Manuel A.; Gomes, Ana C.; Rocha, Sílvia M.

    2015-01-01

    Winemaking is a highly industrialized process and a number of commercial Saccharomyces cerevisiae strains are used around the world, neglecting the diversity of native yeast strains that are responsible for the production of wines peculiar flavours. The aim of this study was to in-depth establish the S. cerevisiae volatile metabolome and to assess inter-strains variability. To fulfill this objective, two indigenous strains (BT2652 and BT2453 isolated from spontaneous fermentation of grapes collected in Bairrada Appellation, Portugal) and two commercial strains (CSc1 and CSc2) S. cerevisiae were analysed using a methodology based on advanced multidimensional gas chromatography (HS-SPME/GC×GC-ToFMS) tandem with multivariate analysis. A total of 257 volatile metabolites were identified, distributed over the chemical families of acetals, acids, alcohols, aldehydes, ketones, terpenic compounds, esters, ethers, furan-type compounds, hydrocarbons, pyrans, pyrazines and S-compounds. Some of these families are related with metabolic pathways of amino acid, carbohydrate and fatty acid metabolism as well as mono and sesquiterpenic biosynthesis. Principal Component Analysis (PCA) was used with a dataset comprising all variables (257 volatile components), and a distinction was observed between commercial and indigenous strains, which suggests inter-strains variability. In a second step, a subset containing esters and terpenic compounds (C10 and C15), metabolites of particular relevance to wine aroma, was also analysed using PCA. The terpenic and ester profiles express the strains variability and their potential contribution to the wine aromas, specially the BT2453, which produced the higher terpenic content. This research contributes to understand the metabolic diversity of indigenous wine microflora versus commercial strains and achieved knowledge that may be further exploited to produce wines with peculiar aroma properties. PMID:26600152

  3. Development, validation and psychometric properties of a diagnostic/prognostic tool for breakthrough pain in mixed chronic-pain patients.

    PubMed

    Samolsky Dekel, Boaz Gedaliahu; Remondini, Francesca; Gori, Alberto; Vasarri, Alessio; Di Nino, GianFranco; Melotti, Rita Maria

    2016-02-01

    Breakthrough pain (BTP) shows variable prevalence in different clinical contexts of cancer and non-cancer patients. BTP diagnostic tools with demonstrated reliability, validation and prognostic capability are lacking. We report the development, psychometric and validation properties of a diagnostic/prognostic tool, the IQ-BTP, for BTP recognition, its likelihood and clinical features among chronic-pain (CP) patients. n=120 consecutive mixed cancer/non-cancer CP in/outpatients. Development, psychometric analyses and formal validation included: Face/Content validity (by 'experts' opinion and assessing the relationship between the IQ-BTP classes and criteria derived from BTP operational-case-definition); Construct validity, by Principle Component Analysis (PCA); and the strength of Spearman correlation between IQ-BTP classes and the Brief Pain Inventory (BPI) items; Reliability, by Cronbach's alpha statistics. Associations with clinical/demographic moderators were assessed applying χ(2) analysis. Potential-BTP was found in 36.7% of patients (38.4% of non-cancer and 32.4% of cancer patients). Among these the likelihood for BTP diagnosis was 'high' in 25%, 'intermediate' in 41% and, 'low' 34% of patients. Analyses showed significant differences between IQ-BTP classes and between the latter BPI pain-item scores. Correlation between IQ-BTP classes and BPI items was moderate. PCA and scree test identified 3 components accounting for 62.3% of the variance. Cronbach's alpha was 0.71. The IQ-BTP showed satisfactory psychometric and validation properties. With adequate feasibility it enabled the allocating of cancer/non-cancer CP patients in three prognostic classes. Results are sufficient to warrant a subsequent impact study of the IQ-BTP as prognostic model and screening tool for BTP in both CP populations. Copyright © 2016 Elsevier B.V. All rights reserved.

  4. Principal component and normal mode analysis of proteins; a quantitative comparison using the GroEL subunit.

    PubMed

    Skjaerven, Lars; Martinez, Aurora; Reuter, Nathalie

    2011-01-01

    Principal component analysis (PCA) and normal mode analysis (NMA) have emerged as two invaluable tools for studying conformational changes in proteins. To compare these approaches for studying protein dynamics, we have used a subunit of the GroEL chaperone, whose dynamics is well characterized. We first show that both PCA on trajectories from molecular dynamics (MD) simulations and NMA reveal a general dynamical behavior in agreement with what has previously been described for GroEL. We thus compare the reproducibility of PCA on independent MD runs and subsequently investigate the influence of the length of the MD simulations. We show that there is a relatively poor one-to-one correspondence between eigenvectors obtained from two independent runs and conclude that caution should be taken when analyzing principal components individually. We also observe that increasing the simulation length does not improve the agreement with the experimental structural difference. In fact, relatively short MD simulations are sufficient for this purpose. We observe a rapid convergence of the eigenvectors (after ca. 6 ns). Although there is not always a clear one-to-one correspondence, there is a qualitatively good agreement between the movements described by the first five modes obtained with the three different approaches; PCA, all-atoms NMA, and coarse-grained NMA. It is particularly interesting to relate this to the computational cost of the three methods. The results we obtain on the GroEL subunit contribute to the generalization of robust and reproducible strategies for the study of protein dynamics, using either NMA or PCA of trajectories from MD simulations. © 2010 Wiley-Liss, Inc.

  5. Strategies for reducing large fMRI data sets for independent component analysis.

    PubMed

    Wang, Ze; Wang, Jiongjiong; Calhoun, Vince; Rao, Hengyi; Detre, John A; Childress, Anna R

    2006-06-01

    In independent component analysis (ICA), principal component analysis (PCA) is generally used to reduce the raw data to a few principal components (PCs) through eigenvector decomposition (EVD) on the data covariance matrix. Although this works for spatial ICA (sICA) on moderately sized fMRI data, it is intractable for temporal ICA (tICA), since typical fMRI data have a high spatial dimension, resulting in an unmanageable data covariance matrix. To solve this problem, two practical data reduction methods are presented in this paper. The first solution is to calculate the PCs of tICA from the PCs of sICA. This approach works well for moderately sized fMRI data; however, it is highly computationally intensive, even intractable, when the number of scans increases. The second solution proposed is to perform PCA decomposition via a cascade recursive least squared (CRLS) network, which provides a uniform data reduction solution for both sICA and tICA. Without the need to calculate the covariance matrix, CRLS extracts PCs directly from the raw data, and the PC extraction can be terminated after computing an arbitrary number of PCs without the need to estimate the whole set of PCs. Moreover, when the whole data set becomes too large to be loaded into the machine memory, CRLS-PCA can save data retrieval time by reading the data once, while the conventional PCA requires numerous data retrieval steps for both covariance matrix calculation and PC extractions. Real fMRI data were used to evaluate the PC extraction precision, computational expense, and memory usage of the presented methods.

  6. Using principal component analysis and annual seasonal trend analysis to assess karst rocky desertification in southwestern China.

    PubMed

    Zhang, Zhiming; Ouyang, Zhiyun; Xiao, Yi; Xiao, Yang; Xu, Weihua

    2017-06-01

    Increasing exploitation of karst resources is causing severe environmental degradation because of the fragility and vulnerability of karst areas. By integrating principal component analysis (PCA) with annual seasonal trend analysis (ASTA), this study assessed karst rocky desertification (KRD) within a spatial context. We first produced fractional vegetation cover (FVC) data from a moderate-resolution imaging spectroradiometer normalized difference vegetation index using a dimidiate pixel model. Then, we generated three main components of the annual FVC data using PCA. Subsequently, we generated the slope image of the annual seasonal trends of FVC using median trend analysis. Finally, we combined the three PCA components and annual seasonal trends of FVC with the incidence of KRD for each type of carbonate rock to classify KRD into one of four categories based on K-means cluster analysis: high, moderate, low, and none. The results of accuracy assessments indicated that this combination approach produced greater accuracy and more reasonable KRD mapping than the average FVC based on the vegetation coverage standard. The KRD map for 2010 indicated that the total area of KRD was 78.76 × 10 3  km 2 , which constitutes about 4.06% of the eight southwest provinces of China. The largest KRD areas were found in Yunnan province. The combined PCA and ASTA approach was demonstrated to be an easily implemented, robust, and flexible method for the mapping and assessment of KRD, which can be used to enhance regional KRD management schemes or to address assessment of other environmental issues.

  7. Competing-risks mortality after radiotherapy vs. observation for localized prostate cancer: a population-based study.

    PubMed

    Abdollah, Firas; Sun, Maxine; Schmitges, Jan; Thuret, Rodolphe; Tian, Zhe; Shariat, Shahrokh F; Briganti, Alberto; Jeldres, Claudio; Perrotte, Paul; Montorsi, Francesco; Karakiewicz, Pierre I

    2012-09-01

    Contemporary patients with localized prostate cancer (PCa) are more frequently treated with radiotherapy. However, there are limited data on the effect of this treatment on cancer-specific mortality (CSM). Our objective was to test the relationship between radiotherapy and survival in men with localized PCa and compare it with those treated with observation. A population-based cohort identified 68,797 men with cT1-T2 PCa treated with radiotherapy or observation between the years 1992 and 2005. Propensity-score matching was used to minimize potential bias related to treatment assignment. Competing-risks analyses tested the effect of treatment type (radiotherapy vs. observation) on CSM, after accounting to other-cause mortality. All analyses were carried out within PCa risk, baseline comorbidity status, and age groups. Radiotherapy was associated with more favorable 10-year CSM rates than observation in patients with high-risk PCa (8.8 vs. 14.4%, hazard ratio [HR]: 0.59, 95% confidence interval [CI]: 0.50-0.68). Conversely, the beneficial effect of radiotherapy on CSM was not evident in patients with low-intermediate risk PCa (3.7 vs. 4.1%, HR: 0.91, 95% CI: 0.80-1.04). Radiotherapy was beneficial in elderly patients (5.6 vs. 7.3%, HR: 0.70, 95% CI: 0.59-0.80). Moreover, it was associated with improved CSM rates among patients with no comorbidities (5.7 vs. 6.5%, HR: 0.81, 95% CI: 0.67-0.98), one comorbidity (4.6 vs. 6.0%, HR: 0.87, 95% CI: 0.75-0.99), and more than two comorbidities (4.2 vs. 5.0%, HR: 0.79, 95% CI: 0.65-0.96). Radiotherapy substantially improves CSM in patients with high-risk PCa, with little or no benefit in patients with low-/intermediate-risk PCa relative to observation. These findings must be interpreted within the context of the limitations of observational data. Copyright © 2012 Elsevier Inc. All rights reserved.

  8. Transcriptome alteration in Phytophthora infestans in response to phenazine-1-carboxylic acid production by Pseudomonas fluorescens strain LBUM223.

    PubMed

    Roquigny, Roxane; Novinscak, Amy; Arseneault, Tanya; Joly, David L; Filion, Martin

    2018-06-19

    Phytophthora infestans is responsible for late blight, one of the most important potato diseases. Phenazine-1-carboxylic acid (PCA)-producing Pseudomonas fluorescens strain LBUM223 isolated in our laboratory shows biocontrol potential against various plant pathogens. To characterize the effect of LBUM223 on the transcriptome of P. infestans, we conducted an in vitro time-course study. Confrontational assay was performed using P. infestans inoculated alone (control) or with LBUM223, its phzC- isogenic mutant (not producing PCA), or exogenically applied PCA. Destructive sampling was performed at 6, 9 and 12 days and the transcriptome of P. infestans was analysed using RNA-Seq. The expression of a subset of differentially expressed genes was validated by RT-qPCR. Both LBUM223 and exogenically applied PCA significantly repressed P. infestans' growth at all times. Compared to the control treatment, transcriptomic analyses showed that the percentages of all P. infestans' genes significantly altered by LBUM223 and exogenically applied PCA increased as time progressed, from 50 to 61% and from to 32 to 46%, respectively. When applying an absolute cut-off value of 3 fold change or more for all three harvesting times, 207 genes were found significantly differentially expressed by PCA, either produced by LBUM223 or exogenically applied. Gene ontology analysis revealed that both treatments altered the expression of key functional genes involved in major functions like phosphorylation mechanisms, transmembrane transport and oxidoreduction activities. Interestingly, even though no host plant tissue was present in the in vitro system, PCA also led to the overexpression of several genes encoding effectors. The mutant only slightly repressed P. infestans' growth and barely altered its transcriptome. Our study suggests that PCA is involved in P. infestans' growth repression and led to important transcriptomic changes by both up- and down-regulating gene expression in P. infestans over time. Different metabolic functions were altered and many effectors were found to be upregulated, suggesting their implication in biocontrol.

  9. 68Ga-HBED-CC-PSMA PET/CT Versus Histopathology in Primary Localized Prostate Cancer: A Voxel-Wise Comparison

    PubMed Central

    Zamboglou, Constantinos; Schiller, Florian; Fechter, Tobias; Wieser, Gesche; Jilg, Cordula Annette; Chirindel, Alin; Salman, Nasr; Drendel, Vanessa; Werner, Martin; Mix, Michael; Meyer, Philipp Tobias; Grosu, Anca Ligia

    2016-01-01

    Purpose: We performed a voxel-wise comparison of 68Ga-HBED-CC-PSMA PET/CT with prostate histopathology to evaluate the performance of 68Ga-HBED-CC-PSMA for the detection and delineation of primary prostate cancer (PCa). Methodology: Nine patients with histopathological proven primary PCa underwent 68Ga-HBED-CC-PSMA PET/CT followed by radical prostatectomy. Resected prostates were scanned by ex-vivo CT in a special localizer and histopathologically prepared. Histopathological information was matched to ex-vivo CT. PCa volume (PCa-histo) and non-PCa tissue in the prostate (NPCa-histo) were processed to obtain a PCa-model, which was adjusted to PET-resolution (histo-PET). Each histo-PET was coregistered to in-vivo PSMA-PET/CT data. Results: Analysis of spatial overlap between histo-PET and PSMA PET revealed highly significant correlations (p < 10-5) in nine patients and moderate to high coefficients of determination (R²) from 42 to 82 % with an average of 60 ± 14 % in eight patients (in one patient R2 = 7 %). Mean SUVmean in PCa-histo and NPCa-histo was 5.6 ± 6.1 and 3.3 ± 2.5 (p = 0.012). Voxel-wise receiver-operating characteristic (ROC) analyses comparing the prediction by PSMA-PET with the non-smoothed tumor distribution from histopathology yielded an average area under the curve of 0.83 ± 0.12. Absolute and relative SUV (normalized to SUVmax) thresholds for achieving at least 90 % sensitivity were 3.19 ± 3.35 and 0.28 ± 0.09, respectively. Conclusions: Voxel-wise analyses revealed good correlations of 68Ga-HBED-CC-PSMA PET/CT and histopathology in eight out of nine patients. Thus, PSMA-PET allows a reliable detection and delineation of PCa as basis for PET-guided focal therapies. PMID:27446496

  10. Combination of PCA and LORETA for sources analysis of ERP data: an emotional processing study

    NASA Astrophysics Data System (ADS)

    Hu, Jin; Tian, Jie; Yang, Lei; Pan, Xiaohong; Liu, Jiangang

    2006-03-01

    The purpose of this paper is to study spatiotemporal patterns of neuronal activity in emotional processing by analysis of ERP data. 108 pictures (categorized as positive, negative and neutral) were presented to 24 healthy, right-handed subjects while 128-channel EEG data were recorded. An analysis of two steps was applied to the ERP data. First, principal component analysis was performed to obtain significant ERP components. Then LORETA was applied to each component to localize their brain sources. The first six principal components were extracted, each of which showed different spatiotemporal patterns of neuronal activity. The results agree with other emotional study by fMRI or PET. The combination of PCA and LORETA can be used to analyze spatiotemporal patterns of ERP data in emotional processing.

  11. Comparison of prostate cancer gene 3 score, prostate health index and percentage free prostate-specific antigen for differentiating histological inflammation from prostate cancer and other non-neoplastic alterations of the prostate at initial biopsy.

    PubMed

    De Luca, Stefano; Passera, Roberto; Bollito, Enrico; Manfredi, Matteo; Scarpa, Roberto Mario; Sottile, Antonino; Randone, Donato Franco; Porpiglia, Francesco

    2014-12-01

    To determine if prostate cancer gene 3 (PCA3) score, Prostate Health Index (PHI), and percent free prostate-specific antigen (%fPSA) may be used to differentiate prostatitis from prostate cancer (PCa), benign prostatic hyperplasia (BPH) and high-grade prostate intraepithelial neoplasia (HG-PIN) in patients with elevated PSA and negative digital rectal examination (DRE). in the present prospective study, 274 patients, undergoing PCA3 score, PHI and %fPSA assessments before initial biopsy, were enrolled. Three multivariate logistic regression models were used to test PCA3 score, PHI and %fPSA as risk factors for prostatitis vs. PCa, vs. BPH, and vs. HG-PIN. All the analyses were performed for the whole patient cohort and for the 'gray zone' of PSA (4-10 ng/ml) cohort (188 individuals). The determinants for prostatitis vs. PCa were PCA3 score, PHI and %fPSA (Odds Ratio [OR]=0.97, 0.96 and 0.94, respectively). Unit increase of PHI was the only risk factor for prostatitis vs. BPH (OR=1.06), and unit increase of PCA3 score for HG-PIN vs. prostatitis (OR=0.98). In the 'gray zone' PSA cohort, the determinants for prostatitis vs. PCa were PCA3 score, PHI and %fPSA (OR=0.96, 0.94 and 0.92, respectively), PCA3 score and PHI for prostatitis vs. BPH (OR=0.96 and 1.08, respectively), and PCA3 score for prostatitis vs. HG-PIN (OR=0.97). The clinical benefit of using PCA3 score and PHI to estimate prostatitis vs. PCa was comparable; even %fPSA had good diagnostic performance, being a faster and cheaper marker. PHI was the only determinant for prostatitis vs. BPH, while PCA3 score for prostatitis vs. HG-PIN. Copyright© 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  12. Distal radius: anatomical morphometric gender characteristics. Do anatomical pre-shaped plates pay attention on it?

    PubMed

    Oppermann, Johannes; Bredow, Jan; Beyer, Frank; Neiss, Wolfram Friedrich; Spies, Christian K; Eysel, Peer; Dargel, Jens; Wacker, Max

    2015-01-01

    The purpose of the study was to investigate differences in the osseous structure anatomy of male and female distal radii. Morphometric data were obtained of 49 distal human cadaveric radii. An imprint of the distal edge was attained using silicone mass and the palmar cortical angle (PCA) of the lateral and intermediate column, here declared as medial, according to the concept of Rikli and Rigazzoni. The lateral and medial length and five widths were digitally measured by three observers. In order to compare the measurements an unpaired t test was used. To prove the reliability of the measurements an intraclass correlation analyses was done. Overall mean medial PCA was 148.25° (SD ± 6.83) and mean lateral PCA 156.07° (SD ± 7.00). In male specimens, the mean medial PCA was 147.38° (SD ± 6.01) and mean lateral PCA was 153.6° (SD ± 6.20) whereas in female specimens, the mean medial PCA was 149.41° (SD ± 7.79) and the mean lateral PCA 159.37° (SD ± 6.78), with statistical significance for the female lateral PCA. No gender significant difference for the medial PCA and no significant side difference for the PCA's could be found. The ICC of the observers was r = 0.936 and 0.976 for the medial and for lateral PCA 0.957-0.984. The palmar cortical length of the distal radius was significantly longer in male specimens. For all widths, larger values for male radii were measured, being statistically significant in all cases. Male dimensions concerning the wide were significantly larger when compared with females. Regarding the PCA at the medial and lateral column, we found significant difference for lateral PCA concerning the gender. Overall, study results demonstrated an angle of 148.25° ± 6.83 for the medial PCA and 156.07° ± 7.00 for the lateral PCA.

  13. Protein-RNA specificity by high-throughput principal component analysis of NMR spectra.

    PubMed

    Collins, Katherine M; Oregioni, Alain; Robertson, Laura E; Kelly, Geoff; Ramos, Andres

    2015-03-31

    Defining the RNA target selectivity of the proteins regulating mRNA metabolism is a key issue in RNA biology. Here we present a novel use of principal component analysis (PCA) to extract the RNA sequence preference of RNA binding proteins. We show that PCA can be used to compare the changes in the nuclear magnetic resonance (NMR) spectrum of a protein upon binding a set of quasi-degenerate RNAs and define the nucleobase specificity. We couple this application of PCA to an automated NMR spectra recording and processing protocol and obtain an unbiased and high-throughput NMR method for the analysis of nucleobase preference in protein-RNA interactions. We test the method on the RNA binding domains of three important regulators of RNA metabolism. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Hyperspectral Image Denoising Using a Nonlocal Spectral Spatial Principal Component Analysis

    NASA Astrophysics Data System (ADS)

    Li, D.; Xu, L.; Peng, J.; Ma, J.

    2018-04-01

    Hyperspectral images (HSIs) denoising is a critical research area in image processing duo to its importance in improving the quality of HSIs, which has a negative impact on object detection and classification and so on. In this paper, we develop a noise reduction method based on principal component analysis (PCA) for hyperspectral imagery, which is dependent on the assumption that the noise can be removed by selecting the leading principal components. The main contribution of paper is to introduce the spectral spatial structure and nonlocal similarity of the HSIs into the PCA denoising model. PCA with spectral spatial structure can exploit spectral correlation and spatial correlation of HSI by using 3D blocks instead of 2D patches. Nonlocal similarity means the similarity between the referenced pixel and other pixels in nonlocal area, where Mahalanobis distance algorithm is used to estimate the spatial spectral similarity by calculating the distance in 3D blocks. The proposed method is tested on both simulated and real hyperspectral images, the results demonstrate that the proposed method is superior to several other popular methods in HSI denoising.

  15. Discrimination of healthy and osteoarthritic articular cartilage by Fourier transform infrared imaging and Fisher’s discriminant analysis

    PubMed Central

    Mao, Zhi-Hua; Yin, Jian-Hua; Zhang, Xue-Xi; Wang, Xiao; Xia, Yang

    2016-01-01

    Fourier transform infrared spectroscopic imaging (FTIRI) technique can be used to obtain the quantitative information of content and spatial distribution of principal components in cartilage by combining with chemometrics methods. In this study, FTIRI combining with principal component analysis (PCA) and Fisher’s discriminant analysis (FDA) was applied to identify the healthy and osteoarthritic (OA) articular cartilage samples. Ten 10-μm thick sections of canine cartilages were imaged at 6.25μm/pixel in FTIRI. The infrared spectra extracted from the FTIR images were imported into SPSS software for PCA and FDA. Based on the PCA result of 2 principal components, the healthy and OA cartilage samples were effectively discriminated by the FDA with high accuracy of 94% for the initial samples (training set) and cross validation, as well as 86.67% for the prediction group. The study showed that cartilage degeneration became gradually weak with the increase of the depth. FTIRI combined with chemometrics may become an effective method for distinguishing healthy and OA cartilages in future. PMID:26977354

  16. Extracting spectral contrast in Landsat Thematic Mapper image data using selective principal component analysis

    USGS Publications Warehouse

    Chavez, P.S.; Kwarteng, A.Y.

    1989-01-01

    A challenge encountered with Landsat Thematic Mapper (TM) data, which includes data from size reflective spectral bands, is displaying as much information as possible in a three-image set for color compositing or digital analysis. Principal component analysis (PCA) applied to the six TM bands simultaneously is often used to address this problem. However, two problems that can be encountered using the PCA method are that information of interest might be mathematically mapped to one of the unused components and that a color composite can be difficult to interpret. "Selective' PCA can be used to minimize both of these problems. The spectral contrast among several spectral regions was mapped for a northern Arizona site using Landsat TM data. Field investigations determined that most of the spectral contrast seen in this area was due to one of the following: the amount of iron and hematite in the soils and rocks, vegetation differences, standing and running water, or the presence of gypsum, which has a higher moisture retention capability than do the surrounding soils and rocks. -from Authors

  17. Research of seafloor topographic analyses for a staged mineral exploration

    NASA Astrophysics Data System (ADS)

    Ikeda, M.; Kadoshima, K.; Koizumi, Y.; Yamakawa, T.; Asakawa, E.; Sumi, T.; Kose, M.

    2016-12-01

    J-MARES (Research and Development Partnership for Next Generation Technology of Marine Resources Survey, JAPAN) has been designing a low-cost and high-efficiency exploration system for seafloor hydrothermal massive sulfide (SMS) deposits in "Cross-ministerial Strategic Innovation Promotion Program (SIP)" granted by the Cabinet Office, Government of Japan since 2014. We proposed the multi-stage approach, which is designed from the regional scaled to the detail scaled survey stages through semi-detail scaled, focusing a prospective area by seafloor topographic analyses. We applied this method to the area of more than 100km x 100km around Okinawa Trough, including some well-known mineralized deposits. In the regional scale survey, we assume survey areas are more than 100 km x 100km. Then the spatial resolution of topography data should be bigger than 100m. The 500 m resolution data which is interpolated into 250 m resolution was used for extracting depression and performing principal component analysis (PCA) by the wavelength obtained from frequency analysis. As the result, we have successfully extracted the areas having the topographic features quite similar to well-known mineralized deposits. In the semi-local survey stage, we use the topography data obtained by bathymetric survey using multi-narrow beam echo-sounder. The 30m-resolution data was used for extracting depression, relative-large mounds, hills, lineaments as fault, and also for performing frequency analysis. As the result, wavelength as principal component constituting in the target area was extracted by PCA of wavelength obtained from frequency analysis. Therefore, color image was composited by using the second principal component (PC2) to the forth principal component (PC4) in which the continuity of specific wavelength was observed, and consistent with extracted lineaments. In addition, well-known mineralized deposits were discriminated in the same clusters by using clustering from PC2 to PC4.We applied the results described above to a new area, and successfully extract the quite similar area in vicinity to one of the well-known mineralized deposits. So we are going to verify the extracted areas by using geophysical methods, such as vertical cable seismic and time-domain EM survey, developed in this SIP project.

  18. Ethnobotanical knowledge in rural communities of Cordoba (Argentina): the importance of cultural and biogeographical factors

    PubMed Central

    2009-01-01

    Background The possibility to better understand the relationships within the men, the nature and their culture has extreme importance because allows the characterisation of social systems through their particular environmental perception, and provides useful tools for the development of conservation policies. Methods The present study was planned to disentangle environmental and cultural factors that are influencing the perception, knowledge and uses of edible and medicinal plants in rural communities of Cordoba (Argentina). Interviews an participant observation were conducted in nine rural communities located in three different biogeographical areas. Data about knowledge of medicinal and edible plants and sociocultural variables were obtained. Data were analysed by Principal Components Analysis (PCA). Results The analysis of data confirmed that medicinal species are widely used whereas the knowledge on edible plants is eroding. The PCA showed four groups of communities, defined by several particular combinations of sociocultural and/or natural variables. Conclusion This comprehensive approach suggests that in general terms the cultural environment has a stronger influence than the natural environment on the use of medicinal and edible plants in rural communities of Cordoba (Argentina). PMID:20003502

  19. Classification of M1/M2-polarized human macrophages by label-free hyperspectral reflectance confocal microscopy and multivariate analysis.

    PubMed

    Bertani, Francesca R; Mozetic, Pamela; Fioramonti, Marco; Iuliani, Michele; Ribelli, Giulia; Pantano, Francesco; Santini, Daniele; Tonini, Giuseppe; Trombetta, Marcella; Businaro, Luca; Selci, Stefano; Rainer, Alberto

    2017-08-21

    The possibility of detecting and classifying living cells in a label-free and non-invasive manner holds significant theranostic potential. In this work, Hyperspectral Imaging (HSI) has been successfully applied to the analysis of macrophagic polarization, given its central role in several pathological settings, including the regulation of tumour microenvironment. Human monocyte derived macrophages have been investigated using hyperspectral reflectance confocal microscopy, and hyperspectral datasets have been analysed in terms of M1 vs. M2 polarization by Principal Components Analysis (PCA). Following PCA, Linear Discriminant Analysis has been implemented for semi-automatic classification of macrophagic polarization from HSI data. Our results confirm the possibility to perform single-cell-level in vitro classification of M1 vs. M2 macrophages in a non-invasive and label-free manner with a high accuracy (above 98% for cells deriving from the same donor), supporting the idea of applying the technique to the study of complex interacting cellular systems, such in the case of tumour-immunity in vitro models.

  20. Fully optimized discrimination of physiological responses to auditory stimuli

    PubMed Central

    Kruglikov, Stepan Y; Chari, Sharmila; Rapp, Paul E; Weinstein, Steven L; Given, Barbara K; Schiff, Steven J

    2008-01-01

    The use of multivariate measurements to characterize brain activity (electrical, magnetic, optical) is widespread. The most common approaches to reduce the complexity of such observations include principal and independent component analyses (PCA and ICA), which are not well suited for discrimination tasks. We addressed two questions: first, how do the neurophysiological responses to elongated phonemes relate to tone and phoneme responses in normal children, and, second, how discriminable are these responses. We employed fully optimized linear discrimination analysis to maximally separate the multi-electrode responses to tones and phonemes, and classified the response to elongated phonemes. We find that discrimination between tones and phonemes is dependent upon responses from associative regions of the brain apparently distinct from the primary sensory cortices typically emphasized by PCA or ICA, and that the neuronal correlates corresponding to elongated phonemes are highly variable in normal children (about half respond with neural correlates of tones and half as phonemes). Our approach is made feasible by the increase in computational power of ordinary personal computers and has significant advantages for a wide range of neuronal imaging modalities. PMID:18430975

  1. Variability in chemical composition of Vitis vinifera cv Mencía from different geographic areas and vintages in Ribeira Sacra (NW Spain).

    PubMed

    Vilanova, M; Rodríguez, I; Canosa, P; Otero, I; Gamero, E; Moreno, D; Talaverano, I; Valdés, E

    2015-02-15

    A chemical study was conducted from 2009 to 2012 to examine spatial and seasonal variability of red Vitis vinifera Mencía located in different geographic areas (Amandi, Chantada, Quiroga-Bibei, Ribeiras do Sil and Ribeiras do Miño) from NW Spain. Mencía samples were analysed for phenolic, (flavan-3-ols, flavonols, anthocyanins, acids and resveratrol), nitrogen (TAC, TAN, YAN and TAS) and volatiles compounds (alcohols, C6 compounds, ethyl esters, terpenes, aldehydes, acids, lactones, volatile phenols and carbonyl compounds) by GC-MS and HPLC. Results showed that the composition of Mencía cultivar was more affected by the vintage than the geographic area. The amino acid composition was less affected by both geographic origin and vintage, showing more varietal stability. Application of Principal Component Analysis (PCA) to experimental data showed a good separation of Mencía grape according to geographical origin and vintages. PCA also showed high correlations between the ripening ratio and C6 compounds, resveratrol and carbonyl compounds. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. Non-targeted metabolomic approach reveals urinary metabolites linked to steroid biosynthesis pathway after ingestion of citrus juice.

    PubMed

    Medina, S; Ferreres, F; García-Viguera, C; Horcajada, M N; Orduna, J; Savirón, M; Zurek, G; Martínez-Sanz, J M; Gil, J I; Gil-Izquierdo, A

    2013-01-15

    Citrus juice intake has been highlighted because of its health-promoting effects. LC-MS based metabolomics approaches are applied to obtain a better knowledge on changes in the concentration of metabolites due to its dietary intake and allow a better understanding of involved metabolic pathways. Eight volunteers daily consumed 400 mL of juice for four consecutive days and urine samples were collected before intake and 24h after each citrus juice intake. Urine samples were analysed by nanoHPLC-q-TOF, followed by principal component analysis (PCA) and Student's t-test (p<0.05). PCA showed a separation between two groups (before and after citrus juice consumption). This approach allowed the identification of four endocrine compounds (tetrahydroaldosterone-3-glucuronide, cortolone-3-glucuronide, testosterone-glucuronide and 17-hydroxyprogesterone), which belonged to the steroid biosynthesis pathway as significant metabolites upregulated by citrus juice intake. Additionally, these results confirmed the importance of using the non-targeted metabolomics technique to identify new endogenous metabolites, up- or down-regulated as a consequence of food intake. Copyright © 2012 Elsevier Ltd. All rights reserved.

  3. Controls on the chemistry of runoff from an upland peat catchment

    NASA Astrophysics Data System (ADS)

    Worrall, Fred; Burt, Tim; Adamson, John

    2003-07-01

    This study uses 2 years of data from a detailed weekly water sampling programme in a 11·4 km2 upland peat catchment in the Northern Pennines, UK. The sampling comprised precipitation, soil-water samples and a number of streams, including the basin outlet. Samples were analysed for: pH, conductivity, alkalinity, Na, K, Ca, Mg, Fe, Al, Total N, SO4, Cl and colour. Principal component analysis (PCA) was used to identify end-members and compositional trends in order to identify controls on the development of water composition. The study showed that the direct use of PCA had several advantages over the use of end-member mixing analysis (EMMA) as it combines an analysis of mixing and evolving waters without the assumption of having to know the compositional sources of the water. In its application to an upland peat catchment, the study supports the view that shallow throughflow at the catotelm/acrotelm boundary is responsible for storm runoff generation and shows that baseflow is controlled by cation exchange in the catotelm and mixing with a base-rich groundwater.

  4. Spike sorting based upon machine learning algorithms (SOMA).

    PubMed

    Horton, P M; Nicol, A U; Kendrick, K M; Feng, J F

    2007-02-15

    We have developed a spike sorting method, using a combination of various machine learning algorithms, to analyse electrophysiological data and automatically determine the number of sampled neurons from an individual electrode, and discriminate their activities. We discuss extensions to a standard unsupervised learning algorithm (Kohonen), as using a simple application of this technique would only identify a known number of clusters. Our extra techniques automatically identify the number of clusters within the dataset, and their sizes, thereby reducing the chance of misclassification. We also discuss a new pre-processing technique, which transforms the data into a higher dimensional feature space revealing separable clusters. Using principal component analysis (PCA) alone may not achieve this. Our new approach appends the features acquired using PCA with features describing the geometric shapes that constitute a spike waveform. To validate our new spike sorting approach, we have applied it to multi-electrode array datasets acquired from the rat olfactory bulb, and from the sheep infero-temporal cortex, and using simulated data. The SOMA sofware is available at http://www.sussex.ac.uk/Users/pmh20/spikes.

  5. Environmental effects on the shape variation of male ultraviolet patterns in the Brimstone butterfly ( Gonepteryx rhamni, Pieridae, Lepidoptera)

    NASA Astrophysics Data System (ADS)

    Pecháček, Pavel; Stella, David; Keil, Petr; Kleisner, Karel

    2014-12-01

    The males of the Brimstone butterfly ( Gonepteryx rhamni) have ultraviolet pattern on the dorsal surfaces of their wings. Using geometric morphometrics, we have analysed correlations between environmental variables (climate, productivity) and shape variability of the ultraviolet pattern and the forewing in 110 male specimens of G. rhamni collected in the Palaearctic zone. To start with, we subjected the environmental variables to principal component analysis (PCA). The first PCA axis (precipitation, temperature, latitude) significantly correlated with shape variation of the ultraviolet patterns across the Palaearctic. Additionally, we have performed two-block partial least squares (PLS) analysis to assess co-variation between intraspecific shape variation and the variation of 11 environmental variables. The first PLS axis explained 93 % of variability and represented the effect of precipitation, temperature and latitude. Along this axis, we observed a systematic increase in the relative area of ultraviolet colouration with increasing temperature and precipitation and decreasing latitude. We conclude that the shape variation of ultraviolet patterns on the forewings of male Brimstones is correlated with large-scale environmental factors.

  6. Assessment of resuspended matter and redistribution of macronutrient elements produced by boat disturbance in a eutrophic lagoon.

    PubMed

    Lenzi, Mauro; Finoia, Maria Grazia; Gennaro, Paola; Mercatali, Isabel; Persia, Emma; Solari, Jacopo; Porrello, Salvatore

    2013-07-15

    Harvesting of macroalgae by specially equipped boats in a shallow eutrophic lagoon produces evident sediment resuspension. To outline the environmental effects of this disturbance, we examined the quantity of fall-out and the distances travelled by sediment and macronutrients from the source of boat disturbance. Resuspended sediment fall-out (RSFO) was trapped at different distances from the boat path to determine total dry weight, total nitrogen (TN), total carbon (TC), total organic carbon (TOC), total sulphur (TS) and total phosphorus (TP). The data was analysed by principal components analysis (PCA) and linear discriminant analysis (LDA) on PCA factors. Fall-out of C, N, S and P from the plume of resuspended sediment indicated significant re-arrangement of these nutrients: RSFO dry weight and S content decreased with distance from the boat path, whereas TP increased and was the variable responsible for most discrimination at 100 m. The mass of resuspended matter was relatively large, indicating that the boats considerably reshuffle lagoon sediment. Copyright © 2013 Elsevier Ltd. All rights reserved.

  7. Improved accuracy of quantitative parameter estimates in dynamic contrast-enhanced CT study with low temporal resolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Sun Mo, E-mail: Sunmo.Kim@rmp.uhn.on.ca; Haider, Masoom A.; Jaffray, David A.

    Purpose: A previously proposed method to reduce radiation dose to patient in dynamic contrast-enhanced (DCE) CT is enhanced by principal component analysis (PCA) filtering which improves the signal-to-noise ratio (SNR) of time-concentration curves in the DCE-CT study. The efficacy of the combined method to maintain the accuracy of kinetic parameter estimates at low temporal resolution is investigated with pixel-by-pixel kinetic analysis of DCE-CT data. Methods: The method is based on DCE-CT scanning performed with low temporal resolution to reduce the radiation dose to the patient. The arterial input function (AIF) with high temporal resolution can be generated with a coarselymore » sampled AIF through a previously published method of AIF estimation. To increase the SNR of time-concentration curves (tissue curves), first, a region-of-interest is segmented into squares composed of 3 × 3 pixels in size. Subsequently, the PCA filtering combined with a fraction of residual information criterion is applied to all the segmented squares for further improvement of their SNRs. The proposed method was applied to each DCE-CT data set of a cohort of 14 patients at varying levels of down-sampling. The kinetic analyses using the modified Tofts’ model and singular value decomposition method, then, were carried out for each of the down-sampling schemes between the intervals from 2 to 15 s. The results were compared with analyses done with the measured data in high temporal resolution (i.e., original scanning frequency) as the reference. Results: The patients’ AIFs were estimated to high accuracy based on the 11 orthonormal bases of arterial impulse responses established in the previous paper. In addition, noise in the images was effectively reduced by using five principal components of the tissue curves for filtering. Kinetic analyses using the proposed method showed superior results compared to those with down-sampling alone; they were able to maintain the accuracy in the quantitative histogram parameters of volume transfer constant [standard deviation (SD), 98th percentile, and range], rate constant (SD), blood volume fraction (mean, SD, 98th percentile, and range), and blood flow (mean, SD, median, 98th percentile, and range) for sampling intervals between 10 and 15 s. Conclusions: The proposed method of PCA filtering combined with the AIF estimation technique allows low frequency scanning for DCE-CT study to reduce patient radiation dose. The results indicate that the method is useful in pixel-by-pixel kinetic analysis of DCE-CT data for patients with cervical cancer.« less

  8. Soy Consumption and the Risk of Prostate Cancer: An Updated Systematic Review and Meta-Analysis

    PubMed Central

    Ranard, Katherine M.; Jeon, Sookyoung; Erdman, John W.

    2018-01-01

    Prostate cancer (PCa) is the second most commonly diagnosed cancer in men, accounting for 15% of all cancers in men worldwide. Asian populations consume soy foods as part of a regular diet, which may contribute to the lower PCa incidence observed in these countries. This meta-analysis provides a comprehensive updated analysis that builds on previously published meta-analyses, demonstrating that soy foods and their isoflavones (genistein and daidzein) are associated with a lower risk of prostate carcinogenesis. Thirty articles were included for analysis of the potential impacts of soy food intake, isoflavone intake, and circulating isoflavone levels, on both primary and advanced PCa. Total soy food (p < 0.001), genistein (p = 0.008), daidzein (p = 0.018), and unfermented soy food (p < 0.001) intakes were significantly associated with a reduced risk of PCa. Fermented soy food intake, total isoflavone intake, and circulating isoflavones were not associated with PCa risk. Neither soy food intake nor circulating isoflavones were associated with advanced PCa risk, although very few studies currently exist to examine potential associations. Combined, this evidence from observational studies shows a statistically significant association between soy consumption and decreased PCa risk. Further studies are required to support soy consumption as a prophylactic dietary approach to reduce PCa carcinogenesis. PMID:29300347

  9. Tumour-derived alkaline phosphatase regulates tumour growth, epithelial plasticity and disease-free survival in metastatic prostate cancer

    PubMed Central

    Rao, S R; Snaith, A E; Marino, D; Cheng, X; Lwin, S T; Orriss, I R; Hamdy, F C; Edwards, C M

    2017-01-01

    Background: Recent evidence suggests that bone-related parameters are the main prognostic factors for overall survival in advanced prostate cancer (PCa), with elevated circulating levels of alkaline phosphatase (ALP) thought to reflect the dysregulated bone formation accompanying distant metastases. We have identified that PCa cells express ALPL, the gene that encodes for tissue nonspecific ALP, and hypothesised that tumour-derived ALPL may contribute to disease progression. Methods: Functional effects of ALPL inhibition were investigated in metastatic PCa cell lines. ALPL gene expression was analysed from published PCa data sets, and correlated with disease-free survival and metastasis. Results: ALPL expression was increased in PCa cells from metastatic sites. A reduction in tumour-derived ALPL expression or ALP activity increased cell death, mesenchymal-to-epithelial transition and reduced migration. Alkaline phosphatase activity was decreased by the EMT repressor Snail. In men with PCa, tumour-derived ALPL correlated with EMT markers, and high ALPL expression was associated with a significant reduction in disease-free survival. Conclusions: Our studies reveal the function of tumour-derived ALPL in regulating cell death and epithelial plasticity, and demonstrate a strong association between ALPL expression in PCa cells and metastasis or disease-free survival, thus identifying tumour-derived ALPL as a major contributor to the pathogenesis of PCa progression. PMID:28006818

  10. Study of support vector machine and serum surface-enhanced Raman spectroscopy for noninvasive esophageal cancer detection

    NASA Astrophysics Data System (ADS)

    Li, Shao-Xin; Zeng, Qiu-Yao; Li, Lin-Fang; Zhang, Yan-Jiao; Wan, Ming-Ming; Liu, Zhi-Ming; Xiong, Hong-Lian; Guo, Zhou-Yi; Liu, Song-Hao

    2013-02-01

    The ability of combining serum surface-enhanced Raman spectroscopy (SERS) with support vector machine (SVM) for improving classification esophageal cancer patients from normal volunteers is investigated. Two groups of serum SERS spectra based on silver nanoparticles (AgNPs) are obtained: one group from patients with pathologically confirmed esophageal cancer (n=30) and the other group from healthy volunteers (n=31). Principal components analysis (PCA), conventional SVM (C-SVM) and conventional SVM combination with PCA (PCA-SVM) methods are implemented to classify the same spectral dataset. Results show that a diagnostic accuracy of 77.0% is acquired for PCA technique, while diagnostic accuracies of 83.6% and 85.2% are obtained for C-SVM and PCA-SVM methods based on radial basis functions (RBF) models. The results prove that RBF SVM models are superior to PCA algorithm in classification serum SERS spectra. The study demonstrates that serum SERS in combination with SVM technique has great potential to provide an effective and accurate diagnostic schema for noninvasive detection of esophageal cancer.

  11. Type 2 diabetes mellitus: distribution of genetic markers in Kazakh population.

    PubMed

    Sikhayeva, Nurgul; Talzhanov, Yerkebulan; Iskakova, Aisha; Dzharmukhanov, Jarkyn; Nugmanova, Raushan; Zholdybaeva, Elena; Ramanculov, Erlan

    2018-01-01

    Ethnic differences exist in the frequencies of genetic variations that contribute to the risk of common disease. This study aimed to analyse the distribution of several genes, previously associated with susceptibility to type 2 diabetes and obesity-related phenotypes, in a Kazakh population. A total of 966 individuals belonging to the Kazakh ethnicity were recruited from an outpatient clinic. We genotyped 41 common single nucleotide polymorphisms (SNPs) previously associated with type 2 diabetes in other ethnic groups and 31 of these were in Hardy-Weinberg equilibrium. The obtained allele frequencies were further compared to publicly available data from other ethnic populations. Allele frequencies for other (compared) populations were pooled from the haplotype map (HapMap) database. Principal component analysis (PCA), cluster analysis, and multidimensional scaling (MDS) were used for the analysis of genetic relationship between the populations. Comparative analysis of allele frequencies of the studied SNPs showed significant differentiation among the studied populations. The Kazakh population was grouped with Asian populations according to the cluster analysis and with the Caucasian populations according to PCA. According to MDS, results of the current study show that the Kazakh population holds an intermediate position between Caucasian and Asian populations. A high percentage of population differentiation was observed between Kazakh and world populations. The Kazakh population was clustered with Caucasian populations, and this result may indicate a significant Caucasian component in the Kazakh gene pool.

  12. Multivariate analysis of chromatographic retention data as a supplementary means for grouping structurally related compounds.

    PubMed

    Fasoula, S; Zisi, Ch; Sampsonidis, I; Virgiliou, Ch; Theodoridis, G; Gika, H; Nikitas, P; Pappa-Louisi, A

    2015-03-27

    In the present study a series of 45 metabolite standards belonging to four chemically similar metabolite classes (sugars, amino acids, nucleosides and nucleobases, and amines) was subjected to LC analysis on three HILIC columns under 21 different gradient conditions with the aim to explore whether the retention properties of these analytes are determined from the chemical group they belong. Two multivariate techniques, principal component analysis (PCA) and discriminant analysis (DA), were used for statistical evaluation of the chromatographic data and extraction similarities between chemically related compounds. The total variance explained by the first two principal components of PCA was found to be about 98%, whereas both statistical analyses indicated that all analytes are successfully grouped in four clusters of chemical structure based on the retention obtained in four or at least three chromatographic runs, which, however should be performed on two different HILIC columns. Moreover, leave-one-out cross-validation of the above retention data set showed that the chemical group in which an analyte belongs can be 95.6% correctly predicted when the analyte is subjected to LC analysis under the same four or three experimental conditions as the all set of analytes was run beforehand. That, in turn, may assist with disambiguation of analyte identification in complex biological extracts. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. Coordination pattern of baseball pitching among young pitchers of various ages and velocity levels.

    PubMed

    Chen, Hsiu-Hui; Liu, Chiang; Yang, Wen-Wen

    2016-09-01

    This study compared the whole-body movement coordination of pitching among 72 baseball players of various ages and velocity levels. Participants were classified as senior, junior, and little according to their age, with each group comprising 24 players. The velocity levels of the high-velocity (the top eight) and low-velocity (the lowest eight) groups were classified according to their pitching velocity. During pitching, the coordinates of 15 markers attached to the major joints of the whole-body movement system were collected for analysis. Sixteen kinematic parameters were calculated to compare the groups and velocity levels. Principal component analysis (PCA) was conducted to quantify the coordination pattern of pitching movement. The results were as follows: (1) five position and two velocity parameters significantly differed among the age groups, and two position and one velocity parameters significantly differed between the high- and low-velocity groups. (2) The coordination patterns of pitching movement could be described using three components, of which the eigenvalues and contents varied according to age and velocity level. In conclusion, the senior and junior players showed greater elbow angular velocity, whereas the little players exhibited a wider shoulder angle only at the beginning of pitching. The players with high velocity exhibited higher trunk and shoulder rotation velocity. The variations among groups found using PCA and kinematics parameter analyses were consistent.

  14. Mini-DIAL system measurements coupled with multivariate data analysis to identify TIC and TIM simulants: preliminary absorption database analysis.

    NASA Astrophysics Data System (ADS)

    Gaudio, P.; Malizia, A.; Gelfusa, M.; Martinelli, E.; Di Natale, C.; Poggi, L. A.; Bellecci, C.

    2017-01-01

    Nowadays Toxic Industrial Components (TICs) and Toxic Industrial Materials (TIMs) are one of the most dangerous and diffuse vehicle of contamination in urban and industrial areas. The academic world together with the industrial and military one are working on innovative solutions to monitor the diffusion in atmosphere of such pollutants. In this phase the most common commercial sensors are based on “point detection” technology but it is clear that such instruments cannot satisfy the needs of the smart cities. The new challenge is developing stand-off systems to continuously monitor the atmosphere. Quantum Electronics and Plasma Physics (QEP) research group has a long experience in laser system development and has built two demonstrators based on DIAL (Differential Absorption of Light) technology could be able to identify chemical agents in atmosphere. In this work the authors will present one of those DIAL system, the miniaturized one, together with the preliminary results of an experimental campaign conducted on TICs and TIMs simulants in cell with aim of use the absorption database for the further atmospheric an analysis using the same DIAL system. The experimental results are analysed with standard multivariate data analysis technique as Principal Component Analysis (PCA) to develop a classification model aimed at identifying organic chemical compound in atmosphere. The preliminary results of absorption coefficients of some chemical compound are shown together pre PCA analysis.

  15. Classification and source determination of medium petroleum distillates by chemometric and artificial neural networks: a self organizing feature approach.

    PubMed

    Mat-Desa, Wan N S; Ismail, Dzulkiflee; NicDaeid, Niamh

    2011-10-15

    Three different medium petroleum distillate (MPD) products (white spirit, paint brush cleaner, and lamp oil) were purchased from commercial stores in Glasgow, Scotland. Samples of 10, 25, 50, 75, 90, and 95% evaporated product were prepared, resulting in 56 samples in total which were analyzed using gas chromatography-mass spectrometry. Data sets from the chromatographic patterns were examined and preprocessed for unsupervised multivariate analyses using principal component analysis (PCA), hierarchical cluster analysis (HCA), and a self organizing feature map (SOFM) artificial neural network. It was revealed that data sets comprised of higher boiling point hydrocarbon compounds provided a good means for the classification of the samples and successfully linked highly weathered samples back to their unevaporated counterpart in every case. The classification abilities of SOFM were further tested and validated for their predictive abilities where one set of weather data in each case was withdrawn from the sample set and used as a test set of the retrained network. This revealed SOFM to be an outstanding mechanism for sample discrimination and linkage over the more conventional PCA and HCA methods often suggested for such data analysis. SOFM also has the advantage of providing additional information through the evaluation of component planes facilitating the investigation of underlying variables that account for the classification. © 2011 American Chemical Society

  16. Identification of More Feasible MicroRNA-mRNA Interactions within Multiple Cancers Using Principal Component Analysis Based Unsupervised Feature Extraction.

    PubMed

    Taguchi, Y-H

    2016-05-10

    MicroRNA(miRNA)-mRNA interactions are important for understanding many biological processes, including development, differentiation and disease progression, but their identification is highly context-dependent. When computationally derived from sequence information alone, the identification should be verified by integrated analyses of mRNA and miRNA expression. The drawback of this strategy is the vast number of identified interactions, which prevents an experimental or detailed investigation of each pair. In this paper, we overcome this difficulty by the recently proposed principal component analysis (PCA)-based unsupervised feature extraction (FE), which reduces the number of identified miRNA-mRNA interactions that properly discriminate between patients and healthy controls without losing biological feasibility. The approach is applied to six cancers: hepatocellular carcinoma, non-small cell lung cancer, esophageal squamous cell carcinoma, prostate cancer, colorectal/colon cancer and breast cancer. In PCA-based unsupervised FE, the significance does not depend on the number of samples (as in the standard case) but on the number of features, which approximates the number of miRNAs/mRNAs. To our knowledge, we have newly identified miRNA-mRNA interactions in multiple cancers based on a single common (universal) criterion. Moreover, the number of identified interactions was sufficiently small to be sequentially curated by literature searches.

  17. Spectral discrimination of serum from liver cancer and liver cirrhosis using Raman spectroscopy

    NASA Astrophysics Data System (ADS)

    Yang, Tianyue; Li, Xiaozhou; Yu, Ting; Sun, Ruomin; Li, Siqi

    2011-07-01

    In this paper, Raman spectra of human serum were measured using Raman spectroscopy, then the spectra was analyzed by multivariate statistical methods of principal component analysis (PCA). Then linear discriminant analysis (LDA) was utilized to differentiate the loading score of different diseases as the diagnosing algorithm. Artificial neural network (ANN) was used for cross-validation. The diagnosis sensitivity and specificity by PCA-LDA are 88% and 79%, while that of the PCA-ANN are 89% and 95%. It can be seen that modern analyzing method is a useful tool for the analysis of serum spectra for diagnosing diseases.

  18. Accuracy of the prostate health index versus the urinary prostate cancer antigen 3 score to predict overall and significant prostate cancer at initial biopsy.

    PubMed

    Seisen, Thomas; Rouprêt, Morgan; Brault, Didier; Léon, Priscilla; Cancel-Tassin, Géraldine; Compérat, Eva; Renard-Penna, Raphaële; Mozer, Pierre; Guechot, Jérome; Cussenot, Olivier

    2015-01-01

    It remains unclear whether the Prostate Health Index (PHI) or the urinary Prostate-Cancer Antigen 3 (PCA-3) score is more accurate at screening for prostate cancer (PCa). The aim of this study was to prospectively compare the accuracy of PHI and PCA-3 scores to predict overall and significant PCa in men undergoing an initial prostate biopsy. Double-blind assessments of PHI and PCA-3 were conducted by referent physicians in 138 patients who subsequently underwent trans-rectal ultrasound-guided prostate biopsy according to a 12-core scheme. Predictive accuracies of PHI and PCA-3 were assessed using AUC and compared according to the DeLong method. Diagnostic performances with usual cut-off values for positivity (i.e., PHI >40 and PCA-3 >35) were calculated, and odds ratios associated with predicting PCa overall and significant PCa as defined by pathological updated Epstein criteria (i.e., Gleason score ≥7, more than three positive cores, or >50% cancer involvement in any core) were estimated using logistic regression. Prevalences of overall and significant PCa were 44.9% and 28.3%, respectively. PCA-3 (AUC = 0.71) was the most accurate predictor of PCa overall, and significantly outperformed PHI (AUC = 0.65; P = 0.03). However, PHI (AUC = 0.80) remained the most accurate predictor when screening exclusively for significant PCa and significantly outperformed PCA-3 (AUC = 0.55; P = 0.03). Furthermore, PCA-3 >35 had the best accuracy, and positive or negative predictive values when screening for PCa overall whereas these diagnostic performances were greater for PHI >40 when exclusively screening for significant PCa. PHI > 40 combined with PCA-3 > 35 was more specific in both cases. In multivariate analyses, PCA-3 >35 (OR = 5.68; 95%CI = [2.21-14.59]; P < 0.001) was significantly correlated with the presence of PCa overall, but PHI >40 (OR = 9.60; 95%CI = [1.72-91.32]; P = 0.001) was the only independent predictor for detecting significant PCa. Although PCA-3 score is the best predictor for PCa overall at initial biopsy, our findings strongly indicate that PHI should be used for population-based screening to avoid over-diagnosis of indolent tumors that are unlikely to cause death. © 2014 Wiley Periodicals, Inc.

  19. InterFace: A software package for face image warping, averaging, and principal components analysis.

    PubMed

    Kramer, Robin S S; Jenkins, Rob; Burton, A Mike

    2017-12-01

    We describe InterFace, a software package for research in face recognition. The package supports image warping, reshaping, averaging of multiple face images, and morphing between faces. It also supports principal components analysis (PCA) of face images, along with tools for exploring the "face space" produced by PCA. The package uses a simple graphical user interface, allowing users to perform these sophisticated image manipulations without any need for programming knowledge. The program is available for download in the form of an app, which requires that users also have access to the (freely available) MATLAB Runtime environment.

  20. Optimization of critical medium components using response surface methodology for phenazine-1-carboxylic acid production by Pseudomonas sp. M-18Q.

    PubMed

    Yuan, Li-Li; Li, Ya-Qian; Wang, Yi; Zhang, Xue-Hong; Xu, Yu-Quan

    2008-03-01

    The optimal flask-shaking batch fermentation medium for phenazine-1-carboxylic acid (PCA) production by Pseudomonas sp. M-18Q, a qscR chromosomal inactivated mutant of the strain M18 was studied using statistical experimental design and analysis. The Plackett-Burman design (PBD) was used to evaluate the effects of eight medium components on the production of PCA, which showed that glucose and soytone were the most significant ingredients (P<0.05). The steepest ascent experiment was adopted to determine the optimal region of the medium composition. The optimum composition of the fermentation medium for maximum PCA yield, as determined on the basis of a five-level two-factor central composite design (CCD), was obtained by response surface methodology (RSM). The high correlation between the predicted and observed values indicated the validity of the model. A maximum PCA yield of 1240 mg/l was obtained at 17.81 g/l glucose and 11.47 g/l soytone, and the production was increased by 65.3% compared with that using the original medium, which was at 750 mg/l.

  1. Application of Principal Component Analysis to NIR Spectra of Phyllosilicates: A Technique for Identifying Phyllosilicates on Mars

    NASA Technical Reports Server (NTRS)

    Rampe, E. B.; Lanza, N. L.

    2012-01-01

    Orbital near-infrared (NIR) reflectance spectra of the martian surface from the OMEGA and CRISM instruments have identified a variety of phyllosilicates in Noachian terrains. The types of phyllosilicates present on Mars have important implications for the aqueous environments in which they formed, and, thus, for recognizing locales that may have been habitable. Current identifications of phyllosilicates from martian NIR data are based on the positions of spectral absorptions relative to laboratory data of well-characterized samples and from spectral ratios; however, some phyllosilicates can be difficult to distinguish from one another with these methods (i.e. illite vs. muscovite). Here we employ a multivariate statistical technique, principal component analysis (PCA), to differentiate between spectrally similar phyllosilicate minerals. PCA is commonly used in a variety of industries (pharmaceutical, agricultural, viticultural) to discriminate between samples. Previous work using PCA to analyze raw NIR reflectance data from mineral mixtures has shown that this is a viable technique for identifying mineral types, abundances, and particle sizes. Here, we evaluate PCA of second-derivative NIR reflectance data as a method for classifying phyllosilicates and test whether this method can be used to identify phyllosilicates on Mars.

  2. Receptor modeling for source apportionment of polycyclic aromatic hydrocarbons in urban atmosphere.

    PubMed

    Singh, Kunwar P; Malik, Amrita; Kumar, Ranjan; Saxena, Puneet; Sinha, Sarita

    2008-01-01

    This study reports source apportionment of polycyclic aromatic hydrocarbons (PAHs) in particulate depositions on vegetation foliages near highway in the urban environment of Lucknow city (India) using the principal components analysis/absolute principal components scores (PCA/APCS) receptor modeling approach. The multivariate method enables identification of major PAHs sources along with their quantitative contributions with respect to individual PAH. The PCA identified three major sources of PAHs viz. combustion, vehicular emissions, and diesel based activities. The PCA/APCS receptor modeling approach revealed that the combustion sources (natural gas, wood, coal/coke, biomass) contributed 19-97% of various PAHs, vehicular emissions 0-70%, diesel based sources 0-81% and other miscellaneous sources 0-20% of different PAHs. The contributions of major pyrolytic and petrogenic sources to the total PAHs were 56 and 42%, respectively. Further, the combustion related sources contribute major fraction of the carcinogenic PAHs in the study area. High correlation coefficient (R2 > 0.75 for most PAHs) between the measured and predicted concentrations of PAHs suggests for the applicability of the PCA/APCS receptor modeling approach for estimation of source contribution to the PAHs in particulates.

  3. Comparison of discrete Fourier transform (DFT) and principal component analysis/DFT as forecasting tools for absorbance time series received by UV-visible probes installed in urban sewer systems.

    PubMed

    Plazas-Nossa, Leonardo; Torres, Andrés

    2014-01-01

    The objective of this work is to introduce a forecasting method for UV-Vis spectrometry time series that combines principal component analysis (PCA) and discrete Fourier transform (DFT), and to compare the results obtained with those obtained by using DFT. Three time series for three different study sites were used: (i) Salitre wastewater treatment plant (WWTP) in Bogotá; (ii) Gibraltar pumping station in Bogotá; and (iii) San Fernando WWTP in Itagüí (in the south part of Medellín). Each of these time series had an equal number of samples (1051). In general terms, the results obtained are hardly generalizable, as they seem to be highly dependent on specific water system dynamics; however, some trends can be outlined: (i) for UV range, DFT and PCA/DFT forecasting accuracy were almost the same; (ii) for visible range, the PCA/DFT forecasting procedure proposed gives systematically lower forecasting errors and variability than those obtained with the DFT procedure; and (iii) for short forecasting times the PCA/DFT procedure proposed is more suitable than the DFT procedure, according to processing times obtained.

  4. Analyzing brain networks with PCA and conditional Granger causality.

    PubMed

    Zhou, Zhenyu; Chen, Yonghong; Ding, Mingzhou; Wright, Paul; Lu, Zuhong; Liu, Yijun

    2009-07-01

    Identifying directional influences in anatomical and functional circuits presents one of the greatest challenges for understanding neural computations in the brain. Granger causality mapping (GCM) derived from vector autoregressive models of data has been employed for this purpose, revealing complex temporal and spatial dynamics underlying cognitive processes. However, the traditional GCM methods are computationally expensive, as signals from thousands of voxels within selected regions of interest (ROIs) are individually processed, and being based on pairwise Granger causality, they lack the ability to distinguish direct from indirect connectivity among brain regions. In this work a new algorithm called PCA based conditional GCM is proposed to overcome these problems. The algorithm implements the following two procedures: (i) dimensionality reduction in ROIs of interest with principle component analysis (PCA), and (ii) estimation of the direct causal influences in local brain networks, using conditional Granger causality. Our results show that the proposed method achieves greater accuracy in detecting network connectivity than the commonly used pairwise Granger causality method. Furthermore, the use of PCA components in conjunction with conditional GCM greatly reduces the computational cost relative to the use of individual voxel time series. Copyright 2009 Wiley-Liss, Inc

  5. Chemical Characterization and Determination of the Anti-Oxidant Capacity of Two Brown Algae with Respect to Sampling Season and Morphological Structures Using Infrared Spectroscopy and Multivariate Analyses.

    PubMed

    Beratto, Angelo; Agurto, Cristian; Freer, Juanita; Peña-Farfal, Carlos; Troncoso, Nicolás; Agurto, Andrés; Castillo, Rosario Del P

    2017-10-01

    Brown algae biomass has been shown to be a highly important industrial source for the production of alginates and different nutraceutical products. The characterization of this biomass is necessary in order to allocate its use to specific applications according to the chemical and biological characteristics of this highly variable resource. The methods commonly used for algae characterization require a long time for the analysis and rigorous pretreatments of samples. In this work, nondestructive and fast analyses of different morphological structures from Lessonia spicata and Macrocystis pyrifera, which were collected during different seasons, were performed using Fourier transform infrared (FT-IR) techniques in combination with chemometric methods. Mid-infrared (IR) and near-infrared (NIR) spectral ranges were tested to evaluate the spectral differences between the species, seasons, and morphological structures of algae using a principal component analysis (PCA). Quantitative analyses of the polyphenol and alginate contents and the anti-oxidant capacity of the samples were performed using partial least squares (PLS) with both spectral ranges in order to build a predictive model for the rapid quantification of these parameters with industrial purposes. The PCA mainly showed differences in the samples based on seasonal sampling, where changes were observed in the bands corresponding to polysaccharides, proteins, and lipids. The obtained PLS models had high correlation coefficients (r) for the polyphenol content and anti-oxidant capacity (r > 0.9) and lower values for the alginate determination (0.7 < r < 0.8). Fourier transform infrared-based techniques were suitable tools for the rapid characterization of algae biomass, in which high variability in the samples was incorporated for the qualitative and quantitative analyses, and have the potential to be used on an industrial scale.

  6. Non-invasive urinary metabolomic profiling discriminates prostate cancer from benign prostatic hyperplasia.

    PubMed

    Pérez-Rambla, Clara; Puchades-Carrasco, Leonor; García-Flores, María; Rubio-Briones, José; López-Guerrero, José Antonio; Pineda-Lucena, Antonio

    2017-01-01

    Prostate cancer (PCa) is one of the most common malignancies in men worldwide. Serum prostate specific antigen (PSA) level has been extensively used as a biomarker to detect PCa. However, PSA is not cancer-specific and various non-malignant conditions, including benign prostatic hyperplasia (BPH), can cause a rise in PSA blood levels, thus leading to many false positive results. In this study, we evaluated the potential of urinary metabolomic profiling for discriminating PCa from BPH. Urine samples from 64 PCa patients and 51 individuals diagnosed with BPH were analysed using 1 H nuclear magnetic resonance ( 1 H-NMR). Comparative analysis of urinary metabolomic profiles was carried out using multivariate and univariate statistical approaches. The urine metabolomic profile of PCa patients is characterised by increased concentrations of branched-chain amino acids (BCAA), glutamate and pseudouridine, and decreased concentrations of glycine, dimethylglycine, fumarate and 4-imidazole-acetate compared with individuals diagnosed with BPH. PCa patients have a specific urinary metabolomic profile. The results of our study underscore the clinical potential of metabolomic profiling to uncover metabolic changes that could be useful to discriminate PCa from BPH in a clinical context.

  7. Multimethod Prediction of Physical Parent-Child Aggression Risk in Expectant Mothers and Fathers with Social Information Processing Theory

    PubMed Central

    Rodriguez, Christina M.; Smith, Tamika L.; Silvia, Paul J.

    2015-01-01

    The Social Information Processing (SIP) model postulates that parents undergo a series of stages in implementing physical discipline that can escalate into physical child abuse. The current study utilized a multimethod approach to investigate whether SIP factors can predict risk of parent-child aggression (PCA) in a diverse sample of expectant mothers and fathers. SIP factors of PCA attitudes, negative child attributions, reactivity, and empathy were considered as potential predictors of PCA risk; additionally, analyses considered whether personal history of PCA predicted participants’ own PCA risk through its influence on their attitudes and attributions. Findings indicate that, for both mothers and fathers, history influenced attitudes but not attributions in predicting PCA risk, and attitudes and attributions predicted PCA risk; empathy and reactivity predicted negative child attributions for expectant mothers, but only reactivity significantly predicted attributions for expectant fathers. Path models for expectant mothers and fathers were remarkably similar. Overall, the findings provide support for major aspects of the SIP model. Continued work is needed in studying the progression of these factors across time for both mothers and fathers as well as the inclusion of other relevant ecological factors to the SIP model. PMID:26631420

  8. Activation of Beta-Catenin Signaling in Androgen Receptor–Negative Prostate Cancer Cells

    PubMed Central

    Wan, Xinhai; Liu, Jie; Lu, Jing-Fang; Tzelepi, Vassiliki; Yang, Jun; Starbuck, Michael W.; Diao, Lixia; Wang, Jing; Efstathiou, Eleni; Vazquez, Elba S.; Troncoso, Patricia; Maity, Sankar N.; Navone, Nora M.

    2012-01-01

    Purpose To study Wnt/beta-catenin in castrate-resistant prostate cancer (CRPC) and understand its function independently of the beta-catenin–androgen receptor (AR) interaction. Experimental Design We performed beta-catenin immunocytochemical analysis, evaluated TOP-flash reporter activity (a reporter of beta-catenin–mediated transcription), and sequenced the beta-catenin gene in MDA PCa 118a, MDA PCa 118b, MDA PCa 2b, and PC-3 prostate cancer (PCa) cells. We knocked down beta-catenin in AR-negative MDA PCa 118b cells and performed comparative gene-array analysis. We also immunohistochemically analyzed beta-catenin and AR in 27 bone metastases of human CRPCs. Results Beta-catenin nuclear accumulation and TOP-flash reporter activity were high in MDA PCa 118b but not in MDA PCa 2b or PC-3 cells. MDA PCa 118a and 118b cells carry a mutated beta-catenin at codon 32 (D32G). Ten genes were expressed differently (false discovery rate, 0.05) in MDA PCa 118b cells with downregulated beta-catenin. One such gene, hyaluronan synthase 2 (HAS2), synthesizes hyaluronan, a core component of the extracellular matrix. We confirmed HAS2 upregulation in PC-3 cells transfected with D32G-mutant beta-catenin. Finally, we found nuclear localization of beta-catenin in 10 of 27 human tissue specimens; this localization was inversely associated with AR expression (P = 0.056, Fisher’s exact test), suggesting that reduced AR expression enables Wnt/beta-catenin signaling. Conclusion We identified a previously unknown downstream target of beta-catenin, HAS2, in PCa, and found that high beta-catenin nuclear localization and low or no AR expression may define a subpopulation of men with bone-metastatic PCa. These findings may guide physicians in managing these patients. PMID:22298898

  9. Cerebrospinal fluid cytokines in the diagnosis of bacterial meningitis in infants.

    PubMed

    Srinivasan, Lakshmi; Kilpatrick, Laurie; Shah, Samir S; Abbasi, Soraya; Harris, Mary C

    2016-10-01

    Bacterial meningitis poses diagnostic challenges in infants. Antibiotic pretreatment and low bacterial density diminish cerebrospinal fluid (CSF) culture yield, while laboratory parameters do not reliably identify bacterial meningitis. Pro and anti-inflammatory cytokines are elevated in bacterial meningitis and may be useful diagnostic adjuncts when CSF cultures are negative. In a prospective cohort study of infants, we used cytometric bead arrays to measure tumor necrosis factor alpha (TNF-α), interleukin 1 (IL-1), IL-6, IL-8, IL-10, and IL-12 in CSF. Receiver operating characteristic (ROC) analyses and Principal component analysis (PCA) were used to determine cytokine combinations that identified bacterial meningitis. Six hundred and eighty four infants < 6 mo were included; 11 had culture-proven bacterial meningitis. IL-6 and IL-10 were the individual cytokines possessing greatest accuracy in diagnosis of culture proven bacterial meningitis (ROC analyses; area under the concentration-time curve (AUC) 0.91; 0.9103 respectively), and performed as well as, or better than combinations identified using ROC and PCA. CSF cytokines were highly correlated with each other and with CSF white blood cell count (WBC) counts in infants with meningitis. A subset of antibiotic pretreated culture-negative subjects demonstrated cytokine patterns similar to culture positive subjects. CSF cytokine levels may aid diagnosis of bacterial meningitis, and facilitate decision-making regarding treatment for culture negative meningitis.

  10. Mutational Landscape of Candidate Genes in Familial Prostate Cancer

    PubMed Central

    Johnson, Anna M.; Zuhlke, Kimberly A.; Plotts, Chris; McDonnell, Shannon K.; Middha, Sumit; Riska, Shaun M.; Thibodeau, Stephen N.; Douglas, Julie A.; Cooney, Kathleen A.

    2014-01-01

    Background Family history is a major risk factor for prostate cancer (PCa), suggesting a genetic component to the disease. However, traditional linkage and association studies have failed to fully elucidate the underlying genetic basis of familial PCa. Methods Here we use a candidate gene approach to identify potential PCa susceptibility variants in whole exome sequencing data from familial PCa cases. Six hundred ninety-seven candidate genes were identified based on function, location near a known chromosome 17 linkage signal, and/or previous association with prostate or other cancers. Single nucleotide variants (SNVs) in these candidate genes were identified in whole exome sequence data from 33 PCa cases from 11 multiplex PCa families (3 cases/family). Results Overall, 4856 candidate gene SNVs were identified, including 1052 missense and 10 nonsense variants. Twenty missense variants were shared by all 3 family members in each family in which they were observed. Additionally, 15 missense variants were shared by 2 of 3 family members and predicted to be deleterious by 5 different algorithms. Four missense variants, BLM Gln123Arg, PARP2 Arg283Gln, LRCC46 Ala295Thr and KIF2B Pro91Leu, and 1 nonsense variant, CYP3A43 Arg441Ter, showed complete co-segregation with PCa status. Twelve additional variants displayed partial co-segregation with PCa. Conclusions Forty-three nonsense and shared, missense variants were identified in our candidate genes. Further research is needed to determine the contribution of these variants to PCa susceptibility. PMID:25111073

  11. Older marijuana users: Life stressors and perceived social support.

    PubMed

    Choi, Namkee G; DiNitto, Diana M; Marti, C Nathan

    2016-12-01

    Given increasing numbers of older-adult marijuana users, this study examined the association of marijuana use and marijuana use disorder with life stressors and perceived social support in the 50+ age group. Data came from the 2012-2013 National Epidemiologic Survey on Alcohol and Related Conditions (N=14,715 respondents aged 50+). Life stressors were measured with 12 items related to interpersonal, legal, and financial problems and being a crime victim. Perceived social support was measured with the 12-item Interpersonal Support Evaluation List. Using principal component analysis (PCA), we identified four components of life stressors. Linear regression analyses was used to test associations of past-year marijuana use and use disorder with PCA scores of each component and perceived social support. Of the 50+ age group, 3.89% were past-year marijuana users and 0.68% had marijuana use disorder. Marijuana users, especially those with marijuana use disorder (17.54% of past-year users), had high rates of mental and other substance use disorders. Controlling for other potential risk factors for stress, including health status and mental and other substance use disorders, marijuana use and use disorder were still significantly associated with more life stressors and lower perceived social support, possibly from low levels of social integration. A substantial proportion of older-adult marijuana users need help with mental health and substance use problems. Further examination of older marijuana users' life stressors and social support networks may aid in developing more systematic intervention strategies to address needs and reduce marijuana use. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  12. Epileptic seizure detection in EEG signal with GModPCA and support vector machine.

    PubMed

    Jaiswal, Abeg Kumar; Banka, Haider

    2017-01-01

    Epilepsy is one of the most common neurological disorders caused by recurrent seizures. Electroencephalograms (EEGs) record neural activity and can detect epilepsy. Visual inspection of an EEG signal for epileptic seizure detection is a time-consuming process and may lead to human error; therefore, recently, a number of automated seizure detection frameworks were proposed to replace these traditional methods. Feature extraction and classification are two important steps in these procedures. Feature extraction focuses on finding the informative features that could be used for classification and correct decision-making. Therefore, proposing effective feature extraction techniques for seizure detection is of great significance. Principal Component Analysis (PCA) is a dimensionality reduction technique used in different fields of pattern recognition including EEG signal classification. Global modular PCA (GModPCA) is a variation of PCA. In this paper, an effective framework with GModPCA and Support Vector Machine (SVM) is presented for epileptic seizure detection in EEG signals. The feature extraction is performed with GModPCA, whereas SVM trained with radial basis function kernel performed the classification between seizure and nonseizure EEG signals. Seven different experimental cases were conducted on the benchmark epilepsy EEG dataset. The system performance was evaluated using 10-fold cross-validation. In addition, we prove analytically that GModPCA has less time and space complexities as compared to PCA. The experimental results show that EEG signals have strong inter-sub-pattern correlations. GModPCA and SVM have been able to achieve 100% accuracy for the classification between normal and epileptic signals. Along with this, seven different experimental cases were tested. The classification results of the proposed approach were better than were compared the results of some of the existing methods proposed in literature. It is also found that the time and space complexities of GModPCA are less as compared to PCA. This study suggests that GModPCA and SVM could be used for automated epileptic seizure detection in EEG signal.

  13. Gabor-based kernel PCA with fractional power polynomial models for face recognition.

    PubMed

    Liu, Chengjun

    2004-05-01

    This paper presents a novel Gabor-based kernel Principal Component Analysis (PCA) method by integrating the Gabor wavelet representation of face images and the kernel PCA method for face recognition. Gabor wavelets first derive desirable facial features characterized by spatial frequency, spatial locality, and orientation selectivity to cope with the variations due to illumination and facial expression changes. The kernel PCA method is then extended to include fractional power polynomial models for enhanced face recognition performance. A fractional power polynomial, however, does not necessarily define a kernel function, as it might not define a positive semidefinite Gram matrix. Note that the sigmoid kernels, one of the three classes of widely used kernel functions (polynomial kernels, Gaussian kernels, and sigmoid kernels), do not actually define a positive semidefinite Gram matrix either. Nevertheless, the sigmoid kernels have been successfully used in practice, such as in building support vector machines. In order to derive real kernel PCA features, we apply only those kernel PCA eigenvectors that are associated with positive eigenvalues. The feasibility of the Gabor-based kernel PCA method with fractional power polynomial models has been successfully tested on both frontal and pose-angled face recognition, using two data sets from the FERET database and the CMU PIE database, respectively. The FERET data set contains 600 frontal face images of 200 subjects, while the PIE data set consists of 680 images across five poses (left and right profiles, left and right half profiles, and frontal view) with two different facial expressions (neutral and smiling) of 68 subjects. The effectiveness of the Gabor-based kernel PCA method with fractional power polynomial models is shown in terms of both absolute performance indices and comparative performance against the PCA method, the kernel PCA method with polynomial kernels, the kernel PCA method with fractional power polynomial models, the Gabor wavelet-based PCA method, and the Gabor wavelet-based kernel PCA method with polynomial kernels.

  14. COMPADRE: an R and web resource for pathway activity analysis by component decompositions.

    PubMed

    Ramos-Rodriguez, Roberto-Rafael; Cuevas-Diaz-Duran, Raquel; Falciani, Francesco; Tamez-Peña, Jose-Gerardo; Trevino, Victor

    2012-10-15

    The analysis of biological networks has become essential to study functional genomic data. Compadre is a tool to estimate pathway/gene sets activity indexes using sub-matrix decompositions for biological networks analyses. The Compadre pipeline also includes one of the direct uses of activity indexes to detect altered gene sets. For this, the gene expression sub-matrix of a gene set is decomposed into components, which are used to test differences between groups of samples. This procedure is performed with and without differentially expressed genes to decrease false calls. During this process, Compadre also performs an over-representation test. Compadre already implements four decomposition methods [principal component analysis (PCA), Isomaps, independent component analysis (ICA) and non-negative matrix factorization (NMF)], six statistical tests (t- and f-test, SAM, Kruskal-Wallis, Welch and Brown-Forsythe), several gene sets (KEGG, BioCarta, Reactome, GO and MsigDB) and can be easily expanded. Our simulation results shown in Supplementary Information suggest that Compadre detects more pathways than over-representation tools like David, Babelomics and Webgestalt and less false positives than PLAGE. The output is composed of results from decomposition and over-representation analyses providing a more complete biological picture. Examples provided in Supplementary Information show the utility, versatility and simplicity of Compadre for analyses of biological networks. Compadre is freely available at http://bioinformatica.mty.itesm.mx:8080/compadre. The R package is also available at https://sourceforge.net/p/compadre.

  15. Systematic study of anharmonic features in a principal component analysis of gramicidin A.

    PubMed

    Kurylowicz, Martin; Yu, Ching-Hsing; Pomès, Régis

    2010-02-03

    We use principal component analysis (PCA) to detect functionally interesting collective motions in molecular-dynamics simulations of membrane-bound gramicidin A. We examine the statistical and structural properties of all PCA eigenvectors and eigenvalues for the backbone and side-chain atoms. All eigenvalue spectra show two distinct power-law scaling regimes, quantitatively separating large from small covariance motions. Time trajectories of the largest PCs converge to Gaussian distributions at long timescales, but groups of small-covariance PCs, which are usually ignored as noise, have subdiffusive distributions. These non-Gaussian distributions imply anharmonic motions on the free-energy surface. We characterize the anharmonic components of motion by analyzing the mean-square displacement for all PCs. The subdiffusive components reveal picosecond-scale oscillations in the mean-square displacement at frequencies consistent with infrared measurements. In this regime, the slowest backbone mode exhibits tilting of the peptide planes, which allows carbonyl oxygen atoms to provide surrogate solvation for water and cation transport in the channel lumen. Higher-frequency modes are also apparent, and we describe their vibrational spectra. Our findings expand the utility of PCA for quantifying the essential features of motion on the anharmonic free-energy surface made accessible by atomistic molecular-dynamics simulations. Copyright (c) 2010 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  16. Dihedral angle principal component analysis of molecular dynamics simulations.

    PubMed

    Altis, Alexandros; Nguyen, Phuong H; Hegger, Rainer; Stock, Gerhard

    2007-06-28

    It has recently been suggested by Mu et al. [Proteins 58, 45 (2005)] to use backbone dihedral angles instead of Cartesian coordinates in a principal component analysis of molecular dynamics simulations. Dihedral angles may be advantageous because internal coordinates naturally provide a correct separation of internal and overall motion, which was found to be essential for the construction and interpretation of the free energy landscape of a biomolecule undergoing large structural rearrangements. To account for the circular statistics of angular variables, a transformation from the space of dihedral angles {phi(n)} to the metric coordinate space {x(n)=cos phi(n),y(n)=sin phi(n)} was employed. To study the validity and the applicability of the approach, in this work the theoretical foundations underlying the dihedral angle principal component analysis (dPCA) are discussed. It is shown that the dPCA amounts to a one-to-one representation of the original angle distribution and that its principal components can readily be characterized by the corresponding conformational changes of the peptide. Furthermore, a complex version of the dPCA is introduced, in which N angular variables naturally lead to N eigenvalues and eigenvectors. Applying the methodology to the construction of the free energy landscape of decaalanine from a 300 ns molecular dynamics simulation, a critical comparison of the various methods is given.

  17. Dihedral angle principal component analysis of molecular dynamics simulations

    NASA Astrophysics Data System (ADS)

    Altis, Alexandros; Nguyen, Phuong H.; Hegger, Rainer; Stock, Gerhard

    2007-06-01

    It has recently been suggested by Mu et al. [Proteins 58, 45 (2005)] to use backbone dihedral angles instead of Cartesian coordinates in a principal component analysis of molecular dynamics simulations. Dihedral angles may be advantageous because internal coordinates naturally provide a correct separation of internal and overall motion, which was found to be essential for the construction and interpretation of the free energy landscape of a biomolecule undergoing large structural rearrangements. To account for the circular statistics of angular variables, a transformation from the space of dihedral angles {φn} to the metric coordinate space {xn=cosφn,yn=sinφn} was employed. To study the validity and the applicability of the approach, in this work the theoretical foundations underlying the dihedral angle principal component analysis (dPCA) are discussed. It is shown that the dPCA amounts to a one-to-one representation of the original angle distribution and that its principal components can readily be characterized by the corresponding conformational changes of the peptide. Furthermore, a complex version of the dPCA is introduced, in which N angular variables naturally lead to N eigenvalues and eigenvectors. Applying the methodology to the construction of the free energy landscape of decaalanine from a 300ns molecular dynamics simulation, a critical comparison of the various methods is given.

  18. The development of summary components for the Disablement in the Physically Active scale in collegiate athletes.

    PubMed

    Houston, Megan N; Hoch, Johanna M; Van Lunen, Bonnie L; Hoch, Matthew C

    2015-11-01

    The Disablement in the Physically Active scale (DPA) is a generic patient-reported outcome designed to evaluate constructs of disability in physically active populations. The purpose of this study was to analyze the DPA scale structure for summary components. Four hundred and fifty-six collegiate athletes completed a demographic form and the DPA. A principal component analysis (PCA) was conducted with oblique rotation. Factors with eigenvalues >1 that explained >5 % of the variance were retained. The PCA revealed a two-factor structure consistent with paradigms used to develop the original DPA. Items 1-12 loaded on Factors 1 and Items 13-16 loaded on Factor 2. Items 1-12 pertain to impairment, activity limitations, and participation restrictions. Items 13-16 address psychosocial and emotional well-being. Consideration of item content suggested Factor 1 concerned physical function, while Factor 2 concerned mental well-being. Thus, items clustered around Factor 1 and 2 were identified as physical (DPA-PSC) and mental (DPA-MSC) summary components, respectively. Together, the factors accounted for 65.1 % of the variance. The PCA revealed a two-factor structure for the DPA that resulted in DPA-PSC and DPA-MSC. Analyzing the DPA as separate constructs may provide distinct information that could help to prescribe treatment and rehabilitation strategies.

  19. Identification and apportionment of hazardous elements in the sediments in the Yangtze River estuary.

    PubMed

    Wang, Jiawei; Liu, Ruimin; Wang, Haotian; Yu, Wenwen; Xu, Fei; Shen, Zhenyao

    2015-12-01

    In this study, positive matrix factorization (PMF) and principal components analysis (PCA) were combined to identify and apportion pollution-based sources of hazardous elements in the surface sediments in the Yangtze River estuary (YRE). Source identification analysis indicated that PC1, including Al, Fe, Mn, Cr, Ni, As, Cu, and Zn, can be defined as a sewage component; PC2, including Pb and Sb, can be considered as an atmospheric deposition component; and PC3, containing Cd and Hg, can be considered as an agricultural nonpoint component. To better identify the sources and quantitatively apportion the concentrations to their sources, eight sources were identified with PMF: agricultural/industrial sewage mixed (18.6 %), mining wastewater (15.9 %), agricultural fertilizer (14.5 %), atmospheric deposition (12.8 %), agricultural nonpoint (10.6 %), industrial wastewater (9.8 %), marine activity (9.0 %), and nickel plating industry (8.8 %). Overall, the hazardous element content seems to be more connected to anthropogenic activity instead of natural sources. The PCA results laid the foundation for the PMF analysis by providing a general classification of sources. PMF resolves more factors with a higher explained variance than PCA; PMF provided both the internal analysis and the quantitative analysis. The combination of the two methods can provide more reasonable and reliable results.

  20. Performance evaluation of PCA-based spike sorting algorithms.

    PubMed

    Adamos, Dimitrios A; Kosmidis, Efstratios K; Theophilidis, George

    2008-09-01

    Deciphering the electrical activity of individual neurons from multi-unit noisy recordings is critical for understanding complex neural systems. A widely used spike sorting algorithm is being evaluated for single-electrode nerve trunk recordings. The algorithm is based on principal component analysis (PCA) for spike feature extraction. In the neuroscience literature it is generally assumed that the use of the first two or most commonly three principal components is sufficient. We estimate the optimum PCA-based feature space by evaluating the algorithm's performance on simulated series of action potentials. A number of modifications are made to the open source nev2lkit software to enable systematic investigation of the parameter space. We introduce a new metric to define clustering error considering over-clustering more favorable than under-clustering as proposed by experimentalists for our data. Both the program patch and the metric are available online. Correlated and white Gaussian noise processes are superimposed to account for biological and artificial jitter in the recordings. We report that the employment of more than three principal components is in general beneficial for all noise cases considered. Finally, we apply our results to experimental data and verify that the sorting process with four principal components is in agreement with a panel of electrophysiology experts.

  1. Application of Hyperspectral Imaging and Chemometric Calibrations for Variety Discrimination of Maize Seeds

    PubMed Central

    Zhang, Xiaolei; Liu, Fei; He, Yong; Li, Xiaoli

    2012-01-01

    Hyperspectral imaging in the visible and near infrared (VIS-NIR) region was used to develop a novel method for discriminating different varieties of commodity maize seeds. Firstly, hyperspectral images of 330 samples of six varieties of maize seeds were acquired using a hyperspectral imaging system in the 380–1,030 nm wavelength range. Secondly, principal component analysis (PCA) and kernel principal component analysis (KPCA) were used to explore the internal structure of the spectral data. Thirdly, three optimal wavelengths (523, 579 and 863 nm) were selected by implementing PCA directly on each image. Then four textural variables including contrast, homogeneity, energy and correlation were extracted from gray level co-occurrence matrix (GLCM) of each monochromatic image based on the optimal wavelengths. Finally, several models for maize seeds identification were established by least squares-support vector machine (LS-SVM) and back propagation neural network (BPNN) using four different combinations of principal components (PCs), kernel principal components (KPCs) and textural features as input variables, respectively. The recognition accuracy achieved in the PCA-GLCM-LS-SVM model (98.89%) was the most satisfactory one. We conclude that hyperspectral imaging combined with texture analysis can be implemented for fast classification of different varieties of maize seeds. PMID:23235456

  2. Soil Components in Heterogeneous Impact Glass in Martian Meteorite EETA79001

    NASA Technical Reports Server (NTRS)

    Schrader, C. M.; Cohen, B. A.; Donovan, J. J.; Vicenzi, E. P.

    2010-01-01

    Martian soil composition can illuminate past and ongoing near-surface processes such as impact gardening [2] and hydrothermal and volcanic activity [3,4]. Though the Mars Exploration Rovers (MER) have analyzed the major-element composition of Martian soils, no soil samples have been returned to Earth for detailed chemical analysis. Rao et al. [1] suggested that Martian meteorite EETA79001 contains melted Martian soil in its impact glass (Lithology C) based on sulfur enrichment of Lithology C relative to the meteorite s basaltic lithologies (A and B) [1,2]. If true, it may be possible to extract detailed soil chemical analyses using this meteoritic sample. We conducted high-resolution (0.3 m/pixel) element mapping of Lithology C in thin section EETA79001,18 by energy dispersive spectrometry (EDS). We use these data for principal component analysis (PCA).

  3. Evaluating motion processing algorithms for use with functional near-infrared spectroscopy data from young children.

    PubMed

    Delgado Reyes, Lourdes M; Bohache, Kevin; Wijeakumar, Sobanawartiny; Spencer, John P

    2018-04-01

    Motion artifacts are often a significant component of the measured signal in functional near-infrared spectroscopy (fNIRS) experiments. A variety of methods have been proposed to address this issue, including principal components analysis (PCA), correlation-based signal improvement (CBSI), wavelet filtering, and spline interpolation. The efficacy of these techniques has been compared using simulated data; however, our understanding of how these techniques fare when dealing with task-based cognitive data is limited. Brigadoi et al. compared motion correction techniques in a sample of adult data measured during a simple cognitive task. Wavelet filtering showed the most promise as an optimal technique for motion correction. Given that fNIRS is often used with infants and young children, it is critical to evaluate the effectiveness of motion correction techniques directly with data from these age groups. This study addresses that problem by evaluating motion correction algorithms implemented in HomER2. The efficacy of each technique was compared quantitatively using objective metrics related to the physiological properties of the hemodynamic response. Results showed that targeted PCA (tPCA), spline, and CBSI retained a higher number of trials. These techniques also performed well in direct head-to-head comparisons with the other approaches using quantitative metrics. The CBSI method corrected many of the artifacts present in our data; however, this approach produced sometimes unstable HRFs. The targeted PCA and spline methods proved to be the most robust, performing well across all comparison metrics. When compared head to head, tPCA consistently outperformed spline. We conclude, therefore, that tPCA is an effective technique for correcting motion artifacts in fNIRS data from young children.

  4. Using both principal component analysis and reduced rank regression to study dietary patterns and diabetes in Chinese adults.

    PubMed

    Batis, Carolina; Mendez, Michelle A; Gordon-Larsen, Penny; Sotres-Alvarez, Daniela; Adair, Linda; Popkin, Barry

    2016-02-01

    We examined the association between dietary patterns and diabetes using the strengths of two methods: principal component analysis (PCA) to identify the eating patterns of the population and reduced rank regression (RRR) to derive a pattern that explains the variation in glycated Hb (HbA1c), homeostasis model assessment of insulin resistance (HOMA-IR) and fasting glucose. We measured diet over a 3 d period with 24 h recalls and a household food inventory in 2006 and used it to derive PCA and RRR dietary patterns. The outcomes were measured in 2009. Adults (n 4316) from the China Health and Nutrition Survey. The adjusted odds ratio for diabetes prevalence (HbA1c≥6·5 %), comparing the highest dietary pattern score quartile with the lowest, was 1·26 (95 % CI 0·76, 2·08) for a modern high-wheat pattern (PCA; wheat products, fruits, eggs, milk, instant noodles and frozen dumplings), 0·76 (95 % CI 0·49, 1·17) for a traditional southern pattern (PCA; rice, meat, poultry and fish) and 2·37 (95 % CI 1·56, 3·60) for the pattern derived with RRR. By comparing the dietary pattern structures of RRR and PCA, we found that the RRR pattern was also behaviourally meaningful. It combined the deleterious effects of the modern high-wheat pattern (high intakes of wheat buns and breads, deep-fried wheat and soya milk) with the deleterious effects of consuming the opposite of the traditional southern pattern (low intakes of rice, poultry and game, fish and seafood). Our findings suggest that using both PCA and RRR provided useful insights when studying the association of dietary patterns with diabetes.

  5. Using both Principal Component Analysis and Reduced Rank Regression to Study Dietary Patterns and Diabetes in Chinese Adults

    PubMed Central

    Batis, Carolina; Mendez, Michelle A.; Gordon-Larsen, Penny; Sotres-Alvarez, Daniela; Adair, Linda; Popkin, Barry

    2014-01-01

    Objective We examined the association between dietary patterns and diabetes using the strengths of two methods: principal component analysis (PCA) to identify the eating patterns of the population and reduced rank regression (RRR) to derive a pattern that explains the variation in hemoglobin A1c (HbA1c), homeostasis model of insulin resistance (HOMA-IR), and fasting glucose. Design We measured diet over a 3-day period with 24-hour recalls and a household food inventory in 2006 and used it to derive PCA and RRR dietary patterns. The outcomes were measured in 2009. Setting Adults (n = 4,316) from the China Health and Nutrition Survey. Results The adjusted odds ratio for diabetes prevalence (HbA1c ≥ 6.5%), comparing the highest dietary pattern score quartile to the lowest, was 1.26 (0.76, 2.08) for a modern high-wheat pattern (PCA; wheat products, fruits, eggs, milk, instant noodles and frozen dumplings), 0.76 (0.49, 1.17) for a traditional southern pattern (PCA; rice, meat, poultry, and fish), and 2.37 (1.56, 3.60) for the pattern derived with RRR. By comparing the dietary pattern structures of RRR and PCA, we found that the RRR pattern was also behaviorally meaningful. It combined the deleterious effects of the modern high-wheat (high intake of wheat buns and breads, deep-fried wheat, and soy milk) with the deleterious effects of consuming the opposite of the traditional southern (low intake of rice, poultry and game, fish and seafood). Conclusions Our findings suggest that using both PCA and RRR provided useful insights when studying the association of dietary patterns with diabetes. PMID:26784586

  6. Pectus Carinatum Evaluation Questionnaire (PCEQ): a novel tool to improve the follow-up in patients treated with brace compression.

    PubMed

    Pessanha, Inês; Severo, Milton; Correia-Pinto, Jorge; Estevão-Costa, José; Henriques-Coelho, Tiago

    2016-03-01

    A questionnaire (Pectus Carinatum Evaluation Questionnaire, PCEQ) was developed to be applied in follow-up of patients with Pectus Carinatum (PC). After validation of the PCEQ, we aimed to quantify the compliance to brace compression and to assess factors that could influence this treatment in patients with PC. From July 2008 to July 2014, 56 patients with PC were treated with the Calgary Protocol of compressive bracing at Paediatric Surgery Department of Hospital São João. Forty patients (71%) completed the questionnaire. The PCEQ was divided into four sections: (i) compliance; (ii) symptoms; (iii) social influence; (iv) activities. For the validation process of the PCEQ, principal components analysis (PCA), orthogonal varimax or oblimin rotation and Cronbach's α coefficient were used. To evaluate the association between compliance and other sections of the questionnaire, we estimated the Pearson's correlation between compliance factor scores ('Compliance Days' and 'Compliance Hours') and the final score of each new questionnaire component identified by PCA ('Chest Pain', 'Dyspnoea', 'Back Pain', 'Parents' Influence', 'Friends' Influence', 'Activities', 'Time To Compliance'). For the sections 'Symptoms', 'Social Influence' and 'Activities', we estimated final scores as the sum of the questions that constitute each component. For the section 'Compliance', the factor scores were estimated by the regression method. After PCA analysis, the PCEQ found nine different components with high reliability. When analysing the compliance of our study group, the final score for 'Activities' revealed a significant correlation with the factor score for 'Compliance Hours' (r = 0.382, P = 0.015). The final score for 'Time To Compliance' showed a significant correlation with both factor scores for 'Compliance Hours' (r = -0.765, P < 0.001) and 'Compliance Days' (r = -0.345, P < 0.029). The PCEQ seems to be an important tool to follow up patients with PC treated by brace compression. Practical steps, such as developing a tight schedule in the early follow-up period or applying the PCEQ in first visits after initiating brace therapy, can be taken in order to increase compliance with brace therapy and improve the quality of life. © The Author 2015. Published by Oxford University Press on behalf of the European Association for Cardio-Thoracic Surgery. All rights reserved.

  7. Effect of age at onset on cortical thickness and cognition in posterior cortical atrophy

    PubMed Central

    Suárez-González, Aida; Lehmann, Manja; Shakespeare, Timothy J.; Yong, Keir X.X.; Paterson, Ross W.; Slattery, Catherine F.; Foulkes, Alexander J.M.; Rabinovici, Gil D.; Gil-Néciga, Eulogio; Roldán-Lora, Florinda; Schott, Jonathan M.; Fox, Nick C.; Crutch, Sebastian J.

    2016-01-01

    Age at onset (AAO) has been shown to influence the phenotype of Alzheimer’s disease (AD), but how it affects atypical presentations of AD remains unknown. Posterior cortical atrophy (PCA) is the most common form of atypical AD. In this study, we aimed to investigate the effect of AAO on cortical thickness and cognitive function in 98 PCA patients. We used Freesurfer (v5.3.0) to compare cortical thickness with AAO both as a continuous variable, and by dichotomizing the groups based on median age (58 years). In both the continuous and dichotomized analyses, we found a pattern suggestive of thinner cortex in precuneus and parietal areas in earlier-onset PCA, and lower cortical thickness in anterior cingulate and prefrontal cortex in later-onset PCA. These cortical thickness differences between PCA subgroups were consistent with earlier-onset PCA patients performing worse on cognitive tests involving parietal functions. Our results provide a suggestion that AAO may not only affect the clinico-anatomical characteristics in AD but may also affect atrophy patterns and cognition within atypical AD phenotypes. PMID:27318138

  8. Spatial assessment of water quality using chemometrics in the Pearl River Estuary, China

    NASA Astrophysics Data System (ADS)

    Wu, Meilin; Wang, Youshao; Dong, Junde; Sun, Fulin; Wang, Yutu; Hong, Yiguo

    2017-03-01

    A cruise was commissioned in the summer of 2009 to evaluate water quality in the Pearl River Estuary (PRE). Chemometrics such as Principal Component Analysis (PCA), Cluster analysis (CA) and Self-Organizing Map (SOM) were employed to identify anthropogenic and natural influences on estuary water quality. The scores of stations in the surface layer in the first principal component (PC1) were related to NH4-N, PO4-P, NO2-N, NO3-N, TP, and Chlorophyll a while salinity, turbidity, and SiO3-Si in the second principal component (PC2). Similarly, the scores of stations in the bottom layers in PC1 were related to PO4-P, NO2-N, NO3-N, and TP, while salinity, Chlorophyll a, NH4-N, and SiO3-Si in PC2. Results of the PCA identified the spatial distribution of the surface and bottom water quality, namely the Guangzhou urban reach, Middle reach, and Lower reach of the estuary. Both cluster analysis and PCA produced the similar results. Self-organizing map delineated the Guangzhou urban reach of the Pearl River that was mainly influenced by human activities. The middle and lower reaches of the PRE were mainly influenced by the waters in the South China Sea. The information extracted by PCA, CA, and SOM would be very useful to regional agencies in developing a strategy to carry out scientific plans for resource use based on marine system functions.

  9. A composite measure to explore visual disability in primary progressive multiple sclerosis.

    PubMed

    Poretto, Valentina; Petracca, Maria; Saiote, Catarina; Mormina, Enricomaria; Howard, Jonathan; Miller, Aaron; Lublin, Fred D; Inglese, Matilde

    2017-01-01

    Optical coherence tomography (OCT) and magnetic resonance imaging (MRI) can provide complementary information on visual system damage in multiple sclerosis (MS). The objective of this paper is to determine whether a composite OCT/MRI score, reflecting cumulative damage along the entire visual pathway, can predict visual deficits in primary progressive multiple sclerosis (PPMS). Twenty-five PPMS patients and 20 age-matched controls underwent neuro-ophthalmologic evaluation, spectral-domain OCT, and 3T brain MRI. Differences between groups were assessed by univariate general linear model and principal component analysis (PCA) grouped instrumental variables into main components. Linear regression analysis was used to assess the relationship between low-contrast visual acuity (LCVA), OCT/MRI-derived metrics and PCA-derived composite scores. PCA identified four main components explaining 80.69% of data variance. Considering each variable independently, LCVA 1.25% was significantly predicted by ganglion cell-inner plexiform layer (GCIPL) thickness, thalamic volume and optic radiation (OR) lesion volume (adjusted R 2 0.328, p  = 0.00004; adjusted R 2 0.187, p  = 0.002 and adjusted R 2 0.180, p  = 0.002). The PCA composite score of global visual pathway damage independently predicted both LCVA 1.25% (adjusted R 2 value 0.361, p  = 0.00001) and LCVA 2.50% (adjusted R 2 value 0.323, p  = 0.00003). A multiparametric score represents a more comprehensive and effective tool to explain visual disability than a single instrumental metric in PPMS.

  10. Health status monitoring for ICU patients based on locally weighted principal component analysis.

    PubMed

    Ding, Yangyang; Ma, Xin; Wang, Youqing

    2018-03-01

    Intelligent status monitoring for critically ill patients can help medical stuff quickly discover and assess the changes of disease and then make appropriate treatment strategy. However, general-type monitoring model now widely used is difficult to adapt the changes of intensive care unit (ICU) patients' status due to its fixed pattern, and a more robust, efficient and fast monitoring model should be developed to the individual. A data-driven learning approach combining locally weighted projection regression (LWPR) and principal component analysis (PCA) is firstly proposed and applied to monitor the nonlinear process of patients' health status in ICU. LWPR is used to approximate the complex nonlinear process with local linear models, in which PCA could be further applied to status monitoring, and finally a global weighted statistic will be acquired for detecting the possible abnormalities. Moreover, some improved versions are developed, such as LWPR-MPCA and LWPR-JPCA, which also have superior performance. Eighteen subjects were selected from the Physiobank's Multi-parameter Intelligent Monitoring for Intensive Care II (MIMIC II) database, and two vital signs of each subject were chosen for online monitoring. The proposed method was compared with several existing methods including traditional PCA, Partial least squares (PLS), just in time learning combined with modified PCA (L-PCA), and Kernel PCA (KPCA). The experimental results demonstrated that the mean fault detection rate (FDR) of PCA can be improved by 41.7% after adding LWPR. The mean FDR of LWPR-MPCA was increased by 8.3%, compared with the latest reported method L-PCA. Meanwhile, LWPR spent less training time than others, especially KPCA. LWPR is first introduced into ICU patients monitoring and achieves the best monitoring performance including adaptability to changes in patient status, sensitivity for abnormality detection as well as its fast learning speed and low computational complexity. The algorithm is an excellent approach to establishing a personalized model for patients, which is the mainstream direction of modern medicine in the following development, as well as improving the global monitoring performance. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.

  11. Comparison of receptor models for source apportionment of the PM10 in Zaragoza (Spain).

    PubMed

    Callén, M S; de la Cruz, M T; López, J M; Navarro, M V; Mastral, A M

    2009-08-01

    Receptor models are useful to understand the chemical and physical characteristics of air pollutants by identifying their sources and by estimating contributions of each source to receptor concentrations. In this work, three receptor models based on principal component analysis with absolute principal component scores (PCA-APCS), Unmix and positive matrix factorization (PMF) were applied to study for the first time the apportionment of the airborne particulate matter less or equal than 10microm (PM10) in Zaragoza, Spain, during 1year sampling campaign (2003-2004). The PM10 samples were characterized regarding their concentrations in inorganic components: trace elements and ions and also organic components: polycyclic aromatic hydrocarbons (PAH) not only in the solid phase but also in the gas phase. A comparison of the three receptor models was carried out in order to do a more robust characterization of the PM10. The three models predicted that the major sources of PM10 in Zaragoza were related to natural sources (60%, 75% and 47%, respectively, for PCA-APCS, Unmix and PMF) although anthropogenic sources also contributed to PM10 (28%, 25% and 39%). With regard to the anthropogenic sources, while PCA and PMF allowed high discrimination in the sources identification associated with different combustion sources such as traffic and industry, fossil fuel, biomass and fuel-oil combustion, heavy traffic and evaporative emissions, the Unmix model only allowed the identification of industry and traffic emissions, evaporative emissions and heavy-duty vehicles. The three models provided good correlations between the experimental and modelled PM10 concentrations with major precision and the closest agreement between the PMF and PCA models.

  12. PCA-based approach for subtracting thermal background emission in high-contrast imaging data

    NASA Astrophysics Data System (ADS)

    Hunziker, S.; Quanz, S. P.; Amara, A.; Meyer, M. R.

    2018-03-01

    Aims.Ground-based observations at thermal infrared wavelengths suffer from large background radiation due to the sky, telescope and warm surfaces in the instrument. This significantly limits the sensitivity of ground-based observations at wavelengths longer than 3 μm. The main purpose of this work is to analyse this background emission in infrared high-contrast imaging data as illustrative of the problem, show how it can be modelled and subtracted and demonstrate that it can improve the detection of faint sources, such as exoplanets. Methods: We used principal component analysis (PCA) to model and subtract the thermal background emission in three archival high-contrast angular differential imaging datasets in the M' and L' filter. We used an M' dataset of β Pic to describe in detail how the algorithm works and explain how it can be applied. The results of the background subtraction are compared to the results from a conventional mean background subtraction scheme applied to the same dataset. Finally, both methods for background subtraction are compared by performing complete data reductions. We analysed the results from the M' dataset of HD 100546 only qualitatively. For the M' band dataset of β Pic and the L' band dataset of HD 169142, which was obtained with an angular groove phase mask vortex vector coronagraph, we also calculated and analysed the achieved signal-to-noise ratio (S/N). Results: We show that applying PCA is an effective way to remove spatially and temporarily varying thermal background emission down to close to the background limit. The procedure also proves to be very successful at reconstructing the background that is hidden behind the point spread function. In the complete data reductions, we find at least qualitative improvements for HD 100546 and HD 169142, however, we fail to find a significant increase in S/N of β Pic b. We discuss these findings and argue that in particular datasets with strongly varying observing conditions or infrequently sampled sky background will benefit from the new approach.

  13. Development of methodology for identification the nature of the polyphenolic extracts by FTIR associated with multivariate analysis

    NASA Astrophysics Data System (ADS)

    Grasel, Fábio dos Santos; Ferrão, Marco Flôres; Wolf, Carlos Rodolfo

    2016-01-01

    Tannins are polyphenolic compounds of complex structures formed by secondary metabolism in several plants. These polyphenolic compounds have different applications, such as drugs, anti-corrosion agents, flocculants, and tanning agents. This study analyses six different type of polyphenolic extracts by Fourier transform infrared spectroscopy (FTIR) combined with multivariate analysis. Through both principal component analysis (PCA) and hierarchical cluster analysis (HCA), we observed well-defined separation between condensed (quebracho and black wattle) and hydrolysable (valonea, chestnut, myrobalan, and tara) tannins. For hydrolysable tannins, it was also possible to observe the formation of two different subgroups between samples of chestnut and valonea and between samples of tara and myrobalan. Among all samples analysed, the chestnut and valonea showed the greatest similarity, indicating that these extracts contain equivalent chemical compositions and structure and, therefore, similar properties.

  14. PCA determination of the radiometric noise of high spectral resolution infrared observations from spectral residuals: Application to IASI

    NASA Astrophysics Data System (ADS)

    Serio, C.; Masiello, G.; Camy-Peyret, C.; Jacquette, E.; Vandermarcq, O.; Bermudo, F.; Coppens, D.; Tobin, D.

    2018-02-01

    The problem of characterizing and estimating the instrumental or radiometric noise of satellite high spectral resolution infrared spectrometers directly from Earth observations is addressed in this paper. An approach has been developed, which relies on the Principal Component Analysis (PCA) with a suitable criterion to select the optimal number of PC scores. Different selection criteria have been set up and analysed, which is based on the estimation theory of Least Squares and/or Maximum Likelihood Principle. The approach is independent of any forward model and/or radiative transfer calculations. The PCA is used to define an orthogonal basis, which, in turn, is used to derive an optimal linear reconstruction of the observations. The residual vector that is the observation vector minus the calculated or reconstructed one is then used to estimate the instrumental noise. It will be shown that the use of the spectral residuals to assess the radiometric instrumental noise leads to efficient estimators, which are largely independent of possible departures of the true noise from that assumed a priori to model the observational covariance matrix. Application to the Infrared Atmospheric Sounder Interferometer (IASI) has been considered. A series of case studies has been set up, which make use of IASI observations. As a major result, the analysis confirms the high stability and radiometric performance of IASI. The approach also proved to be efficient in characterizing noise features due to mechanical micro-vibrations of the beam splitter of the IASI instrument.

  15. Trichomonas vaginalis infection and risk of prostate cancer: associations by disease aggressiveness and race/ethnicity in the PLCO Trial.

    PubMed

    Marous, Miguelle; Huang, Wen-Yi; Rabkin, Charles S; Hayes, Richard B; Alderete, John F; Rosner, Bernard; Grubb, Robert L; Winter, Anke C; Sutcliffe, Siobhan

    2017-08-01

    Results from previous sero-epidemiologic studies of Trichomonas vaginalis infection and prostate cancer (PCa) support a positive association between this sexually transmitted infection and aggressive PCa. However, findings from previous studies are not entirely consistent, and only one has investigated the possible relation between T. vaginalis seropositivity and PCa in African-American men who are at highest risk of both infection and PCa. Therefore, we examined this possible relation in the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial, including separate analyses for aggressive PCa and African-American men. We included a sample of participants from a previous nested case-control study of PCa, as well as all additional Caucasian, aggressive, and African-American cases diagnosed since the previous study (total n = 438 Gleason 7 Caucasian cases, 487 more advanced Caucasian cases (≥Gleason 8 or stage III/IV), 201 African-American cases, and 1216 controls). We tested baseline sera for T. vaginalis antibodies. No associations were observed for risk of Gleason 7 (odds ratio (OR) = 0.87, 95% confidence interval (CI) 0.55-1.37) or more advanced (OR = 0.90, 95% CI 0.58-1.38) PCa in Caucasian men, or for risk of any PCa (OR = 1.06, 95% CI 0.67-1.68) in African-American men. Our findings do not support an association between T. vaginalis infection and PCa.

  16. Patient-controlled hospital admission for patients with severe mental disorders: a nationwide prospective multicentre study.

    PubMed

    Thomsen, C T; Benros, M E; Maltesen, T; Hastrup, L H; Andersen, P K; Giacco, D; Nordentoft, M

    2018-04-01

    To assess whether implementing patient-controlled admission (PCA) can reduce coercion and improve other clinical outcomes for psychiatric in-patients. During 2013-2016, 422 patients in the PCA group were propensity score matched 1:5 with a control group (n = 2110) that received treatment as usual (TAU). Patients were followed up for at least one year using the intention to treat principle utilising nationwide registers. In a paired design, the outcomes of PCA patients during the year after signing a contract were compared with the year before. No reduction in coercion (risk difference = 0.001; 95% CI: -0.038; 0.040) or self-harming behaviour (risk difference = 0.005; 95% CI: -0.008; 0.018) was observed in the PCA group compared with the TAU group. The PCA group had more in-patient bed days (mean difference = 28.4; 95% CI: 21.3; 35.5) and more medication use (P < 0.0001) than the TAU group. Before and after analyses showed reduction in coercion (P = 0.0001) and in-patient bed days (P = 0.0003). Implementing PCA did not reduce coercion, service use or self-harm behaviour when compared with TAU. Beneficial effects of PCA were observed only in the before and after PCA comparisons. Further research should investigate whether PCA affects other outcomes to better establish its clinical value. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  17. Psychometric properties of the children's sleep habits questionnaire in children with autism spectrum disorder.

    PubMed

    Johnson, Cynthia R; DeMand, Alexandra; Lecavalier, Luc; Smith, Tristram; Aman, Michael; Foldes, Emily; Scahill, Lawrence

    2016-04-01

    Sleep disturbances in autism spectrum disorder (ASD) are very common. Psychometrically sound instruments are essential to assess these disturbances. Children's Sleep Habit Questionnaire (CSHQ) is a widely used measure in ASD. The purpose of this study was to explore the psychometric properties of the CSHQ in a sample of children with ASD. Parents/caregivers of 310 children (mean age: 4.7) with ASD completed the CSHQ at study enrollment. Correlations between intelligence quotient (IQ) scores and the original CSHQ scales were calculated. Item endorsement frequencies and percentages were also calculated. A principal component analysis (PCA) was performed, and internal consistency was assessed for the newly extracted components. Correlations between IQ scores and CSHQ subscales and total scores ranged from .015 to .001 suggesting a weak, if any, association. Item endorsement frequencies were high for bedtime resistance items, but lower for parasomnia and sleep-disordered breathing items. A PCA suggested that a five-component solution best fits the data. Internal consistency of the newly extracted five components ranged α = .87-.50. Item endorsement frequencies were highest for bedtime resistance items. A PCA suggested a five-component solution. Three of the five components (Sleep Routine Problems, Insufficient Sleep, and Sleep-onset Association Problems) were types of sleep disturbances commonly reported in ASD, but the other two components (Parasomnia/Sleep-disordered Breathing and Sleep Anxiety) were less clear. Internal consistencies ranged from mediocre to good. Further development of this measure for use in children with ASD is encouraged. Copyright © 2016 Elsevier B.V. All rights reserved.

  18. Time-oriented hierarchical method for computation of principal components using subspace learning algorithm.

    PubMed

    Jankovic, Marko; Ogawa, Hidemitsu

    2004-10-01

    Principal Component Analysis (PCA) and Principal Subspace Analysis (PSA) are classic techniques in statistical data analysis, feature extraction and data compression. Given a set of multivariate measurements, PCA and PSA provide a smaller set of "basis vectors" with less redundancy, and a subspace spanned by them, respectively. Artificial neurons and neural networks have been shown to perform PSA and PCA when gradient ascent (descent) learning rules are used, which is related to the constrained maximization (minimization) of statistical objective functions. Due to their low complexity, such algorithms and their implementation in neural networks are potentially useful in cases of tracking slow changes of correlations in the input data or in updating eigenvectors with new samples. In this paper we propose PCA learning algorithm that is fully homogeneous with respect to neurons. The algorithm is obtained by modification of one of the most famous PSA learning algorithms--Subspace Learning Algorithm (SLA). Modification of the algorithm is based on Time-Oriented Hierarchical Method (TOHM). The method uses two distinct time scales. On a faster time scale PSA algorithm is responsible for the "behavior" of all output neurons. On a slower scale, output neurons will compete for fulfillment of their "own interests". On this scale, basis vectors in the principal subspace are rotated toward the principal eigenvectors. At the end of the paper it will be briefly analyzed how (or why) time-oriented hierarchical method can be used for transformation of any of the existing neural network PSA method, into PCA method.

  19. Obesity, metabolic syndrome, impaired fasting glucose, and microvascular dysfunction: a principal component analysis approach.

    PubMed

    Panazzolo, Diogo G; Sicuro, Fernando L; Clapauch, Ruth; Maranhão, Priscila A; Bouskela, Eliete; Kraemer-Aguiar, Luiz G

    2012-11-13

    We aimed to evaluate the multivariate association between functional microvascular variables and clinical-laboratorial-anthropometrical measurements. Data from 189 female subjects (34.0 ± 15.5 years, 30.5 ± 7.1 kg/m2), who were non-smokers, non-regular drug users, without a history of diabetes and/or hypertension, were analyzed by principal component analysis (PCA). PCA is a classical multivariate exploratory tool because it highlights common variation between variables allowing inferences about possible biological meaning of associations between them, without pre-establishing cause-effect relationships. In total, 15 variables were used for PCA: body mass index (BMI), waist circumference, systolic and diastolic blood pressure (BP), fasting plasma glucose, levels of total cholesterol, high-density lipoprotein cholesterol (HDL-c), low-density lipoprotein cholesterol (LDL-c), triglycerides (TG), insulin, C-reactive protein (CRP), and functional microvascular variables measured by nailfold videocapillaroscopy. Nailfold videocapillaroscopy was used for direct visualization of nutritive capillaries, assessing functional capillary density, red blood cell velocity (RBCV) at rest and peak after 1 min of arterial occlusion (RBCV(max)), and the time taken to reach RBCV(max) (TRBCV(max)). A total of 35% of subjects had metabolic syndrome, 77% were overweight/obese, and 9.5% had impaired fasting glucose. PCA was able to recognize that functional microvascular variables and clinical-laboratorial-anthropometrical measurements had a similar variation. The first five principal components explained most of the intrinsic variation of the data. For example, principal component 1 was associated with BMI, waist circumference, systolic BP, diastolic BP, insulin, TG, CRP, and TRBCV(max) varying in the same way. Principal component 1 also showed a strong association among HDL-c, RBCV, and RBCV(max), but in the opposite way. Principal component 3 was associated only with microvascular variables in the same way (functional capillary density, RBCV and RBCV(max)). Fasting plasma glucose appeared to be related to principal component 4 and did not show any association with microvascular reactivity. In non-diabetic female subjects, a multivariate scenario of associations between classic clinical variables strictly related to obesity and metabolic syndrome suggests a significant relationship between these diseases and microvascular reactivity.

  20. Leupaxin stimulates adhesion and migration of prostate cancer cells through modulation of the phosphorylation status of the actin-binding protein caldesmon

    PubMed Central

    Schmidt, Thomas; Bremmer, Felix; Burfeind, Peter; Kaulfuß, Silke

    2015-01-01

    The focal adhesion protein leupaxin (LPXN) is overexpressed in a subset of prostate cancers (PCa) and is involved in the progression of PCa. In the present study, we analyzed the LPXN-mediated adhesive and cytoskeletal changes during PCa progression. We identified an interaction between the actin-binding protein caldesmon (CaD) and LPXN and this interaction is increased during PCa cell migration. Furthermore, knockdown of LPXN did not affect CaD expression but reduced CaD phosphorylation. This is known to destabilize the affinity of CaD to F-actin, leading to dynamic cell structures that enable cell motility. Thus, downregulation of CaD increased migration and invasion of PCa cells. To identify the kinase responsible for the LPXN-mediated phosphorylation of CaD, we used data from an antibody array, which showed decreased expression of TGF-beta-activated kinase 1 (TAK1) after LPXN knockdown in PC-3 PCa cells. Subsequent analyses of the downstream kinases revealed the extracellular signal-regulated kinase (ERK) as an interaction partner of LPXN that facilitates CaD phosphorylation during LPXN-mediated PCa cell migration. In conclusion, we demonstrate that LPXN directly influences cytoskeletal dynamics via interaction with the actin-binding protein CaD and regulates CaD phosphorylation by recruiting ERK to highly dynamic structures within PCa cells. PMID:26079947

  1. Dittrichia graveolens (L.) Greuter Essential Oil: Chemical Composition, Multivariate Analysis, and Antimicrobial Activity.

    PubMed

    Mitic, Violeta; Stankov Jovanovic, Vesna; Ilic, Marija; Jovanovic, Olga; Djordjevic, Aleksandra; Stojanovic, Gordana

    2016-01-01

    The chemical composition and in vitro antimicrobial activities of Dittrichia graveolens (L.) Greuter essential oil was studied. Moreover, using agglomerative hierarchical cluster (AHC) and principal component analyses (PCA), the interrelationships of the D. graveolens essential-oil profiles characterized so far (including the sample from this study) were investigated. To evaluate the chemical composition of the essential oil, GC-FID and GC/MS analyses were performed. Altogether, 54 compounds were identified, accounting for 92.9% of the total oil composition. The D. graveolens oil belongs to the monoterpenoid chemotype, with monoterpenoids comprising 87.4% of the totally identified compounds. The major components were borneol (43.6%) and bornyl acetate (38.3%). Multivariate analysis showed that the compounds borneol and bornyl acetate exerted the greatest influence on the spatial differences in the composition of the reported oils. The antimicrobial activity against five bacterial and one fungal strain was determined using a disk-diffusion assay. The studied essential oil was active only against Gram-positive bacteria. Copyright © 2016 Verlag Helvetica Chimica Acta AG, Zürich.

  2. Multiscale 3D Shape Analysis using Spherical Wavelets

    PubMed Central

    Nain, Delphine; Haker, Steven; Bobick, Aaron; Tannenbaum, Allen

    2013-01-01

    Shape priors attempt to represent biological variations within a population. When variations are global, Principal Component Analysis (PCA) can be used to learn major modes of variation, even from a limited training set. However, when significant local variations exist, PCA typically cannot represent such variations from a small training set. To address this issue, we present a novel algorithm that learns shape variations from data at multiple scales and locations using spherical wavelets and spectral graph partitioning. Our results show that when the training set is small, our algorithm significantly improves the approximation of shapes in a testing set over PCA, which tends to oversmooth data. PMID:16685992

  3. Multiscale 3D shape analysis using spherical wavelets.

    PubMed

    Nain, Delphine; Haker, Steven; Bobick, Aaron; Tannenbaum, Allen R

    2005-01-01

    Shape priors attempt to represent biological variations within a population. When variations are global, Principal Component Analysis (PCA) can be used to learn major modes of variation, even from a limited training set. However, when significant local variations exist, PCA typically cannot represent such variations from a small training set. To address this issue, we present a novel algorithm that learns shape variations from data at multiple scales and locations using spherical wavelets and spectral graph partitioning. Our results show that when the training set is small, our algorithm significantly improves the approximation of shapes in a testing set over PCA, which tends to oversmooth data.

  4. Heritable patterns of tooth decay in the permanent dentition: principal components and factor analyses.

    PubMed

    Shaffer, John R; Feingold, Eleanor; Wang, Xiaojing; Tcuenco, Karen T; Weeks, Daniel E; DeSensi, Rebecca S; Polk, Deborah E; Wendell, Steve; Weyant, Robert J; Crout, Richard; McNeil, Daniel W; Marazita, Mary L

    2012-03-09

    Dental caries is the result of a complex interplay among environmental, behavioral, and genetic factors, with distinct patterns of decay likely due to specific etiologies. Therefore, global measures of decay, such as the DMFS index, may not be optimal for identifying risk factors that manifest as specific decay patterns, especially if the risk factors such as genetic susceptibility loci have small individual effects. We used two methods to extract patterns of decay from surface-level caries data in order to generate novel phenotypes with which to explore the genetic regulation of caries. The 128 tooth surfaces of the permanent dentition were scored as carious or not by intra-oral examination for 1,068 participants aged 18 to 75 years from 664 biological families. Principal components analysis (PCA) and factor analysis (FA), two methods of identifying underlying patterns without a priori surface classifications, were applied to our data. The three strongest caries patterns identified by PCA recaptured variation represented by DMFS index (correlation, r = 0.97), pit and fissure surface caries (r = 0.95), and smooth surface caries (r = 0.89). However, together, these three patterns explained only 37% of the variability in the data, indicating that a priori caries measures are insufficient for fully quantifying caries variation. In comparison, the first pattern identified by FA was strongly correlated with pit and fissure surface caries (r = 0.81), but other identified patterns, including a second pattern representing caries of the maxillary incisors, were not representative of any previously defined caries indices. Some patterns identified by PCA and FA were heritable (h(2) = 30-65%, p = 0.043-0.006), whereas other patterns were not, indicating both genetic and non-genetic etiologies of individual decay patterns. This study demonstrates the use of decay patterns as novel phenotypes to assist in understanding the multifactorial nature of dental caries.

  5. Simultaneous qualitative and quantitative evaluation of Ilex kudingcha C. J. tseng by using UPLC and UHPLC-qTOF-MS/MS.

    PubMed

    Zhou, Jie; Yi, Huan; Zhao, Zhong-Xiang; Shang, Xue-Ying; Zhu, Ming-Juan; Kuang, Guo-Jun; Zhu, Chen-Chen; Zhang, Lei

    2018-06-05

    In this study, a systematic method was established for the holistic quality control of Ilex kudingcha C. J. Tseng, a popular functional drink for adjuvant treatment of diabetes, hypertension, obesity and hyperlipidemia. Both qualitative and quantitative analyses were conducted. For qualitative analysis, an ultra high performance liquid chromatography (UHPLC) coupled with an electrospray ionization quadrupole time-of-flight mass spectrometry (ESI-qTOF-MS) method was established for rapid separation and structural identification of the constituents in Ilex kudingcha. Samples were separated on an ACQUITY UPLC HSS T3C 18 column (2.1 mm × 100 mm, 1.8 μm) by gradient elution using 0.1% (v/v) formic acid (solvent A) and acetonitrile (solvent B) as mobile phases at a flow rate of 0.25 mL min -1 . The chromatographic profiling of Ilex kudingcha by UHPLC-qTOF-MS/MS resulted in the characterization of 53 compounds, comprising 18 compounds that were unambiguously identified by comparison with reference standards. For quantitative analysis, 18 major compounds from 15 batches of Ilex kudingcha samples were simultaneously detected by UPLC-DAD at wavelengths of 210 nm, 260 nm, and 326 nm. The method was validated with respect to precision, linearity, repeatability, stability, accuracy, and so on. The contents of the 18 target compounds were applied for hierarchical clustering analysis (HCA) and principal component analysis (PCA) to differentiate between the samples. The results of HCA and PCA were consistent with each other. Sample No. 1 differed significantly based on HCA and PCA, and the differentiating components were confirmed to originate from different batches of samples. Phenolic acids and triterpenes were found to be the main ingredients in Ilex kudingcha. This strategy was effective and straightforward, and provided a potential approach for holistic quality control of Ilex kudingcha. Copyright © 2018. Published by Elsevier B.V.

  6. Principal component-based weighted indices and a framework to evaluate indices: Results from the Medical Expenditure Panel Survey 1996 to 2011

    PubMed Central

    Wu, Chao-Jung

    2017-01-01

    Producing indices composed of multiple input variables has been embedded in some data processing and analytical methods. We aim to test the feasibility of creating data-driven indices by aggregating input variables according to principal component analysis (PCA) loadings. To validate the significance of both the theory-based and data-driven indices, we propose principles to review innovative indices. We generated weighted indices with the variables obtained in the first years of the two-year panels in the Medical Expenditure Panel Survey initiated between 1996 and 2011. Variables were weighted according to PCA loadings and summed. The statistical significance and residual deviance of each index to predict mortality in the second years was extracted from the results of discrete-time survival analyses. There were 237,832 surviving the first years of panels, represented 4.5 billion civilians in the United States, of which 0.62% (95% CI = 0.58% to 0.66%) died in the second years of the panels. Of all 134,689 weighted indices, there were 40,803 significantly predicting mortality in the second years with or without the adjustment of age, sex and races. The significant indices in the both models could at most lead to 10,200 years of academic tenure for individual researchers publishing four indices per year or 618.2 years of publishing for journals with annual volume of 66 articles. In conclusion, if aggregating information based on PCA loadings, there can be a large number of significant innovative indices composing input variables of various predictive powers. To justify the large quantities of innovative indices, we propose a reporting and review framework for novel indices based on the objectives to create indices, variable weighting, related outcomes and database characteristics. The indices selected by this framework could lead to a new genre of publications focusing on meaningful aggregation of information. PMID:28886057

  7. The Phenazine 2-Hydroxy-Phenazine-1-Carboxylic Acid Promotes Extracellular DNA Release and Has Broad Transcriptomic Consequences in Pseudomonas chlororaphis 30–84

    DOE PAGES

    Wang, Dongping; Yu, Jun Myoung; Dorosky, Robert J.; ...

    2016-01-26

    Enhanced production of 2-hydroxy-phenazine-1-carboxylic acid (2-OH-PCA) by the biological control strain Pseudomonas chlororaphis 30–84 derivative 30-84O* was shown previously to promote cell adhesion and alter the three-dimensional structure of surfaceattached biofilms compared to the wild type. The current study demonstrates that production of 2-OH-PCA promotes the release of extracellular DNA, which is correlated with the production of structured biofilm matrix. Moreover, the essential role of the extracellular DNA in maintaining the mass and structure of the 30–84 biofilm matrix is demonstrated. To better understand the role of different phenazines in biofilm matrix production and gene expression, transcriptomic analyses were conductedmore » comparing gene expression patterns of populations of wild type, 30-84O* and a derivative of 30–84 producing only PCA (30-84PCA) to a phenazine defective mutant (30-84ZN) when grown in static cultures. RNA-Seq analyses identified a group of 802 genes that were differentially expressed by the phenazine producing derivatives compared to 30-84ZN, including 240 genes shared by the two 2-OH-PCA producing derivatives, the wild type and 30-84O*. A gene cluster encoding a bacteriophage- derived pyocin and its lysis cassette was upregulated in 2-OH-PCA producing derivatives. A holin encoded in this gene cluster was found to contribute to the release of eDNA in 30–84 biofilm matrices, demonstrating that the influence of 2-OH-PCA on eDNA production is due in part to cell autolysis as a result of pyocin production and release. The results expand the current understanding of the functions different phenazines play in the survival of bacteria in biofilm-forming communities.« less

  8. The Phenazine 2-Hydroxy-Phenazine-1-Carboxylic Acid Promotes Extracellular DNA Release and Has Broad Transcriptomic Consequences in Pseudomonas chlororaphis 30–84

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Dongping; Yu, Jun Myoung; Dorosky, Robert J.

    Enhanced production of 2-hydroxy-phenazine-1-carboxylic acid (2-OH-PCA) by the biological control strain Pseudomonas chlororaphis 30–84 derivative 30-84O* was shown previously to promote cell adhesion and alter the three-dimensional structure of surfaceattached biofilms compared to the wild type. The current study demonstrates that production of 2-OH-PCA promotes the release of extracellular DNA, which is correlated with the production of structured biofilm matrix. Moreover, the essential role of the extracellular DNA in maintaining the mass and structure of the 30–84 biofilm matrix is demonstrated. To better understand the role of different phenazines in biofilm matrix production and gene expression, transcriptomic analyses were conductedmore » comparing gene expression patterns of populations of wild type, 30-84O* and a derivative of 30–84 producing only PCA (30-84PCA) to a phenazine defective mutant (30-84ZN) when grown in static cultures. RNA-Seq analyses identified a group of 802 genes that were differentially expressed by the phenazine producing derivatives compared to 30-84ZN, including 240 genes shared by the two 2-OH-PCA producing derivatives, the wild type and 30-84O*. A gene cluster encoding a bacteriophage- derived pyocin and its lysis cassette was upregulated in 2-OH-PCA producing derivatives. A holin encoded in this gene cluster was found to contribute to the release of eDNA in 30–84 biofilm matrices, demonstrating that the influence of 2-OH-PCA on eDNA production is due in part to cell autolysis as a result of pyocin production and release. The results expand the current understanding of the functions different phenazines play in the survival of bacteria in biofilm-forming communities.« less

  9. The Phenazine 2-Hydroxy-Phenazine-1-Carboxylic Acid Promotes Extracellular DNA Release and Has Broad Transcriptomic Consequences in Pseudomonas chlororaphis 30–84

    PubMed Central

    Wang, Dongping; Yu, Jun Myoung; Dorosky, Robert J.; Pierson, Leland S.; Pierson, Elizabeth A.

    2016-01-01

    Enhanced production of 2-hydroxy-phenazine-1-carboxylic acid (2-OH-PCA) by the biological control strain Pseudomonas chlororaphis 30–84 derivative 30-84O* was shown previously to promote cell adhesion and alter the three-dimensional structure of surface-attached biofilms compared to the wild type. The current study demonstrates that production of 2-OH-PCA promotes the release of extracellular DNA, which is correlated with the production of structured biofilm matrix. Moreover, the essential role of the extracellular DNA in maintaining the mass and structure of the 30–84 biofilm matrix is demonstrated. To better understand the role of different phenazines in biofilm matrix production and gene expression, transcriptomic analyses were conducted comparing gene expression patterns of populations of wild type, 30-84O* and a derivative of 30–84 producing only PCA (30-84PCA) to a phenazine defective mutant (30-84ZN) when grown in static cultures. RNA-Seq analyses identified a group of 802 genes that were differentially expressed by the phenazine producing derivatives compared to 30-84ZN, including 240 genes shared by the two 2-OH-PCA producing derivatives, the wild type and 30-84O*. A gene cluster encoding a bacteriophage-derived pyocin and its lysis cassette was upregulated in 2-OH-PCA producing derivatives. A holin encoded in this gene cluster was found to contribute to the release of eDNA in 30–84 biofilm matrices, demonstrating that the influence of 2-OH-PCA on eDNA production is due in part to cell autolysis as a result of pyocin production and release. The results expand the current understanding of the functions different phenazines play in the survival of bacteria in biofilm-forming communities. PMID:26812402

  10. A comparison of the usefulness of canonical analysis, principal components analysis, and band selection for extraction of features from TMS data for landcover analysis

    NASA Technical Reports Server (NTRS)

    Boyd, R. K.; Brumfield, J. O.; Campbell, W. J.

    1984-01-01

    Three feature extraction methods, canonical analysis (CA), principal component analysis (PCA), and band selection, have been applied to Thematic Mapper Simulator (TMS) data in order to evaluate the relative performance of the methods. The results obtained show that CA is capable of providing a transformation of TMS data which leads to better classification results than provided by all seven bands, by PCA, or by band selection. A second conclusion drawn from the study is that TMS bands 2, 3, 4, and 7 (thermal) are most important for landcover classification.

  11. Principal component analysis on a torus: Theory and application to protein dynamics.

    PubMed

    Sittel, Florian; Filk, Thomas; Stock, Gerhard

    2017-12-28

    A dimensionality reduction method for high-dimensional circular data is developed, which is based on a principal component analysis (PCA) of data points on a torus. Adopting a geometrical view of PCA, various distance measures on a torus are introduced and the associated problem of projecting data onto the principal subspaces is discussed. The main idea is that the (periodicity-induced) projection error can be minimized by transforming the data such that the maximal gap of the sampling is shifted to the periodic boundary. In a second step, the covariance matrix and its eigendecomposition can be computed in a standard manner. Adopting molecular dynamics simulations of two well-established biomolecular systems (Aib 9 and villin headpiece), the potential of the method to analyze the dynamics of backbone dihedral angles is demonstrated. The new approach allows for a robust and well-defined construction of metastable states and provides low-dimensional reaction coordinates that accurately describe the free energy landscape. Moreover, it offers a direct interpretation of covariances and principal components in terms of the angular variables. Apart from its application to PCA, the method of maximal gap shifting is general and can be applied to any other dimensionality reduction method for circular data.

  12. [Determination of the Plant Origin of Licorice Oil Extract, a Natural Food Additive, by Principal Component Analysis Based on Chemical Components].

    PubMed

    Tada, Atsuko; Ishizuki, Kyoko; Sugimoto, Naoki; Yoshimatsu, Kayo; Kawahara, Nobuo; Suematsu, Takako; Arifuku, Kazunori; Fukai, Toshio; Tamura, Yukiyoshi; Ohtsuki, Takashi; Tahara, Maiko; Yamazaki, Takeshi; Akiyama, Hiroshi

    2015-01-01

    "Licorice oil extract" (LOE) (antioxidant agent) is described in the notice of Japanese food additive regulations as a material obtained from the roots and/or rhizomes of Glycyrrhiza uralensis, G. inflata or G. glabra. In this study, we aimed to identify the original Glycyrrhiza species of eight food additive products using LC/MS. Glabridin, a characteristic compound in G. glabra, was specifically detected in seven products, and licochalcone A, a characteristic compound in G. inflata, was detected in one product. In addition, Principal Component Analysis (PCA) (a kind of multivariate analysis) using the data of LC/MS or (1)H-NMR analysis was performed. The data of thirty-one samples, including LOE products used as food additives, ethanol extracts of various Glycyrrhiza species and commercially available Glycyrrhiza species-derived products were assessed. Based on the PCA results, the majority of LOE products was confirmed to be derived from G. glabra. This study suggests that PCA using (1)H-NMR analysis data is a simple and useful method to identify the plant species of origin of natural food additive products.

  13. In Situ Aerosol Profile Measurements and Comparisons with SAGE 3 Aerosol Extinction and Surface Area Profiles at 68 deg North

    NASA Technical Reports Server (NTRS)

    2005-01-01

    Under funding from this proposal three in situ profile measurements of stratospheric sulfate aerosol and ozone were completed from balloon-borne platforms. The measured quantities are aerosol size resolved number concentration and ozone. The one derived product is aerosol size distribution, from which aerosol moments, such as surface area, volume, and extinction can be calculated for comparison with SAGE III measurements and SAGE III derived products, such as surface area. The analysis of these profiles and comparison with SAGE III extinction measurements and SAGE III derived surface areas are provided in Yongxiao (2005), which comprised the research thesis component of Mr. Jian Yongxiao's M.S. degree in Atmospheric Science at the University of Wyoming. In addition analysis continues on using principal component analysis (PCA) to derive aerosol surface area from the 9 wavelength extinction measurements available from SAGE III. Ths paper will present PCA components to calculate surface area from SAGE III measurements and compare these derived surface areas with those available directly from in situ size distribution measurements, as well as surface areas which would be derived from PCA and Thomason's algorithm applied to the four wavelength SAGE II extinction measurements.

  14. Principal component analysis on a torus: Theory and application to protein dynamics

    NASA Astrophysics Data System (ADS)

    Sittel, Florian; Filk, Thomas; Stock, Gerhard

    2017-12-01

    A dimensionality reduction method for high-dimensional circular data is developed, which is based on a principal component analysis (PCA) of data points on a torus. Adopting a geometrical view of PCA, various distance measures on a torus are introduced and the associated problem of projecting data onto the principal subspaces is discussed. The main idea is that the (periodicity-induced) projection error can be minimized by transforming the data such that the maximal gap of the sampling is shifted to the periodic boundary. In a second step, the covariance matrix and its eigendecomposition can be computed in a standard manner. Adopting molecular dynamics simulations of two well-established biomolecular systems (Aib9 and villin headpiece), the potential of the method to analyze the dynamics of backbone dihedral angles is demonstrated. The new approach allows for a robust and well-defined construction of metastable states and provides low-dimensional reaction coordinates that accurately describe the free energy landscape. Moreover, it offers a direct interpretation of covariances and principal components in terms of the angular variables. Apart from its application to PCA, the method of maximal gap shifting is general and can be applied to any other dimensionality reduction method for circular data.

  15. Application of principal component analysis in the pollution assessment with heavy metals of vegetable food chain in the old mining areas

    PubMed Central

    2012-01-01

    Background The aim of the paper is to assess by the principal components analysis (PCA) the heavy metal contamination of soil and vegetables widely used as food for people who live in areas contaminated by heavy metals (HMs) due to long-lasting mining activities. This chemometric technique allowed us to select the best model for determining the risk of HMs on the food chain as well as on people's health. Results Many PCA models were computed with different variables: heavy metals contents and some agro-chemical parameters which characterize the soil samples from contaminated and uncontaminated areas, HMs contents of different types of vegetables grown and consumed in these areas, and the complex parameter target hazard quotients (THQ). Results were discussed in terms of principal component analysis. Conclusion There were two major benefits in processing the data PCA: firstly, it helped in optimizing the number and type of data that are best in rendering the HMs contamination of the soil and vegetables. Secondly, it was valuable for selecting the vegetable species which present the highest/minimum risk of a negative impact on the food chain and human health. PMID:23234365

  16. Evaluating interannual vegetation anomalies in the Basilicata region using satellite spot vegetation 1999-2011 time series: preliminary results from the Mitra project

    NASA Astrophysics Data System (ADS)

    Lasaponara, Rosa; Desantis, Fortunato; Aromando, Angelo; Lanorte, Antonio

    2013-04-01

    The Basilicata region funded a fesr project, MITRA to develop reliable low cost technologies to preserve and enhance natural and cultural heritage in some relevant areas selected as test cases. " Cultural heritage and the natural heritage are increasingly threatened with destruction not only by the traditional causes of decay, but also by changing social and economic conditions which aggravate the situation with even more formidable phenomena of damage or destruction, from THE GENERAL CONFERENCE of the United Nations Educational, Scientific and Cultural Organization meeting in Paris from 17 October to 21 November 1972, at its seventeenth session, available on line " (http://whc.unesco.org/en/conventiontext/). This paper is focused on the preliminary results obtained in the framework of the Mitra project. In particular, a temporal series (1999-2011) of the yearly Maximum Value Composit of SPOT/VEGETATION NDVI was used to carried out investigation on the whole Basilicata region. The PCA was used as a first step of data transform to enhance regions of localized change in multi-temporal data sets (Lasaponara 2006). Results from PCA were further processed using Support Vector machine (SVM) to identify and map land degradation phenomenon Both naturally vegetated areas (forest, shrub-land, herbaceous cover) and agricultural lands have been investigated in order to extract the most prominent natural and/or man induced alterations affecting vegetation behavior. Such analyses can provide valuable information for monitoring the status of vegetation which is an indicator of the degree of stress namely any disturbance that adversely influences plants in response to natural hazards and/or anthropogenic activities. Our findings suggest that the jointly use of PCA and SVM PCA can provide valuable information for environmental management policies involving biodiversity preservation and rational exploitation of natural and agricultural resources. Rosa Lasaponara 2006, On the use of principal component analysis (PCA) for evaluating interannual vegetation anomalies from SPOT/VEGETATION NDVI temporal series. Ecological Modelling 1 9 4 ( 2 0 0 6 ) 429-434

  17. Decision analysis for a data collection system of patient-controlled analgesia with a multi-attribute utility model.

    PubMed

    Lee, I-Jung; Huang, Shih-Yu; Tsou, Mei-Yung; Chan, Kwok-Hon; Chang, Kuang-Yi

    2010-10-01

    Data collection systems are very important for the practice of patient-controlled analgesia (PCA). This study aimed to evaluate 3 PCA data collection systems and selected the most favorable system with the aid of multiattribute utility (MAU) theory. We developed a questionnaire with 10 items to evaluate the PCA data collection system and 1 item for overall satisfaction based on MAU theory. Three systems were compared in the questionnaire, including a paper record, optic card reader and personal digital assistant (PDA). A pilot study demonstrated a good internal and test-retest reliability of the questionnaire. A weighted utility score combining the relative importance of individual items assigned by each participant and their responses to each question was calculated for each system. Sensitivity analyses with distinct weighting protocols were conducted to evaluate the stability of the final results. Thirty potential users of a PCA data collection system were recruited in the study. The item "easy to use" had the highest median rank and received the heaviest mean weight among all items. MAU analysis showed that the PDA system had a higher utility score than that in the other 2 systems. Sensitivity analyses revealed that both inverse and reciprocal weighting processes favored the PDA system. High correlations between overall satisfaction and MAU scores from miscellaneous weighting protocols suggested a good predictive validity of our MAU-based questionnaire. The PDA system was selected as the most favorable PCA data collection system by the MAU analysis. The item "easy to use" was the most important attribute of the PCA data collection system. MAU theory can evaluate alternatives by taking into account individual preferences of stakeholders and aid in better decision-making. Copyright © 2010 Elsevier. Published by Elsevier B.V. All rights reserved.

  18. The evolution of analgesia in an 'accelerated' recovery programme for resectional laparoscopic colorectal surgery with anastomosis.

    PubMed

    Zafar, N; Davies, R; Greenslade, G L; Dixon, A R

    2010-02-01

    The study set out to analyse the outcomes of an evolving accelerated recovery programme after laparoscopic colorectal resection (LCR). The results of a prospective electronic database (March 2000 - April 2008) were analysed. There were 353 consecutive patients undergoing 'three port' high anterior resection (AR) (237 without covering stoma) and 166 a right hemicolectomy (RHC). One hundred thirty-eight had postoperative analgesia using paracetamol IV and oral analgesia (IVP); 27 (16.3%) received additional parenteral morphine and were excluded. Patient controlled morphine analgesia (PCA) was used in 138. Transversus abdominis plane (TAP) blocks, supplemented by IV paracetamol and oral analgesia were used in the last 50 patients. The time to the resumption of diet was significantly reduced with TAP analgesia (median 12 h) and IVP (median 12 h) compared with PCA median (36 h) (chi(2) = 143; 4df: P < 0.001). The postoperative hospital stay was significantly reduced with TAP analgesia (median 2 days) and IVP (median 3 days) compared with PCA (median 5 days); chi(2) = 73; 2df: P < 0.001. Seventeen (34%) TAP and nine (6.5%) IVP patients were discharged within 24 h of surgery compared with no patient in the PCA group. Ninety-three per cent of PCA, 35% IVP and 10% TAP patients were discharged in more than 3 days. The movement towards 'accelerated recovery' was not associated with any increased risk of urinary retention, return to theatre, readmission and/or 30 day mortality. Laparoscopic surgery utilizing IV paracetamol and TAP blocks for postoperative analgesia aids safe effective 'accelerated recovery' in an unselected patient population undergoing right hemicolectomy and high anterior resection. Routine epidural anaesthesia is unnecessary for LCR. Morphine PCA is associated with delayed recovery.

  19. Does Para-chloroaniline Really Form after Mixing Sodium Hypochlorite and Chlorhexidine?

    PubMed

    Orhan, Ekim Onur; Irmak, Özgür; Hür, Deniz; Yaman, Batu Can; Karabucak, Bekir

    2016-03-01

    Mixing sodium hypochlorite (NaOCl) with chlorhexidine (CHX) forms a brown-colored precipitate. Previous studies are not in agreement whether this precipitate contains para-chloroaniline (PCA). Tests used for analysis may demonstrate different outcomes. Purpose of this study was to determine whether PCA is formed through the reaction of mixing NaOCl and CHX by using high performance liquid chromatography, proton nuclear magnetic resonance spectroscopy, gas chromatography, thin layer chromatography, infrared spectroscopy, and gas chromatography/mass spectrometry. To obtain a brown precipitate, 4.99% NaOCl was mixed with 2.0% CHX. This brown precipitate was analyzed and compared with signals obtained from commercially available 4.99% NaOCl, 2% solutions, and 98% PCA in powder form. Chromatographic and spectroscopic analyses showed that brown precipitate does not contain free PCA. This study will be a cutoff proof for the argument on PCA formation from reaction of CHX and NaOCl. Copyright © 2016 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.

  20. Prostate Cancer Cell–Stromal Cell Cross-Talk via FGFR1 Mediates Antitumor Activity of Dovitinib in Bone Metastases

    PubMed Central

    Wan, Xinhai; Corn, Paul G.; Yang, Jun; Palanisamy, Nallasivam; Starbuck, Michael W.; Efstathiou, Eleni; Li-Ning Tapia, Elsa M.; Zurita, Amado J.; Aparicio, Ana; Ravoori, Murali K.; Vazquez, Elba S.; Robinson, Dan R.; Wu, Yi-Mi; Cao, Xuhong; Iyer, Matthew K.; McKeehan, Wallace; Kundra, Vikas; Wang, Fen; Troncoso, Patricia; Chinnaiyan, Arul M.; Logothetis, Christopher J.; Navone, Nora M.

    2015-01-01

    Bone is the most common site of prostate cancer (PCa) progression to a therapy-resistant, lethal phenotype. We found that blockade of fibroblast growth factor receptors (FGFRs) with the receptor tyrosine kinase inhibitor dovitinib has clinical activity in a subset of men with castration-resistant PCa and bone metastases. Our integrated analyses suggest that FGF signaling mediates a positive feedback loop between PCa cells and bone cells and that blockade of FGFR1 in osteoblasts partially mediates the antitumor activity of dovitinib by improving bone quality and by blocking PCa cell–bone cell interaction. These findings account for clinical observations such as reductions in lesion size and intensity on bone scans, lymph node size, and tumor-specific symptoms without proportional declines in prostate-specific antigen concentration. Our findings suggest that targeting FGFR has therapeutic activity in advanced PCa and provide direction for the development of therapies with FGFR inhibitors. PMID:25186177

  1. Prostate cancer cell-stromal cell crosstalk via FGFR1 mediates antitumor activity of dovitinib in bone metastases.

    PubMed

    Wan, Xinhai; Corn, Paul G; Yang, Jun; Palanisamy, Nallasivam; Starbuck, Michael W; Efstathiou, Eleni; Li Ning Tapia, Elsa M; Tapia, Elsa M Li-Ning; Zurita, Amado J; Aparicio, Ana; Ravoori, Murali K; Vazquez, Elba S; Robinson, Dan R; Wu, Yi-Mi; Cao, Xuhong; Iyer, Matthew K; McKeehan, Wallace; Kundra, Vikas; Wang, Fen; Troncoso, Patricia; Chinnaiyan, Arul M; Logothetis, Christopher J; Navone, Nora M

    2014-09-03

    Bone is the most common site of prostate cancer (PCa) progression to a therapy-resistant, lethal phenotype. We found that blockade of fibroblast growth factor receptors (FGFRs) with the receptor tyrosine kinase inhibitor dovitinib has clinical activity in a subset of men with castration-resistant PCa and bone metastases. Our integrated analyses suggest that FGF signaling mediates a positive feedback loop between PCa cells and bone cells and that blockade of FGFR1 in osteoblasts partially mediates the antitumor activity of dovitinib by improving bone quality and by blocking PCa cell-bone cell interaction. These findings account for clinical observations such as reductions in lesion size and intensity on bone scans, lymph node size, and tumor-specific symptoms without proportional declines in serum prostate-specific antigen concentration. Our findings suggest that targeting FGFR has therapeutic activity in advanced PCa and provide direction for the development of therapies with FGFR inhibitors. Copyright © 2014, American Association for the Advancement of Science.

  2. Proton Nuclear Magnetic Resonance-Spectroscopic Discrimination of Wines Reflects Genetic Homology of Several Different Grape (V. vinifera L.) Cultivars

    PubMed Central

    Zhu, Yong; Wen, Wen; Zhang, Fengmin; Hardie, Jim W.

    2015-01-01

    Background and Aims Proton nuclear magnetic resonance spectroscopy coupled multivariate analysis (1H NMR-PCA/PLS-DA) is an important tool for the discrimination of wine products. Although 1H NMR has been shown to discriminate wines of different cultivars, a grape genetic component of the discrimination has been inferred only from discrimination of cultivars of undefined genetic homology and in the presence of many confounding environmental factors. We aimed to confirm the influence of grape genotypes in the absence of those factors. Methods and Results We applied 1H NMR-PCA/PLS-DA and hierarchical cluster analysis (HCA) to wines from five, variously genetically-related grapevine (V. vinifera) cultivars; all grown similarly on the same site and vinified similarly. We also compared the semi-quantitative profiles of the discriminant metabolites of each cultivar with previously reported chemical analyses. The cultivars were clearly distinguishable and there was a general correlation between their grouping and their genetic homology as revealed by recent genomic studies. Between cultivars, the relative amounts of several of the cultivar-related discriminant metabolites conformed closely with reported chemical analyses. Conclusions Differences in grape-derived metabolites associated with genetic differences alone are a major source of 1H NMR-based discrimination of wines and 1H NMR has the capacity to discriminate between very closely related cultivars. Significance of the Study The study confirms that genetic variation among grape cultivars alone can account for the discrimination of wine by 1H NMR-PCA/PLS and indicates that 1H NMR spectra of wine of single grape cultivars may in future be used in tandem with hierarchical cluster analysis to elucidate genetic lineages and metabolomic relations of grapevine cultivars. In the absence of genetic information, for example, where predecessor varieties are no longer extant, this may be a particularly useful approach. PMID:26658757

  3. Multimethod prediction of physical parent-child aggression risk in expectant mothers and fathers with Social Information Processing theory.

    PubMed

    Rodriguez, Christina M; Smith, Tamika L; Silvia, Paul J

    2016-01-01

    The Social Information Processing (SIP) model postulates that parents undergo a series of stages in implementing physical discipline that can escalate into physical child abuse. The current study utilized a multimethod approach to investigate whether SIP factors can predict risk of parent-child aggression (PCA) in a diverse sample of expectant mothers and fathers. SIP factors of PCA attitudes, negative child attributions, reactivity, and empathy were considered as potential predictors of PCA risk; additionally, analyses considered whether personal history of PCA predicted participants' own PCA risk through its influence on their attitudes and attributions. Findings indicate that, for both mothers and fathers, history influenced attitudes but not attributions in predicting PCA risk, and attitudes and attributions predicted PCA risk; empathy and reactivity predicted negative child attributions for expectant mothers, but only reactivity significantly predicted attributions for expectant fathers. Path models for expectant mothers and fathers were remarkably similar. Overall, the findings provide support for major aspects of the SIP model. Continued work is needed in studying the progression of these factors across time for both mothers and fathers as well as the inclusion of other relevant ecological factors to the SIP model. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Fourier Transform Infrared Spectroscopy (FTIR) and Multivariate Analysis for Identification of Different Vegetable Oils Used in Biodiesel Production

    PubMed Central

    Mueller, Daniela; Ferrão, Marco Flôres; Marder, Luciano; da Costa, Adilson Ben; de Cássia de Souza Schneider, Rosana

    2013-01-01

    The main objective of this study was to use infrared spectroscopy to identify vegetable oils used as raw material for biodiesel production and apply multivariate analysis to the data. Six different vegetable oil sources—canola, cotton, corn, palm, sunflower and soybeans—were used to produce biodiesel batches. The spectra were acquired by Fourier transform infrared spectroscopy using a universal attenuated total reflectance sensor (FTIR-UATR). For the multivariate analysis principal component analysis (PCA), hierarchical cluster analysis (HCA), interval principal component analysis (iPCA) and soft independent modeling of class analogy (SIMCA) were used. The results indicate that is possible to develop a methodology to identify vegetable oils used as raw material in the production of biodiesel by FTIR-UATR applying multivariate analysis. It was also observed that the iPCA found the best spectral range for separation of biodiesel batches using FTIR-UATR data, and with this result, the SIMCA method classified 100% of the soybean biodiesel samples. PMID:23539030

  5. Collective Dynamics of Periplasmic Glutamine Binding Protein upon Domain Closure

    PubMed Central

    Loeffler, Hannes H.; Kitao, Akio

    2009-01-01

    The glutamine binding protein is a vital component of the associated ATP binding cassette transport systems responsible for the uptake of glutamine into the cell. We have investigated the global movements of this protein by molecular dynamics simulations and principal component analysis (PCA). We confirm that the most dominant mode corresponds to the biological function of the protein, i.e., a hinge-type motion upon ligand binding. The closure itself was directly observed from two independent trajectories whereby PCA was used to elucidate the nature of this closing reaction. Two intermediary states are identified and described in detail. The ligand binding induces the structural change of the hinge regions from a discontinuous β-sheet to a continuous one, which also enhances softness of the hinge and modifies the direction of hinge motion to enable closing. We also investigated the convergence behavior of PCA modes, which were found to converge rather quickly when the associated magnitudes of the eigenvalues are well separated. PMID:19883597

  6. Prediction With Dimension Reduction of Multiple Molecular Data Sources for Patient Survival.

    PubMed

    Kaplan, Adam; Lock, Eric F

    2017-01-01

    Predictive modeling from high-dimensional genomic data is often preceded by a dimension reduction step, such as principal component analysis (PCA). However, the application of PCA is not straightforward for multisource data, wherein multiple sources of 'omics data measure different but related biological components. In this article, we use recent advances in the dimension reduction of multisource data for predictive modeling. In particular, we apply exploratory results from Joint and Individual Variation Explained (JIVE), an extension of PCA for multisource data, for prediction of differing response types. We conduct illustrative simulations to illustrate the practical advantages and interpretability of our approach. As an application example, we consider predicting survival for patients with glioblastoma multiforme from 3 data sources measuring messenger RNA expression, microRNA expression, and DNA methylation. We also introduce a method to estimate JIVE scores for new samples that were not used in the initial dimension reduction and study its theoretical properties; this method is implemented in the R package R.JIVE on CRAN, in the function jive.predict.

  7. Principal component analysis of TOF-SIMS spectra, images and depth profiles: an industrial perspective

    NASA Astrophysics Data System (ADS)

    Pacholski, Michaeleen L.

    2004-06-01

    Principal component analysis (PCA) has been successfully applied to time-of-flight secondary ion mass spectrometry (TOF-SIMS) spectra, images and depth profiles. Although SIMS spectral data sets can be small (in comparison to datasets typically discussed in literature from other analytical techniques such as gas or liquid chromatography), each spectrum has thousands of ions resulting in what can be a difficult comparison of samples. Analysis of industrially-derived samples means the identity of most surface species are unknown a priori and samples must be analyzed rapidly to satisfy customer demands. PCA enables rapid assessment of spectral differences (or lack there of) between samples and identification of chemically different areas on sample surfaces for images. Depth profile analysis helps define interfaces and identify low-level components in the system.

  8. Observation of Nonthermal Emission from the Supernova Remnant IC443 with RXTE

    NASA Technical Reports Server (NTRS)

    Sturner, S. J.; Keohane, J. W.; Reimer, O.

    2002-01-01

    In this paper we present analysis of X-ray spectra from the supernova remnant IC443 obtained using the PCA on RXTE. The spectra in the 3 - 20 keV band are well fit by a two-component model consisting of thermal and nonthermal components. We compare these results with recent results of other X-ray missions and discuss the need for a cut-off in the nonthermal spectrum. Recent Chandra and XMM-Newton observations suggest that much of the nonthermal emission from IC443 can be attributed to a pulsar wind nebula. We present the results of our search for periodic emission in the RXTE PCA data. We then discuss the origin o f the nonthermal component and its possible association with the unidentified EGRET source.

  9. Principal Components Analysis of a JWST NIRSpec Detector Subsystem

    NASA Technical Reports Server (NTRS)

    Arendt, Richard G.; Fixsen, D. J.; Greenhouse, Matthew A.; Lander, Matthew; Lindler, Don; Loose, Markus; Moseley, S. H.; Mott, D. Brent; Rauscher, Bernard J.; Wen, Yiting; hide

    2013-01-01

    We present principal component analysis (PCA) of a flight-representative James Webb Space Telescope NearInfrared Spectrograph (NIRSpec) Detector Subsystem. Although our results are specific to NIRSpec and its T - 40 K SIDECAR ASICs and 5 m cutoff H2RG detector arrays, the underlying technical approach is more general. We describe how we measured the systems response to small environmental perturbations by modulating a set of bias voltages and temperature. We used this information to compute the systems principal noise components. Together with information from the astronomical scene, we show how the zeroth principal component can be used to calibrate out the effects of small thermal and electrical instabilities to produce cosmetically cleaner images with significantly less correlated noise. Alternatively, if one were designing a new instrument, one could use a similar PCA approach to inform a set of environmental requirements (temperature stability, electrical stability, etc.) that enabled the planned instrument to meet performance requirements

  10. The utilization of Depth Invariant Index and Principle Component Analysis for mapping seagrass ecosystem of Kotok Island and Karang Bongkok, Indonesia

    NASA Astrophysics Data System (ADS)

    Manuputty, Agnestesya; Lumban Gaol, Jonson; Bahri Agus, Syamsul; Wayan Nurjaya, I.

    2017-01-01

    Seagrass perform a variety of functions within ecosystems, and have both economic and ecological values, therefore it has to be kept sustainable. One of the stages to preserve seagrass ecosystems is monitoring by utilizing thespatial data accurately. The purpose of the study was to assess and compare the accuracy of DII and PCA transformationsfor mapping of seagrass ecosystems. Fieldstudy was carried out in Karang Bongkok and Kotok Island waters, in Agustus 2014 and in March 2015. A WorldView-2 image acquisition date of 5 October 2013 was used in the study. The transformations for image processing data were Depth Invariant Index (DII) and Principle Component Analysis (PCA) using Support Vector Machine (SVM) classification. The result shows that benthic habitat mapping of Karang Bongkok using DII and PCA transformations were 72%and 81% overall’s accuracy respectively, whereas of Kotok Island were 83% and 84% overall’s accuracy respectively. There were seven benthic habitat types found in karang Bongkok waters and in Kotok Island namely seagrass, sand, rubble, coral, logoon, sand mix seagrass, and sand mix rubble. PCA transformation was effectively to improve mapping accuracy of sea grass mapping in Kotok Island and Karang Bongkok.

  11. Forecasting of UV-Vis absorbance time series using artificial neural networks combined with principal component analysis.

    PubMed

    Plazas-Nossa, Leonardo; Hofer, Thomas; Gruber, Günter; Torres, Andres

    2017-02-01

    This work proposes a methodology for the forecasting of online water quality data provided by UV-Vis spectrometry. Therefore, a combination of principal component analysis (PCA) to reduce the dimensionality of a data set and artificial neural networks (ANNs) for forecasting purposes was used. The results obtained were compared with those obtained by using discrete Fourier transform (DFT). The proposed methodology was applied to four absorbance time series data sets composed by a total number of 5705 UV-Vis spectra. Absolute percentage errors obtained by applying the proposed PCA/ANN methodology vary between 10% and 13% for all four study sites. In general terms, the results obtained were hardly generalizable, as they appeared to be highly dependent on specific dynamics of the water system; however, some trends can be outlined. PCA/ANN methodology gives better results than PCA/DFT forecasting procedure by using a specific spectra range for the following conditions: (i) for Salitre wastewater treatment plant (WWTP) (first hour) and Graz West R05 (first 18 min), from the last part of UV range to all visible range; (ii) for Gibraltar pumping station (first 6 min) for all UV-Vis absorbance spectra; and (iii) for San Fernando WWTP (first 24 min) for all of UV range to middle part of visible range.

  12. Enlightening discriminative network functional modules behind Principal Component Analysis separation in differential-omic science studies

    PubMed Central

    Ciucci, Sara; Ge, Yan; Durán, Claudio; Palladini, Alessandra; Jiménez-Jiménez, Víctor; Martínez-Sánchez, Luisa María; Wang, Yuting; Sales, Susanne; Shevchenko, Andrej; Poser, Steven W.; Herbig, Maik; Otto, Oliver; Androutsellis-Theotokis, Andreas; Guck, Jochen; Gerl, Mathias J.; Cannistraci, Carlo Vittorio

    2017-01-01

    Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics. PMID:28287094

  13. Tool Wear Prediction in Ti-6Al-4V Machining through Multiple Sensor Monitoring and PCA Features Pattern Recognition.

    PubMed

    Caggiano, Alessandra

    2018-03-09

    Machining of titanium alloys is characterised by extremely rapid tool wear due to the high cutting temperature and the strong adhesion at the tool-chip and tool-workpiece interface, caused by the low thermal conductivity and high chemical reactivity of Ti alloys. With the aim to monitor the tool conditions during dry turning of Ti-6Al-4V alloy, a machine learning procedure based on the acquisition and processing of cutting force, acoustic emission and vibration sensor signals during turning is implemented. A number of sensorial features are extracted from the acquired sensor signals in order to feed machine learning paradigms based on artificial neural networks. To reduce the large dimensionality of the sensorial features, an advanced feature extraction methodology based on Principal Component Analysis (PCA) is proposed. PCA allowed to identify a smaller number of features ( k = 2 features), the principal component scores, obtained through linear projection of the original d features into a new space with reduced dimensionality k = 2, sufficient to describe the variance of the data. By feeding artificial neural networks with the PCA features, an accurate diagnosis of tool flank wear ( VB max ) was achieved, with predicted values very close to the measured tool wear values.

  14. Tool Wear Prediction in Ti-6Al-4V Machining through Multiple Sensor Monitoring and PCA Features Pattern Recognition

    PubMed Central

    2018-01-01

    Machining of titanium alloys is characterised by extremely rapid tool wear due to the high cutting temperature and the strong adhesion at the tool-chip and tool-workpiece interface, caused by the low thermal conductivity and high chemical reactivity of Ti alloys. With the aim to monitor the tool conditions during dry turning of Ti-6Al-4V alloy, a machine learning procedure based on the acquisition and processing of cutting force, acoustic emission and vibration sensor signals during turning is implemented. A number of sensorial features are extracted from the acquired sensor signals in order to feed machine learning paradigms based on artificial neural networks. To reduce the large dimensionality of the sensorial features, an advanced feature extraction methodology based on Principal Component Analysis (PCA) is proposed. PCA allowed to identify a smaller number of features (k = 2 features), the principal component scores, obtained through linear projection of the original d features into a new space with reduced dimensionality k = 2, sufficient to describe the variance of the data. By feeding artificial neural networks with the PCA features, an accurate diagnosis of tool flank wear (VBmax) was achieved, with predicted values very close to the measured tool wear values. PMID:29522443

  15. Prediagnostic circulating sex hormones are not associated with mortality for men with prostate cancer.

    PubMed

    Gershman, Boris; Shui, Irene M; Stampfer, Meir; Platz, Elizabeth A; Gann, Peter H; Sesso, Howard L; DuPre, Natalie; Giovannucci, Edward; Mucci, Lorelei A

    2014-04-01

    Sex hormones play an important role in the growth and development of the prostate, and low androgen levels have been suggested to carry an adverse prognosis for men with prostate cancer (PCa). To examine the association between prediagnostic circulating sex hormones and lethal PCa in two prospective cohort studies, the Physicians' Health Study (PHS) and the Health Professionals Follow-up Study (HPFS). We included 963 PCa cases (700 HPFS; 263 PHS) that provided prediagnostic blood samples, in 1982 for PHS and in 1993-1995 for HPFS, in which circulating sex hormone levels were assayed. The primary end point was lethal PCa (defined as cancer-specific mortality or development of metastases), and we also assessed total mortality through March 2011. We used Cox proportional hazards models to evaluate the association of prediagnostic sex hormone levels with time from diagnosis to development of lethal PCa or total mortality. PCa cases were followed for a mean of 12.0±4.9 yr after diagnosis. We confirmed 148 cases of lethal PCa and 421 deaths overall. Using Cox proportional hazard models, we found no significant association between quartile of total testosterone, sex hormone binding globulin (SHBG), SHBG-adjusted testosterone, free testosterone, dihydrotestosterone, androstanediol glucuronide, or estradiol and lethal PCa or total mortality. In subset analyses stratified by Gleason score, TNM stage, age, and interval between blood draw and diagnosis, there was also no consistent association between lethal PCa and sex hormone quartile. We found no overall association between prediagnostic circulating sex hormones and lethal PCa or total mortality. Our null results suggest that reverse causation may be responsible in prior studies that noted adverse outcomes for men with low circulating androgens. Copyright © 2013 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  16. An Exploratory Study on Using Principal-Component Analysis and Confirmatory Factor Analysis to Identify Bolt-On Dimensions: The EQ-5D Case Study.

    PubMed

    Finch, Aureliano Paolo; Brazier, John Edward; Mukuria, Clara; Bjorner, Jakob Bue

    2017-12-01

    Generic preference-based measures such as the EuroQol five-dimensional questionnaire (EQ-5D) are used in economic evaluation, but may not be appropriate for all conditions. When this happens, a possible solution is adding bolt-ons to expand their descriptive systems. Using review-based methods, studies published to date claimed the relevance of bolt-ons in the presence of poor psychometric results. This approach does not identify the specific dimensions missing from the Generic preference-based measure core descriptive system, and is inappropriate for identifying dimensions that might improve the measure generically. This study explores the use of principal-component analysis (PCA) and confirmatory factor analysis (CFA) for bolt-on identification in the EQ-5D. Data were drawn from the international Multi-Instrument Comparison study, which is an online survey on health and well-being measures in five countries. Analysis was based on a pool of 92 items from nine instruments. Initial content analysis provided a theoretical framework for PCA results interpretation and CFA model development. PCA was used to investigate the underlining dimensional structure and whether EQ-5D items were represented in the identified constructs. CFA was used to confirm the structure. CFA was cross-validated in random halves of the sample. PCA suggested a nine-component solution, which was confirmed by CFA. This included psychological symptoms, physical functioning, and pain, which were covered by the EQ-5D, and satisfaction, speech/cognition,relationships, hearing, vision, and energy/sleep which were not. These latter factors may represent relevant candidate bolt-ons. PCA and CFA appear useful methods for identifying potential bolt-ons dimensions for an instrument such as the EQ-5D. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.

  17. Independent components analysis to increase efficiency of discriminant analysis methods (FDA and LDA): Application to NMR fingerprinting of wine.

    PubMed

    Monakhova, Yulia B; Godelmann, Rolf; Kuballa, Thomas; Mushtakova, Svetlana P; Rutledge, Douglas N

    2015-08-15

    Discriminant analysis (DA) methods, such as linear discriminant analysis (LDA) or factorial discriminant analysis (FDA), are well-known chemometric approaches for solving classification problems in chemistry. In most applications, principle components analysis (PCA) is used as the first step to generate orthogonal eigenvectors and the corresponding sample scores are utilized to generate discriminant features for the discrimination. Independent components analysis (ICA) based on the minimization of mutual information can be used as an alternative to PCA as a preprocessing tool for LDA and FDA classification. To illustrate the performance of this ICA/DA methodology, four representative nuclear magnetic resonance (NMR) data sets of wine samples were used. The classification was performed regarding grape variety, year of vintage and geographical origin. The average increase for ICA/DA in comparison with PCA/DA in the percentage of correct classification varied between 6±1% and 8±2%. The maximum increase in classification efficiency of 11±2% was observed for discrimination of the year of vintage (ICA/FDA) and geographical origin (ICA/LDA). The procedure to determine the number of extracted features (PCs, ICs) for the optimum DA models was discussed. The use of independent components (ICs) instead of principle components (PCs) resulted in improved classification performance of DA methods. The ICA/LDA method is preferable to ICA/FDA for recognition tasks based on NMR spectroscopic measurements. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Non-linear principal component analysis applied to Lorenz models and to North Atlantic SLP

    NASA Astrophysics Data System (ADS)

    Russo, A.; Trigo, R. M.

    2003-04-01

    A non-linear generalisation of Principal Component Analysis (PCA), denoted Non-Linear Principal Component Analysis (NLPCA), is introduced and applied to the analysis of three data sets. Non-Linear Principal Component Analysis allows for the detection and characterisation of low-dimensional non-linear structure in multivariate data sets. This method is implemented using a 5-layer feed-forward neural network introduced originally in the chemical engineering literature (Kramer, 1991). The method is described and details of its implementation are addressed. Non-Linear Principal Component Analysis is first applied to a data set sampled from the Lorenz attractor (1963). It is found that the NLPCA approximations are more representative of the data than are the corresponding PCA approximations. The same methodology was applied to the less known Lorenz attractor (1984). However, the results obtained weren't as good as those attained with the famous 'Butterfly' attractor. Further work with this model is underway in order to assess if NLPCA techniques can be more representative of the data characteristics than are the corresponding PCA approximations. The application of NLPCA to relatively 'simple' dynamical systems, such as those proposed by Lorenz, is well understood. However, the application of NLPCA to a large climatic data set is much more challenging. Here, we have applied NLPCA to the sea level pressure (SLP) field for the entire North Atlantic area and the results show a slight imcrement of explained variance associated. Finally, directions for future work are presented.%}

  19. Common mode error in Antarctic GPS coordinate time series on its effect on bedrock-uplift estimates

    NASA Astrophysics Data System (ADS)

    Liu, Bin; King, Matt; Dai, Wujiao

    2018-05-01

    Spatially-correlated common mode error always exists in regional, or-larger, GPS networks. We applied independent component analysis (ICA) to GPS vertical coordinate time series in Antarctica from 2010 to 2014 and made a comparison with the principal component analysis (PCA). Using PCA/ICA, the time series can be decomposed into a set of temporal components and their spatial responses. We assume the components with common spatial responses are common mode error (CME). An average reduction of ˜40% about the RMS values was achieved in both PCA and ICA filtering. However, the common mode components obtained from the two approaches have different spatial and temporal features. ICA time series present interesting correlations with modeled atmospheric and non-tidal ocean loading displacements. A white noise (WN) plus power law noise (PL) model was adopted in the GPS velocity estimation using maximum likelihood estimation (MLE) analysis, with ˜55% reduction of the velocity uncertainties after filtering using ICA. Meanwhile, spatiotemporal filtering reduces the amplitude of PL and periodic terms in the GPS time series. Finally, we compare the GPS uplift velocities, after correction for elastic effects, with recent models of glacial isostatic adjustment (GIA). The agreements of the GPS observed velocities and four GIA models are generally improved after the spatiotemporal filtering, with a mean reduction of ˜0.9 mm/yr of the WRMS values, possibly allowing for more confident separation of various GIA model predictions.

  20. How Many Separable Sources? Model Selection In Independent Components Analysis

    PubMed Central

    Woods, Roger P.; Hansen, Lars Kai; Strother, Stephen

    2015-01-01

    Unlike mixtures consisting solely of non-Gaussian sources, mixtures including two or more Gaussian components cannot be separated using standard independent components analysis methods that are based on higher order statistics and independent observations. The mixed Independent Components Analysis/Principal Components Analysis (mixed ICA/PCA) model described here accommodates one or more Gaussian components in the independent components analysis model and uses principal components analysis to characterize contributions from this inseparable Gaussian subspace. Information theory can then be used to select from among potential model categories with differing numbers of Gaussian components. Based on simulation studies, the assumptions and approximations underlying the Akaike Information Criterion do not hold in this setting, even with a very large number of observations. Cross-validation is a suitable, though computationally intensive alternative for model selection. Application of the algorithm is illustrated using Fisher's iris data set and Howells' craniometric data set. Mixed ICA/PCA is of potential interest in any field of scientific investigation where the authenticity of blindly separated non-Gaussian sources might otherwise be questionable. Failure of the Akaike Information Criterion in model selection also has relevance in traditional independent components analysis where all sources are assumed non-Gaussian. PMID:25811988

  1. Regional and local background ozone in Houston during Texas Air Quality Study 2006

    NASA Astrophysics Data System (ADS)

    Langford, A. O.; Senff, C. J.; Banta, R. M.; Hardesty, R. M.; Alvarez, R. J.; Sandberg, Scott P.; Darby, Lisa S.

    2009-04-01

    Principal Component Analysis (PCA) is used to isolate the common modes of behavior in the daily maximum 8-h average ozone mixing ratios measured at 30 Continuous Ambient Monitoring Stations in the Houston-Galveston-Brazoria area during the Second Texas Air Quality Study field intensive (1 August to 15 October 2006). Three principal components suffice to explain 93% of the total variance. Nearly 84% is explained by the first component, which is attributed to changes in the "regional background" determined primarily by the large-scale winds. The second component (6%) is attributed to changes in the "local background," that is, ozone photochemically produced in the Houston area and spatially and temporally averaged by local circulations. Finally, the third component (3.5%) is attributed to short-lived plumes containing high ozone originating from industrial areas along Galveston Bay and the Houston Ship Channel. Regional background ozone concentrations derived using the first component compare well with mean ozone concentrations measured above the Gulf of Mexico by the tunable profiler for aerosols and ozone lidar aboard the NOAA Twin Otter. The PCA regional background values also agree well with background values derived using the lowest daily 8-h maximum method of Nielsen-Gammon et al. (2005), provided the Galveston Airport data (C34) are omitted from that analysis. The differences found when Galveston is included are caused by the sea breeze, which depresses ozone at Galveston relative to sites further inland. PCA removes the effects of this and other local circulations to obtain a regional background value representative of the greater Houston area.

  2. Pepper seed variety identification based on visible/near-infrared spectral technology

    NASA Astrophysics Data System (ADS)

    Li, Cuiling; Wang, Xiu; Meng, Zhijun; Fan, Pengfei; Cai, Jichen

    2016-11-01

    Pepper is a kind of important fruit vegetable, with the expansion of pepper hybrid planting area, detection of pepper seed purity is especially important. This research used visible/near infrared (VIS/NIR) spectral technology to detect the variety of single pepper seed, and chose hybrid pepper seeds "Zhuo Jiao NO.3", "Zhuo Jiao NO.4" and "Zhuo Jiao NO.5" as research sample. VIS/NIR spectral data of 80 "Zhuo Jiao NO.3", 80 "Zhuo Jiao NO.4" and 80 "Zhuo Jiao NO.5" pepper seeds were collected, and the original spectral data was pretreated with standard normal variable (SNV) transform, first derivative (FD), and Savitzky-Golay (SG) convolution smoothing methods. Principal component analysis (PCA) method was adopted to reduce the dimension of the spectral data and extract principal components, according to the distribution of the first principal component (PC1) along with the second principal component(PC2) in the twodimensional plane, similarly, the distribution of PC1 coupled with the third principal component(PC3), and the distribution of PC2 combined with PC3, distribution areas of three varieties of pepper seeds were divided in each twodimensional plane, and the discriminant accuracy of PCA was tested through observing the distribution area of samples' principal components in validation set. This study combined PCA and linear discriminant analysis (LDA) to identify single pepper seed varieties, results showed that with the FD preprocessing method, the discriminant accuracy of pepper seed varieties was 98% for validation set, it concludes that using VIS/NIR spectral technology is feasible for identification of single pepper seed varieties.

  3. Analysis of environmental variation in a Great Plains reservoir using principal components analysis and geographic information systems

    USGS Publications Warehouse

    Long, J.M.; Fisher, W.L.

    2006-01-01

    We present a method for spatial interpretation of environmental variation in a reservoir that integrates principal components analysis (PCA) of environmental data with geographic information systems (GIS). To illustrate our method, we used data from a Great Plains reservoir (Skiatook Lake, Oklahoma) with longitudinal variation in physicochemical conditions. We measured 18 physicochemical features, mapped them using GIS, and then calculated and interpreted four principal components. Principal component 1 (PC1) was readily interpreted as longitudinal variation in water chemistry, but the other principal components (PC2-4) were difficult to interpret. Site scores for PC1-4 were calculated in GIS by summing weighted overlays of the 18 measured environmental variables, with the factor loadings from the PCA as the weights. PC1-4 were then ordered into a landscape hierarchy, an emergent property of this technique, which enabled their interpretation. PC1 was interpreted as a reservoir scale change in water chemistry, PC2 was a microhabitat variable of rip-rap substrate, PC3 identified coves/embayments and PC4 consisted of shoreline microhabitats related to slope. The use of GIS improved our ability to interpret the more obscure principal components (PC2-4), which made the spatial variability of the reservoir environment more apparent. This method is applicable to a variety of aquatic systems, can be accomplished using commercially available software programs, and allows for improved interpretation of the geographic environmental variability of a system compared to using typical PCA plots. ?? Copyright by the North American Lake Management Society 2006.

  4. Principal Component Analysis for Enhancement of Infrared Spectra Monitoring

    NASA Astrophysics Data System (ADS)

    Haney, Ricky Lance

    The issue of air quality within the aircraft cabin is receiving increasing attention from both pilot and flight attendant unions. This is due to exposure events caused by poor air quality that in some cases may have contained toxic oil components due to bleed air that flows from outside the aircraft and then through the engines into the aircraft cabin. Significant short and long-term medical issues for aircraft crew have been attributed to exposure. The need for air quality monitoring is especially evident in the fact that currently within an aircraft there are no sensors to monitor the air quality and potentially harmful gas levels (detect-to-warn sensors), much less systems to monitor and purify the air (detect-to-treat sensors) within the aircraft cabin. The specific purpose of this research is to utilize a mathematical technique called principal component analysis (PCA) in conjunction with principal component regression (PCR) and proportionality constant calculations (PCC) to simplify complex, multi-component infrared (IR) spectra data sets into a reduced data set used for determination of the concentrations of the individual components. Use of PCA can significantly simplify data analysis as well as improve the ability to determine concentrations of individual target species in gas mixtures where significant band overlap occurs in the IR spectrum region. Application of this analytical numerical technique to IR spectrum analysis is important in improving performance of commercial sensors that airlines and aircraft manufacturers could potentially use in an aircraft cabin environment for multi-gas component monitoring. The approach of this research is two-fold, consisting of a PCA application to compare simulation and experimental results with the corresponding PCR and PCC to determine quantitatively the component concentrations within a mixture. The experimental data sets consist of both two and three component systems that could potentially be present as air contaminants in an aircraft cabin. In addition, experimental data sets are analyzed for a hydrogen peroxide (H2O2) aqueous solution mixture to determine H2O2 concentrations at various levels that could be produced during use of a vapor phase hydrogen peroxide (VPHP) decontamination system. After the PCA application to two and three component systems, the analysis technique is further expanded to include the monitoring of potential bleed air contaminants from engine oil combustion. Simulation data sets created from database spectra were utilized to predict gas components and concentrations in unknown engine oil samples at high temperatures as well as time-evolved gases from the heating of engine oils.

  5. Data on xylem sap proteins from Mn- and Fe-deficient tomato plants obtained using shotgun proteomics.

    PubMed

    Ceballos-Laita, Laura; Gutierrez-Carbonell, Elain; Takahashi, Daisuke; Abadía, Anunciación; Uemura, Matsuo; Abadía, Javier; López-Millán, Ana Flor

    2018-04-01

    This article contains consolidated proteomic data obtained from xylem sap collected from tomato plants grown in Fe- and Mn-sufficient control, as well as Fe-deficient and Mn-deficient conditions. Data presented here cover proteins identified and quantified by shotgun proteomics and Progenesis LC-MS analyses: proteins identified with at least two peptides and showing changes statistically significant (ANOVA; p ≤ 0.05) and above a biologically relevant selected threshold (fold ≥ 2) between treatments are listed. The comparison between Fe-deficient, Mn-deficient and control xylem sap samples using a multivariate statistical data analysis (Principal Component Analysis, PCA) is also included. Data included in this article are discussed in depth in the research article entitled "Effects of Fe and Mn deficiencies on the protein profiles of tomato ( Solanum lycopersicum) xylem sap as revealed by shotgun analyses" [1]. This dataset is made available to support the cited study as well to extend analyses at a later stage.

  6. SU-F-R-41: Regularized PCA Can Model Treatment-Related Changes in Head and Neck Patients Using Daily CBCTs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chetvertkov, M; Henry Ford Health System, Detroit, MI; Siddiqui, F

    2016-06-15

    Purpose: To use daily cone beam CTs (CBCTs) to develop regularized principal component analysis (PCA) models of anatomical changes in head and neck (H&N) patients, to guide replanning decisions in adaptive radiation therapy (ART). Methods: Known deformations were applied to planning CT (pCT) images of 10 H&N patients to model several different systematic anatomical changes. A Pinnacle plugin was used to interpolate systematic changes over 35 fractions, generating a set of 35 synthetic CTs for each patient. Deformation vector fields (DVFs) were acquired between the pCT and synthetic CTs and random fraction-to-fraction changes were superimposed on the DVFs. Standard non-regularizedmore » and regularized patient-specific PCA models were built using the DVFs. The ability of PCA to extract the known deformations was quantified. PCA models were also generated from clinical CBCTs, for which the deformations and DVFs were not known. It was hypothesized that resulting eigenvectors/eigenfunctions with largest eigenvalues represent the major anatomical deformations during the course of treatment. Results: As demonstrated with quantitative results in the supporting document regularized PCA is more successful than standard PCA at capturing systematic changes early in the treatment. Regularized PCA is able to detect smaller systematic changes against the background of random fraction-to-fraction changes. To be successful at guiding ART, regularized PCA should be coupled with models of when anatomical changes occur: early, late or throughout the treatment course. Conclusion: The leading eigenvector/eigenfunction from the both PCA approaches can tentatively be identified as a major systematic change during radiotherapy course when systematic changes are large enough with respect to random fraction-to-fraction changes. In all cases the regularized PCA approach appears to be more reliable at capturing systematic changes, enabling dosimetric consequences to be projected once trends are established early in the treatment course. This work is supported in part by a grant from Varian Medical Systems, Palo Alto, CA.« less

  7. National economic and development indicators and international variation in prostate cancer incidence and mortality: an ecological analysis.

    PubMed

    Neupane, Subas; Bray, Freddie; Auvinen, Anssi

    2017-06-01

    Macroeconomic indicators are likely associated with prostate cancer (PCa) incidence and mortality globally, but have rarely been assessed. Data on PCa incidence in 2003-2007 for 49 countries with either nationwide cancer registry or at least two regional registries were obtained from Cancer Incidence in Five Continents Vol X and national PCa mortality for 2012 from GLOBOCAN 2012. We compared PCa incidence and mortality rates with various population-level indicators of health, economy and development in 2000. Poisson and linear regression methods were used to quantify the associations. PCa incidence varied more than 15-fold, being highest in high-income countries. PCa mortality exhibited less variation, with higher rates in many low- and middle-income countries. Healthcare expenditure (rate ratio, RR 1.46, 95 % CI 1.45-1.47) and population growth (RR 1.15, 95 % CI 1.14-1.16), as well as computer and mobile phone density, were associated with a higher PCa incidence, while gross domestic product, GDP (RR 0.94, 95 % CI 0.93-0.95) and overall mortality (RR 0.72, 95 % CI 0.71-0.73) were associated with a low incidence. GDP (RR 0.55, 95 % CI 0.46-0.66) was also associated with a low PCa mortality, while life expectancy (RR 3.93, 95 % CI 3.22-4.79) and healthcare expenditure (RR 1.20, 95 % CI 1.09-1.32) were associated with an elevated mortality. Our results show that healthcare expenditure and, thus, the availability of medical resources are an important contributor to the patterns of international variation in PCa incidence. This suggests that there is an iatrogenic component in the current global epidemic of PCa. On the other hand, higher healthcare expenditure is associated with lower PCa death rates.

  8. Improved estimation of parametric images of cerebral glucose metabolic rate from dynamic FDG-PET using volume-wise principle component analysis

    NASA Astrophysics Data System (ADS)

    Dai, Xiaoqian; Tian, Jie; Chen, Zhe

    2010-03-01

    Parametric images can represent both spatial distribution and quantification of the biological and physiological parameters of tracer kinetics. The linear least square (LLS) method is a well-estimated linear regression method for generating parametric images by fitting compartment models with good computational efficiency. However, bias exists in LLS-based parameter estimates, owing to the noise present in tissue time activity curves (TTACs) that propagates as correlated error in the LLS linearized equations. To address this problem, a volume-wise principal component analysis (PCA) based method is proposed. In this method, firstly dynamic PET data are properly pre-transformed to standardize noise variance as PCA is a data driven technique and can not itself separate signals from noise. Secondly, the volume-wise PCA is applied on PET data. The signals can be mostly represented by the first few principle components (PC) and the noise is left in the subsequent PCs. Then the noise-reduced data are obtained using the first few PCs by applying 'inverse PCA'. It should also be transformed back according to the pre-transformation method used in the first step to maintain the scale of the original data set. Finally, the obtained new data set is used to generate parametric images using the linear least squares (LLS) estimation method. Compared with other noise-removal method, the proposed method can achieve high statistical reliability in the generated parametric images. The effectiveness of the method is demonstrated both with computer simulation and with clinical dynamic FDG PET study.

  9. Landslides Identification Using Airborne Laser Scanning Data Derived Topographic Terrain Attributes and Support Vector Machine Classification

    NASA Astrophysics Data System (ADS)

    Pawłuszek, Kamila; Borkowski, Andrzej

    2016-06-01

    Since the availability of high-resolution Airborne Laser Scanning (ALS) data, substantial progress in geomorphological research, especially in landslide analysis, has been carried out. First and second order derivatives of Digital Terrain Model (DTM) have become a popular and powerful tool in landslide inventory mapping. Nevertheless, an automatic landslide mapping based on sophisticated classifiers including Support Vector Machine (SVM), Artificial Neural Network or Random Forests is often computationally time consuming. The objective of this research is to deeply explore topographic information provided by ALS data and overcome computational time limitation. For this reason, an extended set of topographic features and the Principal Component Analysis (PCA) were used to reduce redundant information. The proposed novel approach was tested on a susceptible area affected by more than 50 landslides located on Rożnów Lake in Carpathian Mountains, Poland. The initial seven PCA components with 90% of the total variability in the original topographic attributes were used for SVM classification. Comparing results with landslide inventory map, the average user's accuracy (UA), producer's accuracy (PA), and overall accuracy (OA) were calculated for two models according to the classification results. Thereby, for the PCA-feature-reduced model UA, PA, and OA were found to be 72%, 76%, and 72%, respectively. Similarly, UA, PA, and OA in the non-reduced original topographic model, was 74%, 77% and 74%, respectively. Using the initial seven PCA components instead of the twenty original topographic attributes does not significantly change identification accuracy but reduce computational time.

  10. Effect of age at onset on cortical thickness and cognition in posterior cortical atrophy.

    PubMed

    Suárez-González, Aida; Lehmann, Manja; Shakespeare, Timothy J; Yong, Keir X X; Paterson, Ross W; Slattery, Catherine F; Foulkes, Alexander J M; Rabinovici, Gil D; Gil-Néciga, Eulogio; Roldán-Lora, Florinda; Schott, Jonathan M; Fox, Nick C; Crutch, Sebastian J

    2016-08-01

    Age at onset (AAO) has been shown to influence the phenotype of Alzheimer's disease (AD), but how it affects atypical presentations of AD remains unknown. Posterior cortical atrophy (PCA) is the most common form of atypical AD. In this study, we aimed to investigate the effect of AAO on cortical thickness and cognitive function in 98 PCA patients. We used Freesurfer (v5.3.0) to compare cortical thickness with AAO both as a continuous variable, and by dichotomizing the groups based on median age (58 years). In both the continuous and dichotomized analyses, we found a pattern suggestive of thinner cortex in precuneus and parietal areas in earlier-onset PCA, and lower cortical thickness in anterior cingulate and prefrontal cortex in later-onset PCA. These cortical thickness differences between PCA subgroups were consistent with earlier-onset PCA patients performing worse on cognitive tests involving parietal functions. Our results provide a suggestion that AAO may not only affect the clinico-anatomical characteristics in AD but may also affect atrophy patterns and cognition within atypical AD phenotypes. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  11. Two worlds collide: Image analysis methods for quantifying structural variation in cluster molecular dynamics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Steenbergen, K. G., E-mail: kgsteen@gmail.com; Gaston, N.

    2014-02-14

    Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement formore » a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.« less

  12. Multi-Centrality Graph Spectral Decompositions and Their Application to Cyber Intrusion Detection

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Pin-Yu; Choudhury, Sutanay; Hero, Alfred

    Many modern datasets can be represented as graphs and hence spectral decompositions such as graph principal component analysis (PCA) can be useful. Distinct from previous graph decomposition approaches based on subspace projection of a single topological feature, e.g., the centered graph adjacency matrix (graph Laplacian), we propose spectral decomposition approaches to graph PCA and graph dictionary learning that integrate multiple features, including graph walk statistics, centrality measures and graph distances to reference nodes. In this paper we propose a new PCA method for single graph analysis, called multi-centrality graph PCA (MC-GPCA), and a new dictionary learning method for ensembles ofmore » graphs, called multi-centrality graph dictionary learning (MC-GDL), both based on spectral decomposition of multi-centrality matrices. As an application to cyber intrusion detection, MC-GPCA can be an effective indicator of anomalous connectivity pattern and MC-GDL can provide discriminative basis for attack classification.« less

  13. Prediction of BP reactivity to talking using hybrid soft computing approaches.

    PubMed

    Kaur, Gurmanik; Arora, Ajat Shatru; Jain, Vijender Kumar

    2014-01-01

    High blood pressure (BP) is associated with an increased risk of cardiovascular diseases. Therefore, optimal precision in measurement of BP is appropriate in clinical and research studies. In this work, anthropometric characteristics including age, height, weight, body mass index (BMI), and arm circumference (AC) were used as independent predictor variables for the prediction of BP reactivity to talking. Principal component analysis (PCA) was fused with artificial neural network (ANN), adaptive neurofuzzy inference system (ANFIS), and least square-support vector machine (LS-SVM) model to remove the multicollinearity effect among anthropometric predictor variables. The statistical tests in terms of coefficient of determination (R (2)), root mean square error (RMSE), and mean absolute percentage error (MAPE) revealed that PCA based LS-SVM (PCA-LS-SVM) model produced a more efficient prediction of BP reactivity as compared to other models. This assessment presents the importance and advantages posed by PCA fused prediction models for prediction of biological variables.

  14. Finessing filter scarcity problem in face recognition via multi-fold filter convolution

    NASA Astrophysics Data System (ADS)

    Low, Cheng-Yaw; Teoh, Andrew Beng-Jin

    2017-06-01

    The deep convolutional neural networks for face recognition, from DeepFace to the recent FaceNet, demand a sufficiently large volume of filters for feature extraction, in addition to being deep. The shallow filter-bank approaches, e.g., principal component analysis network (PCANet), binarized statistical image features (BSIF), and other analogous variants, endure the filter scarcity problem that not all PCA and ICA filters available are discriminative to abstract noise-free features. This paper extends our previous work on multi-fold filter convolution (ℳ-FFC), where the pre-learned PCA and ICA filter sets are exponentially diversified by ℳ folds to instantiate PCA, ICA, and PCA-ICA offspring. The experimental results unveil that the 2-FFC operation solves the filter scarcity state. The 2-FFC descriptors are also evidenced to be superior to that of PCANet, BSIF, and other face descriptors, in terms of rank-1 identification rate (%).

  15. Two worlds collide: image analysis methods for quantifying structural variation in cluster molecular dynamics.

    PubMed

    Steenbergen, K G; Gaston, N

    2014-02-14

    Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement for a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.

  16. Source Evaluation and Trace Metal Contamination in Benthic Sediments from Equatorial Ecosystems Using Multivariate Statistical Techniques

    PubMed Central

    Benson, Nsikak U.; Asuquo, Francis E.; Williams, Akan B.; Essien, Joseph P.; Ekong, Cyril I.; Akpabio, Otobong; Olajire, Abaas A.

    2016-01-01

    Trace metals (Cd, Cr, Cu, Ni and Pb) concentrations in benthic sediments were analyzed through multi-step fractionation scheme to assess the levels and sources of contamination in estuarine, riverine and freshwater ecosystems in Niger Delta (Nigeria). The degree of contamination was assessed using the individual contamination factors (ICF) and global contamination factor (GCF). Multivariate statistical approaches including principal component analysis (PCA), cluster analysis and correlation test were employed to evaluate the interrelationships and associated sources of contamination. The spatial distribution of metal concentrations followed the pattern Pb>Cu>Cr>Cd>Ni. Ecological risk index by ICF showed significant potential mobility and bioavailability for Cu, Cu and Ni. The ICF contamination trend in the benthic sediments at all studied sites was Cu>Cr>Ni>Cd>Pb. The principal component and agglomerative clustering analyses indicate that trace metals contamination in the ecosystems was influenced by multiple pollution sources. PMID:27257934

  17. Untargeted Identification of Wood Type-Specific Markers in Particulate Matter from Wood Combustion.

    PubMed

    Weggler, Benedikt A; Ly-Verdu, Saray; Jennerwein, Maximilian; Sippula, Olli; Reda, Ahmed A; Orasche, Jürgen; Gröger, Thomas; Jokiniemi, Jorma; Zimmermann, Ralf

    2016-09-20

    Residential wood combustion emissions are one of the major global sources of particulate and gaseous organic pollutants. However, the detailed chemical compositions of these emissions are poorly characterized due to their highly complex molecular compositions, nonideal combustion conditions, and sample preparation steps. In this study, the particulate organic emissions from a masonry heater using three types of wood logs, namely, beech, birch, and spruce, were chemically characterized using thermal desorption in situ derivatization coupled to a GCxGC-ToF/MS system. Untargeted data analyses were performed using the comprehensive measurements. Univariate and multivariate chemometric tools, such as analysis of variance (ANOVA), principal component analysis (PCA), and ANOVA simultaneous component analysis (ASCA), were used to reduce the data to highly significant and wood type-specific features. This study reveals substances not previously considered in the literature as meaningful markers for differentiation among wood types.

  18. Preliminary study of soil permeability properties using principal component analysis

    NASA Astrophysics Data System (ADS)

    Yulianti, M.; Sudriani, Y.; Rustini, H. A.

    2018-02-01

    Soil permeability measurement is undoubtedly important in carrying out soil-water research such as rainfall-runoff modelling, irrigation water distribution systems, etc. It is also known that acquiring reliable soil permeability data is rather laborious, time-consuming, and costly. Therefore, it is desirable to develop the prediction model. Several studies of empirical equations for predicting permeability have been undertaken by many researchers. These studies derived the models from areas which soil characteristics are different from Indonesian soil, which suggest a possibility that these permeability models are site-specific. The purpose of this study is to identify which soil parameters correspond strongly to soil permeability and propose a preliminary model for permeability prediction. Principal component analysis (PCA) was applied to 16 parameters analysed from 37 sites consist of 91 samples obtained from Batanghari Watershed. Findings indicated five variables that have strong correlation with soil permeability, and we recommend a preliminary permeability model, which is potential for further development.

  19. Optimisation of a machine learning algorithm in human locomotion using principal component and discriminant function analyses.

    PubMed

    Bisele, Maria; Bencsik, Martin; Lewis, Martin G C; Barnett, Cleveland T

    2017-01-01

    Assessment methods in human locomotion often involve the description of normalised graphical profiles and/or the extraction of discrete variables. Whilst useful, these approaches may not represent the full complexity of gait data. Multivariate statistical methods, such as Principal Component Analysis (PCA) and Discriminant Function Analysis (DFA), have been adopted since they have the potential to overcome these data handling issues. The aim of the current study was to develop and optimise a specific machine learning algorithm for processing human locomotion data. Twenty participants ran at a self-selected speed across a 15m runway in barefoot and shod conditions. Ground reaction forces (BW) and kinematics were measured at 1000 Hz and 100 Hz, respectively from which joint angles (°), joint moments (N.m.kg-1) and joint powers (W.kg-1) for the hip, knee and ankle joints were calculated in all three anatomical planes. Using PCA and DFA, power spectra of the kinematic and kinetic variables were used as a training database for the development of a machine learning algorithm. All possible combinations of 10 out of 20 participants were explored to find the iteration of individuals that would optimise the machine learning algorithm. The results showed that the algorithm was able to successfully predict whether a participant ran shod or barefoot in 93.5% of cases. To the authors' knowledge, this is the first study to optimise the development of a machine learning algorithm.

  20. Optimisation of a machine learning algorithm in human locomotion using principal component and discriminant function analyses

    PubMed Central

    Bisele, Maria; Bencsik, Martin; Lewis, Martin G. C.

    2017-01-01

    Assessment methods in human locomotion often involve the description of normalised graphical profiles and/or the extraction of discrete variables. Whilst useful, these approaches may not represent the full complexity of gait data. Multivariate statistical methods, such as Principal Component Analysis (PCA) and Discriminant Function Analysis (DFA), have been adopted since they have the potential to overcome these data handling issues. The aim of the current study was to develop and optimise a specific machine learning algorithm for processing human locomotion data. Twenty participants ran at a self-selected speed across a 15m runway in barefoot and shod conditions. Ground reaction forces (BW) and kinematics were measured at 1000 Hz and 100 Hz, respectively from which joint angles (°), joint moments (N.m.kg-1) and joint powers (W.kg-1) for the hip, knee and ankle joints were calculated in all three anatomical planes. Using PCA and DFA, power spectra of the kinematic and kinetic variables were used as a training database for the development of a machine learning algorithm. All possible combinations of 10 out of 20 participants were explored to find the iteration of individuals that would optimise the machine learning algorithm. The results showed that the algorithm was able to successfully predict whether a participant ran shod or barefoot in 93.5% of cases. To the authors’ knowledge, this is the first study to optimise the development of a machine learning algorithm. PMID:28886059

  1. Type 2 diabetes mellitus: distribution of genetic markers in Kazakh population

    PubMed Central

    Sikhayeva, Nurgul; Talzhanov, Yerkebulan; Iskakova, Aisha; Dzharmukhanov, Jarkyn; Nugmanova, Raushan; Zholdybaeva, Elena; Ramanculov, Erlan

    2018-01-01

    Background Ethnic differences exist in the frequencies of genetic variations that contribute to the risk of common disease. This study aimed to analyse the distribution of several genes, previously associated with susceptibility to type 2 diabetes and obesity-related phenotypes, in a Kazakh population. Methods A total of 966 individuals belonging to the Kazakh ethnicity were recruited from an outpatient clinic. We genotyped 41 common single nucleotide polymorphisms (SNPs) previously associated with type 2 diabetes in other ethnic groups and 31 of these were in Hardy–Weinberg equilibrium. The obtained allele frequencies were further compared to publicly available data from other ethnic populations. Allele frequencies for other (compared) populations were pooled from the haplotype map (HapMap) database. Principal component analysis (PCA), cluster analysis, and multidimensional scaling (MDS) were used for the analysis of genetic relationship between the populations. Results Comparative analysis of allele frequencies of the studied SNPs showed significant differentiation among the studied populations. The Kazakh population was grouped with Asian populations according to the cluster analysis and with the Caucasian populations according to PCA. According to MDS, results of the current study show that the Kazakh population holds an intermediate position between Caucasian and Asian populations. Conclusion A high percentage of population differentiation was observed between Kazakh and world populations. The Kazakh population was clustered with Caucasian populations, and this result may indicate a significant Caucasian component in the Kazakh gene pool. PMID:29551892

  2. Effects of mutation, truncation and temperature on the folding kinetics of a WW domain

    PubMed Central

    Maisuradze, Gia G.; Zhou, Rui; Liwo, Adam; Xiao, Yi; Scheraga, Harold A.

    2013-01-01

    The purpose of this work is to show how mutation, truncation and change of temperature can influence the folding kinetics of a protein. This is accomplished by principal component analysis (PCA) of molecular dynamics (MD)-generated folding trajectories of the triple β-strand WW domain from the Formin binding protein 28 (FBP) [PDB: 1E0L] and its full-size, and singly- and doubly-truncated mutants at temperatures below and very close to the melting point. The reasons for biphasic folding kinetics [i.e., coexistence of slow (three-state) and fast (two-state) phases], including the involvement of a solvent-exposed hydrophobic cluster and another delocalized hydrophobic core in the folding kinetics, are discussed. New folding pathways are identified in free-energy landscapes determined in terms of principal components for full-size mutants. Three-state folding is found to be a main mechanism for folding FBP28 WW domain and most of the full-size and truncated mutants. The results from the theoretical analysis are compared to those from experiment. Agreements and discrepancies between the theoretical and experimental results are discussed. Because of its importance in understanding protein kinetics and function, the diffusive mechanism by which FBP28 WW domain and its full-size and truncated mutants explore their conformational space is examined in terms of the mean-square displacement, (MSD), and PCA eigenvalue spectrum analyses. Subdiffusive behavior is observed for all studied systems. PMID:22560992

  3. Epigenetic regulation of EFEMP1 in prostate cancer: biological relevance and clinical potential

    PubMed Central

    Almeida, Mafalda; Costa, Vera L; Costa, Natália R; Ramalho-Carvalho, João; Baptista, Tiago; Ribeiro, Franclim R; Paulo, Paula; Teixeira, Manuel R; Oliveira, Jorge; Lothe, Ragnhild A; Lind, Guro E; Henrique, Rui; Jerónimo, Carmen

    2014-01-01

    Epigenetic alterations are common in prostate cancer (PCa) and seem to contribute decisively to its initiation and progression. Moreover, aberrant promoter methylation is a promising biomarker for non-invasive screening. Herein, we sought to characterize EFEMP1 as biomarker for PCa, unveiling its biological relevance in prostate carcinogenesis. Microarray analyses of treated PCa cell lines and primary tissues enabled the selection of differentially methylated genes, among which EFEMP1 was further validated by MSP and bisulfite sequencing. Assessment of biomarker performance was accomplished by qMSP. Expression analysis of EFEMP1 and characterization of histone marks were performed in tissue samples and cancer cell lines to determine the impact of epigenetic mechanisms on EFEMP1 transcriptional regulation. Phenotypic assays, using transfected cell lines, permitted the evaluation of EFEMP1’s role in PCa development. EFEMP1 methylation assay discriminated PCa from normal prostate tissue (NPT; P < 0.001, Kruskall–Wallis test) and renal and bladder cancers (96% sensitivity and 98% specificity). EFEMP1 transcription levels inversely correlated with promoter methylation and histone deacetylation, suggesting that both epigenetic mechanisms are involved in gene regulation. Phenotypic assays showed that EFEMP1 de novo expression reduces malignant phenotype of PCa cells. EFEMP1 promoter methylation is prevalent in PCa and accurately discriminates PCa from non-cancerous prostate tissues and other urological neoplasms. This epigenetic alteration occurs early in prostate carcinogenesis and, in association with histone deacetylation, progressively leads to gene down-regulation, fostering cell proliferation, invasion and evasion of apoptosis. PMID:25211630

  4. Extracting factors for interest rate scenarios

    NASA Astrophysics Data System (ADS)

    Molgedey, L.; Galic, E.

    2001-04-01

    Factor based interest rate models are widely used for risk managing purposes, for option pricing and for identifying and capturing yield curve anomalies. The movements of a term structure of interest rates are commonly assumed to be driven by a small number of orthogonal factors such as SHIFT, TWIST and BUTTERFLY (BOW). These factors are usually obtained by a Principal Component Analysis (PCA) of historical bond prices (interest rates). Although PCA diagonalizes the covariance matrix of either the interest rates or the interest rate changes, it does not use both covariance matrices simultaneously. Furthermore higher linear and nonlinear correlations are neglected. These correlations as well as the mean reverting properties of the interest rates become crucial, if one is interested in a longer time horizon (infrequent hedging or trading). We will show that Independent Component Analysis (ICA) is a more appropriate tool than PCA, since ICA uses the covariance matrix of the interest rates as well as the covariance matrix of the interest rate changes simultaneously. Additionally higher linear and nonlinear correlations may be easily incorporated. The resulting factors are uncorrelated for various time delays, approximately independent but nonorthogonal. This is in contrast to the factors obtained from the PCA, which are orthogonal and uncorrelated for identical times only. Although factors from the ICA are nonorthogonal, it is sufficient to consider only a few factors in order to explain most of the variation in the original data. Finally we will present examples that ICA based hedges outperforms PCA based hedges specifically if the portfolio is sensitive to structural changes of the yield curve.

  5. The Application of Principal Component Analysis Using Fixed Eigenvectors to the Infrared Thermographic Inspection of the Space Shuttle Thermal Protection System

    NASA Technical Reports Server (NTRS)

    Cramer, K. Elliott; Winfree, William P.

    2006-01-01

    The Nondestructive Evaluation Sciences Branch at NASA s Langley Research Center has been actively involved in the development of thermographic inspection techniques for more than 15 years. Since the Space Shuttle Columbia accident, NASA has focused on the improvement of advanced NDE techniques for the Reinforced Carbon-Carbon (RCC) panels that comprise the orbiter s wing leading edge. Various nondestructive inspection techniques have been used in the examination of the RCC, but thermography has emerged as an effective inspection alternative to more traditional methods. Thermography is a non-contact inspection method as compared to ultrasonic techniques which typically require the use of a coupling medium between the transducer and material. Like radiographic techniques, thermography can be used to inspect large areas, but has the advantage of minimal safety concerns and the ability for single-sided measurements. Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. A typical implementation of PCA is when the eigenvectors are generated from the data set being analyzed. Although it is a powerful tool for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the good material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued when a fixed set of eigenvectors is used to process the thermal data from the RCC materials. These eigen vectors can be generated either from an analytic model of the thermal response of the material under examination, or from a large cross section of experimental data. This paper will provide the details of the analytic model; an overview of the PCA process; as well as a quantitative signal-to-noise comparison of the results of performing both embodiments of PCA on thermographic data from various RCC specimens. Details of a system that has been developed to allow insitu inspection of a majority of shuttle RCC components will be presented along with the acceptance test results for this system. Additionally, the results of applying this technology to the Space Shuttle Discovery after its return from flight will be presented.

  6. Electronic nose for the identification of pig feeding and ripening time in Iberian hams.

    PubMed

    Santos, J P; García, M; Aleixandre, M; Horrillo, M C; Gutiérrez, J; Sayago, I; Fernández, M J; Arés, L

    2004-03-01

    An electronic nose system to control the processing of dry-cured Iberian ham is presented. The sensors involved are tin oxide semiconductors thin films. They were prepared by RF sputtering. Some of the sensors were doped with metal catalysts as Pt and Pd, in order to improve the selectivity of the sensors. The multisensor with 16 semiconductor sensors, gave different responses from two types of dry-cured Iberian hams which differ in the feeding and curing time. The data has been analysed using the PCA (principal component analysis) and backpropagation and probabilistic neural networks. The analysis shows that different types of Iberian ham can be discriminated and identified successfully.

  7. An improved PCA method with application to boiler leak detection.

    PubMed

    Sun, Xi; Marquez, Horacio J; Chen, Tongwen; Riaz, Muhammad

    2005-07-01

    Principal component analysis (PCA) is a popular fault detection technique. It has been widely used in process industries, especially in the chemical industry. In industrial applications, achieving a sensitive system capable of detecting incipient faults, which maintains the false alarm rate to a minimum, is a crucial issue. Although a lot of research has been focused on these issues for PCA-based fault detection and diagnosis methods, sensitivity of the fault detection scheme versus false alarm rate continues to be an important issue. In this paper, an improved PCA method is proposed to address this problem. In this method, a new data preprocessing scheme and a new fault detection scheme designed for Hotelling's T2 as well as the squared prediction error are developed. A dynamic PCA model is also developed for boiler leak detection. This new method is applied to boiler water/steam leak detection with real data from Syncrude Canada's utility plant in Fort McMurray, Canada. Our results demonstrate that the proposed method can effectively reduce false alarm rate, provide effective and correct leak alarms, and give early warning to operators.

  8. Performance evaluation of BPM system in SSRF using PCA method

    NASA Astrophysics Data System (ADS)

    Chen, Zhi-Chu; Leng, Yong-Bin; Yan, Ying-Bing; Yuan, Ren-Xian; Lai, Long-Wei

    2014-07-01

    The beam position monitor (BPM) system is of most importance in a light source. The capability of the BPM depends on the resolution of the system. The traditional standard deviation on the raw data method merely gives the upper limit of the resolution. Principal component analysis (PCA) had been introduced in the accelerator physics and it could be used to get rid of the actual signals. Beam related information was extracted before the evaluation of the BPM performance. A series of studies had been made in the Shanghai Synchrotron Radiation Facility (SSRF) and PCA was proved to be an effective and robust method in the performance evaluations of our BPM system.

  9. Antidiabetic and antihyperlipidemic activity of p-coumaric acid in diabetic rats, role of pancreatic GLUT 2: In vivo approach.

    PubMed

    Amalan, Venkatesan; Vijayakumar, Natesan; Indumathi, Dhananjayan; Ramakrishnan, Arumugam

    2016-12-01

    P-coumaric acid (p-CA, 3-[4-hydroxyphenyl]-2-propenoic acid), the major component widely found in nutritious plant foods, has various antioxidant, antiinflammatory and anticancer property. To evaluate the antidiabetic and antihyperlipidemic mechanisms, via the effects on carbohydrate, lipids and lipoproteins responses in adult male albino Wistar rats were examined by treated with p-CA. Rats were injected with streptozotocin (STZ, 40mg/kg b.w.) by intraperitonially (i.p.) 30days for the induction of experimental diabetes mellitus. Diabetic rats were treated with p-CA orally at a dose of 100mg/kg b.w. The potential defending character of p-CA against diabetic rats was evaluated by performing the various biochemical parameters and glucose transporter such as GLUT2 mRNA expression of pancreas. Administration of p-CA significantly lowers the blood glucose level, gluconeogenic enzymes such as glucose-6-phosphatase and fructose-1,6-bisphosphatase whereas increases the activities of hexokinase, glucose-6 phosphatase dehydrogenase and GSH via by increasing level of insulin. p-CA reduces the total cholesterol and triglycerides in both plasma and tissues i.e. liver and kidney. p-CA also decreases the LDL-C, VLDL-C and it considerably increase the level of HDL-C. A significant decreased expression of GLUT 2 mRNA in the pancreas was recorded in the supplementation of p-CA treated groups. Taken together, these results suggest that p-CA modulates glucose and lipid metabolism via GLUT 2 activation in the pancreatic and has potentially beneficial effects in improving or treating metabolic disorders. Copyright © 2016 Elsevier Masson SAS. All rights reserved.

  10. Prostate Cancer Associated Lipid Signatures in Serum Studied by ESI-Tandem Mass Spectrometryas Potential New Biomarkers.

    PubMed

    Duscharla, Divya; Bhumireddy, Sudarshana Reddy; Lakshetti, Sridhar; Pospisil, Heike; Murthy, P V L N; Walther, Reinhard; Sripadi, Prabhakar; Ummanni, Ramesh

    2016-01-01

    Prostate cancer (PCa) is one amongst the most common cancersin western men. Incidence rate ofPCa is on the rise worldwide. The present study deals with theserum lipidome profiling of patients diagnosed with PCa to identify potential new biomarkers. We employed ESI-MS/MS and GC-MS for identification of significantly altered lipids in cancer patient's serum compared to controls. Lipidomic data revealed 24 lipids are significantly altered in cancer patinet's serum (n = 18) compared to normal (n = 18) with no history of PCa. By using hierarchical clustering and principal component analysis (PCA) we could clearly separate cancer patients from control group. Correlation and partition analysis along with Formal Concept Analysis (FCA) have identified that PC (39:6) and FA (22:3) could classify samples with higher certainty. Both the lipids, PC (39:6) and FA (22:3) could influence the cataloging of patients with 100% sensitivity (all 18 control samples are classified correctly) and 77.7% specificity (of 18 tumor samples 4 samples are misclassified) with p-value of 1.612×10-6 in Fischer's exact test. Further, we performed GC-MS to denote fatty acids altered in PCa patients and found that alpha-linolenic acid (ALA) levels are altered in PCa. We also performed an in vitro proliferation assay to determine the effect of ALA in survival of classical human PCa cell lines LNCaP and PC3. We hereby report that the altered lipids PC (39:6) and FA (22:3) offer a new set of biomarkers in addition to the existing diagnostic tests that could significantly improve sensitivity and specificity in PCa diagnosis.

  11. Prostate Cancer Associated Lipid Signatures in Serum Studied by ESI-Tandem Mass Spectrometryas Potential New Biomarkers

    PubMed Central

    Duscharla, Divya; Bhumireddy, Sudarshana Reddy; Lakshetti, Sridhar; Pospisil, Heike; Murthy, P. V. L. N.; Walther, Reinhard; Sripadi, Prabhakar; Ummanni, Ramesh

    2016-01-01

    Prostate cancer (PCa) is one amongst the most common cancersin western men. Incidence rate ofPCa is on the rise worldwide. The present study deals with theserum lipidome profiling of patients diagnosed with PCa to identify potential new biomarkers. We employed ESI-MS/MS and GC-MS for identification of significantly altered lipids in cancer patient’s serum compared to controls. Lipidomic data revealed 24 lipids are significantly altered in cancer patinet’s serum (n = 18) compared to normal (n = 18) with no history of PCa. By using hierarchical clustering and principal component analysis (PCA) we could clearly separate cancer patients from control group. Correlation and partition analysis along with Formal Concept Analysis (FCA) have identified that PC (39:6) and FA (22:3) could classify samples with higher certainty. Both the lipids, PC (39:6) and FA (22:3) could influence the cataloging of patients with 100% sensitivity (all 18 control samples are classified correctly) and 77.7% specificity (of 18 tumor samples 4 samples are misclassified) with p-value of 1.612×10−6 in Fischer’s exact test. Further, we performed GC-MS to denote fatty acids altered in PCa patients and found that alpha-linolenic acid (ALA) levels are altered in PCa. We also performed an in vitro proliferation assay to determine the effect of ALA in survival of classical human PCa cell lines LNCaP and PC3. We hereby report that the altered lipids PC (39:6) and FA (22:3) offer a new set of biomarkers in addition to the existing diagnostic tests that could significantly improve sensitivity and specificity in PCa diagnosis. PMID:26958841

  12. General Platform for Systematic Quantitative Evaluation of Small-Molecule Permeability in Bacteria

    PubMed Central

    2015-01-01

    The chemical features that impact small-molecule permeability across bacterial membranes are poorly understood, and the resulting lack of tools to predict permeability presents a major obstacle to the discovery and development of novel antibiotics. Antibacterials are known to have vastly different structural and physicochemical properties compared to nonantiinfective drugs, as illustrated herein by principal component analysis (PCA). To understand how these properties influence bacterial permeability, we have developed a systematic approach to evaluate the penetration of diverse compounds into bacteria with distinct cellular envelopes. Intracellular compound accumulation is quantitated using LC-MS/MS, then PCA and Pearson pairwise correlations are used to identify structural and physicochemical parameters that correlate with accumulation. An initial study using 10 sulfonyladenosines in Escherichia coli, Bacillus subtilis, and Mycobacterium smegmatis has identified nonobvious correlations between chemical structure and permeability that differ among the various bacteria. Effects of cotreatment with efflux pump inhibitors were also investigated. This sets the stage for use of this platform in larger prospective analyses of diverse chemotypes to identify global relationships between chemical structure and bacterial permeability that would enable the development of predictive tools to accelerate antibiotic drug discovery. PMID:25198656

  13. Qualitative and quantitative differentiation of gases using ZnO thin film gas sensors and pattern recognition analysis.

    PubMed

    Pati, Sumati; Maity, A; Banerji, P; Majumder, S B

    2014-04-07

    In the present work we have grown highly textured, ultra-thin, nano-crystalline zinc oxide thin films using a metal organic chemical vapor deposition technique and addressed their selectivity towards hydrogen, carbon dioxide and methane gas sensing. Structural and microstructural characteristics of the synthesized films were investigated utilizing X-ray diffraction and electron microscopy techniques respectively. Using a dynamic flow gas sensing measurement set up, the sensing characteristics of these films were investigated as a function of gas concentration (10-1660 ppm) and operating temperature (250-380 °C). ZnO thin film sensing elements were found to be sensitive to all of these gases. Thus at a sensor operating temperature of ~300 °C, the response% of the ZnO thin films were ~68, 59, and 52% for hydrogen, carbon monoxide and methane gases respectively. The data matrices extracted from first Fourier transform analyses (FFT) of the conductance transients were used as input parameters in a linear unsupervised principal component analysis (PCA) pattern recognition technique. We have demonstrated that FFT combined with PCA is an excellent tool for the differentiation of these reducing gases.

  14. An Evaluation of Feature Learning Methods for High Resolution Image Classification

    NASA Astrophysics Data System (ADS)

    Tokarczyk, P.; Montoya, J.; Schindler, K.

    2012-07-01

    Automatic image classification is one of the fundamental problems of remote sensing research. The classification problem is even more challenging in high-resolution images of urban areas, where the objects are small and heterogeneous. Two questions arise, namely which features to extract from the raw sensor data to capture the local radiometry and image structure at each pixel or segment, and which classification method to apply to the feature vectors. While classifiers are nowadays well understood, selecting the right features remains a largely empirical process. Here we concentrate on the features. Several methods are evaluated which allow one to learn suitable features from unlabelled image data by analysing the image statistics. In a comparative study, we evaluate unsupervised feature learning with different linear and non-linear learning methods, including principal component analysis (PCA) and deep belief networks (DBN). We also compare these automatically learned features with popular choices of ad-hoc features including raw intensity values, standard combinations like the NDVI, a few PCA channels, and texture filters. The comparison is done in a unified framework using the same images, the target classes, reference data and a Random Forest classifier.

  15. Two-dimensional statistical linear discriminant analysis for real-time robust vehicle-type recognition

    NASA Astrophysics Data System (ADS)

    Zafar, I.; Edirisinghe, E. A.; Acar, S.; Bez, H. E.

    2007-02-01

    Automatic vehicle Make and Model Recognition (MMR) systems provide useful performance enhancements to vehicle recognitions systems that are solely based on Automatic License Plate Recognition (ALPR) systems. Several car MMR systems have been proposed in literature. However these approaches are based on feature detection algorithms that can perform sub-optimally under adverse lighting and/or occlusion conditions. In this paper we propose a real time, appearance based, car MMR approach using Two Dimensional Linear Discriminant Analysis that is capable of addressing this limitation. We provide experimental results to analyse the proposed algorithm's robustness under varying illumination and occlusions conditions. We have shown that the best performance with the proposed 2D-LDA based car MMR approach is obtained when the eigenvectors of lower significance are ignored. For the given database of 200 car images of 25 different make-model classifications, a best accuracy of 91% was obtained with the 2D-LDA approach. We use a direct Principle Component Analysis (PCA) based approach as a benchmark to compare and contrast the performance of the proposed 2D-LDA approach to car MMR. We conclude that in general the 2D-LDA based algorithm supersedes the performance of the PCA based approach.

  16. Baccharis dracunculifolia-based mouthrinse alters the exopolysaccharide structure in cariogenic biofilms.

    PubMed

    Aires, Carolina P; Sassaki, Guilherme L; Santana-Filho, Arquimedes P; Spadaro, Augusto C C; Cury, Jaime A

    2016-03-01

    Baccharis dracunculifolia is a native plant from Brazil with antimicrobial activity. The purpose of this study was to investigate whether a B. dracunculifolia-based mouthrinse (Bd) changes the structure of insoluble exopolysaccharides (IEPS) in Streptococcus mutans UA159 cariogenic biofilm. Biofilms were grown on glass slides and treated with Bd, its vehicle (VC), chlorhexidine digluconate (CHX), or saline solution (NaCl). Among the treatments, only CHX significantly reduced the biofilm biomass and bacterial viability (p<0.05). Gas chromatography-mass spectrometry and nuclear magnetic resonance analyses revealed that IEPS from the four biofilm samples were α- glucans containing different proportions of (1→6) and (1→3) glycosidic linkages. The structural differences among the four IEPS were compared by principal component analysis (PCA). PCA analysis indicated that IEPS from VC- and NaCl-treated biofilms were structurally similar to each other. Compared with the control, IEPS from Bd- and CHX-treated biofilms were structurally different and had distinct chemical profiles. In summary, the fact that Bd changed the IEPS chemical composition indicates that this mouthrinse may affect the cariogenic properties of the S. mutans biofilm formed. Copyright © 2015. Published by Elsevier B.V.

  17. Development of methodology for identification the nature of the polyphenolic extracts by FTIR associated with multivariate analysis.

    PubMed

    Grasel, Fábio dos Santos; Ferrão, Marco Flôres; Wolf, Carlos Rodolfo

    2016-01-15

    Tannins are polyphenolic compounds of complex structures formed by secondary metabolism in several plants. These polyphenolic compounds have different applications, such as drugs, anti-corrosion agents, flocculants, and tanning agents. This study analyses six different type of polyphenolic extracts by Fourier transform infrared spectroscopy (FTIR) combined with multivariate analysis. Through both principal component analysis (PCA) and hierarchical cluster analysis (HCA), we observed well-defined separation between condensed (quebracho and black wattle) and hydrolysable (valonea, chestnut, myrobalan, and tara) tannins. For hydrolysable tannins, it was also possible to observe the formation of two different subgroups between samples of chestnut and valonea and between samples of tara and myrobalan. Among all samples analysed, the chestnut and valonea showed the greatest similarity, indicating that these extracts contain equivalent chemical compositions and structure and, therefore, similar properties. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. A chemometric approach to the characterisation of historical mortars

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rampazzi, L.; Pozzi, A.; Sansonetti, A.

    2006-06-15

    The compositional knowledge of historical mortars is of great concern in case of provenance and dating investigations and of conservation works since the nature of the raw materials suggests the most compatible conservation products. The classic characterisation usually goes through various analytical determinations, while conservation laboratories call for simple and quick analyses able to enlighten the nature of mortars, usually in terms of the binder fraction. A chemometric approach to the matter is here undertaken. Specimens of mortars were prepared with calcitic and dolomitic binders and analysed by Atomic Spectroscopy. Principal Components Analysis (PCA) was used to investigate the featuresmore » of specimens and samples. A Partial Least Square (PLS1) regression was done in order to predict the binder/aggregate ratio. The model was applied to historical mortars from the churches of St. Lorenzo (Milan) and St. Abbondio (Como). The accordance between the predictive model and the real samples is discussed.« less

  19. Metabolomics combined with chemometric tools (PCA, HCA, PLS-DA and SVM) for screening cassava (Manihot esculenta Crantz) roots during postharvest physiological deterioration.

    PubMed

    Uarrota, Virgílio Gavicho; Moresco, Rodolfo; Coelho, Bianca; Nunes, Eduardo da Costa; Peruch, Luiz Augusto Martins; Neubert, Enilto de Oliveira; Rocha, Miguel; Maraschin, Marcelo

    2014-10-15

    Cassava roots are an important source of dietary and industrial carbohydrates and suffer markedly from postharvest physiological deterioration (PPD). This paper deals with metabolomics combined with chemometric tools for screening the chemical and enzymatic composition in several genotypes of cassava roots during PPD. Metabolome analyses showed increases in carotenoids, flavonoids, anthocyanins, phenolics, reactive scavenging species, and enzymes (superoxide dismutase family, hydrogen peroxide, and catalase) until 3-5days postharvest. PPD correlated negatively with phenolics and carotenoids and positively with anthocyanins and flavonoids. Chemometric tools such as principal component analysis, partial least squares discriminant analysis, and support vector machines discriminated well cassava samples and enabled a good prediction of samples. Hierarchical clustering analyses grouped samples according to their levels of PPD and chemical compositions. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Solid-phase extraction versus matrix solid-phase dispersion: Application to white grapes.

    PubMed

    Dopico-García, M S; Valentão, P; Jagodziñska, A; Klepczyñska, J; Guerra, L; Andrade, P B; Seabra, R M

    2007-11-15

    The use of matrix solid-phase dispersion (MSPD) was tested to, separately, extract phenolic compounds and organic acids from white grapes. This method was compared with a more conventional analytical method previously developed that combines solid liquid extraction (SL) to simultaneously extract phenolic compounds and organic acids followed by a solid-phase extraction (SPE) to separate the two types of compounds. Although the results were qualitatively similar for both techniques, the levels of extracted compounds were in general quite lower on using MSPD, especially for organic acids. Therefore, SL-SPE method was preferred to analyse white "Vinho Verde" grapes. Twenty samples of 10 different varieties (Alvarinho, Avesso, Asal-Branco, Batoca, Douradinha, Esganoso de Castelo Paiva, Loureiro, Pedernã, Rabigato and Trajadura) from four different locations in Minho (Portugal) were analysed in order to study the effects of variety and origin on the profile of the above mentioned compounds. Principal component analysis (PCA) was applied separately to establish the main sources of variability present in the data sets for phenolic compounds, organic acids and for the global data. PCA of phenolic compounds accounted for the highest variability (77.9%) with two PCs, enabling characterization of the varieties of samples according to their higher content in flavonol derivatives or epicatechin. Additionally, a strong effect of sample origin was observed. Stepwise linear discriminant analysis (SLDA) was used for differentiation of grapes according to the origin and variety, resulting in a correct classification of 100 and 70%, respectively.

  1. Biomechanical metrics of aesthetic perception in dance.

    PubMed

    Bronner, Shaw; Shippen, James

    2015-12-01

    The brain may be tuned to evaluate aesthetic perception through perceptual chunking when we observe the grace of the dancer. We modelled biomechanical metrics to explain biological determinants of aesthetic perception in dance. Eighteen expert (EXP) and intermediate (INT) dancers performed développé arabesque in three conditions: (1) slow tempo, (2) slow tempo with relevé, and (3) fast tempo. To compare biomechanical metrics of kinematic data, we calculated intra-excursion variability, principal component analysis (PCA), and dimensionless jerk for the gesture limb. Observers, all trained dancers, viewed motion capture stick figures of the trials and ranked each for aesthetic (1) proficiency and (2) movement smoothness. Statistical analyses included group by condition repeated-measures ANOVA for metric data; Mann-Whitney U rank and Friedman's rank tests for nonparametric rank data; Spearman's rho correlations to compare aesthetic rankings and metrics; and linear regression to examine which metric best quantified observers' aesthetic rankings, p < 0.05. The goodness of fit of the proposed models was determined using Akaike information criteria. Aesthetic proficiency and smoothness rankings of the dance movements revealed differences between groups and condition, p < 0.0001. EXP dancers were rated more aesthetically proficient than INT dancers. The slow and fast conditions were judged more aesthetically proficient than slow with relevé (p < 0.0001). Of the metrics, PCA best captured the differences due to group and condition. PCA also provided the most parsimonious model to explain aesthetic proficiency and smoothness rankings. By permitting organization of large data sets into simpler groupings, PCA may mirror the phenomenon of chunking in which the brain combines sensory motor elements into integrated units of behaviour. In this representation, the chunk of information which is remembered, and to which the observer reacts, is the elemental mode shape of the motion rather than physical displacements. This suggests that reduction in redundant information to a simplistic dimensionality is related to the experienced observer's aesthetic perception.

  2. Using the prostate imaging reporting and data system version 2 (PI-RIDS v2) to detect prostate cancer can prevent unnecessary biopsies and invasive treatment.

    PubMed

    Liu, Chang; Liu, Shi-Liang; Wang, Zhi-Xian; Yu, Kai; Feng, Chun-Xiang; Ke, Zan; Wang, Liang; Zeng, Xiao-Yong

    2018-04-13

    Prostate cancer (PCa) is one of the most common cancers among men globally. The authors aimed to evaluate the ability of the Prostate Imaging Reporting and Data System version 2 (PI-RADS v2) to classify men with PCa, clinically significant PCa (CSPCa), or no PCa, especially among those with serum total prostate-specific antigen (tPSA) levels in the "gray zone" (4-10 ng ml -1 ). A total of 308 patients (355 lesions) were enrolled in this study. Diagnostic efficiency was determined. Univariate and multivariate analyses, receiver operating characteristic curve analysis, and decision curve analysis were performed to determine and compare the predictors of PCa and CSPCa. The results suggested that PI-RADS v2, tPSA, and prostate-specific antigen density (PSAD) were independent predictors of PCa and CSPCa. A PI-RADS v2 score ≥4 provided high negative predictive values (91.39% for PCa and 95.69% for CSPCa). A model of PI-RADS combined with PSA and PSAD helped to define a high-risk group (PI-RADS score = 5 and PSAD ≥0.15 ng ml -1 cm -3 , with tPSA in the gray zone, or PI-RADS score ≥4 with high tPSA level) with a detection rate of 96.1% for PCa and 93.0% for CSPCa while a low-risk group with a detection rate of 6.1% for PCa and 2.2% for CSPCa. It was concluded that the PI-RADS v2 could be used as a reliable and independent predictor of PCa and CSPCa. The combination of PI-RADS v2 score with PSA and PSAD could be helpful in the prediction and diagnosis of PCa and CSPCa and, thus, may help in preventing unnecessary invasive procedures.

  3. Principal component analysis of phenolic acid spectra

    USDA-ARS?s Scientific Manuscript database

    Phenolic acids are common plant metabolites that exhibit bioactive properties and have applications in functional food and animal feed formulations. The ultraviolet (UV) and infrared (IR) spectra of four closely related phenolic acid structures were evaluated by principal component analysis (PCA) to...

  4. Metabolomics and transcriptomics profiles reveal the dysregulation of the tricarboxylic acid cycle and related mechanisms in prostate cancer.

    PubMed

    Shao, Yaping; Ye, Guozhu; Ren, Shancheng; Piao, Hai-Long; Zhao, Xinjie; Lu, Xin; Wang, Fubo; Ma, Wang; Li, Jia; Yin, Peiyuan; Xia, Tian; Xu, Chuanliang; Yu, Jane J; Sun, Yinghao; Xu, Guowang

    2018-07-15

    Genetic alterations drive metabolic reprograming to meet increased biosynthetic precursor and energy demands for cancer cell proliferation and survival in unfavorable environments. A systematic study of gene-metabolite regulatory networks and metabolic dysregulation should reveal the molecular mechanisms underlying prostate cancer (PCa) pathogenesis. Herein, we performed gas chromatography-mass spectrometry (GC-MS)-based metabolomics and RNA-seq analyses in prostate tumors and matched adjacent normal tissues (ANTs) to elucidate the molecular alterations and potential underlying regulatory mechanisms in PCa. Significant accumulation of metabolic intermediates and enrichment of genes in the tricarboxylic acid (TCA) cycle were observed in tumor tissues, indicating TCA cycle hyperactivation in PCa tissues. In addition, the levels of fumarate and malate were highly correlated with the Gleason score, tumor stage and expression of genes encoding related enzymes and were significantly related to the expression of genes involved in branched chain amino acid degradation. Using an integrated omics approach, we further revealed the potential anaplerotic routes from pyruvate, glutamine catabolism and branched chain amino acid (BCAA) degradation contributing to replenishing metabolites for TCA cycle. Integrated omics techniques enable the performance of network-based analyses to gain a comprehensive and in-depth understanding of PCa pathophysiology and may facilitate the development of new and effective therapeutic strategies. © 2018 UICC.

  5. Method of Real-Time Principal-Component Analysis

    NASA Technical Reports Server (NTRS)

    Duong, Tuan; Duong, Vu

    2005-01-01

    Dominant-element-based gradient descent and dynamic initial learning rate (DOGEDYN) is a method of sequential principal-component analysis (PCA) that is well suited for such applications as data compression and extraction of features from sets of data. In comparison with a prior method of gradient-descent-based sequential PCA, this method offers a greater rate of learning convergence. Like the prior method, DOGEDYN can be implemented in software. However, the main advantage of DOGEDYN over the prior method lies in the facts that it requires less computation and can be implemented in simpler hardware. It should be possible to implement DOGEDYN in compact, low-power, very-large-scale integrated (VLSI) circuitry that could process data in real time.

  6. Study of ionospheric anomalies due to impact of typhoon using Principal Component Analysis and image processing

    NASA Astrophysics Data System (ADS)

    LIN, JYH-WOEI

    2012-08-01

    Principal Component Analysis (PCA) and image processing are used to determine Total Electron Content (TEC) anomalies in the F-layer of the ionosphere relating to Typhoon Nakri for 29 May, 2008 (UTC). PCA and image processing are applied to the global ionospheric map (GIM) with transforms conducted for the time period 12:00-14:00 UT on 29 May, 2008 when the wind was most intense. Results show that at a height of approximately 150-200 km the TEC anomaly is highly localized; however, it becomes more intense and widespread with height. Potential causes of these results are discussed with emphasis given to acoustic gravity waves caused by wind force.

  7. An inter-comparison of PM10 source apportionment using PCA and PMF receptor models in three European sites.

    PubMed

    Cesari, Daniela; Amato, F; Pandolfi, M; Alastuey, A; Querol, X; Contini, D

    2016-08-01

    Source apportionment of aerosol is an important approach to investigate aerosol formation and transformation processes as well as to assess appropriate mitigation strategies and to investigate causes of non-compliance with air quality standards (Directive 2008/50/CE). Receptor models (RMs) based on chemical composition of aerosol measured at specific sites are a useful, and widely used, tool to perform source apportionment. However, an analysis of available studies in the scientific literature reveals heterogeneities in the approaches used, in terms of "working variables" such as the number of samples in the dataset and the number of chemical species used as well as in the modeling tools used. In this work, an inter-comparison of PM10 source apportionment results obtained at three European measurement sites is presented, using two receptor models: principal component analysis coupled with multi-linear regression analysis (PCA-MLRA) and positive matrix factorization (PMF). The inter-comparison focuses on source identification, quantification of source contribution to PM10, robustness of the results, and how these are influenced by the number of chemical species available in the datasets. Results show very similar component/factor profiles identified by PCA and PMF, with some discrepancies in the number of factors. The PMF model appears to be more suitable to separate secondary sulfate and secondary nitrate with respect to PCA at least in the datasets analyzed. Further, some difficulties have been observed with PCA in separating industrial and heavy oil combustion contributions. Commonly at all sites, the crustal contributions found with PCA were larger than those found with PMF, and the secondary inorganic aerosol contributions found by PCA were lower than those found by PMF. Site-dependent differences were also observed for traffic and marine contributions. The inter-comparison of source apportionment performed on complete datasets (using the full range of available chemical species) and incomplete datasets (with reduced number of chemical species) allowed to investigate the sensitivity of source apportionment (SA) results to the working variables used in the RMs. Results show that, at both sites, the profiles and the contributions of the different sources calculated with PMF are comparable within the estimated uncertainties indicating a good stability and robustness of PMF results. In contrast, PCA outputs are more sensitive to the chemical species present in the datasets. In PCA, the crustal contributions are higher in the incomplete datasets and the traffic contributions are significantly lower for incomplete datasets.

  8. Differentiating Organic and Conventional Sage by Chromatographic and Mass Spectrometry Flow-Injection Fingerprints Combined with Principal Component Analysis

    PubMed Central

    Gao, Boyan; Lu, Yingjian; Sheng, Yi; Chen, Pei; Yu, Liangli (Lucy)

    2013-01-01

    High performance liquid chromatography (HPLC) and flow injection electrospray ionization with ion trap mass spectrometry (FIMS) fingerprints combined with the principal component analysis (PCA) were examined for their potential in differentiating commercial organic and conventional sage samples. The individual components in the sage samples were also characterized with an ultra-performance liquid chromatography with a quadrupole-time of flight mass spectrometer (UPLC Q-TOF MS). The results suggested that both HPLC and FIMS fingerprints combined with PCA could differentiate organic and conventional sage samples effectively. FIMS may serve as a quick test capable of distinguishing organic and conventional sages in 1 min, and could potentially be developed for high-throughput applications; whereas HPLC fingerprints could provide more chemical composition information with a longer analytical time. PMID:23464755

  9. Fault Detection of Bearing Systems through EEMD and Optimization Algorithm

    PubMed Central

    Lee, Dong-Han; Ahn, Jong-Hyo; Koh, Bong-Hwan

    2017-01-01

    This study proposes a fault detection and diagnosis method for bearing systems using ensemble empirical mode decomposition (EEMD) based feature extraction, in conjunction with particle swarm optimization (PSO), principal component analysis (PCA), and Isomap. First, a mathematical model is assumed to generate vibration signals from damaged bearing components, such as the inner-race, outer-race, and rolling elements. The process of decomposing vibration signals into intrinsic mode functions (IMFs) and extracting statistical features is introduced to develop a damage-sensitive parameter vector. Finally, PCA and Isomap algorithm are used to classify and visualize this parameter vector, to separate damage characteristics from healthy bearing components. Moreover, the PSO-based optimization algorithm improves the classification performance by selecting proper weightings for the parameter vector, to maximize the visualization effect of separating and grouping of parameter vectors in three-dimensional space. PMID:29143772

  10. The variance needed to accurately describe jump height from vertical ground reaction force data.

    PubMed

    Richter, Chris; McGuinness, Kevin; O'Connor, Noel E; Moran, Kieran

    2014-12-01

    In functional principal component analysis (fPCA) a threshold is chosen to define the number of retained principal components, which corresponds to the amount of preserved information. A variety of thresholds have been used in previous studies and the chosen threshold is often not evaluated. The aim of this study is to identify the optimal threshold that preserves the information needed to describe a jump height accurately utilizing vertical ground reaction force (vGRF) curves. To find an optimal threshold, a neural network was used to predict jump height from vGRF curve measures generated using different fPCA thresholds. The findings indicate that a threshold from 99% to 99.9% (6-11 principal components) is optimal for describing jump height, as these thresholds generated significantly lower jump height prediction errors than other thresholds.

  11. The Application of Infrared Thermographic Inspection Techniques to the Space Shuttle Thermal Protection System

    NASA Technical Reports Server (NTRS)

    Cramer, K. E.; Winfree, W. P.

    2005-01-01

    The Nondestructive Evaluation Sciences Branch at NASA s Langley Research Center has been actively involved in the development of thermographic inspection techniques for more than 15 years. Since the Space Shuttle Columbia accident, NASA has focused on the improvement of advanced NDE techniques for the Reinforced Carbon-Carbon (RCC) panels that comprise the orbiter s wing leading edge. Various nondestructive inspection techniques have been used in the examination of the RCC, but thermography has emerged as an effective inspection alternative to more traditional methods. Thermography is a non-contact inspection method as compared to ultrasonic techniques which typically require the use of a coupling medium between the transducer and material. Like radiographic techniques, thermography can be used to inspect large areas, but has the advantage of minimal safety concerns and the ability for single-sided measurements. Principal Component Analysis (PCA) has been shown effective for reducing thermographic NDE data. A typical implementation of PCA is when the eigenvectors are generated from the data set being analyzed. Although it is a powerful tool for enhancing the visibility of defects in thermal data, PCA can be computationally intense and time consuming when applied to the large data sets typical in thermography. Additionally, PCA can experience problems when very large defects are present (defects that dominate the field-of-view), since the calculation of the eigenvectors is now governed by the presence of the defect, not the "good" material. To increase the processing speed and to minimize the negative effects of large defects, an alternative method of PCA is being pursued where a fixed set of eigenvectors, generated from an analytic model of the thermal response of the material under examination, is used to process the thermal data from the RCC materials. Details of a one-dimensional analytic model and a two-dimensional finite-element model will be presented. An overview of the PCA process as well as a quantitative signal-to-noise comparison of the results of performing both embodiments of PCA on thermographic data from various RCC specimens will be shown. Finally, a number of different applications of this technology to various RCC components will be presented.

  12. Model Reduction via Principe Component Analysis and Markov Chain Monte Carlo (MCMC) Methods

    NASA Astrophysics Data System (ADS)

    Gong, R.; Chen, J.; Hoversten, M. G.; Luo, J.

    2011-12-01

    Geophysical and hydrogeological inverse problems often include a large number of unknown parameters, ranging from hundreds to millions, depending on parameterization and problems undertaking. This makes inverse estimation and uncertainty quantification very challenging, especially for those problems in two- or three-dimensional spatial domains. Model reduction technique has the potential of mitigating the curse of dimensionality by reducing total numbers of unknowns while describing the complex subsurface systems adequately. In this study, we explore the use of principal component analysis (PCA) and Markov chain Monte Carlo (MCMC) sampling methods for model reduction through the use of synthetic datasets. We compare the performances of three different but closely related model reduction approaches: (1) PCA methods with geometric sampling (referred to as 'Method 1'), (2) PCA methods with MCMC sampling (referred to as 'Method 2'), and (3) PCA methods with MCMC sampling and inclusion of random effects (referred to as 'Method 3'). We consider a simple convolution model with five unknown parameters as our goal is to understand and visualize the advantages and disadvantages of each method by comparing their inversion results with the corresponding analytical solutions. We generated synthetic data with noise added and invert them under two different situations: (1) the noised data and the covariance matrix for PCA analysis are consistent (referred to as the unbiased case), and (2) the noise data and the covariance matrix are inconsistent (referred to as biased case). In the unbiased case, comparison between the analytical solutions and the inversion results show that all three methods provide good estimates of the true values and Method 1 is computationally more efficient. In terms of uncertainty quantification, Method 1 performs poorly because of relatively small number of samples obtained, Method 2 performs best, and Method 3 overestimates uncertainty due to inclusion of random effects. However, in the biased case, only Method 3 correctly estimates all the unknown parameters, and both Methods 1 and 2 provide wrong values for the biased parameters. The synthetic case study demonstrates that if the covariance matrix for PCA analysis is inconsistent with true models, the PCA methods with geometric or MCMC sampling will provide incorrect estimates.

  13. Non-destructive analysis of the conformational differences among feedstock sources and their corresponding co-products from bioethanol production with molecular spectroscopy.

    PubMed

    Gamage, I H; Jonker, A; Zhang, X; Yu, P

    2014-01-24

    The objective of this study was to determine the possibility of using molecular spectroscopy with multivariate technique as a fast method to detect the source effects among original feedstock sources of wheat and their corresponding co-products, wheat DDGS, from bioethanol production. Different sources of the bioethanol feedstock and their corresponding bioethanol co-products, three samples per source, were collected from the same newly-built bioethanol plant with current bioethanol processing technology. Multivariate molecular spectral analyses were carried out using agglomerative hierarchical cluster analysis (AHCA) and principal component analysis (PCA). The molecular spectral data of different feedstock sources and their corresponding co-products were compared at four different regions of ca. 1800-1725 cm(-1) (carbonyl CO ester, mainly related to lipid structure conformation), ca. 1725-1482 cm(-1) (amide I and amide II region mainly related to protein structure conformation), ca. 1482-1180 cm(-1) (mainly associated with structural carbohydrate) and ca. 1180-800 cm(-1) (mainly related to carbohydrates) in complex plant-based system. The results showed that the molecular spectroscopy with multivariate technique could reveal the structural differences among the bioethanol feedstock sources and among their corresponding co-products. The AHCA and PCA analyses were able to distinguish the molecular structure differences associated with chemical functional groups among the different sources of the feedstock and their corresponding co-products. The molecular spectral differences indicated the differences in functional, biomolecular and biopolymer groups which were confirmed by wet chemical analysis. These biomolecular and biopolymer structural differences were associated with chemical and nutrient profiles and nutrient utilization and availability. Molecular spectral analyses had the potential to identify molecular structure difference among bioethanol feedstock sources and their corresponding co-products. Copyright © 2013 Elsevier B.V. All rights reserved.

  14. Non-destructive analysis of the conformational differences among feedstock sources and their corresponding co-products from bioethanol production with molecular spectroscopy

    NASA Astrophysics Data System (ADS)

    Gamage, I. H.; Jonker, A.; Zhang, X.; Yu, P.

    2014-01-01

    The objective of this study was to determine the possibility of using molecular spectroscopy with multivariate technique as a fast method to detect the source effects among original feedstock sources of wheat and their corresponding co-products, wheat DDGS, from bioethanol production. Different sources of the bioethanol feedstock and their corresponding bioethanol co-products, three samples per source, were collected from the same newly-built bioethanol plant with current bioethanol processing technology. Multivariate molecular spectral analyses were carried out using agglomerative hierarchical cluster analysis (AHCA) and principal component analysis (PCA). The molecular spectral data of different feedstock sources and their corresponding co-products were compared at four different regions of ca. 1800-1725 cm-1 (carbonyl Cdbnd O ester, mainly related to lipid structure conformation), ca. 1725-1482 cm-1 (amide I and amide II region mainly related to protein structure conformation), ca. 1482-1180 cm-1 (mainly associated with structural carbohydrate) and ca. 1180-800 cm-1 (mainly related to carbohydrates) in complex plant-based system. The results showed that the molecular spectroscopy with multivariate technique could reveal the structural differences among the bioethanol feedstock sources and among their corresponding co-products. The AHCA and PCA analyses were able to distinguish the molecular structure differences associated with chemical functional groups among the different sources of the feedstock and their corresponding co-products. The molecular spectral differences indicated the differences in functional, biomolecular and biopolymer groups which were confirmed by wet chemical analysis. These biomolecular and biopolymer structural differences were associated with chemical and nutrient profiles and nutrient utilization and availability. Molecular spectral analyses had the potential to identify molecular structure difference among bioethanol feedstock sources and their corresponding co-products.

  15. Analyzing the Association of Polymorphisms in the CRYBB2 Gene with Prostate Cancer Risk in African Americans

    PubMed Central

    FARUQUE, MEZBAH U.; PAUL, RABINDRA; RICKS-SANTI, LUISEL; JINGWI, EMMANUEL Y.; AHAGHOTU, CHILEDUM A.; DUNSTON, GEORGIA M.

    2016-01-01

    Background/Aim Prostate cancer (PCa) shows disproportionately higher incidence and disease-associated mortality in African Americans. The human crystallin beta B2 (CRYBB2) gene has been reported as one tumor signature gene differentially expressed between African American and European American cancer patients. We investigated the role of CRYBB2 genetic variants in PCa in African Americans. Materials and Methods Subjects comprised of 233 PCa cases and 294 controls. Nine haplotype-tagged single nucleotide polymorphisms (SNPs) in and around the CRYBB2 gene were genotyped by pyrosequencing. Association analyses were performed for PCa with adjustment for age and prostate-specific antigen (PSA), under an additive genetic model. Results Out of the nine SNPs examined, rs9608380 was found to be nominally associated with PCa (odds ratio (OR)=2.619 (95% confidence interval (CI)=1.156–5.935), p=0.021). rs9306412 was in strong linkage disequilibrium with rs9608380 that showed an association p-value of 0.077. Using ENCODE data, we found rs9608380 mapped to a region annotated with regulatory motifs, such as DNase hypersensitive sites and histone modifications. Conclusion This is the first study to analyze the association between genetic variations in the CRYBB2 gene with PCa. rs9608380, associated with PCa, is a potentially functional variant. PMID:25964531

  16. Genetic variations in genes involved in testosterone metabolism are associated with prostate cancer progression: A Spanish multicenter study.

    PubMed

    Henríquez-Hernández, Luis Alberto; Valenciano, Almudena; Foro-Arnalot, Palmira; Álvarez-Cubero, María Jesús; Cozar, José Manuel; Suárez-Novo, José Francisco; Castells-Esteve, Manel; Fernández-Gonzalo, Pablo; De-Paula-Carranza, Belén; Ferrer, Montse; Guedea, Ferrán; Sancho-Pardo, Gemma; Craven-Bartle, Jordi; Ortiz-Gordillo, María José; Cabrera-Roldán, Patricia; Rodríguez-Melcón, Juan Ignacio; Herrera-Ramos, Estefanía; Rodríguez-Gallego, Carlos; Lara, Pedro C

    2015-07-01

    Prostate cancer (PCa) is an androgen-dependent disease. Nonetheless, the role of single nucleotide polymorphisms (SNPs) in genes encoding androgen metabolism remains an unexplored area. To investigate the role of germline variations in cytochrome P450 17A1 (CYP17A1) and steroid-5α-reductase, α-polypeptides 1 and 2 (SRD5A1 and SRD5A2) genes in PCa. In total, 494 consecutive Spanish patients diagnosed with nonmetastatic localized PCa were included in this multicenter study and were genotyped for 32 SNPs in SRD5A1, SRD5A2, and CYP17A1 genes using a Biotrove OpenArray NT Cycler. Clinical data were available. Genotypic and allelic frequencies, as well as haplotype analyses, were determined using the web-based environment SNPator. All additional statistical analyses comparing clinical data and SNPs were performed using PASW Statistics 15. The call rate obtained (determined as the percentage of successful determinations) was 97.3% of detection. A total of 2 SNPs in SRD5A1-rs3822430 and rs1691053-were associated with prostate-specific antigen level at diagnosis. Moreover, G carriers for both SNPs were at higher risk of presenting initial prostate-specific antigen levels>20ng/ml (Exp(B) = 2.812, 95% CI: 1.397-5.657, P = 0.004) than those who are AA-AA carriers. Haplotype analyses showed that patients with PCa nonhomozygous for the haplotype GCTTGTAGTA were at an elevated risk of presenting bigger clinical tumor size (Exp(B) = 3.823, 95% CI: 1.280-11.416, P = 0.016), and higher Gleason score (Exp(B) = 2.808, 95% CI: 1.134-6.953, P = 0.026). SNPs in SRD5A1 seem to affect the clinical characteristics of Spanish patients with PCa. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. Petrologic and oxygen isotopic study of ALH 85085-like chondrites

    NASA Astrophysics Data System (ADS)

    Prinz, M.; Weisberg, M. K.; Clayton, R. N.; Mayeda, T. K.; Ebihara, M.

    1994-07-01

    Four meteorites (PAT 91546, PCA 91328, PCA 91452, PCA 91467) petrologically similar to ALH 85085 chondrite have now been found. Previous studies of ALH 85085 showed it be a new kind of CR-related microchondrule-bearing chondrite, although one called it a sub-chondrite. The purpose of this study is to learn more about ALH 85085-like meteorites and their relationship to CR and CR-related (LEW 85332, Acfer 182, Bencubbin) chondrites. The methods used included petrology, INA bulk chemical analysis (PAT 91546, PCA 91467), and O isotopic analyses of the whole rocks and separated chondrules and dark inclusions (DIs) from PAT 91546. Since microchondrules and fragments are approximately 20 microns it was necessary to analyze composite samples for O; one was of approximately 100 chondrules, and another was of 5 DIs. Petrologically, the four meteorites are similar to ALH 85085, and there is no basis for determining if all of them, or any combinations, are paired. Mineralogically, olivine and pyroxene are highly magnesian FeNi metal generally has 3-10% Ni, and has a positive Ni-Co correlation similar to that in CR and CR-related chondrites. Refractory inclusions are similar in size to the chondrules and have the following assemblages: (1) hibonite-perovskite, (2) melilite-fassaite-forsterite, (3) grossite (Ca-dialuminate)-melilite-perovskite, (4) spinel-melilite, and (5) spinel-pyroxene aggregates. Chemically, INA analyses indicate that PAT 91546 and PCA 91467 are generally similar to ALH 85085. Oxygen isotopic analyses of the four whole-rock compositions fall along the CR mixing line as does ALH 85085; they are also close to LEW 85332, Acfer 182, and Bencubbin. This supports the concept that these are all CR-related chondrites. Even stronger support is found in the compositions of the chondrules and DIs in PAT 91546, which also plot on or near the CR line.

  18. Descriptive sensory analysis in different classes of orange juice by a robust free-choice profile method.

    PubMed

    Pérez Aparicio, Jesús; Toledano Medina, M Angeles; Lafuente Rosales, Victoria

    2007-07-09

    Free-choice profile (FCP), developed in the 1980s, is a sensory analysis method that can be carried out by untrained panels. The participants need only to be able to use a scale and be consumers of the product under evaluation. The data are analysed by sophisticated statistical methodologies like Generalized Procrustean Analysis (GPA) or STATIS. To facilitate a wider use of the free-choice profiling procedure, different authors have advocated simpler methods based on principal components analysis (PCA) of merged data sets. The purpose of this work was to apply another easy procedure to this type of data by means of a robust PCA. The most important characteristic of the proposed method is that quality responsible managers could use this methodology without any scale evaluation. Only the free terms generated by the assessors are necessary to apply the script, thus avoiding the error associated with scale utilization by inexpert assessors. Also, it is possible to use the application with missing data and with differences in the assessors' attendance at sessions. An example was performed to generate the descriptors from different orange juice types. The results were compared with the STATIS method and with the PCA on the merged data sets. The samples evaluated were fresh orange juices with differences in storage days and pasteurized, concentrated and orange nectar drinks from different brands. Eighteen assessors with a low-level training program were used in a six-session free-choice profile framework. The results proved that this script could be of use in marketing decisions and product quality program development.

  19. Advances in Raman spectroscopy for the diagnosis of Alzheimer's disease

    NASA Astrophysics Data System (ADS)

    Sudworth, Caroline D.; Archer, John K. J.; Black, Richard A.; Mann, David

    2006-02-01

    Within the next 50 years Alzheimer's disease is expected to affect 100 million people worldwide. The progressive decline in the mental health of the patient is caused by severe brain atrophy generated by the breakdown and aggregation of proteins, resulting in β-amyloid plaques and neurofibrillary tangles. The greatest challenge to Alzheimer's disease lies in the pursuit of an early and definitive diagnosis, in order that suitable treatment can be administered. At the present time, definitive diagnosis is restricted to post-mortem examination. Alzheimer's disease also remains without a long-term cure. This research demonstrates the potential role of Raman spectroscopy, combined with principle components analysis (PCA), as a diagnostic method. Analyses of ethically approved ex vivo post-mortem brain tissues (originating from frontal and occipital lobes) from control (3 normal elderly subjects and 3 Huntingdon's disease subjects) and Alzheimer's disease (12 subjects) brain sections, and a further set of 12 blinded samples are presented. Spectra originating from these tissues are highly reproducible, and initial results indicate a vital difference in protein content and conformation, relating to the abnormally high levels of aggregated proteins in the diseased tissues. Further examination of these spectra using PCA allows for the separation of control from diseased tissues. The validation of the PCA models using blinded samples also displays promise for the identification of Alzheimer's disease, in conjunction with secondary information regarding other brain diseases and dementias. These results provide a route for Raman spectroscopy as a possible non-invasive, non-destructive tool for the early diagnosis of Alzheimer's disease.

  20. Remote sensing detection of gold related alteration zones in Um Rus area, Central Eastern Desert of Egypt

    NASA Astrophysics Data System (ADS)

    Amer, Reda; Kusky, Timothy; El Mezayen, Ahmed

    2012-01-01

    Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) and Phased Array L-band Synthetic Aperture Radar (PALSAR) images covering the Um Rus area in the Central Eastern Desert of Egypt were evaluated for mapping geologic structure, lithology, and gold-related alteration zones. The study area is covered by Pan-African basement rocks including gabbro and granodiorite intruded into a variable mixture of metavolcanics and metasediments. The first three principal component analyses (PCA1, PCA2, PCA3) in a Red-Green-Blue (RGB) of the visible through shortwave-infrared (VNIR + SWIR) ASTER bands enabled the discrimination between lithological units. The results show that ASTER band ratios ((2 + 4)/3, (5 + 7)/6, (7 + 9)/8) in RGB identifies the lithological units and discriminates the granodiorite very well from the adjacent rock units.The granodiorites are dissected by gold-bearing quartz veins surrounded by alteration zones. The microscopic examination of samples collected from the alteration zones shows sericitic and argillic alteration zones. The Spectral Angle Mapper (SAM) and Spectral Information Divergence (SID) supervised classification methods were applied using the reference spectra of the USGS spectral library. The results show that these classification methods are capable of mapping the alteration zones as indicated by field verification work. The PALSAR image was enhanced for fracture mapping using the second moment co-occurrence filter. Overlying extracted faults and alteration zone classification images show that the N30E and N-S fractures represent potential zones for gold exploration. It is concluded that the proposed methods can be used as a powerful tool for ore deposit exploration.

  1. Apo adenylate kinase encodes its holo form: a principal component and varimax analysis.

    PubMed

    Cukier, Robert I

    2009-02-12

    Adenylate kinase undergoes large-scale motions of its LID and AMP-binding (AMPbd) domains when its apo, open form closes over its substrates, AMP and Mg2+-ATP. It may be an example of an enzyme that provides an ensemble of conformations in its apo state from which its substrates can select and bind to produce catalytically competent conformations. In this work, the fluctuations of the enzyme apo Escherichia coli adenylate kinase (AKE) are obtained with molecular dynamics. The resulting trajectory is analyzed with principal component analysis (PCA) that decomposes the atom motions into orthogonal modes ordered by their decreasing contributions to the total protein fluctuation. In apo AKE, a small set of the PCA modes describes the bulk of the fluctuations. Identification of the atom motions that are important contributors to these modes is improved with the use of a varimax rotation method that rotates the PCA modes to a new mode set that concentrates the atom contributions to a smaller set of atoms in these new modes. In this way, the nature of the important motions of the LID and AMPbd domains are clarified. The dominant PCA modes are used to investigate if apo AKE can fluctuate to conformations that are holo-like, even though the apo trajectory is mainly confined to a region around the initial apo structure. This is accomplished by expressing the difference between the protein coordinates, obtained from the holo and apo crystal structures, using as a basis the PCA modes from the apo AKE trajectory. The coherent motion described by a small set of the apo PCA modes is shown to be able to produce protein conformations that are quite similar to the holo conformation of the protein. In this sense, apo AKE does encode in its fluctuations information about holo-like conformations.

  2. miR-503 suppresses tumor cell proliferation and metastasis by directly targeting RNF31 in prostate cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Guo, Jia; Liu, Xiuheng, E-mail: l_xiuheng@163.com; Wang, Min

    2015-09-04

    Microarray data analyses were performed to search for metastasis-associated oncogenes in prostate cancer (PCa). RNF31 mRNA expressions in tumor tissues and benign prostate tissues were evaluated. The RNF31 protein expression levels were also analyzed by western blot and immunohistochemistry. Luciferase reporter assays were used to identify miRNAs that can regulate RNF31. The effect of RNF31 on PCa progression was studied in vitro and in vivo. We found that RNF31 was significantly increased in PCa and its expression level was highly correlated with seminal vesicle invasion, clinical stage, prostate specific antigen (PSA) level, Gleason score, and BCR. Silence of RNF31 suppressed PCa cellmore » proliferation and metastasis in vitro and in vivo. miR-503 can directly regulate RNF31. Enforced expression of miR-503 inhibited the expression of RNF31 significantly and the restoration of RNF31 expression reversed the inhibitory effects of miR-503 on PCa cell proliferation and metastasis. These findings collectively indicated an oncogene role of RNF31 in PCa progression which can be regulated by miR-503, suggesting that RNF31 could serve as a potential prognostic biomarker and therapeutic target for PCa. - Highlights: • RNF31 is a potential metastasis associated gene and is associated with prostate cancer progression. • Silence of RNF31 inhibits PCa cell colony formation, migration and invasion. • RNF31 as a direct target of miR-503. • miR-503 can regulate cell proliferation, invasion and migration by targeting RNF31. • RNF31 plays an important role in PCa growth and metastasis in vivo.« less

  3. Investigation of cell wall composition related to stem lodging resistance in wheat (Triticum aestivum L.) by FTIR spectroscopy.

    PubMed

    Wang, Jian; Zhu, Jinmao; Huang, RuZhu; Yang, YuSheng

    2012-07-01

    We explored the rapid qualitative analysis of wheat cultivars with good lodging resistances by Fourier transform infrared resonance (FTIR) spectroscopy and multivariate statistical analysis. FTIR imaging showing that wheat stem cell walls were mainly composed of cellulose, pectin, protein, and lignin. Principal components analysis (PCA) was used to eliminate multicollinearity among multiple peak absorptions. PCA revealed the developmental internodes of wheat stems could be distributed from low to high along the load of the second principal component, which was consistent with the corresponding bands of cellulose in the FTIR spectra of the cell walls. Furthermore, four distinct stem populations could also be identified by spectral features related to their corresponding mechanical properties via PCA and cluster analysis. Histochemical staining of four types of wheat stems with various abilities to resist lodging revealed that cellulose contributed more than lignin to the ability to resist lodging. These results strongly suggested that the main cell wall component responsible for these differences was cellulose. Therefore, the combination of multivariate analysis and FTIR could rapidly screen wheat cultivars with good lodging resistance. Furthermore, the application of these methods to a much wider range of cultivars of unknown mechanical properties promises to be of interest.

  4. ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap

    PubMed Central

    Metsalu, Tauno; Vilo, Jaak

    2015-01-01

    The Principal Component Analysis (PCA) is a widely used method of reducing the dimensionality of high-dimensional data, often followed by visualizing two of the components on the scatterplot. Although widely used, the method is lacking an easy-to-use web interface that scientists with little programming skills could use to make plots of their own data. The same applies to creating heatmaps: it is possible to add conditional formatting for Excel cells to show colored heatmaps, but for more advanced features such as clustering and experimental annotations, more sophisticated analysis tools have to be used. We present a web tool called ClustVis that aims to have an intuitive user interface. Users can upload data from a simple delimited text file that can be created in a spreadsheet program. It is possible to modify data processing methods and the final appearance of the PCA and heatmap plots by using drop-down menus, text boxes, sliders etc. Appropriate defaults are given to reduce the time needed by the user to specify input parameters. As an output, users can download PCA plot and heatmap in one of the preferred file formats. This web server is freely available at http://biit.cs.ut.ee/clustvis/. PMID:25969447

  5. Classification of time-of-flight secondary ion mass spectrometry spectra from complex Cu-Fe sulphides by principal component analysis and artificial neural networks.

    PubMed

    Kalegowda, Yogesh; Harmer, Sarah L

    2013-01-08

    Artificial neural network (ANN) and a hybrid principal component analysis-artificial neural network (PCA-ANN) classifiers have been successfully implemented for classification of static time-of-flight secondary ion mass spectrometry (ToF-SIMS) mass spectra collected from complex Cu-Fe sulphides (chalcopyrite, bornite, chalcocite and pyrite) at different flotation conditions. ANNs are very good pattern classifiers because of: their ability to learn and generalise patterns that are not linearly separable; their fault and noise tolerance capability; and high parallelism. In the first approach, fragments from the whole ToF-SIMS spectrum were used as input to the ANN, the model yielded high overall correct classification rates of 100% for feed samples, 88% for conditioned feed samples and 91% for Eh modified samples. In the second approach, the hybrid pattern classifier PCA-ANN was integrated. PCA is a very effective multivariate data analysis tool applied to enhance species features and reduce data dimensionality. Principal component (PC) scores which accounted for 95% of the raw spectral data variance, were used as input to the ANN, the model yielded high overall correct classification rates of 88% for conditioned feed samples and 95% for Eh modified samples. Copyright © 2012 Elsevier B.V. All rights reserved.

  6. Ripening-dependent metabolic changes in the volatiles of pineapple (Ananas comosus (L.) Merr.) fruit: II. Multivariate statistical profiling of pineapple aroma compounds based on comprehensive two-dimensional gas chromatography-mass spectrometry.

    PubMed

    Steingass, Christof Björn; Jutzi, Manfred; Müller, Jenny; Carle, Reinhold; Schmarr, Hans-Georg

    2015-03-01

    Ripening-dependent changes of pineapple volatiles were studied in a nontargeted profiling analysis. Volatiles were isolated via headspace solid phase microextraction and analyzed by comprehensive 2D gas chromatography and mass spectrometry (HS-SPME-GC×GC-qMS). Profile patterns presented in the contour plots were evaluated applying image processing techniques and subsequent multivariate statistical data analysis. Statistical methods comprised unsupervised hierarchical cluster analysis (HCA) and principal component analysis (PCA) to classify the samples. Supervised partial least squares discriminant analysis (PLS-DA) and partial least squares (PLS) regression were applied to discriminate different ripening stages and describe the development of volatiles during postharvest storage, respectively. Hereby, substantial chemical markers allowing for class separation were revealed. The workflow permitted the rapid distinction between premature green-ripe pineapples and postharvest-ripened sea-freighted fruits. Volatile profiles of fully ripe air-freighted pineapples were similar to those of green-ripe fruits postharvest ripened for 6 days after simulated sea freight export, after PCA with only two principal components. However, PCA considering also the third principal component allowed differentiation between air-freighted fruits and the four progressing postharvest maturity stages of sea-freighted pineapples.

  7. Nanostructures formed by cyclodextrin covered procainamide through supramolecular self assembly - Spectral and molecular modeling study

    NASA Astrophysics Data System (ADS)

    Rajendiran, N.; Mohandoss, T.; Sankaranarayanan, R. K.

    2015-02-01

    Inclusion complexation behavior of procainamide (PCA) with two cyclodextrins (α-CD and β-CD) were analyzed by absorption, fluorescence, scanning electron microscope (SEM), transmission electron microscope (TEM), Raman image, FT-IR, differential scanning colorimeter (DSC), Powder X ray diffraction (XRD) and 1H NMR. Blue shift was observed in β-CD whereas no significant spectral shift observed in α-CD. The inclusion complex formation results suggest that water molecules also present in the inside of the CD cavity. The present study revealed that the phenyl ring of the PCA drug is entrapped in the CD cavity. Cyclodextrin studies show that PCA forms 1:2 inclusion complex with α-CD and β-CD. PCA:α-CD complex form nano-sized particles (46 nm) and PCA:β-CD complex form self-assembled to micro-sized tubular structures. The shape-shifting of 2D nanosheets into 1D microtubes by simple rolling mechanism were analysed by micro-Raman and TEM images. Thermodynamic parameters (ΔH, ΔG and ΔS) of inclusion process were determined from semiempirical PM3 calculations.

  8. Discovering phases, phase transitions, and crossovers through unsupervised machine learning: A critical examination

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.

    Here, we apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models - the square and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-one Ising (BSI) model, and the 2D XY model, and examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow exploration of different phases and symmetry-breaking, but can distinguish phase transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which ismore » particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the 'charge' correlations (vorticity) in the BSI model (XY model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the 'antoencoder method', and demonstrate that it too can be trained to capture phase transitions and critical points.« less

  9. Principal component analysis to assess the composition and fate of impurities in a large river-embedded reservoir: Qingcaosha Reservoir.

    PubMed

    Ou, Hua-Se; Wei, Chao-Hai; Deng, Yang; Gao, Nai-Yun

    2013-08-01

    Qingcaosha Reservoir (QR) is the largest river-embedded reservoir in east China, which receives its source water from the Yangtze River (YR). The temporal and spatial variations in dissolved organic matter (DOM), chromophoric DOM (CDOM), nitrogen, phosphorus and phytoplankton biomass were investigated from June to September in 2012 and were integrated by principal component analysis (PCA). Three PCA factors were identified: (1) phytoplankton related factor 1, (2) total DOM related factor 2, and (3) eutrophication related factor 3. Factor 1 was a lake-type parameter which correlated with chlorophyll-a and protein-like CDOM (r = 0.793 and r = 0.831, respectively). Factor 2 was a river-type parameter which correlated with total DOC and humic-like CDOM (r = 0.668 and r = 0.726, respectively). Factor 3 correlated with total nitrogen and phosphorus (r = 0.864 and r = 0.621, respectively). The low flow speed, self-sedimentation and nutrient accumulation in QR resulted in increases in PCA factor 1 scores (phytoplankton biomass and derived CDOM) in the spatial scale, indicating a change of river-type water (YR) to lake-type water (QR). In summer, the water temperature variation induced a growth-bloom-decay process of phytoplankton combined with the increase of PCA factor 2 (humic-like CDOM) in the QR, which was absent in the YR.

  10. Discovering phases, phase transitions, and crossovers through unsupervised machine learning: A critical examination

    DOE PAGES

    Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.

    2017-06-19

    Here, we apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models - the square and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-one Ising (BSI) model, and the 2D XY model, and examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow exploration of different phases and symmetry-breaking, but can distinguish phase transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which ismore » particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the 'charge' correlations (vorticity) in the BSI model (XY model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the 'antoencoder method', and demonstrate that it too can be trained to capture phase transitions and critical points.« less

  11. Discovering phases, phase transitions, and crossovers through unsupervised machine learning: A critical examination

    NASA Astrophysics Data System (ADS)

    Hu, Wenjian; Singh, Rajiv R. P.; Scalettar, Richard T.

    2017-06-01

    We apply unsupervised machine learning techniques, mainly principal component analysis (PCA), to compare and contrast the phase behavior and phase transitions in several classical spin models—the square- and triangular-lattice Ising models, the Blume-Capel model, a highly degenerate biquadratic-exchange spin-1 Ising (BSI) model, and the two-dimensional X Y model—and we examine critically what machine learning is teaching us. We find that quantified principal components from PCA not only allow the exploration of different phases and symmetry-breaking, but they can distinguish phase-transition types and locate critical points. We show that the corresponding weight vectors have a clear physical interpretation, which is particularly interesting in the frustrated models such as the triangular antiferromagnet, where they can point to incipient orders. Unlike the other well-studied models, the properties of the BSI model are less well known. Using both PCA and conventional Monte Carlo analysis, we demonstrate that the BSI model shows an absence of phase transition and macroscopic ground-state degeneracy. The failure to capture the "charge" correlations (vorticity) in the BSI model (X Y model) from raw spin configurations points to some of the limitations of PCA. Finally, we employ a nonlinear unsupervised machine learning procedure, the "autoencoder method," and we demonstrate that it too can be trained to capture phase transitions and critical points.

  12. Fast and Accurate Radiative Transfer Calculations Using Principal Component Analysis for (Exo-)Planetary Retrieval Models

    NASA Astrophysics Data System (ADS)

    Kopparla, P.; Natraj, V.; Shia, R. L.; Spurr, R. J. D.; Crisp, D.; Yung, Y. L.

    2015-12-01

    Radiative transfer (RT) computations form the engine of atmospheric retrieval codes. However, full treatment of RT processes is computationally expensive, prompting usage of two-stream approximations in current exoplanetary atmospheric retrieval codes [Line et al., 2013]. Natraj et al. [2005, 2010] and Spurr and Natraj [2013] demonstrated the ability of a technique using principal component analysis (PCA) to speed up RT computations. In the PCA method for RT performance enhancement, empirical orthogonal functions are developed for binned sets of inherent optical properties that possess some redundancy; costly multiple-scattering RT calculations are only done for those few optical states corresponding to the most important principal components, and correction factors are applied to approximate radiation fields. Kopparla et al. [2015, in preparation] extended the PCA method to a broadband spectral region from the ultraviolet to the shortwave infrared (0.3-3 micron), accounting for major gas absorptions in this region. Here, we apply the PCA method to a some typical (exo-)planetary retrieval problems. Comparisons between the new model, called Universal Principal Component Analysis Radiative Transfer (UPCART) model, two-stream models and line-by-line RT models are performed, for spectral radiances, spectral fluxes and broadband fluxes. Each of these are calculated at the top of the atmosphere for several scenarios with varying aerosol types, extinction and scattering optical depth profiles, and stellar and viewing geometries. We demonstrate that very accurate radiance and flux estimates can be obtained, with better than 1% accuracy in all spectral regions and better than 0.1% in most cases, as compared to a numerically exact line-by-line RT model. The accuracy is enhanced when the results are convolved to typical instrument resolutions. The operational speed and accuracy of UPCART can be further improved by optimizing binning schemes and parallelizing the codes, work on which is under way.

  13. Prostate health index and prostate cancer gene 3 score but not percent-free Prostate Specific Antigen have a predictive role in differentiating histological prostatitis from PCa and other nonneoplastic lesions (BPH and HG-PIN) at repeat biopsy.

    PubMed

    De Luca, Stefano; Passera, Roberto; Fiori, Cristian; Bollito, Enrico; Cappia, Susanna; Mario Scarpa, Roberto; Sottile, Antonino; Franco Randone, Donato; Porpiglia, Francesco

    2015-10-01

    To determine if prostate health index (PHI), prostate cancer antigen gene 3 (PCA3) score, and percentage of free prostate-specific antigen (%fPSA) may be used to differentiate asymptomatic acute and chronic prostatitis from prostate cancer (PCa), benign prostatic hyperplasia (BPH), and high-grade prostate intraepithelial neoplasia (HG-PIN) in patients with elevated PSA levels and negative findings on digital rectal examination at repeat biopsy (re-Bx). In this prospective study, 252 patients were enrolled, undergoing PHI, PCA3 score, and %fPSA assessments before re-Bx. We used 3 multivariate logistic regression models to test the PHI, PCA3 score, and %fPSA as risk factors for prostatitis vs. PCa, vs. BPH, and vs. HG-PIN. All the analyses were performed for the whole patient cohort and for the "gray zone" of PSA (4-10ng/ml) cohort (171 individuals). Of the 252 patients, 43 (17.1%) had diagnosis of PCa. The median PHI was significantly different between men with a negative biopsy and those with a positive biopsy (34.9 vs. 48.1, P<0.001), as for the PCA3 score (24 vs. 54, P<0.001) and %fPSA (11.8% vs. 15.8%, P = 0.012). The net benefit of using PCA3 and PHI to differentiate prostatitis and PCa was moderate, although it extended to a good range of threshold probabilities (40%-100%), whereas that from using %fPSA was negligible: this pattern was reported for the whole population as for the "gray zone" PSA cohort. In front of a good diagnostic performance of all the 3 biomarkers in distinguishing negative biopsy vs. positive biopsy, the clinical benefit of using the PCA3 score and PHI to estimate prostatitis vs. PCa was comparable. PHI was the only determinant for prostatitis vs. BPH, whereas no biomarkers could differentiate prostate inflammation from HG-PIN. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. Local distributions of wealth to describe health inequalities in India: a new approach for analyzing nationally representative household survey data, 1992-2008.

    PubMed

    Bassani, Diego G; Corsi, Daniel J; Gaffey, Michelle F; Barros, Aluisio J D

    2014-01-01

    Worse health outcomes including higher morbidity and mortality are most often observed among the poorest fractions of a population. In this paper we present and validate national, regional and state-level distributions of national wealth index scores, for urban and rural populations, derived from household asset data collected in six survey rounds in India between 1992-3 and 2007-8. These new indices and their sub-national distributions allow for comparative analyses of a standardized measure of wealth across time and at various levels of population aggregation in India. Indices were derived through principal components analysis (PCA) performed using standardized variables from a correlation matrix to minimize differences in variance. Valid and simple indices were constructed with the minimum number of assets needed to produce scores with enough variability to allow definition of unique decile cut-off points in each urban and rural area of all states. For all indices, the first PCA components explained between 36% and 43% of the variance in household assets. Using sub-national distributions of national wealth index scores, mean height-for-age z-scores increased from the poorest to the richest wealth quintiles for all surveys, and stunting prevalence was higher among the poorest and lower among the wealthiest. Urban and rural decile cut-off values for India, for the six regions and for the 24 major states revealed large variability in wealth by geographical area and level, and rural wealth score gaps exceeded those observed in urban areas. The large variability in sub-national distributions of national wealth index scores indicates the importance of accounting for such variation when constructing wealth indices and deriving score distribution cut-off points. Such an approach allows for proper within-sample economic classification, resulting in scores that are valid indicators of wealth and correlate well with health outcomes, and enables wealth-related analyses at whichever geographical area and level may be most informative for policy-making processes.

  15. A probability index for surface zonda wind occurrence at Mendoza city through vertical sounding principal components analysis

    NASA Astrophysics Data System (ADS)

    Otero, Federico; Norte, Federico; Araneo, Diego

    2018-01-01

    The aim of this work is to obtain an index for predicting the probability of occurrence of zonda event at surface level from sounding data at Mendoza city, Argentine. To accomplish this goal, surface zonda wind events were previously found with an objective classification method (OCM) only considering the surface station values. Once obtained the dates and the onset time of each event, the prior closest sounding for each event was taken to realize a principal component analysis (PCA) that is used to identify the leading patterns of the vertical structure of the atmosphere previously to a zonda wind event. These components were used to construct the index model. For the PCA an entry matrix of temperature ( T) and dew point temperature (Td) anomalies for the standard levels between 850 and 300 hPa was build. The analysis yielded six significant components with a 94 % of the variance explained and the leading patterns of favorable weather conditions for the development of the phenomenon were obtained. A zonda/non-zonda indicator c can be estimated by a logistic multiple regressions depending on the PCA component loadings, determining a zonda probability index \\widehat{c} calculable from T and Td profiles and it depends on the climatological features of the region. The index showed 74.7 % efficiency. The same analysis was performed by adding surface values of T and Td from Mendoza Aero station increasing the index efficiency to 87.8 %. The results revealed four significantly correlated PCs with a major improvement in differentiating zonda cases and a reducing of the uncertainty interval.

  16. PAin SoluTions In the Emergency Setting (PASTIES)--patient controlled analgesia versus routine care in emergency department patients with pain from traumatic injuries: randomised trial.

    PubMed

    Smith, Jason E; Rockett, Mark; S, Siobhan Creanor; Squire, Rosalyn; Hayward, Chris; Ewings, Paul; Barton, Andy; Pritchard, Colin; Eyre, Victoria; Cocking, Laura; Benger, Jonathan

    2015-06-21

    To determine whether patient controlled analgesia (PCA) is better than routine care in patients presenting to emergency departments with moderate to severe pain from traumatic injuries. Pragmatic, multicentre, parallel group, randomised controlled trial. Five English hospitals. 200 adults (71% (n = 142) male), aged 18 to 75 years, who presented to the emergency department requiring intravenous opioid analgesia for the treatment of moderate to severe pain from traumatic injuries and were expected to be admitted to hospital for at least 12 hours. PCA (n = 99) or nurse titrated analgesia (treatment as usual; n = 101). The primary outcome was total pain experienced over the 12 hour study period, derived by standardised area under the curve (scaled from 0 to 100) of each participant's hourly pain scores, captured using a visual analogue scale. Pre-specified secondary outcomes included total morphine use, percentage of study period in moderate/severe pain, percentage of study period asleep, length of hospital stay, and satisfaction with pain management. 200 participants were included in the primary analyses. Mean total pain experienced was 47.2 (SD 21.9) for the treatment as usual group and 44.0 (24.0) for the PCA group. Adjusted analyses indicated slightly (but not statistically significantly) lower total pain experienced in the PCA group than in the routine care group (mean difference 2.7, 95% confidence interval -2.4 to 7.8). Participants allocated to PCA used more morphine in total than did participants in the treatment as usual group (mean 44.3 (23.2) v 27.2 (18.2) mg; mean difference 17.0, 11.3 to 22.7). PCA participants spent, on average, less time in moderate/severe pain (36.2% (31.0) v 44.1% (31.6)), but the difference was not statistically significant. A higher proportion of PCA participants reported being perfectly or very satisfied compared with the treatment as usual group (86% (78/91) v 76% (74/98)), but this was also not statistically significant. PCA provided no statistically significant reduction in pain compared with routine care for emergency department patients with traumatic injuries. Trial registration European Clinical Trials Database EudraCT2011-000194-31; Current Controlled Trials ISRCTN25343280. © Smith et al 2015.

  17. Increased serum N-terminal pro-B-type natriuretic peptide levels in patients with medial arterial calcification and poorly compressible leg arteries.

    PubMed

    Jouni, Hayan; Rodeheffer, Richard J; Kullo, Iftikhar J

    2011-01-01

    To determine whether serum levels of N-terminal (NT) pro-B-type natriuretic peptide (pro-BNP) are higher in patients with poorly compressible arteries (PCA) than in patients with peripheral artery disease (PAD) and control subjects without PCA or PAD. Medial arterial calcification in the lower extremities results in PCA and may be associated with increased arterial stiffness and hemodynamic/myocardial stress. PCA was defined as having an ankle-brachial index >1.4 or an ankle blood pressure >255 mm Hg, whereas PAD was defined as having an ankle-brachial index ≤0.9. Study participants with PCA (n=100; aged 71±10 years; 70% men) and age- and sex-matched patients with PAD (n=300) were recruited from the noninvasive vascular laboratory. Age- and sex-matched controls (n=300) were identified from a community-based cohort and had no history of PAD. NT pro-BNP levels were approximately 2.5-fold higher in patients with PCA than in patients with PAD and approximately 4-fold higher than in age- and sex-matched controls. In multivariable regression analyses that adjusted for age, sex, smoking, hypertension, history of coronary heart disease/stroke, systolic blood pressure, and serum creatinine, NT pro-BNP levels remained significantly higher in patients with PCA than in patients with PAD and controls (P<0.001). Patients with medial arterial calcification and PCA have higher serum levels of NT pro-BNP than patients with PAD and controls, which is suggestive of an adverse hemodynamic milieu and increased risk for adverse cardiovascular outcomes.

  18. Combination of 1H nuclear magnetic resonance spectroscopy and principal component analysis to evaluate the lipid fluidity of flutamide-encapsulated lipid nanoemulsions.

    PubMed

    Takegami, Shigehiko; Ueyama, Keita; Konishi, Atsuko; Kitade, Tatsuya

    2018-06-06

    The lipid fluidity of various lipid nanoemulsions (LNEs) without and with flutamide (FT) and containing one of two neutral lipids, one of four phosphatidylcholines as a surfactant, and sodium palmitate as a cosurfactant was investigated by the combination of 1 H nuclear magnetic resonance (NMR) spectroscopy and principal component analysis (PCA). In the 1 H NMR spectra, the peaks from the methylene groups of the neutral lipids and surfactants for all LNE preparations showed downfield shifts with increasing temperature from 20 to 60 °C. PCA was applied to the 1 H NMR spectral data obtained for the LNEs. The PCA resulted in a model in which the first two principal components (PCs) extracted 88% of the total spectral variation; the first PC (PC-1) axis and second PC (PC-2) axis accounted for 73 and 15%, respectively, of the total spectral variation. The Score-1 values for PC-1 plotted against temperature revealed the existence of two clusters, which were defined by the neutral lipid of the LNE preparations. Meanwhile, the Score-2 values decreased with rising temperature and reflected the increase in lipid fluidity of each LNE preparation, consistent with fluorescence anisotropy measurements. In addition, the changes of Score-2 values with temperature for LNE preparations with FT were smaller than those for LNE preparations without FT. This indicates that FT encapsulated in LNE particles markedly suppressed the increase in lipid fluidity of LNE particles with rising temperature. Thus, PCA of 1 H NMR spectra will become a powerful tool to analyze the lipid fluidity of lipid nanoparticles. Graphical abstract ᅟ.

  19. Mapping brain activity in gradient-echo functional MRI using principal component analysis

    NASA Astrophysics Data System (ADS)

    Khosla, Deepak; Singh, Manbir; Don, Manuel

    1997-05-01

    The detection of sites of brain activation in functional MRI has been a topic of immense research interest and many technique shave been proposed to this end. Recently, principal component analysis (PCA) has been applied to extract the activated regions and their time course of activation. This method is based on the assumption that the activation is orthogonal to other signal variations such as brain motion, physiological oscillations and other uncorrelated noises. A distinct advantage of this method is that it does not require any knowledge of the time course of the true stimulus paradigm. This technique is well suited to EPI image sequences where the sampling rate is high enough to capture the effects of physiological oscillations. In this work, we propose and apply tow methods that are based on PCA to conventional gradient-echo images and investigate their usefulness as tools to extract reliable information on brain activation. The first method is a conventional technique where a single image sequence with alternating on and off stages is subject to a principal component analysis. The second method is a PCA-based approach called the common spatial factor analysis technique (CSF). As the name suggests, this method relies on common spatial factors between the above fMRI image sequence and a background fMRI. We have applied these methods to identify active brain ares during visual stimulation and motor tasks. The results from these methods are compared to those obtained by using the standard cross-correlation technique. We found good agreement in the areas identified as active across all three techniques. The results suggest that PCA and CSF methods have good potential in detecting the true stimulus correlated changes in the presence of other interfering signals.

  20. Discrimination of Rhizoma Gastrodiae (Tianma) using 3D synchronous fluorescence spectroscopy coupled with principal component analysis

    NASA Astrophysics Data System (ADS)

    Fan, Qimeng; Chen, Chaoyin; Huang, Zaiqiang; Zhang, Chunmei; Liang, Pengjuan; Zhao, Shenglan

    2015-02-01

    Rhizoma Gastrodiae (Tianma) of different variants and different geographical origins has vital difference in quality and physiological efficacy. This paper focused on the classification and identification of Tianma of six types (two variants from three different geographical origins) using three dimensional synchronous fluorescence spectroscopy (3D-SFS) coupled with principal component analysis (PCA). 3D-SF spectra of aqueous extracts, which were obtained from Tianma of the six types, were measured by a LS-50B luminescence spectrofluorometer. The experimental results showed that the characteristic fluorescent spectral regions of the 3D-SF spectra were similar, while the intensities of characteristic regions are different significantly. Coupled these differences in peak intensities with PCA, Tianma of six types could be discriminated successfully. In conclusion, 3D-SFS coupled with PCA, which has such advantages as effective, specific, rapid, non-polluting, has an edge for discrimination of the similar Chinese herbal medicine. And the proposed methodology is a useful tool to classify and identify Tianma of different variants and different geographical origins.

Top