The impact of moderate wine consumption on the risk of developing prostate cancer.
Vartolomei, Mihai Dorin; Kimura, Shoji; Ferro, Matteo; Foerster, Beat; Abufaraj, Mohammad; Briganti, Alberto; Karakiewicz, Pierre I; Shariat, Shahrokh F
2018-01-01
To investigate the impact of moderate wine consumption on the risk of prostate cancer (PCa). We focused on the differential effect of moderate consumption of red versus white wine. This study was a meta-analysis that includes data from case-control and cohort studies. A systematic search of Web of Science, Medline/PubMed, and Cochrane library was performed on December 1, 2017. Studies were deemed eligible if they assessed the risk of PCa due to red, white, or any wine using multivariable logistic regression analysis. We performed a formal meta-analysis for the risk of PCa according to moderate wine and wine type consumption (white or red). Heterogeneity between studies was assessed using Cochrane's Q test and I 2 statistics. Publication bias was assessed using Egger's regression test. A total of 930 abstracts and titles were initially identified. After removal of duplicates, reviews, and conference abstracts, 83 full-text original articles were screened. Seventeen studies (611,169 subjects) were included for final evaluation and fulfilled the inclusion criteria. In the case of moderate wine consumption: the pooled risk ratio (RR) for the risk of PCa was 0.98 (95% CI 0.92-1.05, p =0.57) in the multivariable analysis. Moderate white wine consumption increased the risk of PCa with a pooled RR of 1.26 (95% CI 1.10-1.43, p =0.001) in the multi-variable analysis. Meanwhile, moderate red wine consumption had a protective role reducing the risk by 12% (RR 0.88, 95% CI 0.78-0.999, p =0.047) in the multivariable analysis that comprised 222,447 subjects. In this meta-analysis, moderate wine consumption did not impact the risk of PCa. Interestingly, regarding the type of wine, moderate consumption of white wine increased the risk of PCa, whereas moderate consumption of red wine had a protective effect. Further analyses are needed to assess the differential molecular effect of white and red wine conferring their impact on PCa risk.
Mueller, Daniela; Ferrão, Marco Flôres; Marder, Luciano; da Costa, Adilson Ben; de Cássia de Souza Schneider, Rosana
2013-01-01
The main objective of this study was to use infrared spectroscopy to identify vegetable oils used as raw material for biodiesel production and apply multivariate analysis to the data. Six different vegetable oil sources—canola, cotton, corn, palm, sunflower and soybeans—were used to produce biodiesel batches. The spectra were acquired by Fourier transform infrared spectroscopy using a universal attenuated total reflectance sensor (FTIR-UATR). For the multivariate analysis principal component analysis (PCA), hierarchical cluster analysis (HCA), interval principal component analysis (iPCA) and soft independent modeling of class analogy (SIMCA) were used. The results indicate that is possible to develop a methodology to identify vegetable oils used as raw material in the production of biodiesel by FTIR-UATR applying multivariate analysis. It was also observed that the iPCA found the best spectral range for separation of biodiesel batches using FTIR-UATR data, and with this result, the SIMCA method classified 100% of the soybean biodiesel samples. PMID:23539030
The impact of moderate wine consumption on the risk of developing prostate cancer
Ferro, Matteo; Foerster, Beat; Abufaraj, Mohammad; Briganti, Alberto; Karakiewicz, Pierre I; Shariat, Shahrokh F
2018-01-01
Objective To investigate the impact of moderate wine consumption on the risk of prostate cancer (PCa). We focused on the differential effect of moderate consumption of red versus white wine. Design This study was a meta-analysis that includes data from case–control and cohort studies. Materials and methods A systematic search of Web of Science, Medline/PubMed, and Cochrane library was performed on December 1, 2017. Studies were deemed eligible if they assessed the risk of PCa due to red, white, or any wine using multivariable logistic regression analysis. We performed a formal meta-analysis for the risk of PCa according to moderate wine and wine type consumption (white or red). Heterogeneity between studies was assessed using Cochrane’s Q test and I2 statistics. Publication bias was assessed using Egger’s regression test. Results A total of 930 abstracts and titles were initially identified. After removal of duplicates, reviews, and conference abstracts, 83 full-text original articles were screened. Seventeen studies (611,169 subjects) were included for final evaluation and fulfilled the inclusion criteria. In the case of moderate wine consumption: the pooled risk ratio (RR) for the risk of PCa was 0.98 (95% CI 0.92–1.05, p=0.57) in the multivariable analysis. Moderate white wine consumption increased the risk of PCa with a pooled RR of 1.26 (95% CI 1.10–1.43, p=0.001) in the multi-variable analysis. Meanwhile, moderate red wine consumption had a protective role reducing the risk by 12% (RR 0.88, 95% CI 0.78–0.999, p=0.047) in the multivariable analysis that comprised 222,447 subjects. Conclusions In this meta-analysis, moderate wine consumption did not impact the risk of PCa. Interestingly, regarding the type of wine, moderate consumption of white wine increased the risk of PCa, whereas moderate consumption of red wine had a protective effect. Further analyses are needed to assess the differential molecular effect of white and red wine conferring their impact on PCa risk. PMID:29713200
Cantiello, Francesco; Russo, Giorgio Ivan; Cicione, Antonio; Ferro, Matteo; Cimino, Sebastiano; Favilla, Vincenzo; Perdonà, Sisto; De Cobelli, Ottavio; Magno, Carlo; Morgia, Giuseppe; Damiano, Rocco
2016-04-01
To assess the performance of prostate health index (PHI) and prostate cancer antigen 3 (PCA3) when added to the PRIAS or Epstein criteria in predicting the presence of pathologically insignificant prostate cancer (IPCa) in patients who underwent radical prostatectomy (RP) but eligible for active surveillance (AS). An observational retrospective study was performed in 188 PCa patients treated with laparoscopic or robot-assisted RP but eligible for AS according to Epstein or PRIAS criteria. Blood and urinary specimens were collected before initial prostate biopsy for PHI and PCA3 measurements. Multivariate logistic regression analyses and decision curve analysis were carried out to identify predictors of IPCa using the updated ERSPC definition. At the multivariate analyses, the inclusion of both PCA3 and PHI significantly increased the accuracy of the Epstein multivariate model in predicting IPCa with an increase of 17 % (AUC = 0.77) and of 32 % (AUC = 0.92), respectively. The inclusion of both PCA3 and PHI also increased the predictive accuracy of the PRIAS multivariate model with an increase of 29 % (AUC = 0.87) and of 39 % (AUC = 0.97), respectively. DCA revealed that the multivariable models with the addition of PHI or PCA3 showed a greater net benefit and performed better than the reference models. In a direct comparison, PHI outperformed PCA3 performance resulting in higher net benefit. In a same cohort of patients eligible for AS, the addition of PHI and PCA3 to Epstein or PRIAS models improved their prognostic performance. PHI resulted in greater net benefit in predicting IPCa compared to PCA3.
Multivariate analysis for scanning tunneling spectroscopy data
NASA Astrophysics Data System (ADS)
Yamanishi, Junsuke; Iwase, Shigeru; Ishida, Nobuyuki; Fujita, Daisuke
2018-01-01
We applied principal component analysis (PCA) to two-dimensional tunneling spectroscopy (2DTS) data obtained on a Si(111)-(7 × 7) surface to explore the effectiveness of multivariate analysis for interpreting 2DTS data. We demonstrated that several components that originated mainly from specific atoms at the Si(111)-(7 × 7) surface can be extracted by PCA. Furthermore, we showed that hidden components in the tunneling spectra can be decomposed (peak separation), which is difficult to achieve with normal 2DTS analysis without the support of theoretical calculations. Our analysis showed that multivariate analysis can be an additional powerful way to analyze 2DTS data and extract hidden information from a large amount of spectroscopic data.
NASA Astrophysics Data System (ADS)
Feng, Shangyuan; Lin, Juqiang; Huang, Zufang; Chen, Guannan; Chen, Weisheng; Wang, Yue; Chen, Rong; Zeng, Haishan
2013-01-01
The capability of using silver nanoparticle based near-infrared surface enhanced Raman scattering (SERS) spectroscopy combined with principal component analysis (PCA) and linear discriminate analysis (LDA) to differentiate esophageal cancer tissue from normal tissue was presented. Significant differences in Raman intensities of prominent SERS bands were observed between normal and cancer tissues. PCA-LDA multivariate analysis of the measured tissue SERS spectra achieved diagnostic sensitivity of 90.9% and specificity of 97.8%. This exploratory study demonstrated great potential for developing label-free tissue SERS analysis into a clinical tool for esophageal cancer detection.
Nonlinear multivariate and time series analysis by neural network methods
NASA Astrophysics Data System (ADS)
Hsieh, William W.
2004-03-01
Methods in multivariate statistical analysis are essential for working with large amounts of geophysical data, data from observational arrays, from satellites, or from numerical model output. In classical multivariate statistical analysis, there is a hierarchy of methods, starting with linear regression at the base, followed by principal component analysis (PCA) and finally canonical correlation analysis (CCA). A multivariate time series method, the singular spectrum analysis (SSA), has been a fruitful extension of the PCA technique. The common drawback of these classical methods is that only linear structures can be correctly extracted from the data. Since the late 1980s, neural network methods have become popular for performing nonlinear regression and classification. More recently, neural network methods have been extended to perform nonlinear PCA (NLPCA), nonlinear CCA (NLCCA), and nonlinear SSA (NLSSA). This paper presents a unified view of the NLPCA, NLCCA, and NLSSA techniques and their applications to various data sets of the atmosphere and the ocean (especially for the El Niño-Southern Oscillation and the stratospheric quasi-biennial oscillation). These data sets reveal that the linear methods are often too simplistic to describe real-world systems, with a tendency to scatter a single oscillatory phenomenon into numerous unphysical modes or higher harmonics, which can be largely alleviated in the new nonlinear paradigm.
Docking and multivariate methods to explore HIV-1 drug-resistance: a comparative analysis
NASA Astrophysics Data System (ADS)
Almerico, Anna Maria; Tutone, Marco; Lauria, Antonino
2008-05-01
In this paper we describe a comparative analysis between multivariate and docking methods in the study of the drug resistance to the reverse transcriptase and the protease inhibitors. In our early papers we developed a simple but efficient method to evaluate the features of compounds that are less likely to trigger resistance or are effective against mutant HIV strains, using the multivariate statistical procedures PCA and DA. In the attempt to create a more solid background for the prediction of susceptibility or resistance, we carried out a comparative analysis between our previous multivariate approach and molecular docking study. The intent of this paper is not only to find further support to the results obtained by the combined use of PCA and DA, but also to evidence the structural features, in terms of molecular descriptors, similarity, and energetic contributions, derived from docking, which can account for the arising of drug-resistance against mutant strains.
Chieng, Norman; Trnka, Hjalte; Boetker, Johan; Pikal, Michael; Rantanen, Jukka; Grohganz, Holger
2013-09-15
The purpose of this study is to investigate the use of multivariate data analysis for powder X-ray diffraction-pair-wise distribution function (PXRD-PDF) data to detect phase separation in freeze-dried binary amorphous systems. Polymer-polymer and polymer-sugar binary systems at various ratios were freeze-dried. All samples were analyzed by PXRD, transformed to PDF and analyzed by principal component analysis (PCA). These results were validated by differential scanning calorimetry (DSC) through characterization of glass transition of the maximally freeze-concentrate solute (Tg'). Analysis of PXRD-PDF data using PCA provides a more clear 'miscible' or 'phase separated' interpretation through the distribution pattern of samples on a score plot presentation compared to residual plot method. In a phase separated system, samples were found to be evenly distributed around the theoretical PDF profile. For systems that were miscible, a clear deviation of samples away from the theoretical PDF profile was observed. Moreover, PCA analysis allows simultaneous analysis of replicate samples. Comparatively, the phase behavior analysis from PXRD-PDF-PCA method was in agreement with the DSC results. Overall, the combined PXRD-PDF-PCA approach improves the clarity of the PXRD-PDF results and can be used as an alternative explorative data analytical tool in detecting phase separation in freeze-dried binary amorphous systems. Copyright © 2013 Elsevier B.V. All rights reserved.
Multivariate frequency domain analysis of protein dynamics
NASA Astrophysics Data System (ADS)
Matsunaga, Yasuhiro; Fuchigami, Sotaro; Kidera, Akinori
2009-03-01
Multivariate frequency domain analysis (MFDA) is proposed to characterize collective vibrational dynamics of protein obtained by a molecular dynamics (MD) simulation. MFDA performs principal component analysis (PCA) for a bandpass filtered multivariate time series using the multitaper method of spectral estimation. By applying MFDA to MD trajectories of bovine pancreatic trypsin inhibitor, we determined the collective vibrational modes in the frequency domain, which were identified by their vibrational frequencies and eigenvectors. At near zero temperature, the vibrational modes determined by MFDA agreed well with those calculated by normal mode analysis. At 300 K, the vibrational modes exhibited characteristic features that were considerably different from the principal modes of the static distribution given by the standard PCA. The influences of aqueous environments were discussed based on two different sets of vibrational modes, one derived from a MD simulation in water and the other from a simulation in vacuum. Using the varimax rotation, an algorithm of the multivariate statistical analysis, the representative orthogonal set of eigenmodes was determined at each vibrational frequency.
Water quality analysis of the Rapur area, Andhra Pradesh, South India using multivariate techniques
NASA Astrophysics Data System (ADS)
Nagaraju, A.; Sreedhar, Y.; Thejaswi, A.; Sayadi, Mohammad Hossein
2017-10-01
The groundwater samples from Rapur area were collected from different sites to evaluate the major ion chemistry. The large number of data can lead to difficulties in the integration, interpretation, and representation of the results. Two multivariate statistical methods, hierarchical cluster analysis (HCA) and factor analysis (FA), were applied to evaluate their usefulness to classify and identify geochemical processes controlling groundwater geochemistry. Four statistically significant clusters were obtained from 30 sampling stations. This has resulted two important clusters viz., cluster 1 (pH, Si, CO3, Mg, SO4, Ca, K, HCO3, alkalinity, Na, Na + K, Cl, and hardness) and cluster 2 (EC and TDS) which are released to the study area from different sources. The application of different multivariate statistical techniques, such as principal component analysis (PCA), assists in the interpretation of complex data matrices for a better understanding of water quality of a study area. From PCA, it is clear that the first factor (factor 1), accounted for 36.2% of the total variance, was high positive loading in EC, Mg, Cl, TDS, and hardness. Based on the PCA scores, four significant cluster groups of sampling locations were detected on the basis of similarity of their water quality.
Zhou, Fei; Zhao, Yajing; Peng, Jiyu; Jiang, Yirong; Li, Maiquan; Jiang, Yuan; Lu, Baiyi
2017-07-01
Osmanthus fragrans flowers are used as folk medicine and additives for teas, beverages and foods. The metabolites of O. fragrans flowers from different geographical origins were inconsistent in some extent. Chromatography and mass spectrometry combined with multivariable analysis methods provides an approach for discriminating the origin of O. fragrans flowers. To discriminate the Osmanthus fragrans var. thunbergii flowers from different origins with the identified metabolites. GC-MS and UPLC-PDA were conducted to analyse the metabolites in O. fragrans var. thunbergii flowers (in total 150 samples). Principal component analysis (PCA), soft independent modelling of class analogy analysis (SIMCA) and random forest (RF) analysis were applied to group the GC-MS and UPLC-PDA data. GC-MS identified 32 compounds common to all samples while UPLC-PDA/QTOF-MS identified 16 common compounds. PCA of the UPLC-PDA data generated a better clustering than PCA of the GC-MS data. Ten metabolites (six from GC-MS and four from UPLC-PDA) were selected as effective compounds for discrimination by PCA loadings. SIMCA and RF analysis were used to build classification models, and the RF model, based on the four effective compounds (caffeic acid derivative, acteoside, ligustroside and compound 15), yielded better results with the classification rate of 100% in the calibration set and 97.8% in the prediction set. GC-MS and UPLC-PDA combined with multivariable analysis methods can discriminate the origin of Osmanthus fragrans var. thunbergii flowers. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Oh, Han Bin; Leach, Franklin E.; Arungundram, Sailaja; Al-Mafraji, Kanar; Venot, Andre; Boons, Geert-Jan; Amster, I. Jonathan
2011-03-01
The structural characterization of glycosaminoglycan (GAG) carbohydrates by mass spectrometry has been a long-standing analytical challenge due to the inherent heterogeneity of these biomolecules, specifically polydispersity, variability in sulfation, and hexuronic acid stereochemistry. Recent advances in tandem mass spectrometry methods employing threshold and electron-based ion activation have resulted in the ability to determine the location of the labile sulfate modification as well as assign the stereochemistry of hexuronic acid residues. To facilitate the analysis of complex electron detachment dissociation (EDD) spectra, principal component analysis (PCA) is employed to differentiate the hexuronic acid stereochemistry of four synthetic GAG epimers whose EDD spectra are nearly identical upon visual inspection. For comparison, PCA is also applied to infrared multiphoton dissociation spectra (IRMPD) of the examined epimers. To assess the applicability of multivariate methods in GAG mixture analysis, PCA is utilized to identify the relative content of two epimers in a binary mixture.
Geladi, Paul; Nelson, Andrew; Lindholm-Sethson, Britta
2007-07-09
Electrical impedance gives multivariate complex number data as results. Two examples of multivariate electrical impedance data measured on lipid monolayers in different solutions give rise to matrices (16x50 and 38x50) of complex numbers. Multivariate data analysis by principal component analysis (PCA) or singular value decomposition (SVD) can be used for complex data and the necessary equations are given. The scores and loadings obtained are vectors of complex numbers. It is shown that the complex number PCA and SVD are better at concentrating information in a few components than the naïve juxtaposition method and that Argand diagrams can replace score and loading plots. Different concentrations of Magainin and Gramicidin A give different responses and also the role of the electrolyte medium can be studied. An interaction of Gramicidin A in the solution with the monolayer over time can be observed.
Spectral discrimination of serum from liver cancer and liver cirrhosis using Raman spectroscopy
NASA Astrophysics Data System (ADS)
Yang, Tianyue; Li, Xiaozhou; Yu, Ting; Sun, Ruomin; Li, Siqi
2011-07-01
In this paper, Raman spectra of human serum were measured using Raman spectroscopy, then the spectra was analyzed by multivariate statistical methods of principal component analysis (PCA). Then linear discriminant analysis (LDA) was utilized to differentiate the loading score of different diseases as the diagnosing algorithm. Artificial neural network (ANN) was used for cross-validation. The diagnosis sensitivity and specificity by PCA-LDA are 88% and 79%, while that of the PCA-ANN are 89% and 95%. It can be seen that modern analyzing method is a useful tool for the analysis of serum spectra for diagnosing diseases.
Evaluation of Meterorite Amono Acid Analysis Data Using Multivariate Techniques
NASA Technical Reports Server (NTRS)
McDonald, G.; Storrie-Lombardi, M.; Nealson, K.
1999-01-01
The amino acid distributions in the Murchison carbonaceous chondrite, Mars meteorite ALH84001, and ice from the Allan Hills region of Antarctica are shown, using a multivariate technique known as Principal Component Analysis (PCA), to be statistically distinct from the average amino acid compostion of 101 terrestrial protein superfamilies.
Friedman, David B
2012-01-01
All quantitative proteomics experiments measure variation between samples. When performing large-scale experiments that involve multiple conditions or treatments, the experimental design should include the appropriate number of individual biological replicates from each condition to enable the distinction between a relevant biological signal from technical noise. Multivariate statistical analyses, such as principal component analysis (PCA), provide a global perspective on experimental variation, thereby enabling the assessment of whether the variation describes the expected biological signal or the unanticipated technical/biological noise inherent in the system. Examples will be shown from high-resolution multivariable DIGE experiments where PCA was instrumental in demonstrating biologically significant variation as well as sample outliers, fouled samples, and overriding technical variation that would not be readily observed using standard univariate tests.
Prognostic value of transformer 2β expression in prostate cancer.
Diao, Yan; Wu, Dong; Dai, Zhijun; Kang, Huafeng; Wang, Ziming; Wang, Xijing
2015-01-01
Deregulation of transformer 2β (Tra2β) has been implicated in several cancers. However, the role of Tra2β expression in prostate cancer (PCa) is unclear. Therefore, this study was to investigate the expression of Tra2β in PCa and evaluated its association with clinicopathological variables and prognosis. Thirty paired fresh PCa samples were analyzed for Tra2β expression by Western blot analysis. Immunohistochemistry (IHC) assay was performed in 160 PCa samples after radical prostatectomy and adjacent non-cancerous tissues. Tra2β protein expression was divided into high expression group and low expression group by IHC. We also investigated the association of Tra2β expression with clinical and pathologic parameters. Kaplan-Meier plots and Cox proportional hazards regression model were used to analyze the association between Tra2β protein expression and prognosis of PCa patients. Our results showed that Tra2β was significantly upregulated in PCa tissues by western blot and IHC. Our data indicated that high expression of Tra2β was significantly associated with lymph node metastasis (P=0.002), clinical stage (P=0.015), preoperative prostate-specific antigen (P=0.003), Gleason score (P=0.001), and biochemical recurrence (P=0.021). High Tra2β expression was a significant predictor of poor biochemical recurrence free survival and overall survival both in univariate and multivariate analysis. We show that Tra2β was significantly upregulated in PCa patients after radical prostatectomy, and multivariate analysis confirmed Tra2β as an independent prognostic factor.
Once upon Multivariate Analyses: When They Tell Several Stories about Biological Evolution.
Renaud, Sabrina; Dufour, Anne-Béatrice; Hardouin, Emilie A; Ledevin, Ronan; Auffray, Jean-Christophe
2015-01-01
Geometric morphometrics aims to characterize of the geometry of complex traits. It is therefore by essence multivariate. The most popular methods to investigate patterns of differentiation in this context are (1) the Principal Component Analysis (PCA), which is an eigenvalue decomposition of the total variance-covariance matrix among all specimens; (2) the Canonical Variate Analysis (CVA, a.k.a. linear discriminant analysis (LDA) for more than two groups), which aims at separating the groups by maximizing the between-group to within-group variance ratio; (3) the between-group PCA (bgPCA) which investigates patterns of between-group variation, without standardizing by the within-group variance. Standardizing within-group variance, as performed in the CVA, distorts the relationships among groups, an effect that is particularly strong if the variance is similarly oriented in a comparable way in all groups. Such shared direction of main morphological variance may occur and have a biological meaning, for instance corresponding to the most frequent standing genetic variation in a population. Here we undertake a case study of the evolution of house mouse molar shape across various islands, based on the real dataset and simulations. We investigated how patterns of main variance influence the depiction of among-group differentiation according to the interpretation of the PCA, bgPCA and CVA. Without arguing about a method performing 'better' than another, it rather emerges that working on the total or between-group variance (PCA and bgPCA) will tend to put the focus on the role of direction of main variance as line of least resistance to evolution. Standardizing by the within-group variance (CVA), by dampening the expression of this line of least resistance, has the potential to reveal other relevant patterns of differentiation that may otherwise be blurred.
Identification of the isomers using principal component analysis (PCA) method
NASA Astrophysics Data System (ADS)
Kepceoǧlu, Abdullah; Gündoǧdu, Yasemin; Ledingham, Kenneth William David; Kilic, Hamdi Sukur
2016-03-01
In this work, we have carried out a detailed statistical analysis for experimental data of mass spectra from xylene isomers. Principle Component Analysis (PCA) was used to identify the isomers which cannot be distinguished using conventional statistical methods for interpretation of their mass spectra. Experiments have been carried out using a linear TOF-MS coupled to a femtosecond laser system as an energy source for the ionisation processes. We have performed experiments and collected data which has been analysed and interpreted using PCA as a multivariate analysis of these spectra. This demonstrates the strength of the method to get an insight for distinguishing the isomers which cannot be identified using conventional mass analysis obtained through dissociative ionisation processes on these molecules. The PCA results dependending on the laser pulse energy and the background pressure in the spectrometers have been presented in this work.
NASA Astrophysics Data System (ADS)
Hristian, L.; Ostafe, M. M.; Manea, L. R.; Apostol, L. L.
2017-06-01
The work pursued the distribution of combed wool fabrics destined to manufacturing of external articles of clothing in terms of the values of durability and physiological comfort indices, using the mathematical model of Principal Component Analysis (PCA). Principal Components Analysis (PCA) applied in this study is a descriptive method of the multivariate analysis/multi-dimensional data, and aims to reduce, under control, the number of variables (columns) of the matrix data as much as possible to two or three. Therefore, based on the information about each group/assortment of fabrics, it is desired that, instead of nine inter-correlated variables, to have only two or three new variables called components. The PCA target is to extract the smallest number of components which recover the most of the total information contained in the initial data.
Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map
An, Yan; Zou, Zhihong; Li, Ranran
2016-01-01
In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009–2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data. PMID:26761018
Descriptive Characteristics of Surface Water Quality in Hong Kong by a Self-Organising Map.
An, Yan; Zou, Zhihong; Li, Ranran
2016-01-08
In this study, principal component analysis (PCA) and a self-organising map (SOM) were used to analyse a complex dataset obtained from the river water monitoring stations in the Tolo Harbor and Channel Water Control Zone (Hong Kong), covering the period of 2009-2011. PCA was initially applied to identify the principal components (PCs) among the nonlinear and complex surface water quality parameters. SOM followed PCA, and was implemented to analyze the complex relationships and behaviors of the parameters. The results reveal that PCA reduced the multidimensional parameters to four significant PCs which are combinations of the original ones. The positive and inverse relationships of the parameters were shown explicitly by pattern analysis in the component planes. It was found that PCA and SOM are efficient tools to capture and analyze the behavior of multivariable, complex, and nonlinear related surface water quality data.
Keenan, Michael R; Smentkowski, Vincent S; Ulfig, Robert M; Oltman, Edward; Larson, David J; Kelly, Thomas F
2011-06-01
We demonstrate for the first time that multivariate statistical analysis techniques can be applied to atom probe tomography data to estimate the chemical composition of a sample at the full spatial resolution of the atom probe in three dimensions. Whereas the raw atom probe data provide the specific identity of an atom at a precise location, the multivariate results can be interpreted in terms of the probabilities that an atom representing a particular chemical phase is situated there. When aggregated to the size scale of a single atom (∼0.2 nm), atom probe spectral-image datasets are huge and extremely sparse. In fact, the average spectrum will have somewhat less than one total count per spectrum due to imperfect detection efficiency. These conditions, under which the variance in the data is completely dominated by counting noise, test the limits of multivariate analysis, and an extensive discussion of how to extract the chemical information is presented. Efficient numerical approaches to performing principal component analysis (PCA) on these datasets, which may number hundreds of millions of individual spectra, are put forward, and it is shown that PCA can be computed in a few seconds on a typical laptop computer.
NASA Astrophysics Data System (ADS)
Chen, Long; Wang, Yue; Liu, Nenrong; Lin, Duo; Weng, Cuncheng; Zhang, Jixue; Zhu, Lihuan; Chen, Weisheng; Chen, Rong; Feng, Shangyuan
2013-06-01
The diagnostic capability of using tissue intrinsic micro-Raman signals to obtain biochemical information from human esophageal tissue is presented in this paper. Near-infrared micro-Raman spectroscopy combined with multivariate analysis was applied for discrimination of esophageal cancer tissue from normal tissue samples. Micro-Raman spectroscopy measurements were performed on 54 esophageal cancer tissues and 55 normal tissues in the 400-1750 cm-1 range. The mean Raman spectra showed significant differences between the two groups. Tentative assignments of the Raman bands in the measured tissue spectra suggested some changes in protein structure, a decrease in the relative amount of lactose, and increases in the percentages of tryptophan, collagen and phenylalanine content in esophageal cancer tissue as compared to those of a normal subject. The diagnostic algorithms based on principal component analysis (PCA) and linear discriminate analysis (LDA) achieved a diagnostic sensitivity of 87.0% and specificity of 70.9% for separating cancer from normal esophageal tissue samples. The result demonstrated that near-infrared micro-Raman spectroscopy combined with PCA-LDA analysis could be an effective and sensitive tool for identification of esophageal cancer.
NASA Astrophysics Data System (ADS)
Rojek, Barbara; Wesolowski, Marek; Suchacz, Bogdan
2013-12-01
In the paper infrared (IR) spectroscopy and multivariate exploration techniques: principal component analysis (PCA) and cluster analysis (CA) were applied as supportive methods for the detection of physicochemical incompatibilities between baclofen and excipients. In the course of research, the most useful rotational strategy in PCA proved to be varimax normalized, while in CA Ward's hierarchical agglomeration with Euclidean distance measure enabled to yield the most interpretable results. Chemometrical calculations confirmed the suitability of PCA and CA as the auxiliary methods for interpretation of infrared spectra in order to recognize whether compatibilities or incompatibilities between active substance and excipients occur. On the basis of IR spectra and the results of PCA and CA it was possible to demonstrate that the presence of lactose, β-cyclodextrin and meglumine in binary mixtures produce interactions with baclofen. The results were verified using differential scanning calorimetry, differential thermal analysis, thermogravimetry/differential thermogravimetry and X-ray powder diffraction analyses.
Principal components analysis in clinical studies.
Zhang, Zhongheng; Castelló, Adela
2017-09-01
In multivariate analysis, independent variables are usually correlated to each other which can introduce multicollinearity in the regression models. One approach to solve this problem is to apply principal components analysis (PCA) over these variables. This method uses orthogonal transformation to represent sets of potentially correlated variables with principal components (PC) that are linearly uncorrelated. PCs are ordered so that the first PC has the largest possible variance and only some components are selected to represent the correlated variables. As a result, the dimension of the variable space is reduced. This tutorial illustrates how to perform PCA in R environment, the example is a simulated dataset in which two PCs are responsible for the majority of the variance in the data. Furthermore, the visualization of PCA is highlighted.
Portable XRF and principal component analysis for bill characterization in forensic science.
Appoloni, C R; Melquiades, F L
2014-02-01
Several modern techniques have been applied to prevent counterfeiting of money bills. The objective of this study was to demonstrate the potential of Portable X-ray Fluorescence (PXRF) technique and the multivariate analysis method of Principal Component Analysis (PCA) for classification of bills in order to use it in forensic science. Bills of Dollar, Euro and Real (Brazilian currency) were measured directly at different colored regions, without any previous preparation. Spectra interpretation allowed the identification of Ca, Ti, Fe, Cu, Sr, Y, Zr and Pb. PCA analysis separated the bills in three groups and subgroups among Brazilian currency. In conclusion, the samples were classified according to its origin identifying the elements responsible for differentiation and basic pigment composition. PXRF allied to multivariate discriminate methods is a promising technique for rapid and no destructive identification of false bills in forensic science. Copyright © 2013 Elsevier Ltd. All rights reserved.
Perdonà, Sisto; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; D’Esposito, Vittoria; Cosimato, Vincenzo; Buonerba, Carlo; Di Lorenzo, Giuseppe; Musi, Gennaro; De Cobelli, Ottavio; Chun, Felix K.; Terracciano, Daniela
2013-01-01
Many efforts to reduce prostate specific antigen (PSA) overdiagnosis and overtreatment have been made. To this aim, Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) have been proposed as new more specific biomarkers. We evaluated the ability of phi and PCA3 to identify prostate cancer (PCa) at initial prostate biopsy in men with total PSA range of 2–10 ng/ml. The performance of phi and PCA3 were evaluated in 300 patients undergoing first prostate biopsy. ROC curve analyses tested the accuracy (AUC) of phi and PCA3 in predicting PCa. Decision curve analyses (DCA) were used to compare the clinical benefit of the two biomarkers. We found that the AUC value of phi (0.77) was comparable to those of %p2PSA (0.76) and PCA3 (0.73) with no significant differences in pairwise comparison (%p2PSA vs phi p = 0.673, %p2PSA vs. PCA3 p = 0.417 and phi vs. PCA3 p = 0.247). These three biomarkers significantly outperformed fPSA (AUC = 0.60), % fPSA (AUC = 0.62) and p2PSA (AUC = 0.63). At DCA, phi and PCA3 exhibited a very close net benefit profile until the threshold probability of 25%, then phi index showed higher net benefit than PCA3. Multivariable analysis showed that the addition of phi and PCA3 to the base multivariable model (age, PSA, %fPSA, DRE, prostate volume) increased predictive accuracy, whereas no model improved single biomarker performance. Finally we showed that subjects with active surveillance (AS) compatible cancer had significantly lower phi and PCA3 values (p<0.001 and p = 0.01, respectively). In conclusion, both phi and PCA3 comparably increase the accuracy in predicting the presence of PCa in total PSA range 2–10 ng/ml at initial biopsy, outperforming currently used %fPSA. PMID:23861782
Ferro, Matteo; Bruzzese, Dario; Perdonà, Sisto; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; D'Esposito, Vittoria; Cosimato, Vincenzo; Buonerba, Carlo; Di Lorenzo, Giuseppe; Musi, Gennaro; De Cobelli, Ottavio; Chun, Felix K; Terracciano, Daniela
2013-01-01
Many efforts to reduce prostate specific antigen (PSA) overdiagnosis and overtreatment have been made. To this aim, Prostate Health Index (Phi) and Prostate Cancer Antigen 3 (PCA3) have been proposed as new more specific biomarkers. We evaluated the ability of phi and PCA3 to identify prostate cancer (PCa) at initial prostate biopsy in men with total PSA range of 2-10 ng/ml. The performance of phi and PCA3 were evaluated in 300 patients undergoing first prostate biopsy. ROC curve analyses tested the accuracy (AUC) of phi and PCA3 in predicting PCa. Decision curve analyses (DCA) were used to compare the clinical benefit of the two biomarkers. We found that the AUC value of phi (0.77) was comparable to those of %p2PSA (0.76) and PCA3 (0.73) with no significant differences in pairwise comparison (%p2PSA vs phi p = 0.673, %p2PSA vs. PCA3 p = 0.417 and phi vs. PCA3 p = 0.247). These three biomarkers significantly outperformed fPSA (AUC = 0.60), % fPSA (AUC = 0.62) and p2PSA (AUC = 0.63). At DCA, phi and PCA3 exhibited a very close net benefit profile until the threshold probability of 25%, then phi index showed higher net benefit than PCA3. Multivariable analysis showed that the addition of phi and PCA3 to the base multivariable model (age, PSA, %fPSA, DRE, prostate volume) increased predictive accuracy, whereas no model improved single biomarker performance. Finally we showed that subjects with active surveillance (AS) compatible cancer had significantly lower phi and PCA3 values (p<0.001 and p = 0.01, respectively). In conclusion, both phi and PCA3 comparably increase the accuracy in predicting the presence of PCa in total PSA range 2-10 ng/ml at initial biopsy, outperforming currently used %fPSA.
Tianniam, Sukanda; Tarachiwin, Lucksanaporn; Bamba, Takeshi; Kobayashi, Akio; Fukusaki, Eiichiro
2008-06-01
Gas chromatography time-of-flight mass spectrometry was applied to elucidate the profiling of primary metabolites and to evaluate the differences between quality differences in Angelica acutiloba (or Yamato-toki) roots through the utilization of multivariate pattern recognition-principal component analysis (PCA). Twenty-two metabolites consisting of sugars, amino and organic acids were identified. PCA analysis successfully discriminated the good, the moderate and the bad quality Yamato-toki roots in accordance to their cultivation areas. The results signified two reducing sugars, fructose and glucose being the most accumulated in the bad quality, whereas higher quantity of phosphoric acid, proline, malic acid and citric acid were found in the good and the moderate quality toki roots. PCA was also effective in discriminating samples derive from different cultivars. Yamato-toki roots with the moderate quality were compared by means of PCA, and the results illustrated good discrimination which was influenced most by malic acid. Overall, this study demonstrated that metabolomics technique is accurate and efficient in determining the quality differences in Yamato-toki roots, and has a potential to be a superior and suitable method to assess the quality of this medicinal plant.
NASA Astrophysics Data System (ADS)
Huang, Shaohua; Wang, Lan; Chen, Weisheng; Feng, Shangyuan; Lin, Juqiang; Huang, Zufang; Chen, Guannan; Li, Buhong; Chen, Rong
2014-11-01
Non-invasive esophagus cancer detection based on urine surface-enhanced Raman spectroscopy (SERS) analysis was presented. Urine SERS spectra were measured on esophagus cancer patients (n = 56) and healthy volunteers (n = 36) for control analysis. Tentative assignments of the urine SERS spectra indicated some interesting esophagus cancer-specific biomolecular changes, including a decrease in the relative content of urea and an increase in the percentage of uric acid in the urine of esophagus cancer patients compared to that of healthy subjects. Principal component analysis (PCA) combined with linear discriminant analysis (LDA) was employed to analyze and differentiate the SERS spectra between normal and esophagus cancer urine. The diagnostic algorithms utilizing a multivariate analysis method achieved a diagnostic sensitivity of 89.3% and specificity of 83.3% for separating esophagus cancer samples from normal urine samples. These results from the explorative work suggested that silver nano particle-based urine SERS analysis coupled with PCA-LDA multivariate analysis has potential for non-invasive detection of esophagus cancer.
Wang, Jian; Zhu, Jinmao; Huang, RuZhu; Yang, YuSheng
2012-07-01
We explored the rapid qualitative analysis of wheat cultivars with good lodging resistances by Fourier transform infrared resonance (FTIR) spectroscopy and multivariate statistical analysis. FTIR imaging showing that wheat stem cell walls were mainly composed of cellulose, pectin, protein, and lignin. Principal components analysis (PCA) was used to eliminate multicollinearity among multiple peak absorptions. PCA revealed the developmental internodes of wheat stems could be distributed from low to high along the load of the second principal component, which was consistent with the corresponding bands of cellulose in the FTIR spectra of the cell walls. Furthermore, four distinct stem populations could also be identified by spectral features related to their corresponding mechanical properties via PCA and cluster analysis. Histochemical staining of four types of wheat stems with various abilities to resist lodging revealed that cellulose contributed more than lignin to the ability to resist lodging. These results strongly suggested that the main cell wall component responsible for these differences was cellulose. Therefore, the combination of multivariate analysis and FTIR could rapidly screen wheat cultivars with good lodging resistance. Furthermore, the application of these methods to a much wider range of cultivars of unknown mechanical properties promises to be of interest.
Steingass, Christof Björn; Jutzi, Manfred; Müller, Jenny; Carle, Reinhold; Schmarr, Hans-Georg
2015-03-01
Ripening-dependent changes of pineapple volatiles were studied in a nontargeted profiling analysis. Volatiles were isolated via headspace solid phase microextraction and analyzed by comprehensive 2D gas chromatography and mass spectrometry (HS-SPME-GC×GC-qMS). Profile patterns presented in the contour plots were evaluated applying image processing techniques and subsequent multivariate statistical data analysis. Statistical methods comprised unsupervised hierarchical cluster analysis (HCA) and principal component analysis (PCA) to classify the samples. Supervised partial least squares discriminant analysis (PLS-DA) and partial least squares (PLS) regression were applied to discriminate different ripening stages and describe the development of volatiles during postharvest storage, respectively. Hereby, substantial chemical markers allowing for class separation were revealed. The workflow permitted the rapid distinction between premature green-ripe pineapples and postharvest-ripened sea-freighted fruits. Volatile profiles of fully ripe air-freighted pineapples were similar to those of green-ripe fruits postharvest ripened for 6 days after simulated sea freight export, after PCA with only two principal components. However, PCA considering also the third principal component allowed differentiation between air-freighted fruits and the four progressing postharvest maturity stages of sea-freighted pineapples.
Perdonà, Sisto; Bruzzese, Dario; Ferro, Matteo; Autorino, Riccardo; Marino, Ada; Mazzarella, Claudia; Perruolo, Giuseppe; Longo, Michele; Spinelli, Rosa; Di Lorenzo, Giuseppe; Oliva, Andrea; De Sio, Marco; Damiano, Rocco; Altieri, Vincenzo; Terracciano, Daniela
2013-02-15
Prostate health index (phi) and prostate cancer antigen 3 (PCA3) have been recently proposed as novel biomarkers for prostate cancer (PCa). We assessed the diagnostic performance of these biomarkers, alone or in combination, in men undergoing first prostate biopsy for suspicion of PCa. One hundred sixty male subjects were enrolled in this prospective observational study. PSA molecular forms, phi index (Beckman coulter immunoassay), PCA3 score (Progensa PCA3 assay), and other established biomarkers (tPSA, fPSA, and %fPSA) were assessed before patients underwent a 18-core first prostate biopsy. The discriminating ability between PCa-negative and PCa-positive biopsies of Beckman coulter phi and PCA3 score and other used biomarkers were determined. One hundred sixty patients met inclusion criteria. %p2PSA (p2PSA/fPSA × 100), phi and PCA3 were significantly higher in patients with PCa compared to PCa-negative group (median values: 1.92 vs. 1.55, 49.97 vs. 36.84, and 50 vs. 32, respectively, P ≤ 0.001). ROC curve analysis showed that %p2PSA, phi, and PCA3 are good indicator of malignancy (AUCs = 0.68, 0.71, and 0.66, respectively). A multivariable logistic regression model consisting of both the phi index and PCA3 score allowed to reach an overall diagnostic accuracy of 0.77. Decision curve analysis revealed that this "combined" marker achieved the highest net benefit over the examined range of the threshold probability. phi and PCA3 showed no significant difference in the ability to predict PCa diagnosis in men undergoing first prostate biopsy. However, diagnostic performance is significantly improved by combining phi and PCA3. Copyright © 2012 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Koch, C.D.; Pirkle, F.L.; Schmidt, J.S.
1981-01-01
A Principal Components Analysis (PCA) has been written to aid in the interpretation of multivariate aerial radiometric data collected by the US Department of Energy (DOE) under the National Uranium Resource Evaluation (NURE) program. The variations exhibited by these data have been reduced and classified into a number of linear combinations by using the PCA program. The PCA program then generates histograms and outlier maps of the individual variates. Black and white plots can be made on a Calcomp plotter by the application of follow-up programs. All programs referred to in this guide were written for a DEC-10. From thismore » analysis a geologist may begin to interpret the data structure. Insight into geological processes underlying the data may be obtained.« less
Multivariate analysis of molecular and morphological diversity in fig (Ficus carica L.)
USDA-ARS?s Scientific Manuscript database
Genetic polymorphism across 15 microsatellite loci among 194 fig accessions including Common, Smyrna, San Pedro, and Caprifig were analyzed using a cluster analysis (CA) and the principal components analysis (PCA). The collection was moderately variable with observed number of alleles per locus rang...
Indelicato, Serena; Bongiorno, David; Tuzzolino, Nicola; Mannino, Maria Rosaria; Muscarella, Rosalia; Fradella, Pasquale; Gargano, Maria Elena; Nicosia, Salvatore; Ceraulo, Leopoldo
2018-03-14
Multivariate analysis was performed on a large data set of groundwater and leachate samples collected during 9 years of operation of the Bellolampo municipal solid waste landfill (located above Palermo, Italy). The aim was to obtain the most likely correlations among the data. The analysis results are presented. Groundwater samples were collected in the period 2004-2013, whereas the leachate analysis refers to the period 2006-2013. For groundwater, statistical data evaluation revealed notable differences among the samples taken from the numerous wells located around the landfill. Characteristic parameters revealed by principal component analysis (PCA) were more deeply investigated, and corresponding thematic maps were drawn. The composition of the leachate was also thoroughly investigated. Several chemical macro-descriptors were calculated, and the results are presented. A comparison of PCA results for the leachate and groundwater data clearly reveals that the groundwater's main components substantially differ from those of the leachate. This outcome strongly suggests excluding leachate permeation through the multiple landfill lining.
NASA Astrophysics Data System (ADS)
He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei
2015-02-01
A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety.
Effect of Statins and Anticoagulants on Prostate Cancer Aggressiveness
DOE Office of Scientific and Technical Information (OSTI.GOV)
Alizadeh, Moein; Sylvestre, Marie-Pierre; Zilli, Thomas
2012-07-15
Purpose: Statins and anticoagulants (ACs) have both been associated with a less-aggressive prostate cancer (PCa) and a better outcome after treatment of localized PCa. The results of these studies might have been confounded because patients might often take both medications. We examined their respective influence on PCa aggressiveness at initial diagnosis. Materials and Methods: We analyzed 381 patients treated with either external beam radiotherapy or brachytherapy for low-risk (n = 152), intermediate-risk (n = 142), or high-risk (n = 87) localized PCa. Univariate and multivariate logistic regression analyses were used to investigate an association between these drug classes and prostatemore » cancer aggressiveness. We tested whether the concomitant use of statins and ACs had a different effect than that of either AC or statin use alone. Results: Of the 381 patients, 172 (45.1%) were taking statins and 141 (37.0%) ACs; 105 patients (27.6%) used both. On univariate analysis, the statin and AC users were associated with the prostate-specific antigen (PSA) level (p = .017) and National Comprehensive Cancer Network risk group (p = .0022). On multivariate analysis, statin use was associated with a PSA level <10 ng/mL (odds ratio, 2.9; 95% confidence interval, 1.3-6.8; p = .012) and a PSA level >20 ng/mL (odds ratio, 0.29; 95% confidence interval, 0.08-0.83; p = .03). The use of ACs was associated with a PSA level >20 ng/mL (odds ratio, 0.13; 95% confidence interval, 0.02-0.59, p = .02). Conclusion: Both AC and statins have an effect on PCa aggressiveness, with statins having a more stringent relationship with the PSA level, highlighting the importance of considering statin use in studies of PCa aggressiveness.« less
Meller, Sebastian; Zipfel, Lisa; Gevensleben, Heidrun; Dietrich, Jörn; Ellinger, Jörg; Majores, Michael; Stein, Johannes; Sailer, Verena; Jung, Maria; Kristiansen, Glen; Dietrich, Dimo
2016-12-01
Molecular biomarkers may facilitate the distinction between aggressive and clinically insignificant prostate cancer (PCa), thereby potentially aiding individualized treatment. We analyzed cysteine dioxygenase 1 (CDO1) promoter methylation and mRNA expression in order to evaluate its potential as prognostic biomarker. CDO1 methylation and mRNA expression were determined in cell lines and formalin-fixed paraffin-embedded prostatectomy specimens from a first cohort of 300 PCa patients using methylation-specific qPCR and qRT-PCR. Univariate and multivariate Cox proportional hazards and Kaplan-Meier analyses were performed to evaluate biochemical recurrence (BCR)-free survival. Results were confirmed in an independent second cohort comprising 498 PCa cases. Methylation and mRNA expression data from the second cohort were generated by The Cancer Genome Atlas (TCGA) Research Network by means of Infinium HumanMethylation450 BeadChip and RNASeq. CDO1 was hypermethylated in PCa compared to normal adjacent tissues and benign prostatic hyperplasia (P < 0.001) and was associated with reduced gene expression (ρ = -0.91, P = 0.005). Using two different methodologies for methylation quantification, high CDO1 methylation as continuous variable was associated with BCR in univariate analysis (first cohort: HR = 1.02, P = 0.002, 95% CI [1.01-1.03]; second cohort: HR = 1.02, P = 0.032, 95% CI [1.00-1.03]) but failed to reach statistical significance in multivariate analysis. CDO1 promoter methylation is involved in gene regulation and is a potential prognostic biomarker for BCR-free survival in PCa patients following radical prostatectomy. Further studies are needed to validate CDO1 methylation assays and to evaluate the clinical utility of CDO1 methylation for the management of PCa.
Geurts, Brigitte P; Neerincx, Anne H; Bertrand, Samuel; Leemans, Manja A A P; Postma, Geert J; Wolfender, Jean-Luc; Cristescu, Simona M; Buydens, Lutgarde M C; Jansen, Jeroen J
2017-04-22
Revealing the biochemistry associated to micro-organismal interspecies interactions is highly relevant for many purposes. Each pathogen has a characteristic metabolic fingerprint that allows identification based on their unique multivariate biochemistry. When pathogen species come into mutual contact, their co-culture will display a chemistry that may be attributed both to mixing of the characteristic chemistries of the mono-cultures and to competition between the pathogens. Therefore, investigating pathogen development in a polymicrobial environment requires dedicated chemometric methods to untangle and focus upon these sources of variation. The multivariate data analysis method Projected Orthogonalised Chemical Encounter Monitoring (POCHEMON) is dedicated to highlight metabolites characteristic for the interaction of two micro-organisms in co-culture. However, this approach is currently limited to a single time-point, while development of polymicrobial interactions may be highly dynamic. A well-known multivariate implementation of Analysis of Variance (ANOVA) uses Principal Component Analysis (ANOVA-PCA). This allows the overall dynamics to be separated from the pathogen-specific chemistry to analyse the contributions of both aspects separately. For this reason, we propose to integrate ANOVA-PCA with the POCHEMON approach to disentangle the pathogen dynamics and the specific biochemistry in interspecies interactions. Two complementary case studies show great potential for both liquid and gas chromatography - mass spectrometry to reveal novel information on chemistry specific to interspecies interaction during pathogen development. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.
Li, Jian; Ma, Guowei; Ma, Lin; Bao, Xiaolin; Li, Liping; Zhao, Qian
2018-01-01
Effects of 1-methylcyclopropene (1-MCP) and vacuum precooling on quality and antioxidant properties of blackberries (Rubus spp.) were evaluated using one-way analysis of variance, principal component analysis (PCA), partial least squares (PLS), and path analysis. Results showed that the activities of antioxidant enzymes were enhanced by both 1-MCP treatment and vacuum precooling. PCA could discriminate 1-MCP treated fruit and the vacuum precooled fruit and showed that the radical-scavenging activities in vacuum precooled fruit were higher than those in 1-MCP treated fruit. The scores of PCA showed that H2O2 content was the most important variables of blackberry fruit. PLSR results showed that peroxidase (POD) activity negatively correlated with H2O2 content. The results of path coefficient analysis indicated that glutathione (GSH) also had an indirect effect on H2O2 content. PMID:29487622
NASA Astrophysics Data System (ADS)
Chatterjee, Shiladitya; Singh, Bhupinder; Diwan, Anubhav; Lee, Zheng Rong; Engelhard, Mark H.; Terry, Jeff; Tolley, H. Dennis; Gallagher, Neal B.; Linford, Matthew R.
2018-03-01
X-ray photoelectron spectroscopy (XPS) and time-of-flight secondary ion mass spectrometry (ToF-SIMS) are much used analytical techniques that provide information about the outermost atomic and molecular layers of materials. In this work, we discuss the application of multivariate spectral techniques, including principal component analysis (PCA) and multivariate curve resolution (MCR), to the analysis of XPS and ToF-SIMS depth profiles. Multivariate analyses often provide insight into data sets that is not easily obtained in a univariate fashion. Pattern recognition entropy (PRE), which has its roots in Shannon's information theory, is also introduced. This approach is not the same as the mutual information/entropy approaches sometimes used in data processing. A discussion of the theory of each technique is presented. PCA, MCR, and PRE are applied to four different data sets obtained from: a ToF-SIMS depth profile through ca. 100 nm of plasma polymerized C3F6 on Si, a ToF-SIMS depth profile through ca. 100 nm of plasma polymerized PNIPAM (poly (N-isopropylacrylamide)) on Si, an XPS depth profile through a film of SiO2 on Si, and an XPS depth profile through a film of Ta2O5 on Ta. PCA, MCR, and PRE reveal the presence of interfaces in the films, and often indicate that the first few scans in the depth profiles are different from those that follow. PRE and backward difference PRE provide this information in a straightforward fashion. Rises in the PRE signals at interfaces suggest greater complexity to the corresponding spectra. Results from PCA, especially for the higher principal components, were sometimes difficult to understand. MCR analyses were generally more interpretable.
Corvucci, Francesca; Nobili, Lara; Melucci, Dora; Grillenzoni, Francesca-Vittoria
2015-02-15
Honey traceability to food quality is required by consumers and food control institutions. Melissopalynologists traditionally use percentages of nectariferous pollens to discriminate the botanical origin and the entire pollen spectrum (presence/absence, type and quantities and association of some pollen types) to determinate the geographical origin of honeys. To improve melissopalynological routine analysis, principal components analysis (PCA) was used. A remarkable and innovative result was that the most significant pollens for the traditional discrimination of the botanical and geographical origin of honeys were the same as those individuated with the chemometric model. The reliability of assignments of samples to honey classes was estimated through explained variance (85%). This confirms that the chemometric model properly describes the melissopalynological data. With the aim to improve honey discrimination, FT-microRaman spectrography and multivariate analysis were also applied. Well performing PCA models and good agreement with known classes were achieved. Encouraging results were obtained for botanical discrimination. Copyright © 2014 Elsevier Ltd. All rights reserved.
Shoji, Koichi; Teishima, Jun; Hayashi, Tetsutaro; Shinmei, Shunsuke; Akita, Tomoyuki; Sentani, Kazuhiro; Takeshima, Yukio; Arihiro, Koji; Tanaka, Junko; Yasui, Wataru; Matsubara, Akio
2017-01-01
An atomic bomb (A-bomb) was dropped on Hiroshima on 6th August 1945. Although numerous studies have investigated cancer incidence and mortality among A-bomb survivors, only a small number have addressed urological cancer in these survivors. The aim of the present study was to investigate the clinicopathological features of prostate cancer (PCa) in A-bomb survivors. The clinicopathological features and prognosis of PCa were retrospectively reviewed in 212 survivors and 595 control patients between November 1996 and December 2010. The histopathological and clinical outcomes of surgical treatment of PCa were also evaluated in 69 survivors and 162 control patients. Despite the higher age at diagnosis compared with the control group (P=0.0031), survivors were more likely to have been diagnosed with PCa from a health check compared with the control group (P<0.0001). As a consequence, the survivors were found to exhibit metastasis significantly less frequently (199/212, 93.9%) compared with the control patients (521/595, 87.6%; P=0.0076). Prognosis in the two groups was examined, subsequent to a mean length of follow-up of 44 months. Overall survival (OS) and PCa-specific survival (CS) were similar between the two groups (OS, P=0.2196; CS, P=0.1017). A-bomb exposure was not found to be an independent predictor for prognosis by multivariate analysis (OS, P=0.7800; CS, P=0.8688). The clinicopathological features of patients who underwent a prostatectomy were similar except for the diagnosis opportunity between the two groups. Progression-free survival rates were similar between the two groups (P=0.5630). A-bomb exposure was not a significant and independent predictor for worsening of progression-free prognosis by multivariate analysis (P=0.3763). A-bomb exposure does not appear to exert deleterious effects on the biological aggressiveness of PCa and the prognosis of patients with PCa. PMID:28693168
Shoji, Koichi; Teishima, Jun; Hayashi, Tetsutaro; Shinmei, Shunsuke; Akita, Tomoyuki; Sentani, Kazuhiro; Takeshima, Yukio; Arihiro, Koji; Tanaka, Junko; Yasui, Wataru; Matsubara, Akio
2017-07-01
An atomic bomb (A-bomb) was dropped on Hiroshima on 6th August 1945. Although numerous studies have investigated cancer incidence and mortality among A-bomb survivors, only a small number have addressed urological cancer in these survivors. The aim of the present study was to investigate the clinicopathological features of prostate cancer (PCa) in A-bomb survivors. The clinicopathological features and prognosis of PCa were retrospectively reviewed in 212 survivors and 595 control patients between November 1996 and December 2010. The histopathological and clinical outcomes of surgical treatment of PCa were also evaluated in 69 survivors and 162 control patients. Despite the higher age at diagnosis compared with the control group (P=0.0031), survivors were more likely to have been diagnosed with PCa from a health check compared with the control group (P<0.0001). As a consequence, the survivors were found to exhibit metastasis significantly less frequently (199/212, 93.9%) compared with the control patients (521/595, 87.6%; P=0.0076). Prognosis in the two groups was examined, subsequent to a mean length of follow-up of 44 months. Overall survival (OS) and PCa-specific survival (CS) were similar between the two groups (OS, P=0.2196; CS, P=0.1017). A-bomb exposure was not found to be an independent predictor for prognosis by multivariate analysis (OS, P=0.7800; CS, P=0.8688). The clinicopathological features of patients who underwent a prostatectomy were similar except for the diagnosis opportunity between the two groups. Progression-free survival rates were similar between the two groups (P=0.5630). A-bomb exposure was not a significant and independent predictor for worsening of progression-free prognosis by multivariate analysis (P=0.3763). A-bomb exposure does not appear to exert deleterious effects on the biological aggressiveness of PCa and the prognosis of patients with PCa.
Kalegowda, Yogesh; Harmer, Sarah L
2012-03-20
Time-of-flight secondary ion mass spectrometry (TOF-SIMS) spectra of mineral samples are complex, comprised of large mass ranges and many peaks. Consequently, characterization and classification analysis of these systems is challenging. In this study, different chemometric and statistical data evaluation methods, based on monolayer sensitive TOF-SIMS data, have been tested for the characterization and classification of copper-iron sulfide minerals (chalcopyrite, chalcocite, bornite, and pyrite) at different flotation pulp conditions (feed, conditioned feed, and Eh modified). The complex mass spectral data sets were analyzed using the following chemometric and statistical techniques: principal component analysis (PCA); principal component-discriminant functional analysis (PC-DFA); soft independent modeling of class analogy (SIMCA); and k-Nearest Neighbor (k-NN) classification. PCA was found to be an important first step in multivariate analysis, providing insight into both the relative grouping of samples and the elemental/molecular basis for those groupings. For samples exposed to oxidative conditions (at Eh ~430 mV), each technique (PCA, PC-DFA, SIMCA, and k-NN) was found to produce excellent classification. For samples at reductive conditions (at Eh ~ -200 mV SHE), k-NN and SIMCA produced the most accurate classification. Phase identification of particles that contain the same elements but a different crystal structure in a mixed multimetal mineral system has been achieved.
Vedder, Moniek M; de Bekker-Grob, Esther W; Lilja, Hans G; Vickers, Andrew J; van Leenders, Geert J L H; Steyerberg, Ewout W; Roobol, Monique J
2014-12-01
Prostate-specific antigen (PSA) testing has limited accuracy for the early detection of prostate cancer (PCa). To assess the value added by percentage of free to total PSA (%fPSA), prostate cancer antigen 3 (PCA3), and a kallikrein panel (4k-panel) to the European Randomised Study of Screening for Prostate Cancer (ERSPC) multivariable prediction models: risk calculator (RC) 4, including transrectal ultrasound, and RC 4 plus digital rectal examination (4+DRE) for prescreened men. Participants were invited for rescreening between October 2007 and February 2009 within the Dutch part of the ERSPC study. Biopsies were taken in men with a PSA level ≥3.0 ng/ml or a PCA3 score ≥10. Additional analyses of the 4k-panel were done on serum samples. Outcome was defined as PCa detectable by sextant biopsy. Receiver operating characteristic curve and decision curve analyses were performed to compare the predictive capabilities of %fPSA, PCA3, 4k-panel, the ERSPC RCs, and their combinations in logistic regression models. PCa was detected in 119 of 708 men. The %fPSA did not perform better univariately or added to the RCs compared with the RCs alone. In 202 men with an elevated PSA, the 4k-panel discriminated better than PCA3 when modelled univariately (area under the curve [AUC]: 0.78 vs. 0.62; p=0.01). The multivariable models with PCA3 or the 4k-panel were equivalent (AUC: 0.80 for RC 4+DRE). In the total population, PCA3 discriminated better than the 4k-panel (univariate AUC: 0.63 vs. 0.56; p=0.05). There was no statistically significant difference between the multivariable model with PCA3 (AUC: 0.73) versus the model with the 4k-panel (AUC: 0.71; p=0.18). The multivariable model with PCA3 performed better than the reference model (0.73 vs. 0.70; p=0.02). Decision curves confirmed these patterns, although numbers were small. Both PCA3 and, to a lesser extent, a 4k-panel have added value to the DRE-based ERSPC RC in detecting PCa in prescreened men. We studied the added value of novel biomarkers to previously developed risk prediction models for prostate cancer. We found that inclusion of these biomarkers resulted in an increase in predictive ability. Copyright © 2014. Published by Elsevier B.V.
Söhn, Matthias; Alber, Markus; Yan, Di
2007-09-01
The variability of dose-volume histogram (DVH) shapes in a patient population can be quantified using principal component analysis (PCA). We applied this to rectal DVHs of prostate cancer patients and investigated the correlation of the PCA parameters with late bleeding. PCA was applied to the rectal wall DVHs of 262 patients, who had been treated with a four-field box, conformal adaptive radiotherapy technique. The correlated changes in the DVH pattern were revealed as "eigenmodes," which were ordered by their importance to represent data set variability. Each DVH is uniquely characterized by its principal components (PCs). The correlation of the first three PCs and chronic rectal bleeding of Grade 2 or greater was investigated with uni- and multivariate logistic regression analyses. Rectal wall DVHs in four-field conformal RT can primarily be represented by the first two or three PCs, which describe approximately 94% or 96% of the DVH shape variability, respectively. The first eigenmode models the total irradiated rectal volume; thus, PC1 correlates to the mean dose. Mode 2 describes the interpatient differences of the relative rectal volume in the two- or four-field overlap region. Mode 3 reveals correlations of volumes with intermediate doses ( approximately 40-45 Gy) and volumes with doses >70 Gy; thus, PC3 is associated with the maximal dose. According to univariate logistic regression analysis, only PC2 correlated significantly with toxicity. However, multivariate logistic regression analysis with the first two or three PCs revealed an increased probability of bleeding for DVHs with more than one large PC. PCA can reveal the correlation structure of DVHs for a patient population as imposed by the treatment technique and provide information about its relationship to toxicity. It proves useful for augmenting normal tissue complication probability modeling approaches.
Carlesi, Serena; Ricci, Marilena; Cucci, Costanza; La Nasa, Jacopo; Lofrumento, Cristiana; Picollo, Marcello; Becucci, Maurizio
2015-07-01
This work explores the application of chemometric techniques to the analysis of lipidic paint binders (i.e., drying oils) by means of Raman and near-infrared spectroscopy. These binders have been widely used by artists throughout history, both individually and in mixtures. We prepared various model samples of the pure binders (linseed, poppy seed, and walnut oils) obtained from different manufacturers. These model samples were left to dry and then characterized by Raman and reflectance near-infrared spectroscopy. Multivariate analysis was performed by applying principal component analysis (PCA) on the first derivative of the corresponding Raman spectra (1800-750 cm(-1)), near-infrared spectra (6000-3900 cm(-1)), and their combination to test whether spectral differences could enable samples to be distinguished on the basis of their composition. The vibrational bands we found most useful to discriminate between the different products we studied are the fundamental ν(C=C) stretching and methylenic stretching and bending combination bands. The results of the multivariate analysis demonstrated the potential of chemometric approaches for characterizing and identifying drying oils, and also for gaining a deeper insight into the aging process. Comparison with high-performance liquid chromatography data was conducted to check the PCA results.
Hayman, Jonathan; Phillips, Ryan; Chen, Di; Perin, Jamie; Narang, Amol K; Trieu, Janson; Radwan, Noura; Greco, Stephen; Deville, Curtiland; McNutt, Todd; Song, Daniel Y; DeWeese, Theodore L; Tran, Phuoc T
2018-06-01
Undetectable End of Radiation PSA (EOR-PSA) has been shown to predict improved survival in prostate cancer (PCa). While validating the unfavorable intermediate-risk (UIR) and favorable intermediate-risk (FIR) stratifications among Johns Hopkins PCa patients treated with radiotherapy, we examined whether EOR-PSA could further risk stratify UIR men for survival. A total of 302 IR patients were identified in the Johns Hopkins PCa database (178 UIR, 124 FIR). Kaplan-Meier curves and multivariable analysis was performed via Cox regression for biochemical recurrence free survival (bRFS), distant metastasis free survival (DMFS), and overall survival (OS), while a competing risks model was used for PCa specific survival (PCSS). Among the 235 patients with known EOR-PSA values, we then stratified by EOR-PSA and performed the aforementioned analysis. The median follow-up time was 11.5 years (138 months). UIR was predictive of worse DMFS and PCSS (P = 0.008 and P = 0.023) on multivariable analysis (MVA). Increased radiation dose was significant for improved DMFS (P = 0.016) on MVA. EOR-PSA was excluded from the models because it did not trend towards significance as a continuous or binary variable due to interaction with UIR, and we were unable to converge a multivariable model with a variable to control for this interaction. However, when stratifying by detectable versus undetectable EOR-PSA, UIR had worse DMFS and PCSS among detectable EOR-PSA patients, but not undetectable patients. UIR was significant on MVA among detectable EOR-PSA patients for DMFS (P = 0.021) and PCSS (P = 0.033), while RT dose also predicted PCSS (P = 0.013). EOR-PSA can assist in predicting DMFS and PCSS among UIR patients, suggesting a clinically meaningful time point for considering intensification of treatment in clinical trials of intermediate-risk men. © 2018 Wiley Periodicals, Inc.
Zeng, Shanshan; Wang, Lu; Chen, Teng; Wang, Yuefei; Mo, Huanbiao; Qu, Haibin
2012-07-06
The paper presents a novel strategy to identify analytical markers of traditional Chinese medicine preparation (TCMP) rapidly via direct analysis in real time mass spectrometry (DART-MS). A commonly used TCMP, Danshen injection, was employed as a model. The optimal analysis conditions were achieved by measuring the contribution of various experimental parameters to the mass spectra. Salvianolic acids and saccharides were simultaneously determined within a single 1-min DART-MS run. Furthermore, spectra of Danshen injections supplied by five manufacturers were processed with principal component analysis (PCA). Obvious clustering was observed in the PCA score plot, and candidate markers were recognized from the contribution plots of PCA. The suitability of potential markers was then confirmed by contrasting with the results of traditional analysis methods. Using this strategy, fructose, glucose, sucrose, protocatechuic aldehyde and salvianolic acid A were rapidly identified as the markers of Danshen injections. The combination of DART-MS with PCA provides a reliable approach to the identification of analytical markers for quality control of TCMP. Copyright © 2012 Elsevier B.V. All rights reserved.
Iorgulescu, E; Voicu, V A; Sârbu, C; Tache, F; Albu, F; Medvedovici, A
2016-08-01
The influence of the experimental variability (instrumental repeatability, instrumental intermediate precision and sample preparation variability) and data pre-processing (normalization, peak alignment, background subtraction) on the discrimination power of multivariate data analysis methods (Principal Component Analysis -PCA- and Cluster Analysis -CA-) as well as a new algorithm based on linear regression was studied. Data used in the study were obtained through positive or negative ion monitoring electrospray mass spectrometry (+/-ESI/MS) and reversed phase liquid chromatography/UV spectrometric detection (RPLC/UV) applied to green tea extracts. Extractions in ethanol and heated water infusion were used as sample preparation procedures. The multivariate methods were directly applied to mass spectra and chromatograms, involving strictly a holistic comparison of shapes, without assignment of any structural identity to compounds. An alternative data interpretation based on linear regression analysis mutually applied to data series is also discussed. Slopes, intercepts and correlation coefficients produced by the linear regression analysis applied on pairs of very large experimental data series successfully retain information resulting from high frequency instrumental acquisition rates, obviously better defining the profiles being compared. Consequently, each type of sample or comparison between samples produces in the Cartesian space an ellipsoidal volume defined by the normal variation intervals of the slope, intercept and correlation coefficient. Distances between volumes graphically illustrates (dis)similarities between compared data. The instrumental intermediate precision had the major effect on the discrimination power of the multivariate data analysis methods. Mass spectra produced through ionization from liquid state in atmospheric pressure conditions of bulk complex mixtures resulting from extracted materials of natural origins provided an excellent data basis for multivariate analysis methods, equivalent to data resulting from chromatographic separations. The alternative evaluation of very large data series based on linear regression analysis produced information equivalent to results obtained through application of PCA an CA. Copyright © 2016 Elsevier B.V. All rights reserved.
Felker, Ely R.; Raman, Steven S.; Margolis, Daniel J.; Lu, David S. K.; Shaheen, Nicholas; Natarajan, Shyam; Sharma, Devi; Huang, Jiaoti; Dorey, Fred; Marks, Leonard S.
2017-01-01
OBJECTIVE The objective of our study was to determine the clinical and MRI characteristics of clinically significant prostate cancer (PCA) (Gleason score ≥ 3 + 4) in men with Prostate Imaging Reporting and Data System version 2 (PI-RADSv2) category 3 transition zone (TZ) lesions. MATERIALS AND METHODS From 2014 to 2016, 865 men underwent prostate MRI and MRI/ultrasound (US) fusion biopsy (FB). A subset of 90 FB-naïve men with 96 PI-RADSv2 category 3 TZ lesions was identified. Patients were imaged at 3 T using a body coil. Images were assigned a PI-RADSv2 category by an experienced radiologist. Using clinical data and imaging features, we performed univariate and multivariate analyses to identify predictors of clinically significant PCA. RESULTS The mean patient age was 66 years, and the mean prostate-specific antigen density (PSAD) was 0.13 ng/mL2. PCA was detected in 34 of 96 (35%) lesions, 14 of which (15%) harbored clinically significant PCA. In univariate analysis, DWI score, prostate volume, and PSAD were significant predictors (p < 0.05) of clinically significant PCA with a suggested significance for apparent diffusion coefficient (ADC) and prostate-specific antigen value (p < 0.10). On multivariate analysis, PSAD and lesion ADC were the most important covariates. The combination of both PSAD of 0.15 ng/mL2 or greater and an ADC value of less than 1000 mm2/s yielded an AUC of 0.91 for clinically significant PCA (p < 0.001). If FB had been restricted to these criteria, only 10 of 90 men would have undergone biopsy, resulting in diagnosis of clinically significant PCA in 60% with eight men (9%) misdiagnosed (false-negative). CONCLUSION The yield of FB in men with PI-RADSv2 category 3 TZ lesions for clinically significant PCA is 15% but significantly improves to 60% (AUC > 0.9) among men with PSAD of 0.15 ng/mL2 or greater and lesion ADC value of less than 1000 mm2/s. PMID:28858541
Varekar, Vikas; Karmakar, Subhankar; Jha, Ramakar
2016-02-01
The design of surface water quality sampling location is a crucial decision-making process for rationalization of monitoring network. The quantity, quality, and types of available dataset (watershed characteristics and water quality data) may affect the selection of appropriate design methodology. The modified Sanders approach and multivariate statistical techniques [particularly factor analysis (FA)/principal component analysis (PCA)] are well-accepted and widely used techniques for design of sampling locations. However, their performance may vary significantly with quantity, quality, and types of available dataset. In this paper, an attempt has been made to evaluate performance of these techniques by accounting the effect of seasonal variation, under a situation of limited water quality data but extensive watershed characteristics information, as continuous and consistent river water quality data is usually difficult to obtain, whereas watershed information may be made available through application of geospatial techniques. A case study of Kali River, Western Uttar Pradesh, India, is selected for the analysis. The monitoring was carried out at 16 sampling locations. The discrete and diffuse pollution loads at different sampling sites were estimated and accounted using modified Sanders approach, whereas the monitored physical and chemical water quality parameters were utilized as inputs for FA/PCA. The designed optimum number of sampling locations for monsoon and non-monsoon seasons by modified Sanders approach are eight and seven while that for FA/PCA are eleven and nine, respectively. Less variation in the number and locations of designed sampling sites were obtained by both techniques, which shows stability of results. A geospatial analysis has also been carried out to check the significance of designed sampling location with respect to river basin characteristics and land use of the study area. Both methods are equally efficient; however, modified Sanders approach outperforms FA/PCA when limited water quality and extensive watershed information is available. The available water quality dataset is limited and FA/PCA-based approach fails to identify monitoring locations with higher variation, as these multivariate statistical approaches are data-driven. The priority/hierarchy and number of sampling sites designed by modified Sanders approach are well justified by the land use practices and observed river basin characteristics of the study area.
Lazzeri, Massimo; Haese, Alexander; Abrate, Alberto; de la Taille, Alexandre; Redorta, Joan Palou; McNicholas, Thomas; Lughezzani, Giovanni; Lista, Giuliana; Larcher, Alessandro; Bini, Vittorio; Cestari, Andrea; Buffi, Nicolòmaria; Graefen, Markus; Bosset, Olivier; Le Corvoisier, Philippe; Breda, Alberto; de la Torre, Pablo; Fowler, Linda; Roux, Jacques; Guazzoni, Giorgio
2013-08-01
To test the sensitivity, specificity and accuracy of serum prostate-specific antigen isoform [-2]proPSA (p2PSA), %p2PSA and the prostate health index (PHI), in men with a family history of prostate cancer (PCa) undergoing prostate biopsy for suspected PCa. To evaluate the potential reduction in unnecessary biopsies and the characteristics of potentially missed cases of PCa that would result from using serum p2PSA, %p2PSA and PHI. The analysis consisted of a nested case-control study from the PRO-PSA Multicentric European Study, the PROMEtheuS project. All patients had a first-degree relative (father, brother, son) with PCa. Multivariable logistic regression models were complemented by predictive accuracy analysis and decision-curve analysis. Of the 1026 patients included in the PROMEtheuS cohort, 158 (15.4%) had a first-degree relative with PCa. p2PSA, %p2PSA and PHI values were significantly higher (P < 0.001), and free/total PSA (%fPSA) values significantly lower (P < 0.001) in the 71 patients with PCa (44.9%) than in patients without PCa. Univariable accuracy analysis showed %p2PSA (area under the receiver-operating characteristic curve [AUC]: 0.733) and PHI (AUC: 0.733) to be the most accurate predictors of PCa at biopsy, significantly outperforming total PSA ([tPSA] AUC: 0.549), free PSA ([fPSA] AUC: 0.489) and %fPSA (AUC: 0.600) (P ≤ 0.001). For %p2PSA a threshold of 1.66 was found to have the best balance between sensitivity and specificity (70.4 and 70.1%; 95% confidence interval [CI]: 58.4-80.7 and 59.4-79.5 respectively). A PHI threshold of 40 was found to have the best balance between sensitivity and specificity (64.8 and 71.3%, respectively; 95% CI 52.5-75.8 and 60.6-80.5). At 90% sensitivity, the thresholds for %p2PSA and PHI were 1.20 and 25.5, with a specificity of 37.9 and 25.5%, respectively. At a %p2PSA threshold of 1.20, a total of 39 (24.8%) biopsies could have been avoided, but two cancers with a Gleason score (GS) of 7 would have been missed. At a PHI threshold of 25.5 a total of 27 (17.2%) biopsies could have been avoided and two (3.8%) cancers with a GS of 7 would have been missed. In multivariable logistic regression models, %p2PSA and PHI achieved independent predictor status and significantly increased the accuracy of multivariable models including PSA and prostate volume by 8.7 and 10%, respectively (P ≤ 0.001). p2PSA, %p2PSA and PHI were directly correlated with Gleason score (ρ: 0.247, P = 0.038; ρ: 0.366, P = 0.002; ρ: 0.464, P < 0.001, respectively). %p2PSA and PHI are more accurate than tPSA, fPSA and %fPSA in predicting PCa in men with a family history of PCa. Consideration of %p2PSA and PHI results in the avoidance of several unnecessary biopsies. p2PSA, %p2PSA and PHI correlate with cancer aggressiveness. © 2013 BJU International.
Hakimzadeh, Neda; Parastar, Hadi; Fattahi, Mohammad
2014-01-24
In this study, multivariate curve resolution (MCR) and multivariate classification methods are proposed to develop a new chemometric strategy for comprehensive analysis of high-performance liquid chromatography-diode array absorbance detection (HPLC-DAD) fingerprints of sixty Salvia reuterana samples from five different geographical regions. Different chromatographic problems occurred during HPLC-DAD analysis of S. reuterana samples, such as baseline/background contribution and noise, low signal-to-noise ratio (S/N), asymmetric peaks, elution time shifts, and peak overlap are handled using the proposed strategy. In this way, chromatographic fingerprints of sixty samples are properly segmented to ten common chromatographic regions using local rank analysis and then, the corresponding segments are column-wise augmented for subsequent MCR analysis. Extended multivariate curve resolution-alternating least squares (MCR-ALS) is used to obtain pure component profiles in each segment. In general, thirty-one chemical components were resolved using MCR-ALS in sixty S. reuterana samples and the lack of fit (LOF) values of MCR-ALS models were below 10.0% in all cases. Pure spectral profiles are considered for identification of chemical components by comparing their resolved spectra with the standard ones and twenty-four components out of thirty-one components were identified. Additionally, pure elution profiles are used to obtain relative concentrations of chemical components in different samples for multivariate classification analysis by principal component analysis (PCA) and k-nearest neighbors (kNN). Inspection of the PCA score plot (explaining 76.1% of variance accounted for three PCs) showed that S. reuterana samples belong to four clusters. The degree of class separation (DCS) which quantifies the distance separating clusters in relation to the scatter within each cluster is calculated for four clusters and it was in the range of 1.6-5.8. These results are then confirmed by kNN. In addition, according to the PCA loading plot and kNN dendrogram of thirty-one variables, five chemical constituents of luteolin-7-o-glucoside, salvianolic acid D, rosmarinic acid, lithospermic acid and trijuganone A are identified as the most important variables (i.e., chemical markers) for clusters discrimination. Finally, the effect of different chemical markers on samples differentiation is investigated using counter-propagation artificial neural network (CP-ANN) method. It is concluded that the proposed strategy can be successfully applied for comprehensive analysis of chromatographic fingerprints of complex natural samples. Copyright © 2013 Elsevier B.V. All rights reserved.
He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei
2015-02-25
A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety. Copyright © 2014 Elsevier B.V. All rights reserved.
[Quality evaluation of American ginseng using UPLC coupled with multivariate analysis].
Tang, Yan; Yan, Shu-Mo; Wang, Jing-Jing; Yuan, Yuan; Yang, Bin
2016-05-01
An ultra performance liquid chromatography (UPLC)method combined with multivariate data analysis was developed to evaluate the quality of American ginseng by simultaneously determining the concentrations of six ginsenosides (Rg₁, Re, Rb₁, Rc, Ro and Rd)in the samples. For UPLC, acetonitrile with 0.01% formic acid and water with 0.01% formic acid were used as the mobile phase with gradient elution. Under the established chromatographic conditions, the six ginsenosides could be well separated and the results of linearity, stability, precision, repeatability, and recovery rate all reached the requirement of quantification analysis, respectively. The total contents of Rg₁, Re, and Rb₁ in 57 samples all reached the requirement of the 2015 edition of Chinese Pharmacopoeia. At the same time, the experimental data were analyzed by principle component analysis (PCA) and partial least squares discriminant analysis (PLS-DA). The crude drugs and the decoction pieces can be discriminated by a PCA method and the samples with different age can be distinguished by a PLS-DA method. Copyright© by the Chinese Pharmaceutical Association.
Waskitho, Dri; Lukitaningsih, Endang; Sudjadi; Rohman, Abdul
2016-01-01
Analysis of lard extracted from lipstick formulation containing castor oil has been performed using FTIR spectroscopic method combined with multivariate calibration. Three different extraction methods were compared, namely saponification method followed by liquid/liquid extraction with hexane/dichlorometane/ethanol/water, saponification method followed by liquid/liquid extraction with dichloromethane/ethanol/water, and Bligh & Dyer method using chloroform/methanol/water as extracting solvent. Qualitative and quantitative analysis of lard were performed using principle component (PCA) and partial least square (PLS) analysis, respectively. The results showed that, in all samples prepared by the three extraction methods, PCA was capable of identifying lard at wavelength region of 1200-800 cm -1 with the best result was obtained by Bligh & Dyer method. Furthermore, PLS analysis at the same wavelength region used for qualification showed that Bligh and Dyer was the most suitable extraction method with the highest determination coefficient (R 2 ) and the lowest root mean square error of calibration (RMSEC) as well as root mean square error of prediction (RMSEP) values.
NASA Astrophysics Data System (ADS)
He, Shixuan; Xie, Wanyi; Zhang, Ping; Fang, Shaoxi; Li, Zhe; Tang, Peng; Gao, Xia; Guo, Jinsong; Tlili, Chaker; Wang, Deqiang
2018-02-01
The analysis of algae and dominant alga plays important roles in ecological and environmental fields since it can be used to forecast water bloom and control its potential deleterious effects. Herein, we combine in vivo confocal resonance Raman spectroscopy with multivariate analysis methods to preliminary identify the three algal genera in water blooms at unicellular scale. Statistical analysis of characteristic Raman peaks demonstrates that certain shifts and different normalized intensities, resulting from composition of different carotenoids, exist in Raman spectra of three algal cells. Principal component analysis (PCA) scores and corresponding loading weights show some differences from Raman spectral characteristics which are caused by vibrations of carotenoids in unicellular algae. Then, discriminant partial least squares (DPLS) classification method is used to verify the effectiveness of algal identification with confocal resonance Raman spectroscopy. Our results show that confocal resonance Raman spectroscopy combined with PCA and DPLS could handle the preliminary identification of dominant alga for forecasting and controlling of water blooms.
Busetto, Gian Maria; De Berardinis, Ettore; Sciarra, Alessandro; Panebianco, Valeria; Giovannone, Riccardo; Rosato, Stefano; D'Errigo, Paola; Di Silverio, Franco; Gentile, Vincenzo; Salciccia, Stefano
2013-12-01
To overcome the well-known prostate-specific antigen limits, several new biomarkers have been proposed. Since its introduction in clinical practice, the urinary prostate cancer gene 3 (PCA3) assay has shown promising results for prostate cancer (PC) detection. Furthermore, multiparametric magnetic resonance imaging (mMRI) has the ability to better describe several aspects of PC. A prospective study of 171 patients with negative prostate biopsy findings and a persistent high prostate-specific antigen level was conducted to assess the role of mMRI and PCA3 in identifying PC. All patients underwent the PCA3 test and mMRI before a second transrectal ultrasound-guided prostate biopsy. The accuracy and reliability of PCA3 (3 different cutoff points) and mMRI were evaluated. Four multivariate logistic regression models were analyzed, in terms of discrimination and the cost benefit, to assess the clinical role of PCA3 and mMRI in predicting the biopsy outcome. A decision curve analysis was also plotted. Repeated transrectal ultrasound-guided biopsy identified 68 new cases (41.7%) of PC. The sensitivity and specificity of the PCA3 test and mMRI was 68% and 49% and 74% and 90%, respectively. Evaluating the regression models, the best discrimination (area under the curve 0.808) was obtained using the full model (base clinical model plus mMRI and PCA3). The decision curve analysis, to evaluate the cost/benefit ratio, showed good performance in predicting PC with the model that included mMRI and PCA3. mMRI increased the accuracy and sensitivity of the PCA3 test, and the use of the full model significantly improved the cost/benefit ratio, avoiding unnecessary biopsies. Copyright © 2013 Elsevier Inc. All rights reserved.
Verma, Priyanka; Kumar, Manoj; Mishra, Girish; Sahoo, Dinabandhu
2017-02-01
In the present study bio prospecting of thirty seaweeds from Indian coasts was analyzed for their biochemical components including pigments, fatty acid and ash content. Multivariate analysis of biochemical components and fatty acids was done using Principal Component Analysis (PCA) and Agglomerative hierarchical clustering (AHC) to manifest chemotaxonomic relationship among various seaweeds. The overall analysis suggests that these seaweeds have multi-functional properties and can be utilized as promising bioresource for proteins, lipids, pigments and carbohydrates for the food/feed and biofuel industry. Copyright © 2016. Published by Elsevier Ltd.
The Raman spectrum character of skin tumor induced by UVB
NASA Astrophysics Data System (ADS)
Wu, Shulian; Hu, Liangjun; Wang, Yunxia; Li, Yongzeng
2016-03-01
In our study, the skin canceration processes induced by UVB were analyzed from the perspective of tissue spectrum. A home-made Raman spectral system with a millimeter order excitation laser spot size combined with a multivariate statistical analysis for monitoring the skin changed irradiated by UVB was studied and the discrimination were evaluated. Raman scattering signals of the SCC and normal skin were acquired. Spectral differences in Raman spectra were revealed. Linear discriminant analysis (LDA) based on principal component analysis (PCA) were employed to generate diagnostic algorithms for the classification of skin SCC and normal. The results indicated that Raman spectroscopy combined with PCA-LDA demonstrated good potential for improving the diagnosis of skin cancers.
NIR monitoring of in-service wood structures
Michela Zanetti; Timothy G. Rials; Douglas Rammer
2005-01-01
Near infrared spectroscopy (NIRS) was used to study a set of Southern Yellow Pine boards exposed to natural weathering for different periods of exposure time. This non-destructive spectroscopic technique is a very powerful tool to predict the weathering of wood when used in combination with multivariate analysis (Principal Component Analysis, PCA, and Projection to...
NASA Astrophysics Data System (ADS)
Di Anibal, Carolina V.; Marsal, Lluís F.; Callao, M. Pilar; Ruisánchez, Itziar
2012-02-01
Raman spectroscopy combined with multivariate analysis was evaluated as a tool for detecting Sudan I dye in culinary spices. Three Raman modalities were studied: normal Raman, FT-Raman and SERS. The results show that SERS is the most appropriate modality capable of providing a proper Raman signal when a complex matrix is analyzed. To get rid of the spectral noise and background, Savitzky-Golay smoothing with polynomial baseline correction and wavelet transform were applied. Finally, to check whether unadulterated samples can be differentiated from samples adulterated with Sudan I dye, an exploratory analysis such as principal component analysis (PCA) was applied to raw data and data processed with the two mentioned strategies. The results obtained by PCA show that Raman spectra need to be properly treated if useful information is to be obtained and both spectra treatments are appropriate for processing the Raman signal. The proposed methodology shows that SERS combined with appropriate spectra treatment can be used as a practical screening tool to distinguish samples suspicious to be adulterated with Sudan I dye.
Analysis of antique bronze coins by Laser Induced Breakdown Spectroscopy and multivariate analysis
NASA Astrophysics Data System (ADS)
Bachler, M. Orlić; Bišćan, M.; Kregar, Z.; Jelovica Badovinac, I.; Dobrinić, J.; Milošević, S.
2016-09-01
This work presents a feasibility study of applying the Principal Component Analysis (PCA) to data obtained by Laser-Induced Breakdown Spectroscopy (LIBS) with the aim of determining correlation between different samples. The samples were antique bronze coins coated in silver (follis) dated in the Roman Empire period and were made during different rulers in different mints. While raw LIBS data revealed that in the period from the year 286 to 383 CE content of silver was constantly decreasing, the PCA showed that the samples can be somewhat grouped together based on their place of origin, which could be a useful hint when analysing unknown samples. It was also found that PCA can help in discriminating spectra corresponding to ablation from the surface and from the bulk. Furthermore, Partial Least Squares method (PLS) was used to obtain, based on a set of samples with known composition, an estimation of relative copper concentration in studied ancient coins. This analysis showed that copper concentration in surface layers ranged from 83% to 90%.
Bonetti, Jennifer; Quarino, Lawrence
2014-05-01
This study has shown that the combination of simple techniques with the use of multivariate statistics offers the potential for the comparative analysis of soil samples. Five samples were obtained from each of twelve state parks across New Jersey in both the summer and fall seasons. Each sample was examined using particle-size distribution, pH analysis in both water and 1 M CaCl2 , and a loss on ignition technique. Data from each of the techniques were combined, and principal component analysis (PCA) and canonical discriminant analysis (CDA) were used for multivariate data transformation. Samples from different locations could be visually differentiated from one another using these multivariate plots. Hold-one-out cross-validation analysis showed error rates as low as 3.33%. Ten blind study samples were analyzed resulting in no misclassifications using Mahalanobis distance calculations and visual examinations of multivariate plots. Seasonal variation was minimal between corresponding samples, suggesting potential success in forensic applications. © 2014 American Academy of Forensic Sciences.
Pérez-Rambla, Clara; Puchades-Carrasco, Leonor; García-Flores, María; Rubio-Briones, José; López-Guerrero, José Antonio; Pineda-Lucena, Antonio
2017-01-01
Prostate cancer (PCa) is one of the most common malignancies in men worldwide. Serum prostate specific antigen (PSA) level has been extensively used as a biomarker to detect PCa. However, PSA is not cancer-specific and various non-malignant conditions, including benign prostatic hyperplasia (BPH), can cause a rise in PSA blood levels, thus leading to many false positive results. In this study, we evaluated the potential of urinary metabolomic profiling for discriminating PCa from BPH. Urine samples from 64 PCa patients and 51 individuals diagnosed with BPH were analysed using 1 H nuclear magnetic resonance ( 1 H-NMR). Comparative analysis of urinary metabolomic profiles was carried out using multivariate and univariate statistical approaches. The urine metabolomic profile of PCa patients is characterised by increased concentrations of branched-chain amino acids (BCAA), glutamate and pseudouridine, and decreased concentrations of glycine, dimethylglycine, fumarate and 4-imidazole-acetate compared with individuals diagnosed with BPH. PCa patients have a specific urinary metabolomic profile. The results of our study underscore the clinical potential of metabolomic profiling to uncover metabolic changes that could be useful to discriminate PCa from BPH in a clinical context.
Veyhe, Anna Sofía; Hofoss, Dag; Hansen, Solrunn; Thomassen, Yngvar; Sandanger, Torkjel M; Odland, Jon Øyvind; Nieboer, Evert
2015-03-01
Although predictors of contaminants in serum or whole blood are usually examined by chemical groups (e.g., POPs, toxic and/or essential elements; dietary sources), principal component analysis (PCA) permits consideration of both individual substances and combined variables. Our study had two primary objectives: (i) Characterize the sources and predictors of a suite of eight PCBs, four organochlorine (OC) pesticides, five essential and five toxic elements in serum and/or whole blood of pregnant women recruited as part of the Mother-and-Child Contaminant Cohort Study conducted in Northern Norway (The MISA study); and (ii) determine the influence of personal and social characteristics on both dietary and contaminant factors. Recruitment and sampling started in May 2007 and continued for the next 31 months until December 2009. Blood/serum samples were collected during the 2nd trimester (mean: 18.2 weeks, range 9.0-36.0). A validated questionnaire was administered to obtain personal information. The samples were analysed by established laboratories employing verified methods and reference standards. PCA involved Varimax rotation, and significant predictors (p≤0.05) in linear regression models were included in the multivariable linear regression analysis. When considering all the contaminants, three prominent PCA axes stood out with prominent loadings of: all POPs; arsenic, selenium and mercury; and cadmium and lead. Respectively, in the multivariate models the following were predictors: maternal age, parity and consumption of freshwater fish and land-based wild animals; marine fish; cigarette smoking, dietary PCA axes reflecting consumption of grains and cereals, and food items involving hunting. PCA of only the POPs separated them into two axes that, in terms of recently published findings, could be understood to reflect longitudinal trends and their relative contributions to summed POPs. The linear combinations of variables generated by PCA identified prominent dietary sources of OC groups and of prominent toxic elements and highlighted the importance of maternal characteristics. Copyright © 2014 Elsevier GmbH. All rights reserved.
Chen, Rui; Xie, Liping; Xue, Wei; Ye, Zhangqun; Ma, Lulin; Gao, Xu; Ren, Shancheng; Wang, Fubo; Zhao, Lin; Xu, Chuanliang; Sun, Yinghao
2016-09-01
Substantial differences exist in the relationship of prostate cancer (PCa) detection rate and prostate-specific antigen (PSA) level between Western and Asian populations. Classic Western risk calculators, European Randomized Study for Screening of Prostate Cancer Risk Calculator, and Prostate Cancer Prevention Trial Risk Calculator, were shown to be not applicable in Asian populations. We aimed to develop and validate a risk calculator for predicting the probability of PCa and high-grade PCa (defined as Gleason Score sum 7 or higher) at initial prostate biopsy in Chinese men. Urology outpatients who underwent initial prostate biopsy according to the inclusion criteria were included. The multivariate logistic regression-based Chinese Prostate Cancer Consortium Risk Calculator (CPCC-RC) was constructed with cases from 2 hospitals in Shanghai. Discriminative ability, calibration and decision curve analysis were externally validated in 3 CPCC member hospitals. Of the 1,835 patients involved, PCa was identified in 338/924 (36.6%) and 294/911 (32.3%) men in the development and validation cohort, respectively. Multivariate logistic regression analyses showed that 5 predictors (age, logPSA, logPV, free PSA ratio, and digital rectal examination) were associated with PCa (Model 1) or high-grade PCa (Model 2), respectively. The area under the curve of Model 1 and Model 2 was 0.801 (95% CI: 0.771-0.831) and 0.826 (95% CI: 0.796-0.857), respectively. Both models illustrated good calibration and substantial improvement in decision curve analyses than any single predictors at all threshold probabilities. Higher predicting accuracy, better calibration, and greater clinical benefit were achieved by CPCC-RC, compared with European Randomized Study for Screening of Prostate Cancer Risk Calculator and Prostate Cancer Prevention Trial Risk Calculator in predicting PCa. CPCC-RC performed well in discrimination and calibration and decision curve analysis in external validation compared with Western risk calculators. CPCC-RC may aid in decision-making of prostate biopsy in Chinese or in other Asian populations with similar genetic and environmental backgrounds. Copyright © 2016 Elsevier Inc. All rights reserved.
The application of near infrared (NIR) spectroscopy to inorganic preservative-treated wood
Chi-Leung So; Stan T. Lebow; Leslie H. Groom; Timothy G. Rials
2004-01-01
There is a growing need to find a rapid, inexpensive, and reliable method to distinguish between treated and untreated waste wood. This paper evaluates the ability of near infrared (NIR) spectroscopy with multivariate analysis (MVA) to distinguish preservative types and retentions. It is demonstrated that principal component analysis (PCA) can differentiate lumber...
ERIC Educational Resources Information Center
Brusco, Michael J.; Singh, Renu; Steinley, Douglas
2009-01-01
The selection of a subset of variables from a pool of candidates is an important problem in several areas of multivariate statistics. Within the context of principal component analysis (PCA), a number of authors have argued that subset selection is crucial for identifying those variables that are required for correct interpretation of the…
Tailored multivariate analysis for modulated enhanced diffraction
DOE Office of Scientific and Technical Information (OSTI.GOV)
Caliandro, Rocco; Guccione, Pietro; Nico, Giovanni
2015-10-21
Modulated enhanced diffraction (MED) is a technique allowing the dynamic structural characterization of crystalline materials subjected to an external stimulus, which is particularly suited forin situandoperandostructural investigations at synchrotron sources. Contributions from the (active) part of the crystal system that varies synchronously with the stimulus can be extracted by an offline analysis, which can only be applied in the case of periodic stimuli and linear system responses. In this paper a new decomposition approach based on multivariate analysis is proposed. The standard principal component analysis (PCA) is adapted to treat MED data: specific figures of merit based on their scoresmore » and loadings are found, and the directions of the principal components obtained by PCA are modified to maximize such figures of merit. As a result, a general method to decompose MED data, called optimum constrained components rotation (OCCR), is developed, which produces very precise results on simulated data, even in the case of nonperiodic stimuli and/or nonlinear responses. The multivariate analysis approach is able to supply in one shot both the diffraction pattern related to the active atoms (through the OCCR loadings) and the time dependence of the system response (through the OCCR scores). When applied to real data, OCCR was able to supply only the latter information, as the former was hindered by changes in abundances of different crystal phases, which occurred besides structural variations in the specific case considered. To develop a decomposition procedure able to cope with this combined effect represents the next challenge in MED analysis.« less
Badran, M; Morsy, R; Soliman, H; Elnimr, T
2016-01-01
The trace elements metabolism has been reported to possess specific roles in the pathogenesis and progress of diabetes mellitus. Due to the continuous increase in the population of patients with Type 2 diabetes (T2D), this study aims to assess the levels and inter-relationships of fast blood glucose (FBG) and serum trace elements in Type 2 diabetic patients. This study was conducted on 40 Egyptian Type 2 diabetic patients and 36 healthy volunteers (Hospital of Tanta University, Tanta, Egypt). The blood serum was digested and then used to determine the levels of 24 trace elements using an inductive coupled plasma mass spectroscopy (ICP-MS). Multivariate statistical analysis depended on correlation coefficient, cluster analysis (CA) and principal component analysis (PCA), were used to analysis the data. The results exhibited significant changes in FBG and eight of trace elements, Zn, Cu, Se, Fe, Mn, Cr, Mg, and As, levels in the blood serum of Type 2 diabetic patients relative to those of healthy controls. The statistical analyses using multivariate statistical techniques were obvious in the reduction of the experimental variables, and grouping the trace elements in patients into three clusters. The application of PCA revealed a distinct difference in associations of trace elements and their clustering patterns in control and patients group in particular for Mg, Fe, Cu, and Zn that appeared to be the most crucial factors which related with Type 2 diabetes. Therefore, on the basis of this study, the contributors of trace elements content in Type 2 diabetic patients can be determine and specify with correlation relationship and multivariate statistical analysis, which confirm that the alteration of some essential trace metals may play a role in the development of diabetes mellitus. Copyright © 2015 Elsevier GmbH. All rights reserved.
Measuring the Indonesian provinces competitiveness by using PCA technique
NASA Astrophysics Data System (ADS)
Runita, Ditha; Fajriyah, Rohmatul
2017-12-01
Indonesia is a country which has vast teritoty. It has 34 provinces. Building local competitiveness is critical to enhance the long-term national competitiveness especially for a country as diverse as Indonesia. A competitive local government can attract and maintain successful firms and increase living standards for its inhabitants, because investment and skilled workers gravitate from uncompetitive regions to more competitive ones. Altough there are other methods to measuring competitiveness, but here we have demonstrated a simple method using principal component analysis (PCA). It can directly be applied to correlated, multivariate data. The analysis on Indonesian provinces provides 3 clusters based on the competitiveness measurement and the clusters are Bad, Good and Best perform provinces.
Carneiro, Fabíola B.; Lopes, Pablo Q.; Ramalho, Ricardo C.; Scotti, Marcus T.; Santos, Sócrates G.; Soares, Luiz A. L.
2017-01-01
Background: Schinus terebinthifolius Raddi belongs to Anacardiacea family and is widely known as “aroeira.” This species originates from South America, and its extracts are used in folk medicine due to its therapeutic properties, which include antimicrobial, anti-inflammatory, and antipyretic effects. The complexity and variability of the chemical constitution of the herbal raw material establishes the quality of the respective herbal medicine products. Objective: Thus, the purpose of this study was to investigate the variability of the volatile compounds from leaves of S. terebinthifolius. Materials and Methods: The samples were collected from different states of the Northeast region of Brazil and analyzed with a gas chromatograph coupled to a mass spectrometer (GC-MS). The collected data were analyzed using multivariate data analysis. Results: The samples’ chromatograms, obtained by GC-MS, showed similar chemical profiles in a number of peaks, but some differences were observed in the intensity of these analytical markers. The chromatographic fingerprints obtained by GC-MS were suitable for discrimination of the samples; these results along with a statistical treatment (principal component analysis [PCA]) were used as a tool for comparative analysis between the different samples of S. terebinthifolius. Conclusion: The experimental data show that the PCA used in this study clustered the samples into groups with similar chemical profiles, which builds an appropriate approach to evaluate the similarity in the phytochemical pattern found in the different leaf samples. SUMMARY The leave extracts of Schinus terebinthifolius were obtained by turbo-extractionThe extracts were partitioned with hexane and analyzed by GC-MSThe chromatographic data were analyzed using the principal component analysis (PCA)The PCA plots showed the main compounds (phellandrene, limonene, and carene), which were used to group the samples from a different geographical location in accordance to their chemical similarity. Abbreviations used: AL: Alagoas, BA: Bahia, CE: Ceará, CPETEC: Center for Weather Forecasting and Climate Studies, GC-MS: Gas chromatograph coupled to a mass spectrometer, MA: Maranhão, MVA: Multivariate data analysis, PB: Paraíba, PC1: Direction that describes the maximum variance of the original data, PC2: Maximum direction variance of the data in the subspace orthogonal to PC1, PCA: Principal component analysis, PE: Pernambuco, PI: Piauí, RN: Rio Grande do Norte, SE: Sergipe. PMID:29142431
Application of multivariable statistical techniques in plant-wide WWTP control strategies analysis.
Flores, X; Comas, J; Roda, I R; Jiménez, L; Gernaey, K V
2007-01-01
The main objective of this paper is to present the application of selected multivariable statistical techniques in plant-wide wastewater treatment plant (WWTP) control strategies analysis. In this study, cluster analysis (CA), principal component analysis/factor analysis (PCA/FA) and discriminant analysis (DA) are applied to the evaluation matrix data set obtained by simulation of several control strategies applied to the plant-wide IWA Benchmark Simulation Model No 2 (BSM2). These techniques allow i) to determine natural groups or clusters of control strategies with a similar behaviour, ii) to find and interpret hidden, complex and casual relation features in the data set and iii) to identify important discriminant variables within the groups found by the cluster analysis. This study illustrates the usefulness of multivariable statistical techniques for both analysis and interpretation of the complex multicriteria data sets and allows an improved use of information for effective evaluation of control strategies.
Liu, Chang; Liu, Shi-Liang; Wang, Zhi-Xian; Yu, Kai; Feng, Chun-Xiang; Ke, Zan; Wang, Liang; Zeng, Xiao-Yong
2018-04-13
Prostate cancer (PCa) is one of the most common cancers among men globally. The authors aimed to evaluate the ability of the Prostate Imaging Reporting and Data System version 2 (PI-RADS v2) to classify men with PCa, clinically significant PCa (CSPCa), or no PCa, especially among those with serum total prostate-specific antigen (tPSA) levels in the "gray zone" (4-10 ng ml -1 ). A total of 308 patients (355 lesions) were enrolled in this study. Diagnostic efficiency was determined. Univariate and multivariate analyses, receiver operating characteristic curve analysis, and decision curve analysis were performed to determine and compare the predictors of PCa and CSPCa. The results suggested that PI-RADS v2, tPSA, and prostate-specific antigen density (PSAD) were independent predictors of PCa and CSPCa. A PI-RADS v2 score ≥4 provided high negative predictive values (91.39% for PCa and 95.69% for CSPCa). A model of PI-RADS combined with PSA and PSAD helped to define a high-risk group (PI-RADS score = 5 and PSAD ≥0.15 ng ml -1 cm -3 , with tPSA in the gray zone, or PI-RADS score ≥4 with high tPSA level) with a detection rate of 96.1% for PCa and 93.0% for CSPCa while a low-risk group with a detection rate of 6.1% for PCa and 2.2% for CSPCa. It was concluded that the PI-RADS v2 could be used as a reliable and independent predictor of PCa and CSPCa. The combination of PI-RADS v2 score with PSA and PSAD could be helpful in the prediction and diagnosis of PCa and CSPCa and, thus, may help in preventing unnecessary invasive procedures.
Thavamani, Palanisami; Megharaj, Mallavarapu; Naidu, Ravi
2012-06-01
Principal component analysis (PCA) was used to provide an overview of the distribution pattern of polycyclic aromatic hydrocarbons (PAHs) and heavy metals in former manufactured gas plant (MGP) site soils. PCA is the powerful multivariate method to identify the patterns in data and expressing their similarities and differences. Ten PAHs (naphthalene, acenapthylene, acenaphthene, fluorene, phenanthrene, anthracene, fluoranthene, pyrene, chrysene, benzo[a]pyrene) and four toxic heavy metals - lead (Pb), cadmium (Cd), chromium (Cr) and zinc (Zn) - were detected in the site soils. PAH contamination was contributed equally by both low and high molecular weight PAHs. PCA was performed using the varimax rotation method in SPSS, 17.0. Two principal components accounting for 91.7% of the total variance was retained using scree test. Principle component 1 (PC1) substantially explained the dominance of PAH contamination in the MGP site soils. All PAHs, except anthracene, were positively correlated in PC1. There was a common thread in high molecular weight PAHs loadings, where the loadings were inversely proportional to the hydrophobicity and molecular weight of individual PAHs. Anthracene, which was less correlated with other individual PAHs, deviated well from the origin which can be ascribed to its lower toxicity and different origin than its isomer phenanthrene. Among the four major heavy metals studied in MGP sites, Pb, Cd and Cr were negatively correlated in PC1 but showed strong positive correlation in principle component 2 (PC2). Although metals may not have originated directly from gaswork processes, the correlation between PAHs and metals suggests that the materials used in these sites may have contributed to high concentrations of Pb, Cd, Cr and Zn. Thus, multivariate analysis helped to identify the sources of PAHs, heavy metals and their association in MGP site, and thereby better characterise the site risk, which would not be possible if one uses chemical analysis alone.
Tang, Bo; Han, Cheng-Tao; Zhang, Gui-Ming; Zhang, Cui-Zhu; Yang, Wei-Yi; Shen, Ying; Vidal, Adriana C; Freedland, Stephen J; Zhu, Yao; Ye, Ding-Wei
2017-03-08
To investigate whether waist-hip ratio (WHR) is a better predictor of prostate cancer (PCa) incidence than body mass index (BMI) in Chinese men. Of consecutive patients who underwent prostate biopsies in one tertiary center between 2013 and 2015, we examined data on 1018 with PSA ≤20 ng/ml. Clinical data and biopsy outcomes were collected. Logistic regression was used to evaluate the associations between BMI, WHR and PCa incidence. Area under the ROC (AUC) was used to evaluate the accuracy of different prognostic models. A total of 255 men and 103 men were diagnosed with PCa and high grade PCa (HGPCa, Gleason score ≥8). WHR was an independent risk factor for both PCa (OR = 1.07 95%Cl 1.03-1.11) and HGPCa (OR = 1.14 95%Cl 1.09-1.19) detection, while BMI had no relationship with either PCa or HGPCa detection. Adding WHR to a multivariable model increased the AUC for detecting HGPCa from 0.66 (95%Cl 0.60-0.72) to 0.71 (95%Cl 0.65-0.76). In this Chinese cohort, WHR was significantly predictive of PCa and HGPCa. Adding WHR to a multivariable model increased the diagnostic accuracy for detecting HGPCa. If confirmed, including WHR measurement may improve PCa and HGPCa detection.
Tang, Bo; Han, Cheng-Tao; Zhang, Gui-Ming; Zhang, Cui-Zhu; Yang, Wei-Yi; Shen, Ying; Vidal, Adriana C.; Freedland, Stephen J.; Zhu, Yao; Ye, Ding-Wei
2017-01-01
To investigate whether waist-hip ratio (WHR) is a better predictor of prostate cancer (PCa) incidence than body mass index (BMI) in Chinese men. Of consecutive patients who underwent prostate biopsies in one tertiary center between 2013 and 2015, we examined data on 1018 with PSA ≤20 ng/ml. Clinical data and biopsy outcomes were collected. Logistic regression was used to evaluate the associations between BMI, WHR and PCa incidence. Area under the ROC (AUC) was used to evaluate the accuracy of different prognostic models. A total of 255 men and 103 men were diagnosed with PCa and high grade PCa (HGPCa, Gleason score ≥8). WHR was an independent risk factor for both PCa (OR = 1.07 95%Cl 1.03–1.11) and HGPCa (OR = 1.14 95%Cl 1.09–1.19) detection, while BMI had no relationship with either PCa or HGPCa detection. Adding WHR to a multivariable model increased the AUC for detecting HGPCa from 0.66 (95%Cl 0.60–0.72) to 0.71 (95%Cl 0.65–0.76). In this Chinese cohort, WHR was significantly predictive of PCa and HGPCa. Adding WHR to a multivariable model increased the diagnostic accuracy for detecting HGPCa. If confirmed, including WHR measurement may improve PCa and HGPCa detection. PMID:28272469
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ladd-Lively, Jennifer L
2014-01-01
The objective of this work was to determine the feasibility of using on-line multivariate statistical process control (MSPC) for safeguards applications in natural uranium conversion plants. Multivariate statistical process control is commonly used throughout industry for the detection of faults. For safeguards applications in uranium conversion plants, faults could include the diversion of intermediate products such as uranium dioxide, uranium tetrafluoride, and uranium hexafluoride. This study was limited to a 100 metric ton of uranium (MTU) per year natural uranium conversion plant (NUCP) using the wet solvent extraction method for the purification of uranium ore concentrate. A key component inmore » the multivariate statistical methodology is the Principal Component Analysis (PCA) approach for the analysis of data, development of the base case model, and evaluation of future operations. The PCA approach was implemented through the use of singular value decomposition of the data matrix where the data matrix represents normal operation of the plant. Component mole balances were used to model each of the process units in the NUCP. However, this approach could be applied to any data set. The monitoring framework developed in this research could be used to determine whether or not a diversion of material has occurred at an NUCP as part of an International Atomic Energy Agency (IAEA) safeguards system. This approach can be used to identify the key monitoring locations, as well as locations where monitoring is unimportant. Detection limits at the key monitoring locations can also be established using this technique. Several faulty scenarios were developed to test the monitoring framework after the base case or normal operating conditions of the PCA model were established. In all of the scenarios, the monitoring framework was able to detect the fault. Overall this study was successful at meeting the stated objective.« less
Spatial assessment of air quality patterns in Malaysia using multivariate analysis
NASA Astrophysics Data System (ADS)
Dominick, Doreena; Juahir, Hafizan; Latif, Mohd Talib; Zain, Sharifuddin M.; Aris, Ahmad Zaharin
2012-12-01
This study aims to investigate possible sources of air pollutants and the spatial patterns within the eight selected Malaysian air monitoring stations based on a two-year database (2008-2009). The multivariate analysis was applied on the dataset. It incorporated Hierarchical Agglomerative Cluster Analysis (HACA) to access the spatial patterns, Principal Component Analysis (PCA) to determine the major sources of the air pollution and Multiple Linear Regression (MLR) to assess the percentage contribution of each air pollutant. The HACA results grouped the eight monitoring stations into three different clusters, based on the characteristics of the air pollutants and meteorological parameters. The PCA analysis showed that the major sources of air pollution were emissions from motor vehicles, aircraft, industries and areas of high population density. The MLR analysis demonstrated that the main pollutant contributing to variability in the Air Pollutant Index (API) at all stations was particulate matter with a diameter of less than 10 μm (PM10). Further MLR analysis showed that the main air pollutant influencing the high concentration of PM10 was carbon monoxide (CO). This was due to combustion processes, particularly originating from motor vehicles. Meteorological factors such as ambient temperature, wind speed and humidity were also noted to influence the concentration of PM10.
Differences in chewing sounds of dry-crisp snacks by multivariate data analysis
NASA Astrophysics Data System (ADS)
De Belie, N.; Sivertsvik, M.; De Baerdemaeker, J.
2003-09-01
Chewing sounds of different types of dry-crisp snacks (two types of potato chips, prawn crackers, cornflakes and low calorie snacks from extruded starch) were analysed to assess differences in sound emission patterns. The emitted sounds were recorded by a microphone placed over the ear canal. The first bite and the first subsequent chew were selected from the time signal and a fast Fourier transformation provided the power spectra. Different multivariate analysis techniques were used for classification of the snack groups. This included principal component analysis (PCA) and unfold partial least-squares (PLS) algorithms, as well as multi-way techniques such as three-way PLS, three-way PCA (Tucker3), and parallel factor analysis (PARAFAC) on the first bite and subsequent chew. The models were evaluated by calculating the classification errors and the root mean square error of prediction (RMSEP) for independent validation sets. It appeared that the logarithm of the power spectra obtained from the chewing sounds could be used successfully to distinguish the different snack groups. When different chewers were used, recalibration of the models was necessary. Multi-way models distinguished better between chewing sounds of different snack groups than PCA on bite or chew separately and than unfold PLS. From all three-way models applied, N-PLS with three components showed the best classification capabilities, resulting in classification errors of 14-18%. The major amount of incorrect classifications was due to one type of potato chips that had a very irregular shape, resulting in a wide variation of the emitted sounds.
Wang, Kai; Chen, Xinguang; Bird, Victoria Y; Gerke, Travis A; Manini, Todd M; Prosperi, Mattia
2017-11-01
The relationship between serum total testosterone and prostate cancer (PCa) risk is controversial. The hypothesis that faster age-related reduction in testosterone is linked with increased PCa risk remains untested. We conducted our study at a tertiary-level hospital in southeast of the USA, and derived data from the Medical Registry Database of individuals that were diagnosed of any prostate-related disease from 2001 to 2015. Cases were those diagnosed of PCa and had one or more measurements of testosterone prior to PCa diagnosis. Controls were those without PCa and had one or more testosterone measurements. Multivariable logistic regression models for PCa risk of absolute levels (one-time measure and 5-year average) and annual change in testosterone were respectively constructed. Among a total of 1,559 patients, 217 were PCa cases, and neither one-time measure nor 5-year average of testosterone was found to be significantly associated with PCa risk. Among the 379 patients with two or more testosterone measurements, 27 were PCa cases. For every 10 ng/dL increment in annual reduction of testosterone, the risk of PCa would increase by 14% [adjusted odds ratio, 1.14; 95% confidence interval (CI), 1.03-1.25]. Compared to patients with a relatively stable testosterone, patients with an annual testosterone reduction of more than 30 ng/dL had 5.03 [95% CI: 1.53, 16.55] fold increase in PCa risk. This implies a faster age-related reduction in, but not absolute level of serum total testosterone as a risk factor for PCa. Further longitudinal studies are needed to confirm this finding. © 2017 UICC.
Xu, Ning; Wu, Yu-Peng; Chen, Dong-Ning; Ke, Zhi-Bin; Cai, Hai; Wei, Yong; Zheng, Qing-Shui; Huang, Jin-Bei; Li, Xiao-Dong; Xue, Xue-Yi
2018-05-01
To explore the value of Prostate Imaging Reporting and Data System Version 2 (PI-RADS v2) for predicting prostate biopsy results in patients with prostate specific antigen (PSA) levels of 4-10 ng/ml. We retrospectively reviewed multi-parameter magnetic resonance images from 528 patients with PSA levels of 4-10 ng/ml who underwent transrectal ultrasound-guided prostate biopsies between May 2015 and May 2017. Among them, 137 were diagnosed with prostate cancer (PCa), and we further subdivided them according to pathological results into the significant PCa (S-PCa) and insignificant significant PCa (Ins-PCa) groups (121 cases were defined by surgical pathological specimen and 16 by biopsy). Age, PSA, percent free PSA, PSA density (PSAD), prostate volume (PV), and PI-RADS score were collected. Logistic regression analysis was performed to determine predictors of pathological results. Receiver operating characteristic curves were constructed to analyze the diagnostic value of PI-RADS v2 in PCa. Multivariate analysis indicated that age, PV, percent free PSA, and PI-RADS score were independent predictors of biopsy findings, while only PI-RADS score was an independent predictor of S-PCa (P < 0.05). The areas under the receiver operating characteristic curve for diagnosing PCa with respect to age, PV, percent free PSA, and PI-RADS score were 0.570, 0.430, 0.589 and 0.836, respectively. The area under the curve for diagnosing S-PCa with respect to PI-RADS score was 0.732. A PI-RADS score of 3 was the best cutoff for predicting PCa, and 4 was the best cutoff for predicting S-PCa. Thus, 92.8% of patients with PI-RADS scores of 1-2 would have avoided biopsy, but at the cost of missing 2.2% of the potential PCa cases. Similarly, 83.82% of patients with a PI-RADS score ≤ 3 would have avoided biopsy, but at the cost of missing 3.3% of the potential S-PCa cases. PI-RADS v2 could be used to reduce unnecessary prostate biopsies in patients with PSA levels of 4-10 ng/ml.
Fourier Transform Infrared Spectroscopy as a Tool in Analysis of Proteus mirabilis Endotoxins.
Żarnowiec, Paulina; Czerwonka, Grzegorz; Kaca, Wiesław
2017-01-01
Fourier transform infrared spectroscopy (FT-IR) was used to scan whole bacterial cells as well as lipopolysaccharides (LPSs, endotoxins) isolated from them. Proteus mirabilis cells, with chemically defined LPSs, served as a model for the ATR FT-IR method. The paper focuses on three steps of infrared spectroscopy: (1) sample preparation, (2) IR scanning, and (3) multivariate analysis of IR data (principal component analysis, PCA).
Jo, Jung Ku; Oh, Jong Jin; Kim, Yong Tae; Moon, Hong Sang; Choi, Hong Yong; Park, Seunghyun; Ho, Jin-Nyoung; Yoon, Sungroh; Park, Hae Young; Byun, Seok-Soo
2017-11-14
Genetic variation which related with progression to castration-resistant prostate cancer (CRPC) during androgen-deprivation therapy (ADT) has not been elucidated in patients with metastatic prostate cancer (mPCa). Therefore, we assessed the association between genetic variats in mPCa and progession to CRPC. Analysis of exome genotypes revealed that 42 SNPs were significantly associated with mPCa. The top five polymorphisms were statistically significantly associated with metastatic disease. In addition, one of these SNPs, rs56350726, was significantly associated with time to CRPC in Kaplan-Meier analysis (Log-rank test, p = 0.011). In multivariable Cox regression, rs56350726 was strongly associated with progression to CRPC (HR = 4.172 95% CI = 1.223-14.239, p = 0.023). We assessed genetic variation among 1000 patients with PCa with or without metastasis, using 242,221 single nucleotide polymorphisms (SNPs) on the custom HumanExome BeadChip v1.0 (Illuminam Inc.). We analyzed the time to CRPC in 110 of the 1000 patients who were treated with ADT. Genetic data were analyzed using unconditional logistic regression and odds ratios calculated as estimates of relative risk of metastasis. We identified SNPs associated with metastasis and analyzed the relationship between these SNPs and time to CRPC in mPCa. Based on a genetic variation, the five top SNPs were observed to associate with mPCa. And one (SLC28A3, rs56350726) of five SNP was found the association with the progression to CRPC in patients with mPCa.
Ardila, Jorge Armando; Funari, Cristiano Soleo; Andrade, André Marques; Cavalheiro, Alberto José; Carneiro, Renato Lajarim
2015-01-01
Bauhinia forficata Link. is recognised by the Brazilian Health Ministry as a treatment of hypoglycemia and diabetes. Analytical methods are useful to assess the plant identity due the similarities found in plants from Bauhinia spp. HPLC-UV/PDA in combination with chemometric tools is an alternative widely used and suitable for authentication of plant material, however, the shifts of retention times for similar compounds in different samples is a problem. To perform comparisons between the authentic medicinal plant (Bauhinia forficata Link.) and samples commercially available in drugstores claiming to be "Bauhinia spp. to treat diabetes" and to evaluate the performance of multivariate curve resolution - alternating least squares (MCR-ALS) associated to principal component analysis (PCA) when compared to pure PCA. HPLC-UV/PDA data obtained from extracts of leaves were evaluated employing a combination of MCR-ALS and PCA, which allowed the use of the full chromatographic and spectrometric information without the need of peak alignment procedures. The use of MCR-ALS/PCA showed better results than the conventional PCA using only one wavelength. Only two of nine commercial samples presented characteristics similar to the authentic Bauhinia forficata spp., considering the full HPLC-UV/PDA data. The combination of MCR-ALS and PCA is very useful when applied to a group of samples where a general alignment procedure could not be applied due to the different chromatographic profiles. This work also demonstrates the need of more strict control from the health authorities regarding herbal products available on the market. Copyright © 2015 John Wiley & Sons, Ltd.
Vanderhaeghe, F; Smolders, A J P; Roelofs, J G M; Hoffmann, M
2012-03-01
Selecting an appropriate variable subset in linear multivariate methods is an important methodological issue for ecologists. Interest often exists in obtaining general predictive capacity or in finding causal inferences from predictor variables. Because of a lack of solid knowledge on a studied phenomenon, scientists explore predictor variables in order to find the most meaningful (i.e. discriminating) ones. As an example, we modelled the response of the amphibious softwater plant Eleocharis multicaulis using canonical discriminant function analysis. We asked how variables can be selected through comparison of several methods: univariate Pearson chi-square screening, principal components analysis (PCA) and step-wise analysis, as well as combinations of some methods. We expected PCA to perform best. The selected methods were evaluated through fit and stability of the resulting discriminant functions and through correlations between these functions and the predictor variables. The chi-square subset, at P < 0.05, followed by a step-wise sub-selection, gave the best results. In contrast to expectations, PCA performed poorly, as so did step-wise analysis. The different chi-square subset methods all yielded ecologically meaningful variables, while probable noise variables were also selected by PCA and step-wise analysis. We advise against the simple use of PCA or step-wise discriminant analysis to obtain an ecologically meaningful variable subset; the former because it does not take into account the response variable, the latter because noise variables are likely to be selected. We suggest that univariate screening techniques are a worthwhile alternative for variable selection in ecology. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.
NASA Astrophysics Data System (ADS)
Anggraeni, Anni; Arianto, Fernando; Mutalib, Abdul; Pratomo, Uji; Bahti, Husein H.
2017-05-01
Rare Earth Elements (REE) are elements that a lot of function for life, such as metallurgy, optical devices, and manufacture of electronic devices. Sources of REE is present in the mineral, in which each element has similar properties. Currently, to determining the content of REE is used instruments such as ICP-OES, ICP-MS, XRF, and HPLC. But in each instruments, there are still have some weaknesses. Therefore we need an alternative analytical method for the determination of rare earth metal content, one of them is by a combination of UV-Visible spectrophotometry and multivariate analysis, including Principal Component Analysis (PCA), Principal Component Regression (PCR), and Partial Least Square Regression (PLS). The purpose of this experiment is to determine the content of light and medium rare earth elements in the mineral monazite without chemical separation by using a combination of multivariate analysis and UV-Visible spectrophotometric methods. Training set created 22 variations of concentration and absorbance was measured using a UV-Vis spectrophotometer, then the data is processed by PCA, PCR, and PLSR. The results were compared and validated to obtain the mathematical equation with the smallest percent error. From this experiment, mathematical equation used PLS methods was better than PCR after validated, which has RMSE value for La, Ce, Pr, Nd, Gd, Sm, Eu, and Tb respectively 0.095; 0.573; 0.538; 0.440; 3.387; 1.240; 1.870; and 0.639.
The Burden of Urinary Incontinence and Urinary Bother Among Elderly Prostate Cancer Survivors
Kopp, Ryan P.; Marshall, Lynn M.; Wang, Patty Y.; Bauer, Douglas C.; Barrett-Connor, Elizabeth; Parsons, J. Kellogg
2014-01-01
Background Data describing urinary health in elderly, community-dwelling prostate cancer (PCa) survivors are limited. Objective To elucidate the prevalence of lower urinary tract symptoms, urinary bother, and incontinence in elderly PCa survivors compared with peers without PCa. Design, setting, and participants A cross-sectional analysis of 5990 participants in the Osteoporotic Fractures in Men Research Group, a cohort study of community-dwelling men ≥65 yr. Outcome measurements and statistical analysis We characterized urinary health using self-reported urinary incontinence and the American Urological Association Symptom Index (AUA-SI). We compared urinary health measures according to type of PCa treatment in men with PCa and men without PCa using multivariate log-binomial regression to generate prevalence ratios (PRs). Results and limitations At baseline, 706 men (12%) reported a history of PCa, with a median time since diagnosis of 6.3 yr. Of these men, 426 (60%) reported urinary incontinence. In adjusted analyses, observation (PR: 1.92; 95% confidence interval [CI], 1.15–3.21; p = 0.01), surgery (PR: 4.68; 95% CI, 4.11–5.32; p < 0.0001), radiation therapy (PR: 1.64; 95% CI, 1.20– 2.23; p = 0.002), and androgen-deprivation therapy (ADT) (PR: 2.01; 95% CI, 1.35–2.99; p = 0.0006) were each associated with daily incontinence. Daily incontinence risk increased with time since diagnosis independently of age. Observation (PR: 1.33; 95% CI, 1.00–1.78; p = 0.05), surgery (PR: 1.25; 95% CI, 1.10–1.42; p = 0.0008), and ADT (PR: 1.50; 95% CI, 1.26–1.79; p < 0.0001) were associated with increased AUA-SI bother scores. Cancer stage and use of adjuvant or salvage therapies were not available for analysis. Conclusions Compared with their peers without PCa, elderly PCa survivors had a two-fold to five-fold greater prevalence of urinary incontinence, which rose with increasing survivorship duration. Observation, surgery, and ADT were each associated with increased urinary bother. These data suggest a substantially greater burden of urinary health problems among elderly PCa survivors than previously recognized. PMID:23587870
Owari, Takuya; Miyake, Makito; Nakai, Yasushi; Morizawa, Yosuke; Itami, Yoshitaka; Hori, Shunta; Anai, Satoshi; Tanaka, Nobumichi; Fujimoto, Kiyohide
2018-06-06
The objective of the present study was to report the incidence of skeletal-related events (SREs) and identify risk factors for SREs in patients with genitourinary cancer with newly diagnosed bone metastasis. This retrospective study included 180 patients with bone metastasis from prostate cancer (PCa; n = 111), renal cell carcinoma (RCC; n = 43), and urothelial carcinoma (UC; n = 26). Clinical factors at the time of diagnosis of bone metastasis were evaluated with Cox proportional hazards regression analysis to identify independent risk factors for SREs. During follow-up, 29 (26%) patients with PCa, 30 (70%) with RCC, and 15 (58%) with UC developed SREs. Treatment with bone-modifying agents (BMAs) before the development of SREs and within 6 months from the diagnosis of bone metastasis significantly delayed the time to first SRE as compared to nonuse of BMAs. Multivariate analysis identified type of primary cancer (PCa vs. RCC, PCa vs. UC), performance status, and bone pain as significant independent predictive risk factors for SREs. Treatment with BMAs significantly delayed the development of first SREs. The identified predictors of SREs might be useful to select patients who would benefit most from early treatment with BMAs. © 2018 S. Karger AG, Basel.
Roopwani, Rahul; Buckner, Ira S
2011-10-14
Principal component analysis (PCA) was applied to pharmaceutical powder compaction. A solid fraction parameter (SF(c/d)) and a mechanical work parameter (W(c/d)) representing irreversible compression behavior were determined as functions of applied load. Multivariate analysis of the compression data was carried out using PCA. The first principal component (PC1) showed loadings for the solid fraction and work values that agreed with changes in the relative significance of plastic deformation to consolidation at different pressures. The PC1 scores showed the same rank order as the relative plasticity ranking derived from the literature for common pharmaceutical materials. The utility of PC1 in understanding deformation was extended to binary mixtures using a subset of the original materials. Combinations of brittle and plastic materials were characterized using the PCA method. The relationships between PC1 scores and the weight fractions of the mixtures were typically linear showing ideal mixing in their deformation behaviors. The mixture consisting of two plastic materials was the only combination to show a consistent positive deviation from ideality. The application of PCA to solid fraction and mechanical work data appears to be an effective means of predicting deformation behavior during compaction of simple powder mixtures. Copyright © 2011 Elsevier B.V. All rights reserved.
Willard, Melissa A Bodnar; McGuffin, Victoria L; Smith, Ruth Waddell
2012-01-01
Salvia divinorum is a hallucinogenic herb that is internationally regulated. In this study, salvinorin A, the active compound in S. divinorum, was extracted from S. divinorum plant leaves using a 5-min extraction with dichloromethane. Four additional Salvia species (Salvia officinalis, Salvia guaranitica, Salvia splendens, and Salvia nemorosa) were extracted using this procedure, and all extracts were analyzed by gas chromatography-mass spectrometry. Differentiation of S. divinorum from other Salvia species was successful based on visual assessment of the resulting chromatograms. To provide a more objective comparison, the total ion chromatograms (TICs) were subjected to principal components analysis (PCA). Prior to PCA, the TICs were subjected to a series of data pretreatment procedures to minimize non-chemical sources of variance in the data set. Successful discrimination of S. divinorum from the other four Salvia species was possible based on visual assessment of the PCA scores plot. To provide a numerical assessment of the discrimination, a series of statistical procedures such as Euclidean distance measurement, hierarchical cluster analysis, Student's t tests, Wilcoxon rank-sum tests, and Pearson product moment correlation were also applied to the PCA scores. The statistical procedures were then compared to determine the advantages and disadvantages for forensic applications.
Ciucci, Sara; Ge, Yan; Durán, Claudio; Palladini, Alessandra; Jiménez-Jiménez, Víctor; Martínez-Sánchez, Luisa María; Wang, Yuting; Sales, Susanne; Shevchenko, Andrej; Poser, Steven W.; Herbig, Maik; Otto, Oliver; Androutsellis-Theotokis, Andreas; Guck, Jochen; Gerl, Mathias J.; Cannistraci, Carlo Vittorio
2017-01-01
Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics. PMID:28287094
PCA based clustering for brain tumor segmentation of T1w MRI images.
Kaya, Irem Ersöz; Pehlivanlı, Ayça Çakmak; Sekizkardeş, Emine Gezmez; Ibrikci, Turgay
2017-03-01
Medical images are huge collections of information that are difficult to store and process consuming extensive computing time. Therefore, the reduction techniques are commonly used as a data pre-processing step to make the image data less complex so that a high-dimensional data can be identified by an appropriate low-dimensional representation. PCA is one of the most popular multivariate methods for data reduction. This paper is focused on T1-weighted MRI images clustering for brain tumor segmentation with dimension reduction by different common Principle Component Analysis (PCA) algorithms. Our primary aim is to present a comparison between different variations of PCA algorithms on MRIs for two cluster methods. Five most common PCA algorithms; namely the conventional PCA, Probabilistic Principal Component Analysis (PPCA), Expectation Maximization Based Principal Component Analysis (EM-PCA), Generalize Hebbian Algorithm (GHA), and Adaptive Principal Component Extraction (APEX) were applied to reduce dimensionality in advance of two clustering algorithms, K-Means and Fuzzy C-Means. In the study, the T1-weighted MRI images of the human brain with brain tumor were used for clustering. In addition to the original size of 512 lines and 512 pixels per line, three more different sizes, 256 × 256, 128 × 128 and 64 × 64, were included in the study to examine their effect on the methods. The obtained results were compared in terms of both the reconstruction errors and the Euclidean distance errors among the clustered images containing the same number of principle components. According to the findings, the PPCA obtained the best results among all others. Furthermore, the EM-PCA and the PPCA assisted K-Means algorithm to accomplish the best clustering performance in the majority as well as achieving significant results with both clustering algorithms for all size of T1w MRI images. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Ruela-de-Sousa, Roberta R; Hoekstra, Elmer; Hoogland, A Marije; Queiroz, Karla C Souza; Peppelenbosch, Maikel P; Stubbs, Andrew P; Pelizzaro-Rocha, Karin; van Leenders, Geert J L H; Jenster, Guido; Aoyama, Hiroshi; Ferreira, Carmen V; Fuhler, Gwenny M
2016-04-01
Low-risk patients suffering from prostate cancer (PCa) are currently placed under active surveillance rather than undergoing radical prostatectomy. However, clear parameters for selecting the right patient for each strategy are not available, and new biomarkers and treatment modalities are needed. Low-molecular-weight protein tyrosine phosphatase (LMWPTP) could present such a target. To correlate expression levels of LMWPTP in primary PCa to clinical outcome, and determine the role of LMWPTP in prostate tumor cell biology. Acid phosphatase 1, soluble (ACP1) expression was analyzed on microarray data sets, which were subsequently used in Ingenuity Pathway Analysis. Immunohistochemistry was performed on a tissue microarray containing material of 481 PCa patients whose clinicopathologic data were recorded. PCa cell line models were used to investigate the role of LMWPTP in cell proliferation, migration, adhesion, and anoikis resistance. The association between LMWPTP expression and clinical and pathologic outcomes was calculated using chi-square correlations and multivariable Cox regression analysis. Functional consequences of LMWPTP overexpression or downregulation were determined using migration and adhesion assays, confocal microscopy, Western blotting, and proliferation assays. LMWPTP expression was significantly increased in human PCa and correlated with earlier recurrence of disease (hazard ratio [HR]:1.99; p<0.001) and reduced patient survival (HR: 1.53; p=0.04). Unbiased Ingenuity analysis comparing cancer and normal prostate suggests migratory propensities in PCa. Indeed, overexpression of LMWPTP increases PCa cell migration, anoikis resistance, and reduces activation of focal adhesion kinase/paxillin, corresponding to decreased adherence. Overexpression of LMWPTP in PCa confers a malignant phenotype with worse clinical outcome. Prospective follow-up should determine the clinical potential of LMWPTP overexpression. These findings implicate low-molecular-weight protein tyrosine phosphatase as a novel oncogene in prostate cancer and could offer the possibility of using this protein as biomarker or target for treatment of this disease. Copyright © 2015 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Mishra, Gautam; Easton, Christopher D.; McArthur, Sally L.
2009-01-01
Physical and photolithographic techniques are commonly used to create chemical patterns for a range of technologies including cell culture studies, bioarrays and other biomedical applications. In this paper, we describe the fabrication of chemical micropatterns from commonly used plasma polymers. Atomic force microcopy (AFM) imaging, Time-of-Flight Static Secondary Ion Mass Spectrometry (ToF-SSIMS) imaging and multivariate analysis have been employed to visualize the chemical boundaries created by these patterning techniques and assess the spatial and chemical resolution of the patterns. ToF-SSIMS analysis demonstrated that well defined chemical and spatial boundaries were obtained from photolithographic patterning, while the resolution of physical patterning via a transmission electron microscopy (TEM) grid varied depending on the properties of the plasma system including the substrate material. In general, physical masking allowed diffusion of the plasma species below the mask and bleeding of the surface chemistries. Multivariate analysis techniques including Principal Component Analysis (PCA) and Region of Interest (ROI) assessment were used to investigate the ToF-SSIMS images of a range of different plasma polymer patterns. In the most challenging case, where two strongly reacting polymers, allylamine and acrylic acid were deposited, PCA confirmed the fabrication of micropatterns with defined spatial resolution. ROI analysis allowed for the identification of an interface between the two plasma polymers for patterns fabricated using the photolithographic technique which has been previously overlooked. This study clearly demonstrated the versatility of photolithographic patterning for the production of multichemistry plasma polymer arrays and highlighted the need for complimentary characterization and analytical techniques during the fabrication plasma polymer micropatterns. PMID:19950941
Tailored multivariate analysis for modulated enhanced diffraction
Caliandro, Rocco; Guccione, Pietro; Nico, Giovanni; ...
2015-10-21
Modulated enhanced diffraction (MED) is a technique allowing the dynamic structural characterization of crystalline materials subjected to an external stimulus, which is particularly suited forin situandoperandostructural investigations at synchrotron sources. Contributions from the (active) part of the crystal system that varies synchronously with the stimulus can be extracted by an offline analysis, which can only be applied in the case of periodic stimuli and linear system responses. In this paper a new decomposition approach based on multivariate analysis is proposed. The standard principal component analysis (PCA) is adapted to treat MED data: specific figures of merit based on their scoresmore » and loadings are found, and the directions of the principal components obtained by PCA are modified to maximize such figures of merit. As a result, a general method to decompose MED data, called optimum constrained components rotation (OCCR), is developed, which produces very precise results on simulated data, even in the case of nonperiodic stimuli and/or nonlinear responses. Furthermore, the multivariate analysis approach is able to supply in one shot both the diffraction pattern related to the active atoms (through the OCCR loadings) and the time dependence of the system response (through the OCCR scores). Furthermore, when applied to real data, OCCR was able to supply only the latter information, as the former was hindered by changes in abundances of different crystal phases, which occurred besides structural variations in the specific case considered. In order to develop a decomposition procedure able to cope with this combined effect represents the next challenge in MED analysis.« less
NASA Technical Reports Server (NTRS)
Rampe, E. B.; Lanza, N. L.
2012-01-01
Orbital near-infrared (NIR) reflectance spectra of the martian surface from the OMEGA and CRISM instruments have identified a variety of phyllosilicates in Noachian terrains. The types of phyllosilicates present on Mars have important implications for the aqueous environments in which they formed, and, thus, for recognizing locales that may have been habitable. Current identifications of phyllosilicates from martian NIR data are based on the positions of spectral absorptions relative to laboratory data of well-characterized samples and from spectral ratios; however, some phyllosilicates can be difficult to distinguish from one another with these methods (i.e. illite vs. muscovite). Here we employ a multivariate statistical technique, principal component analysis (PCA), to differentiate between spectrally similar phyllosilicate minerals. PCA is commonly used in a variety of industries (pharmaceutical, agricultural, viticultural) to discriminate between samples. Previous work using PCA to analyze raw NIR reflectance data from mineral mixtures has shown that this is a viable technique for identifying mineral types, abundances, and particle sizes. Here, we evaluate PCA of second-derivative NIR reflectance data as a method for classifying phyllosilicates and test whether this method can be used to identify phyllosilicates on Mars.
Oral contraceptive use is associated with prostate cancer: an ecological study
Fleshner, Neil E
2011-01-01
Background Several recent studies have suggested that oestrogen exposure may increase the risk of prostate cancer (PCa). Objectives To examine associations between PCa incidence and mortality and population-based use of oral contraceptives (OCs). It was hypothesised that OC by-products may cause environmental contamination, leading to an increased low level oestrogen exposure and therefore higher PCa incidence and mortality. Methods The hypothesis was tested in an ecological study. Data from the International Agency for Research on Cancer were used to retrieve age-standardised rates of prostate cancer in 2007, and data from the United Nations World Contraceptive Use 2007 report were used to retrieve data on contraceptive use. A Pearson correlation and multivariable linear regression were used to associate the percentage of women using OCs, intrauterine devices, condoms or vaginal barriers to the age standardised prostate cancer incidence and mortality. These analyses were performed by individual nations and by continents worldwide. Results OC use was significantly associated with prostate cancer incidence and mortality in the individual nations worldwide (r=0.61 and r=0.53, respectively; p<0.05 for all). PCa incidence was also associated with OC use in Europe (r=0.545, p<0.05) and by continent (r=0.522, p<0.05). All other forms of contraceptives (ie, intra-uterine devices, condoms or vaginal barriers) were not correlated with prostate cancer incidence or mortality. On multivariable analysis the correlation with OC was independent of a nation's wealth. Conclusion A significant association between OCs and PCa has been shown. It is hypothesised that the OC effect may be mediated through environmental oestrogen levels; this novel concept is worth further investigation. PMID:22102643
Oral contraceptive use is associated with prostate cancer: an ecological study.
Margel, David; Fleshner, Neil E
2011-01-01
Background Several recent studies have suggested that oestrogen exposure may increase the risk of prostate cancer (PCa). Objectives To examine associations between PCa incidence and mortality and population-based use of oral contraceptives (OCs). It was hypothesised that OC by-products may cause environmental contamination, leading to an increased low level oestrogen exposure and therefore higher PCa incidence and mortality. Methods The hypothesis was tested in an ecological study. Data from the International Agency for Research on Cancer were used to retrieve age-standardised rates of prostate cancer in 2007, and data from the United Nations World Contraceptive Use 2007 report were used to retrieve data on contraceptive use. A Pearson correlation and multivariable linear regression were used to associate the percentage of women using OCs, intrauterine devices, condoms or vaginal barriers to the age standardised prostate cancer incidence and mortality. These analyses were performed by individual nations and by continents worldwide. Results OC use was significantly associated with prostate cancer incidence and mortality in the individual nations worldwide (r=0.61 and r=0.53, respectively; p<0.05 for all). PCa incidence was also associated with OC use in Europe (r=0.545, p<0.05) and by continent (r=0.522, p<0.05). All other forms of contraceptives (ie, intra-uterine devices, condoms or vaginal barriers) were not correlated with prostate cancer incidence or mortality. On multivariable analysis the correlation with OC was independent of a nation's wealth. Conclusion A significant association between OCs and PCa has been shown. It is hypothesised that the OC effect may be mediated through environmental oestrogen levels; this novel concept is worth further investigation.
Huang, Jun; Kaul, Goldi; Cai, Chunsheng; Chatlapalli, Ramarao; Hernandez-Abad, Pedro; Ghosh, Krishnendu; Nagi, Arwinder
2009-12-01
To facilitate an in-depth process understanding, and offer opportunities for developing control strategies to ensure product quality, a combination of experimental design, optimization and multivariate techniques was integrated into the process development of a drug product. A process DOE was used to evaluate effects of the design factors on manufacturability and final product CQAs, and establish design space to ensure desired CQAs. Two types of analyses were performed to extract maximal information, DOE effect & response surface analysis and multivariate analysis (PCA and PLS). The DOE effect analysis was used to evaluate the interactions and effects of three design factors (water amount, wet massing time and lubrication time), on response variables (blend flow, compressibility and tablet dissolution). The design space was established by the combined use of DOE, optimization and multivariate analysis to ensure desired CQAs. Multivariate analysis of all variables from the DOE batches was conducted to study relationships between the variables and to evaluate the impact of material attributes/process parameters on manufacturability and final product CQAs. The integrated multivariate approach exemplifies application of QbD principles and tools to drug product and process development.
Urbini, Marco; Petito, Valentina; de Notaristefani, Francesco; Scaldaferri, Franco; Gasbarrini, Antonio; Tortora, Luca
2017-10-01
Here, time of flight secondary ion mass spectrometry (ToF-SIMS) and multivariate analysis were combined to study the role of ulcerative colitis (UC), a type of inflammatory bowel disease (IBD), in the colon cancer progression. ToF-SIMS was used to obtain mass spectra and chemical maps from the mucosal surface of human normal (NC), inflamed (IC), and dysplastic (DC) colon tissues. Chemical mapping with a lateral resolution of ≈ 1 μm allowed to evaluate zonation of fatty acids and amino acids as well as the morphological condition of the intestinal glands. High mass resolution ToF-SIMS spectra showed chemical differences in lipid and amino acid composition as a function of pathological state. In positive ion mode, mono- (MAG), di- (DAG), and triacylglycerol (TAG) signals were detected in NC tissues, while in IC and DC tissues, the only cholesterol was present as lipid class representative. Signals from fatty acids, collected in negative ion mode, were subjected to principal component analysis (PCA). PCA showed a strict correlation between IC and DC samples, due to an increase of stearic, arachidonic, and linoleic acid. In the same way, differences in the amino acid composition were highlighted through multivariate analysis. PCA revealed that glutamic acid, leucine/isoleucine, and valine fragments are related to IC tissues. On the other hand, tyrosine, methionine, and tryptophan peaks contributed highly to the separation of DC tissues. Finally, a classification of NC, IC, and DC patients was also achieved through hierarchical cluster analysis of amino acid fragments. In this case, human colonic inflammation showed a stronger relationship with normal than dysplastic condition. Graphical Abstract ᅟ.
Wan, Yongshan; Qian, Yun; Migliaccio, Kati White; Li, Yuncong; Conrad, Cecilia
2014-03-01
Most studies using multivariate techniques for pollution source evaluation are conducted in free-flowing rivers with distinct point and nonpoint sources. This study expanded on previous research to a managed "canal" system discharging into the Indian River Lagoon, Florida, where water and land management is the single most important anthropogenic factor influencing water quality. Hydrometric and land use data of four drainage basins were uniquely integrated into the analysis of 25 yr of monthly water quality data collected at seven stations to determine the impact of water and land management on the spatial variability of water quality. Cluster analysis (CA) classified seven monitoring stations into four groups (CA groups). All water quality parameters identified by discriminant analysis showed distinct spatial patterns among the four CA groups. Two-step principal component analysis/factor analysis (PCA/FA) was conducted with (i) water quality data alone and (ii) water quality data in conjunction with rainfall, flow, and land use data. The results indicated that PCA/FA of water quality data alone was unable to identify factors associated with management activities. The addition of hydrometric and land use data into PCA/FA revealed close associations of nutrients and color with land management and storm-water retention in pasture and citrus lands; total suspended solids, turbidity, and NO + NO with flow and Lake Okeechobee releases; specific conductivity with supplemental irrigation supply; and dissolved O with wetland preservation. The practical implication emphasizes the importance of basin-specific land and water management for ongoing pollutant loading reduction and ecosystem restoration programs. Copyright © by the American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America, Inc.
Liu, Zechang; Wang, Liping; Liu, Yumei
2018-01-18
Hops impart flavor to beer, with the volatile components characterizing the various hop varieties and qualities. Fingerprinting, especially flavor fingerprinting, is often used to identify 'flavor products' because inconsistencies in the description of flavor may lead to an incorrect definition of beer quality. Compared to flavor fingerprinting, volatile fingerprinting is simpler and easier. We performed volatile fingerprinting using head space-solid phase micro-extraction gas chromatography-mass spectrometry combined with similarity analysis and principal component analysis (PCA) for evaluating and distinguishing between three major Chinese hops. Eighty-four volatiles were identified, which were classified into seven categories. Volatile fingerprinting based on similarity analysis did not yield any obvious result. By contrast, hop varieties and qualities were identified using volatile fingerprinting based on PCA. The potential variables explained the variance in the three hop varieties. In addition, the dendrogram and principal component score plot described the differences and classifications of hops. Volatile fingerprinting plus multivariate statistical analysis can rapidly differentiate between the different varieties and qualities of the three major Chinese hops. Furthermore, this method can be used as a reference in other fields. © 2018 Society of Chemical Industry. © 2018 Society of Chemical Industry.
Jankovic, Marko; Ogawa, Hidemitsu
2004-10-01
Principal Component Analysis (PCA) and Principal Subspace Analysis (PSA) are classic techniques in statistical data analysis, feature extraction and data compression. Given a set of multivariate measurements, PCA and PSA provide a smaller set of "basis vectors" with less redundancy, and a subspace spanned by them, respectively. Artificial neurons and neural networks have been shown to perform PSA and PCA when gradient ascent (descent) learning rules are used, which is related to the constrained maximization (minimization) of statistical objective functions. Due to their low complexity, such algorithms and their implementation in neural networks are potentially useful in cases of tracking slow changes of correlations in the input data or in updating eigenvectors with new samples. In this paper we propose PCA learning algorithm that is fully homogeneous with respect to neurons. The algorithm is obtained by modification of one of the most famous PSA learning algorithms--Subspace Learning Algorithm (SLA). Modification of the algorithm is based on Time-Oriented Hierarchical Method (TOHM). The method uses two distinct time scales. On a faster time scale PSA algorithm is responsible for the "behavior" of all output neurons. On a slower scale, output neurons will compete for fulfillment of their "own interests". On this scale, basis vectors in the principal subspace are rotated toward the principal eigenvectors. At the end of the paper it will be briefly analyzed how (or why) time-oriented hierarchical method can be used for transformation of any of the existing neural network PSA method, into PCA method.
Guan, Yangbo; Wu, You; Liu, Yifei; Ni, Jian; Nong, Shaojun
2016-08-01
Despite androgen deprivation therapy (ADT) remains the mainstay therapy for advanced prostate cancer (PCa), the patients have widely variable durations of response to ADT. Unfortunately, there is limited knowledge of pre-treatment prognostic factors for response to ADT. Recently, microRNA-21 (miR-21) has been reported to play an important role in development of castration resistance of CaP. However, little is known about the expression of miR-21 in advanced PCa biopsy tissues, and data on its potential predictive value in advanced PCa are completely lacking. In this study, paraffin-embedded prostate carcinoma tissues obtained by needle biopsy from 85 advanced PCa patients were evaluated for the expression levels of miR-21 by quantitative real-time PCR (qRT-PCR). In situ hybridization (ISH) analysis was performed to further confirm the qRT-PCR results. Kaplan-Meier analysis and Cox proportional hazards regression models were performed to investigate the correlation between miR-21 expression and time to progression of advanced PCa patients. Compared with adjacent non-cancerous prostate tissues, the expression level of miR-21 was significantly increased in PCa tissues (PCa vs. non-cancerous prostate: 1.3273 ± 0.3207 vs. 0.9970 ± 0.2054, P < 0.001). By and large, in ISH analysis miR-21 was expressed at a higher level in tumor areas than in adjacent non-cancerous areas. Additionally, PCa patients with higher expression of miR-21 were significantly more likely to be of high Gleason score and high clinical stage (P < 0.05). There was no significant association between miR-21 expression and the initial prostate-specific antigen (PSA) level or age at diagnosis. Moreover, Kaplan-Meier survival analysis found that PCa patients with high miR-21 expression have shorter progression-free survival than those with low miR-21 expression. Furthermore, Multivariate Cox analysis revealed both miR-21 expression status (P = 0.040) and clinical stage (P = 0.042) were all independent predictive factor for progression-free survival for advanced PCa. These findings suggest for the first time that the up-regulation of miR-21 may serve as an independent predictor of progress-free survival in patients with advanced PCa. Prostate 76:986-993, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Tada, Atsuko; Ishizuki, Kyoko; Sugimoto, Naoki; Yoshimatsu, Kayo; Kawahara, Nobuo; Suematsu, Takako; Arifuku, Kazunori; Fukai, Toshio; Tamura, Yukiyoshi; Ohtsuki, Takashi; Tahara, Maiko; Yamazaki, Takeshi; Akiyama, Hiroshi
2015-01-01
"Licorice oil extract" (LOE) (antioxidant agent) is described in the notice of Japanese food additive regulations as a material obtained from the roots and/or rhizomes of Glycyrrhiza uralensis, G. inflata or G. glabra. In this study, we aimed to identify the original Glycyrrhiza species of eight food additive products using LC/MS. Glabridin, a characteristic compound in G. glabra, was specifically detected in seven products, and licochalcone A, a characteristic compound in G. inflata, was detected in one product. In addition, Principal Component Analysis (PCA) (a kind of multivariate analysis) using the data of LC/MS or (1)H-NMR analysis was performed. The data of thirty-one samples, including LOE products used as food additives, ethanol extracts of various Glycyrrhiza species and commercially available Glycyrrhiza species-derived products were assessed. Based on the PCA results, the majority of LOE products was confirmed to be derived from G. glabra. This study suggests that PCA using (1)H-NMR analysis data is a simple and useful method to identify the plant species of origin of natural food additive products.
Panazzolo, Diogo G; Sicuro, Fernando L; Clapauch, Ruth; Maranhão, Priscila A; Bouskela, Eliete; Kraemer-Aguiar, Luiz G
2012-11-13
We aimed to evaluate the multivariate association between functional microvascular variables and clinical-laboratorial-anthropometrical measurements. Data from 189 female subjects (34.0 ± 15.5 years, 30.5 ± 7.1 kg/m2), who were non-smokers, non-regular drug users, without a history of diabetes and/or hypertension, were analyzed by principal component analysis (PCA). PCA is a classical multivariate exploratory tool because it highlights common variation between variables allowing inferences about possible biological meaning of associations between them, without pre-establishing cause-effect relationships. In total, 15 variables were used for PCA: body mass index (BMI), waist circumference, systolic and diastolic blood pressure (BP), fasting plasma glucose, levels of total cholesterol, high-density lipoprotein cholesterol (HDL-c), low-density lipoprotein cholesterol (LDL-c), triglycerides (TG), insulin, C-reactive protein (CRP), and functional microvascular variables measured by nailfold videocapillaroscopy. Nailfold videocapillaroscopy was used for direct visualization of nutritive capillaries, assessing functional capillary density, red blood cell velocity (RBCV) at rest and peak after 1 min of arterial occlusion (RBCV(max)), and the time taken to reach RBCV(max) (TRBCV(max)). A total of 35% of subjects had metabolic syndrome, 77% were overweight/obese, and 9.5% had impaired fasting glucose. PCA was able to recognize that functional microvascular variables and clinical-laboratorial-anthropometrical measurements had a similar variation. The first five principal components explained most of the intrinsic variation of the data. For example, principal component 1 was associated with BMI, waist circumference, systolic BP, diastolic BP, insulin, TG, CRP, and TRBCV(max) varying in the same way. Principal component 1 also showed a strong association among HDL-c, RBCV, and RBCV(max), but in the opposite way. Principal component 3 was associated only with microvascular variables in the same way (functional capillary density, RBCV and RBCV(max)). Fasting plasma glucose appeared to be related to principal component 4 and did not show any association with microvascular reactivity. In non-diabetic female subjects, a multivariate scenario of associations between classic clinical variables strictly related to obesity and metabolic syndrome suggests a significant relationship between these diseases and microvascular reactivity.
Boiret, Mathieu; de Juan, Anna; Gorretta, Nathalie; Ginot, Yves-Michel; Roger, Jean-Michel
2015-01-25
In this work, Raman hyperspectral images and multivariate curve resolution-alternating least squares (MCR-ALS) are used to study the distribution of actives and excipients within a pharmaceutical drug product. This article is mainly focused on the distribution of a low dose constituent. Different approaches are compared, using initially filtered or non-filtered data, or using a column-wise augmented dataset before starting the MCR-ALS iterative process including appended information on the low dose component. In the studied formulation, magnesium stearate is used as a lubricant to improve powder flowability. With a theoretical concentration of 0.5% (w/w) in the drug product, the spectral variance contained in the data is weak. By using a principal component analysis (PCA) filtered dataset as a first step of the MCR-ALS approach, the lubricant information is lost in the non-explained variance and its associated distribution in the tablet cannot be highlighted. A sufficient number of components to generate the PCA noise-filtered matrix has to be used in order to keep the lubricant variability within the data set analyzed or, otherwise, work with the raw non-filtered data. Different models are built using an increasing number of components to perform the PCA reduction. It is shown that the magnesium stearate information can be extracted from a PCA model using a minimum of 20 components. In the last part, a column-wise augmented matrix, including a reference spectrum of the lubricant, is used before starting MCR-ALS process. PCA reduction is performed on the augmented matrix, so the magnesium stearate contribution is included within the MCR-ALS calculations. By using an appropriate PCA reduction, with a sufficient number of components, or by using an augmented dataset including appended information on the low dose component, the distribution of the two actives, the two main excipients and the low dose lubricant are correctly recovered. Copyright © 2014 Elsevier B.V. All rights reserved.
Tahir, Haroon Elrasheid; Xiaobo, Zou; Xiaowei, Huang; Jiyong, Shi; Mariod, Abdalbasit Adam
2016-09-01
Aroma profiles of six honey varieties of different botanical origins were investigated using colorimetric sensor array, gas chromatography-mass spectrometry (GC-MS) and descriptive sensory analysis. Fifty-eight aroma compounds were identified, including 2 norisoprenoids, 5 hydrocarbons, 4 terpenes, 6 phenols, 7 ketones, 9 acids, 12 aldehydes and 13 alcohols. Twenty abundant or active compounds were chosen as key compounds to characterize honey aroma. Discrimination of the honeys was subsequently implemented using multivariate analysis, including hierarchical clustering analysis (HCA) and principal component analysis (PCA). Honeys of the same botanical origin were grouped together in the PCA score plot and HCA dendrogram. SPME-GC/MS and colorimetric sensor array were able to discriminate the honeys effectively with the advantages of being rapid, simple and low-cost. Moreover, partial least squares regression (PLSR) was applied to indicate the relationship between sensory descriptors and aroma compounds. Copyright © 2016 Elsevier Ltd. All rights reserved.
Peng, Shengmeng; Du, Tao; Wu, Wanhua; Chen, Xianju; Lai, Yiming; Zhu, Dingjun; Wang, Qiong; Ma, Xiaoming; Lin, Chunhao; Li, Zean; Guo, Zhenghui; Huang, Hai
2018-06-11
The aim of this study was to investigate the associations of serine proteinase inhibitor family G1 (SERPING1) down-regulation with poor prognosis in patients with prostate cancer (PCa). Furthermore, we aim to find more novel and effective PCa molecular markers to provide an early screening of PCa, distinguish patients with aggressive PCa, predict the prognosis, or reduce the economic burden of PCa. SERPING1 protein expression in both human PCa and normal prostate tissues was detected by immunohistochemical staining, which intensity was analyzed in association with clinical pathological parameters such Gleason score, pathological grade, clinical stage, tumor stage, lymph node metastasis, and distant metastasis. Moreover, we used The Cancer Genome Atlas (TCGA) Database, Taylor Database, and Oncomine dataset to validate our immunohistochemical results and investigated the value of SERPING1 in PCa at mRNA level. Kaplan-Meier analysis and Cox regression analysis were performed to evaluate the relationship between SERPING1 and prognosis of patients with PCa. The outcome showed that SERPING1 was expressed mainly in cytoplasm of grand cells of prostate tissue and was significantly expressed less in PCa (P<0.001). Furthermore, in the tissue microarray of our samples, decreasing expression of SERPING1 was correlated with the higher Gleason score (P = 0.004), the higher pathological grade (P = 0.01) and the advanced tumor stage (P = 0.005) at protein level. In TCGA dataset and Taylor Dataset, low-expressed SERPING1 was correlated with the younger patient (P = 0.02 in TCGA, P = 0.044 in Taylor) and the higher Gleason score (P = 0.019 in TCGA, P<0.001 in Taylor) at mRNA level. Kaplan-Meier analysis revealed that the lower mRNA of SERPING1 predicted lower overall survivals (P = 0.027 in TCGA), lower disease-free survival (P = 0.029) and lower biochemical recurrence-free survival (P = 0.011 in Taylor). Data from Oncomine database shown that SERPING1 low expression implying higher malignancy of prostate lesions. Using multivariate analysis, we also found that SERPING1 expression was independent prognostic marker of poor disease-free survival and biochemical recurrence-free survival. SERPING1 may play an important role in PCa and can be serve as a novel marker in diagnosis and prognostic prediction in PCa. In addition, levels of SERPING1 can help identify low-risk prostate to provide reference for patients with PCa to accept active surveillance and reduce overtreatment. Copyright © 2018 Elsevier Inc. All rights reserved.
Carleton, Neil M; Zhu, Guangjing; Gorbounov, Mikhail; Miller, M Craig; Pienta, Kenneth J; Resar, Linda M S; Veltri, Robert W
2018-05-01
There are few tissue-based biomarkers that can accurately predict prostate cancer (PCa) progression and aggressiveness. We sought to evaluate the clinical utility of prostate and breast overexpressed 1 (PBOV1) as a potential PCa biomarker. Patient tumor samples were designated by Grade Groups using the 2014 Gleason grading system. Primary radical prostatectomy tumors were obtained from 48 patients and evaluated for PBOV1 levels using Western blot analysis in matched cancer and benign cancer-adjacent regions. Immunohistochemical evaluation of PBOV1 was subsequently performed in 80 cancer and 80 benign cancer-adjacent patient samples across two tissue microarrays (TMAs) to verify protein levels in epithelial tissue and to assess correlation between PBOV1 proteins and nuclear architectural changes in PCa cells. Digital histomorphometric analysis was used to track 22 parameters that characterized nuclear changes in PBOV1-stained cells. Using a training and test set for validation, multivariate logistic regression (MLR) models were used to identify significant nuclear parameters that distinguish Grade Group 3 and above PCa from Grade Group 1 and 2 PCa regions. PBOV1 protein levels were increased in tumors from Grade Group 3 and above (GS 4 + 3 and ≥ 8) regions versus Grade Groups 1 and 2 (GS 3 + 3 and 3 + 4) regions (P = 0.005) as assessed by densitometry of immunoblots. Additionally, by immunoblotting, PBOV1 protein levels differed significantly between Grade Group 2 (GS 3 + 4) and Grade Group 3 (GS 4 + 3) PCa samples (P = 0.028). In the immunohistochemical analysis, measures of PBOV1 staining intensity strongly correlated with nuclear alterations in cancer cells. An MLR model retaining eight parameters describing PBOV1 staining intensity and nuclear architecture discriminated Grade Group 3 and above PCa from Grade Group 1 and 2 PCa and benign cancer-adjacent regions with a ROC-AUC of 0.90 and 0.80, respectively, in training and test sets. Our study demonstrates that the PBOV1 protein could be used to discriminate Grade Group 3 and above PCa. Additionally, the PBOV1 protein could be involved in modulating changes to the nuclear architecture of PCa cells. Confirmatory studies are warranted in an independent population for further validation. © 2018 Wiley Periodicals, Inc.
Eide, Ingvar; Westad, Frank
2018-01-01
A pilot study demonstrating real-time environmental monitoring with automated multivariate analysis of multi-sensor data submitted online has been performed at the cabled LoVe Ocean Observatory located at 258 m depth 20 km off the coast of Lofoten-Vesterålen, Norway. The major purpose was efficient monitoring of many variables simultaneously and early detection of changes and time-trends in the overall response pattern before changes were evident in individual variables. The pilot study was performed with 12 sensors from May 16 to August 31, 2015. The sensors provided data for chlorophyll, turbidity, conductivity, temperature (three sensors), salinity (calculated from temperature and conductivity), biomass at three different depth intervals (5-50, 50-120, 120-250 m), and current speed measured in two directions (east and north) using two sensors covering different depths with overlap. A total of 88 variables were monitored, 78 from the two current speed sensors. The time-resolution varied, thus the data had to be aligned to a common time resolution. After alignment, the data were interpreted using principal component analysis (PCA). Initially, a calibration model was established using data from May 16 to July 31. The data on current speed from two sensors were subject to two separate PCA models and the score vectors from these two models were combined with the other 10 variables in a multi-block PCA model. The observations from August were projected on the calibration model consecutively one at a time and the result was visualized in a score plot. Automated PCA of multi-sensor data submitted online is illustrated with an attached time-lapse video covering the relative short time period used in the pilot study. Methods for statistical validation, and warning and alarm limits are described. Redundant sensors enable sensor diagnostics and quality assurance. In a future perspective, the concept may be used in integrated environmental monitoring.
Wang, Chaolong; Zöllner, Sebastian; Rosenberg, Noah A.
2012-01-01
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure. PMID:22927824
Wang, Chaolong; Zöllner, Sebastian; Rosenberg, Noah A
2012-08-01
Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure.
Chen, Cheng; Chen, Ye; Hu, Lin-Kun; Jiang, Chang-Chuan; Xu, Ren-Fang; He, Xiao-Zhou
2018-02-27
We evaluated the prognosis of the new grade groups and American Joint Committee on Cancer (AJCC) stage groups in men with prostate cancer (PCa) who were treated conservatively. A total of 13 798 eligible men were chosen from the Surveillance Epidemiology and End Results database. The new grade and AJCC stage groups were investigated on prostate biopsy specimens. Kaplan-Meier survival analysis and multivariable hazards models were applied to estimate the association of new grade and stage groups with overall survival (OS) and PCa-specific survival (CSS). Mean follow-up was 42.65 months (95% confidence interval: 42.47-42.84) in the entire cohort. The 3-year OS and CSS rates stepped down for grade groups 1-5 and AJCC stage groups I-IVB, respectively. After adjusting for clinical and pathological characteristics, all grade groups and AJCC stage groups were associated with higher all-cause and PCa-specific mortality compared to the reference group (all P ≤ 0.003). In conclusion, we evaluated the oncological outcome of the new grade and AJCC stage groups on biopsy specimens of conservatively treated PCa. These two novel clinically relevant classifications can assist physicians to determine different therapeutic strategies for PCa patients.
Sciutto, Giorgia; Oliveri, Paolo; Catelli, Emilio; Bonacini, Irene
2017-01-01
In the field of applied researches in heritage science, the use of multivariate approach is still quite limited and often chemometric results obtained are often underinterpreted. Within this scenario, the present paper is aimed at disseminating the use of suitable multivariate methodologies and proposes a procedural workflow applied on a representative group of case studies, of considerable importance for conservation purposes, as a sort of guideline on the processing and on the interpretation of this FTIR data. Initially, principal component analysis (PCA) is performed and the score values are converted into chemical maps. Successively, the brushing approach is applied, demonstrating its usefulness for a deep understanding of the relationships between the multivariate map and PC score space, as well as for the identification of the spectral bands mainly involved in the definition of each area localised within the score maps. PMID:29333162
Quantitative analysis of NMR spectra with chemometrics
NASA Astrophysics Data System (ADS)
Winning, H.; Larsen, F. H.; Bro, R.; Engelsen, S. B.
2008-01-01
The number of applications of chemometrics to series of NMR spectra is rapidly increasing due to an emerging interest for quantitative NMR spectroscopy e.g. in the pharmaceutical and food industries. This paper gives an analysis of advantages and limitations of applying the two most common chemometric procedures, Principal Component Analysis (PCA) and Multivariate Curve Resolution (MCR), to a designed set of 231 simple alcohol mixture (propanol, butanol and pentanol) 1H 400 MHz spectra. The study clearly demonstrates that the major advantage of chemometrics is the visualisation of larger data structures which adds a new exploratory dimension to NMR research. While robustness and powerful data visualisation and exploration are the main qualities of the PCA method, the study demonstrates that the bilinear MCR method is an even more powerful method for resolving pure component NMR spectra from mixtures when certain conditions are met.
Gabriele, Domenico; Garibaldi, Monica; Girelli, Giuseppe; Taraglio, Stefano; Duregon, Eleonora; Gabriele, Pietro; Guiot, Caterina; Bollito, Enrico
2016-06-01
This work aims to definitely show the ability of percentage of positive biopsy cores (%PC) to independently predict biochemical outcome beyond traditional pretreatment risk-factors in prostate cancer (PCa) patients treated with radiotherapy. A cohort of 2493 men belonging to the EUREKA-2 retrospective multicentric database on (PCa) and treated with external-beam radiation therapy (EBRT) as primary treatment comprised the study population (median follow-up 50 months). A Cox regression time to prostate-specific antigen (PSA) failure analysis was performed to evaluate the predictive power of %PC, both in univariate and multivariate settings, with age, pretreatment PSA, clinical-radiological staging, bioptic Gleason Score (bGS), RT dose and RT +/- ADT as covariates. P statistics for %PC is lower than 0.001 both in univariate and multivariate models. %PC as a continuous variable yields an AUC of 69% in ROC curve analysis for biochemical relapse. Four classes of %PC (1-20%, 21-50%, 51-80% and 81-100%) distinctly split patients for risk of biochemical relapse (overall log-rank test P<0.0001), with biochemical progression free survival (bPFS) at 5-years ranging from 88% to 58% and 10-years bPFS ranging from 80% to 38%. We strongly affirm the usefulness of %PC information beyond main risk factors (PSA, staging and bGS) in predicting biochemical recurrence after EBRT for PCa. The stratification of patients according to %PC may be valuable to further discriminate cases with favourable or adverse prognosis.
Xiao, Zuobing; Liu, Shengjiang; Gu, Yongbo; Xu, Na; Shang, Yi; Zhu, Jiancai
2014-03-01
Volatiles of cherry wines were extracted by headspace solid phase microextraction (HS-SPME) and analyzed by gas chromatography mass spectrometry (GC-MS), multivariate statistical techniques (such as principal component analysis (PCA) and cluster analysis (CA) and correlation analysis) to differentiate sensory attributes of 3 groups of the wines through characterization of volatiles of cherry wine. Seventy-five volatiles were identified in 9 samples, including 29 esters, 22 alcohols, 8 acids, 3 ketones, 5 aldehydes, and 8 miscellaneous compounds. The PCA results showed that the cherry wines were mainly differentiated by 8 sensory attributes. The samples W2, W4, and W7 were grouped around sweet aromatic and the samples W1, W5, and W9 were highly associated with the sweet, esters, green, bitter, and fermented. Nevertheless, the samples W3, W6, and W8 were located close to the sour, alcoholic, and fruity. The final result of correlation analysis was in conformity with the conclusion of PCA. The CA results showed that the group of W2, W4, and W7, and the group of W1, W5, and W9 had less difference than the group of W3, W6, and W8. The reason should be that esterification reactions and fermentation process during the ageing period was more extended. The results of analyzing revealed that HS-SPME-GC-MS coupled with chemometrics could give an appropriate way of characterizing and classifying the cherry wines. Attributes that represent and discriminate among cherry wines might be made use of a better comprehending of the wines and for being utilized in future work. In addition, several chemometrics were used to classify the type of wines and try to install the relationship between volatiles and sensory property. Especially, PCA clearly revealed that the most contributing compounds for sensory attributes of cherry wines, CA was a more applicable way to distinguish types of cherry wines. Therefore, a feasible method that would be helpful to promote the quality of the wines by improving the winemaking process and analyzing aromatic characteristics of wines. © 2014 Institute of Food Technologists®
Method for factor analysis of GC/MS data
Van Benthem, Mark H; Kotula, Paul G; Keenan, Michael R
2012-09-11
The method of the present invention provides a fast, robust, and automated multivariate statistical analysis of gas chromatography/mass spectroscopy (GC/MS) data sets. The method can involve systematic elimination of undesired, saturated peak masses to yield data that follow a linear, additive model. The cleaned data can then be subjected to a combination of PCA and orthogonal factor rotation followed by refinement with MCR-ALS to yield highly interpretable results.
Tumor volume in insignificant prostate cancer: increasing threshold gains increasing risk.
Schiffmann, Jonas; Connan, Judith; Salomon, Georg; Boehm, Katharina; Beyer, Burkhard; Schlomm, Thorsten; Tennstedt, Pierre; Sauter, Guido; Karakiewicz, Pierre I; Graefen, Markus; Huland, Hartwig
2015-01-01
An increased tumor volume threshold (<2.5 ml) is suggested to define insignificant prostate cancer (iPCa). We hypothesize that an increasing tumor volume within iPCa patients increases the risk of biochemical recurrence (BCR) after radical prostatectomy (RP). We relied on RP patients treated between 1992 and 2008. Multivariable Cox regression analyses predicting BCR within patients harboring favorable pathological characteristics (≤pT2, pN0/Nx, Gleason 3 + 3). Kaplan-Meier analysis was performed for BCR-free survival within iPCa patients (≤pT2, pN0/Nx, Gleason 3 + 3, tumor volume: <0.5 vs. 0.5-2.49 ml). From 1,829 patients, 141 (7.7%) and 310 (16.9%) harbored iPCa (tumor volume: <0.5 vs. 0.5-2.49 ml), respectively. Of those, 21 (14.9%) versus 31 (10.0%) had PSA >10 ng/ml. Tumor volume achieved independent predictor status for BCR. Specifically, iPCa patients with increasing tumor volume (0.5-2.49 ml) were at higher risk of BCR after RP than those with tumor volume <0.5 ml (HR: 8.8, 95% CI: 1.2-65.9, P = 0.04). Kaplan-Meier analysis recorded superior BCR-free survival in iPCa patients with lower tumor volume (<0.5 ml) (log-rank P = 0.009). The 10-year cancer-specific death rate was 0 versus 0.5%. Contemporary iPCa definition incorporates intermediate and high-risk patients (PSA: 10-20 and >20 ng/ml). Despite most favorable pathological characteristics, iPCa patients are not devoid of BCR after RP. Moreover, iPCa patients were at higher risk of BCR, when increasing tumor volume up to 2.49 ml was at play. Taken together the contemporary concept of iPCa is suboptimal. Especially, an increased tumor volume threshold for defining iPCa cannot be recommended according to our data. Clinicians might take these considerations into account during decision-making process. © 2014 Wiley Periodicals, Inc.
Phung, Dung; Huang, Cunrui; Rutherford, Shannon; Dwirahmadi, Febi; Chu, Cordia; Wang, Xiaoming; Nguyen, Minh; Nguyen, Nga Huy; Do, Cuong Manh; Nguyen, Trung Hieu; Dinh, Tuan Anh Diep
2015-05-01
The present study is an evaluation of temporal/spatial variations of surface water quality using multivariate statistical techniques, comprising cluster analysis (CA), principal component analysis (PCA), factor analysis (FA) and discriminant analysis (DA). Eleven water quality parameters were monitored at 38 different sites in Can Tho City, a Mekong Delta area of Vietnam from 2008 to 2012. Hierarchical cluster analysis grouped the 38 sampling sites into three clusters, representing mixed urban-rural areas, agricultural areas and industrial zone. FA/PCA resulted in three latent factors for the entire research location, three for cluster 1, four for cluster 2, and four for cluster 3 explaining 60, 60.2, 80.9, and 70% of the total variance in the respective water quality. The varifactors from FA indicated that the parameters responsible for water quality variations are related to erosion from disturbed land or inflow of effluent from sewage plants and industry, discharges from wastewater treatment plants and domestic wastewater, agricultural activities and industrial effluents, and contamination by sewage waste with faecal coliform bacteria through sewer and septic systems. Discriminant analysis (DA) revealed that nephelometric turbidity units (NTU), chemical oxygen demand (COD) and NH₃ are the discriminating parameters in space, affording 67% correct assignation in spatial analysis; pH and NO₂ are the discriminating parameters according to season, assigning approximately 60% of cases correctly. The findings suggest a possible revised sampling strategy that can reduce the number of sampling sites and the indicator parameters responsible for large variations in water quality. This study demonstrates the usefulness of multivariate statistical techniques for evaluation of temporal/spatial variations in water quality assessment and management.
Liu, Yong; Su, Chao; Zhang, Hong; Li, Xiaoting; Pei, Jingfei
2014-01-01
Many studies indicated that industrialization and urbanization caused serious soil heavy metal pollution from industrialized age. However, fewer previous studies have conducted a combined analysis of the landscape pattern, urbanization, industrialization, and heavy metal pollution. This paper was aimed at exploring the relationships of heavy metals in the soil (Pb, Cu, Ni, As, Cd, Cr, Hg, and Zn) with landscape pattern, industrialisation, urbanisation in Taiyuan city using multivariate analysis. The multivariate analysis included correlation analysis, analysis of variance (ANOVA), independent-sample T test, and principal component analysis (PCA). Geographic information system (GIS) was also applied to determine the spatial distribution of the heavy metals. The spatial distribution maps showed that the heavy metal pollution of the soil was more serious in the centre of the study area. The results of the multivariate analysis indicated that the correlations among heavy metals were significant, and industrialisation could significantly affect the concentrations of some heavy metals. Landscape diversity showed a significant negative correlation with the heavy metal concentrations. The PCA showed that a two-factor model for heavy metal pollution, industrialisation, and the landscape pattern could effectively demonstrate the relationships between these variables. The model explained 86.71% of the total variance of the data. Moreover, the first factor was mainly loaded with the comprehensive pollution index (P), and the second factor was primarily loaded with landscape diversity and dominance (H and D). An ordination of 80 samples could show the pollution pattern of all the samples. The results revealed that local industrialisation caused heavy metal pollution of the soil, but such pollution could respond negatively to the landscape pattern. The results of the study could provide a basis for agricultural, suburban, and urban planning. PMID:25251460
Liu, Yong; Su, Chao; Zhang, Hong; Li, Xiaoting; Pei, Jingfei
2014-01-01
Many studies indicated that industrialization and urbanization caused serious soil heavy metal pollution from industrialized age. However, fewer previous studies have conducted a combined analysis of the landscape pattern, urbanization, industrialization, and heavy metal pollution. This paper was aimed at exploring the relationships of heavy metals in the soil (Pb, Cu, Ni, As, Cd, Cr, Hg, and Zn) with landscape pattern, industrialisation, urbanisation in Taiyuan city using multivariate analysis. The multivariate analysis included correlation analysis, analysis of variance (ANOVA), independent-sample T test, and principal component analysis (PCA). Geographic information system (GIS) was also applied to determine the spatial distribution of the heavy metals. The spatial distribution maps showed that the heavy metal pollution of the soil was more serious in the centre of the study area. The results of the multivariate analysis indicated that the correlations among heavy metals were significant, and industrialisation could significantly affect the concentrations of some heavy metals. Landscape diversity showed a significant negative correlation with the heavy metal concentrations. The PCA showed that a two-factor model for heavy metal pollution, industrialisation, and the landscape pattern could effectively demonstrate the relationships between these variables. The model explained 86.71% of the total variance of the data. Moreover, the first factor was mainly loaded with the comprehensive pollution index (P), and the second factor was primarily loaded with landscape diversity and dominance (H and D). An ordination of 80 samples could show the pollution pattern of all the samples. The results revealed that local industrialisation caused heavy metal pollution of the soil, but such pollution could respond negatively to the landscape pattern. The results of the study could provide a basis for agricultural, suburban, and urban planning.
Rahmania, Halida; Sudjadi; Rohman, Abdul
2015-02-01
For Indonesian community, meatball is one of the favorite meat food products. In order to gain economical benefits, the substitution of beef meat with rat meat can happen due to the different prices between rat meat and beef. In this present research, the feasibility of FTIR spectroscopy in combination with multivariate calibration of partial least square (PLS) was used for the quantitative analysis of rat meat in the binary mixture of beef in meatball formulation. Meanwhile, the chemometrics of principal component analysis (PCA) was used for the classification between rat meat and beef meatballs. Some frequency regions in mid infrared region were optimized, and finally, the frequency region of 750-1000 cm(-1) was selected during PLS and PCA modeling.For quantitative analysis, the relationship between actual values (x-axis) and FTIR predicted values (y-axis) of rat meat is described by the equation of y= 0.9417x+ 2.8410 with coefficient of determination (R2) of 0.993, and root mean square error of calibration (RMSEC) of 1.79%. Furthermore, PCA was successfully used for the classification of rat meat meatball and beef meatball.
NASA Astrophysics Data System (ADS)
Han, H. W.; Yan, X. L.; Dong, R. X.; Ban, G.; Li, K.
2009-03-01
In this paper, we show surface-enhanced Raman spectra (SERS) of serums from type II diabetes mellitus and diabetic complication (coronary disease, glaucoma and cerebral infarction), and analyze the SERS through the multivariate statistical methods of principal component analysis (PCA). In particular, we find that there exist many adenines in these serums, which maybe come from DNA (RNA) damage. The relative intensity of the band at 725±2 cm-1 assigned to adenine is higher for patients than for the healthy volunteers; therefore, it can be used as an important ‘fingerprint’ in order to diagnose these diseases. It is also shown that serums from type II diabetes mellitus group, diabetic complication group and healthy volunteers group can be discriminated by PCA.
Li, Siyue; Zhang, Quanfa
2010-04-15
A data matrix (4032 observations), obtained during a 2-year monitoring period (2005-2006) from 42 sites in the upper Han River is subjected to various multivariate statistical techniques including cluster analysis, principal component analysis (PCA), factor analysis (FA), correlation analysis and analysis of variance to determine the spatial characterization of dissolved trace elements and heavy metals. Our results indicate that waters in the upper Han River are primarily polluted by Al, As, Cd, Pb, Sb and Se, and the potential pollutants include Ba, Cr, Hg, Mn and Ni. Spatial distribution of trace metals indicates the polluted sections mainly concentrate in the Danjiang, Danjiangkou Reservoir catchment and Hanzhong Plain, and the most contaminated river is in the Hanzhong Plain. Q-model clustering depends on geographical location of sampling sites and groups the 42 sampling sites into four clusters, i.e., Danjiang, Danjiangkou Reservoir region (lower catchment), upper catchment and one river in headwaters pertaining to water quality. The headwaters, Danjiang and lower catchment, and upper catchment correspond to very high polluted, moderate polluted and relatively low polluted regions, respectively. Additionally, PCA/FA and correlation analysis demonstrates that Al, Cd, Mn, Ni, Fe, Si and Sr are controlled by natural sources, whereas the other metals appear to be primarily controlled by anthropogenic origins though geogenic source contributing to them. 2009 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Gaudio, P.; Malizia, A.; Gelfusa, M.; Martinelli, E.; Di Natale, C.; Poggi, L. A.; Bellecci, C.
2017-01-01
Nowadays Toxic Industrial Components (TICs) and Toxic Industrial Materials (TIMs) are one of the most dangerous and diffuse vehicle of contamination in urban and industrial areas. The academic world together with the industrial and military one are working on innovative solutions to monitor the diffusion in atmosphere of such pollutants. In this phase the most common commercial sensors are based on “point detection” technology but it is clear that such instruments cannot satisfy the needs of the smart cities. The new challenge is developing stand-off systems to continuously monitor the atmosphere. Quantum Electronics and Plasma Physics (QEP) research group has a long experience in laser system development and has built two demonstrators based on DIAL (Differential Absorption of Light) technology could be able to identify chemical agents in atmosphere. In this work the authors will present one of those DIAL system, the miniaturized one, together with the preliminary results of an experimental campaign conducted on TICs and TIMs simulants in cell with aim of use the absorption database for the further atmospheric an analysis using the same DIAL system. The experimental results are analysed with standard multivariate data analysis technique as Principal Component Analysis (PCA) to develop a classification model aimed at identifying organic chemical compound in atmosphere. The preliminary results of absorption coefficients of some chemical compound are shown together pre PCA analysis.
Gavin, Anna T; Donnelly, David; Donnelly, Conan; Drummond, Frances J; Morgan, Eileen; Gormley, Gerard J; Sharp, Linda
2016-12-19
To investigate effects on men's health and well-being of higher prostate cancer (PCa) investigation and treatment levels in similar populations. PCa survivors in Ireland where the Republic of Ireland (RoI) has a 50% higher PCa incidence than Northern Ireland (NI). A cross-sectional postal questionnaire was sent to PCa survivors 2-18 years post-treatment, seeking information about current physical effects of treatment, health-related quality of life (HRQoL; EORTC QLQ-C30; EQ-5D-5L) and psychological well-being (21 question version of the Depression, Anxiety and Stress Scale, DASS-21). Outcomes in RoI and NI survivors were compared, stratifying into 'late disease' (stage III/IV and any Gleason grade (GG) at diagnosis) and 'early disease' (stage I/II and GG 2-7). Responses were weighted by age, jurisdiction and time since diagnosis. Between-country differences were investigated using multivariate logistic and linear regression. 3348 men responded (RoI n=2567; NI n=781; reflecting population sizes, response rate 54%). RoI responders were younger; less often had comorbidities (45% vs 38%); were more likely to present asymptomatically (66%; 41%) or with early disease (56%; 35%); and less often currently used androgen deprivation therapy (ADT; 2%; 28%). Current prevalence of incontinence (16%) and impotence (56% early disease, 67% late disease) did not differ between RoI and NI. In early disease, only current bowel problems (RoI 12%; NI 21%) differed significantly in multivariate analysis. In late disease, NI men reported significantly higher levels of gynaecomastia (23% vs 9%) and hot flashes(41% vs 19%), but when ADT users were analysed separately, differences disappeared. For HRQoL, in multivariate analysis, only pain (early disease: RoI 11.1, NI 19.4) and financial difficulties (late disease: RoI 10.4, NI 7.9) differed significantly between countries. There were no significant between-country differences in DASS-21 or index ED-5D-5L score. Treatment side effects were commonly reported and increased PCa detection in RoI has left more men with these side effects. We recommended that men be offered a PSA test only after informed discussion. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
A hybrid PCA-CART-MARS-based prognostic approach of the remaining useful life for aircraft engines.
Sánchez Lasheras, Fernando; García Nieto, Paulino José; de Cos Juez, Francisco Javier; Mayo Bayón, Ricardo; González Suárez, Victor Manuel
2015-03-23
Prognostics is an engineering discipline that predicts the future health of a system. In this research work, a data-driven approach for prognostics is proposed. Indeed, the present paper describes a data-driven hybrid model for the successful prediction of the remaining useful life of aircraft engines. The approach combines the multivariate adaptive regression splines (MARS) technique with the principal component analysis (PCA), dendrograms and classification and regression trees (CARTs). Elements extracted from sensor signals are used to train this hybrid model, representing different levels of health for aircraft engines. In this way, this hybrid algorithm is used to predict the trends of these elements. Based on this fitting, one can determine the future health state of a system and estimate its remaining useful life (RUL) with accuracy. To evaluate the proposed approach, a test was carried out using aircraft engine signals collected from physical sensors (temperature, pressure, speed, fuel flow, etc.). Simulation results show that the PCA-CART-MARS-based approach can forecast faults long before they occur and can predict the RUL. The proposed hybrid model presents as its main advantage the fact that it does not require information about the previous operation states of the input variables of the engine. The performance of this model was compared with those obtained by other benchmark models (multivariate linear regression and artificial neural networks) also applied in recent years for the modeling of remaining useful life. Therefore, the PCA-CART-MARS-based approach is very promising in the field of prognostics of the RUL for aircraft engines.
A Hybrid PCA-CART-MARS-Based Prognostic Approach of the Remaining Useful Life for Aircraft Engines
Lasheras, Fernando Sánchez; Nieto, Paulino José García; de Cos Juez, Francisco Javier; Bayón, Ricardo Mayo; Suárez, Victor Manuel González
2015-01-01
Prognostics is an engineering discipline that predicts the future health of a system. In this research work, a data-driven approach for prognostics is proposed. Indeed, the present paper describes a data-driven hybrid model for the successful prediction of the remaining useful life of aircraft engines. The approach combines the multivariate adaptive regression splines (MARS) technique with the principal component analysis (PCA), dendrograms and classification and regression trees (CARTs). Elements extracted from sensor signals are used to train this hybrid model, representing different levels of health for aircraft engines. In this way, this hybrid algorithm is used to predict the trends of these elements. Based on this fitting, one can determine the future health state of a system and estimate its remaining useful life (RUL) with accuracy. To evaluate the proposed approach, a test was carried out using aircraft engine signals collected from physical sensors (temperature, pressure, speed, fuel flow, etc.). Simulation results show that the PCA-CART-MARS-based approach can forecast faults long before they occur and can predict the RUL. The proposed hybrid model presents as its main advantage the fact that it does not require information about the previous operation states of the input variables of the engine. The performance of this model was compared with those obtained by other benchmark models (multivariate linear regression and artificial neural networks) also applied in recent years for the modeling of remaining useful life. Therefore, the PCA-CART-MARS-based approach is very promising in the field of prognostics of the RUL for aircraft engines. PMID:25806876
Hwang, Eu-Chang; Choi, Hyang-Sik; Im, Chang-Min; Jung, Seung-Il; Kim, Sun-Ouck; Kang, Taek-Won; Kwon, Dong-Deuk; Park, Kwang-Sung; Ryu, Soo-Bang
2010-01-01
Prostatic calculi are common and are associated with inflammation of the prostate. Recently, it has been suggested that this inflammation may be associated with prostate carcinogenesis. The aim of this study was to investigate the relationship between prostatic calculi and prostate cancer (PCa) in prostate biopsy specimens. We retrospectively analyzed 417 consecutive patients who underwent transrectal ultrasonography (TRUS) and prostate biopsies between January 2005 and January 2008. Based on the biopsy findings, patients were divided into benign prostatic hyperplasia and PCa groups. TRUS was used to detect prostatic calculi and to measure prostate volume. The correlations between PCa risk and age, serum total PSA levels, prostate volume, and prostatic calculi were analyzed. Patient age and PSA, as well as the frequency of prostatic calculi in the biopsy specimens, differed significantly between both the groups (P < 0.05). In the PCa group, the Gleason scores (GSs) were higher in patients with prostatic calculi than in patients without prostatic calculi (P = 0.023). Using multivariate logistic regression analysis, we found that patient age, serum total PSA and prostate volume were risk factors for PCa (P = 0.001), but that the presence of prostatic calculi was not associated with an increased risk of PCa (P = 0.13). In conclusion, although the presence of prostatic calculi was not shown to be a risk factor for PCa, prostatic calculi were more common in patients with PCa and were associated with a higher GS among these men. PMID:20037598
Shami, Vanessa M; Mahajan, Anshu; Sundaram, Vinay; Davis, Eric M; Loch, Michelle M; Kahaleh, Michel
2008-11-01
Accurate preoperative staging of pancreatic cancer (Pca) is crucial to direct management. There is a perception that endoscopic ultrasound (EUS) staging should be performed before biliary decompression because of artifact caused by self-expandable metal stents (SEMS). Our aim is to determine whether placement of SEMS affects the staging of Pca. Fifty-five patients (35 men; mean age, 67 years) with newly diagnosed Pca staged in the last 5 years and captured prospectively were divided into 2 groups matched by age, sex, and final staging. The staging accuracy of EUS in patients who had a SEMS (n = 28) was compared with patients without a SEMS (n = 27). The gold standard was surgical pathology, or cytologic confirmation of metastatic disease. Multivariate analysis was effected on age, sex, presence of SEMS, and presence of metastasis to assess prediction of staging inaccuracy. Endoscopic ultrasound correctly staged 14 (52%) of 27 patients in the no-SEMS group and 13 (46%) of 28 in the SEMS group. Logistic regression analysis identified only metastasis as a predictor of inaccuracy in EUS staging. Endoscopic ultrasound staging of Pca does not seem to be affected by the presence of a SEMS. The major reason for misstaging in both groups was failure to detect metastatic disease.
Chen, Kang; Park, Junyong; Li, Feng; Patil, Sharadrao M; Keire, David A
2018-04-01
NMR spectroscopy is an emerging analytical tool for measuring complex drug product qualities, e.g., protein higher order structure (HOS) or heparin chemical composition. Most drug NMR spectra have been visually analyzed; however, NMR spectra are inherently quantitative and multivariate and thus suitable for chemometric analysis. Therefore, quantitative measurements derived from chemometric comparisons between spectra could be a key step in establishing acceptance criteria for a new generic drug or a new batch after manufacture change. To measure the capability of chemometric methods to differentiate comparator NMR spectra, we calculated inter-spectra difference metrics on 1D/2D spectra of two insulin drugs, Humulin R® and Novolin R®, from different manufacturers. Both insulin drugs have an identical drug substance but differ in formulation. Chemometric methods (i.e., principal component analysis (PCA), 3-way Tucker3 or graph invariant (GI)) were performed to calculate Mahalanobis distance (D M ) between the two brands (inter-brand) and distance ratio (D R ) among the different lots (intra-brand). The PCA on 1D inter-brand spectral comparison yielded a D M value of 213. In comparing 2D spectra, the Tucker3 analysis yielded the highest differentiability value (D M = 305) in the comparisons made followed by PCA (D M = 255) then the GI method (D M = 40). In conclusion, drug quality comparisons among different lots might benefit from PCA on 1D spectra for rapidly comparing many samples, while higher resolution but more time-consuming 2D-NMR-data-based comparisons using Tucker3 analysis or PCA provide a greater level of assurance for drug structural similarity evaluation between drug brands.
A novel principal component analysis for spatially misaligned multivariate air pollution data.
Jandarov, Roman A; Sheppard, Lianne A; Sampson, Paul D; Szpiro, Adam A
2017-01-01
We propose novel methods for predictive (sparse) PCA with spatially misaligned data. These methods identify principal component loading vectors that explain as much variability in the observed data as possible, while also ensuring the corresponding principal component scores can be predicted accurately by means of spatial statistics at locations where air pollution measurements are not available. This will make it possible to identify important mixtures of air pollutants and to quantify their health effects in cohort studies, where currently available methods cannot be used. We demonstrate the utility of predictive (sparse) PCA in simulated data and apply the approach to annual averages of particulate matter speciation data from national Environmental Protection Agency (EPA) regulatory monitors.
The burden of urinary incontinence and urinary bother among elderly prostate cancer survivors.
Kopp, Ryan P; Marshall, Lynn M; Wang, Patty Y; Bauer, Douglas C; Barrett-Connor, Elizabeth; Parsons, J Kellogg
2013-10-01
Data describing urinary health in elderly, community-dwelling prostate cancer (PCa) survivors are limited. To elucidate the prevalence of lower urinary tract symptoms, urinary bother, and incontinence in elderly PCa survivors compared with peers without PCa. A cross-sectional analysis of 5990 participants in the Osteoporotic Fractures in Men Research Group, a cohort study of community-dwelling men ≥ 65 yr. We characterized urinary health using self-reported urinary incontinence and the American Urological Association Symptom Index (AUA-SI). We compared urinary health measures according to type of PCa treatment in men with PCa and men without PCa using multivariate log-binomial regression to generate prevalence ratios (PRs). At baseline, 706 men (12%) reported a history of PCa, with a mean time since diagnosis of 6.3 yr. Of these men, 426 (60%) reported urinary incontinence. In adjusted analyses, observation (PR: 2.11; 95% confidence interval [CI], 1.22-3.65; p=0.007), surgery (PR: 4.41; 95% CI, 3.79-5.13; p<0.0001), radiation therapy (PR: 1.49; 95% CI, 1.06-2.08; p=0.02), and androgen-deprivation therapy (ADT) (PR: 2.02; 95% CI, 1.31-3.13; p=0.002) were each associated with daily incontinence. Daily incontinence risk increased with time since diagnosis independently of age. Observation (PR: 1.33; 95% CI, 1.00-1.78; p=0.05), surgery (PR: 1.25; 95% CI, 1.10-1.42; p=0.0008), and ADT (PR: 1.50; 95% CI, 1.26-1.79; p<0.0001) were associated with increased AUA-SI bother scores. Cancer stage and use of adjuvant or salvage therapies were not available for analysis. Compared with their peers without PCa, elderly PCa survivors had a two-fold to five-fold greater prevalence of urinary incontinence, which rose with increasing survivorship duration. Observation, surgery, and ADT were each associated with increased urinary bother. These data suggest a substantially greater burden of urinary health problems among elderly PCa survivors than previously recognized. Copyright © 2013 European Association of Urology. Published by Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Hashim, Noor Haslinda Noor; Latip, Jalifah; Khatib, Alfi
2016-11-01
The metabolites of Clinacanthus nutans leaves extracts and their dependence on drying process were systematically characterized using 1H nuclear magnetic resonance spectroscopy (NMR) multivariate data analysis. Principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA) were able to distinguish the leaves extracts obtained from different drying methods. The identified metabolites were carbohydrates, amino acid, flavonoids and sulfur glucoside compounds. The major metabolites responsible for the separation in PLS-DA loading plots were lupeol, cycloclinacosides, betulin, cerebrosides and choline. The results showed that the combination of 1H NMR spectroscopy and multivariate data analyses could act as an efficient technique to understand the C. nutans composition and its variation.
Fasoula, S; Zisi, Ch; Sampsonidis, I; Virgiliou, Ch; Theodoridis, G; Gika, H; Nikitas, P; Pappa-Louisi, A
2015-03-27
In the present study a series of 45 metabolite standards belonging to four chemically similar metabolite classes (sugars, amino acids, nucleosides and nucleobases, and amines) was subjected to LC analysis on three HILIC columns under 21 different gradient conditions with the aim to explore whether the retention properties of these analytes are determined from the chemical group they belong. Two multivariate techniques, principal component analysis (PCA) and discriminant analysis (DA), were used for statistical evaluation of the chromatographic data and extraction similarities between chemically related compounds. The total variance explained by the first two principal components of PCA was found to be about 98%, whereas both statistical analyses indicated that all analytes are successfully grouped in four clusters of chemical structure based on the retention obtained in four or at least three chromatographic runs, which, however should be performed on two different HILIC columns. Moreover, leave-one-out cross-validation of the above retention data set showed that the chemical group in which an analyte belongs can be 95.6% correctly predicted when the analyte is subjected to LC analysis under the same four or three experimental conditions as the all set of analytes was run beforehand. That, in turn, may assist with disambiguation of analyte identification in complex biological extracts. Copyright © 2015 Elsevier B.V. All rights reserved.
Ruckebusch, C; Vilmin, F; Coste, N; Huvenne, J P
2008-07-01
We evaluate the contribution made by multivariate curve resolution-alternating least squares (MCR-ALS) for resolving gel permeation chromatography-Fourier transform infrared (GPC-FT-IR) data collected on butadiene rubber (BR) and styrene butadiene rubber (SBR) blends in order to access in-depth knowledge of polymers along the molecular weight distribution (MWD). In the BR-SBR case, individual polymers differ in chemical composition but share almost the same MWD. Principal component analysis (PCA) gives a general overview of the data structure and attests to the feasibility of modeling blends as a binary system. MCR-ALS is then performed. It allows resolving the chromatographic coelution and validates the chosen methodology. For SBR-SBR blends, the problem is more challenging since the individual elastomers present the same chemical composition. Rank deficiency is detected from the PCA data structure analysis. MCR-ALS is thus performed on column-wise augmented matrices. It brings very useful insight into the composition of the analyzed blends. In particular, a weak change in the composition of individual SBR in the MWD's lowest mass region is revealed.
Buriani, Alessandro; Fortinguerra, Stefano; Sorrenti, Vincenzo; Dall'Acqua, Stefano; Innocenti, Gabbriella; Montopoli, Monica; Gabbia, Daniela; Carrara, Maria
2017-08-11
Principal component analysis (PCA) multivariate analysis was applied to study the cytotoxic activity of essential oils from various species of the Pistacia genus on human tumor cell lines. In particular, the cytotoxic activity of essential oils obtained from P. lentiscus , P. lentiscus var. chia (mastic gum), P. terebinthus , P. vera , and P. integerrima , was screened on three human adenocarcinoma cell lines: MCF-7 (breast), 2008 (ovarian), and LoVo (colon). The results indicate that all the Pistacia phytocomplexes, with the exception of mastic gum oil, induce cytotoxic effects on one or more of the three cell lines. PCA highlighted the presence of different cooperating clusters of bioactive molecules. Cluster variability among species, and even within the same species, could explain some of the differences seen among samples suggesting the presence of both common and species-specific mechanisms. Single molecules from one of the most significant clusters were tested, but only bornyl-acetate presented cytotoxic activity, although at much higher concentrations (IC 50 = 138.5 µg/mL) than those present in the essential oils, indicating that understanding of the full biological effect requires a holistic vision of the phytocomplexes with all its constituents.
Application of FT-IR spectroscopy on breast cancer serum analysis
NASA Astrophysics Data System (ADS)
Elmi, Fatemeh; Movaghar, Afshin Fayyaz; Elmi, Maryam Mitra; Alinezhad, Heshmatollah; Nikbakhsh, Novin
2017-12-01
Breast cancer is regarded as the most malignant tumor among women throughout the world. Therefore, early detection and proper diagnostic methods have been known to help save women's lives. Fourier Transform Infrared (FT-IR) spectroscopy, coupled with PCA-LDA analysis, is a new technique to investigate the characteristics of serum in breast cancer. In this study, 43 breast cancer and 43 healthy serum samples were collected, and the FT-IR spectra were recorded for each one. Then, PCA analysis and linear discriminant analysis (LDA) were used to analyze the spectral data. The results showed that there were differences between the spectra of the two groups. Discriminating wavenumbers were associated with several spectral differences over the 950-1200 cm- 1(sugar), 1190-1350 cm- 1 (collagen), 1475-1710 cm- 1 (protein), 1710-1760 cm- 1 (ester), 2800-3000 cm- 1 (stretching motions of -CH2 & -CH3), and 3090-3700 cm- 1 (NH stretching) regions. PCA-LDA performance on serum IR could recognize changes between the control and the breast cancer cases. The diagnostic accuracy, sensitivity, and specificity of PCA-LDA analysis for 3000-3600 cm- 1 (NH stretching) were found to be 83%, 84%, 74% for the control and 80%, 76%, 72% for the breast cancer cases, respectively. The results showed that the major spectral differences between the two groups were related to the differences in protein conformation in serum samples. It can be concluded that FT-IR spectroscopy, together with multivariate data analysis, is able to discriminate between breast cancer and healthy serum samples.
Henriksson, S; Hagberg, J; Bäckström, M; Persson, I; Lindström, G
2013-09-01
Polychlorinated dibenzo-p-dioxins and polychlorinated dibenzo-p-furans (PCDD/Fs) were analysed in soil from a Swedish sawmill site where chlorophenols (CPs) had been used more than 40 years ago. The most contaminated area at the site was the preservation subarea where the PCDD/F WHO2005-TEQ level was 3450 times higher than the current Swedish guideline value of 200 ng TEQ/kg soil for land for industrial use. It was also shown that a fire which destroyed the sawmill might have affected the congener distribution at the concerned areas. To get a broader picture of the contamination both GIS (spatial interpolation analysis) and multivariate data analysis (PCA) were applied to visualize and compare PCDD/F levels as well as congener distributions at different areas at the site. It is shown that GIS and PCA are powerful tools in decisions on future investigations, risk assessments and remediation of contaminated sites. Copyright © 2013 Elsevier Ltd. All rights reserved.
Grigatti, Marco; Cavani, Luciano; Ciavatta, Claudio
2007-01-01
Three blends formed by: agro-industrial waste, wastewater sewage sludge, and their mixture, blended with tree pruning as bulking agent, were composted over a 3-month period. During the composting process the blends were monitored for the main physical and chemical characteristics. Electrofocusing (EF) was carried out on the extracted organic matter. The EF profiles were analyzed by principal component analysis (PCA) in order to assess the suitability of EF to evaluate the stabilisation level during the composting process. Throughout the process, the blends showed a general shifting of focused bands, from low to high pH, even though the compost origin affected the EF profiles. If the EF profile is analyzed by dividing it into pH regions, the interpretation of the results can be affected by the origin of compost. A good clustering of compost samples depending on the process time was obtained by analyzing the whole profile by PCA. Analysis of EF results with PCA represents a useful analytical technique to study the evolution and the stabilisation of composted organic matter.
NASA Astrophysics Data System (ADS)
Luo, Shuwen; Chen, Changshui; Mao, Hua; Jin, Shaoqin
2013-06-01
The feasibility of early detection of gastric cancer using near-infrared (NIR) Raman spectroscopy (RS) by distinguishing premalignant lesions (adenomatous polyp, n=27) and cancer tissues (adenocarcinoma, n=33) from normal gastric tissues (n=45) is evaluated. Significant differences in Raman spectra are observed among the normal, adenomatous polyp, and adenocarcinoma gastric tissues at 936, 1003, 1032, 1174, 1208, 1323, 1335, 1450, and 1655 cm-1. Diverse statistical methods are employed to develop effective diagnostic algorithms for classifying the Raman spectra of different types of ex vivo gastric tissues, including principal component analysis (PCA), linear discriminant analysis (LDA), and naive Bayesian classifier (NBC) techniques. Compared with PCA-LDA algorithms, PCA-NBC techniques together with leave-one-out, cross-validation method provide better discriminative results of normal, adenomatous polyp, and adenocarcinoma gastric tissues, resulting in superior sensitivities of 96.3%, 96.9%, and 96.9%, and specificities of 93%, 100%, and 95.2%, respectively. Therefore, NIR RS associated with multivariate statistical algorithms has the potential for early diagnosis of gastric premalignant lesions and cancer tissues in molecular level.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beppler, Christina L
2015-12-01
A new approach was created for studying energetic material degradation. This approach involved detecting and tentatively identifying non-volatile chemical species by liquid chromatography-mass spectrometry (LC-MS) with multivariate statistical data analysis that form as the CL-20 energetic material thermally degraded. Multivariate data analysis showed clear separation and clustering of samples based on sample group: either pristine or aged material. Further analysis showed counter-clockwise trends in the principal components analysis (PCA), a type of multivariate data analysis, Scores plots. These trends may indicate that there was a discrete shift in the chemical markers as the went from pristine to aged material, andmore » then again when the aged CL-20 mixed with a potentially incompatible material was thermally aged for 4, 6, or 9 months. This new approach to studying energetic material degradation should provide greater knowledge of potential degradation markers in these materials.« less
NASA Astrophysics Data System (ADS)
Hollmach, Julia; Schweizer, Julia; Steiner, Gerald; Knels, Lilla; Funk, Richard H. W.; Thalheim, Silko; Koch, Edmund
2011-07-01
Retinal diseases like age-related macular degeneration have become an important cause of visual loss depending on increasing life expectancy and lifestyle habits. Due to the fact that no satisfying treatment exists, early diagnosis and prevention are the only possibilities to stop the degeneration. The protein cytochrome c (cyt c) is a suitable marker for degeneration processes and apoptosis because it is a part of the respiratory chain and involved in the apoptotic pathway. The determination of the local distribution and oxidative state of cyt c in living cells allows the characterization of cell degeneration processes. Since cyt c exhibits characteristic absorption bands between 400 and 650 nm wavelength, uv/vis in situ spectroscopic imaging was used for its characterization in retinal ganglion cells. The large amount of data, consisting of spatial and spectral information, was processed by multivariate data analysis. The challenge consists in the identification of the molecular information of cyt c. Baseline correction, principle component analysis (PCA) and cluster analysis (CA) were performed in order to identify cyt c within the spectral dataset. The combination of PCA and CA reveals cyt c and its oxidative state. The results demonstrate that uv/vis spectroscopic imaging in conjunction with sophisticated multivariate methods is a suitable tool to characterize cyt c under in situ conditions.
Zuendorf, Gerhard; Kerrouche, Nacer; Herholz, Karl; Baron, Jean-Claude
2003-01-01
Principal component analysis (PCA) is a well-known technique for reduction of dimensionality of functional imaging data. PCA can be looked at as the projection of the original images onto a new orthogonal coordinate system with lower dimensions. The new axes explain the variance in the images in decreasing order of importance, showing correlations between brain regions. We used an efficient, stable and analytical method to work out the PCA of Positron Emission Tomography (PET) images of 74 normal subjects using [(18)F]fluoro-2-deoxy-D-glucose (FDG) as a tracer. Principal components (PCs) and their relation to age effects were investigated. Correlations between the projections of the images on the new axes and the age of the subjects were carried out. The first two PCs could be identified as being the only PCs significantly correlated to age. The first principal component, which explained 10% of the data set variance, was reduced only in subjects of age 55 or older and was related to loss of signal in and adjacent to ventricles and basal cisterns, reflecting expected age-related brain atrophy with enlarging CSF spaces. The second principal component, which accounted for 8% of the total variance, had high loadings from prefrontal, posterior parietal and posterior cingulate cortices and showed the strongest correlation with age (r = -0.56), entirely consistent with previously documented age-related declines in brain glucose utilization. Thus, our method showed that the effect of aging on brain metabolism has at least two independent dimensions. This method should have widespread applications in multivariate analysis of brain functional images. Copyright 2002 Wiley-Liss, Inc.
Gu, Xiaobin; Gao, Xianshu; Cui, Ming; Xie, Mu; Ma, Mingwei; Qin, Shangbin; Li, Xiaoying; Qi, Xin; Bai, Yun; Wang, Dian
2018-01-01
Objective This study was aimed to compare survival outcomes in high-risk prostate cancer (PCa) patients receiving external beam radiotherapy (EBRT) or radical prostatectomy (RP). Materials and methods The Surveillance, Epidemiology, and End Results (SEER) database was used to identify PCa patients with high-risk features who received RP alone or EBRT alone from 2004 to 2008. Propensity-score matching (PSM) was performed. Kaplan–Meier survival analysis was used to compare cancer-specific survival (CSS) and overall survival (OS). Multivariate Cox regression analysis was used to identify independent prognostic factors. Results A total of 24,293 patients were identified, 14,460 patients receiving RP and 9833 patients receiving EBRT. Through PSM, 3828 patients were identified in each group. The mean CSS was 128.6 and 126.7 months for RP and EBRT groups, respectively (P<0.001). The subgroup analyses showed that CSS of the RP group was better than that of the EBRT group for patients aged <65 years (P<0.001), White race (P<0.001), and married status (P<0.001). However, there was no significant difference in CSS for patients aged ≥65 years, Black race, other race, and unmarried status. Similar trends were observed for OS. Multivariate analysis showed that EBRT treatment modality, T3–T4 stage, Gleason score 8–10, and prostate-specific antigen >20 ng/mL were significant risk factors for both CSS and OS. Conclusion This study suggested that survival outcomes might be better with RP than EBRT in high-risk PCa patients aged <65 years; however, RP and EBRT provided equivalent survival outcomes in older patients, which argues for primary radiotherapy in this older cohort.
Nonomura, N; Takayama, H; Nishimura, K; Oka, D; Nakai, Y; Shiba, M; Tsujimura, A; Nakayama, M; Aozasa, K; Okuyama, A
2007-01-01
Mast cell infiltration is often observed around human tumours. Inflammatory cells such as macrophages, neutrophils and mast cells infiltrating around tumours are known to contribute to tumour growth; however, the clinical significance of mast cell invasion in prostate cancer (PCa) has not been investigated. Mast cell infiltration was evaluated in 104 patients (age range, 45–88 years; median, 72 years), who underwent needle biopsy of the prostate and were confirmed to have PCa. Needle biopsy specimens of prostate were sliced into 5-μm-thick sections and immunostained for mast cells with monoclonal antibody against mast cell-specific tryptase. Mast cells were counted systematically under a microscope (× 400 magnification), and the relations between mast cell numbers and clinicopathologic findings were evaluated. The mast cell count was evaluated for prognostic value by multivariate analysis. Mast cells were immunostained around the cancer foci. The median number of mast cells in each case was 16. The mast cell count was higher around cancer foci in patients with higher Gleason scores than in those with low Gleason scores. The mast cell number correlated well with clinical stage (P<0.001). Prostate-specific antigen-free survival of patients with higher mast cell counts was better than that in patients with lower mast cell counts (P<0.001). Multivariate analysis revealed that mast cell count was a significant prognostic factor (P<0.005). The number of mast cells infiltrating around cancer foci in prostate biopsy specimens can be a significant prognostic factor of PCa. PMID:17848955
1H NMR Metabolomics Study of Spleen from C57BL/6 Mice Exposed to Gamma Radiation
Xiao, X; Hu, M; Liu, M; Hu, JZ
2016-01-01
Due to the potential risk of accidental exposure to gamma radiation, it’s critical to identify the biomarkers of radiation exposed creatures. In the present study, NMR based metabolomics combined with multivariate data analysis to evaluate the metabolites changed in the C57BL/6 mouse spleen after 4 days whole body exposure to 3.0 Gy and 7.8 Gy gamma radiations. Principal component analysis (PCA) and orthogonal projection to latent structures analysis (OPLS) are employed for classification and identification potential biomarkers associated with gamma irradiation. Two different strategies for NMR spectral data reduction (i.e., spectral binning and spectral deconvolution) are combined with normalize to constant sum and unit weight before multivariate data analysis, respectively. The combination of spectral deconvolution and normalization to unit weight is the best way for identifying discriminatory metabolites between the irradiation and control groups. Normalized to the constant sum may achieve some pseudo biomarkers. PCA and OPLS results shown that the exposed groups can be well separated from the control group. Leucine, 2-aminobutyrate, valine, lactate, arginine, glutathione, 2-oxoglutarate, creatine, tyrosine, phenylalanine, π-methylhistidine, taurine, myoinositol, glycerol and uracil are significantly elevated while ADP is decreased significantly. These significantly changed metabolites are associated with multiple metabolic pathways and may be potential biomarkers in the spleen exposed to gamma irradiation. PMID:27019763
1H NMR metabolomics study of spleen from C57BL/6 mice exposed to gamma radiation
Xiao, Xiongjie; Hu, M.; Liu, M.; ...
2016-01-27
Due to the potential risk of accidental exposure to gamma radiation, it’s critical to identify the biomarkers of radiation exposed creatures. In the present study, NMR based metabolomics combined with multivariate data analysis to evaluate the metabolites changed in the C57BL/6 mouse spleen after 4 days whole body exposure to 3.0 Gy and 7.8 Gy gamma radiations. Principal component analysis (PCA) and orthogonal projection to latent structures analysis (OPLS) are employed for classification and identification potential biomarkers associated with gamma irradiation. Two different strategies for NMR spectral data reduction (i.e., spectral binning and spectral deconvolution) are combined with normalize tomore » constant sum and unit weight before multivariate data analysis, respectively. The combination of spectral deconvolution and normalization to unit weight is the best way for identifying discriminatory metabolites between the irradiation and control groups. Normalized to the constant sum may achieve some pseudo biomarkers. PCA and OPLS results shown that the exposed groups can be well separated from the control group. Leucine, 2-aminobutyrate, valine, lactate, arginine, glutathione, 2-oxoglutarate, creatine, tyrosine, phenylalanine, π-methylhistidine, taurine, myoinositol, glycerol and uracil are significantly elevated while ADP is decreased significantly. As a result, these significantly changed metabolites are associated with multiple metabolic pathways and may be potential biomarkers in the spleen exposed to gamma irradiation.« less
Sun, Li-Li; Wang, Meng; Zhang, Hui-Jie; Liu, Ya-Nan; Ren, Xiao-Liang; Deng, Yan-Ru; Qi, Ai-Di
2018-01-01
Polygoni Multiflori Radix (PMR) is increasingly being used not just as a traditional herbal medicine but also as a popular functional food. In this study, multivariate chemometric methods and mass spectrometry were combined to analyze the ultra-high-performance liquid chromatograph (UPLC) fingerprints of PMR from six different geographical origins. A chemometric strategy based on multivariate curve resolution-alternating least squares (MCR-ALS) and three classification methods is proposed to analyze the UPLC fingerprints obtained. Common chromatographic problems, including the background contribution, baseline contribution, and peak overlap, were handled by the established MCR-ALS model. A total of 22 components were resolved. Moreover, relative species concentrations were obtained from the MCR-ALS model, which was used for multivariate classification analysis. Principal component analysis (PCA) and Ward's method have been applied to classify 72 PMR samples from six different geographical regions. The PCA score plot showed that the PMR samples fell into four clusters, which related to the geographical location and climate of the source areas. The results were then corroborated by Ward's method. In addition, according to the variance-weighted distance between cluster centers obtained from Ward's method, five components were identified as the most significant variables (chemical markers) for cluster discrimination. A counter-propagation artificial neural network has been applied to confirm and predict the effects of chemical markers on different samples. Finally, the five chemical markers were identified by UPLC-quadrupole time-of-flight mass spectrometer. Components 3, 12, 16, 18, and 19 were identified as 2,3,5,4'-tetrahydroxy-stilbene-2-O-β-d-glucoside, emodin-8-O-β-d-glucopyranoside, emodin-8-O-(6'-O-acetyl)-β-d-glucopyranoside, emodin, and physcion, respectively. In conclusion, the proposed method can be applied for the comprehensive analysis of natural samples. Copyright © 2016. Published by Elsevier B.V.
Castro, Elena; Goh, Chee; Olmos, David; Saunders, Ed; Leongamornlert, Daniel; Tymrakiewicz, Malgorzata; Mahmud, Nadiya; Dadaev, Tokhir; Govindasami, Koveela; Guy, Michelle; Sawyer, Emma; Wilkinson, Rosemary; Ardern-Jones, Audrey; Ellis, Steve; Frost, Debra; Peock, Susan; Evans, D Gareth; Tischkowitz, Marc; Cole, Trevor; Davidson, Rosemarie; Eccles, Diana; Brewer, Carole; Douglas, Fiona; Porteous, Mary E; Donaldson, Alan; Dorkins, Huw; Izatt, Louise; Cook, Jackie; Hodgson, Shirley; Kennedy, M John; Side, Lucy E; Eason, Jacqueline; Murray, Alex; Antoniou, Antonis C; Easton, Douglas F; Kote-Jarai, Zsofia; Eeles, Rosalind
2013-05-10
To analyze the baseline clinicopathologic characteristics of prostate tumors with germline BRCA1 and BRCA2 (BRCA1/2) mutations and the prognostic value of those mutations on prostate cancer (PCa) outcomes. This study analyzed the tumor features and outcomes of 2,019 patients with PCa (18 BRCA1 carriers, 61 BRCA2 carriers, and 1,940 noncarriers). The Kaplan-Meier method and Cox regression analysis were used to evaluate the associations between BRCA1/2 status and other PCa prognostic factors with overall survival (OS), cause-specific OS (CSS), CSS in localized PCa (CSS_M0), metastasis-free survival (MFS), and CSS from metastasis (CSS_M1). PCa with germline BRCA1/2 mutations were more frequently associated with Gleason ≥ 8 (P = .00003), T3/T4 stage (P = .003), nodal involvement (P = .00005), and metastases at diagnosis (P = .005) than PCa in noncarriers. CSS was significantly longer in noncarriers than in carriers (15.7 v 8.6 years, multivariable analyses [MVA] P = .015; hazard ratio [HR] = 1.8). For localized PCa, 5-year CSS and MFS were significantly higher in noncarriers (96% v 82%; MVA P = .01; HR = 2.6%; and 93% v 77%; MVA P = .009; HR = 2.7, respectively). Subgroup analyses confirmed the poor outcomes in BRCA2 patients, whereas the role of BRCA1 was not well defined due to the limited size and follow-up in this subgroup. Our results confirm that BRCA1/2 mutations confer a more aggressive PCa phenotype with a higher probability of nodal involvement and distant metastasis. BRCA mutations are associated with poor survival outcomes and this should be considered for tailoring clinical management of these patients.
Russo, Giorgio Ivan; Campisi, Daniele; Di Mauro, Marina; Regis, Federica; Reale, Giulio; Marranzano, Marina; Ragusa, Rosalia; Solinas, Tatiana; Madonia, Massimo; Cimino, Sebastiano; Morgia, Giuseppe
2017-12-05
Dietary polyphenols gained the interest of the scientific community due to their wide content in a variety of plant-derived foods and beverages commonly consumed, such as fruits, vegetables, coffee, tea, and cocoa . We aimed to investigate whether there was an association between dietary phenolic acid consumption and prostate cancer (PCa) in South Italy. We conducted a population-based case-control study from January 2015 to December 2016 in a single institution of the municipality of Catania, southern Italy (Registration number: 41/2015). Patients with elevated PSA and/or suspicious PCa underwent transperineal prostate biopsy. A total of 118 histopathological-verified PCa cases were collected and a total of 222 controls were selected from a sample of 2044 individuals. Dietary data were collected by using two food frequency questionnaires and data on the phenolic acids content in foods was obtained from the Phenol-Explorer database (www.phenol-explorer.eu). Association between dietary intake of phenolic acids and PCa was calculated through logistic regression analysis. We found lower levels of caffeic acid (2.28 mg/day vs. 2.76 mg/day; p < 0.05) and ferulic acid (2.80 mg/day vs. 4.04 mg/day; p < 0.01) in PCa when compared to controls. The multivariate logistic regression showed that both caffeic acid (OR = 0.32; p < 0.05) and ferulic acid (OR = 0.30; p < 0.05) were associated with reduced risk of PCa. Higher intake of hydroxybenzoic acids and caffeic acids were associated with lower risk of advanced PCa. High intake of caffeic acid and ferulic acid may be associated with reduced risk of PCa.
Castro, Elena; Goh, Chee; Olmos, David; Saunders, Ed; Leongamornlert, Daniel; Tymrakiewicz, Malgorzata; Mahmud, Nadiya; Dadaev, Tokhir; Govindasami, Koveela; Guy, Michelle; Sawyer, Emma; Wilkinson, Rosemary; Ardern-Jones, Audrey; Ellis, Steve; Frost, Debra; Peock, Susan; Evans, D. Gareth; Tischkowitz, Marc; Cole, Trevor; Davidson, Rosemarie; Eccles, Diana; Brewer, Carole; Douglas, Fiona; Porteous, Mary E.; Donaldson, Alan; Dorkins, Huw; Izatt, Louise; Cook, Jackie; Hodgson, Shirley; Kennedy, M. John; Side, Lucy E.; Eason, Jacqueline; Murray, Alex; Antoniou, Antonis C.; Easton, Douglas F.; Kote-Jarai, Zsofia; Eeles, Rosalind
2013-01-01
Purpose To analyze the baseline clinicopathologic characteristics of prostate tumors with germline BRCA1 and BRCA2 (BRCA1/2) mutations and the prognostic value of those mutations on prostate cancer (PCa) outcomes. Patients and Methods This study analyzed the tumor features and outcomes of 2,019 patients with PCa (18 BRCA1 carriers, 61 BRCA2 carriers, and 1,940 noncarriers). The Kaplan-Meier method and Cox regression analysis were used to evaluate the associations between BRCA1/2 status and other PCa prognostic factors with overall survival (OS), cause-specific OS (CSS), CSS in localized PCa (CSS_M0), metastasis-free survival (MFS), and CSS from metastasis (CSS_M1). Results PCa with germline BRCA1/2 mutations were more frequently associated with Gleason ≥ 8 (P = .00003), T3/T4 stage (P = .003), nodal involvement (P = .00005), and metastases at diagnosis (P = .005) than PCa in noncarriers. CSS was significantly longer in noncarriers than in carriers (15.7 v 8.6 years, multivariable analyses [MVA] P = .015; hazard ratio [HR] = 1.8). For localized PCa, 5-year CSS and MFS were significantly higher in noncarriers (96% v 82%; MVA P = .01; HR = 2.6%; and 93% v 77%; MVA P = .009; HR = 2.7, respectively). Subgroup analyses confirmed the poor outcomes in BRCA2 patients, whereas the role of BRCA1 was not well defined due to the limited size and follow-up in this subgroup. Conclusion Our results confirm that BRCA1/2 mutations confer a more aggressive PCa phenotype with a higher probability of nodal involvement and distant metastasis. BRCA mutations are associated with poor survival outcomes and this should be considered for tailoring clinical management of these patients. PMID:23569316
Gamage, I H; Jonker, A; Zhang, X; Yu, P
2014-01-24
The objective of this study was to determine the possibility of using molecular spectroscopy with multivariate technique as a fast method to detect the source effects among original feedstock sources of wheat and their corresponding co-products, wheat DDGS, from bioethanol production. Different sources of the bioethanol feedstock and their corresponding bioethanol co-products, three samples per source, were collected from the same newly-built bioethanol plant with current bioethanol processing technology. Multivariate molecular spectral analyses were carried out using agglomerative hierarchical cluster analysis (AHCA) and principal component analysis (PCA). The molecular spectral data of different feedstock sources and their corresponding co-products were compared at four different regions of ca. 1800-1725 cm(-1) (carbonyl CO ester, mainly related to lipid structure conformation), ca. 1725-1482 cm(-1) (amide I and amide II region mainly related to protein structure conformation), ca. 1482-1180 cm(-1) (mainly associated with structural carbohydrate) and ca. 1180-800 cm(-1) (mainly related to carbohydrates) in complex plant-based system. The results showed that the molecular spectroscopy with multivariate technique could reveal the structural differences among the bioethanol feedstock sources and among their corresponding co-products. The AHCA and PCA analyses were able to distinguish the molecular structure differences associated with chemical functional groups among the different sources of the feedstock and their corresponding co-products. The molecular spectral differences indicated the differences in functional, biomolecular and biopolymer groups which were confirmed by wet chemical analysis. These biomolecular and biopolymer structural differences were associated with chemical and nutrient profiles and nutrient utilization and availability. Molecular spectral analyses had the potential to identify molecular structure difference among bioethanol feedstock sources and their corresponding co-products. Copyright © 2013 Elsevier B.V. All rights reserved.
Mujica Ascencio, Saul; Choe, ChunSik; Meinke, Martina C; Müller, Rainer H; Maksimov, George V; Wigger-Alberti, Walter; Lademann, Juergen; Darvin, Maxim E
2016-07-01
Propylene glycol is one of the known substances added in cosmetic formulations as a penetration enhancer. Recently, nanocrystals have been employed also to increase the skin penetration of active components. Caffeine is a component with many applications and its penetration into the epidermis is controversially discussed in the literature. In the present study, the penetration ability of two components - caffeine nanocrystals and propylene glycol, applied topically on porcine ear skin in the form of a gel, was investigated ex vivo using two confocal Raman microscopes operated at different excitation wavelengths (785nm and 633nm). Several depth profiles were acquired in the fingerprint region and different spectral ranges, i.e., 526-600cm(-1) and 810-880cm(-1) were chosen for independent analysis of caffeine and propylene glycol penetration into the skin, respectively. Multivariate statistical methods such as principal component analysis (PCA) and linear discriminant analysis (LDA) combined with Student's t-test were employed to calculate the maximum penetration depths of each substance (caffeine and propylene glycol). The results show that propylene glycol penetrates significantly deeper than caffeine (20.7-22.0μm versus 12.3-13.0μm) without any penetration enhancement effect on caffeine. The results confirm that different substances, even if applied onto the skin as a mixture, can penetrate differently. The penetration depths of caffeine and propylene glycol obtained using two different confocal Raman microscopes are comparable showing that both types of microscopes are well suited for such investigations and that multivariate statistical PCA-LDA methods combined with Student's t-test are very useful for analyzing the penetration of different substances into the skin. Copyright © 2016 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Gamage, I. H.; Jonker, A.; Zhang, X.; Yu, P.
2014-01-01
The objective of this study was to determine the possibility of using molecular spectroscopy with multivariate technique as a fast method to detect the source effects among original feedstock sources of wheat and their corresponding co-products, wheat DDGS, from bioethanol production. Different sources of the bioethanol feedstock and their corresponding bioethanol co-products, three samples per source, were collected from the same newly-built bioethanol plant with current bioethanol processing technology. Multivariate molecular spectral analyses were carried out using agglomerative hierarchical cluster analysis (AHCA) and principal component analysis (PCA). The molecular spectral data of different feedstock sources and their corresponding co-products were compared at four different regions of ca. 1800-1725 cm-1 (carbonyl Cdbnd O ester, mainly related to lipid structure conformation), ca. 1725-1482 cm-1 (amide I and amide II region mainly related to protein structure conformation), ca. 1482-1180 cm-1 (mainly associated with structural carbohydrate) and ca. 1180-800 cm-1 (mainly related to carbohydrates) in complex plant-based system. The results showed that the molecular spectroscopy with multivariate technique could reveal the structural differences among the bioethanol feedstock sources and among their corresponding co-products. The AHCA and PCA analyses were able to distinguish the molecular structure differences associated with chemical functional groups among the different sources of the feedstock and their corresponding co-products. The molecular spectral differences indicated the differences in functional, biomolecular and biopolymer groups which were confirmed by wet chemical analysis. These biomolecular and biopolymer structural differences were associated with chemical and nutrient profiles and nutrient utilization and availability. Molecular spectral analyses had the potential to identify molecular structure difference among bioethanol feedstock sources and their corresponding co-products.
Peters, Max; van der Voort van Zyp, Jochem R N; Moerland, Marinus A; Hoekstra, Carel J; van de Pol, Sandrine; Westendorp, Hendrik; Maenhout, Metha; Kattevilder, Rob; Verkooijen, Helena M; van Rossum, Peter S N; Ahmed, Hashim U; Shah, Taimur T; Emberton, Mark; van Vulpen, Marco
2016-04-01
Whole-gland salvage Iodine-125-brachytherapy is a potentially curative treatment strategy for localised prostate cancer (PCa) recurrences after radiotherapy. Prognostic factors influencing PCa-specific and overall survival (PCaSS & OS) are not known. The objective of this study was to develop a multivariable, internally validated prognostic model for survival after whole-gland salvage I-125-brachytherapy. Whole-gland salvage I-125-brachytherapy patients treated in the Netherlands from 1993-2010 were included. Eligible patients had a transrectal ultrasound-guided biopsy-confirmed localised recurrence after biochemical failure (clinical judgement, ASTRO or Phoenix-definition). Recurrences were assessed clinically and with CT and/or MRI. Metastases were excluded using CT/MRI and technetium-99m scintigraphy. Multivariable Cox-regression was used to assess the predictive value of clinical characteristics in relation to PCa-specific and overall mortality. PCa-specific mortality was defined as patients dying with distant metastases present. Missing data were handled using multiple imputation (20 imputed sets). Internal validation was performed and the C-statistic calculated. Calibration plots were created to visually assess the goodness-of-fit of the final model. Optimism-corrected survival proportions were calculated. All analyses were performed according to the TRIPOD statement. Median total follow-up was 78months (range 5-139). A total of 62 patients were treated, of which 28 (45%) died from PCa after mean (±SD) 82 (±36) months. Overall, 36 patients (58%) patients died after mean 84 (±40) months. PSA doubling time (PSADT) remained a predictive factor for both types of mortality (PCa-specific and overall): corrected hazard ratio's (HR's) 0.92 (95% CI: 0.86-0.98, p=0.02) and 0.94 (95% CI: 0.90-0.99, p=0.01), respectively (C-statistics 0.71 and 0.69, respectively). Calibration was accurate up to 96month follow-up. Over 80% of patients can survive 8years if PSADT>24months (PCaSS) and >33months (OS). Only approximately 50% survival is achieved with a PSADT of 12months. A PSADT of respectively >24months and >33months can result in >80% probability of PCa- specific and overall survival 8years after whole-gland salvage I-125-brachytherapy. Survival should be weighed against toxicity from a salvage procedure. Larger series and external validation are necessary. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
An anthropometric study of Serbian metal industry workers.
Omić, S; Brkić, V K Spasojevic; Golubović, T A; Brkić, A D; Klarin, M M
2017-01-01
There are recent studies using new industrial workers' anthropometric data in different countries, but for Serbia such data are not available. This study is the first anthropometric study of Serbian metal industry workers in the country, whose labor force is increasingly employed both on local and international markets. The metal industry is one of Serbia's most important economic sectors. To this end, we collected the basic static anthropometric dimensions of 122 industrial workers and used principal components analysis (PCA) to obtain multivariate anthropometric models. To confirm the results, the dimensions of an additional 50 workers were collected. The PCA methodology was also compared with the percentile method. Comparing both data samples, we found that 96% of the participants are within the tolerance ellipsoid. According to this study, multivariate modeling covers a larger extent of the intended population proportion compared to percentiles. The results of this research are useful for the designers of metal industry workstations. This information can be used in dimensioning the workplace, thus increasing job satisfaction, reducing the risk of injuries and fatalities, and consequently increasing productivity and safety.
Nonomura, Norio; Takayama, Hitoshi; Nakayama, Masashi; Nakai, Yasutomo; Kawashima, Atsunari; Mukai, Masatoshi; Nagahara, Akira; Aozasa, Katsuyuki; Tsujimura, Akira
2011-06-01
• To evaluate tumour-associated macrophage (TAM) infiltration in prostate biopsy specimens as a possible prognostic factor for prostate cancer (PCa) after hormonal therapy. • Immunostaining of TAMs in prostate biopsy specimens was performed using a monoclonal antibody CD68 for 71 patients having PCa treated with hormonal therapy. • Six microscopic (×400) fields around the cancer foci were selected for TAM counting. • The median value of serum prostate-specific antigen (PSA) was 50.1 ng/mL, and the median TAM count was 22. • Recurrence-free survival was significantly better in patients with fewer TAMs (<22) than in those with higher numbers of TAMs (≥22) (P < 0.001). • TAM count was higher in those with higher serum PSA (PSA), higher Gleason score, clinical T stage or those with PSA failure. Cox multivariate analysis showed that TAM count is one of the prognostic factors for PCa treated by hormonal therapy (P < 0.0001). • TAM infiltration in prostate needle biopsy specimens is a useful predictive factor for PSA failure or progression of PCa after hormonal therapy. © 2010 THE AUTHORS. BJU INTERNATIONAL © 2010 BJU INTERNATIONAL.
Receptor modeling for source apportionment of polycyclic aromatic hydrocarbons in urban atmosphere.
Singh, Kunwar P; Malik, Amrita; Kumar, Ranjan; Saxena, Puneet; Sinha, Sarita
2008-01-01
This study reports source apportionment of polycyclic aromatic hydrocarbons (PAHs) in particulate depositions on vegetation foliages near highway in the urban environment of Lucknow city (India) using the principal components analysis/absolute principal components scores (PCA/APCS) receptor modeling approach. The multivariate method enables identification of major PAHs sources along with their quantitative contributions with respect to individual PAH. The PCA identified three major sources of PAHs viz. combustion, vehicular emissions, and diesel based activities. The PCA/APCS receptor modeling approach revealed that the combustion sources (natural gas, wood, coal/coke, biomass) contributed 19-97% of various PAHs, vehicular emissions 0-70%, diesel based sources 0-81% and other miscellaneous sources 0-20% of different PAHs. The contributions of major pyrolytic and petrogenic sources to the total PAHs were 56 and 42%, respectively. Further, the combustion related sources contribute major fraction of the carcinogenic PAHs in the study area. High correlation coefficient (R2 > 0.75 for most PAHs) between the measured and predicted concentrations of PAHs suggests for the applicability of the PCA/APCS receptor modeling approach for estimation of source contribution to the PAHs in particulates.
Jha, Dilip Kumar; Vinithkumar, Nambali Valsalan; Sahu, Biraja Kumar; Dheenan, Palaiya Sukumaran; Das, Apurba Kumar; Begum, Mehmuna; Devi, Marimuthu Prashanthi; Kirubagaran, Ramalingam
2015-07-15
Chidiyatappu Bay is one of the least disturbed marine environments of Andaman & Nicobar Islands, the union territory of India. Oceanic flushing from southeast and northwest direction is prevalent in this bay. Further, anthropogenic activity is minimal in the adjoining environment. Considering the pristine nature of this bay, seawater samples collected from 12 sampling stations covering three seasons were analyzed. Principal Component Analysis (PCA) revealed 69.9% of total variance and exhibited strong factor loading for nitrite, chlorophyll a and phaeophytin. In addition, analysis of variance (ANOVA-one way), regression analysis, box-whisker plots and Geographical Information System based hot spot analysis further simplified and supported multivariate results. The results obtained are important to establish reference conditions for comparative study with other similar ecosystems in the region. Copyright © 2015 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Grasel, Fábio dos Santos; Ferrão, Marco Flôres; Wolf, Carlos Rodolfo
2016-01-01
Tannins are polyphenolic compounds of complex structures formed by secondary metabolism in several plants. These polyphenolic compounds have different applications, such as drugs, anti-corrosion agents, flocculants, and tanning agents. This study analyses six different type of polyphenolic extracts by Fourier transform infrared spectroscopy (FTIR) combined with multivariate analysis. Through both principal component analysis (PCA) and hierarchical cluster analysis (HCA), we observed well-defined separation between condensed (quebracho and black wattle) and hydrolysable (valonea, chestnut, myrobalan, and tara) tannins. For hydrolysable tannins, it was also possible to observe the formation of two different subgroups between samples of chestnut and valonea and between samples of tara and myrobalan. Among all samples analysed, the chestnut and valonea showed the greatest similarity, indicating that these extracts contain equivalent chemical compositions and structure and, therefore, similar properties.
Fast classification of hazelnut cultivars through portable infrared spectroscopy and chemometrics
NASA Astrophysics Data System (ADS)
Manfredi, Marcello; Robotti, Elisa; Quasso, Fabio; Mazzucco, Eleonora; Calabrese, Giorgio; Marengo, Emilio
2018-01-01
The authentication and traceability of hazelnuts is very important for both the consumer and the food industry, to safeguard the protected varieties and the food quality. This study investigates the use of a portable FTIR spectrometer coupled to multivariate statistical analysis for the classification of raw hazelnuts. The method discriminates hazelnuts from different origins/cultivars based on differences of the signal intensities of their IR spectra. The multivariate classification methods, namely principal component analysis (PCA) followed by linear discriminant analysis (LDA) and partial least square discriminant analysis (PLS-DA), with or without variable selection, allowed a very good discrimination among the groups, with PLS-DA coupled to variable selection providing the best results. Due to the fast analysis, high sensitivity, simplicity and no sample preparation, the proposed analytical methodology could be successfully used to verify the cultivar of hazelnuts, and the analysis can be performed quickly and directly on site.
NASA Astrophysics Data System (ADS)
Pujiwati, Arie; Nakamura, K.; Watanabe, N.; Komai, T.
2018-02-01
Multivariate analysis is applied to investigate geochemistry of several trace elements in top soils and their relation with the contamination source as the influence of coal mines in Jorong, South Kalimantan. Total concentration of Cd, V, Co, Ni, Cr, Zn, As, Pb, Sb, Cu and Ba was determined in 20 soil samples by the bulk analysis. Pearson correlation is applied to specify the linear correlation among the elements. Principal Component Analysis (PCA) and Cluster Analysis (CA) were applied to observe the classification of trace elements and contamination sources. The results suggest that contamination loading is contributed by Cr, Cu, Ni, Zn, As, and Pb. The elemental loading mostly affects the non-coal mining area, for instances the area near settlement and agricultural land use. Moreover, the contamination source is classified into the areas that are influenced by the coal mining activity, the agricultural types, and the river mixing zone. Multivariate analysis could elucidate the elemental loading and the contamination sources of trace elements in the vicinity of coal mine area.
DOE Office of Scientific and Technical Information (OSTI.GOV)
University of Illinois at Chicago; Montana State University; Bhardwaj, Chhavi
2013-04-01
7.87 to 10.5 eV vacuum ultraviolet (VUV) photon energies were used in laser desorption postionization mass spectrometry (LDPI-MS) to analyze biofilms comprised of binary cultures of interacting microorganisms. The effect of photon energy was examined using both tunable synchrotron and laser sources of VUV radiation. Principal components analysis (PCA) was applied to the MS data to differentiate species in Escherichia coli-Saccharomyces cerevisiae coculture biofilms. PCA of LDPI-MS also differentiated individual E. coli strains in a biofilm comprised of two interacting gene deletion strains, even though these strains differed from the wild type K-12 strain by no more than four genemore » deletions each out of approximately 2000 genes. PCA treatment of 7.87 eV LDPI-MS data separated the E. coli strains into three distinct groups two ?pure? groups and a mixed region. Furthermore, the ?pure? regions of the E. coli cocultures showed greater variance by PCA when analyzed by 7.87 eV photon energies than by 10.5 eV radiation. Comparison of the 7.87 and 10.5 eV data is consistent with the expectation that the lower photon energy selects a subset of low ionization energy analytes while 10.5 eV is more inclusive, detecting a wider range of analytes. These two VUV photon energies therefore give different spreads via PCA and their respective use in LDPI-MS constitute an additional experimental parameter to differentiate strains and species.« less
Goltz, Diane; Gevensleben, Heidrun; Dietrich, Jörn; Ellinger, Jörg; Landsberg, Jennifer; Kristiansen, Glen; Dietrich, Dimo
2016-01-01
Biomarkers that facilitate the prediction of disease recurrence in prostate cancer (PCa) may enable physicians to personalize treatment for individual patients. In the current study, PD-1 ( PDCD1 ) promoter methylation was assessed in a cohort of 498 PCa patients included in The Cancer Genome Atlas (TCGA) and a second cohort of 300 PCa cases treated at the University Hospital of Bonn. In the TCGA cohort, the PD-1 promoter was significantly hypermethylated in carcinomas versus normal prostatic epithelium (55.5% vs. 38.2%, p < 0.001) and PD-1 methylation ( mPD-1 ) inversely correlated with PD-1 mRNA expression in PCa (Spearman's ρ = -0.415, p < 0.001). In both cohorts, mPD-1 significantly correlated with preoperative prostate specific antigen (PSA). In univariate Cox Proportional Hazard analysis, mPD-1 served as a significant prognostic factor for biochemical recurrence (BCR)-free survival (Hazard ratio: HR = 2.35 [1.35-4.10], p = 0.003, n = 410) in the TCGA cohort. In multivariate analysis, mPD-1 was shown to add significant independent prognostic information adjunct to pathologic tumor category (pT) and Gleason grading group (HR = 2.08 [1.16-3.74], p = 0.014, n = 350). PD-1 promoter methylation analyses could thus potentially aid the identification of patients which might benefit from adjuvant treatment after radical prostatectomy. Moreover, our data suggest an intrinsic role of PD-1 in PCa carcinogenesis and disease progression, which needs to be addressed in future studies.
L-dopa decarboxylase (DDC) gene expression is related to outcome in patients with prostate cancer.
Koutalellis, Georgios; Stravodimos, Konstantinos; Avgeris, Margaritis; Mavridis, Konstantinos; Scorilas, Andreas; Lazaris, Andreas; Constantinides, Constantinos
2012-09-01
What's known on the subject? and What does the study add? L-dopa decarboxylase (DDC) has been documented as a novel co-activator of androgen receptor transcriptional activity. Recently, it was shown that DDC gene expression is significantly higher in patients with PCa than in those with BPH. In the present study, there was a significant association between the DDC gene expression levels and the pathological stage and Gleason score of patients with prostate cancer (PCa). Moreover, DDC expression was shown to be an unfavourable prognostic marker of biochemical recurrence and disease-free survival in patients with PCa treated by radical prostatectomy. To determine whether L-dopa decarboxylase gene (DDC) expression levels in patients with prostate cancer (PCa) correlate to biochemical recurrence and disease prognosis after radical prostatectomy (RP). The present study consisted of 56 samples with confirmed malignancy from patients with PCa who had undergone RP at a single tertiary academic centre. Total RNA was isolated from tissue specimens and a SYBR Green fluorescence-based quantitative real-time polymerase chain reaction methodology was developed for the determination of DDC mRNA expression levels of the tested tissues. Follow-up time ranged between 1.0 and 62.0 months (mean ± SE, 28.6 ± 2.1 month; median, 31.5 months). Time to biochemical recurrence was defined as the interval between the surgery and the measurement of two consecutive values of prostate-specific antigen (PSA) ≥0.2 ng/mL. DDC expression levels were found to be positively correlated with the tumour-node-metastasis stage (P = 0.021) and Gleason score (P = 0.036) of the patients with PCa. Patients with PCa with raised DDC expression levels run a significantly higher risk of biochemical recurrence after RP, as indicated by Cox proportional regression analysis (P = 0.021). Multivariate Cox proportional regression models revealed the preoperative PSA-, age- and digital rectal examination-independent prognostic value of DDC expression for the prediction of disease-free survival (DFS) among patients with PCa (P = 0.036). Kaplan-Meier survival analysis confirms the significantly shorter DFS after RP of PCa with higher DDC expression levels (P = 0.015). This is the first study indicating the potential of DDC expression as a novel prognostic biomarker in patients with PCa who have undergone RP. For further evaluation and clinical application of the findings of the present study, a direct analysis of mRNA and/or its protein expression level in preoperative biopsy, blood serum and urine should be conducted. © 2012 BJU INTERNATIONAL.
Beer fermentation: monitoring of process parameters by FT-NIR and multivariate data analysis.
Grassi, Silvia; Amigo, José Manuel; Lyndgaard, Christian Bøge; Foschino, Roberto; Casiraghi, Ernestina
2014-07-15
This work investigates the capability of Fourier-Transform near infrared (FT-NIR) spectroscopy to monitor and assess process parameters in beer fermentation at different operative conditions. For this purpose, the fermentation of wort with two different yeast strains and at different temperatures was monitored for nine days by FT-NIR. To correlate the collected spectra with °Brix, pH and biomass, different multivariate data methodologies were applied. Principal component analysis (PCA), partial least squares (PLS) and locally weighted regression (LWR) were used to assess the relationship between FT-NIR spectra and the abovementioned process parameters that define the beer fermentation. The accuracy and robustness of the obtained results clearly show the suitability of FT-NIR spectroscopy, combined with multivariate data analysis, to be used as a quality control tool in the beer fermentation process. FT-NIR spectroscopy, when combined with LWR, demonstrates to be a perfectly suitable quantitative method to be implemented in the production of beer. Copyright © 2014 Elsevier Ltd. All rights reserved.
Partial Least Squares for Discrimination in fMRI Data
Andersen, Anders H.; Rayens, William S.; Liu, Yushu; Smith, Charles D.
2011-01-01
Multivariate methods for discrimination were used in the comparison of brain activation patterns between groups of cognitively normal women who are at either high or low Alzheimer's disease risk based on family history and apolipoprotein-E4 status. Linear discriminant analysis (LDA) was preceded by dimension reduction using either principal component analysis (PCA), partial least squares (PLS), or a new oriented partial least squares (OrPLS) method. The aim was to identify a spatial pattern of functionally connected brain regions that was differentially expressed by the risk groups and yielded optimal classification accuracy. Multivariate dimension reduction is required prior to LDA when the data contains more feature variables than there are observations on individual subjects. Whereas PCA has been commonly used to identify covariance patterns in neuroimaging data, this approach only identifies gross variability and is not capable of distinguishing among-groups from within-groups variability. PLS and OrPLS provide a more focused dimension reduction by incorporating information on class structure and therefore lead to more parsimonious models for discrimination. Performance was evaluated in terms of the cross-validated misclassification rates. The results support the potential of using fMRI as an imaging biomarker or diagnostic tool to discriminate individuals with disease or high risk. PMID:22227352
Bastidas, Camila Y; von Plessing, Carlos; Troncoso, José; Del P Castillo, Rosario
2018-04-15
Fourier Transform infrared imaging and multivariate analysis were used to identify, at the microscopic level, the presence of florfenicol (FF), a heavily-used antibiotic in the salmon industry, supplied to fishes in feed pellets for the treatment of salmonid rickettsial septicemia (SRS). The FF distribution was evaluated using Principal Component Analysis (PCA) and Augmented Multivariate Curve Resolution with Alternating Least Squares (augmented MCR-ALS) on the spectra obtained from images with pixel sizes of 6.25 μm × 6.25 μm and 1.56 μm × 1.56 μm, in different zones of feed pellets. Since the concentration of the drug was 3.44 mg FF/g pellet, this is the first report showing the powerful ability of the used of spectroscopic techniques and multivariate analysis, especially the augmented MCR-ALS, to describe the FF distribution in both the surface and inner parts of feed pellets at low concentration, in a complex matrix and at the microscopic level. The results allow monitoring the incorporation of the drug into the feed pellets. Copyright © 2018 Elsevier B.V. All rights reserved.
Metsalu, Tauno; Vilo, Jaak
2015-01-01
The Principal Component Analysis (PCA) is a widely used method of reducing the dimensionality of high-dimensional data, often followed by visualizing two of the components on the scatterplot. Although widely used, the method is lacking an easy-to-use web interface that scientists with little programming skills could use to make plots of their own data. The same applies to creating heatmaps: it is possible to add conditional formatting for Excel cells to show colored heatmaps, but for more advanced features such as clustering and experimental annotations, more sophisticated analysis tools have to be used. We present a web tool called ClustVis that aims to have an intuitive user interface. Users can upload data from a simple delimited text file that can be created in a spreadsheet program. It is possible to modify data processing methods and the final appearance of the PCA and heatmap plots by using drop-down menus, text boxes, sliders etc. Appropriate defaults are given to reduce the time needed by the user to specify input parameters. As an output, users can download PCA plot and heatmap in one of the preferred file formats. This web server is freely available at http://biit.cs.ut.ee/clustvis/. PMID:25969447
Kalegowda, Yogesh; Harmer, Sarah L
2013-01-08
Artificial neural network (ANN) and a hybrid principal component analysis-artificial neural network (PCA-ANN) classifiers have been successfully implemented for classification of static time-of-flight secondary ion mass spectrometry (ToF-SIMS) mass spectra collected from complex Cu-Fe sulphides (chalcopyrite, bornite, chalcocite and pyrite) at different flotation conditions. ANNs are very good pattern classifiers because of: their ability to learn and generalise patterns that are not linearly separable; their fault and noise tolerance capability; and high parallelism. In the first approach, fragments from the whole ToF-SIMS spectrum were used as input to the ANN, the model yielded high overall correct classification rates of 100% for feed samples, 88% for conditioned feed samples and 91% for Eh modified samples. In the second approach, the hybrid pattern classifier PCA-ANN was integrated. PCA is a very effective multivariate data analysis tool applied to enhance species features and reduce data dimensionality. Principal component (PC) scores which accounted for 95% of the raw spectral data variance, were used as input to the ANN, the model yielded high overall correct classification rates of 88% for conditioned feed samples and 95% for Eh modified samples. Copyright © 2012 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Webb-Robertson, Bobbie-Jo M.; Jarman, Kristin H.; Harvey, Scott D.
2005-05-28
A fundamental problem in analysis of highly multivariate spectral or chromatographic data is reduction of dimensionality. Principal components analysis (PCA), concerned with explaining the variance-covariance structure of the data, is a commonly used approach to dimension reduction. Recently an attractive alternative to PCA, sequential projection pursuit (SPP), has been introduced. Designed to elicit clustering tendencies in the data, SPP may be more appropriate when performing clustering or classification analysis. However, the existing genetic algorithm (GA) implementation of SPP has two shortcomings, computation time and inability to determine the number of factors necessary to explain the majority of the structure inmore » the data. We address both these shortcomings. First, we introduce a new SPP algorithm, a random scan sampling algorithm (RSSA), that significantly reduces computation time. We compare the computational burden of the RSS and GA implementation for SPP on a dataset containing Raman spectra of twelve organic compounds. Second, we propose a Bayes factor criterion, BFC, as an effective measure for selecting the number of factors needed to explain the majority of the structure in the data. We compare SPP to PCA on two datasets varying in type, size, and difficulty; in both cases SPP achieves a higher accuracy with a lower number of latent variables.« less
Zhang, Wei; Ren, Shan-Cheng; Shi, Xiao-Lei; Liu, Ya-Wei; Zhu, Ya-Sheng; Jing, Tai-Le; Wang, Fu-Bo; Chen, Rui; Xu, Chuan-Liang; Wang, Hui-Qing; Wang, Hai-Feng; Wang, Yan; Liu, Bing; Li, Yao-Ming; Fang, Zi-Yu; Guo, Fei; Lu, Xin; Shen, Dan; Gao, Xu; Hou, Jian-Guo; Sun, Ying-Hao
2015-05-01
Long non-coding RNA (LncRNA) PCA3 has been a well-established urine biomarker for the detection of prostate cancer (PCa). Our previous study showed a novel LncRNA FR0348383 is up-regulated in over 70% of PCa compared with matched benign tissues. The aim of this study was to evaluate the diagnostic value of urinary FR0348383 for men undergoing prostate biopsy due to elevated PSA (PSA > 4.0 ng/ml) and/or abnormal digital rectal examination (DRE). Post-DRE first-catch urine specimens prior to prostate biopsies were prospectively collected. After the whole transcriptome amplification, quantitative real time polymerase chain reaction was applied to quantify urine FR0348383 and PSA levels. The FR0348383 score was calculated as the ratio of PSA and FR0348383 mRNA (PSA mRNA/FR0348383 mRNA × 1000). The diagnostic value of FR0348383 score was evaluated by logistic regression and decision curve analysis. 213 cases with urine samples containing sufficient mRNA were included, 94 cases had serum PSA level 4.0-10.0 ng/ml. PCa was identified in 72 cases. An increasing FR0348383 score was correlated with an increasing probability of a positive biopsy (P < 0.001). Multivariable logistic analysis indicated FR0348383 score (P < 0.001), PSA (P = 0.004), age (P = 0.007), prostate volume (P < 0.001) were independent predictors of PCa. ROC analysis demonstrated FR0348383 score outperformed PSA, %free PSA, and PSA Density in the prediction of PCa in the subgroup of patients with grey area PSA (AUC: 0.815 vs. 0.562 vs. 0.599 vs. 0.645). When using a probability threshold of 30% in the grey zone cohort, The FR0348383 score would save 52.0% of avoidable biopsies without missing any high grade cancers. FR0348383 transcript in post-DRE urine may be a novel biomarker for detection of PCa with great diagnostic value, especially in the grey zone cohort. The application of FR0348383 score in clinical practice might avoid unnecessary prostate biopsies and increase the specificity of PCa diagnosis. © 2015 Wiley Periodicals, Inc.
Autonomic specificity of basic emotions: evidence from pattern classification and cluster analysis.
Stephens, Chad L; Christie, Israel C; Friedman, Bruce H
2010-07-01
Autonomic nervous system (ANS) specificity of emotion remains controversial in contemporary emotion research, and has received mixed support over decades of investigation. This study was designed to replicate and extend psychophysiological research, which has used multivariate pattern classification analysis (PCA) in support of ANS specificity. Forty-nine undergraduates (27 women) listened to emotion-inducing music and viewed affective films while a montage of ANS variables, including heart rate variability indices, peripheral vascular activity, systolic time intervals, and electrodermal activity, were recorded. Evidence for ANS discrimination of emotion was found via PCA with 44.6% of overall observations correctly classified into the predicted emotion conditions, using ANS variables (z=16.05, p<.001). Cluster analysis of these data indicated a lack of distinct clusters, which suggests that ANS responses to the stimuli were nomothetic and stimulus-specific rather than idiosyncratic and individual-specific. Collectively these results further confirm and extend support for the notion that basic emotions have distinct ANS signatures. Copyright © 2010 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Bhattacharjee, T.; Kumar, P.; Fillipe, L.
2018-02-01
Vibrational spectroscopy, especially FTIR and Raman, has shown enormous potential in disease diagnosis, especially in cancers. Their potential for detecting varied pathological conditions are regularly reported. However, to prove their applicability in clinics, large multi-center multi-national studies need to be undertaken; and these will result in enormous amount of data. A parallel effort to develop analytical methods, including user-friendly software that can quickly pre-process data and subject them to required multivariate analysis is warranted in order to obtain results in real time. This study reports a MATLAB based script that can automatically import data, preprocess spectra— interpolation, derivatives, normalization, and then carry out Principal Component Analysis (PCA) followed by Linear Discriminant Analysis (LDA) of the first 10 PCs; all with a single click. The software has been verified on data obtained from cell lines, animal models, and in vivo patient datasets, and gives results comparable to Minitab 16 software. The software can be used to import variety of file extensions, asc, .txt., .xls, and many others. Options to ignore noisy data, plot all possible graphs with PCA factors 1 to 5, and save loading factors, confusion matrices and other parameters are also present. The software can provide results for a dataset of 300 spectra within 0.01 s. We believe that the software will be vital not only in clinical trials using vibrational spectroscopic data, but also to obtain rapid results when these tools get translated into clinics.
Guo, Jing; Yuan, Yahong; Dou, Pei; Yue, Tianli
2017-10-01
Fifty-one kiwifruit juice samples of seven kiwifruit varieties from five regions in China were analyzed to determine their polyphenols contents and to trace fruit varieties and geographical origins by multivariate statistical analysis. Twenty-one polyphenols belonging to four compound classes were determined by ultra-high-performance liquid chromatography coupled with ultra-high-resolution TOF mass spectrometry. (-)-Epicatechin, (+)-catechin, procyanidin B1 and caffeic acid derivatives were the predominant phenolic compounds in the juices. Principal component analysis (PCA) allowed a clear separation of the juices according to kiwifruit varieties. Stepwise linear discriminant analysis (SLDA) yielded satisfactory categorization of samples, provided 100% success rate according to kiwifruit varieties and 92.2% success rate according to geographical origins. The result showed that polyphenolic profiles of kiwifruit juices contain enough information to trace fruit varieties and geographical origins. Copyright © 2017 Elsevier Ltd. All rights reserved.
Grasel, Fábio dos Santos; Ferrão, Marco Flôres; Wolf, Carlos Rodolfo
2016-01-15
Tannins are polyphenolic compounds of complex structures formed by secondary metabolism in several plants. These polyphenolic compounds have different applications, such as drugs, anti-corrosion agents, flocculants, and tanning agents. This study analyses six different type of polyphenolic extracts by Fourier transform infrared spectroscopy (FTIR) combined with multivariate analysis. Through both principal component analysis (PCA) and hierarchical cluster analysis (HCA), we observed well-defined separation between condensed (quebracho and black wattle) and hydrolysable (valonea, chestnut, myrobalan, and tara) tannins. For hydrolysable tannins, it was also possible to observe the formation of two different subgroups between samples of chestnut and valonea and between samples of tara and myrobalan. Among all samples analysed, the chestnut and valonea showed the greatest similarity, indicating that these extracts contain equivalent chemical compositions and structure and, therefore, similar properties. Copyright © 2015 Elsevier B.V. All rights reserved.
Classification of adulterated honeys by multivariate analysis.
Amiry, Saber; Esmaiili, Mohsen; Alizadeh, Mohammad
2017-06-01
In this research, honey samples were adulterated with date syrup (DS) and invert sugar syrup (IS) at three concentrations (7%, 15% and 30%). 102 adulterated samples were prepared in six batches with 17 replications for each batch. For each sample, 32 parameters including color indices, rheological, physical, and chemical parameters were determined. To classify the samples, based on type and concentrations of adulterant, a multivariate analysis was applied using principal component analysis (PCA) followed by a linear discriminant analysis (LDA). Then, 21 principal components (PCs) were selected in five sets. Approximately two-thirds were identified correctly using color indices (62.75%) or rheological properties (67.65%). A power discrimination was obtained using physical properties (97.06%), and the best separations were achieved using two sets of chemical properties (set 1: lactone, diastase activity, sucrose - 100%) (set 2: free acidity, HMF, ash - 95%). Copyright © 2016 Elsevier Ltd. All rights reserved.
Levin-Schwartz, Yuri; Song, Yang; Schreier, Peter J.; Calhoun, Vince D.; Adalı, Tülay
2016-01-01
Due to their data-driven nature, multivariate methods such as canonical correlation analysis (CCA) have proven very useful for fusion of multimodal neurological data. However, being able to determine the degree of similarity between datasets and appropriate order selection are crucial to the success of such techniques. The standard methods for calculating the order of multimodal data focus only on sources with the greatest individual energy and ignore relations across datasets. Additionally, these techniques as well as the most widely-used methods for determining the degree of similarity between datasets assume sufficient sample support and are not effective in the sample-poor regime. In this paper, we propose to jointly estimate the degree of similarity between datasets and their order when few samples are present using principal component analysis and canonical correlation analysis (PCA-CCA). By considering these two problems simultaneously, we are able to minimize the assumptions placed on the data and achieve superior performance in the sample-poor regime compared to traditional techniques. We apply PCA-CCA to the pairwise combinations of functional magnetic resonance imaging (fMRI), structural magnetic resonance imaging (sMRI), and electroencephalogram (EEG) data drawn from patients with schizophrenia and healthy controls while performing an auditory oddball task. The PCA-CCA results indicate that the fMRI and sMRI datasets are the most similar, whereas the sMRI and EEG datasets share the least similarity. We also demonstrate that the degree of similarity obtained by PCA-CCA is highly predictive of the degree of significance found for components generated using CCA. PMID:27039696
Vermathen, Martina; Marzorati, Mattia; Vermathen, Peter
2012-01-01
Classical liquid-state high-resolution (HR) NMR spectroscopy has proved a powerful tool in the metabonomic analysis of liquid food samples like fruit juices. In this paper the application of (1)H high-resolution magic angle spinning (HR-MAS) NMR spectroscopy to apple tissue is presented probing its potential for metabonomic studies. The (1)H HR-MAS NMR spectra are discussed in terms of the chemical composition of apple tissue and compared to liquid-state NMR spectra of apple juice. Differences indicate that specific metabolic changes are induced by juice preparation. The feasibility of HR-MAS NMR-based multivariate analysis is demonstrated by a study distinguishing three different apple cultivars by principal component analysis (PCA). Preliminary results are shown from subsequent studies comparing three different cultivation methods by means of PCA and partial least squares discriminant analysis (PLS-DA) of the HR-MAS NMR data. The compounds responsible for discriminating organically grown apples are discussed. Finally, an outlook of our ongoing work is given including a longitudinal study on apples.
Automated Classification and Analysis of Non-metallic Inclusion Data Sets
NASA Astrophysics Data System (ADS)
Abdulsalam, Mohammad; Zhang, Tongsheng; Tan, Jia; Webler, Bryan A.
2018-05-01
The aim of this study is to utilize principal component analysis (PCA), clustering methods, and correlation analysis to condense and examine large, multivariate data sets produced from automated analysis of non-metallic inclusions. Non-metallic inclusions play a major role in defining the properties of steel and their examination has been greatly aided by automated analysis in scanning electron microscopes equipped with energy dispersive X-ray spectroscopy. The methods were applied to analyze inclusions on two sets of samples: two laboratory-scale samples and four industrial samples from a near-finished 4140 alloy steel components with varying machinability. The laboratory samples had well-defined inclusions chemistries, composed of MgO-Al2O3-CaO, spinel (MgO-Al2O3), and calcium aluminate inclusions. The industrial samples contained MnS inclusions as well as (Ca,Mn)S + calcium aluminate oxide inclusions. PCA could be used to reduce inclusion chemistry variables to a 2D plot, which revealed inclusion chemistry groupings in the samples. Clustering methods were used to automatically classify inclusion chemistry measurements into groups, i.e., no user-defined rules were required.
Non-linear principal component analysis applied to Lorenz models and to North Atlantic SLP
NASA Astrophysics Data System (ADS)
Russo, A.; Trigo, R. M.
2003-04-01
A non-linear generalisation of Principal Component Analysis (PCA), denoted Non-Linear Principal Component Analysis (NLPCA), is introduced and applied to the analysis of three data sets. Non-Linear Principal Component Analysis allows for the detection and characterisation of low-dimensional non-linear structure in multivariate data sets. This method is implemented using a 5-layer feed-forward neural network introduced originally in the chemical engineering literature (Kramer, 1991). The method is described and details of its implementation are addressed. Non-Linear Principal Component Analysis is first applied to a data set sampled from the Lorenz attractor (1963). It is found that the NLPCA approximations are more representative of the data than are the corresponding PCA approximations. The same methodology was applied to the less known Lorenz attractor (1984). However, the results obtained weren't as good as those attained with the famous 'Butterfly' attractor. Further work with this model is underway in order to assess if NLPCA techniques can be more representative of the data characteristics than are the corresponding PCA approximations. The application of NLPCA to relatively 'simple' dynamical systems, such as those proposed by Lorenz, is well understood. However, the application of NLPCA to a large climatic data set is much more challenging. Here, we have applied NLPCA to the sea level pressure (SLP) field for the entire North Atlantic area and the results show a slight imcrement of explained variance associated. Finally, directions for future work are presented.%}
Quantitative Ultrasound Using Texture Analysis of Myofascial Pain Syndrome in the Trapezius.
Kumbhare, Dinesh A; Ahmed, Sara; Behr, Michael G; Noseworthy, Michael D
2018-01-01
Objective-The objective of this study is to assess the discriminative ability of textural analyses to assist in the differentiation of the myofascial trigger point (MTrP) region from normal regions of skeletal muscle. Also, to measure the ability to reliably differentiate between three clinically relevant groups: healthy asymptomatic, latent MTrPs, and active MTrP. Methods-18 and 19 patients were identified with having active and latent MTrPs in the trapezius muscle, respectively. We included 24 healthy volunteers. Images were obtained by research personnel, who were blinded with respect to the clinical status of the study participant. Histograms provided first-order parameters associated with image grayscale. Haralick, Galloway, and histogram-related features were used in texture analysis. Blob analysis was conducted on the regions of interest (ROIs). Principal component analysis (PCA) was performed followed by multivariate analysis of variance (MANOVA) to determine the statistical significance of the features. Results-92 texture features were analyzed for factorability using Bartlett's test of sphericity, which was significant. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.94. PCA demonstrated rotated eigenvalues of the first eight components (each comprised of multiple texture features) explained 94.92% of the cumulative variance in the ultrasound image characteristics. The 24 features identified by PCA were included in the MANOVA as dependent variables, and the presence of a latent or active MTrP or healthy muscle were independent variables. Conclusion-Texture analysis techniques can discriminate between the three clinically relevant groups.
NASA Astrophysics Data System (ADS)
Yehia, Ali M.; Mohamed, Heba M.
2016-01-01
Three advanced chemmometric-assisted spectrophotometric methods namely; Concentration Residuals Augmented Classical Least Squares (CRACLS), Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) and Principal Component Analysis-Artificial Neural Networks (PCA-ANN) were developed, validated and benchmarked to PLS calibration; to resolve the severely overlapped spectra and simultaneously determine; Paracetamol (PAR), Guaifenesin (GUA) and Phenylephrine (PHE) in their ternary mixture and in presence of p-aminophenol (AP) the main degradation product and synthesis impurity of Paracetamol. The analytical performance of the proposed methods was described by percentage recoveries, root mean square error of calibration and standard error of prediction. The four multivariate calibration methods could be directly used without any preliminary separation step and successfully applied for pharmaceutical formulation analysis, showing no excipients' interference.
Influence factors and forecast of carbon emission in China: structure adjustment for emission peak
NASA Astrophysics Data System (ADS)
Wang, B.; Cui, C. Q.; Li, Z. P.
2018-02-01
This paper introduced Principal Component Analysis and Multivariate Linear Regression Model to verify long-term balance relationships between Carbon Emissions and the impact factors. The integrated model of improved PCA and multivariate regression analysis model is attainable to figure out the pattern of carbon emission sources. Main empirical results indicate that among all selected variables, the role of energy consumption scale was largest. GDP and Population follow and also have significant impacts on carbon emission. Industrialization rate and fossil fuel proportion, which is the indicator of reflecting the economic structure and energy structure, have a higher importance than the factor of urbanization rate and the dweller consumption level of urban areas. In this way, some suggestions are put forward for government to achieve the peak of carbon emissions.
Use of direct gradient analysis to uncover biological hypotheses in 16s survey data and beyond.
Erb-Downward, John R; Sadighi Akha, Amir A; Wang, Juan; Shen, Ning; He, Bei; Martinez, Fernando J; Gyetko, Margaret R; Curtis, Jeffrey L; Huffnagle, Gary B
2012-01-01
This study investigated the use of direct gradient analysis of bacterial 16S pyrosequencing surveys to identify relevant bacterial community signals in the midst of a "noisy" background, and to facilitate hypothesis-testing both within and beyond the realm of ecological surveys. The results, utilizing 3 different real world data sets, demonstrate the utility of adding direct gradient analysis to any analysis that draws conclusions from indirect methods such as Principal Component Analysis (PCA) and Principal Coordinates Analysis (PCoA). Direct gradient analysis produces testable models, and can identify significant patterns in the midst of noisy data. Additionally, we demonstrate that direct gradient analysis can be used with other kinds of multivariate data sets, such as flow cytometric data, to identify differentially expressed populations. The results of this study demonstrate the utility of direct gradient analysis in microbial ecology and in other areas of research where large multivariate data sets are involved.
Goltz, Diane; Gevensleben, Heidrun; Dietrich, Jörn; Ellinger, Jörg; Landsberg, Jennifer; Kristiansen, Glen; Dietrich, Dimo
2016-01-01
ABSTRACT Biomarkers that facilitate the prediction of disease recurrence in prostate cancer (PCa) may enable physicians to personalize treatment for individual patients. In the current study, PD-1 (PDCD1) promoter methylation was assessed in a cohort of 498 PCa patients included in The Cancer Genome Atlas (TCGA) and a second cohort of 300 PCa cases treated at the University Hospital of Bonn. In the TCGA cohort, the PD-1 promoter was significantly hypermethylated in carcinomas versus normal prostatic epithelium (55.5% vs. 38.2%, p < 0.001) and PD-1 methylation (mPD-1) inversely correlated with PD-1 mRNA expression in PCa (Spearman's ρ = −0.415, p < 0.001). In both cohorts, mPD-1 significantly correlated with preoperative prostate specific antigen (PSA). In univariate Cox Proportional Hazard analysis, mPD-1 served as a significant prognostic factor for biochemical recurrence (BCR)-free survival (Hazard ratio: HR = 2.35 [1.35–4.10], p = 0.003, n = 410) in the TCGA cohort. In multivariate analysis, mPD-1 was shown to add significant independent prognostic information adjunct to pathologic tumor category (pT) and Gleason grading group (HR = 2.08 [1.16–3.74], p = 0.014, n = 350). PD-1 promoter methylation analyses could thus potentially aid the identification of patients which might benefit from adjuvant treatment after radical prostatectomy. Moreover, our data suggest an intrinsic role of PD-1 in PCa carcinogenesis and disease progression, which needs to be addressed in future studies. PMID:27853645
Hagiwara, Kazuhisa; Tobisawa, Yuki; Kaya, Takatoshi; Kaneko, Tomonori; Hatakeyama, Shingo; Mori, Kazuyuki; Hashimoto, Yasuhiro; Koie, Takuya; Suda, Yoshihiko; Ohyama, Chikara; Yoneyama, Tohru
2017-01-01
Wisteria floribunda agglutinin (WFA) preferably binds to LacdiNAc glycans, and its reactivity is associated with tumor progression. The aim of this study to examine whether the serum LacdiNAc carrying prostate-specific antigen–glycosylation isomer (PSA-Gi) and WFA-reactivity of tumor tissue can be applied as a diagnostic and prognostic marker of prostate cancer (PCa). Between 2007 and 2016, serum PSA-Gi levels before prostate biopsy (Pbx) were measured in 184 biopsy-proven benign prostatic hyperplasia patients and 244 PCa patients using an automated lectin-antibody immunoassay. WFA-reactivity on tumor was analyzed in 260 radical prostatectomy (RP) patients. Diagnostic and prognostic performance of serum PSA-Gi was evaluated using area under the receiver-operator characteristic curve (AUC). Prognostic performance of WFA-reactivity on tumor was evaluated via Cox proportional hazards regression analysis and nomogram. The AUC of serum PSA-Gi detecting PCa and predicting Pbx Grade Group (GG) 3 and GG ≥ 3 after RP was much higher than those of conventional PSA. Multivariate analysis showed that WFA-reactivity on prostate tumor was an independent risk factor of PSA recurrence. The nomogram was a strong model for predicting PSA-free survival provability with a c-index ≥0.7. Serum PSA-Gi levels and WFA-reactivity on prostate tumor may be a novel diagnostic and pre- and post-operative prognostic biomarkers of PCa, respectively. PMID:28134773
Hagiwara, Kazuhisa; Tobisawa, Yuki; Kaya, Takatoshi; Kaneko, Tomonori; Hatakeyama, Shingo; Mori, Kazuyuki; Hashimoto, Yasuhiro; Koie, Takuya; Suda, Yoshihiko; Ohyama, Chikara; Yoneyama, Tohru
2017-01-26
Wisteria floribunda agglutinin (WFA) preferably binds to LacdiNAc glycans, and its reactivity is associated with tumor progression. The aim of this study to examine whether the serum LacdiNAc carrying prostate-specific antigen-glycosylation isomer (PSA-Gi) and WFA-reactivity of tumor tissue can be applied as a diagnostic and prognostic marker of prostate cancer (PCa). Between 2007 and 2016, serum PSA-Gi levels before prostate biopsy (Pbx) were measured in 184 biopsy-proven benign prostatic hyperplasia patients and 244 PCa patients using an automated lectin-antibody immunoassay. WFA-reactivity on tumor was analyzed in 260 radical prostatectomy (RP) patients. Diagnostic and prognostic performance of serum PSA-Gi was evaluated using area under the receiver-operator characteristic curve (AUC). Prognostic performance of WFA-reactivity on tumor was evaluated via Cox proportional hazards regression analysis and nomogram. The AUC of serum PSA-Gi detecting PCa and predicting Pbx Grade Group (GG) 3 and GG ≥ 3 after RP was much higher than those of conventional PSA. Multivariate analysis showed that WFA-reactivity on prostate tumor was an independent risk factor of PSA recurrence. The nomogram was a strong model for predicting PSA-free survival provability with a c -index ≥0.7. Serum PSA-Gi levels and WFA-reactivity on prostate tumor may be a novel diagnostic and pre- and post-operative prognostic biomarkers of PCa, respectively.
Multivariate Statistical Analysis of Water Quality data in Indian River Lagoon, Florida
NASA Astrophysics Data System (ADS)
Sayemuzzaman, M.; Ye, M.
2015-12-01
The Indian River Lagoon, is part of the longest barrier island complex in the United States, is a region of particular concern to the environmental scientist because of the rapid rate of human development throughout the region and the geographical position in between the colder temperate zone and warmer sub-tropical zone. Thus, the surface water quality analysis in this region always brings the newer information. In this present study, multivariate statistical procedures were applied to analyze the spatial and temporal water quality in the Indian River Lagoon over the period 1998-2013. Twelve parameters have been analyzed on twelve key water monitoring stations in and beside the lagoon on monthly datasets (total of 27,648 observations). The dataset was treated using cluster analysis (CA), principle component analysis (PCA) and non-parametric trend analysis. The CA was used to cluster twelve monitoring stations into four groups, with stations on the similar surrounding characteristics being in the same group. The PCA was then applied to the similar groups to find the important water quality parameters. The principal components (PCs), PC1 to PC5 was considered based on the explained cumulative variances 75% to 85% in each cluster groups. Nutrient species (phosphorus and nitrogen), salinity, specific conductivity and erosion factors (TSS, Turbidity) were major variables involved in the construction of the PCs. Statistical significant positive or negative trends and the abrupt trend shift were detected applying Mann-Kendall trend test and Sequential Mann-Kendall (SQMK), for each individual stations for the important water quality parameters. Land use land cover change pattern, local anthropogenic activities and extreme climate such as drought might be associated with these trends. This study presents the multivariate statistical assessment in order to get better information about the quality of surface water. Thus, effective pollution control/management of the surface waters can be undertaken.
Tomlins, Scott A.; Alshalalfa, Mohammed; Davicioni, Elai; Erho, Nicholas; Yousefi, Kasra; Zhao, Shuang; Haddad, Zaid; Den, Robert B.; Dicker, Adam P.; Trock, Bruce; DeMarzo, Angelo; Ross, Ashley; Schaeffer, Edward M.; Klein, Eric A.; Magi-Galluzzi, Cristina; Karnes, Jeffery R.; Jenkins, Robert B.; Feng, Felix Y.
2015-01-01
Background Prostate cancer (PCa) molecular subtypes have been defined by essentially mutually exclusive events, including ETS gene fusions (most commonly involving ERG) and SPINK1 over-expression. Clinical assessment may aid in disease stratification, complementing available prognostic tests. Objective To determine the analytical validity and clinicopatholgical associations of microarray-based molecular subtyping. Design, Setting and Participants We analyzed Affymetrix GeneChip expression profiles for 1,577 patients from eight radical prostatectomy (RP) cohorts, including 1,351 cases assessed using the Decipher prognostic assay (performed in a CLIA-certified laboratory). A microarray-based (m-) random forest ERG classification model was trained and validated. Outlier expression analysis was used to predict other mutually exclusive non-ERG ETS gene rearrangements (ETS+) or SPINK1 over-expression (SPINK1+). Outcome Measurements Associations with clinical features and outcomes by multivariable logistic regression analysis and receiver operating curves. Results and Limitations The m-ERG classifier showed 95% accuracy in an independent validation subset (n=155 samples). Across cohorts, 45%, 9%, 8% and 38% of PCa were classified as m-ERG+, m-ETS+, m-SPINK1+, and triple negative (m-ERG−/m-ETS−/m-SPINK1−), respectively. Gene expression profiling supports three underlying molecularly defined groups (m-ERG+, m-ETS+ and m-SPINK1+/triple negative). On multivariable analysis, m-ERG+ tumors were associated with lower preoperative serum PSA and Gleason scores, but enriched for extraprostatic extension (p<0.001). m-ETS+ tumors were associated with seminal vesicle invasion (p=0.01), while m-SPINK1+/triple negative tumors had higher Gleason scores and were more frequent in Black/African American patients (p<0.001). Clinical outcomes were not significantly different between subtypes. Conclusions A clinically available prognostic test (Decipher) can also assess PCa molecular subtypes, obviating the need for additional testing. Clinicopathological differences were found among subtypes based on global expression patterns. PMID:25964175
Matiatos, Ioannis; Alexopoulos, Apostolos; Godelitsas, Athanasios
2014-04-01
The present study involves an integration of the hydrogeological, hydrochemical and isotopic (both stable and radiogenic) data of the groundwater samples taken from aquifers occurring in the region of northeastern Peloponnesus. Special emphasis has been given to health-related ions and isotopes in relation to the WHO and USEPA guidelines, to highlight the concentrations of compounds (e.g., As and Ba) exceeding the drinking water thresholds. Multivariate statistical analyses, i.e. two principal component analyses (PCA) and one discriminant analysis (DA), combined with conventional hydrochemical methodologies, were applied, with the aim to interpret the spatial variations in the groundwater quality and to identify the main hydrogeochemical factors and human activities responsible for the high ion concentrations and isotopic content in the groundwater analysed. The first PCA resulted in a three component model, which explained approximately 82% of the total variance of the data sets and enabled the identification of the hydrogeological processes responsible for the isotopic content i.e., δ(18)Ο, tritium and (222)Rn. The second PCA, involving the trace element presence in the water samples, revealed a four component model, which explained approximately 89% of the total variance of the data sets, giving more insight into the geochemical and anthropogenic controls on the groundwater composition (e.g., water-rock interaction, hydrothermal activity and agricultural activities). Using discriminant analysis, a four parameter (δ(18)O, (Ca+Mg)/(HCO3+SO4), EC and Cl) discriminant function concerning the (222)Rn content was derived, which favoured a classification of the samples according to the concentration of (222)Rn as (222)Rn-safe (<11 Bq·L(-1)) and (222)Rn-contaminated (>11 Bq·L(-1)). The selection of radon builds on the fact that this radiogenic isotope has been generally related to increased health risk when consumed. Copyright © 2014 Elsevier B.V. All rights reserved.
Sun, Hui; Wang, Huiyu; Zhang, Aihua; Yan, Guangli; Han, Ying; Li, Yuan; Wu, Xiuhong; Meng, Xiangcai; Wang, Xijun
2016-01-01
As herbal medicines have an important position in health care systems worldwide, their current assessment, and quality control are a major bottleneck. Cortex Phellodendri chinensis (CPC) and Cortex Phellodendri amurensis (CPA) are widely used in China, however, how to identify species of CPA and CPC has become urgent. In this study, multivariate analysis approach was performed to the investigation of chemical discrimination of CPA and CPC. Principal component analysis showed that two herbs could be separated clearly. The chemical markers such as berberine, palmatine, phellodendrine, magnoflorine, obacunone, and obaculactone were identified through the orthogonal partial least squared discriminant analysis, and were identified tentatively by the accurate mass of quadruple-time-of-flight mass spectrometry. A total of 29 components can be used as the chemical markers for discrimination of CPA and CPC. Of them, phellodenrine is significantly higher in CPC than that of CPA, whereas obacunone and obaculactone are significantly higher in CPA than that of CPC. The present study proves that multivariate analysis approach based chemical analysis greatly contributes to the investigation of CPA and CPC, and showed that the identified chemical markers as a whole should be used to discriminate the two herbal medicines, and simultaneously the results also provided chemical information for their quality assessment. Multivariate analysis approach was performed to the investigate the herbal medicineThe chemical markers were identified through multivariate analysis approachA total of 29 components can be used as the chemical markers. UPLC-Q/TOF-MS-based multivariate analysis method for the herbal medicine samples Abbreviations used: CPC: Cortex Phellodendri chinensis, CPA: Cortex Phellodendri amurensis, PCA: Principal component analysis, OPLS-DA: Orthogonal partial least squares discriminant analysis, BPI: Base peaks ion intensity.
Prostate-Specific Antigen (PSA) Screening and New Biomarkers for Prostate Cancer (PCa)
Rittenhouse, Harry; Hu, Xinhai; Cammann, Henning; Jung, Klaus
2014-01-01
Abstract PSA screening reduces PCa-mortality but the disadvantages overdiagnosis and overtreatment require multivariable risk-prediction tools to select appropriate treatment or active surveillance. This review explains the differences between the two largest screening trials and discusses the drawbacks of screening and its meta-analysisxs. The current American and European screening strategies are described. Nonetheless, PSA is one of the most widely used tumor markers and strongly correlates with the risk of harboring PCa. However, while PSA has limitations for PCa detection with its low specificity there are several potential biomarkers presented in this review with utility for PCa currently being studied. There is an urgent need for new biomarkers especially to detect clinically significant and aggressive PCa. From all PSA-based markers, the FDA-approved prostate health index (phi) shows improved specificity over percent free and total PSA. Another kallikrein panel, 4K, which includes KLK2 has recently shown promise in clinical research studies but has not yet undergone formal validation studies. In urine, prostate cancer gene 3 (PCA3) has also been validated and approved by the FDA for its utility to detect PCa. The potential correlation of PCA3 with cancer aggressiveness requires more clinical studies. The detection of the fusion of androgen-regulated genes with genes of the regulatory transcription factors in tissue of ~50% of all PCa-patients is a milestone in PCa research. A combination of the urinary assays for TMPRSS2:ERG gene fusion and PCA3 shows an improved accuracy for PCa detection. Overall, the field of PCa biomarker discovery is very exciting and prospective. PMID:27683457
NASA Astrophysics Data System (ADS)
Rocha-Osornio, L. N.; Pichardo-Molina, J. L.; Barbosa-Garcia, O.; Frausto-Reyes, C.; Araujo-Andrade, C.; Huerta-Franco, R.; Gutiérrez-Juárez, G.
2008-02-01
Raman spectroscopy and Multivariate methods were used to study serum blood samples of control and breast cancer patients. Blood samples were obtained from 11 patients and 12 controls from the central region of Mexico. Our results show that principal component analysis is able to discriminate serum sample of breast cancer patients from those of control group, also the loading vectors of PCA plotted as a function of Raman shift shown which bands permitted to make the maximum discrimination between both groups of samples.
Assel, Melissa J.; Gerdtsson, Axel; Thorek, Daniel L.J.; Carlsson, Sigrid V.; Malm, Johan; Scardino, Peter T.; Vickers, Andrew; Lilja, Hans; Ulmert, David
2018-01-01
Objectives To evaluate whether anthropometric parameters add to PSA measurements in middle-aged men for risk assessment of prostate cancer (PCa) diagnosis and death. Results After adjusting for PSA, both BMI and weight were significantly associated with an increased risk of PCa death with the odds of a death corresponding to a 10 kg/m2 or 10 kg increase being 1.58 (95% CI 1.10, 2.28; p = 0.013) and 1.14 (95% CI 1.02, 1.26; p = 0.016) times greater, respectively. AUCs did not meaningfully increase with the addition of weight or BMI to prediction models including PSA. Materials and Methods In 1974 to 1986, 22,444 Swedish men aged 44 to 50 enrolled in Malmö Preventive Project, Sweden, and provided blood samples and anthropometric data. Rates of PSA screening in the cohort were very low. Documentation of PCa diagnosis and disease-specific death up to 2014 was retrieved through national registries. Among men with anthropometric measurements available at baseline, a total of 1692 men diagnosed with PCa were matched to 4190 controls, and 464 men who died of disease were matched to 1390 controls. Multivariable conditional logistic regression was used to determine whether diagnosis or death from PCa were associated with weight and body mass index (BMI) at adulthood after adjusting for PSA. Conclusions Men with higher BMI and weight at early middle age have an increased risk of PCa diagnosis and death after adjusting for PSA. However, in a multi-variable numerical statistical model, BMI and weight do not importantly improve the predictive accuracy of PSA. Risk-stratification of screening should be based on PSA without reference to anthropometrics. PMID:29464033
Assel, Melissa J; Gerdtsson, Axel; Thorek, Daniel L J; Carlsson, Sigrid V; Malm, Johan; Scardino, Peter T; Vickers, Andrew; Lilja, Hans; Ulmert, David
2018-01-19
To evaluate whether anthropometric parameters add to PSA measurements in middle-aged men for risk assessment of prostate cancer (PCa) diagnosis and death. After adjusting for PSA, both BMI and weight were significantly associated with an increased risk of PCa death with the odds of a death corresponding to a 10 kg/m2 or 10 kg increase being 1.58 (95% CI 1.10, 2.28; p = 0.013) and 1.14 (95% CI 1.02, 1.26; p = 0.016) times greater, respectively. AUCs did not meaningfully increase with the addition of weight or BMI to prediction models including PSA. In 1974 to 1986, 22,444 Swedish men aged 44 to 50 enrolled in Malmö Preventive Project, Sweden, and provided blood samples and anthropometric data. Rates of PSA screening in the cohort were very low. Documentation of PCa diagnosis and disease-specific death up to 2014 was retrieved through national registries. Among men with anthropometric measurements available at baseline, a total of 1692 men diagnosed with PCa were matched to 4190 controls, and 464 men who died of disease were matched to 1390 controls. Multivariable conditional logistic regression was used to determine whether diagnosis or death from PCa were associated with weight and body mass index (BMI) at adulthood after adjusting for PSA. Men with higher BMI and weight at early middle age have an increased risk of PCa diagnosis and death after adjusting for PSA. However, in a multi-variable numerical statistical model, BMI and weight do not importantly improve the predictive accuracy of PSA. Risk-stratification of screening should be based on PSA without reference to anthropometrics.
Bertani, Francesca R; Mozetic, Pamela; Fioramonti, Marco; Iuliani, Michele; Ribelli, Giulia; Pantano, Francesco; Santini, Daniele; Tonini, Giuseppe; Trombetta, Marcella; Businaro, Luca; Selci, Stefano; Rainer, Alberto
2017-08-21
The possibility of detecting and classifying living cells in a label-free and non-invasive manner holds significant theranostic potential. In this work, Hyperspectral Imaging (HSI) has been successfully applied to the analysis of macrophagic polarization, given its central role in several pathological settings, including the regulation of tumour microenvironment. Human monocyte derived macrophages have been investigated using hyperspectral reflectance confocal microscopy, and hyperspectral datasets have been analysed in terms of M1 vs. M2 polarization by Principal Components Analysis (PCA). Following PCA, Linear Discriminant Analysis has been implemented for semi-automatic classification of macrophagic polarization from HSI data. Our results confirm the possibility to perform single-cell-level in vitro classification of M1 vs. M2 macrophages in a non-invasive and label-free manner with a high accuracy (above 98% for cells deriving from the same donor), supporting the idea of applying the technique to the study of complex interacting cellular systems, such in the case of tumour-immunity in vitro models.
Jiang, Jheng Jie; Lee, Chon Lin; Fang, Meng Der; Boyd, Kenneth G.; Gibb, Stuart W.
2015-01-01
This paper presents a methodology based on multivariate data analysis for characterizing potential source contributions of emerging contaminants (ECs) detected in 26 river water samples across multi-scape regions during dry and wet seasons. Based on this methodology, we unveil an approach toward potential source contributions of ECs, a concept we refer to as the “Pharmaco-signature.” Exploratory analysis of data points has been carried out by unsupervised pattern recognition (hierarchical cluster analysis, HCA) and receptor model (principal component analysis-multiple linear regression, PCA-MLR) in an attempt to demonstrate significant source contributions of ECs in different land-use zone. Robust cluster solutions grouped the database according to different EC profiles. PCA-MLR identified that 58.9% of the mean summed ECs were contributed by domestic impact, 9.7% by antibiotics application, and 31.4% by drug abuse. Diclofenac, ibuprofen, codeine, ampicillin, tetracycline, and erythromycin-H2O have significant pollution risk quotients (RQ>1), indicating potentially high risk to aquatic organisms in Taiwan. PMID:25874375
Chemical information obtained from Auger depth profiles by means of advanced factor analysis (MLCFA)
NASA Astrophysics Data System (ADS)
De Volder, P.; Hoogewijs, R.; De Gryse, R.; Fiermans, L.; Vennik, J.
1993-01-01
The advanced multivariate statistical technique "maximum likelihood common factor analysis (MLCFA)" is shown to be superior to "principal component analysis (PCA)" for decomposing overlapping peaks into their individual component spectra of which neither the number of components nor the peak shape of the component spectra is known. An examination of the maximum resolving power of both techniques, MLCFA and PCA, by means of artificially created series of multicomponent spectra confirms this finding unambiguously. Substantial progress in the use of AES as a chemical-analysis technique is accomplished through the implementation of MLCFA. Chemical information from Auger depth profiles is extracted by investigating the variation of the line shape of the Auger signal as a function of the changing chemical state of the element. In particular, MLCFA combined with Auger depth profiling has been applied to problems related to steelcord-rubber tyre adhesion. MLCFA allows one to elucidate the precise nature of the interfacial layer of reaction products between natural rubber vulcanized on a thin brass layer. This study reveals many interesting chemical aspects of the oxi-sulfidation of brass undetectable with classical AES.
NASA Astrophysics Data System (ADS)
Chen, Quansheng; Qi, Shuai; Li, Huanhuan; Han, Xiaoyan; Ouyang, Qin; Zhao, Jiewen
2014-10-01
To rapidly and efficiently detect the presence of adulterants in honey, three-dimensional fluorescence spectroscopy (3DFS) technique was employed with the help of multivariate calibration. The data of 3D fluorescence spectra were compressed using characteristic extraction and the principal component analysis (PCA). Then, partial least squares (PLS) and back propagation neural network (BP-ANN) algorithms were used for modeling. The model was optimized by cross validation, and its performance was evaluated according to root mean square error of prediction (RMSEP) and correlation coefficient (R) in prediction set. The results showed that BP-ANN model was superior to PLS models, and the optimum prediction results of the mixed group (sunflower ± longan ± buckwheat ± rape) model were achieved as follow: RMSEP = 0.0235 and R = 0.9787 in the prediction set. The study demonstrated that the 3D fluorescence spectroscopy technique combined with multivariate calibration has high potential in rapid, nondestructive, and accurate quantitative analysis of honey adulteration.
Lotan, Tamara L.; Wei, Wei; Morais, Carlos L.; Hawley, Sarah T.; Fazli, Ladan; Hurtado-Coll, Antonio; Troyer, Dean; McKenney, Jesse K.; Simko, Jeffrey; Carroll, Peter R.; Gleave, Martin; Lance, Raymond; Lin, Daniel W.; Nelson, Peter S.; Thompson, Ian M.; True, Lawrence D.; Feng, Ziding; Brooks, James D.
2015-01-01
Background PTEN is the most commonly deleted tumor suppressor gene in primary prostate cancer (PCa) and its loss is associated with poor clinical outcomes and ERG gene rearrangement. Objective We tested whether PTEN loss is associated with shorter recurrence-free survival (RFS) in surgically treated PCa patients with known ERG status. Design, setting, and participants A genetically validated, automated PTEN immunohistochemistry (IHC) protocol was used for 1275 primary prostate tumors from the Canary Foundation retrospective PCa tissue microarray cohort to assess homogeneous (in all tumor tissue sampled) or heterogeneous (in a subset of tumor tissue sampled) PTEN loss. ERG status as determined by a genetically validated IHC assay was available for a subset of 938 tumors. Outcome measurements and statistical analysis Associations between PTEN and ERG status were assessed using Fisher’s exact test. Kaplan-Meier and multivariate weighted Cox proportional models for RFS were constructed. Results and limitations When compared to intact PTEN, homogeneous (hazard ratio [HR] 1.66, p = 0.001) but not heterogeneous (HR 1.24, p = 0.14) PTEN loss was significantly associated with shorter RFS in multivariate models. Among ERG-positive tumors, homogeneous (HR 3.07, p < 0.0001) but not heterogeneous (HR 1.46, p = 0.10) PTEN loss was significantly associated with shorter RFS. Among ERG-negative tumors, PTEN did not reach significance for inclusion in the final multivariate models. The interaction term for PTEN and ERG status with respect to RFS did not reach statistical significance (p = 0.11) for the current sample size. Conclusions These data suggest that PTEN is a useful prognostic biomarker and that there is no statistically significant interaction between PTEN and ERG status for RFS. Patient summary We found that loss of the PTEN tumor suppressor gene in prostate tumors as assessed by tissue staining is correlated with shorter time to prostate cancer recurrence after radical prostatectomy. PMID:27617307
Bhatt, Chet R; Alfarraj, Bader; Ghany, Charles T; Yueh, Fang Y; Singh, Jagdish P
2017-04-01
In this study, the laser-induced breakdown spectroscopy (LIBS) technique was used to identify and compare the presence of major nutrient elements in organic and conventional vegetables. Different parts of cauliflowers and broccolis were used as working samples. Laser-induced breakdown spectra from these samples were acquired at optimum values of laser energy, gate delay, and gate width. Both univariate and multivariate analyses were performed for the comparison of these organic and conventional vegetable flowers. Principal component analysis (PCA) was taken into account for multivariate analysis while for univariate analysis, the intensity of selected atomic lines of different elements and their intensity ratio with some reference lines of organic cauliflower and broccoli samples were compared with those of conventional ones. In addition, different parts of the cauliflower and broccoli were compared in terms of intensity and intensity ratio of elemental lines.
NASA Astrophysics Data System (ADS)
Song, Biao; Lu, Dan; Peng, Ming; Li, Xia; Zou, Ye; Huang, Meizhen; Lu, Feng
2017-02-01
Raman spectroscopy is developed as a fast and non-destructive method for the discrimination and classification of hydroxypropyl methyl cellulose (HPMC) samples. 44 E series and 41 K series of HPMC samples are measured by a self-developed portable Raman spectrometer (Hx-Raman) which is excited by a 785 nm diode laser and the spectrum range is 200-2700 cm-1 with a resolution (FWHM) of 6 cm-1. Multivariate analysis is applied for discrimination of E series from K series. By methods of principal components analysis (PCA) and Fisher discriminant analysis (FDA), a discrimination result with sensitivity of 90.91% and specificity of 95.12% is achieved. The corresponding receiver operating characteristic (ROC) is 0.99, indicting the accuracy of the predictive model. This result demonstrates the prospect of portable Raman spectrometer for rapid, non-destructive classification and discrimination of E series and K series samples of HPMC.
NASA Astrophysics Data System (ADS)
Haq, Quazi M. I.; Mabood, Fazal; Naureen, Zakira; Al-Harrasi, Ahmed; Gilani, Sayed A.; Hussain, Javid; Jabeen, Farah; Khan, Ajmal; Al-Sabari, Ruqaya S. M.; Al-khanbashi, Fatema H. S.; Al-Fahdi, Amira A. M.; Al-Zaabi, Ahoud K. A.; Al-Shuraiqi, Fatma A. M.; Al-Bahaisi, Iman M.
2018-06-01
Nucleic acid & serology based methods have revolutionized plant disease detection, however, they are not very reliable at asymptomatic stage, especially in case of pathogen with systemic infection, in addition, they need at least 1-2 days for sample harvesting, processing, and analysis. In this study, two reflectance spectroscopies i.e. Near Infrared reflectance spectroscopy (NIR) and Fourier-Transform-Infrared spectroscopy with Attenuated Total Reflection (FT-IR, ATR) coupled with multivariate exploratory methods like Principle Component Analysis (PCA) and Partial least square discriminant analysis (PLS-DA) have been deployed to detect begomovirus infection in papaya leaves. The application of those techniques demonstrates that they are very useful for robust in vivo detection of plant begomovirus infection. These methods are simple, sensitive, reproducible, precise, and do not require any lengthy samples preparation procedures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, M; Wang, Xiliang
Melanoma is a malignant tumor of melanocytes with high capability of invasion and rapid metastasis to other organs. Malignant melanoma is the most common metastatic malignancy found in gastrointestinal tract (GI). To the best of our knowledge, previous studies of melanoma in gastrointestinal tract are all clinical case reports. In this work, 1H NMR-based metabolomics approach is used to investigate the metabolite profiles differences of stomach tissue extracts of metastatic B16-F10 melanoma in C57BL/6J mouse and search for specific metabolite biomarker candidates. Principal Component Analysis (PCA), an unsupervised multivariate data analysis method, is used to detect possible outliers, while Orthogonalmore » Projection to Latent Structure (OPLS), a supervised multivariate data analysis method, is employed to evaluate important metabolites responsible for discriminating the control and the melanoma groups. Both PCA and OPLS results reveal that the melanoma group can be well separated from its control group. Among the 50 identified metabolites, it is found that the concentrations of 19 metabolites are statistically and significantly changed with the levels of O-phosphocholine and hypoxanthine down-regulated while the levels of isoleucine, leucine, valine, isobutyrate, threonine, cadaverine, alanine, glutamate, glutamine, methionine, citrate, asparagine, tryptophan, glycine, serine, uracil, and formate up-regulated in the melanoma group. These significantly changed metabolites are associated with multiple biological pathways and may be potential biomarkers for metastatic melanoma in stomach.« less
Hu, M; Wang, Xiliang
2014-12-05
Melanoma is a malignant tumor of melanocytes with high capability of invasion and rapid metastasis to other organs. Malignant melanoma is the most common metastatic malignancy found in gastrointestinal tract (GI). To the best of our knowledge, previous studies of melanoma in gastrointestinal tract are all clinical case reports. In this work, 1H NMR-based metabolomics approach is used to investigate the metabolite profiles differences of stomach tissue extracts of metastatic B16-F10 melanoma in C57BL/6J mouse and search for specific metabolite biomarker candidates. Principal Component Analysis (PCA), an unsupervised multivariate data analysis method, is used to detect possible outliers, while Orthogonalmore » Projection to Latent Structure (OPLS), a supervised multivariate data analysis method, is employed to evaluate important metabolites responsible for discriminating the control and the melanoma groups. Both PCA and OPLS results reveal that the melanoma group can be well separated from its control group. Among the 50 identified metabolites, it is found that the concentrations of 19 metabolites are statistically and significantly changed with the levels of O-phosphocholine and hypoxanthine down-regulated while the levels of isoleucine, leucine, valine, isobutyrate, threonine, cadaverine, alanine, glutamate, glutamine, methionine, citrate, asparagine, tryptophan, glycine, serine, uracil, and formate up-regulated in the melanoma group. These significantly changed metabolites are associated with multiple biological pathways and may be potential biomarkers for metastatic melanoma in stomach.« less
Ebqa'ai, Mohammad; Ibrahim, Bashar
2017-12-01
This study aims to analyse the heavy metal pollutants in Jeddah, the second largest city in the Gulf Cooperation Council with a population exceeding 3.5 million, and many vehicles. Ninety-eight street dust samples were collected seasonally from the six major roads as well as the Jeddah Beach, and subsequently digested using modified Leeds Public Analyst method. The heavy metals (Fe, Zn, Mn, Cu, Cd, and Pb) were extracted from the ash using methyl isobutyl ketone as solvent extraction and eventually analysed by atomic absorption spectroscopy. Multivariate statistical techniques, principal component analysis (PCA), and hierarchical cluster analysis were applied to these data. Heavy metal concentrations were ranked according to the following descending order: Fe > Zn > Mn > Cu > Pb > Cd. In order to study the pollution and health risk from these heavy metals as well as estimating their effect on the environment, pollution indices, integrated pollution index, enrichment factor, daily dose average, hazard quotient, and hazard index were all analysed. The PCA showed high levels of Zn, Fe, and Cd in Al Kurnish road, while these elements were consistently detected on King Abdulaziz and Al Madina roads. The study indicates that high levels of Zn and Pb pollution were recorded for major roads in Jeddah. Six out of seven roads had high pollution indices. This study is the first step towards further investigations into current health problems in Jeddah, such as anaemia and asthma.
Colihueque, Nelson; Corrales, Olga; Yáñez, Miguel
2017-01-01
Trichomycterus areolatus Valenciennes, 1846 is a small endemic catfish inhabiting the Andean river basins of Chile. In this study, the morphological variability of three T. areolatus populations, collected in two river basins from southern Chile, was assessed with multivariate analyses, including principal component analysis (PCA) and discriminant function analysis (DFA). It is hypothesized that populations must segregate morphologically from each other based on the river basin that they were sampled from, since each basin presents relatively particular hydrological characteristics. Significant morphological differences among the three populations were found with PCA (ANOSIM test, r = 0.552, p < 0.0001) and DFA (Wilks's λ = 0.036, p < 0.01). PCA accounted for a total variation of 56.16% by the first two principal components. The first Principal Component (PC1) and PC2 explained 34.72 and 21.44% of the total variation, respectively. The scatter-plot of the first two discriminant functions (DF1 on DF2) also validated the existence of three different populations. In group classification using DFA, 93.3% of the specimens were correctly-classified into their original populations. Of the total of 22 transformed truss measurements, 17 exhibited highly significant ( p < 0.01) differences among populations. The data support the existence of T. areolatus morphological variation across different rivers in southern Chile, likely reflecting the geographic isolation underlying population structure of the species.
2016-01-01
The motivation behind this research is to innovatively combine new methods like wavelet, principal component analysis (PCA), and artificial neural network (ANN) approaches to analyze trade in today’s increasingly difficult and volatile financial futures markets. The main focus of this study is to facilitate forecasting by using an enhanced denoising process on market data, taken as a multivariate signal, in order to deduct the same noise from the open-high-low-close signal of a market. This research offers evidence on the predictive ability and the profitability of abnormal returns of a new hybrid forecasting model using Wavelet-PCA denoising and ANN (named WPCA-NN) on futures contracts of Hong Kong’s Hang Seng futures, Japan’s NIKKEI 225 futures, Singapore’s MSCI futures, South Korea’s KOSPI 200 futures, and Taiwan’s TAIEX futures from 2005 to 2014. Using a host of technical analysis indicators consisting of RSI, MACD, MACD Signal, Stochastic Fast %K, Stochastic Slow %K, Stochastic %D, and Ultimate Oscillator, empirical results show that the annual mean returns of WPCA-NN are more than the threshold buy-and-hold for the validation, test, and evaluation periods; this is inconsistent with the traditional random walk hypothesis, which insists that mechanical rules cannot outperform the threshold buy-and-hold. The findings, however, are consistent with literature that advocates technical analysis. PMID:27248692
Chan Phooi M'ng, Jacinta; Mehralizadeh, Mohammadali
2016-01-01
The motivation behind this research is to innovatively combine new methods like wavelet, principal component analysis (PCA), and artificial neural network (ANN) approaches to analyze trade in today's increasingly difficult and volatile financial futures markets. The main focus of this study is to facilitate forecasting by using an enhanced denoising process on market data, taken as a multivariate signal, in order to deduct the same noise from the open-high-low-close signal of a market. This research offers evidence on the predictive ability and the profitability of abnormal returns of a new hybrid forecasting model using Wavelet-PCA denoising and ANN (named WPCA-NN) on futures contracts of Hong Kong's Hang Seng futures, Japan's NIKKEI 225 futures, Singapore's MSCI futures, South Korea's KOSPI 200 futures, and Taiwan's TAIEX futures from 2005 to 2014. Using a host of technical analysis indicators consisting of RSI, MACD, MACD Signal, Stochastic Fast %K, Stochastic Slow %K, Stochastic %D, and Ultimate Oscillator, empirical results show that the annual mean returns of WPCA-NN are more than the threshold buy-and-hold for the validation, test, and evaluation periods; this is inconsistent with the traditional random walk hypothesis, which insists that mechanical rules cannot outperform the threshold buy-and-hold. The findings, however, are consistent with literature that advocates technical analysis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Clegg, Samuel M; Barefield, James E; Wiens, Roger C
2008-01-01
Quantitative analysis with LIBS traditionally employs calibration curves that are complicated by the chemical matrix effects. These chemical matrix effects influence the LIBS plasma and the ratio of elemental composition to elemental emission line intensity. Consequently, LIBS calibration typically requires a priori knowledge of the unknown, in order for a series of calibration standards similar to the unknown to be employed. In this paper, three new Multivariate Analysis (MV A) techniques are employed to analyze the LIBS spectra of 18 disparate igneous and highly-metamorphosed rock samples. Partial Least Squares (PLS) analysis is used to generate a calibration model from whichmore » unknown samples can be analyzed. Principal Components Analysis (PCA) and Soft Independent Modeling of Class Analogy (SIMCA) are employed to generate a model and predict the rock type of the samples. These MV A techniques appear to exploit the matrix effects associated with the chemistries of these 18 samples.« less
Zhang, Tan; Li, Fangxuan; Mu, Jiali; Liu, Juntian; Zhang, Sheng
2017-06-01
To explore the significance of ultrasonic features in differential diagnosis of thyroid nodules via combining the thyroid imaging reporting and data system (TI-RADS) and multivariate statistical analysis. Patients who received surgical treatment and was diagnosed with single thyroid nodule by postoperative pathology and preoperative ultrasound were enrolled in this study. Multivariate analysis was applied to assess the significant ultrasonic features which correlated with identifying benign or malignance and grading the TI-RADS classification of thyroid nodule. There were significant differences in the nodule size, aspect ratio, internal, echogenicity, boundary, presence or absence of calcifications, calcification type and CDFI between benign and malignant thyroid nodules. Multivariate analysis showed clear-cut distinction both between benign and malignance and among different TI-RADS categories of malignancy nodules. The shape and calcification of the nodule were important factors for distinguish the benign and malignance. Height of the nodule, aspect and calcification was important factors for grading TI-RADS categories of malignancy thyroid nodules. Ill-defined boundary, irregular shape and presence of calcification related with highly malignant risk for thyroid nodule. The larger height and aspect and presence of calcification related with higher TI-RADS classification of malignancy thyroid nodule.
Catelani, Tiago A; Santos, João Rodrigo; Páscoa, Ricardo N M J; Pezza, Leonardo; Pezza, Helena R; Lopes, João A
2018-03-01
This work proposes the use of near infrared (NIR) spectroscopy in diffuse reflectance mode and multivariate statistical process control (MSPC) based on principal component analysis (PCA) for real-time monitoring of the coffee roasting process. The main objective was the development of a MSPC methodology able to early detect disturbances to the roasting process resourcing to real-time acquisition of NIR spectra. A total of fifteen roasting batches were defined according to an experimental design to develop the MSPC models. This methodology was tested on a set of five batches where disturbances of different nature were imposed to simulate real faulty situations. Some of these batches were used to optimize the model while the remaining was used to test the methodology. A modelling strategy based on a time sliding window provided the best results in terms of distinguishing batches with and without disturbances, resourcing to typical MSPC charts: Hotelling's T 2 and squared predicted error statistics. A PCA model encompassing a time window of four minutes with three principal components was able to efficiently detect all disturbances assayed. NIR spectroscopy combined with the MSPC approach proved to be an adequate auxiliary tool for coffee roasters to detect faults in a conventional roasting process in real-time. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Abeysekara, Saman; Damiran, Daalkhaijav; Yu, Peiqiang
2013-02-01
The objectives of this study were (i) to determine lipid related molecular structures components (functional groups) in feed combination of cereal grain (barley, Hordeum vulgare) and wheat (Triticum aestivum) based dried distillers grain solubles (wheat DDGSs) from bioethanol processing at five different combination ratios using univariate and multivariate molecular spectral analyses with infrared Fourier transform molecular spectroscopy, and (ii) to correlate lipid-related molecular-functional structure spectral profile to nutrient profiles. The spectral intensity of (i) CH3 asymmetric, CH2 asymmetric, CH3 symmetric and CH2 symmetric groups, (ii) unsaturation (Cdbnd C) group, and (iii) carbonyl ester (Cdbnd O) group were determined. Spectral differences of functional groups were detected by hierarchical cluster analysis (HCA) and principal components analysis (PCA). The results showed that the combination treatments significantly inflicted modifications (P < 0.05) in nutrient profile and lipid related molecular spectral intensity (CH2 asymmetric stretching peak height, CH2 symmetric stretching peak height, ratio of CH2 to CH3 symmetric stretching peak intensity, and carbonyl peak area). Ratio of CH2 to CH3 symmetric stretching peak intensity, and carbonyl peak significantly correlated with nutrient profiles. Both PCA and HCA differentiated lipid-related spectrum. In conclusion, the changes of lipid molecular structure spectral profiles through feed combination could be detected using molecular spectroscopy. These changes were associated with nutrient profiles and functionality.
Monitoring of an antigen manufacturing process.
Zavatti, Vanessa; Budman, Hector; Legge, Raymond; Tamer, Melih
2016-06-01
Fluorescence spectroscopy in combination with multivariate statistical methods was employed as a tool for monitoring the manufacturing process of pertactin (PRN), one of the virulence factors of Bordetella pertussis utilized in whopping cough vaccines. Fluorophores such as amino acids and co-enzymes were detected throughout the process. The fluorescence data collected at different stages of the fermentation and purification process were treated employing principal component analysis (PCA). Through PCA, it was feasible to identify sources of variability in PRN production. Then, partial least square (PLS) was employed to correlate the fluorescence spectra obtained from pure PRN samples and the final protein content measured by a Kjeldahl test from these samples. In view that a statistically significant correlation was found between fluorescence and PRN levels, this approach could be further used as a method to predict the final protein content.
Giri, Veda N; Obeid, Elias; Hegarty, Sarah E; Gross, Laura; Bealin, Lisa; Hyatt, Colette; Fang, Carolyn Y; Leader, Amy
2018-04-14
Genetic testing (GT) for prostate cancer (PCA) is rising, with limited insights regarding genetic counseling (GC) needs of males. Genetic Evaluation of Men (GEM) is a prospective multigene testing study for inherited PCA. Men undergoing GC were surveyed on knowledge of cancer risk and genetics (CRG) and understanding of personal GT results to identify GC needs. GEM participants with or high-risk for PCA were recruited. Pre-test GC was in-person, with video and handout, or via telehealth. Post-test disclosure was in-person, by phone, or via telehealth. Clinical and family history data were obtained from participant surveys and medical records. Participants completed measures of knowledge of CRG, literacy, and numeracy pre-test and post-test. Understanding of personal genetic results was assessed post-test. Factors associated with knowledge of CRG and understanding of personal genetic results were examined using multivariable linear regression or McNemar's test. Among 109 men who completed pre- and post-GT surveys, multivariable analysis revealed family history meeting hereditary cancer syndrome (HCS) criteria was significantly predictive of higher baseline knowledge (P = 0.040). Of 101 men who responded definitively regarding understanding of results, 13 incorrectly reported their result (McNemar's P < 0.001). Factors significantly associated with discordance between reported and actual results included having a variant of uncertain significance (VUS) (P < 0.001) and undergoing GC via pre-test video and post-test phone disclosure (P = 0.015). While meeting criteria for HCS was associated with higher knowledge of CRG, understanding of personal GT results was lacking among a subset of males with VUS. A more exploratory finding was lack of understanding of results among men who underwent GC utilizing video and phone. Studies optimizing GC strategies for males undergoing multigene testing for inherited PCA are warranted. © 2018 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Meksiarun, Phiranuphon; Ishigaki, Mika; Huck-Pezzei, Verena A. C.; Huck, Christian W.; Wongravee, Kanet; Sato, Hidetoshi; Ozaki, Yukihiro
2017-03-01
This study aimed to extract the paraffin component from paraffin-embedded oral cancer tissue spectra using three multivariate analysis (MVA) methods; Independent Component Analysis (ICA), Partial Least Squares (PLS) and Independent Component - Partial Least Square (IC-PLS). The estimated paraffin components were used for removing the contribution of paraffin from the tissue spectra. These three methods were compared in terms of the efficiency of paraffin removal and the ability to retain the tissue information. It was found that ICA, PLS and IC-PLS could remove the paraffin component from the spectra at almost the same level while Principal Component Analysis (PCA) was incapable. In terms of retaining cancer tissue spectral integrity, effects of PLS and IC-PLS on the non-paraffin region were significantly less than that of ICA where cancer tissue spectral areas were deteriorated. The paraffin-removed spectra were used for constructing Raman images of oral cancer tissue and compared with Hematoxylin and Eosin (H&E) stained tissues for verification. This study has demonstrated the capability of Raman spectroscopy together with multivariate analysis methods as a diagnostic tool for the paraffin-embedded tissue section.
Yehia, Ali M; Mohamed, Heba M
2016-01-05
Three advanced chemmometric-assisted spectrophotometric methods namely; Concentration Residuals Augmented Classical Least Squares (CRACLS), Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) and Principal Component Analysis-Artificial Neural Networks (PCA-ANN) were developed, validated and benchmarked to PLS calibration; to resolve the severely overlapped spectra and simultaneously determine; Paracetamol (PAR), Guaifenesin (GUA) and Phenylephrine (PHE) in their ternary mixture and in presence of p-aminophenol (AP) the main degradation product and synthesis impurity of Paracetamol. The analytical performance of the proposed methods was described by percentage recoveries, root mean square error of calibration and standard error of prediction. The four multivariate calibration methods could be directly used without any preliminary separation step and successfully applied for pharmaceutical formulation analysis, showing no excipients' interference. Copyright © 2015 Elsevier B.V. All rights reserved.
Afshar-Oromieh, Ali; Holland-Letz, Tim; Giesel, Frederik L; Kratochwil, Clemens; Mier, Walter; Haufe, Sabine; Debus, Nils; Eder, Matthias; Eisenhut, Michael; Schäfer, Martin; Neels, Oliver; Hohenfellner, Markus; Kopka, Klaus; Kauczor, Hans-Ulrich; Debus, Jürgen; Haberkorn, Uwe
2017-08-01
Since the clinical introduction of 68 Ga-PSMA-11 PET/CT, this imaging method has rapidly spread and is now regarded as a significant step forward in the diagnosis of recurrent prostate cancer (PCa). The aim of this study was to analyse the influence of several variables with possible influence on PSMA ligand uptake in a large cohort. We performed a retrospective analysis of 1007 consecutive patients who were scanned with 68 Ga-PSMA-11 PET/CT (1 h after injection) from January 2014 to January 2017 to detect recurrent disease. Patients with untreated primary PCa or patients referred for PSMA radioligand therapy were excluded. The possible effects of different variables including PSA level and PSA doubling time (PSA DT ), PSA velocity (PSA Vel ), Gleason score (GSC, including separate analysis of GSC 7a and 7b), ongoing androgen deprivation therapy (ADT), patient age and amount of injected activity were evaluated. In 79.5% of patients at least one lesion with characteristics suggestive of recurrent PCa was detected. A pathological (positive) PET/CT scan was associated with PSA level and ADT. GSC, amount of injected activity, patient age, PSA DT and PSA Vel were not associated with a positive PET/CT scan in multivariate analysis. 68 Ga-PSMA-11 PET/CT detects tumour lesions in a high percentage of patients with recurrent PCa. Tumour detection is clearly associated with PSA level and ADT. Only a tendency for an association without statistical significance was found between higher GSC and a higher probability of a pathological PET/CT scan. No associations were found between a pathological 68 Ga-PSMA-11 PET/CT scan and patient age, amount of injected activity, PSA DT or PSA Vel.
Jayadevappa, Ravishankar; Chhatre, Sumedha; Wong, Yu-Ning; Wittink, Marsha N; Cook, Ratna; Morales, Knashawn H; Vapiwala, Neha; Newman, Diane K; Guzzo, Thomas; Wein, Alan J; Malkowicz, Stanley B; Lee, David I; Schwartz, Jerome S; Gallo, Joseph J
2017-05-01
In the context of prostate cancer (PCa) characterized by the multiple alternative treatment strategies, comparative effectiveness analysis is essential for informed decision-making. We analyzed the comparative effectiveness of PCa treatments through systematic review and meta-analysis with a focus on outcomes that matter most to newly diagnosed localized PCa patients. We performed a systematic review of literature published in English from 1995 to October 2016. A search strategy was employed using terms "prostate cancer," "localized," "outcomes," "mortality," "health related quality of life," and "complications" to identify relevant randomized controlled trials (RCTs), prospective, and retrospective studies. For observational studies, only those adjusting for selection bias using propensity-score or instrumental-variables approaches were included. Multivariable adjusted hazard ratio was used to assess all-cause and disease-specific mortality. Funnel plots were used to assess the level of bias. Our search strategy yielded 58 articles, of which 29 were RCTs, 6 were prospective studies, and 23 were retrospective studies. The studies provided moderate data for the patient-centered outcome of mortality. Radical prostatectomy demonstrated mortality benefit compared to watchful waiting (all-cause HR = 0.63 CI = 0.45, 0.87; disease-specific HR = 0.48 CI = 0.40, 0.58), and radiation therapy (all-cause HR = 0.65 CI = 0.57, 0.74; disease-specific HR = 0.51 CI = 0.40, 0.65). However, we had minimal comparative information about tradeoffs between and within treatment for other patient-centered outcomes in the short and long-term. Lack of patient-centered outcomes in comparative effectiveness research in localized PCa is a major hurdle to informed and shared decision-making. More rigorous studies that can integrate patient-centered and intermediate outcomes in addition to mortality are needed.
Vegetation characteristics important to common songbirds in east Texas
Conner, Richard N.; Dickson, James G.; Locke, Brian A.; Segelquist, Charles A.
1983-01-01
Multivariate studies of breeding bird communities have used principal component analysis (PCA) or several-group (three or more groups) discriminant function analysis (DFA) to ordinate bird species on vegetational continua (Cody 1968, James 1971, Whitmore 1975). In community studies, high resolution of habitat requirements for individual species is not always possible with either PCA or several-group DFA. When habitat characteristics of several species are examined with a DFA the resultant axes optimally discriminate among all species simultaneously. Hence, the characteristics assigned to a particular species reflect in part the presence of other species in the analyses. A better resolution of each species' habitat requirements may be obtained from a two-group DFA, wherein habitats selected by a species are discriminated from all other available habitats. Analyses using two-group DFAs to compare habitat used by a species with habitat unused by the same species have the potential to provide an optimal frame of reference from which to examine habitat variables (Martinka 1972, Conner and Adkisson 1976, Whitmore 1981). Mathematically (DFA) it is possible to maximally separate two groups of multivariate observations with a single axis (Harner and whitmore 1977). A line drawn in three or n-dimensional space can easily be positioned to intersect two multivariate means (centroids). If three or more centroids for species are analyzed simultaneously, a single line can no longer intersect all centroids unless a perfectly linear relationship exists for the species being examined. The probability of such an occurrence is extremely low. Thus, a high degree of resolution can be realized when a two-group DFA is used to determine habitat parameters important to individual species. We have used two-group DFA to identify vegetation variable important to 12 common species of songbirds in East Texas.
Classification of Malaysia aromatic rice using multivariate statistical analysis
NASA Astrophysics Data System (ADS)
Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md; Masnan, M. J.; Zakaria, A.; Rahim, N. A.; Omar, O.
2015-05-01
Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy training time, and prone to fatigue as the number of sample increased and inconsistent. The GC-MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties.
Shuai, Wang; Yongrui, Bao; Shanshan, Guan; Bo, Liu; Lu, Chen; Lei, Wang; Xiaorong, Ran
2014-01-01
Metabolomics, the systematic analysis of potential metabolites in a biological specimen, has been increasingly applied to discovering biomarkers, identifying perturbed pathways, measuring therapeutic targets, and discovering new drugs. By analyzing and verifying the significant difference in metabolic profiles and changes of metabolite biomarkers, metabolomics enables us to better understand substance metabolic pathways which can clarify the mechanism of Traditional Chinese Medicines (TCM). Corydalis yanhusuo alkaloid (CA) is a major component of Qizhiweitong (QZWT) prescription which has been used for treating gastric ulcer for centuries and its mechanism remains unclear completely. Metabolite profiling was performed by high-performance liquid chromatography combined with time-of-flight mass spectrometry (HPLC/ESI-TOF-MS) and in conjunction with multivariate data analysis and pathway analysis. The statistic software Mass Profiller Prossional (MPP) and statistic method including ANOVA and principal component analysis (PCA) were used for discovering novel potential biomarkers to clarify mechanism of CA in treating acid injected rats with gastric ulcer. The changes in metabolic profiling were restored to their base-line values after CA treatment according to the PCA score plots. Ten different potential biomarkers and seven key metabolic pathways contributing to the treatment of gastric ulcer were discovered and identified. Among the pathways, sphingophospholipid metabolism and fatty acid metabolism related network were acutely perturbed. Quantitative real time polymerase chain reaction (RT-PCR) analysis were performed to evaluate the expression of genes related to the two pathways for verifying the above results. The results show that changed biomarkers and pathways may provide evidence to insight into drug action mechanisms and enable us to increase research productivity toward metabolomics drug discovery. PMID:24454691
Moore, John P; Zhang, Song-Lei; Nieuwoudt, Hélène; Divol, Benoit; Trygg, Johan; Bauer, Florian F
2015-11-18
Yeast cells possess a cell wall comprising primarily glycoproteins, mannans, and glucan polymers. Several yeast phenotypes relevant for fermentation, wine processing, and wine quality are correlated with cell wall properties. To investigate the effect of wine fermentation on cell wall composition, a study was performed using mid-infrared (MIR) spectroscopy coupled with multivariate methods (i.e., PCA and OPLS-DA). A total of 40 yeast strains were evaluated, including Saccharomyces strains (laboratory and industrial) and non-Saccharomyces species. Cells were fermented in both synthetic MS300 and Chardonnay grape must to stationery phase, processed, and scanned in the MIR spectrum. PCA of the fingerprint spectral region showed distinct separation of Saccharomyces strains from non-Saccharomyces species; furthermore, industrial wine yeast strains separated from laboratory strains. PCA loading plots and the use of OPLS-DA to the data sets suggested that industrial strains were enriched with cell wall proteins (e.g., mannoproteins), whereas laboratory strains were composed mainly of mannan and glucan polymers.
2011-01-01
where r << P. The use of PCA for finding outliers in multivariate data is surveyed by Gnanadesikan and Kettenring16 and Rao.17 As alluded to earlier...1984. 16. Gnanadesikan R and Kettenring JR. Robust estimates, residu als, and outlier detection with multiresponse data. Biometrics 1972; 28: 81–124
NASA Astrophysics Data System (ADS)
Li, Shaoxin; Zhang, Yanjiao; Xu, Junfa; Li, Linfang; Zeng, Qiuyao; Lin, Lin; Guo, Zhouyi; Liu, Zhiming; Xiong, Honglian; Liu, Songhao
2014-09-01
This study aims to present a noninvasive prostate cancer screening methods using serum surface-enhanced Raman scattering (SERS) and support vector machine (SVM) techniques through peripheral blood sample. SERS measurements are performed using serum samples from 93 prostate cancer patients and 68 healthy volunteers by silver nanoparticles. Three types of kernel functions including linear, polynomial, and Gaussian radial basis function (RBF) are employed to build SVM diagnostic models for classifying measured SERS spectra. For comparably evaluating the performance of SVM classification models, the standard multivariate statistic analysis method of principal component analysis (PCA) is also applied to classify the same datasets. The study results show that for the RBF kernel SVM diagnostic model, the diagnostic accuracy of 98.1% is acquired, which is superior to the results of 91.3% obtained from PCA methods. The receiver operating characteristic curve of diagnostic models further confirm above research results. This study demonstrates that label-free serum SERS analysis technique combined with SVM diagnostic algorithm has great potential for noninvasive prostate cancer screening.
Szymanska-Chargot, M; Chylinska, M; Kruk, B; Zdunek, A
2015-01-22
The aim of this work was to quantitatively and qualitatively determine the composition of the cell wall material from apples during development by means of Fourier transform infrared (FT-IR) spectroscopy. The FT-IR region of 1500-800 cm(-1), containing characteristic bands for galacturonic acid, hemicellulose and cellulose, was examined using principal component analysis (PCA), k-means clustering and partial least squares (PLS). The samples were differentiated by development stage and cultivar using PCA and k-means clustering. PLS calibration models for galacturonic acid, hemicellulose and cellulose content from FT-IR spectra were developed and validated with the reference data. PLS models were tested using the root-mean-square errors of cross-validation for contents of galacturonic acid, hemicellulose and cellulose which was 8.30 mg/g, 4.08% and 1.74%, respectively. It was proven that FT-IR spectroscopy combined with chemometric methods has potential for fast and reliable determination of the main constituents of fruit cell walls. Copyright © 2014 Elsevier Ltd. All rights reserved.
Longobardi, Francesco; Innamorato, Valentina; Di Gioia, Annalisa; Ventrella, Andrea; Lippolis, Vincenzo; Logrieco, Antonio F; Catucci, Lucia; Agostiano, Angela
2017-12-15
Lentil samples coming from two different countries, i.e. Italy and Canada, were analysed using untargeted 1 H NMR fingerprinting in combination with chemometrics in order to build models able to classify them according to their geographical origin. For such aim, Soft Independent Modelling of Class Analogy (SIMCA), k-Nearest Neighbor (k-NN), Principal Component Analysis followed by Linear Discriminant Analysis (PCA-LDA) and Partial Least Squares-Discriminant Analysis (PLS-DA) were applied to the NMR data and the results were compared. The best combination of average recognition (100%) and cross-validation prediction abilities (96.7%) was obtained for the PCA-LDA. All the statistical models were validated both by using a test set and by carrying out a Monte Carlo Cross Validation: the obtained performances were found to be satisfying for all the models, with prediction abilities higher than 95% demonstrating the suitability of the developed methods. Finally, the metabolites that mostly contributed to the lentil discrimination were indicated. Copyright © 2017 Elsevier Ltd. All rights reserved.
Fontes, Cristiano Hora; Budman, Hector
2017-11-01
A clustering problem involving multivariate time series (MTS) requires the selection of similarity metrics. This paper shows the limitations of the PCA similarity factor (SPCA) as a single metric in nonlinear problems where there are differences in magnitude of the same process variables due to expected changes in operation conditions. A novel method for clustering MTS based on a combination between SPCA and the average-based Euclidean distance (AED) within a fuzzy clustering approach is proposed. Case studies involving either simulated or real industrial data collected from a large scale gas turbine are used to illustrate that the hybrid approach enhances the ability to recognize normal and fault operating patterns. This paper also proposes an oversampling procedure to create synthetic multivariate time series that can be useful in commonly occurring situations involving unbalanced data sets. Copyright © 2017 ISA. Published by Elsevier Ltd. All rights reserved.
Generation of Boundary Manikin Anthropometry
NASA Technical Reports Server (NTRS)
Young, Karen S.; Margerum, Sarah; Barr, Abbe; Ferrer, Mike A.; Rajulu, Sudhakar
2008-01-01
The purpose of this study was to develop 3D digital boundary manikins that are representative of the anthropometry of a unique population. These digital manikins can be used by designers to verify and validate that the components of the spacesuit design satisfy the requirements specified in the Human Systems Integration Requirements (HSIR) document. Currently, the HSIR requires the suit to accommodate the 1st percentile American female to the 99th percentile American male. The manikin anthropometry was derived using two methods: Principal Component Analysis (PCA) and Whole Body Posture Based Analysis (WBPBA). PCA is a statistical method for reducing a multidimensional data set by using eigenvectors and eigenvalues. The goal is to create a reduced data set that encapsulates the majority of the variation in the population. WBPBA is a multivariate analytical approach that was developed by the Anthropometry and Biomechanics Facility (ABF) to identify the extremes of the population for a given body posture. WBPBA is a simulation-based method that finds extremes in a population based on anthropometry and posture whereas PCA is based solely on anthropometry. Both methods yield a list of subjects and their anthropometry from the target population; PCA resulted in 20 female and 22 male subjects anthropometry and WBPBA resulted in 7 subjects' anthropometry representing the extreme subjects in the target population. The subjects anthropometry is then used to 'morph' a baseline digital scan of a person with the same body type to create a 3D digital model that can be used as a tool for designers, the details of which will be discussed in subsequent papers.
Spahn, Martin; Morlacco, Alessandro; Boxler, Silvan; Joniau, Steven; Briganti, Alberto; Montorsi, Francesco; Gontero, Paolo; Bader, Pia; Frohneberg, Detlef; van Poppel, Hein; Karnes, Robert Jeffrey
2017-11-01
To identify which patients with macroscopic bladder-infiltrating T4 prostate cancer (PCa) might have favourable outcomes when treated with radical cystectomy (RC). We evaluated 62 patients with cT4cN0-1 cM0 PCa treated with RC and pelvic lymph node dissection between 1972 and 2011. In addition to descriptive statistics, the Kaplan-Meier method and log-rank tests were used to depict survival rates. Univariate and multivariate Cox regression analysis tested the association between predictors and progression-free, PCa-specific and overall survival. Of the 62 patients, 19 (30.6%) did not have clinical progression during follow-up, two (3.2%) had local recurrence, and 32 (51.6%) had haematogenous and nine (14.5%) combined pelvic and distant metastasis. Forty patients (64.5%) died, 34 (54.8%) from PCa and six (9.7%) from other causes. The median (range) survival time of the 19 patients who were metastasis-free at last follow-up was 86 (1-314) months, 8/19 patients had a follow-up of >5 years, and five patients survived metastasis-free for >15 years. Patients without seminal vesicle invasion (SVI) had the best outcomes, with an estimated 10-year PCa-specific survival of 75% compared with 24% for patients with SVI. For cT4 PCa RC can be an appropriate treatment for local control and part of a multimodality-treatment approach. Although recurrences are probable, these do not necessarily translate into cancer-specific death. Men without SVI had a 75% 10-year PCa-specific survival. Although outcomes for patients with SVI are not as favourable, there can be good local control; however, these patients are at higher risk of progression and may need more aggressive systemic treatment. © 2017 The Authors BJU International © 2017 BJU International Published by John Wiley & Sons Ltd.
Kaufman, Leyla V; Wright, Mark G
2011-08-01
Understanding what ecological factors might predispose indigenous habitats to invasion by invasive species is an important aspect of conservation and invasive species management, particularly when biological control is considered for suppression of the invasive species. This study seeks to identify ecological factors that might play a role in determining the structure of the parasitoid assemblage associated with caterpillars of the endemic Hawaiian moth Udea stellata (Crambidae). Parasitoids were reared from field-collected U. stellata larvae at 18 locations. Fourteen environmental variables were measured at each site. Two multivariate analyses, principal component analysis (PCA) and partial redundancy analysis (RDA), were used to analyze the parasitoid assemblage across a range of habitats varying in environmental characteristics. The PCA analysis showed that the occurrence of some species were highly correlated, and associated with less disturbed sites, whereas other species were associated with sites of medium and high levels of disturbance. The RDA analysis showed that only three of the measured environmental variables (U. stellata density, elevation, and level of habitat disturbance) significantly explained variability in the parasitoid assemblage among sites. There was greater parasitoid species richness associated with U. stellata larvae at higher elevation sites with a lower degree of habitat disturbance by exotic vegetation. The purposely introduced parasitoid species were associated with the non-target moth at sites located at higher elevations with low levels of disturbance. Multivariate analysis has the potential to provide valuable insights into the identification of important environmental factors that mediate parasitoid assemblage structure and level of parasitism on a particular target or non-target species, and therefore facilitate identification of suitable target habitats or susceptible non-target habitats.
Msimanga, Huggins Z; Ollis, Robert J
2010-06-01
Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were used to classify acetaminophen-containing medicines using their attenuated total reflection Fourier transform infrared (ATR-FT-IR) spectra. Four formulations of Tylenol (Arthritis Pain Relief, Extra Strength Pain Relief, 8 Hour Pain Relief, and Extra Strength Pain Relief Rapid Release) along with 98% pure acetaminophen were selected for this study because of the similarity of their spectral features, with correlation coefficients ranging from 0.9857 to 0.9988. Before acquiring spectra for the predictor matrix, the effects on spectral precision with respect to sample particle size (determined by sieve size opening), force gauge of the ATR accessory, sample reloading, and between-tablet variation were examined. Spectra were baseline corrected and normalized to unity before multivariate analysis. Analysis of variance (ANOVA) was used to study spectral precision. The large particles (35 mesh) showed large variance between spectra, while fine particles (120 mesh) indicated good spectral precision based on the F-test. Force gauge setting did not significantly affect precision. Sample reloading using the fine particle size and a constant force gauge setting of 50 units also did not compromise precision. Based on these observations, data acquisition for the predictor matrix was carried out with the fine particles (sieve size opening of 120 mesh) at a constant force gauge setting of 50 units. After removing outliers, PCA successfully classified the five samples in the first and second components, accounting for 45.0% and 24.5% of the variances, respectively. The four-component PLS-DA model (R(2)=0.925 and Q(2)=0.906) gave good test spectra predictions with an overall average of 0.961 +/- 7.1% RSD versus the expected 1.0 prediction for the 20 test spectra used.
Piecewise multivariate modelling of sequential metabolic profiling data.
Rantalainen, Mattias; Cloarec, Olivier; Ebbels, Timothy M D; Lundstedt, Torbjörn; Nicholson, Jeremy K; Holmes, Elaine; Trygg, Johan
2008-02-19
Modelling the time-related behaviour of biological systems is essential for understanding their dynamic responses to perturbations. In metabolic profiling studies, the sampling rate and number of sampling points are often restricted due to experimental and biological constraints. A supervised multivariate modelling approach with the objective to model the time-related variation in the data for short and sparsely sampled time-series is described. A set of piecewise Orthogonal Projections to Latent Structures (OPLS) models are estimated, describing changes between successive time points. The individual OPLS models are linear, but the piecewise combination of several models accommodates modelling and prediction of changes which are non-linear with respect to the time course. We demonstrate the method on both simulated and metabolic profiling data, illustrating how time related changes are successfully modelled and predicted. The proposed method is effective for modelling and prediction of short and multivariate time series data. A key advantage of the method is model transparency, allowing easy interpretation of time-related variation in the data. The method provides a competitive complement to commonly applied multivariate methods such as OPLS and Principal Component Analysis (PCA) for modelling and analysis of short time-series data.
Systems genetic analysis of multivariate response to iron deficiency in mice
Yin, Lina; Unger, Erica L.; Jellen, Leslie C.; Earley, Christopher J.; Allen, Richard P.; Tomaszewicz, Ann; Fleet, James C.
2012-01-01
The aim of this study was to identify genes that influence iron regulation under varying dietary iron availability. Male and female mice from 20+ BXD recombinant inbred strains were fed iron-poor or iron-adequate diets from weaning until 4 mo of age. At death, the spleen, liver, and blood were harvested for the measurement of hemoglobin, hematocrit, total iron binding capacity, transferrin saturation, and liver, spleen and plasma iron concentration. For each measure and diet, we found large, strain-related variability. A principal-components analysis (PCA) was performed on the strain means for the seven parameters under each dietary condition for each sex, followed by quantitative trait loci (QTL) analysis on the factors. Compared with the iron-adequate diet, iron deficiency altered the factor structure of the principal components. QTL analysis, combined with PosMed (a candidate gene searching system) published gene expression data and literature citations, identified seven candidate genes, Ptprd, Mdm1, Picalm, lip1, Tcerg1, Skp2, and Frzb based on PCA factor, diet, and sex. Expression of each of these is cis-regulated, significantly correlated with the corresponding PCA factor, and previously reported to regulate iron, directly or indirectly. We propose that polymorphisms in multiple genes underlie individual differences in iron regulation, especially in response to dietary iron challenge. This research shows that iron management is a highly complex trait, influenced by multiple genes. Systems genetics analysis of iron homeostasis holds promise for developing new methods for prevention and treatment of iron deficiency anemia and related diseases. PMID:22461179
Song, Seung Yeob; Lee, Young Koung; Kim, In-Jung
2016-01-01
A high-throughput screening system for Citrus lines were established with higher sugar and acid contents using Fourier transform infrared (FT-IR) spectroscopy in combination with multivariate analysis. FT-IR spectra confirmed typical spectral differences between the frequency regions of 950-1100 cm(-1), 1300-1500 cm(-1), and 1500-1700 cm(-1). Principal component analysis (PCA) and subsequent partial least square-discriminant analysis (PLS-DA) were able to discriminate five Citrus lines into three separate clusters corresponding to their taxonomic relationships. The quantitative predictive modeling of sugar and acid contents from Citrus fruits was established using partial least square regression algorithms from FT-IR spectra. The regression coefficients (R(2)) between predicted values and estimated sugar and acid content values were 0.99. These results demonstrate that by using FT-IR spectra and applying quantitative prediction modeling to Citrus sugar and acid contents, excellent Citrus lines can be early detected with greater accuracy. Copyright © 2015 Elsevier Ltd. All rights reserved.
Classification of white wine aromas with an electronic nose.
Lozano, J; Santos, J P; Horrillo, M C
2005-09-15
This paper reports the use of a tin dioxide multisensor array based electronic nose for recognition of 29 typical aromas in white wine. Headspace technique has been used to extract aroma of the wine. Multivariate analysis, including principal component analysis (PCA) as well as probabilistic neural networks (PNNs), has been used to identify the main aroma added to the wine. The results showed that in spite of the strong influence of ethanol and other majority compounds of wine, the system could discriminate correctly the aromatic compounds added to the wine with a minimum accuracy of 97.2%.
Interval to Testosterone Recovery After Hormonal Therapy for Prostate Cancer and Risk of Death
DOE Office of Scientific and Technical Information (OSTI.GOV)
D'Amico, Anthony V.; Chen, M.-H.; Renshaw, Andrew A.
Purpose: To assess whether the risk of death is associated with the time to testosterone recovery (TTR) after radiotherapy (RT) and hormonal therapy (HT) for prostate cancer (PCa). Patients and Methods: Between 1995 and 2001, 206 men with localized, unfavorable-risk PCa were randomized to receive RT or RT plus 6 months of HT. A multivariate postrandomization Cox regression analysis was used to assess whether the TTR in years was associated with the risk of death after adjusting for the known prognostic factors, age, Adult Comorbidity Evaluation-27 score, and the use of HT for recurrence. Results: Of the 102 men randomizedmore » to receive RT and HT, 57 (56%) had a TTR of >2 years, and none of these men had died of PCa after a median follow-up of 7.6 years. As the TTR increased, the risk of death decreased significantly (adjusted hazard ratio, 0.60; 95% confidence interval, 0.43-0.84; p = .003). A significant interaction was noted between the TTR and the comorbidity score (p = .002). The survival estimates were similar (p = 0.17) across the TTR values in men with moderate to severe comorbidity; however, these estimates increased significantly (p < .001) with decreasing PCa-specific mortality (p = .006) as the TTR increased in men with no or minimal comorbidity. Conclusion: The results of our study have shown that a longer TTR after RT plus 6 months of HT for unfavorable-risk PCa is associated with a lower risk of death in men with no or minimal comorbidity.« less
Jeldres, Claudio; Cullen, Jennifer; Hurwitz, Lauren M.; Wolff, Erika M.; Levie, Katherine; Odem-Davis, Katherine; Johnston, Richard B.; Pham, Khanh N.; Rosner, Inger L; Brand, Timothy C.; L’Esperance, James O.; Sterbis, Joseph R.; Etzioni, Ruth B.; Porter, Christopher R.
2015-01-01
Background For low-risk prostate cancer (PCa), active surveillance (AS) may confer comparable oncological outcomes to radical prostatectomy (RP). Health-related quality of life (HRQoL) outcomes are important to consider, yet few studies have examined HRQoL for patients managed with AS. This study compared longitudinal HRQoL in a prospective, racially diverse, and contemporary cohort of patients who underwent RP or AS for low-risk PCa. Methods Beginning in 2007, HRQoL data from validated questionnaires (EPIC and SF-36) were collected by the Center for Prostate Disease Research in a multi-center national database. Patients aged ≤75 that were diagnosed with low-risk PCa and elected RP or AS for initial disease management were followed for three years. Mean scores were estimated using generalized estimating equations, adjusting for baseline HRQoL, demographic and clinical patient characteristics. Results Of the patients with low-risk PCa, 228 underwent RP and 77 underwent AS. Multivariable analysis revealed that RP patients had significantly worse sexual function, sexual bother, and urinary function at all time points compared to patients on AS. Differences in mental health between groups were below the threshold for clinical significance at one year. Conclusions This study found no differences in mental health outcomes but worse urinary and sexual HRQoL for RP patients compared to AS patients for up to three years. These data offer support for management of low risk PCa with AS as a means for postponing the morbidity associated with RP without concomitant mental health declines. PMID:25845467
Antioxidant Characterization of Oak Extracts Combining Spectrophotometric Assays and Chemometrics
Popović, Boris M.; Štajner, Dubravka; Orlović, Saša; Galić, Zoran
2013-01-01
Antioxidant characteristics of leaves, twigs, and acorns from two Serbian oak species Quercus robur L. and Quercus petraea L. from Vojvodina province (northern Serbia) were investigated. 80% ethanol (in water) extracts were used for antiradical power (ARP) determinations against DPPH•, •NO, and O2 •− radicals, ferric reducing antioxidant power (FRAP), total phenol, tannin, flavonoid, and proanthocyanidin contents. Permanganate reducing antioxidant capacity (PRAC) was determined using water extracts. Beside, mentioned parameters, soluble proteins, lipid peroxidation (LP), pigments and proline contents were also determined. The data of different procedures were compared and analyzed by multivariate techniques (correlation matrix calculation and principal component analysis (PCA)). PCA found that investigated organs of two different oak tree species possess similar antioxidant characteristics. The superior antioxidant characteristics showed oak leaves over twigs and acorns and seem to be promising source of antioxidants with possible use in industry and pharmacy. PMID:24453789
Horning, Aaron M; Awe, Julius A; Wang, Chiou-Miin; Liu, Joseph; Lai, Zhao; Wang, Vickie Yao; Jadhav, Rohit R; Louie, Anna D; Lin, Chun-Lin; Kroczak, Tad; Chen, Yidong; Jin, Victor X; Abboud-Werner, Sherry L; Leach, Robin J; Hernandez, Javior; Thompson, Ian M; Saranchuk, Jeff; Drachenberg, Darrel; Chen, Chun-Liang; Mai, Sabine; Huang, Tim Hui-Ming
2015-11-01
Altered DNA methylation in CpG islands of gene promoters has been implicated in prostate cancer (PCa) progression and can be used to predict disease outcome. In this study, we determine whether methylation changes of androgen biosynthesis pathway (ABP)-related genes in patients' plasma cell-free DNA (cfDNA) can serve as prognostic markers for biochemical recurrence (BCR). Methyl-binding domain capture sequencing (MBDCap-seq) was used to identify differentially methylated regions (DMRs) in primary tumors of patients who subsequently developed BCR or not, respectively. Methylation pyrosequencing of candidate loci was validated in cfDNA samples of 86 PCa patients taken at and/or post-radical prostatectomy (RP) using univariate and multivariate prediction analyses. Putative DMRs in 13 of 30 ABP-related genes were found between tumors of BCR (n = 12) versus no evidence of disease (NED) (n = 15). In silico analysis of The Cancer Genome Atlas data confirmed increased DNA methylation of two loci-SRD5A2 and CYP11A1, which also correlated with their decreased expression, in tumors with subsequent BCR development. Their aberrant cfDNA methylation was also associated with detectable levels of PSA taken after patients' post-RP. Multivariate analysis of the change in cfDNA methylation at all of CpG sites measured along with patient's treatment history predicted if a patient will develop BCR with 77.5% overall accuracy. Overall, increased DNA methylation of SRD5A2 and CYP11A1 related to androgen biosynthesis functions may play a role in BCR after patients' RP. The correlation between aberrant cfDNA methylation and detectable PSA in post-RP further suggests their utility as predictive markers for PCa recurrence. . © 2015 Wiley Periodicals, Inc.
Randazzo, Marco; Müller, Alexander; Carlsson, Sigrid; Eberli, Daniel; Huber, Andreas; Grobholz, Rainer; Manka, Lukas; Mortezavi, Ashkan; Sulser, Tullio; Recker, Franz; Kwiatkowski, Maciej
2016-01-01
Objective To assess the value of positive family history (FH) as a risk factor for prostate cancer (PCa) incidence and grade among men undergoing organized PSA-screening in a population-based study. Patients and Methods The study cohort comprised all attendees of the Swiss arm of the European Randomized Study of Screening for Prostate Cancer (ERSPC) with systematic PSA-tests every 4 years. Men reporting first-degree relative(s) diagnosed with PCa were considered to have a positive FH. Biopsy was exclusively PSA-triggered with a threshold of 3 ng/ml. Primary endpoint was PCa diagnosis. Kaplan-Meier and Cox regression analyses were used. Results Of 4,932 attendees with a median age of 60.9 (IQR 57.6–65.1) years, 334 (6.8%) reported a positive FH. Median follow-up duration was 11.6 years (IQR 10.3–13.3). Cumulative PCa incidence was 60/334 (18%, positive FH) and 550/4,598 (12%, negative FH) (OR 1.6, 95% CI 1.2–2.2, p=0.001), respectively. In both groups, most PCa diagnosed had a low grade. There were no significant differences in PSA at diagnosis, biopsy Gleason score or Gleason score on pathologic specimen among men who underwent radical prostatectomy between both groups, respectively. On multivariable analysis, age (HR 1.04, 95% CI 1.02–1.06), baseline PSA (HR 1.13 95% CI 1.12–1.14), and FH (HR 1.6, CI 1.24–2.14) were independent predictors for overall PCa incidence (p<0.0001 each). Only baseline PSA (HR 1.14, 95% CI 1.12–1.16, p<0.0001) was an independent predictor of Gleason score ≥7 PCa on prostate biopsy. The proportion of interval PCa diagnosed in between the screening rounds was non-significantly different. Conclusion Irrespective of the FH status, the current PSA-based screening setting detects the majority of aggressive PCa and missed only a minority of interval cancers with a 4-year screening algorithm. Our results suggest that men with a positive FH are at increased risk for low grade but not aggressive PCa. PMID:26332304
Arthur, Rhonda; Møller, Henrik; Garmo, Hans; Holmberg, Lars; Stattin, Pår; Malmstrom, Håkan; Lambe, Mats; Hammar, Niklas; Walldius, Göran; Robinson, David; Jungner, Ingmar; Hemelrijck, Mieke Van
2016-06-01
Lifestyle-related risk factors such as hyperglycemia and dyslipidemia have been associated with several cancers. However, studies exploring their link with prostate cancer (PCa) clinicopathological characteristics are sparse and inconclusive. Here, we investigated the associations between serum metabolic markers and PCa clinicopathological characteristics. The study comprised 14,294 men from the Swedish Apolipoprotein MOrtality RISk (AMORIS) cohort who were diagnosed with PCa between 1996 and 2011. Univariate and multivariable logistic regression were used to investigate the relation between glucose, triglycerides and total cholesterol and PCa risk categories, PSA, Gleason score, and T-stage. Mean age at time of PCa diagnosis was 69 years. Men with glucose levels >6.9 mmol/L tend to have PSA<4 μg/L, while those with glucose levels of 5.6-6.9 mmol/L had a greater odds of PSA>20 μg/L compared to PSA 4.0-9.9 μg/L. Hypertriglyceridemia was also positively associated with PSA>20 μg/L. Hyperglycemic men had a greater odds of intermediate- and high-grade PCa and advanced stage or metastatic PCa. Similarly, hypertriglyceridemia was positively associated with high-grade PCa. There was also a trend toward an increased odds of intermediate risk localized PCa and advanced stage PCa among men with hypertriglyceridemia. Total cholesterol did not have any statistically significant association with any of the outcomes studied. Our findings suggest that high serum levels of glucose and triglycerides may influence PCa aggressiveness and severity. Further investigation on the role of markers of glucose and lipid metabolism in influencing PCa aggressiveness and severity is needed as this may help define important targets for intervention. © 2016 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Age-Specific Prostate Specific Antigen Cutoffs for Guiding Biopsy Decision in Chinese Population
Xu, Jianfeng; Jiang, Haowen; Ding, Qiang
2013-01-01
Background Age-specific prostate specific antigen (PSA) cutoffs for prostate biopsy have been widely used in the USA and European countries. However, the application of age-specific PSA remains poorly understood in China. Methods Between 2003 and 2012, 1,848 men over the age of 40, underwent prostate biopsy for prostate cancer (PCa) at Huashan Hospital, Shanghai, China. Clinical information and blood samples were collected prior to biopsy for each patient. Men were divided into three age groups (≤60, 61 to 80, and >80) for analyses. Digital rectal examination (DRE), transrectal ultrasound (prostate volume and nodule), total PSA (tPSA), and free PSA (fPSA) were also included in the analyses. Logistic regression was used to build the multi-variate model. Results Serum tPSA levels were age-dependent (P = 0.008), while %fPSA (P = 0.051) and PSAD (P = 0.284) were age-independent. At a specificity of 80%, the sensitivities for predicting PCa were 83%, 71% and 68% with tPSA cutoff values of 19.0 ng/mL (age≤60),21.0 ng/mL (age 61–80), and 23.0 ng/mL (age≥81). Also, sensitivities at the same tPSA levels were able to reach relatively high levels (70%–88%) for predicting high-grade PCa. Area (AUC) under the receive operating curves (ROCs) of tPSA, %fPSA, PSAD and multi-variate model were different in age groups. When predicting PCa, the AUC of tPSA, %fPSA, PSAD and multi-variate model were 0.90, 0.57, 0.93 and 0.87 respectively in men ≤60 yr; 0.82, 0.70, 0.88 and 0.86 respectively in men 61–80 yr; 0.79, 0.78, 0.87 and 0.88 respectively in men>80 yr. When predicting Gleason Score ≥7 or 8 PCa, there were no significant differences between AUCs of each variable. Conclusion Age-specific PSA cutoff values for prostate biopsy should be considered in the Chinese population. Indications for prostate biopsies (tPSA, %fPSA and PSAD) should be considered based on age in the Chinese population. PMID:23825670
De Luca, Stefano; Passera, Roberto; Bollito, Enrico; Manfredi, Matteo; Scarpa, Roberto Mario; Sottile, Antonino; Randone, Donato Franco; Porpiglia, Francesco
2014-12-01
To determine if prostate cancer gene 3 (PCA3) score, Prostate Health Index (PHI), and percent free prostate-specific antigen (%fPSA) may be used to differentiate prostatitis from prostate cancer (PCa), benign prostatic hyperplasia (BPH) and high-grade prostate intraepithelial neoplasia (HG-PIN) in patients with elevated PSA and negative digital rectal examination (DRE). in the present prospective study, 274 patients, undergoing PCA3 score, PHI and %fPSA assessments before initial biopsy, were enrolled. Three multivariate logistic regression models were used to test PCA3 score, PHI and %fPSA as risk factors for prostatitis vs. PCa, vs. BPH, and vs. HG-PIN. All the analyses were performed for the whole patient cohort and for the 'gray zone' of PSA (4-10 ng/ml) cohort (188 individuals). The determinants for prostatitis vs. PCa were PCA3 score, PHI and %fPSA (Odds Ratio [OR]=0.97, 0.96 and 0.94, respectively). Unit increase of PHI was the only risk factor for prostatitis vs. BPH (OR=1.06), and unit increase of PCA3 score for HG-PIN vs. prostatitis (OR=0.98). In the 'gray zone' PSA cohort, the determinants for prostatitis vs. PCa were PCA3 score, PHI and %fPSA (OR=0.96, 0.94 and 0.92, respectively), PCA3 score and PHI for prostatitis vs. BPH (OR=0.96 and 1.08, respectively), and PCA3 score for prostatitis vs. HG-PIN (OR=0.97). The clinical benefit of using PCA3 score and PHI to estimate prostatitis vs. PCa was comparable; even %fPSA had good diagnostic performance, being a faster and cheaper marker. PHI was the only determinant for prostatitis vs. BPH, while PCA3 score for prostatitis vs. HG-PIN. Copyright© 2014 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Hamchevici, Carmen; Udrea, Ion
2013-11-01
The concept of basin-wide Joint Danube Survey (JDS) was launched by the International Commission for the Protection of the Danube River (ICPDR) as a tool for investigative monitoring under the Water Framework Directive (WFD), with a frequency of 6 years. The first JDS was carried out in 2001 and its success in providing key information for characterisation of the Danube River Basin District as required by WFD lead to the organisation of the second JDS in 2007, which was the world's biggest river research expedition in that year. The present paper presents an approach for improving the survey strategy for the next planned survey JDS3 (2013) by means of several multivariate statistical techniques. In order to design the optimum structure in terms of parameters and sampling sites, principal component analysis (PCA), factor analysis (FA) and cluster analysis were applied on JDS2 data for 13 selected physico-chemical and one biological element measured in 78 sampling sites located on the main course of the Danube. Results from PCA/FA showed that most of the dataset variance (above 75%) was explained by five varifactors loaded with 8 out of 14 variables: physical (transparency and total suspended solids), relevant nutrients (N-nitrates and P-orthophosphates), feedback effects of primary production (pH, alkalinity and dissolved oxygen) and algal biomass. Taking into account the representation of the factor scores given by FA versus sampling sites and the major groups generated by the clustering procedure, the spatial network of the next survey could be carefully tailored, leading to a decreasing of sampling sites by more than 30%. The approach of target oriented sampling strategy based on the selected multivariate statistics can provide a strong reduction in dimensionality of the original data and corresponding costs as well, without any loss of information.
Alves, Darlan Daniel; Riegel, Roberta Plangg; de Quevedo, Daniela Müller; Osório, Daniela Montanari Migliavacca; da Costa, Gustavo Marques; do Nascimento, Carlos Augusto; Telöken, Franko
2018-06-08
Assessment of surface water quality is an issue of currently high importance, especially in polluted rivers which provide water for treatment and distribution as drinking water, as is the case of the Sinos River, southern Brazil. Multivariate statistical techniques allow a better understanding of the seasonal variations in water quality, as well as the source identification and source apportionment of water pollution. In this study, the multivariate statistical techniques of cluster analysis (CA), principal component analysis (PCA), and positive matrix factorization (PMF) were used, along with the Kruskal-Wallis test and Spearman's correlation analysis in order to interpret a water quality data set resulting from a monitoring program conducted over a period of almost two years (May 2013 to April 2015). The water samples were collected from the raw water inlet of the municipal water treatment plant (WTP) operated by the Water and Sewage Services of Novo Hamburgo (COMUSA). CA allowed the data to be grouped into three periods (autumn and summer (AUT-SUM); winter (WIN); spring (SPR)). Through the PCA, it was possible to identify that the most important parameters in contribution to water quality variations are total coliforms (TCOLI) in SUM-AUT, water level (WL), water temperature (WT), and electrical conductivity (EC) in WIN and color (COLOR) and turbidity (TURB) in SPR. PMF was applied to the complete data set and enabled the source apportionment water pollution through three factors, which are related to anthropogenic sources, such as the discharge of domestic sewage (mostly represented by Escherichia coli (ECOLI)), industrial wastewaters, and agriculture runoff. The results provided by this study demonstrate the contribution provided by the use of integrated statistical techniques in the interpretation and understanding of large data sets of water quality, showing also that this approach can be used as an efficient methodology to optimize indicators for water quality assessment.
Benson, Nsikak U.; Asuquo, Francis E.; Williams, Akan B.; Essien, Joseph P.; Ekong, Cyril I.; Akpabio, Otobong; Olajire, Abaas A.
2016-01-01
Trace metals (Cd, Cr, Cu, Ni and Pb) concentrations in benthic sediments were analyzed through multi-step fractionation scheme to assess the levels and sources of contamination in estuarine, riverine and freshwater ecosystems in Niger Delta (Nigeria). The degree of contamination was assessed using the individual contamination factors (ICF) and global contamination factor (GCF). Multivariate statistical approaches including principal component analysis (PCA), cluster analysis and correlation test were employed to evaluate the interrelationships and associated sources of contamination. The spatial distribution of metal concentrations followed the pattern Pb>Cu>Cr>Cd>Ni. Ecological risk index by ICF showed significant potential mobility and bioavailability for Cu, Cu and Ni. The ICF contamination trend in the benthic sediments at all studied sites was Cu>Cr>Ni>Cd>Pb. The principal component and agglomerative clustering analyses indicate that trace metals contamination in the ecosystems was influenced by multiple pollution sources. PMID:27257934
Multivariate statistical approach to estimate mixing proportions for unknown end members
Valder, Joshua F.; Long, Andrew J.; Davis, Arden D.; Kenner, Scott J.
2012-01-01
A multivariate statistical method is presented, which includes principal components analysis (PCA) and an end-member mixing model to estimate unknown end-member hydrochemical compositions and the relative mixing proportions of those end members in mixed waters. PCA, together with the Hotelling T2 statistic and a conceptual model of groundwater flow and mixing, was used in selecting samples that best approximate end members, which then were used as initial values in optimization of the end-member mixing model. This method was tested on controlled datasets (i.e., true values of estimates were known a priori) and found effective in estimating these end members and mixing proportions. The controlled datasets included synthetically generated hydrochemical data, synthetically generated mixing proportions, and laboratory analyses of sample mixtures, which were used in an evaluation of the effectiveness of this method for potential use in actual hydrological settings. For three different scenarios tested, correlation coefficients (R2) for linear regression between the estimated and known values ranged from 0.968 to 0.993 for mixing proportions and from 0.839 to 0.998 for end-member compositions. The method also was applied to field data from a study of end-member mixing in groundwater as a field example and partial method validation.
Mottese, Antonio Francesco; Naccari, Clara; Vadalà, Rossella; Bua, Giuseppe Daniel; Bartolomeo, Giovanni; Rando, Rossana; Cicero, Nicola; Dugo, Giacomo
2018-01-01
Opuntia ficus-indica L. Miller fruits, particularly 'Ficodindia dell'Etna' of Biancavilla (POD), 'Fico d'india tradizionale di Roccapalumba' with protected brand and samples from an experimental field in Pezzolo (Sicily) were analyzed by inductively coupled plasma mass spectrometry in order to determine the multi-element profile. A multivariate chemometric approach, specifically principal component analysis (PCA), was applied to individuate how mineral elements may represent a marker of geographic origin, which would be useful for traceability. PCA has allowed us to verify that the geographical origin of prickly pear fruits is significantly influenced by trace element content, and the results found in Biancavilla PDO samples were linked to the geological composition of this volcanic areas. It was observed that two principal components accounted for 72.03% of the total variance in the data and, in more detail, PC1 explains 45.51% and PC2 26.52%, respectively. This study demonstrated that PCA is an integrated tool for the traceability of food products and, at the same time, a useful method of authentication of typical local fruits such as prickly pear. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
NASA Astrophysics Data System (ADS)
Suzuki, Noriaki
Genetically engineered proteins for inorganics (GEPIs) belong to a new class of polypeptides that are designed to have specific affinities to inorganic materials. A "gold binding protein (GBP)" was chosen as a model protein for GEPIs to study the molecular origins of binding specificity to gold using Time-of-flight secondary ion mass spectrometry (TOF-SIMS) and X-ray photoelectron spectroscopy (XPS). TOF-SIMS, a surface-sensitive analytical instrument with extremely high mass resolutions, provides information on specific amino acid-surface interactions. We used "principal component analysis (PCA)" to analyze the data. We also introduced a new multivariate technique, "hierarchical cluster analysis (HCA)" to organize the data into meaningful structures by measuring a degree of "similarity" and "dissimilarity" of the data. This report discusses a combined use of PCA and HCA to elucidate the binding specificity of GBP to Au. Based on the knowledge gained from TOF-SIMS measurements, we further investigated the nature of the interaction between selected amino acids and noble metal surfaces by using X-ray photoelectron spectroscopy (XPS). We developed a unique capability to introduce water vapor during the adsorption of a single amino acid and applied this method to study the intrinsic nature of sidechain/Au interactions. To further apply this unique research protocol, we characterized another type of GEPI, "quartz binding protein (QBP)," to identify the possible binding sites. This thesis research aims to provide experimental protocols for analyzing short peptide-substrate interface from complex spectroscopic data by using multivariate analysis techniques.
Minia, Egypt: Principal Component Analysis
Abdelrehim, Marwa G; Mahfouz, Eman M; Ewis, Ashraf A; Seedhom, Amany E; Afifi, Hassan M; Shebl, Fatma M
2018-02-26
Background: Pancreatic cancer (PC) is a serious and rapidly progressing malignancy. Identifying risk factors including dietary elements is important to develop preventive strategies. This study focused on possible links between diet and PC. Methods: We conducted a case-control study including all PC patients diagnosed at Minia Cancer Center and controls from general population from June 2014 to December 2015. Dietary data were collected directly through personal interviews. Principal component analysis (PCA) was performed to identify dietary groups. The data were analyzed using crude odds ratios (ORs) and multivariable logistic regression with adjusted ORs and 95% confidence intervals (CIs). Results: A total of 75 cases and 149 controls were included in the study. PCA identified six dietary groups, labeled as cereals and grains, vegetables, proteins, dairy products, fruits, and sugars. Bivariate analysis showed that consumption of vegetables, fruits, sugars, and total energy intake were associated with change in PC risk. In multivariable-adjusted models comparing highest versus lowest levels of intake, we observed significant lower odds of PC in association with vegetable intake (OR 0.24; 95% CI, 0.07-0.85, P=0.012) and a higher likelihood with the total energy intake (OR 9.88; 95% CI, 2.56-38.09, P<0.0001). There was also a suggested link between high fruit consumption and reduced odds of PC. Conclusions: The study supports the association between dietary factors and the odds of PC development in Egypt. It was found that higher energy intake is associated with an increase in likelihood of PC, while increased vegetable consumption is associated with a lower odds ratio. Creative Commons Attribution License
Is social interaction associated with alcohol consumption in Uganda?
Tumwesigye, Nazarius Mbona; Kasirye, Rogers; Nansubuga, Elizabeth
2009-07-01
Little is documented about the association of alcohol consumption and social interaction in Uganda, a country with one of the highest per capita alcohol consumptions in the world. This paper describes the pattern of social interaction by sex and establishes the relationship between social interaction and alcohol consumption with and without the consideration of confounders. The data used had 1479 records and were collected in a survey in 2003. The study was part of a multinational study on Gender, Alcohol, and Culture International Study (GENACIS). Each question on social interaction had been pre-coded in a way that quantified the extent of social interaction. The sum of responses on interaction questions gave a summative score which was used to compute summary indices on social interaction. Principal component analysis (PCA) was used to identify the best combination of variables for a social interaction index. The index was computed by a prediction using a PCA model developed from the selected variables. The index was categorised into quintiles and used in bivariate and multivariate logistic regression analysis of alcohol consumption and social interaction. The stronger the social interaction the more the likelihood of taking alcohol frequently (chi(trend)(2)=4.72, p<0.001). The strength of the association remains significant even after controlling for sex, age group and education level (p=0.008). The strength of relationship between social interaction and heavy consumption of alcohol gets weak in multivariate analysis. Communication messages meant to improve health, well-being and public order need to incorporate dangers of negative influence of social interaction.
Locomotor Recovery in Spinal Cord Injury: Insights Beyond Walking Speed and Distance.
Awai, Lea; Curt, Armin
2016-08-01
Recovery of locomotor function after incomplete spinal cord injury (iSCI) is clinically assessed through walking speed and distance, while improvements in these measures might not be in line with a normalization of gait quality and are, on their own, insensitive at revealing potential mechanisms underlying recovery. The objective of this study was to relate changes of gait parameters to the recovery of walking speed while distinguishing between parameters that rather reflect speed improvements from factors contributing to overall recovery. Kinematic data of 16 iSCI subjects were repeatedly recorded during in-patient rehabilitation. The responsiveness of gait parameters to walking speed was assessed by linear regression. Principal component analysis (PCA) was applied on the multivariate data across time to identify factors that contribute to recovery after iSCI. Parameters of gait cycle and movement dynamics were both responsive and closely related to the recovery of walking speed, which increased by 96%. Multivariate analysis revealed specific gait parameters (intralimb shape normality and consistency) that, although less related to speed increments, loaded highly on principal component one (PC1) (58.6%) explaining the highest proportion of variance (i.e., recovery of outcome over time). Interestingly, measures of hip, knee, and ankle range of motion showed varying degrees of responsiveness (from very high to very low) while not contributing to gait recovery as revealed by PCA. The conjunct application of two analysis methods distinguishes gait parameters that simply reflect increased walking speed from parameters that actually contribute to gait recovery in iSCI. This distinction may be of value for the evaluation of interventions for locomotor recovery.
Colihueque, Nelson; Corrales, Olga; Yáñez, Miguel
2017-01-01
Abstract Trichomycterus areolatus Valenciennes, 1846 is a small endemic catfish inhabiting the Andean river basins of Chile. In this study, the morphological variability of three T. areolatus populations, collected in two river basins from southern Chile, was assessed with multivariate analyses, including principal component analysis (PCA) and discriminant function analysis (DFA). It is hypothesized that populations must segregate morphologically from each other based on the river basin that they were sampled from, since each basin presents relatively particular hydrological characteristics. Significant morphological differences among the three populations were found with PCA (ANOSIM test, r = 0.552, p < 0.0001) and DFA (Wilks’s λ = 0.036, p < 0.01). PCA accounted for a total variation of 56.16% by the first two principal components. The first Principal Component (PC1) and PC2 explained 34.72 and 21.44% of the total variation, respectively. The scatter-plot of the first two discriminant functions (DF1 on DF2) also validated the existence of three different populations. In group classification using DFA, 93.3% of the specimens were correctly-classified into their original populations. Of the total of 22 transformed truss measurements, 17 exhibited highly significant (p < 0.01) differences among populations. The data support the existence of T. areolatus morphological variation across different rivers in southern Chile, likely reflecting the geographic isolation underlying population structure of the species. PMID:29134012
Guldbrandsen, Niels; Kostidis, Sarantos; Schäfer, Hartmut; De Mieri, Maria; Spraul, Manfred; Skaltsounis, Alexios-Leandros; Mikros, Emmanuel; Hamburger, Matthias
2015-05-22
Isatis tinctoria is an ancient dye and medicinal plant with potent anti-inflammatory and antiallergic properties. Metabolic differences were investigated by NMR spectroscopy of accessions from different origins that were grown under identical conditions on experimental plots. For these accessions, metabolite profiles at different harvesting dates were analyzed, and single and repeatedly harvested plants were compared. Leaf samples were shock-frozen in liquid N2 immediately after being harvested, freeze-dried, and cryomilled prior to extraction. Extracts were prepared by pressurized liquid extraction with ethyl acetate and 70% aqueous methanol. NMR spectra were analyzed using a combination of different methods of multivariate data analysis such as principal component analysis (PCA), canonical analysis (CA), and k-nearest neighbor concept (k-NN). Accessions and harvesting dates were well separated in the PCA/CA/k-NN analysis in both extracts. Pairwise statistical total correlation spectroscopy (STOCSY) revealed unsaturated fatty acids, porphyrins, carbohydrates, indole derivatives, isoprenoids, phenylpropanoids, and minor aromatic compounds as the cause of these differences. In addition, the metabolite profile was affected by the repeated harvest regime, causing a decrease of 1,5-anhydroglucitol, sucrose, unsaturated fatty acids, porphyrins, isoprenoids, and a flavonoid.
Estimation of failure criteria in multivariate sensory shelf life testing using survival analysis.
Giménez, Ana; Gagliardi, Andrés; Ares, Gastón
2017-09-01
For most food products, shelf life is determined by changes in their sensory characteristics. A predetermined increase or decrease in the intensity of a sensory characteristic has frequently been used to signal that a product has reached the end of its shelf life. Considering all attributes change simultaneously, the concept of multivariate shelf life allows a single measurement of deterioration that takes into account all these sensory changes at a certain storage time. The aim of the present work was to apply survival analysis to estimate failure criteria in multivariate sensory shelf life testing using two case studies, hamburger buns and orange juice, by modelling the relationship between consumers' rejection of the product and the deterioration index estimated using PCA. In both studies, a panel of 13 trained assessors evaluated the samples using descriptive analysis whereas a panel of 100 consumers answered a "yes" or "no" question regarding intention to buy or consume the product. PC1 explained the great majority of the variance, indicating all sensory characteristics evolved similarly with storage time. Thus, PC1 could be regarded as index of sensory deterioration and a single failure criterion could be estimated through survival analysis for 25 and 50% consumers' rejection. The proposed approach based on multivariate shelf life testing may increase the accuracy of shelf life estimations. Copyright © 2017 Elsevier Ltd. All rights reserved.
Abeysekara, Saman; Damiran, Daalkhaijav; Yu, Peiqiang
2013-02-01
The objectives of this study were (i) to determine lipid related molecular structures components (functional groups) in feed combination of cereal grain (barley, Hordeum vulgare) and wheat (Triticum aestivum) based dried distillers grain solubles (wheat DDGSs) from bioethanol processing at five different combination ratios using univariate and multivariate molecular spectral analyses with infrared Fourier transform molecular spectroscopy, and (ii) to correlate lipid-related molecular-functional structure spectral profile to nutrient profiles. The spectral intensity of (i) CH(3) asymmetric, CH(2) asymmetric, CH(3) symmetric and CH(2) symmetric groups, (ii) unsaturation (CC) group, and (iii) carbonyl ester (CO) group were determined. Spectral differences of functional groups were detected by hierarchical cluster analysis (HCA) and principal components analysis (PCA). The results showed that the combination treatments significantly inflicted modifications (P<0.05) in nutrient profile and lipid related molecular spectral intensity (CH(2) asymmetric stretching peak height, CH(2) symmetric stretching peak height, ratio of CH(2) to CH(3) symmetric stretching peak intensity, and carbonyl peak area). Ratio of CH(2) to CH(3) symmetric stretching peak intensity, and carbonyl peak significantly correlated with nutrient profiles. Both PCA and HCA differentiated lipid-related spectrum. In conclusion, the changes of lipid molecular structure spectral profiles through feed combination could be detected using molecular spectroscopy. These changes were associated with nutrient profiles and functionality. Copyright © 2012 Elsevier B.V. All rights reserved.
Untargeted Metabolomic Analysis of Capsicum spp. by GC-MS.
Aranha, Bianca Camargo; Hoffmann, Jessica Fernanda; Barbieri, Rosa Lia; Rombaldi, Cesar Valmor; Chaves, Fábio Clasen
2017-09-01
In order to conserve the biodiversity of Capsicum species and find genotypes with potential to be utilised commercially, Embrapa Clima Temperado maintains an active germplasm collection (AGC) that requires characterisation, enabling genotype selection and support for breeding programmes. The objective of this study was to characterise pepper accessions from the Embrapa Clima Temperado AGC and differentiate species based on their metabolic profile using an untargeted metabolomics approach. Cold (-20°C) methanol extraction residue of freeze-dried fruit samples was partitioned into water/methanol (A) and chloroform (B) fractions. The polar fraction (A) was derivatised and both fractions (A and B) were analysed by gas chromatography coupled to mass spectrometry (GC-MS). Data from each fraction was analysed using a multivariate principal component analysis (PCA) with XCMS software. Amino acids, sugars, organic acids, capsaicinoids, and hydrocarbons were identified. Outlying accessions including P116 (C. chinense), P46, and P76 (C. annuum) were observed in a PCA plot mainly due to their high sucrose and fructose contents. PCA also indicated a separation of P221 (C. annuum) and P200 (C. chinense), because of their high dihydrocapsaicin content. Although the metabolic profiling did not allow for grouping by species, it permitted the simultaneous identification and quantification of several compounds complementing and expanding the metabolic database of the studied Capsicum spp. in the AGC. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Arbogast, Luke W; Delaglio, Frank; Schiel, John E; Marino, John P
2017-11-07
Two-dimensional (2D) 1 H- 13 C methyl NMR provides a powerful tool to probe the higher order structure (HOS) of monoclonal antibodies (mAbs), since spectra can readily be acquired on intact mAbs at natural isotopic abundance, and small changes in chemical environment and structure give rise to observable changes in corresponding spectra, which can be interpreted at atomic resolution. This makes it possible to apply 2D NMR spectral fingerprinting approaches directly to drug products in order to systematically characterize structure and excipient effects. Systematic collections of NMR spectra are often analyzed in terms of the changes in specifically identified peak positions, as well as changes in peak height and line widths. A complementary approach is to apply principal component analysis (PCA) directly to the matrix of spectral data, correlating spectra according to similarities and differences in their overall shapes, rather than according to parameters of individually identified peaks. This is particularly well-suited for spectra of mAbs, where some of the individual peaks might not be well resolved. Here we demonstrate the performance of the PCA method for discriminating structural variation among systematic sets of 2D NMR fingerprint spectra using the NISTmAb and illustrate how spectral variability identified by PCA may be correlated to structure.
Effect of sexual steroids on boar kinematic sperm subpopulations.
Ayala, E M E; Aragón, M A
2017-11-01
Here, we show the effects of sexual steroids, progesterone, testosterone, or estradiol on motility parameters of boar sperm. Sixteen commercial seminal doses, four each of four adult boars, were analyzed using computer assisted sperm analysis (CASA). Mean values of motility parameters were analyzed by bivariate and multivariate statistics. Principal component analysis (PCA), followed by hierarchical clustering, was applied on data of motility parameters, provided automatically as intervals by the CASA system. Effects of sexual steroids were described in the kinematic subpopulations identified from multivariate statistics. Mean values of motility parameters were not significantly changed after addition of sexual steroids. Multivariate graphics showed that sperm subpopulations were not sensitive to the addition of either testosterone or estradiol, but sperm subpopulations responsive to progesterone were found. Distribution of motility parameters were wide in controls but sharpened at distinct concentrations of progesterone. We conclude that kinematic sperm subpopulations responsive to progesterone are present in boar semen, and these subpopulations are masked in evaluations of mean values of motility parameters. © 2017 International Society for Advancement of Cytometry. © 2017 International Society for Advancement of Cytometry.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Molina, Sarah; Department of Radiation Oncology, CHU/Université de Poitiers, Poitiers; Guerif, Stéphane
Purpose: Predictive factors for biochemical recurrence (BCR) in localized prostate cancer (PCa) after brachytherapy are insufficient to date. Cellular radiosensitivity depends on DNA double-strand breaks, mainly repaired by the nonhomologous end-joining (NHEJ) system. We analyzed whether the expression of NHEJ proteins can predict BCR in patients treated by brachytherapy for localized PCa. Methods and Materials: From 983 PCa cases treated by brachytherapy between March 2000 and March 2012, 167 patients with available biopsy material suitable for in situ analysis were included in the study. The median follow-up time was 47 months. Twenty-nine patients experienced BCR. All slides were reviewed to reassessmore » the Gleason score. Expression of the key NHEJ proteins DNA-PKcs, Ku70, and Ku80, and the proliferation marker Ki67, was studied by immunohistochemistry performed on tissue microarrays. Results: The Gleason scores after review (P=.06) tended to be associated with BCR when compared with the score initially reported (P=.74). Both the clinical stage (P=.02) and the pretreatment prostate-specific antigen level (P=.01) were associated with biochemical failure. Whereas the expression of Ku80 and Ki67 were not predictive of relapse, positive DNA-PKcs nuclear staining (P=.003) and higher Ku70 expression (P=.05) were associated with BCR. On multivariate analysis, among pretreatment variables, only DNA-PKcs (P=.03) and clinical stage (P=.02) remained predictive of recurrence. None of the patients without palpable PCa and negative DNA-PKcs expression experienced biochemical failure, compared with 32% of men with palpable and positive DNA-PKcs staining that recurred. Conclusions: Our results suggest that DNA-PKcs could be a predictive marker of BCR after brachytherapy, and this might be a useful tool for optimizing the choice of treatment in low-risk PCa patients.« less
Multivariate Statistical Analysis of MSL APXS Bulk Geochemical Data
NASA Astrophysics Data System (ADS)
Hamilton, V. E.; Edwards, C. S.; Thompson, L. M.; Schmidt, M. E.
2014-12-01
We apply cluster and factor analyses to bulk chemical data of 130 soil and rock samples measured by the Alpha Particle X-ray Spectrometer (APXS) on the Mars Science Laboratory (MSL) rover Curiosity through sol 650. Multivariate approaches such as principal components analysis (PCA), cluster analysis, and factor analysis compliment more traditional approaches (e.g., Harker diagrams), with the advantage of simultaneously examining the relationships between multiple variables for large numbers of samples. Principal components analysis has been applied with success to APXS, Pancam, and Mössbauer data from the Mars Exploration Rovers. Factor analysis and cluster analysis have been applied with success to thermal infrared (TIR) spectral data of Mars. Cluster analyses group the input data by similarity, where there are a number of different methods for defining similarity (hierarchical, density, distribution, etc.). For example, without any assumptions about the chemical contributions of surface dust, preliminary hierarchical and K-means cluster analyses clearly distinguish the physically adjacent rock targets Windjana and Stephen as being distinctly different than lithologies observed prior to Curiosity's arrival at The Kimberley. In addition, they are separated from each other, consistent with chemical trends observed in variation diagrams but without requiring assumptions about chemical relationships. We will discuss the variation in cluster analysis results as a function of clustering method and pre-processing (e.g., log transformation, correction for dust cover) and implications for interpreting chemical data. Factor analysis shares some similarities with PCA, and examines the variability among observed components of a dataset so as to reveal variations attributable to unobserved components. Factor analysis has been used to extract the TIR spectra of components that are typically observed in mixtures and only rarely in isolation; there is the potential for similar results with data from APXS. These techniques offer new ways to understand the chemical relationships between the materials interrogated by Curiosity, and potentially their relation to materials observed by APXS instruments on other landed missions.
Tallon, Lucile; Luangphakdy, Devillier; Ruffion, Alain; Colombel, Marc; Devonec, Marian; Champetier, Denis; Paparel, Philippe; Decaussin-Petrucci, Myriam; Perrin, Paul; Vlaeminck-Guillem, Virginie
2014-07-30
It has been suggested that urinary PCA3 and TMPRSS2:ERG fusion tests and serum PHI correlate to cancer aggressiveness-related pathological criteria at prostatectomy. To evaluate and compare their ability in predicting prostate cancer aggressiveness, PHI and urinary PCA3 and TMPRSS2:ERG (T2) scores were assessed in 154 patients who underwent radical prostatectomy for biopsy-proven prostate cancer. Univariate and multivariate analyses using logistic regression and decision curve analyses were performed. All three markers were predictors of a tumor volume≥0.5 mL. Only PHI predicted Gleason score≥7. T2 score and PHI were both independent predictors of extracapsular extension(≥pT3), while multifocality was only predicted by PCA3 score. Moreover, when compared to a base model (age, digital rectal examination, serum PSA, and Gleason sum at biopsy), the addition of both PCA3 score and PHI to the base model induced a significant increase (+12%) when predicting tumor volume>0.5 mL. PHI and urinary PCA3 and T2 scores can be considered as complementary predictors of cancer aggressiveness at prostatectomy.
Use of multivariate statistics to identify unreliable data obtained using CASA.
Martínez, Luis Becerril; Crispín, Rubén Huerta; Mendoza, Maximino Méndez; Gallegos, Oswaldo Hernández; Martínez, Andrés Aragón
2013-06-01
In order to identify unreliable data in a dataset of motility parameters obtained from a pilot study acquired by a veterinarian with experience in boar semen handling, but without experience in the operation of a computer assisted sperm analysis (CASA) system, a multivariate graphical and statistical analysis was performed. Sixteen boar semen samples were aliquoted then incubated with varying concentrations of progesterone from 0 to 3.33 µg/ml and analyzed in a CASA system. After standardization of the data, Chernoff faces were pictured for each measurement, and a principal component analysis (PCA) was used to reduce the dimensionality and pre-process the data before hierarchical clustering. The first twelve individual measurements showed abnormal features when Chernoff faces were drawn. PCA revealed that principal components 1 and 2 explained 63.08% of the variance in the dataset. Values of principal components for each individual measurement of semen samples were mapped to identify differences among treatment or among boars. Twelve individual measurements presented low values of principal component 1. Confidence ellipses on the map of principal components showed no statistically significant effects for treatment or boar. Hierarchical clustering realized on two first principal components produced three clusters. Cluster 1 contained evaluations of the two first samples in each treatment, each one of a different boar. With the exception of one individual measurement, all other measurements in cluster 1 were the same as observed in abnormal Chernoff faces. Unreliable data in cluster 1 are probably related to the operator inexperience with a CASA system. These findings could be used to objectively evaluate the skill level of an operator of a CASA system. This may be particularly useful in the quality control of semen analysis using CASA systems.
Selecting minimum dataset soil variables using PLSR as a regressive multivariate method
NASA Astrophysics Data System (ADS)
Stellacci, Anna Maria; Armenise, Elena; Castellini, Mirko; Rossi, Roberta; Vitti, Carolina; Leogrande, Rita; De Benedetto, Daniela; Ferrara, Rossana M.; Vivaldi, Gaetano A.
2017-04-01
Long-term field experiments and science-based tools that characterize soil status (namely the soil quality indices, SQIs) assume a strategic role in assessing the effect of agronomic techniques and thus in improving soil management especially in marginal environments. Selecting key soil variables able to best represent soil status is a critical step for the calculation of SQIs. Current studies show the effectiveness of statistical methods for variable selection to extract relevant information deriving from multivariate datasets. Principal component analysis (PCA) has been mainly used, however supervised multivariate methods and regressive techniques are progressively being evaluated (Armenise et al., 2013; de Paul Obade et al., 2016; Pulido Moncada et al., 2014). The present study explores the effectiveness of partial least square regression (PLSR) in selecting critical soil variables, using a dataset comparing conventional tillage and sod-seeding on durum wheat. The results were compared to those obtained using PCA and stepwise discriminant analysis (SDA). The soil data derived from a long-term field experiment in Southern Italy. On samples collected in April 2015, the following set of variables was quantified: (i) chemical: total organic carbon and nitrogen (TOC and TN), alkali-extractable C (TEC and humic substances - HA-FA), water extractable N and organic C (WEN and WEOC), Olsen extractable P, exchangeable cations, pH and EC; (ii) physical: texture, dry bulk density (BD), macroporosity (Pmac), air capacity (AC), and relative field capacity (RFC); (iii) biological: carbon of the microbial biomass quantified with the fumigation-extraction method. PCA and SDA were previously applied to the multivariate dataset (Stellacci et al., 2016). PLSR was carried out on mean centered and variance scaled data of predictors (soil variables) and response (wheat yield) variables using the PLS procedure of SAS/STAT. In addition, variable importance for projection (VIP) statistics was used to quantitatively assess the predictors most relevant for response variable estimation and then for variable selection (Andersen and Bro, 2010). PCA and SDA returned TOC and RFC as influential variables both on the set of chemical and physical data analyzed separately as well as on the whole dataset (Stellacci et al., 2016). Highly weighted variables in PCA were also TEC, followed by K, and AC, followed by Pmac and BD, in the first PC (41.2% of total variance); Olsen P and HA-FA in the second PC (12.6%), Ca in the third (10.6%) component. Variables enabling maximum discrimination among treatments for SDA were WEOC, on the whole dataset, humic substances, followed by Olsen P, EC and clay, in the separate data analyses. The highest PLS-VIP statistics were recorded for Olsen P and Pmac, followed by TOC, TEC, pH and Mg for chemical variables and clay, RFC and AC for the physical variables. Results show that different methods may provide different ranking of the selected variables and the presence of a response variable, in regressive techniques, may affect variable selection. Further investigation with different response variables and with multi-year datasets would allow to better define advantages and limits of single or combined approaches. Acknowledgment The work was supported by the projects "BIOTILLAGE, approcci innovative per il miglioramento delle performances ambientali e produttive dei sistemi cerealicoli no-tillage", financed by PSR-Basilicata 2007-2013, and "DESERT, Low-cost water desalination and sensor technology compact module" financed by ERANET-WATERWORKS 2014. References Andersen C.M. and Bro R., 2010. Variable selection in regression - a tutorial. Journal of Chemometrics, 24 728-737. Armenise et al., 2013. Developing a soil quality index to compare soil fitness for agricultural use under different managements in the mediterranean environment. Soil and Tillage Research, 130:91-98. de Paul Obade et al., 2016. A standardized soil quality index for diverse field conditions. Sci. Total Env. 541:424-434. Pulido Moncada et al., 2014. Data-driven analysis of soil quality indicators using limited data. Geoderma, 235:271-278. Stellacci et al., 2016. Comparison of different multivariate methods to select key soil variables for soil quality indices computation. XLV Congress of the Italian Society of Agronomy (SIA), Sassari, 20-22 September 2016.
Lourenço, Vera; Herdling, Thorsten; Reich, Gabriele; Menezes, José C; Lochmann, Dirk
2011-08-01
A set of 192 fluid bed granulation batches at industrial scale were in-line monitored using microwave resonance technology (MRT) to determine moisture, temperature and density of the granules. Multivariate data analysis techniques such as multiway partial least squares (PLS), multiway principal component analysis (PCA) and multivariate batch control charts were applied onto collected batch data sets. The combination of all these techniques, along with off-line particle size measurements, led to significantly increased process understanding. A seasonality effect could be put into evidence that impacted further processing through its influence on the final granule size. Moreover, it was demonstrated by means of a PLS that a relation between the particle size and the MRT measurements can be quantitatively defined, highlighting a potential ability of the MRT sensor to predict information about the final granule size. This study has contributed to improve a fluid bed granulation process, and the process knowledge obtained shows that the product quality can be built in process design, following Quality by Design (QbD) and Process Analytical Technology (PAT) principles. Copyright © 2011. Published by Elsevier B.V.
Pilatti, Fernanda Kokowicz; Ramlov, Fernanda; Schmidt, Eder Carlos; Costa, Christopher; Oliveira, Eva Regina de; Bauer, Claudia M; Rocha, Miguel; Bouzon, Zenilda Laurita; Maraschin, Marcelo
2017-01-30
Fossil fuels, e.g. gasoline and diesel oil, account for substantial share of the pollution that affects marine ecosystems. Environmental metabolomics is an emerging field that may help unravel the effect of these xenobiotics on seaweeds and provide methodologies for biomonitoring coastal ecosystems. In the present study, FTIR and multivariate analysis were used to discriminate metabolic profiles of Ulva lactuca after in vitro exposure to diesel oil and gasoline, in combinations of concentrations (0.001%, 0.01%, 0.1%, and 1.0% - v/v) and times of exposure (30min, 1h, 12h, and 24h). PCA and HCA performed on entire mid-infrared spectral window were able to discriminate diesel oil-exposed thalli from the gasoline-exposed ones. HCA performed on spectral window related to the protein absorbance (1700-1500cm -1 ) enabled the best discrimination between gasoline-exposed samples regarding the time of exposure, and between diesel oil-exposed samples according to the concentration. The results indicate that the combination of FTIR with multivariate analysis is a simple and efficient methodology for metabolic profiling with potential use for biomonitoring strategies. Copyright © 2016 Elsevier Ltd. All rights reserved.
Ríos-Reina, Rocío; Morales, M Lourdes; García-González, Diego L; Amigo, José M; Callejón, Raquel M
2018-03-01
High-quality wine vinegars have been registered in Spain under protected designation of origin (PDO): "Vinagre de Jerez", "Vinagre de Condado de Huelva" and "Vinagre de Montilla-Moriles". The raw material, production and aging processes determine their quality and their aromatic composition. Vinegar volatile profile is usually analyzed by gas chromatography-mass spectrometry (GC-MS), being necessary a previous extraction step. Thus, three different sampling methods (Headspace solid phase microextraction "HS-SPME", Headspace stir bar sorptive extraction "HSSE" and Dynamic headspace extraction "DHS") were studied for the analysis of the volatile composition of Spanish PDO wine vinegars. Multivariate curve resolution (MCR) was used to solve chromatographic problems, improving the results obtained. Principal component analysis (PCA) showed that not all the sampling methods were equally suitable for the characterization and differentiation between PDOs and categories, being HSSE the technique that made able the best vinegar characterization. Copyright © 2017 Elsevier Ltd. All rights reserved.
Detection of Leukemia with Blood Samples Using Raman Spectroscopy and Multivariate Analysis
NASA Astrophysics Data System (ADS)
Martínez-Espinosa, J. C.; González-Solís, J. L.; Frausto-Reyes, C.; Miranda-Beltrán, M. L.; Soria-Fregoso, C.; Medina-Valtierra, J.
2009-06-01
The use of Raman spectroscopy to analyze blood biochemistry and hence distinguish between normal and abnormal blood was investigated. Blood samples were obtained from 6 patients who were clinically diagnosed with leukemia and 6 healthy volunteers. The imprint was put under the microscope and several points were chosen for Raman measurement. All the spectra were collected by a confocal Raman micro-spectroscopy (Renishaw) with a NIR 830 nm laser. It is shown that the serum samples from patients with leukemia and from the control group can be discriminated when the multivariate statistical methods of principal component analysis (PCA) and linear discriminated analysis (LDA) are applied to their Raman spectra. The ratios of some band intensities were analyzed and some band ratios were significant and corresponded to proteins, phospholipids, and polysaccharides. The preliminary results suggest that Raman Spectroscopy could be a new technique to study the degree of damage to the bone marrow using just blood samples instead of biopsies, treatment very painful for patients.
Study of archaeological coins of different dynasties using libs coupled with multivariate analysis
NASA Astrophysics Data System (ADS)
Awasthi, Shikha; Kumar, Rohit; Rai, G. K.; Rai, A. K.
2016-04-01
Laser Induced Breakdown Spectroscopy (LIBS) is an atomic emission spectroscopic technique having unique capability of an in-situ monitoring tool for detection and quantification of elements present in different artifacts. Archaeological coins collected form G.R. Sharma Memorial Museum; University of Allahabad, India has been analyzed using LIBS technique. These coins were obtained from excavation of Kausambi, Uttar Pradesh, India. LIBS system assembled in the laboratory (laser Nd:YAG 532 nm, 4 ns pulse width FWHM with Ocean Optics LIBS 2000+ spectrometer) is employed for spectral acquisition. The spectral lines of Ag, Cu, Ca, Sn, Si, Fe and Mg are identified in the LIBS spectra of different coins. LIBS along with Multivariate Analysis play an effective role for classification and contribution of spectral lines in different coins. The discrimination between five coins with Archaeological interest has been carried out using Principal Component Analysis (PCA). The results show the potential relevancy of the methodology used in the elemental identification and classification of artifacts with high accuracy and robustness.
H, Maulidiani; Khatib, Alfi; Shaari, Khozirah; Abas, Faridah; Shitan, Mahendran; Kneer, Ralf; Neto, Victor; Lajis, Nordin H
2012-01-11
The metabolites of three species of Apiaceae, also known as Pegaga, were analyzed utilizing (1)H NMR spectroscopy and multivariate data analysis. Principal component analysis (PCA) and hierarchical cluster analysis (HCA) resolved the species, Centella asiatica, Hydrocotyle bonariensis, and Hydrocotyle sibthorpioides, into three clusters. The saponins, asiaticoside and madecassoside, along with chlorogenic acids were the metabolites that contributed most to the separation. Furthermore, the effects of growth-lighting condition to metabolite contents were also investigated. The extracts of C. asiatica grown in full-day light exposure exhibited a stronger radical scavenging activity and contained more triterpenes (asiaticoside and madecassoside), flavonoids, and chlorogenic acids as compared to plants grown in 50% shade. This study established the potential of using a combination of (1)H NMR spectroscopy and multivariate data analyses in differentiating three closely related species and the effects of growth lighting, based on their metabolite contents and identification of the markers contributing to their differences.
Hou, Deyi; O'Connor, David; Nathanail, Paul; Tian, Li; Ma, Yan
2017-12-01
Heavy metal soil contamination is associated with potential toxicity to humans or ecotoxicity. Scholars have increasingly used a combination of geographical information science (GIS) with geostatistical and multivariate statistical analysis techniques to examine the spatial distribution of heavy metals in soils at a regional scale. A review of such studies showed that most soil sampling programs were based on grid patterns and composite sampling methodologies. Many programs intended to characterize various soil types and land use types. The most often used sampling depth intervals were 0-0.10 m, or 0-0.20 m, below surface; and the sampling densities used ranged from 0.0004 to 6.1 samples per km 2 , with a median of 0.4 samples per km 2 . The most widely used spatial interpolators were inverse distance weighted interpolation and ordinary kriging; and the most often used multivariate statistical analysis techniques were principal component analysis and cluster analysis. The review also identified several determining and correlating factors in heavy metal distribution in soils, including soil type, soil pH, soil organic matter, land use type, Fe, Al, and heavy metal concentrations. The major natural and anthropogenic sources of heavy metals were found to derive from lithogenic origin, roadway and transportation, atmospheric deposition, wastewater and runoff from industrial and mining facilities, fertilizer application, livestock manure, and sewage sludge. This review argues that the full potential of integrated GIS and multivariate statistical analysis for assessing heavy metal distribution in soils on a regional scale has not yet been fully realized. It is proposed that future research be conducted to map multivariate results in GIS to pinpoint specific anthropogenic sources, to analyze temporal trends in addition to spatial patterns, to optimize modeling parameters, and to expand the use of different multivariate analysis tools beyond principal component analysis (PCA) and cluster analysis (CA). Copyright © 2017 Elsevier Ltd. All rights reserved.
Katayama, K; Sato, T; Arai, T; Amao, H; Ohta, Y; Ozawa, T; Kenyon, P R; Hickson, R E; Tazaki, H
2013-02-01
Simple liquid chromatography-mass spectrometry (LC-MS) was applied to non-targeted metabolic analyses to discover new metabolic markers in animal plasma. Principle component analysis (PCA) and partial least squares-discriminate analysis (PLS-DA) were used to analyse LC-MS multivariate data. PCA clearly generated two separate clusters for artificially induced diabetic mice and healthy control mice. PLS-DA of time-course changes in plasma metabolites of chicks after feeding generated three clusters (pre- and immediately after feeding, 0.5-3 h after feeding and 4 h after feeding). Two separate clusters were also generated for plasma metabolites of pregnant Angus heifers with differing live-weight change profiles (gaining or losing). The accompanying PLS-DA loading plot detailed the metabolites that contribute the most to the cluster separation. In each case, the same highly hydrophilic metabolite was strongly correlated to the group separation. The metabolite was identified as betaine by LC-MS/MS. This result indicates that betaine and its metabolic precursor, choline, may be useful biomarkers to evaluate the nutritional and metabolic status of animals. © 2011 Blackwell Verlag GmbH.
Son, Hong-Seok; Kim, Ki Myong; van den Berg, Frans; Hwang, Geum-Sook; Park, Won-Mok; Lee, Cherl-Ho; Hong, Young-Shick
2008-09-10
(1)H NMR spectroscopy was used to investigate the metabolic differences in wines produced from different grape varieties and different regions. A significant separation among wines from Campbell Early, Cabernet Sauvignon, and Shiraz grapes was observed using principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA). The metabolites contributing to the separation were assigned to be 2,3-butanediol, lactate, acetate, proline, succinate, malate, glycerol, tartarate, glucose, and phenolic compounds by PCA and PLS-DA loading plots. Wines produced from Cabernet Sauvignon grapes harvested in the continental areas of Australia, France, and California were also separated. PLS-DA loading plots revealed that the level of proline in Californian Cabernet Sauvignon wines was higher than that in Australian and French Cabernet Sauvignon, Australian Shiraz, and Korean Campbell Early wines, showing that the chemical composition of the grape berries varies with the variety and growing area. This study highlights the applicability of NMR-based metabolomics with multivariate statistical data sets in determining wine quality and product origin.
Liu, Yu; Zhang, Xufeng; Li, Ying; Wang, Haixia
2017-11-01
Geographical origin traceability is an important issue for controlling the quality of seafood and safeguarding the interest of consumers. In the present study, a new method of compound-specific isotope analysis (CSIA) of fatty acids was established to evaluate its applicability in establishing the origin traceability of Apostichopus japonicus in the coastal areas of China. Moreover, principal component analysis (PCA) and discriminant analysis (DA) were applied to distinguish between the origins of A. japonicus. The results show that the stable carbon isotope compositions of fatty acids of A. japonicus significantly differ in terms of both season and origin. They also indicate that the stable carbon isotope composition of fatty acids could effectively discriminate between the origins of A. japonicus, except for between Changhai Island and Zhangzi Island in the spring of 2016 because of geographical proximity or the similarity of food sources. The fatty acids that have the highest contribution to identifying the geographical origins of A. japonicus are C22:6n-3, C16:1n-7, C20:5n-3, C18:0 and C23:1n-9, when considering the fatty acid contents, the stable carbon isotope composition of fatty acids and the results of the PCA and DA. We conclude that CSIA of fatty acids, combined with multivariate statistical analysis such as PCA and DA, may be an effective tool for establishing the traceability of A. japonicus in the coastal areas of China. The relevant conclusions of the present study provide a new method for determining the traceability of seafood or other food products. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.
Soneson, Charlotte; Lilljebjörn, Henrik; Fioretos, Thoas; Fontes, Magnus
2010-04-15
With the rapid development of new genetic measurement methods, several types of genetic alterations can be quantified in a high-throughput manner. While the initial focus has been on investigating each data set separately, there is an increasing interest in studying the correlation structure between two or more data sets. Multivariate methods based on Canonical Correlation Analysis (CCA) have been proposed for integrating paired genetic data sets. The high dimensionality of microarray data imposes computational difficulties, which have been addressed for instance by studying the covariance structure of the data, or by reducing the number of variables prior to applying the CCA. In this work, we propose a new method for analyzing high-dimensional paired genetic data sets, which mainly emphasizes the correlation structure and still permits efficient application to very large data sets. The method is implemented by translating a regularized CCA to its dual form, where the computational complexity depends mainly on the number of samples instead of the number of variables. The optimal regularization parameters are chosen by cross-validation. We apply the regularized dual CCA, as well as a classical CCA preceded by a dimension-reducing Principal Components Analysis (PCA), to a paired data set of gene expression changes and copy number alterations in leukemia. Using the correlation-maximizing methods, regularized dual CCA and PCA+CCA, we show that without pre-selection of known disease-relevant genes, and without using information about clinical class membership, an exploratory analysis singles out two patient groups, corresponding to well-known leukemia subtypes. Furthermore, the variables showing the highest relevance to the extracted features agree with previous biological knowledge concerning copy number alterations and gene expression changes in these subtypes. Finally, the correlation-maximizing methods are shown to yield results which are more biologically interpretable than those resulting from a covariance-maximizing method, and provide different insight compared to when each variable set is studied separately using PCA. We conclude that regularized dual CCA as well as PCA+CCA are useful methods for exploratory analysis of paired genetic data sets, and can be efficiently implemented also when the number of variables is very large.
Seisen, Thomas; Rouprêt, Morgan; Brault, Didier; Léon, Priscilla; Cancel-Tassin, Géraldine; Compérat, Eva; Renard-Penna, Raphaële; Mozer, Pierre; Guechot, Jérome; Cussenot, Olivier
2015-01-01
It remains unclear whether the Prostate Health Index (PHI) or the urinary Prostate-Cancer Antigen 3 (PCA-3) score is more accurate at screening for prostate cancer (PCa). The aim of this study was to prospectively compare the accuracy of PHI and PCA-3 scores to predict overall and significant PCa in men undergoing an initial prostate biopsy. Double-blind assessments of PHI and PCA-3 were conducted by referent physicians in 138 patients who subsequently underwent trans-rectal ultrasound-guided prostate biopsy according to a 12-core scheme. Predictive accuracies of PHI and PCA-3 were assessed using AUC and compared according to the DeLong method. Diagnostic performances with usual cut-off values for positivity (i.e., PHI >40 and PCA-3 >35) were calculated, and odds ratios associated with predicting PCa overall and significant PCa as defined by pathological updated Epstein criteria (i.e., Gleason score ≥7, more than three positive cores, or >50% cancer involvement in any core) were estimated using logistic regression. Prevalences of overall and significant PCa were 44.9% and 28.3%, respectively. PCA-3 (AUC = 0.71) was the most accurate predictor of PCa overall, and significantly outperformed PHI (AUC = 0.65; P = 0.03). However, PHI (AUC = 0.80) remained the most accurate predictor when screening exclusively for significant PCa and significantly outperformed PCA-3 (AUC = 0.55; P = 0.03). Furthermore, PCA-3 >35 had the best accuracy, and positive or negative predictive values when screening for PCa overall whereas these diagnostic performances were greater for PHI >40 when exclusively screening for significant PCa. PHI > 40 combined with PCA-3 > 35 was more specific in both cases. In multivariate analyses, PCA-3 >35 (OR = 5.68; 95%CI = [2.21-14.59]; P < 0.001) was significantly correlated with the presence of PCa overall, but PHI >40 (OR = 9.60; 95%CI = [1.72-91.32]; P = 0.001) was the only independent predictor for detecting significant PCa. Although PCA-3 score is the best predictor for PCa overall at initial biopsy, our findings strongly indicate that PHI should be used for population-based screening to avoid over-diagnosis of indolent tumors that are unlikely to cause death. © 2014 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Carneiro, Renato Lajarim; Poppi, Ronei Jesus
2014-01-01
In the present work the homogeneity of a pharmaceutical formulation presented as a cream was studied using infrared imaging spectroscopy and chemometric methodologies such as principal component analysis (PCA) and multivariate curve resolution with alternating least squares (MCR-ALS). A cream formulation, presented as an emulsion, was prepared using imiquimod as the active pharmaceutical ingredient (API) and the excipients: water, vaseline, an emulsifier and a carboxylic acid in order to dissolve the API. After exposure at 45 °C during 3 months to perform accelerated stability test, the presence of some crystals was observed, indicating homogeneity problems in the formulation. PCA exploratory analysis showed that the crystal composition was different from the composition of the emulsion, since the score maps presented crystal structures in the emulsion. MCR-ALS estimated the spectra of the crystals and the emulsion. The crystals presented amine and C-H bands, suggesting that the precipitate was a salt formed by carboxylic acid and imiquimod. These results indicate the potential of infrared imaging spectroscopy in conjunction with chemometric methodologies as an analytical tool to ensure the quality of cream formulations in the pharmaceutical industry.
Lee, Jae Won; Choi, Bo-Ram; Kim, Young-Chang; Choi, Doo Jin; Lee, Young-Seob; Kim, Geum-Soog; Baek, Nam-In; Kim, Seung-Yu; Lee, Dae Young
2017-12-04
The effective production and usage of ginsenosides, given their distinct pharmacological effects, are receiving increasing amounts of attention. As the ginsenosides content differs in different parts of Panax ginseng, we wanted to assess and compare the ginsenosides content in the ginseng roots, leave, stems, and berries. To extract the ginsenosides, 70% (v/v) methanol was used. The optimal ultra-performance liquid chromatography-quadrupole time of flight mass spectrometry (UPLC-QTOF/MS) method was used to profile various ginsenosides from the different parts of P. ginseng. The datasets were then subjected to multivariate analysis including principal component analysis (PCA) and hierarchical clustering analysis (HCA). A UPLC-QTOF/MS method with an in-house library was constructed to profile 58 ginsenosides. With this method, a total of 39 ginsenosides were successfully identified and quantified in the ginseng roots, leave, stem, and berries. PCA and HCA characterized the different ginsenosides compositions from the different parts. The quantitative ginsenoside contents were also characterized from each plant part. The results of this study indicate that the UPLC-QTOF/MS method can be an effective tool to characterize various ginsenosides from the different parts of P. ginseng.
Schnall, Rebecca; Bakken, Suzanne
2011-09-01
To assess the applicability of the Technology Acceptance Model (TAM) constructs in explaining HIV case managers' behavioural intention to use a continuity of care record (CCR) with context-specific links designed to meet their information needs. Data were collected from 94 case managers who provide care to persons living with HIV (PLWH) using an online survey comprising three components: (1) demographic information: age, gender, ethnicity, race, Internet usage and computer experience; (2) mock-up of CCR with context-specific links; and items related to TAM constructs. Data analysis included: principal components factor analysis (PCA), assessment of internal consistency reliability and univariate and multivariate analysis. PCA extracted three factors (Perceived Ease of Use, Perceived Usefulness and Perceived Barriers to Use), explained variance = 84.9%, Cronbach's ά = 0.69-0.91. In a linear regression model, Perceived Ease of Use, Perceived Usefulness and Perceived Barriers to Use explained 43.6% (p < 0.001) of the variance in Behavioural Intention to use a CCR with context-specific links. Our study contributes to the evidence base regarding TAM in health care through expanding the type of professional surveyed, study setting and Health Information Technology assessed.
Zhu, Yanzhong; Song, Yonghui; Yu, Huibin; Liu, Ruixia; Liu, Lusan; Lv, Chunjian
2017-08-08
UV-visible absorption spectroscopy coupled with principal component analysis (PCA) and hierarchical cluster analysis (HCA) was applied to characterize spectroscopic components, detect latent factors, and investigate spatial variations of dissolved organic matter (DOM) in a large-scale lake. Twelve surface water samples were collected from Dongjianghu Lake in China. DOM contained lignin and quinine moieties, carboxylic acid, microbial products, and aromatic and alkyl groups, which in the northern part of the lake was largely different from the southern part. Fifteen spectroscopic indices were deduced from the absorption spectra to indicate molecular weight or humification degree of DOM. The northern part of the lake presented the smaller molecular weight or the lower humification degree of DOM than the southern part. E 2/4 , E 3/4 , E 2/3 , and S 2 were latent factors of characterizing the molecular weight of DOM, while E 2/5 , E 3/5 , E 2/6 , E 4/5 , E 3/6 , and A 2/1 were latent factors of evaluating the humification degree of DOM. The UV-visible absorption spectroscopy combined with PCA and HCA may not only characterize DOM fractions of lakes, but may be transferred to other types of waterscape.
Hadjisolomou, Ekaterini; Stefanidis, Konstantinos; Papatheodorou, George; Papastergiadou, Evanthia
2018-03-19
During the last decades, Mediterranean freshwater ecosystems, especially lakes, have been under severe pressure due to increasing eutrophication and water quality deterioration. In this article, we compared the effectiveness of different data analysis methods by assessing the contribution of environmental parameters to eutrophication processes. For this purpose, principal components analysis (PCA), cluster analysis, and a self-organizing map (SOM) were applied, using water quality data from two transboundary lakes of North Greece. SOM is considered as an advanced and powerful data analysis tool because of its ability to represent complex and nonlinear relationships among multivariate data sets. The results of PCA and cluster analysis agreed with the SOM results, although the latter provided more information because of the visualization abilities regarding the parameters' relationships. Besides nutrients that were found to be a key factor for controlling chlorophyll-a (Chl - a), water temperature was related positively with algal production, while the Secchi disk depth parameter was found to be highly important and negatively related toeutrophic conditions. In general, the SOM results were more specific and allowed direct associations between the water quality variables. Our work showed that SOMs can be used effectively in limnological studies to produce robust and interpretable results, aiding scientists and managers to cope with environmental problems such as eutrophication.
Loeb, Stacy; Drevin, Linda; Robinson, David; Holmberg, Erik; Carlsson, Sigrid; Lambe, Mats; Stattin, Pär
2016-01-01
Purpose Prostate cancer (PCa) incidence and prognosis vary geographically. We examined possible differences in PCa risk by clinical risk category between native-born and immigrant populations in Sweden. Our hypothesis was that lower PSA-testing uptake among foreign-born men would result in lower rates of localized disease, and similar or higher risk of metastatic disease. Methods Using the Prostate Cancer database Sweden (PCBaSe), we identified 117,328 men with PCa diagnosed from 1991–2008, of which 8,332 were foreign-born. For each case, 5 cancer-free matched controls were randomly selected from the population register. Conditional logistic regression was used to compare low-risk, intermediate-risk, high-risk, regionally metastatic, and distant metastatic PCa based upon region of origin. Results Across all risk categories, immigrants had significantly lower PCa risk than native-born Swedish men, except North Americans and Northern Europeans. The lowest PCa risk was observed in men from the Middle East, Southern Europe and Asia. Multivariable adjustment for socioeconomic factors and comorbidities did not materially change risk estimates. Older age at immigration and more recent arrival in Sweden were associated with lower PCa risk. Non-native men were less likely to be diagnosed with PCa through PSA-testing during a health check-up. Conclusions The risk for all stages of PCa was lower among first-generation immigrants to Sweden compared to native-born men. Older age at immigration and more recent immigration were associated with particularly low risks. Patterns of PSA testing appeared to only partly explain the differences in PCa risk, since immigrant men also had a lower risk of metastatic disease. PMID:23266834
Multivariate Analysis and Prediction of Dioxin-Furan ...
Peer Review Draft of Regional Methods Initiative Final Report Dioxins, which are bioaccumulative and environmentally persistent, pose an ongoing risk to human and ecosystem health. Fish constitute a significant source of dioxin exposure for humans and fish-eating wildlife. Current dioxin analytical methods are costly, time-consuming, and produce hazardous by-products. A Danish team developed a novel, multivariate statistical methodology based on the covariance of dioxin-furan congener Toxic Equivalences (TEQs) and fatty acid methyl esters (FAMEs) and applied it to North Atlantic Ocean fishmeal samples. The goal of the current study was to attempt to extend this Danish methodology to 77 whole and composite fish samples from three trophic groups: predator (whole largemouth bass), benthic (whole flathead and channel catfish) and forage fish (composite bluegill, pumpkinseed and green sunfish) from two dioxin contaminated rivers (Pocatalico R. and Kanawha R.) in West Virginia, USA. Multivariate statistical analyses, including, Principal Components Analysis (PCA), Hierarchical Clustering, and Partial Least Squares Regression (PLS), were used to assess the relationship between the FAMEs and TEQs in these dioxin contaminated freshwater fish from the Kanawha and Pocatalico Rivers. These three multivariate statistical methods all confirm that the pattern of Fatty Acid Methyl Esters (FAMEs) in these freshwater fish covaries with and is predictive of the WHO TE
Selvarasu, Suresh; Kim, Do Yun; Karimi, Iftekhar A; Lee, Dong-Yup
2010-10-01
We present an integrated framework for characterizing fed-batch cultures of mouse hybridoma cells producing monoclonal antibody (mAb). This framework systematically combines data preprocessing, elemental balancing and statistical analysis technique. Initially, specific rates of cell growth, glucose/amino acid consumptions and mAb/metabolite productions were calculated via curve fitting using logistic equations, with subsequent elemental balancing of the preprocessed data indicating the presence of experimental measurement errors. Multivariate statistical analysis was then employed to understand physiological characteristics of the cellular system. The results from principal component analysis (PCA) revealed three major clusters of amino acids with similar trends in their consumption profiles: (i) arginine, threonine and serine, (ii) glycine, tyrosine, phenylalanine, methionine, histidine and asparagine, and (iii) lysine, valine and isoleucine. Further analysis using partial least square (PLS) regression identified key amino acids which were positively or negatively correlated with the cell growth, mAb production and the generation of lactate and ammonia. Based on these results, the optimal concentrations of key amino acids in the feed medium can be inferred, potentially leading to an increase in cell viability and productivity, as well as a decrease in toxic waste production. The study demonstrated how the current methodological framework using multivariate statistical analysis techniques can serve as a potential tool for deriving rational medium design strategies. Copyright © 2010 Elsevier B.V. All rights reserved.
Pec, Jaroslav; Flores-Sanchez, Isvett Josefina; Choi, Young Hae; Verpoorte, Robert
2010-07-01
Cannabis sativa L. plants produce a diverse array of secondary metabolites. Cannabis cell cultures were treated with jasmonic acid (JA) and pectin as elicitors to evaluate their effect on metabolism from two cell lines using NMR spectroscopy and multivariate data analysis. According to principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA), the chloroform extract of the pectin-treated cultures were more different than control and JA-treated cultures; but in the methanol/water extract the metabolome of the JA-treated cells showed clear differences with control and pectin-treated cultures. Tyrosol, an antioxidant metabolite, was detected in cannabis cell cultures. The tyrosol content increased after eliciting with JA.
NASA Astrophysics Data System (ADS)
Ryu, D.; Liu, S.; Western, A. W.; Webb, J. A.; Lintern, A.; Leahy, P.; Wilson, P.; Watson, M.; Waters, D.; Bende-Michl, U.
2016-12-01
The Great Barrier Reef (GBR) lagoon has been experiencing significant water quality deterioration due in part to agricultural intensification and urban settlement in adjacent catchments. The degradation of water quality in rivers is caused by land-derived pollutants (i.e. sediment, nutrient and pesticide). A better understanding of dynamics of water quality is essential for land management to improve the GBR ecosystem. However, water quality is also greatly influenced by natural hydrological processes. To assess influencing factors and predict the water quality accurately, selection of the most important predictors of water quality is necessary. In this work, multivariate statistical techniques - cluster analysis (CA), principal component analysis (PCA) and factor analysis (FA) - are used to reduce the complexity derived from the multidimensional water quality monitoring data. Seventeen stations are selected across the GBR catchments, and the event-based measurements of 12 variables monitored during 9 years (2006 - 2014) were analysed by means of CA and PCA/FA. The key findings are: (1) 17 stations can be grouped into two clusters according to the hierarchical CA, and the spatial dissimilarity between these sites is characterised by the different climatic and land use in the GBR catchments. (2) PCA results indicate that the first 3 PCs explain 85% of the total variance, and FA on the entire data set shows that the varifactor (VF) loadings can be used to interpret the sources of spatial variation in water quality on the GBR catchments level. The impact of soil erosion and non-point source of pollutants from agriculture contribution to VF1 and the variability in hydrological conditions and biogeochemical processes can explain the loadings in VF2. (3) FA is also performed on two groups of sites identified in CA individually, to evaluate the underlying sources that are responsible for spatial variability in water quality in the two groups. For the Cluster 1 sites, spatial variations in water quality are likely from the agricultural inputs (fertilises) and for the Cluster 2 sites, the differences in hydrological transport is responsible for large spatial variations in water quality. These findings can be applied to water quality assessment along with establish effective water and land management in the future.
Application of Multivariate Statistical Analysis to Biomarkers in Se-Turkey Crude Oils
NASA Astrophysics Data System (ADS)
Gürgey, K.; Canbolat, S.
2017-11-01
Twenty-four crude oil samples were collected from the 24 oil fields distributed in different districts of SE-Turkey. API and Sulphur content (%), Stable Carbon Isotope, Gas Chromatography (GC), and Gas Chromatography-Mass Spectrometry (GC-MS) data were used to construct a geochemical data matrix. The aim of this study is to examine the genetic grouping or correlations in the crude oil samples, hence the number of source rocks present in the SE-Turkey. To achieve these aims, two of the multivariate statistical analysis techniques (Principle Component Analysis [PCA] and Cluster Analysis were applied to data matrix of 24 samples and 8 source specific biomarker variables/parameters. The results showed that there are 3 genetically different oil groups: Batman-Nusaybin Oils, Adıyaman-Kozluk Oils and Diyarbakir Oils, in addition to a one mixed group. These groupings imply that at least, three different source rocks are present in South-Eastern (SE) Turkey. Grouping of the crude oil samples appears to be consistent with the geographic locations of the oils fields, subsurface stratigraphy as well as geology of the area.
Thomassen, Yvonne E; van Sprang, Eric N M; van der Pol, Leo A; Bakker, Wilfried A M
2010-09-01
Historical manufacturing data can potentially harbor a wealth of information for process optimization and enhancement of efficiency and robustness. To extract useful data multivariate data analysis (MVDA) using projection methods is often applied. In this contribution, the results obtained from applying MVDA on data from inactivated polio vaccine (IPV) production runs are described. Data from over 50 batches at two different production scales (700-L and 1,500-L) were available. The explorative analysis performed on single unit operations indicated consistent manufacturing. Known outliers (e.g., rejected batches) were identified using principal component analysis (PCA). The source of operational variation was pinpointed to variation of input such as media. Other relevant process parameters were in control and, using this manufacturing data, could not be correlated to product quality attributes. The gained knowledge of the IPV production process, not only from the MVDA, but also from digitalizing the available historical data, has proven to be useful for troubleshooting, understanding limitations of available data and seeing the opportunity for improvements. 2010 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Prochazka, D.; Mazura, M.; Samek, O.; Rebrošová, K.; Pořízka, P.; Klus, J.; Prochazková, P.; Novotný, J.; Novotný, K.; Kaiser, J.
2018-01-01
In this work, we investigate the impact of data provided by complementary laser-based spectroscopic methods on multivariate classification accuracy. Discrimination and classification of five Staphylococcus bacterial strains and one strain of Escherichia coli is presented. The technique that we used for measurements is a combination of Raman spectroscopy and Laser-Induced Breakdown Spectroscopy (LIBS). Obtained spectroscopic data were then processed using Multivariate Data Analysis algorithms. Principal Components Analysis (PCA) was selected as the most suitable technique for visualization of bacterial strains data. To classify the bacterial strains, we used Neural Networks, namely a supervised version of Kohonen's self-organizing maps (SOM). We were processing results in three different ways - separately from LIBS measurements, from Raman measurements, and we also merged data from both mentioned methods. The three types of results were then compared. By applying the PCA to Raman spectroscopy data, we observed that two bacterial strains were fully distinguished from the rest of the data set. In the case of LIBS data, three bacterial strains were fully discriminated. Using a combination of data from both methods, we achieved the complete discrimination of all bacterial strains. All the data were classified with a high success rate using SOM algorithm. The most accurate classification was obtained using a combination of data from both techniques. The classification accuracy varied, depending on specific samples and techniques. As for LIBS, the classification accuracy ranged from 45% to 100%, as for Raman Spectroscopy from 50% to 100% and in case of merged data, all samples were classified correctly. Based on the results of the experiments presented in this work, we can assume that the combination of Raman spectroscopy and LIBS significantly enhances discrimination and classification accuracy of bacterial species and strains. The reason is the complementarity in obtained chemical information while using these two methods.
NASA Astrophysics Data System (ADS)
Dodson, J. B.; Taylor, P. C.
2016-12-01
The diurnal cycle of convection (CDC) greatly influences the water, radiative, and energy budgets in convectively active regions. For example, previous research of the Amazonian CDC has identified significant monthly covariability between the satellite-observed radiative and precipitation diurnal and multiple reanalysis-derived atmospheric state variables (ASVs) representing convective instability. However, disagreements between retrospective analysis products (reanalyses) over monthly ASV anomalies create significant uncertainty in the resulting covariability. Satellite observations of convective clouds can be used to characterize monthly anomalies in convective activity. CloudSat observes multiple properties of both deep convective cores and the associated anvils, and so is useful as an alternative to the use of reanalyses. CloudSat cannot observe the full diurnal cycle, but it can detect differences between daytime and nighttime convection. Initial efforts to use CloudSat data to characterize convective activity showed that the results are highly dependent on the choice of variable used to characterize the cloud. This is caused by a series of inverse relationships between convective frequency, cloud top height, radar reflectivity vertical profile, and other variables. A single, multi-variable index for convective activity based on CloudSat data may be useful to clarify the results. Principal component analysis (PCA) provides a method to create a multivariable index, where the first principal component (PC1) corresponds with convective instability. The time series of PC1 can then be used as a proxy for monthly variability in convective activity. The primary challenge presented involves determining the utility of PCA for creating a robust index for convective activity that accounts for the complex relationships of multiple convective cloud variables, and yields information about the interactions between convection, the convective environment, and radiation beyond the previous single-variable approaches. The choice of variables used to calculate PC1 may influence any results based on PC1, so it is necessary to test the sensitivity of the results to different variable combinations.
A Semi-parametric Multivariate Gap-filling Model for Eddy Covariance Latent Heat Flux
NASA Astrophysics Data System (ADS)
Li, M.; Chen, Y.
2010-12-01
Quantitative descriptions of latent heat fluxes are important to study the water and energy exchanges between terrestrial ecosystems and the atmosphere. The eddy covariance approaches have been recognized as the most reliable technique for measuring surface fluxes over time scales ranging from hours to years. However, unfavorable micrometeorological conditions, instrument failures, and applicable measurement limitations may cause inevitable flux gaps in time series data. Development and application of suitable gap-filling techniques are crucial to estimate long term fluxes. In this study, a semi-parametric multivariate gap-filling model was developed to fill latent heat flux gaps for eddy covariance measurements. Our approach combines the advantages of a multivariate statistical analysis (principal component analysis, PCA) and a nonlinear interpolation technique (K-nearest-neighbors, KNN). The PCA method was first used to resolve the multicollinearity relationships among various hydrometeorological factors, such as radiation, soil moisture deficit, LAI, and wind speed. The KNN method was then applied as a nonlinear interpolation tool to estimate the flux gaps as the weighted sum latent heat fluxes with the K-nearest distances in the PCs’ domain. Two years, 2008 and 2009, of eddy covariance and hydrometeorological data from a subtropical mixed evergreen forest (the Lien-Hua-Chih Site) were collected to calibrate and validate the proposed approach with artificial gaps after standard QC/QA procedures. The optimal K values and weighting factors were determined by the maximum likelihood test. The results of gap-filled latent heat fluxes conclude that developed model successful preserving energy balances of daily, monthly, and yearly time scales. Annual amounts of evapotranspiration from this study forest were 747 mm and 708 mm for 2008 and 2009, respectively. Nocturnal evapotranspiration was estimated with filled gaps and results are comparable with other studies. Seasonal and daily variability of latent heat fluxes were also discussed.
Fernandez, Anne C; Amoyal, Nicole R; Paiva, Andrea L; Prochaska, James O
2016-01-01
In the United States, 36% of human papillomavirus (HPV)-related cancers occur among men. HPV vaccination can substantially reduce the risk of HPV infection; however, the vast majority of men are unvaccinated. This study developed and validated transtheoretical model-based measures for HPV vaccination in young adult men. Cross-sectional measurement development. Online survey of young adult men. Three hundred twenty-nine mostly college-attending men, ages 18 to 26. Stage of change, decisional balance (pros/cons), and self-efficacy. The sample was randomly split into halves for exploratory principal components analysis (PCA), followed by confirmatory factor analyses (CFA) to test measurement models. Multivariate analyses examined relationships between scales. For decisional balance, PCA revealed two uncorrelated five-item factors (pros α = .78; cons α = .83). For the self-efficacy scale, PCA revealed a single-factor solution (α = .83). CFA confirmed that the two-factor uncorrelated model for decisional balance and a single-factor model for self-efficacy. Follow-up analyses of variance supported the theoretically predicted relationships between stage of change, pros, and self-efficacy. This study resulted in reliable and valid measures of pros and self-efficacy for HPV vaccination that can be used in future clinical research.
NASA Astrophysics Data System (ADS)
Ganiyu, S. A.; Badmus, B. S.; Olurin, O. T.; Ojekunle, Z. O.
2018-03-01
The variation of groundwater quality across different regions is of great importance in the study of groundwater so as to ascertain the sources of contaminants to available water sources. Geochemical assessment of groundwater samples from hand-dug wells were done within the vicinity of Ajakanga dumpsite, Ibadan, Southwestern, Nigeria, with the aim of assessing their suitability for domestic and irrigation purposes. Ten groundwater samples were collected both in dry and wet seasons for analysis of physicochemical parameters such as: pH, EC, TDS, Na+, K+, Ca2+, Mg2+, HCO3^{ - } Cl-, SO4^{2 - }, NO3^{2 - } principal component analysis (PCA) and cluster analysis (CA) were used to determine probable sources of groundwater contamination. The results of the analyses showed the groundwater samples to be within permissible limits of WHO/NSDWQ, while elevated values of concentrations of most analyzed chemical constituents in water samples were noticed in S1 and S10 due to their nearness to the dumpsite and agricultural overflow, respectively. Groundwater in the study area is of hard, fresh and alkaline nature. There are very strong associations between EC and TDS, HCO3^{ - } and CO3^{2 - } in both seasons. PCA identified five and three major factors accounting for 95.7 and 88.7% of total variation in water quality for dry and wet seasons, respectively. PCA also identified factors influencing water quality as those probably related to mineral dissolution, groundwater-rock interaction, weathering process and anthropogenic activities from the dumpsite. Results of CA show groups based on similar water quality characteristics and on the extent of proximity to the dumpsite. Assessment for irrigation purpose showed that most of the water samples were suitable for agricultural purpose except in a few locations.
Del Coco, Laura; Schena, Francesco Paolo; Fanizzi, Francesco Paolo
2012-01-01
Multivariate analysis of 1H NMR data has been used for the characterization of 12 blended olive oils commercially available in the U.S. as Italian products. Chemometric methods such as unsupervised Principal Component Analysis (PCA) allowed good discrimination and gave some affinity indications for the U.S. market olive oils compared to other single cultivars of extra virgin olive oil such as Coratina and Ogliarola from Apulia, one of Italy’s leading olive oil producers, Picual (Spain), Kalamata (Greece) and Sfax (Tunisia). The olive oils commercially available as Italian products in the U.S. market clustered into 3 groups. Among them only the first (7 samples) and the second group (2 samples) showed PCA ranges similar to European references. Two oils of the third group (3 samples) were more similar to Tunisian references. In conclusion, our study revealed that most EVOO (extra virgin olive oils) tested were closer to Greek (in particular) and Spanish olive oils than Apulia EVOO. The PCA loadings disclose the components responsible for the discrimination as unsaturated (oleic, linoleic, linolenic) and saturated fatty acids. All are of great importance because of their nutritional value and differential effects on the oxidative stability of oils. It is evident that this approach has the potential to reveal the origin of EVOO, although the results support the need for a larger database, including EVOO from other Italian regions. PMID:22690321
Xu, Huan; Fu, Shi; Chen, Qi; Gu, Meng; Zhou, Juan; Liu, Chong; Chen, Yanbo; Wang, Zhong
2017-05-09
To measure the level of oxytocin in serum and prostate cancer (PCa) tissue and study its effect on the proliferation of PCa cells. Oxytocin level in serum was significantly increased in PCa patients compared with the no-carcinoma individuals. Additionally, the levels of oxytocin and its receptor were also elevated in the PCa tissue. However, no significant difference existed among the PCa of various Gleason grades. Western blot analysis confirmed the previous results and revealed an increased expression level of APPL1. The level of oxytocin in serum was measured by ELISA analysis. The expression of oxytocin and its receptor in prostate was analyzed by immunohistochemistry. The proliferation and apoptosis of PCa cells were assessed by the Cell Counting Kit 8 (CCK8) assay, cell cycle analysis and caspase3 activity analysis, respectively. Western blot analysis was used for the detection of PCNA, Caspase3 and APPL1 protein levels. Serum and prostatic oxytocin levels are increased in the PCa subjects. Serum oxytocin level may be a biomarker for PCa in the future. Oxytocin increases PCa growth and APPL1 expression.
NASA Astrophysics Data System (ADS)
Freitas, Renato P.; Coelho, Filipe A.; Felix, Valter S.; Pereira, Marcelo O.; de Souza, Marcos André Torres; Anjos, Marcelino J.
2018-03-01
This study used Raman, FT-IR and XRF spectroscopy and SEM to analyze ceramic fragments dating from the 19th century, excavated from an old farm in the municipality of Pirenópolis, Goiás, Brazil. The results show that the samples were produced in an open oven at a firing temperature below 500 °C, using raw materials including kaolinite, hematite, magnetite, quartz, microcline, albite, anhydrite, calcite, illite, orthoclase and MnO2. Although the analyses showed similarities in the manufacturing process and the presence of many minerals was common in all samples, multivariate statistical methods (PCA) allowed a more detailed assessment of similarities and differences in the mineral composition of the samples. The results of the PCA showed that the samples excavated in one of the slave quarters (senzalas) group with those excavated at the farmhouse, where the landowner lived, which indicates a paternalistic attitude towards captives, including the sharing of ceramic materials of everyday use.
Popović, Boris M; Stajner, Dubravka; Slavko, Kevrešan; Sandra, Bijelić
2012-09-15
Ethanol extracts (80% in water) of 10 cornelian cherry (Cornus mas L.) genotypes were studied for antioxidant properties, using methods including DPPH(), ()NO, O(2)(-) and ()OH antiradical powers, FRAP, total phenolic and anthocyanin content (TPC and ACC) and also one relatively new, permanganate method (permanganate reducing antioxidant capacity-PRAC). Lipid peroxidation (LP) was also determined as an indicator of oxidative stress. The data from different procedures were compared and analysed by multivariate techniques (correlation matrix calculation and principal component analysis (PCA)). Significant positive correlations were obtained between TPC, ACC and DPPH(), ()NO, O(2)(-), and ()OH antiradical powers, and also between PRAC and TPC, ACC and FRAP. PCA found two major clusters of cornelian cherry, based on antiradical power, FRAP and PRAC and also on chemical composition. Chemometric evaluation showed close interdependence between PRAC method and FRAP and ACC. There was a huge variation between C. mas genotypes in terms of antioxidant activity. Copyright © 2012 Elsevier Ltd. All rights reserved.
Boehm, Katharina; Valdivieso, Roger; Meskawi, Malek; Larcher, Alessandro; Schiffmann, Jonas; Sun, Maxine; Graefen, Markus; Saad, Fred; Parent, Marie-Élise; Karakiewicz, Pierre I
2016-03-01
We relied on a population-based case-control study (PROtEuS) to examine a potential association between the presence of histologically confirmed prostate cancer (PCa) and history of genitourinary infections, e.g., prostatitis, urethritis, orchitis and epididymitis. Cases were 1933 men with incident PCa, diagnosed across Montreal hospitals between 2005 and 2009. Population controls were 1994 men from the same residential area and age distribution. In-person interviews collected information about socio-demographic characteristics, lifestyle and medical history, e.g., self-reported history of several genitourinary infections, as well as on PCa screening. Logistic regression analyses tested overall and grade-specific associations, including subgroup analyses with frequent PSA testing. After multivariable adjustment, prostatitis was associated with an increased risk of any PCa (OR 1.81 [1.44-2.27]), but not urethritis (OR 1.05 [0.84-1.30]), orchitis (OR 1.28 [0.92-1.78]) or epididymitis (OR 0.98 [0.57-1.68]). The association between prostatitis and PCa was more pronounced for low-grade PCa (Gleason ≤ 6: OR 2.11 [1.61-2.77]; Gleason ≥ 7: OR 1.59 [1.22-2.07]). Adjusting for frequency of physician visits, PSA testing frequency or restricting analyses to frequently screened subjects did not affect these results. Prostatitis was associated with an increased probability for detecting PCa even after adjustment for frequency of PSA testing and physician visits, but not urethritis, orchitis or epididymitis. These considerations may be helpful in clinical risk stratification of individuals in whom the risk of PCa is pertinent.
Jouni, Hayan; Rodeheffer, Richard J; Kullo, Iftikhar J
2011-01-01
To determine whether serum levels of N-terminal (NT) pro-B-type natriuretic peptide (pro-BNP) are higher in patients with poorly compressible arteries (PCA) than in patients with peripheral artery disease (PAD) and control subjects without PCA or PAD. Medial arterial calcification in the lower extremities results in PCA and may be associated with increased arterial stiffness and hemodynamic/myocardial stress. PCA was defined as having an ankle-brachial index >1.4 or an ankle blood pressure >255 mm Hg, whereas PAD was defined as having an ankle-brachial index ≤0.9. Study participants with PCA (n=100; aged 71±10 years; 70% men) and age- and sex-matched patients with PAD (n=300) were recruited from the noninvasive vascular laboratory. Age- and sex-matched controls (n=300) were identified from a community-based cohort and had no history of PAD. NT pro-BNP levels were approximately 2.5-fold higher in patients with PCA than in patients with PAD and approximately 4-fold higher than in age- and sex-matched controls. In multivariable regression analyses that adjusted for age, sex, smoking, hypertension, history of coronary heart disease/stroke, systolic blood pressure, and serum creatinine, NT pro-BNP levels remained significantly higher in patients with PCA than in patients with PAD and controls (P<0.001). Patients with medial arterial calcification and PCA have higher serum levels of NT pro-BNP than patients with PAD and controls, which is suggestive of an adverse hemodynamic milieu and increased risk for adverse cardiovascular outcomes.
Blind source separation problem in GPS time series
NASA Astrophysics Data System (ADS)
Gualandi, A.; Serpelloni, E.; Belardinelli, M. E.
2016-04-01
A critical point in the analysis of ground displacement time series, as those recorded by space geodetic techniques, is the development of data-driven methods that allow the different sources of deformation to be discerned and characterized in the space and time domains. Multivariate statistic includes several approaches that can be considered as a part of data-driven methods. A widely used technique is the principal component analysis (PCA), which allows us to reduce the dimensionality of the data space while maintaining most of the variance of the dataset explained. However, PCA does not perform well in finding the solution to the so-called blind source separation (BSS) problem, i.e., in recovering and separating the original sources that generate the observed data. This is mainly due to the fact that PCA minimizes the misfit calculated using an L2 norm (χ 2), looking for a new Euclidean space where the projected data are uncorrelated. The independent component analysis (ICA) is a popular technique adopted to approach the BSS problem. However, the independence condition is not easy to impose, and it is often necessary to introduce some approximations. To work around this problem, we test the use of a modified variational Bayesian ICA (vbICA) method to recover the multiple sources of ground deformation even in the presence of missing data. The vbICA method models the probability density function (pdf) of each source signal using a mix of Gaussian distributions, allowing for more flexibility in the description of the pdf of the sources with respect to standard ICA, and giving a more reliable estimate of them. Here we present its application to synthetic global positioning system (GPS) position time series, generated by simulating deformation near an active fault, including inter-seismic, co-seismic, and post-seismic signals, plus seasonal signals and noise, and an additional time-dependent volcanic source. We evaluate the ability of the PCA and ICA decomposition techniques in explaining the data and in recovering the original (known) sources. Using the same number of components, we find that the vbICA method fits the data almost as well as a PCA method, since the χ 2 increase is less than 10 % the value calculated using a PCA decomposition. Unlike PCA, the vbICA algorithm is found to correctly separate the sources if the correlation of the dataset is low (<0.67) and the geodetic network is sufficiently dense (ten continuous GPS stations within a box of side equal to two times the locking depth of a fault where an earthquake of Mw >6 occurred). We also provide a cookbook for the use of the vbICA algorithm in analyses of position time series for tectonic and non-tectonic applications.
Lapolla, Annunziata; Ragazzi, Eugenio; Andretta, Barbara; Fedele, Domenico; Tubaro, Michela; Seraglia, Roberta; Molin, Laura; Traldi, Pietro
2007-06-01
To clarify the possible pathogenetic role of oxidation products originated from the glycation of proteins, human globins from nephropathic patients have been studied by matrix-assisted laser desorption/ionization mass spectrometry (MALDI), revealing not only unglycated and monoglycated globins, but also a series of different species. For the last ones, structural assignments were tentatively done on the basis of observed masses and expectations for the Maillard reaction pattern. Consequently, they must be considered only propositive, and the discussion which will follow must be considered in this view. In our opinion this approach does not seem to compromise the intended diagnostic use of the data because distinctions are valid even if the assignments are uncertain. We studied nine healthy subjects and 19 nephropathic patients and processed the data obtained from the MALDI spectra using a multivariate analysis. Our results showed that multivariate analytical techniques enable differential aspects of the profile of molecular species to be identified in the blood of end stage nephropathic patients. A correct grouping can be achieved by principal component analysis (PCA) and the results suggest that several products involved in carbonyl stress exist in nephropathic patients. These compounds may have a relevant role as specific markers of the pathological state.
Exploratory analysis of TOF-SIMS data from biological surfaces
NASA Astrophysics Data System (ADS)
Vaidyanathan, Seetharaman; Fletcher, John S.; Henderson, Alex; Lockyer, Nicholas P.; Vickerman, John C.
2008-12-01
The application of multivariate analytical tools enables simplification of TOF-SIMS datasets so that useful information can be extracted from complex spectra and images, especially those that do not give readily interpretable results. There is however a challenge in understanding the outputs from such analyses. The problem is complicated when analysing images, given the additional dimensions in the dataset. Here we demonstrate how the application of simple pre-processing routines can enable the interpretation of TOF-SIMS spectra and images. For the spectral data, TOF-SIMS spectra used to discriminate bacterial isolates associated with urinary tract infection were studied. Using different criteria for picking peaks before carrying out PC-DFA enabled identification of the discriminatory information with greater certainty. For the image data, an air-dried salt stressed bacterial sample, discussed in another paper by us in this issue, was studied. Exploration of the image datasets with and without normalisation prior to multivariate analysis by PCA or MAF resulted in different regions of the image being highlighted by the techniques.
Wang, Jinjia; Liu, Yuan
2015-04-01
This paper presents a feature extraction method based on multivariate empirical mode decomposition (MEMD) combining with the power spectrum feature, and the method aims at the non-stationary electroencephalogram (EEG) or magnetoencephalogram (MEG) signal in brain-computer interface (BCI) system. Firstly, we utilized MEMD algorithm to decompose multichannel brain signals into a series of multiple intrinsic mode function (IMF), which was proximate stationary and with multi-scale. Then we extracted and reduced the power characteristic from each IMF to a lower dimensions using principal component analysis (PCA). Finally, we classified the motor imagery tasks by linear discriminant analysis classifier. The experimental verification showed that the correct recognition rates of the two-class and four-class tasks of the BCI competition III and competition IV reached 92.0% and 46.2%, respectively, which were superior to the winner of the BCI competition. The experimental proved that the proposed method was reasonably effective and stable and it would provide a new way for feature extraction.
Jåstad, Eirik O; Torheim, Turid; Villeneuve, Kathleen M; Kvaal, Knut; Hole, Eli O; Sagstuen, Einar; Malinen, Eirik; Futsaether, Cecilia M
2017-09-28
The amino acid l-α-alanine is the most commonly used material for solid-state electron paramagnetic resonance (EPR) dosimetry, due to the formation of highly stable radicals upon irradiation, with yields proportional to the radiation dose. Two major alanine radical components designated R1 and R2 have previously been uniquely characterized from EPR and electron-nuclear double resonance (ENDOR) studies as well as from quantum chemical calculations. There is also convincing experimental evidence of a third minor radical component R3, and a tentative radical structure has been suggested, even though no well-defined spectral signature has been observed experimentally. In the present study, temperature dependent EPR spectra of X-ray irradiated polycrystalline alanine were analyzed using five multivariate methods in further attempts to understand the composite nature of the alanine dosimeter EPR spectrum. Principal component analysis (PCA), maximum likelihood common factor analysis (MLCFA), independent component analysis (ICA), self-modeling mixture analysis (SMA), and multivariate curve resolution (MCR) were used to extract pure radical spectra and their fractional contributions from the experimental EPR spectra. All methods yielded spectral estimates resembling the established R1 spectrum. Furthermore, SMA and MCR consistently predicted both the established R2 spectrum and the shape of the R3 spectrum. The predicted shape of the R3 spectrum corresponded well with the proposed tentative spectrum derived from spectrum simulations. Thus, results from two independent multivariate data analysis techniques strongly support the previous evidence that three radicals are indeed present in irradiated alanine samples.
Dynamic analysis environment for nuclear forensic analyses
NASA Astrophysics Data System (ADS)
Stork, C. L.; Ummel, C. C.; Stuart, D. S.; Bodily, S.; Goldblum, B. L.
2017-01-01
A Dynamic Analysis Environment (DAE) software package is introduced to facilitate group inclusion/exclusion method testing, evaluation and comparison for pre-detonation nuclear forensics applications. Employing DAE, the multivariate signatures of a questioned material can be compared to the signatures for different, known groups, enabling the linking of the questioned material to its potential process, location, or fabrication facility. Advantages of using DAE for group inclusion/exclusion include built-in query tools for retrieving data of interest from a database, the recording and documentation of all analysis steps, a clear visualization of the analysis steps intelligible to a non-expert, and the ability to integrate analysis tools developed in different programming languages. Two group inclusion/exclusion methods are implemented in DAE: principal component analysis, a parametric feature extraction method, and k nearest neighbors, a nonparametric pattern recognition method. Spent Fuel Isotopic Composition (SFCOMPO), an open source international database of isotopic compositions for spent nuclear fuels (SNF) from 14 reactors, is used to construct PCA and KNN models for known reactor groups, and 20 simulated SNF samples are utilized in evaluating the performance of these group inclusion/exclusion models. For all 20 simulated samples, PCA in conjunction with the Q statistic correctly excludes a large percentage of reactor groups and correctly includes the true reactor of origination. Employing KNN, 14 of the 20 simulated samples are classified to their true reactor of origination.
Hussain, Hazilia; Yusoff, Mohd Kamil; Ramli, Mohd Firuz; Abd Latif, Puziah; Juahir, Hafizan; Zawawi, Mohamed Azwan Mohammed
2013-11-15
Nitrate-nitrogen leaching from agricultural areas is a major cause for groundwater pollution. Polluted groundwater with high levels of nitrate is hazardous and cause adverse health effects. Human consumption of water with elevated levels of NO3-N has been linked to the infant disorder methemoglobinemia and also to non-Hodgkin's disease lymphoma in adults. This research aims to study the temporal patterns and source apportionment of nitrate-nitrogen leaching in a paddy soil at Ladang Merdeka Ismail Mulong in Kelantan, Malaysia. The complex data matrix (128 x 16) of nitrate-nitrogen parameters was subjected to multivariate analysis mainly Principal Component Analysis (PCA) and Discriminant Analysis (DA). PCA extracted four principal components from this data set which explained 86.4% of the total variance. The most important contributors were soil physical properties confirmed using Alyuda Forecaster software (R2 = 0.98). Discriminant analysis was used to evaluate the temporal variation in soil nitrate-nitrogen on leaching process. Discriminant analysis gave four parameters (hydraulic head, evapotranspiration, rainfall and temperature) contributing more than 98% correct assignments in temporal analysis. DA allowed reduction in dimensionality of the large data set which defines the four operating parameters most efficient and economical to be monitored for temporal variations. This knowledge is important so as to protect the precious groundwater from contamination with nitrate.
Metabolic changes in different developmental stages of Vanilla planifolia pods.
Palama, Tony Lionel; Khatib, Alfi; Choi, Young Hae; Payet, Bertrand; Fock, Isabelle; Verpoorte, Robert; Kodja, Hippolyte
2009-09-09
The metabolomic analysis of developing Vanilla planifolia green pods (between 3 and 8 months after pollination) was carried out by nuclear magnetic resonance (NMR) spectroscopy and multivariate data analysis. Multivariate data analysis of the (1)H NMR spectra, such as principal component analysis (PCA) and partial least-squares-discriminant analysis (PLS-DA), showed a trend of separation of those samples based on the metabolites present in the methanol/water (1:1) extract. Older pods had a higher content of glucovanillin, vanillin, p-hydroxybenzaldehyde glucoside, p-hydroxybenzaldehyde, and sucrose, while younger pods had more bis[4-(beta-D-glucopyranosyloxy)-benzyl]-2-isopropyltartrate (glucoside A), bis[4-(beta-D-glucopyranosyloxy)-benzyl]-2-(2-butyl)tartrate (glucoside B), glucose, malic acid, and homocitric acid. A liquid chromatography-mass spectrometry (LC-MS) analysis targeted at phenolic compound content was also performed on the developing pods and confirmed the NMR results. Ratios of aglycones/glucosides were estimated and thus allowed for detection of more minor metabolites in the green vanilla pods. Quantification of compounds based on both LC-MS and NMR analyses showed that free vanillin can reach 24% of the total vanillin content after 8 months of development in the vanilla green pods.
Predictive equations for the estimation of body size in seals and sea lions (Carnivora: Pinnipedia)
Churchill, Morgan; Clementz, Mark T; Kohno, Naoki
2014-01-01
Body size plays an important role in pinniped ecology and life history. However, body size data is often absent for historical, archaeological, and fossil specimens. To estimate the body size of pinnipeds (seals, sea lions, and walruses) for today and the past, we used 14 commonly preserved cranial measurements to develop sets of single variable and multivariate predictive equations for pinniped body mass and total length. Principal components analysis (PCA) was used to test whether separate family specific regressions were more appropriate than single predictive equations for Pinnipedia. The influence of phylogeny was tested with phylogenetic independent contrasts (PIC). The accuracy of these regressions was then assessed using a combination of coefficient of determination, percent prediction error, and standard error of estimation. Three different methods of multivariate analysis were examined: bidirectional stepwise model selection using Akaike information criteria; all-subsets model selection using Bayesian information criteria (BIC); and partial least squares regression. The PCA showed clear discrimination between Otariidae (fur seals and sea lions) and Phocidae (earless seals) for the 14 measurements, indicating the need for family-specific regression equations. The PIC analysis found that phylogeny had a minor influence on relationship between morphological variables and body size. The regressions for total length were more accurate than those for body mass, and equations specific to Otariidae were more accurate than those for Phocidae. Of the three multivariate methods, the all-subsets approach required the fewest number of variables to estimate body size accurately. We then used the single variable predictive equations and the all-subsets approach to estimate the body size of two recently extinct pinniped taxa, the Caribbean monk seal (Monachus tropicalis) and the Japanese sea lion (Zalophus japonicus). Body size estimates using single variable regressions generally under or over-estimated body size; however, the all-subset regression produced body size estimates that were close to historically recorded body length for these two species. This indicates that the all-subset regression equations developed in this study can estimate body size accurately. PMID:24916814
Yim, Jae Hyun; Kim, Chan Kyo; Kim, Jae-Hun
2018-04-01
Active surveillance (AS) is an important treatment strategy for prostate cancer (PCa). Prostate Imaging-Reporting and Data System (PI-RADS) v2 has been addressed, but few studies have reported the value of PI-RADS v2 for assessing risk stratification in patients with PCa, especially on selecting potential candidates for AS. To investigate the utility of PI-RADS v2 and apparent diffusion coefficient (ADC) in evaluating patients with insignificant PCa, who are suitable for AS. Retrospective. In all, 238 patients with PCa who met the Prostate Cancer Research International: Active Surveillance criteria underwent radical prostatectomy. 3.0T, including T 2 -weighted, diffusion-weighted, and dynamic contrast-enhanced imaging. Insignificant cancer was defined histopathologically as an organ-confined disease with a tumor volume <0.5 cm 3 without Gleason score 4-5. Patients were divided into two groups based on the PI-RADS v2 and tumor ADC: A, PI-RADS score ≤3 and ADC ≥1.095 × 10 -3 mm 2 /s; and B, PI-RADS score 4-5 or ADC <1.095 × 10 -3 mm 2 /s. Preoperative clinical and imaging variables were evaluated regarding the associations with insignificant cancer. Of the 238 patients, 101 (42.8%) were diagnosed with insignificant cancer on pathological findings. The number of positive cores, prostate-specific antigen density (PSAD), PI-RADS v2 and tumor ADC were significantly associated with insignificant cancer on univariate analysis (P < 0.05). However, multivariate analysis indicated tumor ADC (odds ratio [OR] = 4.57, P < 0.001) and PI-RADS v2 (OR = 3.60, P < 0.001) were independent predictors of insignificant cancer. Area under the receiver operating characteristics curve (AUC) reached 0.803 when PI-RADS v2 (AUC = 0.747) was combined with tumor ADC (AUC = 0.786). The PI-RADS v2 together with tumor ADC may be a useful marker for predicting patients with insignificant PCa when considering AS. 4 Technical Efficacy: Stage 2 J. Magn. Reson. Imaging 2018;47:1072-1079. © 2017 International Society for Magnetic Resonance in Medicine.
Predictive spectroscopy and chemical imaging based on novel optical systems
NASA Astrophysics Data System (ADS)
Nelson, Matthew Paul
1998-10-01
This thesis describes two futuristic optical systems designed to surpass contemporary spectroscopic methods for predictive spectroscopy and chemical imaging. These systems are advantageous to current techniques in a number of ways including lower cost, enhanced portability, shorter analysis time, and improved S/N. First, a novel optical approach to predicting chemical and physical properties based on principal component analysis (PCA) is proposed and evaluated. A regression vector produced by PCA is designed into the structure of a set of paired optical filters. Light passing through the paired filters produces an analog detector signal directly proportional to the chemical/physical property for which the regression vector was designed. Second, a novel optical system is described which takes a single-shot approach to chemical imaging with high spectroscopic resolution using a dimension-reduction fiber-optic array. Images are focused onto a two- dimensional matrix of optical fibers which are drawn into a linear distal array with specific ordering. The distal end is imaged with a spectrograph equipped with an ICCD camera for spectral analysis. Software is used to extract the spatial/spectral information contained in the ICCD images and deconvolute them into wave length-specific reconstructed images or position-specific spectra which span a multi-wavelength space. This thesis includes a description of the fabrication of two dimension-reduction arrays as well as an evaluation of the system for spatial and spectral resolution, throughput, image brightness, resolving power, depth of focus, and channel cross-talk. PCA is performed on the images by treating rows of the ICCD images as spectra and plotting the scores of each PC as a function of reconstruction position. In addition, iterative target transformation factor analysis (ITTFA) is performed on the spectroscopic images to generate ``true'' chemical maps of samples. Univariate zero-order images, univariate first-order spectroscopic images, bivariate first-order spectroscopic images, and multivariate first-order spectroscopic images of the temporal development of laser-induced plumes are presented and interpreted. Reconstructed chemical images generated using bivariate and trivariate wavelength techniques, bimodal and trimodal PCA methods, and bimodal and trimodal ITTFA approaches are also included.
NASA Astrophysics Data System (ADS)
Hu, Chongqing; Li, Aihua; Zhao, Xingyang
2011-02-01
This paper proposes a multivariate statistical analysis approach to processing the instantaneous engine speed signal for the purpose of locating multiple misfire events in internal combustion engines. The state of each cylinder is described with a characteristic vector extracted from the instantaneous engine speed signal following a three-step procedure. These characteristic vectors are considered as the values of various procedure parameters of an engine cycle. Therefore, determination of occurrence of misfire events and identification of misfiring cylinders can be accomplished by a principal component analysis (PCA) based pattern recognition methodology. The proposed algorithm can be implemented easily in practice because the threshold can be defined adaptively without the information of operating conditions. Besides, the effect of torsional vibration on the engine speed waveform is interpreted as the presence of super powerful cylinder, which is also isolated by the algorithm. The misfiring cylinder and the super powerful cylinder are often adjacent in the firing sequence, thus missing detections and false alarms can be avoided effectively by checking the relationship between the cylinders.
NASA Astrophysics Data System (ADS)
Tsai, Yu-Hsuan; Garrett, Timothy J.; Carter, Christy S.; Yost, Richard A.
2015-06-01
Skeletal muscles are composed of heterogeneous muscle fibers that have different physiological, morphological, biochemical, and histological characteristics. In this work, skeletal muscles extensor digitorum longus, soleus, and whole gastrocnemius were analyzed by matrix-assisted laser desorption/ionization mass spectrometry to characterize small molecule metabolites of oxidative and glycolytic muscle fiber types as well as to visualize biomarker localization. Multivariate data analysis such as principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were performed to extract significant features. Different metabolic fingerprints were observed from oxidative and glycolytic fibers. Higher abundances of biomolecules such as antioxidant anserine as well as acylcarnitines were observed in the glycolytic fibers, whereas taurine and some nucleotides were found to be localized in the oxidative fibers.
Liu, Xiaona; Zhang, Qiao; Wu, Zhisheng; Shi, Xinyuan; Zhao, Na; Qiao, Yanjiang
2015-01-01
Laser-induced breakdown spectroscopy (LIBS) was applied to perform a rapid elemental analysis and provenance study of Blumea balsamifera DC. Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were implemented to exploit the multivariate nature of the LIBS data. Scores and loadings of computed principal components visually illustrated the differing spectral data. The PLS-DA algorithm showed good classification performance. The PLS-DA model using complete spectra as input variables had similar discrimination performance to using selected spectral lines as input variables. The down-selection of spectral lines was specifically focused on the major elements of B. balsamifera samples. Results indicated that LIBS could be used to rapidly analyze elements and to perform provenance study of B. balsamifera. PMID:25558999
NASA Astrophysics Data System (ADS)
Vítková, Gabriela; Prokeš, Lubomír; Novotný, Karel; Pořízka, Pavel; Novotný, Jan; Všianský, Dalibor; Čelko, Ladislav; Kaiser, Jozef
2014-11-01
Focusing on historical aspect, during archeological excavation or restoration works of buildings or different structures built from bricks it is important to determine, preferably in-situ and in real-time, the locality of bricks origin. Fast classification of bricks on the base of Laser-Induced Breakdown Spectroscopy (LIBS) spectra is possible using multivariate statistical methods. Combination of principal component analysis (PCA) and linear discriminant analysis (LDA) was applied in this case. LIBS was used to classify altogether the 29 brick samples from 7 different localities. Realizing comparative study using two different LIBS setups - stand-off and table-top it is shown that stand-off LIBS has a big potential for archeological in-field measurements.
A Novel Acoustic Sensor Approach to Classify Seeds Based on Sound Absorption Spectra
Gasso-Tortajada, Vicent; Ward, Alastair J.; Mansur, Hasib; Brøchner, Torben; Sørensen, Claus G.; Green, Ole
2010-01-01
A non-destructive and novel in situ acoustic sensor approach based on the sound absorption spectra was developed for identifying and classifying different seed types. The absorption coefficient spectra were determined by using the impedance tube measurement method. Subsequently, a multivariate statistical analysis, i.e., principal component analysis (PCA), was performed as a way to generate a classification of the seeds based on the soft independent modelling of class analogy (SIMCA) method. The results show that the sound absorption coefficient spectra of different seed types present characteristic patterns which are highly dependent on seed size and shape. In general, seed particle size and sphericity were inversely related with the absorption coefficient. PCA presented reliable grouping capabilities within the diverse seed types, since the 95% of the total spectral variance was described by the first two principal components. Furthermore, the SIMCA classification model based on the absorption spectra achieved optimal results as 100% of the evaluation samples were correctly classified. This study contains the initial structuring of an innovative method that will present new possibilities in agriculture and industry for classifying and determining physical properties of seeds and other materials. PMID:22163455
Simeonov, V; Massart, D L; Andreev, G; Tsakovski, S
2000-11-01
The paper deals with application of different statistical methods like cluster and principal components analysis (PCA), partial least squares (PLSs) modeling. These approaches are an efficient tool in achieving better understanding about the contamination of two gulf regions in Black Sea. As objects of the study, a collection of marine sediment samples from Varna and Bourgas "hot spots" gulf areas are used. In the present case the use of cluster and PCA make it possible to separate three zones of the marine environment with different levels of pollution by interpretation of the sediment analysis (Bourgas gulf, Varna gulf and lake buffer zone). Further, the extraction of four latent factors offers a specific interpretation of the possible pollution sources and separates natural from anthropogenic factors, the latter originating from contamination by chemical, oil refinery and steel-work enterprises. Finally, the PLSs modeling gives a better opportunity in predicting contaminant concentration on tracer (or tracers) element as compared to the one-dimensional approach of the baseline models. The results of the study are important not only in local aspect as they allow quick response in finding solutions and decision making but also in broader sense as a useful environmetrical methodology.
Shawky, Eman; Abou El Kheir, Rasha M
2018-02-11
Species of Apiaceae are used in folk medicine as spices and in officinal medicinal preparations of drugs. They are an excellent source of phenolics exhibiting antioxidant activity, which are of great benefit to human health. Discrimination among Apiaceae medicinal herbs remains an intricate challenge due to their morphological similarity. In this study, a combined "untargeted" and "targeted" approach to investigate different Apiaceae plants species was proposed by using the merging of high-performance thin layer chromatography (HPTLC)-image analysis and pattern recognition methods which were used for fingerprinting and classification of 42 different Apiaceae samples collected from Egypt. Software for image processing was applied for fingerprinting and data acquisition. HPTLC fingerprint assisted by principal component analysis (PCA) and hierarchical cluster analysis (HCA)-heat maps resulted in a reliable untargeted approach for discrimination and classification of different samples. The "targeted" approach was performed by developing and validating an HPTLC method allowing the quantification of eight flavonoids. The combination of quantitative data with PCA and HCA-heat-maps allowed the different samples to be discriminated from each other. The use of chemometrics tools for evaluation of fingerprints reduced expense and analysis time. The proposed method can be adopted for routine discrimination and evaluation of the phytochemical variability in different Apiaceae species extracts. Copyright © 2018 John Wiley & Sons, Ltd.
Stefanidis, Konstantinos; Papatheodorou, George
2018-01-01
During the last decades, Mediterranean freshwater ecosystems, especially lakes, have been under severe pressure due to increasing eutrophication and water quality deterioration. In this article, we compared the effectiveness of different data analysis methods by assessing the contribution of environmental parameters to eutrophication processes. For this purpose, principal components analysis (PCA), cluster analysis, and a self-organizing map (SOM) were applied, using water quality data from two transboundary lakes of North Greece. SOM is considered as an advanced and powerful data analysis tool because of its ability to represent complex and nonlinear relationships among multivariate data sets. The results of PCA and cluster analysis agreed with the SOM results, although the latter provided more information because of the visualization abilities regarding the parameters’ relationships. Besides nutrients that were found to be a key factor for controlling chlorophyll-a (Chl-a), water temperature was related positively with algal production, while the Secchi disk depth parameter was found to be highly important and negatively related toeutrophic conditions. In general, the SOM results were more specific and allowed direct associations between the water quality variables. Our work showed that SOMs can be used effectively in limnological studies to produce robust and interpretable results, aiding scientists and managers to cope with environmental problems such as eutrophication. PMID:29562675
Lotan, Tamara L; Wei, Wei; Morais, Carlos L; Hawley, Sarah T; Fazli, Ladan; Hurtado-Coll, Antonio; Troyer, Dean; McKenney, Jesse K; Simko, Jeffrey; Carroll, Peter R; Gleave, Martin; Lance, Raymond; Lin, Daniel W; Nelson, Peter S; Thompson, Ian M; True, Lawrence D; Feng, Ziding; Brooks, James D
2016-06-01
PTEN is the most commonly deleted tumor suppressor gene in primary prostate cancer (PCa) and its loss is associated with poor clinical outcomes and ERG gene rearrangement. We tested whether PTEN loss is associated with shorter recurrence-free survival (RFS) in surgically treated PCa patients with known ERG status. A genetically validated, automated PTEN immunohistochemistry (IHC) protocol was used for 1275 primary prostate tumors from the Canary Foundation retrospective PCa tissue microarray cohort to assess homogeneous (in all tumor tissue sampled) or heterogeneous (in a subset of tumor tissue sampled) PTEN loss. ERG status as determined by a genetically validated IHC assay was available for a subset of 938 tumors. Associations between PTEN and ERG status were assessed using Fisher's exact test. Kaplan-Meier and multivariate weighted Cox proportional models for RFS were constructed. When compared to intact PTEN, homogeneous (hazard ratio [HR] 1.66, p = 0.001) but not heterogeneous (HR 1.24, p = 0.14) PTEN loss was significantly associated with shorter RFS in multivariate models. Among ERG-positive tumors, homogeneous (HR 3.07, p < 0.0001) but not heterogeneous (HR 1.46, p = 0.10) PTEN loss was significantly associated with shorter RFS. Among ERG-negative tumors, PTEN did not reach significance for inclusion in the final multivariate models. The interaction term for PTEN and ERG status with respect to RFS did not reach statistical significance ( p = 0.11) for the current sample size. These data suggest that PTEN is a useful prognostic biomarker and that there is no statistically significant interaction between PTEN and ERG status for RFS. We found that loss of the PTEN tumor suppressor gene in prostate tumors as assessed by tissue staining is correlated with shorter time to prostate cancer recurrence after radical prostatectomy.
Qi, Xiubin; Crooke, Emma; Ross, Andrew; Bastow, Trevor P; Stalvies, Charlotte
2011-09-21
This paper presents a system and method developed to identify a source oil's characteristic properties by testing the oil's dissolved components in water. Through close examination of the oil dissolution process in water, we hypothesise that when oil is in contact with water, the resulting oil-water extract, a complex hydrocarbon mixture, carries the signature property information of the parent oil. If the dominating differences in compositions between such extracts of different oils can be identified, this information could guide the selection of various sensors, capable of capturing such chemical variations. When used as an array, such a sensor system can be used to determine parent oil information from the oil-water extract. To test this hypothesis, 22 oils' water extracts were prepared and selected dominant hydrocarbons analyzed with Gas Chromatography-Mass Spectrometry (GC-MS); the subsequent Principal Component Analysis (PCA) indicates that the major difference between the extract solutions is the relative concentration between the volatile mono-aromatics and fluorescent polyaromatics. An integrated sensor array system that is composed of 3 volatile hydrocarbon sensors and 2 polyaromatic hydrocarbon sensors was built accordingly to capture the major and subtle differences of these extracts. It was tested by exposure to a total of 110 water extract solutions diluted from the 22 extracts. The sensor response data collected from the testing were processed with two multivariate analysis tools to reveal information retained in the response patterns of the arrayed sensors: by conducting PCA, we were able to demonstrate the ability to qualitatively identify and distinguish different oil samples from their sensor array response patterns. When a supervised PCA, Linear Discriminate Analysis (LDA), was applied, even quantitative classification can be achieved: the multivariate model generated from the LDA achieved 89.7% of successful classification of the type of the oil samples. By grouping the samples based on the level of viscosity and density we were able to reveal the correlation between the oil extracts' sensor array responses and their original oils' feature properties. The equipment and method developed in this study have promising potential to be readily applied in field studies and marine surveys for oil exploration or oil spill monitoring.
NASA Astrophysics Data System (ADS)
Eglinton, Timothy I.; Sinninghe Damsté, Jaap S.; Pool, Wim; de Leeuw, Jan W.; Eijk, Gert; Boon, Jaap J.
1992-04-01
This study describes the analysis of sulphur-containing products from Curie-point pyrolysis (Py) of eighty-five samples (kerogens, bitumen, and petroleum asphaltenes and coals) using gas chromatography (GC) in combination with sulphur-selective detection. Peak areas of approximately forty individual organic sulphur pyrolysis products (OSPP) were measured, and the results analysed with the aid of multivariate data reduction techniques (principal components analysis, (PCA)). The structural relationships proposed in an earlier publication ( SINNINGHE DAMSTé et al., 1989a) in which OSPP can be grouped according to common "carbon skeletons" are supported by PCA. The distribution of OSPP varies both as a function of kerogen type (as defined by elemental composition) and maturity, reflecting differences in the relative abundance of the various carbon skeleton types. Sulphur-containing products from Type I, Type II, and, to some extent, Type II-S kerogens are dominated by OSPP derived from "moieties" (i.e., discrete structural components within the macromolecule) possessing linear carbon skeletons, while coals and Type III kerogens give rise to higher relative abundances of OSPP with branched carbon skeletons. Type I kerogens are distinguished from Type II kerogens due to the type of linear carbon skeleton, the former yielding higher relative amounts of 2- n-alkylthiophenes and thiolanes and the latter 2,5-di-substituted sulphur-containing products. Products from sulphur-rich (Type II-S) kerogens differ by higher relative abundances of OSPP derived from precursors with isoprenoid and/or steroidal side-chain carbon skeletons, and by higher absolute abundances of all OSPP. Petroleum and, to a lesser extent, bitumen asphaltenes give rise to OSPP with longer carbon skeletons than do kerogens or coals. This observation supports the models proposed by SINNINGHE DAMSTé et al. (1990a) in which sulphur-containing moieties in asphaltenes are bound by fewer intermolecular bridges (i.e., are less extensively cross-linked) and, consequently, more readily yield longer chain products on pyrolysis. From these observations, we suggest that Py-GC in combination with PCA provides useful information concerning the chemical nature of organically bound sulphur in geomacromolecules. This information can be rationalised based on carbon skeleton relationships established for low molecular weight organic sulphur compounds, and in terms of kerogen type and overall sulphur content
De Luca, Stefano; Passera, Roberto; Fiori, Cristian; Bollito, Enrico; Cappia, Susanna; Mario Scarpa, Roberto; Sottile, Antonino; Franco Randone, Donato; Porpiglia, Francesco
2015-10-01
To determine if prostate health index (PHI), prostate cancer antigen gene 3 (PCA3) score, and percentage of free prostate-specific antigen (%fPSA) may be used to differentiate asymptomatic acute and chronic prostatitis from prostate cancer (PCa), benign prostatic hyperplasia (BPH), and high-grade prostate intraepithelial neoplasia (HG-PIN) in patients with elevated PSA levels and negative findings on digital rectal examination at repeat biopsy (re-Bx). In this prospective study, 252 patients were enrolled, undergoing PHI, PCA3 score, and %fPSA assessments before re-Bx. We used 3 multivariate logistic regression models to test the PHI, PCA3 score, and %fPSA as risk factors for prostatitis vs. PCa, vs. BPH, and vs. HG-PIN. All the analyses were performed for the whole patient cohort and for the "gray zone" of PSA (4-10ng/ml) cohort (171 individuals). Of the 252 patients, 43 (17.1%) had diagnosis of PCa. The median PHI was significantly different between men with a negative biopsy and those with a positive biopsy (34.9 vs. 48.1, P<0.001), as for the PCA3 score (24 vs. 54, P<0.001) and %fPSA (11.8% vs. 15.8%, P = 0.012). The net benefit of using PCA3 and PHI to differentiate prostatitis and PCa was moderate, although it extended to a good range of threshold probabilities (40%-100%), whereas that from using %fPSA was negligible: this pattern was reported for the whole population as for the "gray zone" PSA cohort. In front of a good diagnostic performance of all the 3 biomarkers in distinguishing negative biopsy vs. positive biopsy, the clinical benefit of using the PCA3 score and PHI to estimate prostatitis vs. PCa was comparable. PHI was the only determinant for prostatitis vs. BPH, whereas no biomarkers could differentiate prostate inflammation from HG-PIN. Copyright © 2015 Elsevier Inc. All rights reserved.
Park, Sung Yoon; Shin, Su-Jin; Jung, Dae Chul; Cho, Nam Hoon; Choi, Young Deuk; Rha, Koon Ho; Hong, Sung Joon; Oh, Young Taik
2017-06-01
To analyze whether Prostate Imaging Reporting and Data System (PI-RADSv2) scores are associated with a risk of normal-sized pelvic lymph node metastasis (PLNM) in prostate cancer (PCa). A consecutive series of 221 patients who underwent magnetic resonance imaging and radical prostatectomy with pelvic lymph node dissection (PLND) for PCa were retrospectively analyzed under the approval of institutional review board in our institution. No patients had enlarged (≥0.8cm in short-axis diameter) lymph nodes. Clinical parameters [prostate-specific antigen (PSA), greatest percentage of biopsy core, and percentage of positive cores], and PI-RADSv2 score from two independent readers were analyzed with multivariate logistic regression and receiver operating-characteristic curve for PLNM. Diagnostic performance of PI-RADSv2 and Briganti nomogram was compared. Weighted kappa was investigated regarding PI-RADSv2 scoring. Normal-sized PLNM was found in 9.5% (21/221) of patients. In multivariate analysis, PI-RADSv2 (reader 1, p=0.009; reader 2, p=0.026) and PSA (reader 1, p=0.008; reader 2, p=0.037) were predictive of normal-sized PLNM. Threshold of PI-RADSv2 was a score of 5, where PI-RADSv2 was associated with high sensitivity (reader 1, 95.2% [20/21]; reader 2, 90.5% [19/21]) and negative predictive value (reader 1, 99.2% [124/125]; reader 2, 98.6% [136/138]). However, diagnostic performance of PI-RADSv2 (AUC=0.786-0.788) was significantly lower than that of Briganti nomogram (AUC=0.890) for normal-sized PLNM (p<0.05). The inter-reader agreement was excellent for PI-RADSv2 of 5 or not (weighted kappa=0.804). PI-RADSv2 scores may be associated with the risk of normal-sized PLNM in PCa. Copyright © 2017. Published by Elsevier B.V.
Recognizing different tissues in human fetal femur cartilage by label-free Raman microspectroscopy
NASA Astrophysics Data System (ADS)
Kunstar, Aliz; Leijten, Jeroen; van Leuveren, Stefan; Hilderink, Janneke; Otto, Cees; van Blitterswijk, Clemens A.; Karperien, Marcel; van Apeldoorn, Aart A.
2012-11-01
Traditionally, the composition of bone and cartilage is determined by standard histological methods. We used Raman microscopy, which provides a molecular "fingerprint" of the investigated sample, to detect differences between the zones in human fetal femur cartilage without the need for additional staining or labeling. Raman area scans were made from the (pre)articular cartilage, resting, proliferative, and hypertrophic zones of growth plate and endochondral bone within human fetal femora. Multivariate data analysis was performed on Raman spectral datasets to construct cluster images with corresponding cluster averages. Cluster analysis resulted in detection of individual chondrocyte spectra that could be separated from cartilage extracellular matrix (ECM) spectra and was verified by comparing cluster images with intensity-based Raman images for the deoxyribonucleic acid/ribonucleic acid (DNA/RNA) band. Specific dendrograms were created using Ward's clustering method, and principal component analysis (PCA) was performed with the separated and averaged Raman spectra of cells and ECM of all measured zones. Overall (dis)similarities between measured zones were effectively visualized on the dendrograms and main spectral differences were revealed by PCA allowing for label-free detection of individual cartilaginous zones and for label-free evaluation of proper cartilaginous matrix formation for future tissue engineering and clinical purposes.
Structure-seeking multilinear methods for the analysis of fMRI data.
Andersen, Anders H; Rayens, William S
2004-06-01
In comprehensive fMRI studies of brain function, the data structures often contain higher-order ways such as trial, task condition, subject, and group in addition to the intrinsic dimensions of time and space. While multivariate bilinear methods such as principal component analysis (PCA) have been used successfully for extracting information about spatial and temporal features in data from a single fMRI run, the need to unfold higher-order data sets into bilinear arrays has led to decompositions that are nonunique and to the loss of multiway linkages and interactions present in the data. These additional dimensions or ways can be retained in multilinear models to produce structures that are unique and which admit interpretations that are neurophysiologically meaningful. Multiway analysis of fMRI data from multiple runs of a bilateral finger-tapping paradigm was performed using the parallel factor (PARAFAC) model. A trilinear model was fitted to a data cube of dimensions voxels by time by run. Similarly, a quadrilinear model was fitted to a higher-way structure of dimensions voxels by time by trial by run. The spatial and temporal response components were extracted and validated by comparison to results from traditional SVD/PCA analyses based on scenarios of unfolding into lower-order bilinear structures.
Exploring patterns enriched in a dataset with contrastive principal component analysis.
Abid, Abubakar; Zhang, Martin J; Bagaria, Vivek K; Zou, James
2018-05-30
Visualization and exploration of high-dimensional data is a ubiquitous challenge across disciplines. Widely used techniques such as principal component analysis (PCA) aim to identify dominant trends in one dataset. However, in many settings we have datasets collected under different conditions, e.g., a treatment and a control experiment, and we are interested in visualizing and exploring patterns that are specific to one dataset. This paper proposes a method, contrastive principal component analysis (cPCA), which identifies low-dimensional structures that are enriched in a dataset relative to comparison data. In a wide variety of experiments, we demonstrate that cPCA with a background dataset enables us to visualize dataset-specific patterns missed by PCA and other standard methods. We further provide a geometric interpretation of cPCA and strong mathematical guarantees. An implementation of cPCA is publicly available, and can be used for exploratory data analysis in many applications where PCA is currently used.
Hu, Boran; Yue, Yaqing; Zhu, Yong; Wen, Wen; Zhang, Fengmin; Hardie, Jim W
2015-01-01
Proton nuclear magnetic resonance spectroscopy coupled multivariate analysis (1H NMR-PCA/PLS-DA) is an important tool for the discrimination of wine products. Although 1H NMR has been shown to discriminate wines of different cultivars, a grape genetic component of the discrimination has been inferred only from discrimination of cultivars of undefined genetic homology and in the presence of many confounding environmental factors. We aimed to confirm the influence of grape genotypes in the absence of those factors. We applied 1H NMR-PCA/PLS-DA and hierarchical cluster analysis (HCA) to wines from five, variously genetically-related grapevine (V. vinifera) cultivars; all grown similarly on the same site and vinified similarly. We also compared the semi-quantitative profiles of the discriminant metabolites of each cultivar with previously reported chemical analyses. The cultivars were clearly distinguishable and there was a general correlation between their grouping and their genetic homology as revealed by recent genomic studies. Between cultivars, the relative amounts of several of the cultivar-related discriminant metabolites conformed closely with reported chemical analyses. Differences in grape-derived metabolites associated with genetic differences alone are a major source of 1H NMR-based discrimination of wines and 1H NMR has the capacity to discriminate between very closely related cultivars. The study confirms that genetic variation among grape cultivars alone can account for the discrimination of wine by 1H NMR-PCA/PLS and indicates that 1H NMR spectra of wine of single grape cultivars may in future be used in tandem with hierarchical cluster analysis to elucidate genetic lineages and metabolomic relations of grapevine cultivars. In the absence of genetic information, for example, where predecessor varieties are no longer extant, this may be a particularly useful approach.
Zuo, Yamin; Deng, Xuehua; Wu, Qing
2018-05-04
Discrimination of Gastrodia elata ( G. elata ) geographical origin is of great importance to pharmaceutical companies and consumers in China. this paper focuses on the feasibility of near infrared spectrum (NIRS) combined multivariate analysis as a rapid and non-destructive method to prove its fit for this purpose. Firstly, 16 batches of G. elata samples from four main-cultivation regions in China were quantified by traditional HPLC method. It showed that samples from different origins could not be efficiently differentiated by the contents of four phenolic compounds in this study. Secondly, the raw near infrared (NIR) spectra of those samples were acquired and two different pattern recognition techniques were used to classify the geographical origins. The results showed that with spectral transformation optimized, discriminant analysis (DA) provided 97% and 99% correct classification for the calibration and validation sets of samples from discriminating of four different main-cultivation regions, and provided 98% and 99% correct classifications for the calibration and validation sets of samples from eight different cities, respectively, which all performed better than the principal component analysis (PCA) method. Thirdly, as phenolic compounds content (PCC) is highly related with the quality of G. elata , synergy interval partial least squares (Si-PLS) was applied to build the PCC prediction model. The coefficient of determination for prediction (R p ²) of the Si-PLS model was 0.9209, and root mean square error for prediction (RMSEP) was 0.338. The two regions (4800 cm −1 ⁻5200 cm −1 , and 5600 cm −1 ⁻6000 cm −1 ) selected by Si-PLS corresponded to the absorptions of aromatic ring in the basic phenolic structure. It can be concluded that NIR spectroscopy combined with PCA, DA and Si-PLS would be a potential tool to provide a reference for the quality control of G. elata.
Jesse, Stephen; Kalinin, Sergei V
2009-02-25
An approach for the analysis of multi-dimensional, spectroscopic-imaging data based on principal component analysis (PCA) is explored. PCA selects and ranks relevant response components based on variance within the data. It is shown that for examples with small relative variations between spectra, the first few PCA components closely coincide with results obtained using model fitting, and this is achieved at rates approximately four orders of magnitude faster. For cases with strong response variations, PCA allows an effective approach to rapidly process, de-noise, and compress data. The prospects for PCA combined with correlation function analysis of component maps as a universal tool for data analysis and representation in microscopy are discussed.
NASA Astrophysics Data System (ADS)
Su, Shiliang; Zhi, Junjun; Lou, Liping; Huang, Fang; Chen, Xia; Wu, Jiaping
Characterizing the spatio-temporal patterns and apportioning the pollution sources of water bodies are important for the management and protection of water resources. The main objective of this study is to describe the dynamics of water quality and provide references for improving river pollution control practices. Comprehensive application of neural-based modeling and different multivariate methods was used to evaluate the spatio-temporal patterns and source apportionment of pollution in Qiantang River, China. Measurement data were obtained and pretreated for 13 variables from 41 monitoring sites for the period of 2001-2004. A self-organizing map classified the 41 monitoring sites into three groups (Group A, B and C), representing different pollution characteristics. Four significant parameters (dissolved oxygen, biochemical oxygen demand, total phosphorus and total lead) were identified by discriminant analysis for distinguishing variations of different years, with about 80% correct assignment for temporal variation. Rotated principal component analysis (PCA) identified four potential pollution sources for Group A (domestic sewage and agricultural pollution, industrial wastewater pollution, mineral weathering, vehicle exhaust and sand mining), five for Group B (heavy metal pollution, agricultural runoff, vehicle exhaust and sand mining, mineral weathering, chemical plants discharge) and another five for Group C (vehicle exhaust and sand mining, chemical plants discharge, soil weathering, biochemical pollution, mineral weathering). The identified potential pollution sources explained 75.6% of the total variances for Group A, 75.0% for Group B and 80.0% for Group C, respectively. Receptor-based source apportionment was applied to further estimate source contributions for each pollution variable in the three groups, which facilitated and supported the PCA results. These results could assist managers to develop optimal strategies and determine priorities for river pollution control and effective water resources management.
Mutheneni, Srinivasa Rao; Upadhyayula, Suryanaryana Murty; Kumaraswamy, Sriram; Kadiri, Madhusudhan Rao; Nagalla, Balakrishna
2016-01-01
Lymphatic filariasis (LF) is a major public health problem in India. The objective of the study was to assess the impact of socioeconomic conditions on LF in Chittoor district of Andhra Pradesh, India. A survey was carried out from 2004 to 2007 during which, an epidemiological and socioeconomic data were collected and analysed. The microfilaria (mf) positive samples were taken as cases and matched with control group by sex and age (1:1) for case-control study. Bivariate and multivariate logistic regression was used to identify the potential risk factors for filariasis. Using principal component analysis (PCA), a socioeconomic index was developed and the data/scores were classified into low, medium and high categories. In total 5,133 blood smears were collected, of which 77 samples were found positive for microfilaria (1.52%). Multivariate analysis showed that the risk of filariasis was higher in groups of people with income < ₹1000 per month [OR = 2.752 (95%CI, 0.435-17.429)]; ₹ 1000-3000 per month [3.079 (0.923-0.275)]; people living in tiled house structure [1.641 (0.534-5.048)], with kutcha (uncemented) drainage system [19.427 (2.985- 126.410)], respondents who did not implemented mosquito avoidance measures [1.737 (0.563-5.358)]; and in people who were not aware about prevention and control of filariasis [1.042 (0.368-2.956)]. PCA showed that respondents with low (41.6%) and medium (33.8%) socioeconomic status are more prone to filariasis (p=0.036). The cross sectional study showed that the population with low and medium socioeconomic status are at higher risk of filariasis. The identified socioeconomic risk factors can be used as a guideline for improving the conditions for effective management of filariasis.
Baez-Cazull, S. E.; McGuire, J.T.; Cozzarelli, I.M.; Voytek, M.A.
2008-01-01
Determining the processes governing aqueous biogeochemistry in a wetland hydrologically linked to an underlying contaminated aquifer is challenging due to the complex exchange between the systems and their distinct responses to changes in precipitation, recharge, and biological activities. To evaluate temporal and spatial processes in the wetland-aquifer system, water samples were collected using cm-scale multichambered passive diffusion samplers (peepers) to span the wetland-aquifer interface over a period of 3 yr. Samples were analyzed for major cations and anions, methane, and a suite of organic acids resulting in a large dataset of over 8000 points, which was evaluated using multivariate statistics. Principal component analysis (PCA) was chosen with the purpose of exploring the sources of variation in the dataset to expose related variables and provide insight into the biogeochemical processes that control the water chemistry of the system. Factor scores computed from PCA were mapped by date and depth. Patterns observed suggest that (i) fermentation is the process controlling the greatest variability in the dataset and it peaks in May; (ii) iron and sulfate reduction were the dominant terminal electron-accepting processes in the system and were associated with fermentation but had more complex seasonal variability than fermentation; (iii) methanogenesis was also important and associated with bacterial utilization of minerals as a source of electron acceptors (e.g., barite BaSO4); and (iv) seasonal hydrological patterns (wet and dry periods) control the availability of electron acceptors through the reoxidation of reduced iron-sulfur species enhancing iron and sulfate reduction. Copyright ?? 2008 by the American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America. All rights reserved.
Brachytherapy Boost Utilization and Survival in Unfavorable-risk Prostate Cancer.
Johnson, Skyler B; Lester-Coll, Nataniel H; Kelly, Jacqueline R; Kann, Benjamin H; Yu, James B; Nath, Sameer K
2017-11-01
There are limited comparative survival data for prostate cancer (PCa) patients managed with a low-dose rate brachytherapy (LDR-B) boost and dose-escalated external-beam radiotherapy (DE-EBRT) alone. To compare overall survival (OS) for men with unfavorable PCa between LDR-B and DE-EBRT groups. Using the National Cancer Data Base, we identified men with unfavorable PCa treated between 2004 and 2012 with androgen suppression (AS) and either EBRT followed by LDR-B or DE-EBRT (75.6-86.4Gy). Treatment selection was evaluated using logistic regression and annual percentage proportions. OS was analyzed using the Kaplan-Meier method, log-rank test, Cox proportional hazards, and propensity score matching. We identified 25038 men between 2004 and 2012, during which LDR-B boost utilization decreased from 29% to 14%. LDR-B was associated with better OS on univariate (7-yr OS: 82% vs 73%; p<0.001) and multivariate analyses (hazard ratio [HR] 0.70, 95% confidence interval [CI] 0.64-0.77). Propensity score matching verified an OS benefit associated with LDR-B boost (HR 0.74, 95% CI 0.66-0.89). The OS benefit of LDR-B boost persisted when limited to men aged <60 yr with no comorbidities. On subset analysis, there was no interaction between treatment and age, risk group, or radiation dose. Limitations include the retrospective design, nonrandomized selection bias, and the absence of treatment toxicity, hormone duration, and cancer-specific outcomes. Between 2004 and 2012, LDR-B boost utilization declined and was associated with better OS compared to DE-EBRT alone. LDR-B boost is probably the ideal treatment option for men with unfavorable PCa, pending long-term results of randomized trials. We compared radiotherapy utilization and survival for prostate cancer (PCa) patients using a national database. We found that low-dose rate brachytherapy (LDR-B) boost, a method being used less frequently, was associated with better overall survival when compared to dose-escalated external-beam radiotherapy alone for men with unfavorable PCa. Randomized trials are needed to confirm that LDR-B boost is the ideal treatment. Copyright © 2017 European Association of Urology. Published by Elsevier B.V. All rights reserved.
Foahom Kamwa, A D; Vian, E; Agoua, G; Sénéchal, C; Bentaleb, Y; Fofana, M; Manip-M'ebobisse, N; Blanchet, P
2012-11-01
To analyze in a Caribbean population at 90% of African descent, the results of radiotherapy with androgen deprivation (AD) in high-risk prostate cancer (PCa). Fifty-nine consecutive patients with a high-risk PCa as defined by the D'AMICO classification and treated by radiotherapy with AD between January 2003 and April 2009 in our center were analyzed. The median dose of radiation and the median duration of AD were 70Gy and 37months respectively. Biochemical recurrence (BF), as primary outcome was defined according to the PHOENIX criteria (nadir PSA+2ng/mL). Multivariate analysis was performed to identify predictive factors of BF. The median follow-up was 47months. Eight (13.6%) patients had BF and four (6.8%) developed metastases. Six (10.2%) died during the follow-up. The 5years acturial biochemical disease-free survival was 79.7%. Multivariate analyses have shown that Gleason sum (GS) superior to 7 (P=0.029), AD duration less than 24months (P=0.004) and the rate of Nadir PSA greater or equal to 0.5ng/mL (P=0.011) were independent predictive factors of BF. This study was the first to our knowledge, to provide that radiotherapy associate with AD for HRPC among Caribbean men is effective as observed in other populations. Patients with GS superior to 7 could be considered for more aggressive treatments in clinical trials. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Rollins, Derrick K; Teh, Ailing
2010-12-17
Microarray data sets provide relative expression levels for thousands of genes for a small number, in comparison, of different experimental conditions called assays. Data mining techniques are used to extract specific information of genes as they relate to the assays. The multivariate statistical technique of principal component analysis (PCA) has proven useful in providing effective data mining methods. This article extends the PCA approach of Rollins et al. to the development of ranking genes of microarray data sets that express most differently between two biologically different grouping of assays. This method is evaluated on real and simulated data and compared to a current approach on the basis of false discovery rate (FDR) and statistical power (SP) which is the ability to correctly identify important genes. This work developed and evaluated two new test statistics based on PCA and compared them to a popular method that is not PCA based. Both test statistics were found to be effective as evaluated in three case studies: (i) exposing E. coli cells to two different ethanol levels; (ii) application of myostatin to two groups of mice; and (iii) a simulated data study derived from the properties of (ii). The proposed method (PM) effectively identified critical genes in these studies based on comparison with the current method (CM). The simulation study supports higher identification accuracy for PM over CM for both proposed test statistics when the gene variance is constant and for one of the test statistics when the gene variance is non-constant. PM compares quite favorably to CM in terms of lower FDR and much higher SP. Thus, PM can be quite effective in producing accurate signatures from large microarray data sets for differential expression between assays groups identified in a preliminary step of the PCA procedure and is, therefore, recommended for use in these applications.
Upadhyay, Rohit; Mishra, Hari Niwas
2016-04-01
The simultaneous optimization of a synergistic blend of oleoresin sage (SAG) and ascorbyl palmitate (AP) in sunflower oil (SO) was performed using central composite and rotatable design coupled with principal component analysis (PCA) and response surface methodology (RSM). The physicochemical parameters viz., peroxide value, anisidine value, free fatty acids, induction period, total polar matter, antioxidant capacity and conjugated diene value were considered as response variables. PCA reduced the original set of correlated responses to few uncorrelated principal components (PC). The PC1 (eigen value, 5.78; data variance explained, 82.53 %) was selected for optimization using RSM. The quadratic model adequately described the data (R (2) = 0. 91, p < 0.05) and lack of fit was insignificant (p > 0.05). The contour plot of PC 1 score indicated the optimal synergistic combination of 1289.19 and 218.06 ppm for SAG and AP, respectively. This combination of SAG and AP resulted in shelf life of 320 days at 25 °C estimated using linear shelf life prediction model. In conclusion, the versatility of PCA-RSM approach has resulted in an easy interpretation in multiple response optimizations. This approach can be considered as a useful guide to develop new oil blends stabilized with food additives from natural sources.
NASA Astrophysics Data System (ADS)
Chen, Yi-Ying; Chu, Chia-Ren; Li, Ming-Hsu
2012-10-01
SummaryIn this paper we present a semi-parametric multivariate gap-filling model for tower-based measurement of latent heat flux (LE). Two statistical techniques, the principal component analysis (PCA) and a nonlinear interpolation approach were integrated into this LE gap-filling model. The PCA was first used to resolve the multicollinearity relationships among various environmental variables, including radiation, soil moisture deficit, leaf area index, wind speed, etc. Two nonlinear interpolation methods, multiple regressions (MRS) and the K-nearest neighbors (KNNs) were examined with random selected flux gaps for both clear sky and nighttime/cloudy data to incorporate into this LE gap-filling model. Experimental results indicated that the KNN interpolation approach is able to provide consistent LE estimations while MRS presents over estimations during nighttime/cloudy. Rather than using empirical regression parameters, the KNN approach resolves the nonlinear relationship between the gap-filled LE flux and principal components with adaptive K values under different atmospheric states. The developed LE gap-filling model (PCA with KNN) works with a RMSE of 2.4 W m-2 (˜0.09 mm day-1) at a weekly time scale by adding 40% artificial flux gaps into original dataset. Annual evapotranspiration at this study site were estimated at 736 mm (1803 MJ) and 728 mm (1785 MJ) for year 2008 and 2009, respectively.
NASA Astrophysics Data System (ADS)
Lan, C. Y.; Li, M. H.; Chen, Y. Y.
2016-12-01
Appropriate estimations of gaps appeared in eddy covariance (EC) flux observations are critical to the reliability of long-term EC applications. In this study we present a semi-parametric multivariate gap-filling model for tower-based measurement of CO2 flux. The raw EC data passing QC/QA was separated into two groups, clear sky, having net radiation greater than 50 W/m2, and nighttime/cloudy. For the clear sky conditions, the principle component analysis (PCA) was used to resolve the multicollinearity relationships among various environmental variables, including net radiation, wind speed, vapor pressure deficit, soil moisture deficit, leaf area index, and soil temperature, in association with CO2 assimilated by forest. After the principal domains were determined by the PCA, the relationships between CO2 fluxes and selected PCs (key factors) were built up by nonlinear interpolations to estimate the gap-filled CO2 flux. In view of limited photosynthesis at nighttime/cloudy conditions, respiration rate of the forest ecosystem was estimated by the Lloyd-Tylor equation. Artificial gaps were randomly selected to exam the applicability of our PCA approach. Based on tower-based measurement of CO2 flux at the Lien-Hua-Chih site, a total of 5.8 ton-C/ha/yr was assimilated in 2012.
Villa-Manríquez, J F; Castro-Ramos, J; Gutiérrez-Delgado, F; Lopéz-Pacheco, M A; Villanueva-Luna, A E
2017-08-01
In this study we identify and classify high and low levels of glycated hemoglobin (HbA1c) in healthy volunteers (HV) and diabetic patients (DP). Overall, 86 subjects were evaluated. The Raman spectrum was measured in three anatomical regions of the body: index fingertip, right ear lobe, and forehead. The measurements were performed to compare the difference between the HV and DP (22 well controlled diabetic patients (WCDP) (HbA1c <6.5%), and 49 not controlled diabetic patients (NCDP) (HbA1c ≥6.5%)). Multivariable methods such as principal components analysis (PCA) combined with support vector machine (SVM) were used to develop effective diagnostic algorithms for classification among these groups. The forehead of HV versus WCDP showed the highest sensitivity (100%) and specificity (100%). Sensitivity (100%) and specificity (60%), were highest in the forehead of WCDP, versus NCDP. In HV versus NCDP, the fingertip had the highest sensitivity (100%) and specificity (80%). The efficacy of the diagnostic algorithm by receiver operating characteristic (ROC) curve was confirmed. Overall, our study demonstrated that the combination of Raman spectroscopy and PCA-SVM are feasible non-invasive diagnostic tool in diabetes to classify qualitatively high and low levels of HbA1c in vivo. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Cuypers, Maarten; Lamers, Romy E D; de Vries, Marieke; Husson, Olga; Kil, Paul J M; van de Poll-Franse, Lonneke V
2016-11-01
To investigate decision-making role preferences and their association with the evaluation of information received in a sample of low-risk and intermediate-risk prostate cancer (Pca) survivors. Cross-sectional study involved 562 men diagnosed with low-risk or intermediate-risk Pca (median time since diagnosis, 48mo), measuring preferred decision-making role (Control Preference Scale) and the evaluation of information received (EORTC QLQ-INFO25). Analyses were performed using analysis of variance, chi-square tests, and multivariable linear regression models. Men who preferred a passive role were older and less educated than other preference groups and more often selected a noninvasive treatment option (all with P<0.001). The passive role preference group reported having received less information, judged the received information as less helpful, and indicated lower overall satisfaction with information received (all with P<0.05). Role preference groups did not differ in their desire to receive more information. Compared with nonpassive preference groups, the preference for a passive role in Pca treatment decision-making is associated with less satisfaction with the information received. Assessment of role preferences and tailored information provision could improve satisfaction with information received and perhaps may ultimately lead to improved patient participation in treatment decision-making. Copyright © 2016 Elsevier Inc. All rights reserved.
Russo, Giorgio I; Cimino, Sebastiano; Giranio, Giorgia; Regis, Federica; Favilla, Vincenzo; Privitera, Salvatore; Motta, Fabio; Caltabiano, Rosario; Stenzl, Arnulf; Todenhöfer, Tilman; Morgia, Giuseppe
2018-05-01
To evaluate the association between metabolic syndrome (MetS) and proliferative inflammatory atrophy (PIA) in patients with suspected prostate cancer (PCa). From June 2015 to July 2016, we conducted the FIERY (Flogosis Increased Events of pRostatic biopsY) study at the Urology section, Department of Surgery of the University of Catania (Local registration number: #131/2015). A total of 205 patients with elevated prostate-specific antigen (≥ 4 ng/ml) or clinical suspicion of PCa who underwent primary transperineal prostate biopsy were included in this cross-sectional study. The assessment of PIA, HGPIN, and PCa were performed by 2 experienced pathologists and samples were investigated for the presence of an inflammatory infiltrate, according to the Irani score. Primary and secondary Gleason grade of tumor in positive biopsies were evaluated according to the 2016 ISUP Modified Gleason System. In the entire cohort, median age was 68.0 (interquartile range: 62.0-74.5), median prostate-specific antigen was 6.5 (interquartile range: 5.51-9.57). The prevalence of MetS was 34.1%, the detection rate of PCa was 32.7%, the rate of PIA was 28.3%, the rate of HGPIN was 32.2%, whereas the rate of severe intraprostatic inflammation (Irani-score ≥4) was 28.8%. When comparing clinical and histological variables in patients without and with PIA, metabolic aberrations where not significantly different in both groups. We did not find statistical association in detection rate of PCa (29.3% vs. 34.0%; P = 0.07) and HGPIN (27.6% vs. 34.0%; P = 0.37) in patients with and without PIA, respectively. When considering metabolic aberrations, MetS was not associated with Irani-score ≥4 (28.6% vs. 28.4%; P = 0.96) and none of each component was statistically predictive of severe inflammation. At the multivariable logistic regression analysis, PIA, HGPIN, and MetS were not associated with greater risk of PCa. In this study, we did not show an association between MetS and PIA and PCa. Although the small sample size and the cross-sectional nature of the study, we do not suppose that MetS could be associated with greater evidence of PIA. Further studies should be conducted to evaluate the exact nature of this pathological lesion. Copyright © 2018 Elsevier Inc. All rights reserved.
Priority of VHS Development Based in Potential Area using Principal Component Analysis
NASA Astrophysics Data System (ADS)
Meirawan, D.; Ana, A.; Saripudin, S.
2018-02-01
The current condition of VHS is still inadequate in quality, quantity and relevance. The purpose of this research is to analyse the development of VHS based on the development of regional potential by using principal component analysis (PCA) in Bandung, Indonesia. This study used descriptive qualitative data analysis using the principle of secondary data reduction component. The method used is Principal Component Analysis (PCA) analysis with Minitab Statistics Software tool. The results of this study indicate the value of the lowest requirement is a priority of the construction of development VHS with a program of majors in accordance with the development of regional potential. Based on the PCA score found that the main priority in the development of VHS in Bandung is in Saguling, which has the lowest PCA value of 416.92 in area 1, Cihampelas with the lowest PCA value in region 2 and Padalarang with the lowest PCA value.
Gao, Boyan; Qin, Fang; Ding, Tingting; Chen, Yineng; Lu, Weiying; Yu, Liangli Lucy
2014-08-13
Ultraperformance liquid chromatography mass spectrometry (UPLC-MS), flow injection mass spectrometry (FIMS), and headspace gas chromatography (headspace-GC) combined with multivariate data analysis techniques were examined and compared in differentiating organically grown oregano from that grown conventionally. It is the first time that headspace-GC fingerprinting technology is reported in differentiating organically and conventionally grown spice samples. The results also indicated that UPLC-MS, FIMS, and headspace-GC-FID fingerprints with OPLS-DA were able to effectively distinguish oreganos under different growing conditions, whereas with PCA, only FIMS fingerprint could differentiate the organically and conventionally grown oregano samples. UPLC fingerprinting provided detailed information about the chemical composition of oregano with a longer analysis time, whereas FIMS finished a sample analysis within 1 min. On the other hand, headspace GC-FID fingerprinting required no sample pretreatment, suggesting its potential as a high-throughput method in distinguishing organically and conventionally grown oregano samples. In addition, chemical components in oregano were identified by their molecular weight using QTOF-MS and headspace-GC-MS.
Moldes, Carlos Alberto; Cantarelli, Miguel Angel; Camiña, José Manuel; Tsai, Siu Mui; Azevedo, Ricardo Antunes
2017-10-11
Amino acid profiles are useful to analyze the responses to glyphosate in susceptible and resistant soybean lines. Comparisons of profiles for 10 amino acids (Asp, Asn, Glu, Gln, Ser, His, Gly, Thr, Tyr, Leu) by HPLC in soybean roots were performed in two near isogenic pairs (four varieties). Foliar application of glyphosate was made to soybean plants after 5 weeks of seeding. Roots of four varieties were collected at 0 and 72 h after glyphosate application (AGA) for amino acid analysis by HPLC. Univariate analysis showed a significant increase of several amino acids in susceptible as well as resistant soybean lines; however, amino acids from the major pathways of carbon (C) and nitrogen (N) metabolism, such as Asp, Asn, Glu and Gln, and Ser, increased significantly in susceptible varieties at 72 h AGA. Multivariate analysis using principal component analysis (2D PCA and 3D PCA) allowed different groups to be identified and discriminated based on the soybean genetic origin, showing the amino acid responses on susceptible and resistant varieties. Based on the results, it is possible to infer that the increase of Asn, Asp, Glu, Gln, and Ser in susceptible varieties would be related to the deregulation of C and N metabolism, as well as changes in the growth mechanisms regulated by Ser.
Lust, Andres; Strachan, Clare J; Veski, Peep; Aaltonen, Jaakko; Heinämäki, Jyrki; Yliruusi, Jouko; Kogermann, Karin
2015-01-01
The conversion of active pharmaceutical ingredient (API) from amorphous to crystalline form is the primary stability issue in formulating amorphous solid dispersions (SDs). The aim of the present study was to carry out qualitative and quantitative analysis of the physical solid-state stability of the SDs of poorly water-soluble piroxicam (PRX) and polyvinyl caprolactam-polyvinyl acetate-polyethylene-glycol graft copolymer (Soluplus(®)). The SDs were prepared by a solvent evaporation method and stored for six months at 0% RH/6 °C, 0% RH/25 °C, 40% RH/25 °C and 75% RH/25 °C. Fourier transform infrared spectroscopy equipped with attenuated total reflection accessory (ATR-FTIR) and Raman spectroscopy were used for characterizing the physical solid-state changes and drug-polymer interactions. The principal component analysis (PCA) and multivariate curve resolution alternating least squares (MCR-ALS) were used for the qualitative and quantitative analysis of Raman spectra collected during storage. When stored at 0% RH/6 °C and at 0% RH/25 °C, PRX in SDs remained in an amorphous form since no recrystallization was observed by ATR-FTIR and Raman spectroscopy. Raman spectroscopy coupled with PCA and MCR-ALS and ATR-FTIR spectroscopy enabled to detect the recrystallization of amorphous PRX in the samples stored at higher humidity. Copyright © 2015 Elsevier B.V. All rights reserved.
Racial Differences in the Surgical Care of Medicare Beneficiaries With Localized Prostate Cancer.
Schmid, Marianne; Meyer, Christian P; Reznor, Gally; Choueiri, Toni K; Hanske, Julian; Sammon, Jesse D; Abdollah, Firas; Chun, Felix K H; Kibel, Adam S; Tucker-Seeley, Reginald D; Kantoff, Philip W; Lipsitz, Stuart R; Menon, Mani; Nguyen, Paul L; Trinh, Quoc-Dien
2016-01-01
There is extensive evidence suggesting that black men with localized prostate cancer (PCa) have worse cancer-specific mortality compared with their non-Hispanic white counterparts. To evaluate racial disparities in the use, quality of care, and outcomes of radical prostatectomy (RP) in elderly men (≥ 65 years) with nonmetastatic PCa. This retrospective analysis of outcomes stratified according to race (black vs non-Hispanic white) included 2020 elderly black patients (7.6%) and 24,462 elderly non-Hispanic white patients (92.4%) with localized PCa who underwent RP within the first year of PCa diagnosis in the Surveillance, Epidemiology, and End Results (SEER)-Medicare database between 1992 and 2009. The study was performed in 2014. Process of care (ie, time to treatment, lymph node dissection), as well as outcome measures (ie, complications, emergency department visits, readmissions, PCa-specific and all-cause mortality, costs) were evaluated using Cox proportional hazards regression. Multivariable conditional logistic regression and quantile regression were used to study the association of racial disparities with process of care and outcome measures. The proportion of black patients with localized prostate cancer who underwent RP within 90 days was 59.4% vs 69.5% of non-Hispanic white patients (P < 001). In quantile regression of the top 50% of patients, blacks had a 7-day treatment delay compared with non-Hispanic whites. (P < 001). Black patients were less likely to undergo lymph node dissection (odds ratio [OR], 0.76 [95% CI, 0.66-0.80]; P < .001) but had higher odds of postoperative visits to the emergency department (within 30 days: OR, 1.48 [95% CI, 1.18-1.86]); after 30 days or more (OR, 1.45 [95% CI, 1.19-1.76]) and readmissions (within 30 days: OR, 1.28 [95% CI, 1.02-1.61]); ≥ 30 days (OR, 1.27 [95% CI, 1.07-1.51]) compared with non-Hispanic whites. The surgical treatment of black patients was associated with a higher incremental annual cost (the top 50% of blacks spent $1185.50 (95% CI , $804.85-1 $1566.10; P < .001) more than the top 50% of non-Hispanic whites). There was no difference in PCa-specific mortality (P = .16) or all-cause mortality (P = .64) between black and non-Hispanic white men. Blacks treated with RP for localized PCa are more likely to experience adverse events and incur higher costs compared with non-Hispanic white men; however, this does not translate into a difference in PCa-specific or all-cause mortality.
Effect of dehydration on the development of collaterals in acute middle cerebral artery occlusion.
Chang, S-W; Huang, Y-C; Lin, L-C; Yang, J-T; Weng, H-H; Tsai, Y-H; Lee, T-H
2016-03-01
Recent large series studies have demonstrated that dehydration is common amongst stroke subjects and is associated with poor outcome. However, the effects of hydration status on the development of collaterals have never been discussed. In this study, the hypothesis that hydration status is an important factor for developing collaterals after acute middle cerebral artery (MCA) infarction was tested. Eighty-seven patients with acute infarction due to occlusion of the MCA were enrolled. Two collateral markers, posterior cerebral artery (PCA) laterality and fluid-attenuated inversion recovery hyperintense vessels (HVs) were assessed from magnetic resonance imaging. Dehydration status was defined by a nitrogen to creatinine ratio ≧ of 15. The associations between dehydration status and the development of collaterals were estimated. Sixty-one of 87 patients (70.1%) were identified as dehydrated. The development of PCA laterality and HVs shows a significant difference between dehydrated and euhydrated patients. A serum nitrogen to creatinine ratio <15, diastolic blood pressure and the presence of a dense MCA on computed tomography were significantly associated with the development of PCA laterality. A serum nitrogen to creatinine ratio <15, the initial National Institutes of Health Stroke Scale score, the presence of a dense MCA and calcifications of the internal carotid artery on computed tomography were significantly associated with the development of HVs. Dehydration remained an independent negative predictor for the development of PCA laterality and HVs in the multivariate analysis. Hydration status is associated with the development of collateral flow after acute MCA occlusion. This preliminary study provides an imaging clue that hydration status and early hydration therapy could be important for acute stroke management. © 2016 EAN.
Gökce, Mehmet I; Wang, Xuemei; Frost, Jacqueline; Roberson, Pamela; Volk, Robert J; Brooks, Durado; Canfield, Steven E; Pettaway, Curtis A
2017-02-15
The American Cancer Society (ACS) recommends men have the opportunity to make an informed decision about screening for prostate cancer (PCa). The ACS developed a unique decision aid (ACS-DA) for this purpose. However, to date, studies evaluating the efficacy of the ACS-DA are lacking. The authors evaluated the ACS-DA among a cohort of medically underserved men (MUM). A multiethnic cohort of MUM (n = 285) was prospectively included between June 2010 and December 2014. The ACS-DA was presented in a group format. Levels of knowledge on PCa were evaluated before and after the presentation. Participants' decisional conflict and thoughts about the presentation also were evaluated. Logistic regression analyses were performed to determine factors associated with having an adequate level of knowledge. Before receiving the ACS-DA, 33.1% of participants had adequate knowledge on PCa, and this increased to 77% after the DA (P < .0001). On multivariate analysis, higher education level (odds ratio, 11.19; P = .001) and history of another cancer (odds ratio, 7.45; P = .03) were associated with having adequate knowledge after receiving the DA. Levels of decisional conflict were low and were correlated with levels of knowledge after receiving the DA. The majority of men also rated the presentation as favorable and would recommend the ACS-DA to others. Use of the ACS-DA was feasible among MUM and led to increased PCa knowledge. This also correlated with low levels of decisional conflict. The ACS-DA presented to groups of men may serve as a feasible tool for informed decision making in a MUM population. Cancer 2017;123:583-591. © 2016 American Cancer Society. © 2016 American Cancer Society.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Van Benthem, Mark Hilary; Mowry, Curtis Dale; Kotula, Paul Gabriel
Thermal decomposition of poly dimethyl siloxane compounds, Sylgard{reg_sign} 184 and 186, were examined using thermal desorption coupled gas chromatography-mass spectrometry (TD/GC-MS) and multivariate analysis. This work describes a method of producing multiway data using a stepped thermal desorption. The technique involves sequentially heating a sample of the material of interest with subsequent analysis in a commercial GC/MS system. The decomposition chromatograms were analyzed using multivariate analysis tools including principal component analysis (PCA), factor rotation employing the varimax criterion, and multivariate curve resolution. The results of the analysis show seven components related to offgassing of various fractions of siloxanes that varymore » as a function of temperature. Thermal desorption coupled with gas chromatography-mass spectrometry (TD/GC-MS) is a powerful analytical technique for analyzing chemical mixtures. It has great potential in numerous analytic areas including materials analysis, sports medicine, in the detection of designer drugs; and biological research for metabolomics. Data analysis is complicated, far from automated and can result in high false positive or false negative rates. We have demonstrated a step-wise TD/GC-MS technique that removes more volatile compounds from a sample before extracting the less volatile compounds. This creates an additional dimension of separation before the GC column, while simultaneously generating three-way data. Sandia's proven multivariate analysis methods, when applied to these data, have several advantages over current commercial options. It also has demonstrated potential for success in finding and enabling identification of trace compounds. Several challenges remain, however, including understanding the sources of noise in the data, outlier detection, improving the data pretreatment and analysis methods, developing a software tool for ease of use by the chemist, and demonstrating our belief that this multivariate analysis will enable superior differentiation capabilities. In addition, noise and system artifacts challenge the analysis of GC-MS data collected on lower cost equipment, ubiquitous in commercial laboratories. This research has the potential to affect many areas of analytical chemistry including materials analysis, medical testing, and environmental surveillance. It could also provide a method to measure adsorption parameters for chemical interactions on various surfaces by measuring desorption as a function of temperature for mixtures. We have presented results of a novel method for examining offgas products of a common PDMS material. Our method involves utilizing a stepped TD/GC-MS data acquisition scheme that may be almost totally automated, coupled with multivariate analysis schemes. This method of data generation and analysis can be applied to a number of materials aging and thermal degradation studies.« less
An adaptive confidence limit for periodic non-steady conditions fault detection
NASA Astrophysics Data System (ADS)
Wang, Tianzhen; Wu, Hao; Ni, Mengqi; Zhang, Milu; Dong, Jingjing; Benbouzid, Mohamed El Hachemi; Hu, Xiong
2016-05-01
System monitoring has become a major concern in batch process due to the fact that failure rate in non-steady conditions is much higher than in steady ones. A series of approaches based on PCA have already solved problems such as data dimensionality reduction, multivariable decorrelation, and processing non-changing signal. However, if the data follows non-Gaussian distribution or the variables contain some signal changes, the above approaches are not applicable. To deal with these concerns and to enhance performance in multiperiod data processing, this paper proposes a fault detection method using adaptive confidence limit (ACL) in periodic non-steady conditions. The proposed ACL method achieves four main enhancements: Longitudinal-Standardization could convert non-Gaussian sampling data to Gaussian ones; the multiperiod PCA algorithm could reduce dimensionality, remove correlation, and improve the monitoring accuracy; the adaptive confidence limit could detect faults under non-steady conditions; the fault sections determination procedure could select the appropriate parameter of the adaptive confidence limit. The achieved result analysis clearly shows that the proposed ACL method is superior to other fault detection approaches under periodic non-steady conditions.
Fully optimized discrimination of physiological responses to auditory stimuli
Kruglikov, Stepan Y; Chari, Sharmila; Rapp, Paul E; Weinstein, Steven L; Given, Barbara K; Schiff, Steven J
2008-01-01
The use of multivariate measurements to characterize brain activity (electrical, magnetic, optical) is widespread. The most common approaches to reduce the complexity of such observations include principal and independent component analyses (PCA and ICA), which are not well suited for discrimination tasks. We addressed two questions: first, how do the neurophysiological responses to elongated phonemes relate to tone and phoneme responses in normal children, and, second, how discriminable are these responses. We employed fully optimized linear discrimination analysis to maximally separate the multi-electrode responses to tones and phonemes, and classified the response to elongated phonemes. We find that discrimination between tones and phonemes is dependent upon responses from associative regions of the brain apparently distinct from the primary sensory cortices typically emphasized by PCA or ICA, and that the neuronal correlates corresponding to elongated phonemes are highly variable in normal children (about half respond with neural correlates of tones and half as phonemes). Our approach is made feasible by the increase in computational power of ordinary personal computers and has significant advantages for a wide range of neuronal imaging modalities. PMID:18430975
Cormio, Luigi; Lucarelli, Giuseppe; Netti, Giuseppe Stefano; Stallone, Giovanni; Selvaggio, Oscar; Troiano, Francesco; Di Fino, Giuseppe; Sanguedolce, Francesca; Bufo, Pantaleo; Grandaliano, Giuseppe; Carrieri, Giuseppe
2015-04-01
to determine whether peak flow rate (PFR) and post-void residual urinary volume (PVRUV) predict prostate biopsy outcome. The study population consisted of 1780 patients undergoing first prostate biopsy. Patients with prostate cancer (PCa) had significantly greater prostate-specific antigen (PSA) and PFR but lower prostate volume (PVol) and PVRUV than those without PCa. Receiver operator characteristic curve analysis showed that PVol and PVRUV were the most accurate predictors of biopsy outcome. The addition of PVRUV to the multivariate logistic regression model based on standard clinical parameters (age, PSA, digital rectal examination, PVol) significantly increased the predictive accuracy of the model in both the population overall (79% vs. 77%; p=0.001) and patients with PSA levels up to 10 ng/ml (74.3% vs. 71.7%; p=0.005). PVRUV seems to be an accurate non-invasive test to predict biopsy outcome that can be used alone or in combination with PVol in the decision-making process for men potentially facing a prostate biopsy. Copyright© 2015 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Inoue, Koichi; Tanada, Chihiro; Sakamoto, Tasuku; Tsutsui, Haruhito; Akiba, Takashi; Min, Jun Zhe; Todoroki, Kenichiro; Yamano, Yutaka; Toyo'oka, Toshimasa
2015-08-15
In this study including the field of metabolomics approach for food, the evaluation of untargeted compounds using HILIC-ESI/TOF/MS and multivariate statistical analysis method is proposed for the assessment of classification, contamination and degradation of infant formula. HILIC mode is used to monitor more detected numbers in infant formulas in the ESI-positive scan mode than the reversed phase. The repeatability of the non-targeted contents from 4 kinds of infant formulas based on PCA was less than the relative standard deviation of 15% in all groups. The PCA pattern showed that significant differences in the classification of types and origins, the contamination of melamine and the degradations for one week were evaluated using HILIC-ESI/TOF/MS. In the S-plot from the degradation test, we could identify two markers by comparison to standards as nicotinic acid and nicotinamide. With this strategy, the differences from the untargeted compounds could be utilized for quality and safety assessment of infant formula. Copyright © 2015 Elsevier Ltd. All rights reserved.
Behavior of the PCA3 gene in the urine of men with high grade prostatic intraepithelial neoplasia.
Morote, Juan; Rigau, Marina; Garcia, Marta; Mir, Carmen; Ballesteros, Carlos; Planas, Jacques; Raventós, Carles X; Placer, José; de Torres, Inés M; Reventós, Jaume; Doll, Andreas
2010-12-01
An ideal marker for the early detection of prostate cancer (PCa) should also differentiate between men with isolated high grade prostatic intraepithelial neoplasia (HGPIN) and those with PCa. Prostate Cancer Gene 3 (PCA3) is a highly specific PCa gene and its score, in relation to the PSA gene in post-prostate massage urine (PMU-PCA3), seems to be useful in ruling out PCa, especially after a negative prostate biopsy. Because PCA3 is also expressed in the HGPIN lesion, the aim of this study was to determine the efficacy of PMU-PCA3 scores for ruling out PCa in men with previous HGPIN. The PMU-PCA3 score was assessed by quantitative PCR (multiplex research assay) in 244 men subjected to prostate biopsy: 64 men with an isolated HGPIN (no cancer detected after two or more repeated biopsies), 83 men with PCa and 97 men with benign pathology findings (BP: no PCa, HGPIN or ASAP). The median PMU-PCA3 score was 1.56 in men with BP, 2.01 in men with HGPIN (p = 0.128) and 9.06 in men with PCa (p = 0.008). The AUC in the ROC analysis was 0.705 in the subset of men with BP and PCa, while it decreased to 0.629 when only men with isolated HGPIN and PCa were included in the analysis. Fixing the sensitivity of the PMU-PCA3 score at 90%, its specificity was 79% in men with BP and 69% in men with isolated HGPIN. The efficacy of the PMU-PCA3 score to rule out PCa in men with HGPIN is lower than in men with BP.
Common factor analysis versus principal component analysis: choice for symptom cluster research.
Kim, Hee-Ju
2008-03-01
The purpose of this paper is to examine differences between two factor analytical methods and their relevance for symptom cluster research: common factor analysis (CFA) versus principal component analysis (PCA). Literature was critically reviewed to elucidate the differences between CFA and PCA. A secondary analysis (N = 84) was utilized to show the actual result differences from the two methods. CFA analyzes only the reliable common variance of data, while PCA analyzes all the variance of data. An underlying hypothetical process or construct is involved in CFA but not in PCA. PCA tends to increase factor loadings especially in a study with a small number of variables and/or low estimated communality. Thus, PCA is not appropriate for examining the structure of data. If the study purpose is to explain correlations among variables and to examine the structure of the data (this is usual for most cases in symptom cluster research), CFA provides a more accurate result. If the purpose of a study is to summarize data with a smaller number of variables, PCA is the choice. PCA can also be used as an initial step in CFA because it provides information regarding the maximum number and nature of factors. In using factor analysis for symptom cluster research, several issues need to be considered, including subjectivity of solution, sample size, symptom selection, and level of measure.
Analysis of the principal component algorithm in phase-shifting interferometry.
Vargas, J; Quiroga, J Antonio; Belenguer, T
2011-06-15
We recently presented a new asynchronous demodulation method for phase-sampling interferometry. The method is based in the principal component analysis (PCA) technique. In the former work, the PCA method was derived heuristically. In this work, we present an in-depth analysis of the PCA demodulation method.
Rider, Jennifer R.; Wilson, Kathryn M.; Sinnott, Jennifer A.; Kelly, Rachel S.; Mucci, Lorelei A.; Giovannucci, Edward L.
2016-01-01
Background Evidence suggests that ejaculation frequency may be inversely related to the risk of prostate cancer (PCa), a disease for which few modifiable risk factors have been identified. Objective To incorporate an additional 10 yr of follow-up into an original analysis and to comprehensively evaluate the association between ejaculation frequency and PCa, accounting for screening, clinically relevant disease subgroups, and the impact of mortality from other causes. Design, setting, and participants A prospective cohort study of participants in the Health Professionals Follow-up Study utilizing self-reported data on average monthly ejaculation frequency. The study includes 31 925 men who answered questions on ejaculation frequency on a 1992 questionnaire and followed through to 2010. The average monthly ejaculation frequency was assessed at three time points: age 20–29 yr, age 40–49 yr, and the year before questionnaire distribution. Outcome measurements and statistical analysis Incidence of total PCa and clinically relevant disease subgroups. Cox models were used to estimate hazard ratios (HRs) and 95% confidence intervals (CIs). Results and limitations During 480 831 person-years, 3839 men were diagnosed with PCa. Ejaculation frequency at age 40–49 yr was positively associated with age-standardized body mass index, physical activity, divorce, history of sexually transmitted infections, and consumption of total calories and alcohol. Prostate-specific antigen (PSA) test utilization by 2008, number of PSA tests, and frequency of prostate biopsy were similar across frequency categories. In multivariable analyses, the hazard ratio for PCa incidence for ≥21 compared to 4–7 ejaculations per month was 0.81 (95% confidence interval [CI] 0.72–0.92; p < 0.0001 for trend) for frequency at age 20–29 yr and 0.78 (95% CI 0.69–0.89; p < 0.0001 for trend) for frequency at age 40–49 yr. Associations were driven by low-risk disease, were similar when restricted to a PSA-screened cohort, and were unlikely to be explained by competing causes of death. Conclusions These findings provide additional evidence of a beneficial role of more frequent ejaculation throughout adult life in the etiology of PCa, particularly for low-risk disease. Patient summary We evaluated whether ejaculation frequency throughout adulthood is related to prostate cancer risk in a large US-based study. We found that men reporting higher compared to lower ejaculatory frequency in adulthood were less likely to be subsequently diagnosed with prostate cancer. PMID:27033442
Wójcicki, Krzysztof; Khmelinskii, Igor; Sikorski, Marek; Sikorska, Ewa
2015-11-15
Infrared spectroscopic techniques and chemometric methods were used to study oxidation of olive, sunflower and rapeseed oils. Accelerated oxidative degradation of oils at 60°C was monitored using peroxide values and FT-MIR ATR and FT-NIR transmittance spectroscopy. Principal component analysis (PCA) facilitated visualization and interpretation of spectral changes occurring during oxidation. Multivariate curve resolution (MCR) method found three spectral components in the NIR and MIR spectral matrix, corresponding to the oxidation products, and saturated and unsaturated structures. Good quantitative relation was found between peroxide value and contribution of oxidation products evaluated using MCR--based on NIR (R(2) = 0.890), MIR (R(2) = 0.707) and combined NIR and MIR (R(2) = 0.747) data. Calibration models for prediction peroxide value established using partial least squares (PLS) regression were characterized for MIR (R(2) = 0.701, RPD = 1.7), NIR (R(2) = 0.970, RPD = 5.3), and combined NIR and MIR data (R(2) = 0.954, RPD = 3.1). Copyright © 2015 Elsevier Ltd. All rights reserved.
Elkhoudary, Mahmoud M; Abdel Salam, Randa A; Hadad, Ghada M
2014-09-15
Metronidazole (MNZ) is a widely used antibacterial and amoebicide drug. Therefore, it is important to develop a rapid and specific analytical method for the determination of MNZ in mixture with Spiramycin (SPY), Diloxanide (DIX) and Cliquinol (CLQ) in pharmaceutical preparations. This work describes simple, sensitive and reliable six multivariate calibration methods, namely linear and nonlinear artificial neural networks preceded by genetic algorithm (GA-ANN) and principle component analysis (PCA-ANN) as well as partial least squares (PLS) either alone or preceded by genetic algorithm (GA-PLS) for UV spectrophotometric determination of MNZ, SPY, DIX and CLQ in pharmaceutical preparations with no interference of pharmaceutical additives. The results manifest the problem of nonlinearity and how models like ANN can handle it. Analytical performance of these methods was statistically validated with respect to linearity, accuracy, precision and specificity. The developed methods indicate the ability of the previously mentioned multivariate calibration models to handle and solve UV spectra of the four components' mixtures using easy and widely used UV spectrophotometer. Copyright © 2014 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Yang, Haiqing; Wu, Di; He, Yong
2007-11-01
Near-infrared spectroscopy (NIRS) with the characteristics of high speed, non-destructiveness, high precision and reliable detection data, etc. is a pollution-free, rapid, quantitative and qualitative analysis method. A new approach for variety discrimination of brown sugars using short-wave NIR spectroscopy (800-1050nm) was developed in this work. The relationship between the absorbance spectra and brown sugar varieties was established. The spectral data were compressed by the principal component analysis (PCA). The resulting features can be visualized in principal component (PC) space, which can lead to discovery of structures correlative with the different class of spectral samples. It appears to provide a reasonable variety clustering of brown sugars. The 2-D PCs plot obtained using the first two PCs can be used for the pattern recognition. Least-squares support vector machines (LS-SVM) was applied to solve the multivariate calibration problems in a relatively fast way. The work has shown that short-wave NIR spectroscopy technique is available for the brand identification of brown sugar, and LS-SVM has the better identification ability than PLS when the calibration set is small.
Mitic, Violeta; Stankov Jovanovic, Vesna; Ilic, Marija; Jovanovic, Olga; Djordjevic, Aleksandra; Stojanovic, Gordana
2016-01-01
The chemical composition and in vitro antimicrobial activities of Dittrichia graveolens (L.) Greuter essential oil was studied. Moreover, using agglomerative hierarchical cluster (AHC) and principal component analyses (PCA), the interrelationships of the D. graveolens essential-oil profiles characterized so far (including the sample from this study) were investigated. To evaluate the chemical composition of the essential oil, GC-FID and GC/MS analyses were performed. Altogether, 54 compounds were identified, accounting for 92.9% of the total oil composition. The D. graveolens oil belongs to the monoterpenoid chemotype, with monoterpenoids comprising 87.4% of the totally identified compounds. The major components were borneol (43.6%) and bornyl acetate (38.3%). Multivariate analysis showed that the compounds borneol and bornyl acetate exerted the greatest influence on the spatial differences in the composition of the reported oils. The antimicrobial activity against five bacterial and one fungal strain was determined using a disk-diffusion assay. The studied essential oil was active only against Gram-positive bacteria. Copyright © 2016 Verlag Helvetica Chimica Acta AG, Zürich.
Irvine, Karen-Amanda; Ferguson, Adam R.; Mitchell, Kathleen D.; Beattie, Stephanie B.; Lin, Amity; Stuck, Ellen D.; Huie, J. Russell; Nielson, Jessica L.; Talbott, Jason F.; Inoue, Tomoo; Beattie, Michael S.; Bresnahan, Jacqueline C.
2014-01-01
The IBB scale is a recently developed forelimb scale for the assessment of fine control of the forelimb and digits after cervical spinal cord injury [SCI; (1)]. The present paper describes the assessment of inter-rater reliability and face, concurrent and construct validity of this scale following SCI. It demonstrates that the IBB is a reliable and valid scale that is sensitive to severity of SCI and to recovery over time. In addition, the IBB correlates with other outcome measures and is highly predictive of biological measures of tissue pathology. Multivariate analysis using principal component analysis (PCA) demonstrates that the IBB is highly predictive of the syndromic outcome after SCI (2), and is among the best predictors of bio-behavioral function, based on strong construct validity. Altogether, the data suggest that the IBB, especially in concert with other measures, is a reliable and valid tool for assessing neurological deficits in fine motor control of the distal forelimb, and represents a powerful addition to multivariate outcome batteries aimed at documenting recovery of function after cervical SCI in rats. PMID:25071704
NASA Astrophysics Data System (ADS)
Elkhoudary, Mahmoud M.; Abdel Salam, Randa A.; Hadad, Ghada M.
2014-09-01
Metronidazole (MNZ) is a widely used antibacterial and amoebicide drug. Therefore, it is important to develop a rapid and specific analytical method for the determination of MNZ in mixture with Spiramycin (SPY), Diloxanide (DIX) and Cliquinol (CLQ) in pharmaceutical preparations. This work describes simple, sensitive and reliable six multivariate calibration methods, namely linear and nonlinear artificial neural networks preceded by genetic algorithm (GA-ANN) and principle component analysis (PCA-ANN) as well as partial least squares (PLS) either alone or preceded by genetic algorithm (GA-PLS) for UV spectrophotometric determination of MNZ, SPY, DIX and CLQ in pharmaceutical preparations with no interference of pharmaceutical additives. The results manifest the problem of nonlinearity and how models like ANN can handle it. Analytical performance of these methods was statistically validated with respect to linearity, accuracy, precision and specificity. The developed methods indicate the ability of the previously mentioned multivariate calibration models to handle and solve UV spectra of the four components’ mixtures using easy and widely used UV spectrophotometer.
Hurwitz, Lauren M; Cullen, Jennifer; Elsamanoudi, Sally; Kim, Daniel J; Hudak, Jane; Colston, Maryellen; Travis, Judith; Kuo, Huai-Ching; Porter, Christopher R; Rosner, Inger L
2016-05-01
Patients diagnosed with prostate cancer (PCa) are presented with several treatment options of similar efficacy but varying side effects. Understanding how and why patients make their treatment decisions, as well as the effect of treatment choice on long-term outcomes, is critical to ensuring effective, patient-centered care. This study examined treatment decision-making in a racially diverse, equal-access, contemporary cohort of patients with PCa counseled on treatment options at a multidisciplinary clinic. A prospective cohort study was initiated at the Walter Reed National Military Medical Center (formerly Walter Reed Army Medical Center) in 2006. Newly diagnosed patients with PCa were enrolled before attending a multidisciplinary clinic. Patients completed surveys preclinic and postclinic to assess treatment preferences, reasons for treatment choice, and decisional regret. As of January 2014, 925 patients with PCa enrolled in this study. Surgery (54%), external radiation (20%), and active surveillance (12%) were the most common primary treatments for patients with low- and intermediate-risk PCa, whereas patients with high-risk PCa chose surgery (34%) or external radiation with neoadjuvant hormones (57%). Treatment choice differed by age at diagnosis, race, comorbidity status, and calendar year in both univariable and multivariable analyses. Patients preferred to play an active role in the decision-making process and cited doctors at the clinic as the most helpful source of treatment-related information. Almost all patients reported satisfaction with their decision. This is one of the first prospective cohort studies to examine treatment decision-making in an equal-access, multidisciplinary clinic setting. Studies of this cohort would aid in understanding and improving the PCa decision-making process. Published by Elsevier Inc.
Yang, James J; Li, Jia; Williams, L Keoki; Buu, Anne
2016-01-05
In genome-wide association studies (GWAS) for complex diseases, the association between a SNP and each phenotype is usually weak. Combining multiple related phenotypic traits can increase the power of gene search and thus is a practically important area that requires methodology work. This study provides a comprehensive review of existing methods for conducting GWAS on complex diseases with multiple phenotypes including the multivariate analysis of variance (MANOVA), the principal component analysis (PCA), the generalizing estimating equations (GEE), the trait-based association test involving the extended Simes procedure (TATES), and the classical Fisher combination test. We propose a new method that relaxes the unrealistic independence assumption of the classical Fisher combination test and is computationally efficient. To demonstrate applications of the proposed method, we also present the results of statistical analysis on the Study of Addiction: Genetics and Environment (SAGE) data. Our simulation study shows that the proposed method has higher power than existing methods while controlling for the type I error rate. The GEE and the classical Fisher combination test, on the other hand, do not control the type I error rate and thus are not recommended. In general, the power of the competing methods decreases as the correlation between phenotypes increases. All the methods tend to have lower power when the multivariate phenotypes come from long tailed distributions. The real data analysis also demonstrates that the proposed method allows us to compare the marginal results with the multivariate results and specify which SNPs are specific to a particular phenotype or contribute to the common construct. The proposed method outperforms existing methods in most settings and also has great applications in GWAS on complex diseases with multiple phenotypes such as the substance abuse disorders.
Pertoldi, Cino; Sonne, Christian; Wiig, Øystein; Baagøe, Hans J; Loeschcke, Volker; Bechshøft, Thea Østergaard
2012-06-01
A morphometric study was conducted on four skull traits of 37 male and 18 female adult East Greenland polar bears (Ursus maritimus) collected 1892-1968, and on 54 male and 44 female adult Barents Sea polar bears collected 1950-1969. The aim was to compare differences in size and shape of the bear skulls using a multivariate approach, characterizing the variation between the two populations using morphometric traits as an indicator of environmental and genetic differences. Mixture analysis testing for geographic differentiation within each population revealed three clusters for Barents Sea males and three clusters for Barents Sea females. East Greenland consisted of one female and one male cluster. A principal component analysis (PCA) conducted on the clusters defined by the mixture analysis, showed that East Greenland and Barents Sea polar bear populations overlapped to a large degree, especially with regards to females. Multivariate analyses of variance (MANOVA) showed no significant differences in morphometric means between the two populations, but differences were detected between clusters from each respective geographic locality. To estimate the importance of genetics and environment in the morphometric differences between the bears, a PCA was performed on the covariance matrix derived from the skull measurements. Skull trait size (PC1) explained approx. 80% of the morphometric variation, whereas shape (PC2) defined approx. 15%, indicating some genetic differentiation. Hence, both environmental and genetic factors seem to have contributed to the observed skull differences between the two populations. Overall, results indicate that many Barents Sea polar bears are morphometrically similar to the East Greenland ones, suggesting an exchange of individuals between the two populations. Furthermore, a subpopulation structure in the Barents Sea population was also indicated from the present analyses, which should be considered with regards to future management decisions. © 2012 The Authors.
Al-Shair, Khaled; Muldoon, Eavan G; Morris, Julie; Atherton, Graham T; Kosmidis, Chris; Denning, David W
2016-05-01
Fatigue is a prominent disabling symptom in several pulmonary diseases. Its impact on health status in patients with chronic pulmonary aspergillosis (CPA) has not been investigated. A total of 151 CPA patients attending the National Aspergillosis Centre completed Manchester COPD Fatigue Scale (MCFS), St. George's Respiratory Questionnaire (SGRQ) and Medical Research Council (MRC) dyspnoea score. Lung function and BMI were measured. Univariate, multivariate linear and binary analyses, and principal component analysis (PCA) were used. Female patients accounted for 44%. The mean (range) of age was 59.6 (31-83) years, FEV1% was 64 (14-140), BMI was 23.6 (16.3-43.4), SGRQ total score was 56 (4-96.2) and MCFS total score was 30.6 (0-54). PCA showed that 27 items of MCFS loaded on three components; physical, psychosocial and cognitive fatigue, explaining 78.4% of fatigue variance. MCFS score correlated strongly with total SGRQ score (r = 0.83, p < 0.001). Using linear multivariate analysis, fatigue was the strongest factor (beta = 0.7 p < 0.0001) associated with impaired health status, after adjusting for age, BMI, FEV1%, and MRC dyspnoea score. Using patients' 5 self-assessment grades of their health, one-way ANOVA showed that those with "very poor" health status had the highest fatigue scores (45 (±6) (p < 0.001)). Logistic regression analysis showed that fatigue score (OR = 0.9, 95% CI 0.84-0.97; p = 0.005) and FEV1% (OR = 1.03, 95% CI 1.01-1.07, p = 0.02) are significantly associated with self-assessed impaired health status after correcting for age, gender and DLCO%. Fatigue is a major component of impaired health status of CPA patients. Copyright © 2016 Elsevier Ltd. All rights reserved.
Rapid analysis of pharmaceutical drugs using LIBS coupled with multivariate analysis.
Tiwari, P K; Awasthi, S; Kumar, R; Anand, R K; Rai, P K; Rai, A K
2018-02-01
Type 2 diabetes drug tablets containing voglibose having dose strengths of 0.2 and 0.3 mg of various brands have been examined, using laser-induced breakdown spectroscopy (LIBS) technique. The statistical methods such as the principal component analysis (PCA) and the partial least square regression analysis (PLSR) have been employed on LIBS spectral data for classifying and developing the calibration models of drug samples. We have developed the ratio-based calibration model applying PLSR in which relative spectral intensity ratios H/C, H/N and O/N are used. Further, the developed model has been employed to predict the relative concentration of element in unknown drug samples. The experiment has been performed in air and argon atmosphere, respectively, and the obtained results have been compared. The present model provides rapid spectroscopic method for drug analysis with high statistical significance for online control and measurement process in a wide variety of pharmaceutical industrial applications.
NASA Astrophysics Data System (ADS)
Lipovsky, B.; Funning, G. J.
2009-12-01
We compare several techniques for the analysis of geodetic time series with the ultimate aim to characterize the physical processes which are represented therein. We compare three methods for the analysis of these data: Principal Component Analysis (PCA), Non-Linear PCA (NLPCA), and Rotated PCA (RPCA). We evaluate each method by its ability to isolate signals which may be any combination of low amplitude (near noise level), temporally transient, unaccompanied by seismic emissions, and small scale with respect to the spatial domain. PCA is a powerful tool for extracting structure from large datasets which is traditionally realized through either the solution of an eigenvalue problem or through iterative methods. PCA is an transformation of the coordinate system of our data such that the new "principal" data axes retain maximal variance and minimal reconstruction error (Pearson, 1901; Hotelling, 1933). RPCA is achieved by an orthogonal transformation of the principal axes determined in PCA. In the analysis of meteorological data sets, RPCA has been seen to overcome domain shape dependencies, correct for sampling errors, and to determine principal axes which more closely represent physical processes (e.g., Richman, 1986). NLPCA generalizes PCA such that principal axes are replaced by principal curves (e.g., Hsieh 2004). We achieve NLPCA through an auto-associative feed-forward neural network (Scholz, 2005). We show the geophysical relevance of these techniques by application of each to a synthetic data set. Results are compared by inverting principal axes to determine deformation source parameters. Temporal variability in source parameters, estimated by each method, are also compared.
Ielpo, Pierina; Leardi, Riccardo; Pappagallo, Giuseppe; Uricchio, Vito Felice
2017-06-01
In this paper, the results obtained from multivariate statistical techniques such as PCA (Principal component analysis) and LDA (Linear discriminant analysis) applied to a wide soil data set are presented. The results have been compared with those obtained on a groundwater data set, whose samples were collected together with soil ones, within the project "Improvement of the Regional Agro-meteorological Monitoring Network (2004-2007)". LDA, applied to soil data, has allowed to distinguish the geographical origin of the sample from either one of the two macroaeras: Bari and Foggia provinces vs Brindisi, Lecce e Taranto provinces, with a percentage of correct prediction in cross validation of 87%. In the case of the groundwater data set, the best classification was obtained when the samples were grouped into three macroareas: Foggia province, Bari province and Brindisi, Lecce and Taranto provinces, by reaching a percentage of correct predictions in cross validation of 84%. The obtained information can be very useful in supporting soil and water resource management, such as the reduction of water consumption and the reduction of energy and chemical (nutrients and pesticides) inputs in agriculture.
NASA Astrophysics Data System (ADS)
Liu, Yue; Zhang, Ying; Zhang, Jing; Fan, Gang; Tu, Ya; Sun, Suqin; Shen, Xudong; Li, Qingzhu; Zhang, Yi
2018-03-01
As an important ethnic medicine, sea buckthorn was widely used to prevent and treat various diseases due to its nutritional and medicinal properties. According to the Chinese Pharmacopoeia, sea buckthorn was originated from H. rhamnoides, which includes five subspecies distributed in China. Confusion and misidentification usually occurred due to their similar morphology, especially in dried and powdered forms. Additionally, these five subspecies have vital differences in quality and physiological efficacy. This paper focused on the quick classification and identification method of sea buckthorn berry powders from five H. rhamnoides subspecies using multi-step IR spectroscopy coupled with multivariate data analysis. The holistic chemical compositions revealed by the FT-IR spectra demonstrated that flavonoids, fatty acids and sugars were the main chemical components. Further, the differences in FT-IR spectra regarding their peaks, positions and intensities were used to identify H. rhamnoides subspecies samples. The discrimination was achieved using principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA). The results showed that the combination of multi-step IR spectroscopy and chemometric analysis offered a simple, fast and reliable method for the classification and identification of the sea buckthorn berry powders from different H. rhamnoides subspecies.
Lavergne, Céline; Jeison, David; Ortega, Valentina; Chamy, Rolando; Donoso-Bravo, Andrés
2018-09-15
An important variability in the experimental results in anaerobic digestion lab test has been reported. This study presents a meta-analysis coupled with multivariate analysis aiming to assess the impact of this experimental variability in batch and continuous operation at mesophilic and thermophilic anaerobic digestion of waste activated sludge. An analysis of variance showed that there was no significant difference between mesophilic and thermophilic conditions in both continuous and batch conditions. Concerning the operation mode, the values of methane yield were significantly higher in batch experiment than in continuous reactors. According to the PCA, for both cases, the methane yield is positive correlated to the temperature rises. Interestingly, in the batch experiments, the higher the volatile solids in the substrate was, the lowest was the methane production, which is correlated to experimental flaws when setting up those tests. In continuous mode, unlike the batch test, the methane yield is strongly (positively) correlated to the organic content of the substrate. Experimental standardization, above all, in batch conditions are urgently necessary or move to continuous experiments for reporting results. The modeling can also be a source of disturbance in batch test. Copyright © 2018 Elsevier Ltd. All rights reserved.
Raman spectroscopic analysis of cytotoxic effect of cisplatin-treated leukemic cells
NASA Astrophysics Data System (ADS)
Lin, Juqiang; Li, Yongzeng; Feng, Shangyuan; Chen, Rong; Chen, Guannan; Chen, Qisong; Pan, Jianji; Lin, Shaojun; Yu, Yun
2009-08-01
An antitumor drug cisplatin was employed to treat the leukemic cells and induce apoptosis of the cancer cells. Confocal Raman micro-spectroscopy has been applied to investigate the effectiveness of the treatment using near-infrared laser (785nm) excitation, scanning range from 500 to 2000 cm-1. The Raman spectra of leukemic cell treated with cisplatin for 4, 6, 8, 12 and 14 h were measured separately. The major difference of the apoptotic cells from the cancer cells are the reduction in intensities of vibration bands generated by cellular lipids, proteins and nucleic acids. In particular, large intensity reduction in nucleic vibrations at 782, 1092, 1320, 1340, and 1578 cm-1 was observed upon apoptosis of the leukemic cells. Up to 45% reduction in the magnitude of the 782 cm-1 peak in Raman spectra of the apoptotic cells was observed, which suggests the breakdown of phosphodiester bonds and DNA bases. We showed that the principal components analysis (PCA), a multivariate statistical tool, can be used to distinguish single apoptotic cells and leukemic cells based on their Raman spectra. Our results indicate that the Raman spectroscopy with PCA is a novel, nondestructive mean for studying the cisplatin -treated leukemic cells, which could also provide useful data for clinical dosage optimization for cisplatin.
Use of Standing Gold Nanorods for Detection of Malachite Green and Crystal Violet in Fish by SERS.
Chen, Xiaowei; Nguyen, Trang H D; Gu, Liqun; Lin, Mengshi
2017-07-01
With growing consumption of aquaculture products, there is increasing demand on rapid and sensitive techniques that can detect prohibited substances in the seafood products. This study aimed to develop a novel surface-enhanced Raman spectroscopy (SERS) method coupled with simplified extraction protocol and novel gold nanorod (AuNR) substrates to detect banned aquaculture substances (malachite green [MG] and crystal violet [CV]) and their mixture (1:1) in aqueous solution and fish samples. Multivariate statistical tools such as principal component analysis (PCA) and partial least squares regression (PLSR) were used in data analysis. PCA results demonstrate that SERS can distinguish MG, CV and their mixture (1:1) in aqueous solution and in fish samples. The detection limit of SERS coupled with standing AuNR substrates is 1 ppb for both MG and CV in fish samples. A good linear relationship between the actual concentration and predicted concentration of analytes based on PLSR models with R 2 values from 0.87 to 0.99 were obtained, indicating satisfactory quantification results of this method. These results demonstrate that the SERS method coupled with AuNR substrates can be used for rapid and accurate detection of MG and CV in fish samples. © 2017 Institute of Food Technologists®.
Zhang, Qianqian; Wang, Huiwei; Wang, Yanchao; Yang, Mingnan; Zhu, Liang
2017-07-01
Deterioration in groundwater quality has attracted wide social interest in China. In this study, groundwater quality was monitored during December 2014 at 115 sites in the Hutuo River alluvial-pluvial fan region of northern China. Results showed that 21.7% of NO 3 - and 51.3% of total hardness samples exceeded grade III of the national quality standards for Chinese groundwater. In addition, results of gray relationship analysis (GRA) show that 64.3, 10.4, 21.7, and 3.6% of samples were within the I, II, IV, and V grades of groundwater in the Hutuo River region, respectively. The poor water quality in the study region is due to intense anthropogenic activities as well as aquifer vulnerability to contamination. Results of principal component analysis (PCA) revealed three major factors: (1) domestic wastewater and agricultural runoff pollution (anthropogenic activities), (2) water-rock interactions (natural processes), and (3) industrial wastewater pollution (anthropogenic activities). Using PCA and absolute principal component scores-multivariate linear regression (APCS-MLR), results show that domestic wastewater and agricultural runoff are the main sources of groundwater pollution in the Hutuo River alluvial-pluvial fan area. Thus, the most appropriate methods to prevent groundwater quality degradation are to improve capacities for wastewater treatment and to optimize fertilization strategies.
Pedersen, Kristine B; Lejon, Tore; Jensen, Pernille E; Ottosen, Lisbeth M
2016-01-01
A highly oil-polluted soil from Krasnoe in North-West Russia was used to investigate the degradation of organic pollutants during electrodialytic remediation. Removal efficiencies were up to 70 % for total hydrocarbons (THC) and up to 65 % for polyaromatic hydrocarbons (PAH). Relatively more of the lighter PAH compounds and THC fractions were degraded. A principal component analysis (PCA) revealed a difference in the distribution of PAH compounds after the remediation. The observed clustering of experiments in the PCA scores plot was assessed to be related to the stirring rate. Multivariate analysis of the experimental settings and final concentrations in the 12 experiments revealed that the stirring rate of the soil suspension was by far the most important parameter for the remediation for both THC and PAH. Light was the second most important variable for PAH and seems to influence degradation. The experimental variables current density and remediation time did not significantly influence the degradation of the organic pollutants. Despite current density not influencing the remediation, there is potential for degrading organic pollutants during electrodialytic removal of heavy metals, as long as a stirred set-up is applied. Depending on remediation objectives, further optimisation may be needed in order to develop efficient remediation strategies.
Vohra, V.; Niranjan, S. K.; Mishra, A. K.; Jamuna, V.; Chopra, A.; Sharma, Neelesh; Jeong, Dong Kee
2015-01-01
Phenotypic characterization and body biometric in 13 traits (height at withers, body length, chest girth, paunch girth, ear length, tail length, length of tail up to switch, face length, face width, horn length, circumference of horn at base, distances between pin bone and hip bone) were recorded in 233 adult Gojri buffaloes from Punjab and Himachal Pradesh states of India. Traits were analysed by using varimax rotated principal component analysis (PCA) with Kaiser Normalization to explain body conformation. PCA revealed four components which explained about 70.9% of the total variation. First component described the general body conformation and explained 31.5% of total variation. It was represented by significant positive high loading of height at wither, body length, heart girth, face length and face width. The communality ranged from 0.83 (hip bone distance) to 0.45 (horn length) and unique factors ranged from 0.16 to 0.55 for all these 13 different biometric traits. Present study suggests that first principal component can be used in the evaluation and comparison of body conformation in buffaloes and thus provides an opportunity to distinguish between early and late maturing to adult, based on a small group of biometric traits to explain body conformation in adult buffaloes. PMID:25656215
NASA Astrophysics Data System (ADS)
Chen, Weiwei; Lin, Jia; Chen, Rong; Feng, Shangyuan; Yu, Yun; Lin, Duo; Huang, Meizhen; Shi, Hong; Huang, Hao
2015-04-01
We have evaluated the capabilities of surface-enhanced Raman scattering (SERS) technology for analyzing two Huo-Xue-Hua-Yu decoctions (HXHYDs) prepared according to different prescriptions. The aim of this study was to evaluate the relevance of SERS technology applied to decoction of traditional Chinese medicines (TCM). HXHYD I was prepared according to the original prescription; the same preparation method was used for the HXHYD II, except for the crudeweight ratio described in the original prescription. There was no Raman signal in conventional Raman spectra of HXHYDs. Silver nanoparticles were directly mixed with HXHYDs to enhance the Raman scattering of biochemical constituents, and high quality SERS spectra were obtained. Significant differences in SERS spectra between HXHYD I and II can be observed, which showed special changes in the percentage of biochemical constituents in different decoctions. Principal components analysis (PCA) combined with linear discriminant analysis (LDA) were employed to generate diagnostic algorithms for classification of SERS spectra of two HXHYDs, and showed that a diagnostic accuracy of 100% can be achieved. This work demonstrated that the SERS technique has potential for spectral characteristic detection for decoction of TCM with high sensitivity, and that this technique, combined with PCA-LDA, can be used for quality control of the extracted decoction of TCM and production management of Chinese herbal preparations.
Classification of upper Mississippi River pools based on contiguous aquatic/geomorphic habitats
Koel, Todd M.
2001-01-01
Navigation pools of the upper Mississippi River (UMR) vary greatly in terms of available contiguous aquatic/geomorphic habitats. These habitats are critical for the biotic diversity and overall productivity of the floodplain corridor of each pool. In this study, similarities among pools 4-26 and an open river reach (river kilometer 47-129) of the UMR were determined from multivariate analysis of eleven habitat types that were hydrologically-contiguous (non-leveed). Isolated floodplain habitats were not included in final analyses because this isolation limits their contribution to overall riverine productivity, in part due to a lack of hydrological connectivity to the main channel during the flood pulse. Cluster analysis based on simple Euclidean distance was used to produce two major pool groups and five pool subgroups. Important habitat variables in defining pool groups, as interpreted from principal components analysis (PCA) axis 1, were contiguous floodplain shallow aquatic area and contiguous impounded area. The habitat variable most important in defining pool subgroups, as interpreted from PCA axis 2, was tertiary channel. Most notably, pool 6 was more similar to pools 14-24 than other upper pools, and pools 19 and 25 were more similar to pools 4-13 than other lower pools. These results were quite different from those of two previous investigators, primarily because only areas of non-isolated aquatic habitat were considered.
Mat-Desa, Wan N S; Ismail, Dzulkiflee; NicDaeid, Niamh
2011-10-15
Three different medium petroleum distillate (MPD) products (white spirit, paint brush cleaner, and lamp oil) were purchased from commercial stores in Glasgow, Scotland. Samples of 10, 25, 50, 75, 90, and 95% evaporated product were prepared, resulting in 56 samples in total which were analyzed using gas chromatography-mass spectrometry. Data sets from the chromatographic patterns were examined and preprocessed for unsupervised multivariate analyses using principal component analysis (PCA), hierarchical cluster analysis (HCA), and a self organizing feature map (SOFM) artificial neural network. It was revealed that data sets comprised of higher boiling point hydrocarbon compounds provided a good means for the classification of the samples and successfully linked highly weathered samples back to their unevaporated counterpart in every case. The classification abilities of SOFM were further tested and validated for their predictive abilities where one set of weather data in each case was withdrawn from the sample set and used as a test set of the retrained network. This revealed SOFM to be an outstanding mechanism for sample discrimination and linkage over the more conventional PCA and HCA methods often suggested for such data analysis. SOFM also has the advantage of providing additional information through the evaluation of component planes facilitating the investigation of underlying variables that account for the classification. © 2011 American Chemical Society
Does adding ketamine to morphine patient-controlled analgesia safely improve post-thoracotomy pain?
Mathews, Timothy J; Churchhouse, Antonia M D; Housden, Tessa; Dunning, Joel
2012-02-01
A best evidence topic in thoracic surgery was written according to a structured protocol. The question addressed was 'is the addition of ketamine to morphine patient-controlled analgesia (PCA) following thoracic surgery superior to morphine alone'. Altogether 201 papers were found using the reported search, of which nine represented the best evidence to answer the clinical question. The authors, journal, date and country of publication, patient group studied, study type, relevant outcomes and results of these papers are tabulated. This consisted of one systematic review of PCA morphine with ketamine (PCA-MK) trials, one meta-analysis of PCA-MK trials, four randomized controlled trials of PCA-MK, one meta-analysis of trials using a variety of peri-operative ketamine regimes and two cohort studies of PCA-MK. Main outcomes measured included pain score rated on visual analogue scale, morphine consumption and incidence of psychotomimetic side effects/hallucination. Two papers reported the measurements of respiratory function. This evidence shows that adding ketamine to morphine PCA is safe, with a reported incidence of hallucination requiring intervention of 2.9%, and a meta-analysis finding an incidence of all central nervous system side effects of 18% compared with 15% with morphine alone, P = 0.31, RR 1.27 with 95% CI (0.8-2.01). All randomized controlled trials of its use following thoracic surgery found no hallucination or psychological side effect. All five studies in thoracic surgery (n = 243) found reduced morphine requirements with PCA-MK. Pain scores were significantly lower in PCA-MK patients in thoracic surgery papers, with one paper additionally reporting increased patient satisfaction. However, no significant improvement was found in a meta-analysis of five papers studying PCA-MK in a variety of surgical settings. Both papers reporting respiratory outcomes found improved oxygen saturations and PaCO(2) levels in PCA-MK patients following thoracic surgery. We conclude that adding low-dose ketamine to morphine PCA is safe and post-thoracotomy may provide better pain control than PCA with morphine alone (PCA-MO), with reduced morphine consumption and possible improvement in respiratory function. These studies thus support the routine use of PCA-MK instead of PCA-MO to improve post-thoracotomy pain control.
NASA Astrophysics Data System (ADS)
Gualandi, A.; Serpelloni, E.; Belardinelli, M. E.
2014-12-01
A critical point in the analysis of ground displacements time series is the development of data driven methods that allow to discern and characterize the different sources that generate the observed displacements. A widely used multivariate statistical technique is the Principal Component Analysis (PCA), which allows to reduce the dimensionality of the data space maintaining most of the variance of the dataset explained. It reproduces the original data using a limited number of Principal Components, but it also shows some deficiencies. Indeed, PCA does not perform well in finding the solution to the so-called Blind Source Separation (BSS) problem, i.e. in recovering and separating the original sources that generated the observed data. This is mainly due to the assumptions on which PCA relies: it looks for a new Euclidean space where the projected data are uncorrelated. Usually, the uncorrelation condition is not strong enough and it has been proven that the BSS problem can be tackled imposing on the components to be independent. The Independent Component Analysis (ICA) is, in fact, another popular technique adopted to approach this problem, and it can be used in all those fields where PCA is also applied. An ICA approach enables us to explain the time series imposing a fewer number of constraints on the model, and to reveal anomalies in the data such as transient signals. However, the independence condition is not easy to impose, and it is often necessary to introduce some approximations. To work around this problem, we use a variational bayesian ICA (vbICA) method, which models the probability density function (pdf) of each source signal using a mix of Gaussian distributions. This technique allows for more flexibility in the description of the pdf of the sources, giving a more reliable estimate of them. Here we present the application of the vbICA technique to GPS position time series. First, we use vbICA on synthetic data that simulate a seismic cycle (interseismic + coseismic + postseismic + seasonal + noise), and study the ability of the algorithm to recover the original (known) sources of deformation. Secondly, we apply vbICA to different tectonically active scenarios, such as earthquakes in central and northern Italy, as well as the study of slow slip events in Cascadia.
Soy Consumption and the Risk of Prostate Cancer: An Updated Systematic Review and Meta-Analysis
Ranard, Katherine M.; Jeon, Sookyoung; Erdman, John W.
2018-01-01
Prostate cancer (PCa) is the second most commonly diagnosed cancer in men, accounting for 15% of all cancers in men worldwide. Asian populations consume soy foods as part of a regular diet, which may contribute to the lower PCa incidence observed in these countries. This meta-analysis provides a comprehensive updated analysis that builds on previously published meta-analyses, demonstrating that soy foods and their isoflavones (genistein and daidzein) are associated with a lower risk of prostate carcinogenesis. Thirty articles were included for analysis of the potential impacts of soy food intake, isoflavone intake, and circulating isoflavone levels, on both primary and advanced PCa. Total soy food (p < 0.001), genistein (p = 0.008), daidzein (p = 0.018), and unfermented soy food (p < 0.001) intakes were significantly associated with a reduced risk of PCa. Fermented soy food intake, total isoflavone intake, and circulating isoflavones were not associated with PCa risk. Neither soy food intake nor circulating isoflavones were associated with advanced PCa risk, although very few studies currently exist to examine potential associations. Combined, this evidence from observational studies shows a statistically significant association between soy consumption and decreased PCa risk. Further studies are required to support soy consumption as a prophylactic dietary approach to reduce PCa carcinogenesis. PMID:29300347
NASA Astrophysics Data System (ADS)
Praca, Emilie; Gannier, Alexandre; Das, Krishna; Laran, Sophie
2009-04-01
Cetaceans are mobile and spend long periods underwater. Because of this, modelling their habitat could be subject to a serious problem of false absence. Furthermore, extensive surveys at sea are time and money consuming, and presence-absence data are difficult to apply. This study compares the ability of two presence-absence and two presence-only habitat modelling methods and uses the example of the sperm whale ( Physeter macrocephalus) in the northwestern Mediterranean Sea. The data consist of summer visual and acoustical detections of sperm whales, compiled between 1998 and 2005. Habitat maps were computed using topographical and hydrological eco-geographical variables. Four methods were compared: principal component analysis (PCA), ecological niche factor analysis (ENFA), generalized linear model (GLM) and multivariate adaptive regression splines (MARS). The evaluation of the models was achieved by calculating the receiver operating characteristic (ROC) of the models and their respective area under the curve (AUC). Presence-absence methods (GLM, AUC=0.70, and MARS, AUC=0.79) presented better AUC than presence-only methods (PCA, AUC=0.58, and ENFA, AUC=0.66), but this difference was not statistically significant, except between the MARS and the PCA models. The four models showed an influence of both topographical and hydrological factors, but the resulting habitat suitability maps differed. The core habitat on the continental slope was well highlighted by the four models, while GLM and MARS maps also showed a suitable habitat in the offshore waters. Presence-absence methods are therefore recommended for modelling the habitat suitability of cetaceans, as they seem more accurate to highlight complex habitat. However, the use of presence-only techniques, in particular ENFA, could be very useful for a first model of the habitat range or when important surveys at sea are not possible.
da Silva, Givaldo Souza; Canuto, Kirley Marques; Ribeiro, Paulo Riceli Vasconcelos; de Brito, Edy Sousa; Nascimento, Madson Moreira; Zocolo, Guilherme Julião; Coutinho, Janclei Pereira; de Jesus, Raildo Mota
2017-12-01
Paullinia cupana, commonly known as guarana, is an Amazonian fruit whose seeds are used to produce the powdered guarana, which is rich in caffeine and consumed for its stimulating activity. The metabolic profile of guarana from the two largest producing regions was investigated using UPLC-MS combined with multivariate statistical analysis. The principal component analysis (PCA) showed significant differences between samples produced in the states of Bahia and Amazonas. The metabolites responsible for the differentiation were identified by orthogonal partial least squares discriminant analysis (OPLS-DA). Fourteen phenolic compounds were characterized in guarana powder samples, and catechin, epicatechin, B-type procyanidin dimer, A-type procyanidin trimer and A-type procyanidin dimer were the main compounds responsible for the geographical variation of the samples. Copyright © 2017. Published by Elsevier Ltd.
1 H NMR study and multivariate data analysis of reindeer skin tanning methods.
Zhu, Lizheng; Ilott, Andrew J; Del Federico, Eleonora; Kehlet, Cindie; Klokkernes, Torunn; Jerschow, Alexej
2017-04-01
Reindeer skin clothing has been an essential component in the lives of indigenous people of the arctic and sub-arctic regions, keeping them warm during harsh winters. However, the skin processing technology, which often conveys the history and tradition of the indigenous group, has not been well documented. In this study, NMR spectra and relaxation behaviors of reindeer skin samples treated with a variety of vegetable tannin extracts, oils and fatty substances are studied and compared. With the assistance of principal component analysis (PCA), one can recognize patterns and identify groupings of differently treated samples. These methods could be important aids in efforts to conserve museum leather artifacts with unknown treatment methods and in the analysis of reindeer skin tanning processes. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Ferrero, A; Campos, J; Rabal, A M; Pons, A; Hernanz, M L; Corróns, A
2011-09-26
The Bidirectional Reflectance Distribution Function (BRDF) is essential to characterize an object's reflectance properties. This function depends both on the various illumination-observation geometries as well as on the wavelength. As a result, the comprehensive interpretation of the data becomes rather complex. In this work we assess the use of the multivariable analysis technique of Principal Components Analysis (PCA) applied to the experimental BRDF data of a ceramic colour standard. It will be shown that the result may be linked to the various reflection processes occurring on the surface, assuming that the incoming spectral distribution is affected by each one of these processes in a specific manner. Moreover, this procedure facilitates the task of interpolating a series of BRDF measurements obtained for a particular sample. © 2011 Optical Society of America
Kulkarni, Purva; Dost, Mina; Bulut, Özgül Demir; Welle, Alexander; Böcker, Sebastian; Boland, Wilhelm; Svatoš, Aleš
2018-01-01
Spatially resolved analysis of a multitude of compound classes has become feasible with the rapid advancement in mass spectrometry imaging strategies. In this study, we present a protocol that combines high lateral resolution time-of-flight secondary ion mass spectrometry (TOF-SIMS) imaging with a multivariate data analysis (MVA) approach to probe the complex leaf surface chemistry of Populus trichocarpa. Here, epicuticular waxes (EWs) found on the adaxial leaf surface of P. trichocarpa were blotted on silicon wafers and imaged using TOF-SIMS at 10 μm and 1 μm lateral resolution. Intense M +● and M -● molecular ions were clearly visible, which made it possible to resolve the individual compound classes present in EWs. Series of long-chain aliphatic saturated alcohols (C 21 -C 30 ), hydrocarbons (C 25 -C 33 ) and wax esters (WEs; C 44 -C 48 ) were clearly observed. These data correlated with the 7 Li-chelation matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) analysis, which yielded mostly molecular adduct ions of the analyzed compounds. Subsequently, MVA was used to interrogate the TOF-SIMS dataset for identifying hidden patterns on the leaf's surface based on its chemical profile. After the application of principal component analysis (PCA), a small number of principal components (PCs) were found to be sufficient to explain maximum variance in the data. To further confirm the contributions from pure components, a five-factor multivariate curve resolution (MCR) model was applied. Two distinct patterns of small islets, here termed 'crystals', were apparent from the resulting score plots. Based on PCA and MCR results, the crystals were found to be formed by C 23 or C 29 alcohols. Other less obvious patterns observed in the PCs revealed that the adaxial leaf surface is coated with a relatively homogenous layer of alcohols, hydrocarbons and WEs. The ultra-high-resolution TOF-SIMS imaging combined with the MVA approach helped to highlight the diverse patterns underlying the leaf's surface. Currently, the methods available to analyze the surface chemistry of waxes in conjunction with the spatial information related to the distribution of compounds are limited. This study uses tools that may provide important biological insights into the composition of the wax layer, how this layer is repaired after mechanical damage or insect feeding, and which transport mechanisms are involved in deploying wax constituents to specific regions on the leaf surface. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Environmental assessment of Al-Hammar Marsh, Southern Iraq.
Al-Gburi, Hind Fadhil Abdullah; Al-Tawash, Balsam Salim; Al-Lafta, Hadi Salim
2017-02-01
(a) To determine the spatial distributions and levels of major and minor elements, as well as heavy metals, in water, sediment, and biota (plant and fish) in Al-Hammar Marsh, southern Iraq, and ultimately to supply more comprehensive information for policy-makers to manage the contaminants input into the marsh so that their concentrations do not reach toxic levels. (b) to characterize the seasonal changes in the marsh surface water quality. (c) to address the potential environmental risk of these elements by comparison with the historical levels and global quality guidelines (i.e., World Health Organization (WHO) standard limits). (d) to define the sources of these elements (i.e., natural and/or anthropogenic) using combined multivariate statistical techniques such as Principal Component Analysis (PCA) and Agglomerative Hierarchical Cluster Analysis (AHCA) along with pollution analysis (i.e., enrichment factor analysis). Water, sediment, plant, and fish samples were collected from the marsh, and analyzed for major and minor ions, as well as heavy metals, and then compared to historical levels and global quality guidelines (WHO guidelines). Then, multivariate statistical techniques, such as PCA and AHCA, were used to determine the element sourcing. Water analyses revealed unacceptable values for almost all physio-chemical and biological properties, according to WHO standard limits for drinking water. Almost all major ions and heavy metal concentrations in water showed a distinct decreasing trend at the marsh outlet station compared to other stations. In general, major and minor ions, as well as heavy metals exhibit higher concentrations in winter than in summer. Sediment analyses using multivariate statistical techniques revealed that Mg, Fe, S, P, V, Zn, As, Se, Mo, Co, Ni, Cu, Sr, Br, Cd, Ca, N, Mn, Cr, and Pb were derived from anthropogenic sources, while Al, Si, Ti, K, and Zr were primarily derived from natural sources. Enrichment factor analysis gave results compatible with multivariate statistical techniques findings. Analysis of heavy metals in plant samples revealed that there is no pollution in plants in Al-Hammar Marsh. However, the concentrations of heavy metals in fish samples showed that all samples were contaminated by Pb, Mn, and Ni, while some samples were contaminated by Pb, Mn, and Ni. Decreasing of Tigris and Euphrates discharges during the past decades due to drought conditions and upstream damming, as well as the increasing stress of wastewater effluents from anthropogenic activities, led to degradation of the downstream Al-Hammar Marsh water quality in terms of physical, chemical, and biological properties. As such properties were found to consistently exceed the historical and global quality objectives. However, element concentration decreasing trend at the marsh outlet station compared to other stations indicate that the marsh plays an important role as a natural filtration and bioremediation system. Higher element concentrations in winter were due to runoff from the washing of the surrounding Sabkha during flooding by winter rainstorms. Finally, the high concentrations of heavy metals in fish samples can be attributed to bioaccumulation and biomagnification processes.
Farnell, D J J; Popat, H; Richmond, S
2016-06-01
Methods used in image processing should reflect any multilevel structures inherent in the image dataset or they run the risk of functioning inadequately. We wish to test the feasibility of multilevel principal components analysis (PCA) to build active shape models (ASMs) for cases relevant to medical and dental imaging. Multilevel PCA was used to carry out model fitting to sets of landmark points and it was compared to the results of "standard" (single-level) PCA. Proof of principle was tested by applying mPCA to model basic peri-oral expressions (happy, neutral, sad) approximated to the junction between the mouth/lips. Monte Carlo simulations were used to create this data which allowed exploration of practical implementation issues such as the number of landmark points, number of images, and number of groups (i.e., "expressions" for this example). To further test the robustness of the method, mPCA was subsequently applied to a dental imaging dataset utilising landmark points (placed by different clinicians) along the boundary of mandibular cortical bone in panoramic radiographs of the face. Changes of expression that varied between groups were modelled correctly at one level of the model and changes in lip width that varied within groups at another for the Monte Carlo dataset. Extreme cases in the test dataset were modelled adequately by mPCA but not by standard PCA. Similarly, variations in the shape of the cortical bone were modelled by one level of mPCA and variations between the experts at another for the panoramic radiographs dataset. Results for mPCA were found to be comparable to those of standard PCA for point-to-point errors via miss-one-out testing for this dataset. These errors reduce with increasing number of eigenvectors/values retained, as expected. We have shown that mPCA can be used in shape models for dental and medical image processing. mPCA was found to provide more control and flexibility when compared to standard "single-level" PCA. Specifically, mPCA is preferable to "standard" PCA when multiple levels occur naturally in the dataset. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Detection of Cell Wall Chemical Variation in Zea Mays Mutants Using Near-Infrared Spectroscopy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Buyck, N.; Thomas, S.
Corn stover is regarded as the prime candidate feedstock material for commercial biomass conversion in the United States. Variations in chemical composition of Zea mays cell walls can affect biomass conversion process yields and economics. Mutant lines were constructed by activating a Mu transposon system. The cell wall chemical composition of 48 mutant families was characterized using near-infrared (NIR) spectroscopy. NIR data were analyzed using a multivariate statistical analysis technique called Principal Component Analysis (PCA). PCA of the NIR data from 349 maize leaf samples reveals 57 individuals as outliers on one or more of six Principal Components (PCs) atmore » the 95% confidence interval. Of these, 19 individuals from 16 families are outliers on either PC3 (9% of the variation) or PC6 (1% of the variation), the two PCs that contain information about cell wall polymers. Those individuals for which altered cell wall chemistry is confirmed with wet chemical analysis will then be subjected to fermentation analysis to determine whether or not biomass conversion process kinetics, yields and/or economics are significantly affected. Those mutants that provide indications for a decrease in process cost will be pursued further to identify the gene(s) responsible for the observed changes in cell wall composition and associated changes in process economics. These genes will eventually be incorporated into maize breeding programs directed at the development of a truly dual use crop.« less
Evaluation of natural mandibular shape asymmetry: an approach by using elliptical Fourier analysis.
Niño-Sandoval, Tania C; Morantes Ariza, Carlos F; Infante-Contreras, Clementina; Vasconcelos, Belmiro Ce
2018-04-05
The purpose of this study was to demonstrate that asymmetry is a natural occurring phenomenon in the mandibular shape by using elliptical Fourier analysis. 164 digital orthopantomographs from Colombian patients of both sexes aged 18 to 25 years were collected. Curves from left and right hemimandible were digitized. An elliptical Fourier analysis was performed with 20 harmonics. In the general sexual dimorphism a principal component analysis (PCA) and a hotelling T 2 from the multivariate warp space were employed. Exploratory analysis of general asymmetry and sexual dimorphism by side was made with a Procrustes Fit. A non-parametric multivariate analysis of variance (MANOVA) was applied to assess differentiation of skeletal classes of each hemimandible, and a Procrustes analysis of variance (ANOVA) was applied to search any relation between skeletal class and side in both sexes. Significant values were found in general asymmetry, general sexual dimorphism, in dimorphism by side (p < 0.0001), asymmetry by sex, and differences between Class I, II, and III (p < 0.005). However, a relation of skeletal classes and side was not found. The mandibular asymmetry by shape is present in all patients and should not be articulated exclusively to pathological processes, therefore, along with sexual dimorphism and differences between skeletal classes must be taken into account for improving mandibular prediction systems.
NASA Astrophysics Data System (ADS)
Wu, Xia; Zheng, Kang; Zhao, Fengjia; Zheng, Yongjun; Li, Yantuan
2014-08-01
Meretricis concha is a kind of marine traditional Chinese medicine (TCM), and has been commonly used for the treatment of asthma and scald burns. In order to investigate the relationship between the inorganic elemental fingerprint and the geographical origin identification of Meretricis concha, the elemental contents of M. concha from five sampling points in Rushan Bay have been determined by means of inductively coupled plasma optical emission spectrometry (ICP-OES). Based on the contents of 14 inorganic elements (Al, As, Cd, Co, Cr, Cu, Fe, Hg, Mn, Mo, Ni, Pb, Se, and Zn), the inorganic elemental fingerprint which well reflects the elemental characteristics was constructed. All the data from the five sampling points were discriminated with accuracy through hierarchical cluster analysis (HCA) and principle component analysis (PCA), indicating that a four-factor model which could explain approximately 80% of the detection data was established, and the elements Al, As, Cd, Cu, Ni and Pb could be viewed as the characteristic elements. This investigation suggests that the inorganic elemental fingerprint combined with multivariate statistical analysis is a promising method for verifying the geographical origin of M. concha, and this strategy should be valuable for the authenticity discrimination of some marine TCM.
Targeted metabolomic profiling in rat tissues reveals sex differences.
Ruoppolo, Margherita; Caterino, Marianna; Albano, Lucia; Pecce, Rita; Di Girolamo, Maria Grazia; Crisci, Daniela; Costanzo, Michele; Milella, Luigi; Franconi, Flavia; Campesi, Ilaria
2018-03-16
Sex differences affect several diseases and are organ-and parameter-specific. In humans and animals, sex differences also influence the metabolism and homeostasis of amino acids and fatty acids, which are linked to the onset of diseases. Thus, the use of targeted metabolite profiles in tissues represents a powerful approach to examine the intermediary metabolism and evidence for any sex differences. To clarify the sex-specific activities of liver, heart and kidney tissues, we used targeted metabolomics, linear discriminant analysis (LDA), principal component analysis (PCA), cluster analysis and linear correlation models to evaluate sex and organ-specific differences in amino acids, free carnitine and acylcarnitine levels in male and female Sprague-Dawley rats. Several intra-sex differences affect tissues, indicating that metabolite profiles in rat hearts, livers and kidneys are organ-dependent. Amino acids and carnitine levels in rat hearts, livers and kidneys are affected by sex: male and female hearts show the greatest sexual dimorphism, both qualitatively and quantitatively. Finally, multivariate analysis confirmed the influence of sex on the metabolomics profiling. Our data demonstrate that the metabolomics approach together with a multivariate approach can capture the dynamics of physiological and pathological states, which are essential for explaining the basis of the sex differences observed in physiological and pathological conditions.
Nonlinear Principal Components Analysis: Introduction and Application
ERIC Educational Resources Information Center
Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Koojj, Anita J.
2007-01-01
The authors provide a didactic treatment of nonlinear (categorical) principal components analysis (PCA). This method is the nonlinear equivalent of standard PCA and reduces the observed variables to a number of uncorrelated principal components. The most important advantages of nonlinear over linear PCA are that it incorporates nominal and ordinal…
Yang, Lei; Wei, Ran; Shen, Henggen
2017-01-01
New principal component analysis (PCA) respirator fit test panels had been developed for current American and Chinese civilian workers based on anthropometric surveys. The PCA panels used the first two principal components (PCs) obtained from a set of 10 facial dimensions. Although the PCA panels for American and Chinese subjects adopted the bivairate framework with two PCs, the number of the PCs retained in the PCA analysis was different between Chinese subjects and Americans. For the Chinese youth group, the third PC should be retained in the PCA analysis for developing new fit test panels. In this article, an additional number label (ANL) is used to explain the third PC in PCA analysis when the first two PCs are used to construct the PCA half-facepiece respirator fit test panel for Chinese group. The three-dimensional box-counting method is proposed to estimate the ANLs by calculating fractal dimensions of the facial anthropometric data of the Chinese youth. The linear regression coefficients of scale-free range R 2 are all over 0.960, which demonstrates that the facial anthropometric data of the Chinese youth has fractal characteristic. The youth subjects born in Henan province has an ANL of 2.002, which is lower than the composite facial anthropometric data of Chinese subjects born in many provinces. Hence, Henan youth subjects have the self-similar facial anthropometric characteristic and should use the particular ANL (2.002) as the important tool along with using the PCA panel. The ANL method proposed in this article not only provides a new methodology in quantifying the characteristics of facial anthropometric dimensions for any ethnic/racial group, but also extends the scope of PCA panel studies to higher dimensions.
Functional Data Analysis in NTCP Modeling: A New Method to Explore the Radiation Dose-Volume Effects
DOE Office of Scientific and Technical Information (OSTI.GOV)
Benadjaoud, Mohamed Amine, E-mail: mohamedamine.benadjaoud@gustaveroussy.fr; Université Paris sud, Le Kremlin-Bicêtre; Institut Gustave Roussy, Villejuif
2014-11-01
Purpose/Objective(s): To describe a novel method to explore radiation dose-volume effects. Functional data analysis is used to investigate the information contained in differential dose-volume histograms. The method is applied to the normal tissue complication probability modeling of rectal bleeding (RB) for patients irradiated in the prostatic bed by 3-dimensional conformal radiation therapy. Methods and Materials: Kernel density estimation was used to estimate the individual probability density functions from each of the 141 rectum differential dose-volume histograms. Functional principal component analysis was performed on the estimated probability density functions to explore the variation modes in the dose distribution. The functional principalmore » components were then tested for association with RB using logistic regression adapted to functional covariates (FLR). For comparison, 3 other normal tissue complication probability models were considered: the Lyman-Kutcher-Burman model, logistic model based on standard dosimetric parameters (LM), and logistic model based on multivariate principal component analysis (PCA). Results: The incidence rate of grade ≥2 RB was 14%. V{sub 65Gy} was the most predictive factor for the LM (P=.058). The best fit for the Lyman-Kutcher-Burman model was obtained with n=0.12, m = 0.17, and TD50 = 72.6 Gy. In PCA and FLR, the components that describe the interdependence between the relative volumes exposed at intermediate and high doses were the most correlated to the complication. The FLR parameter function leads to a better understanding of the volume effect by including the treatment specificity in the delivered mechanistic information. For RB grade ≥2, patients with advanced age are significantly at risk (odds ratio, 1.123; 95% confidence interval, 1.03-1.22), and the fits of the LM, PCA, and functional principal component analysis models are significantly improved by including this clinical factor. Conclusion: Functional data analysis provides an attractive method for flexibly estimating the dose-volume effect for normal tissues in external radiation therapy.« less
Time-dependent analysis of dosage delivery information for patient-controlled analgesia services.
Kuo, I-Ting; Chang, Kuang-Yi; Juan, De-Fong; Hsu, Steen J; Chan, Chia-Tai; Tsou, Mei-Yung
2018-01-01
Pain relief always plays the essential part of perioperative care and an important role of medical quality improvement. Patient-controlled analgesia (PCA) is a method that allows a patient to self-administer small boluses of analgesic to relieve the subjective pain. PCA logs from the infusion pump consisted of a lot of text messages which record all events during the therapies. The dosage information can be extracted from PCA logs to provide easily understanding features. The analysis of dosage information with time has great help to figure out the variance of a patient's pain relief condition. To explore the trend of pain relief requirement, we developed a PCA dosage information generator (PCA DIG) to extract meaningful messages from PCA logs during the first 48 hours of therapies. PCA dosage information including consumption, delivery, infusion rate, and the ratio between demand and delivery is presented with corresponding values in 4 successive time frames. Time-dependent statistical analysis demonstrated the trends of analgesia requirements decreased gradually along with time. These findings are compatible with clinical observations and further provide valuable information about the strategy to customize postoperative pain management.
Study of jojoba oil aging by FTIR.
Le Dréau, Y; Dupuy, N; Gaydou, V; Joachim, J; Kister, J
2009-05-29
As the jojoba oil was used in cosmetic, pharmaceutical, dietetic food, animal feeding, lubrication, polishing and bio-diesel fields, it was important to study its aging at high temperature by oxidative process. In this work a FT-MIR methodology was developed for monitoring accelerate oxidative degradation of jojoba oils. Principal component analysis (PCA) was used to differentiate various samples according to their origin and obtaining process, and to differentiate oxidative conditions applied on oils. Two spectroscopic indices were calculated to report simply the oxidation phenomenon. Results were confirmed and deepened by multivariate curve resolution-alternative least square method (MCR-ALS). It allowed identifying chemical species produced or degraded during the thermal treatment according to a SIMPLISMA pretreatment.
Lin, Yuxin; Chen, Feifei; Shen, Li; Tang, Xiaoyu; Du, Cui; Sun, Zhandong; Ding, Huijie; Chen, Jiajia; Shen, Bairong
2018-05-21
Prostate cancer (PCa) is a fatal malignant tumor among males in the world and the metastasis is a leading cause for PCa death. Biomarkers are therefore urgently needed to detect PCa metastatic signature at the early time. MicroRNAs are small non-coding RNAs with the potential to be biomarkers for disease prediction. In addition, computer-aided biomarker discovery is now becoming an attractive paradigm for precision diagnosis and prognosis of complex diseases. In this study, we identified key microRNAs as biomarkers for predicting PCa metastasis based on network vulnerability analysis. We first extracted microRNAs and mRNAs that were differentially expressed between primary PCa and metastatic PCa (MPCa) samples. Then we constructed the MPCa-specific microRNA-mRNA network and screened microRNA biomarkers by a novel bioinformatics model. The model emphasized the characterization of systems stability changes and the network vulnerability with three measurements, i.e. the structurally single-line regulation, the functional importance of microRNA targets and the percentage of transcription factor genes in microRNA unique targets. With this model, we identified five microRNAs as putative biomarkers for PCa metastasis. Among them, miR-101-3p and miR-145-5p have been previously reported as biomarkers for PCa metastasis and the remaining three, i.e. miR-204-5p, miR-198 and miR-152, were screened as novel biomarkers for PCa metastasis. The results were further confirmed by the assessment of their predictive power and biological function analysis. Five microRNAs were identified as candidate biomarkers for predicting PCa metastasis based on our network vulnerability analysis model. The prediction performance, literature exploration and functional enrichment analysis convinced our findings. This novel bioinformatics model could be applied to biomarker discovery for other complex diseases.
Sanchez, Tino W.; Zhang, Guangyu; Li, Jitian; Dai, Liping; Mirshahidi, Saied; Wall, Nathan R.; Yates, Clayton; Wilson, Colwick; Montgomery, Susanne; Zhang, Jian-Ying; Casiano, Carlos A.
2016-01-01
African American (AA) men suffer from a disproportionately high incidence and mortality of prostate cancer (PCa) compared with other racial/ethnic groups. Despite these disparities, African American men are underrepresented in clinical trials and in studies on PCa biology and biomarker discovery. We used immunoseroproteomics to profile antitumor autoantibody responses in AA and European American (EA) men with PCa, and explored differences in these responses. This minimally invasive approach detects autoantibodies to tumor-associated antigens that could serve as clinical biomarkers and immunotherapeutic agents. Sera from AA and EA men with PCa were probed by immunoblotting against PC3 cell proteins, with AA sera showing stronger immunoreactivity. Mass spectrometry analysis of immunoreactive protein spots revealed that several AA sera contained autoantibodies to a number of proteins associated with both the glycolysis and plasminogen pathways, particularly to alpha-enolase (ENO1). The proteomic data is deposited in ProteomeXchange with identifier PXD003968. Analysis of sera from 340 racially diverse men by enzyme-linked immunosorbent assays (ELISA) showed higher frequency of anti-ENO1 autoantibodies in PCa sera compared with control sera. We observed differences between AA-PCa and EA-PCa patients in their immunoreactivity against ENO1. Although EA-PCa sera reacted with higher frequency against purified ENO1 in ELISA and recognized by immunoblotting the endogenous cellular ENO1 across a panel of prostate cell lines, AA-PCa sera reacted weakly against this protein by ELISA but recognized it by immunoblotting preferentially in metastatic cell lines. These race-related differences in immunoreactivity to ENO1 could not be accounted by differential autoantibody recognition of phosphoepitopes within this antigen. Proteomic analysis revealed differences in the posttranslational modification profiles of ENO1 variants differentially recognized by AA-PCa and EA-PCa sera. These intriguing results suggest the possibility of race-related differences in the antitumor autoantibody response in PCa, and have implications for defining novel biological determinants of PCa health disparities. PMID:27742740
Guo, H; Wang, T; Louie, P K K
2004-06-01
Receptor-oriented source apportionment models are often used to identify sources of ambient air pollutants and to estimate source contributions to air pollutant concentrations. In this study, a PCA/APCS model was applied to the data on non-methane hydrocarbons (NMHCs) measured from January to December 2001 at two sampling sites: Tsuen Wan (TW) and Central & Western (CW) Toxic Air Pollutants Monitoring Stations in Hong Kong. This multivariate method enables the identification of major air pollution sources along with the quantitative apportionment of each source to pollutant species. The PCA analysis identified four major pollution sources at TW site and five major sources at CW site. The extracted pollution sources included vehicular internal engine combustion with unburned fuel emissions, use of solvent particularly paints, liquefied petroleum gas (LPG) or natural gas leakage, and industrial, commercial and domestic sources such as solvents, decoration, fuel combustion, chemical factories and power plants. The results of APCS receptor model indicated that 39% and 48% of the total NMHCs mass concentrations measured at CW and TW were originated from vehicle emissions, respectively. 32% and 36.4% of the total NMHCs were emitted from the use of solvent and 11% and 19.4% were apportioned to the LPG or natural gas leakage, respectively. 5.2% and 9% of the total NMHCs mass concentrations were attributed to other industrial, commercial and domestic sources, respectively. It was also found that vehicle emissions and LPG or natural gas leakage were the main sources of C(3)-C(5) alkanes and C(3)-C(5) alkenes while aromatics were predominantly released from paints. Comparison of source contributions to ambient NMHCs at the two sites indicated that the contribution of LPG or natural gas at CW site was almost twice that at TW site. High correlation coefficients (R(2) > 0.8) between the measured and predicted values suggested that the PCA/APCS model was applicable for estimation of sources of NMHCs in ambient air.
Zhu, Yong; Wen, Wen; Zhang, Fengmin; Hardie, Jim W.
2015-01-01
Background and Aims Proton nuclear magnetic resonance spectroscopy coupled multivariate analysis (1H NMR-PCA/PLS-DA) is an important tool for the discrimination of wine products. Although 1H NMR has been shown to discriminate wines of different cultivars, a grape genetic component of the discrimination has been inferred only from discrimination of cultivars of undefined genetic homology and in the presence of many confounding environmental factors. We aimed to confirm the influence of grape genotypes in the absence of those factors. Methods and Results We applied 1H NMR-PCA/PLS-DA and hierarchical cluster analysis (HCA) to wines from five, variously genetically-related grapevine (V. vinifera) cultivars; all grown similarly on the same site and vinified similarly. We also compared the semi-quantitative profiles of the discriminant metabolites of each cultivar with previously reported chemical analyses. The cultivars were clearly distinguishable and there was a general correlation between their grouping and their genetic homology as revealed by recent genomic studies. Between cultivars, the relative amounts of several of the cultivar-related discriminant metabolites conformed closely with reported chemical analyses. Conclusions Differences in grape-derived metabolites associated with genetic differences alone are a major source of 1H NMR-based discrimination of wines and 1H NMR has the capacity to discriminate between very closely related cultivars. Significance of the Study The study confirms that genetic variation among grape cultivars alone can account for the discrimination of wine by 1H NMR-PCA/PLS and indicates that 1H NMR spectra of wine of single grape cultivars may in future be used in tandem with hierarchical cluster analysis to elucidate genetic lineages and metabolomic relations of grapevine cultivars. In the absence of genetic information, for example, where predecessor varieties are no longer extant, this may be a particularly useful approach. PMID:26658757
Lycopene and Risk of Prostate Cancer
Chen, Ping; Zhang, Wenhao; Wang, Xiao; Zhao, Keke; Negi, Devendra Singh; Zhuo, Li; Qi, Mao; Wang, Xinghuan; Zhang, Xinhua
2015-01-01
Abstract Prostate cancer (PCa) is a common illness for aging males. Lycopene has been identified as an antioxidant agent with potential anticancer properties. Studies investigating the relation between lycopene and PCa risk have produced inconsistent results. This study aims to determine dietary lycopene consumption/circulating concentration and any potential dose–response associations with the risk of PCa. Eligible studies published in English up to April 10, 2014, were searched and identified from Pubmed, Sciencedirect Online, Wiley online library databases and hand searching. The STATA (version 12.0) was applied to process the dose–response meta-analysis. Random effects models were used to calculate pooled relative risks (RRs) and 95% confidence intervals (CIs) and to incorporate variation between studies. The linear and nonlinear dose–response relations were evaluated with data from categories of lycopene consumption/circulating concentrations. Twenty-six studies were included with 17,517 cases of PCa reported from 563,299 participants. Although inverse association between lycopene consumption and PCa risk was not found in all studies, there was a trend that with higher lycopene intake, there was reduced incidence of PCa (P = 0.078). Removal of one Chinese study in sensitivity analysis, or recalculation using data from only high-quality studies for subgroup analysis, indicated that higher lycopene consumption significantly lowered PCa risk. Furthermore, our dose–response meta-analysis demonstrated that higher lycopene consumption was linearly associated with a reduced risk of PCa with a threshold between 9 and 21 mg/day. Consistently, higher circulating lycopene levels significantly reduced the risk of PCa. Interestingly, the concentration of circulating lycopene between 2.17 and 85 μg/dL was linearly inversed with PCa risk whereas there was no linear association >85 μg/dL. In addition, greater efficacy for the circulating lycopene concentration on preventing PCa was found for studies with high quality, follow-up >10 years and where results were adjusted by the age or the body mass index. In conclusion, our novel data demonstrates that higher lycopene consumption/circulating concentration is associated with a lower risk of PCa. However, further studies are required to determine the mechanism by which lycopene reduces the risk of PCa and if there are other factors in tomato products that might potentially decrease PCa risk and progression. PMID:26287411
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nee, K.; Bryan, S.; Levitskaia, T.
The reliability of chemical processes can be greatly improved by implementing inline monitoring systems. Combining multivariate analysis with non-destructive sensors can enhance the process without interfering with the operation. Here, we present here hierarchical models using both principal component analysis and partial least square analysis developed for different chemical components representative of solvent extraction process streams. A training set of 380 samples and an external validation set of 95 samples were prepared and Near infrared and Raman spectral data as well as conductivity under variable temperature conditions were collected. The results from the models indicate that careful selection of themore » spectral range is important. By compressing the data through Principal Component Analysis (PCA), we lower the rank of the data set to its most dominant features while maintaining the key principal components to be used in the regression analysis. Within the studied data set, concentration of five chemical components were modeled; total nitrate (NO 3 -), total acid (H +), neodymium (Nd 3+), sodium (Na +), and ionic strength (I.S.). The best overall model prediction for each of the species studied used a combined data set comprised of complementary techniques including NIR, Raman, and conductivity. Finally, our study shows that chemometric models are powerful but requires significant amount of carefully analyzed data to capture variations in the chemistry.« less
Nee, K.; Bryan, S.; Levitskaia, T.; ...
2017-12-28
The reliability of chemical processes can be greatly improved by implementing inline monitoring systems. Combining multivariate analysis with non-destructive sensors can enhance the process without interfering with the operation. Here, we present here hierarchical models using both principal component analysis and partial least square analysis developed for different chemical components representative of solvent extraction process streams. A training set of 380 samples and an external validation set of 95 samples were prepared and Near infrared and Raman spectral data as well as conductivity under variable temperature conditions were collected. The results from the models indicate that careful selection of themore » spectral range is important. By compressing the data through Principal Component Analysis (PCA), we lower the rank of the data set to its most dominant features while maintaining the key principal components to be used in the regression analysis. Within the studied data set, concentration of five chemical components were modeled; total nitrate (NO 3 -), total acid (H +), neodymium (Nd 3+), sodium (Na +), and ionic strength (I.S.). The best overall model prediction for each of the species studied used a combined data set comprised of complementary techniques including NIR, Raman, and conductivity. Finally, our study shows that chemometric models are powerful but requires significant amount of carefully analyzed data to capture variations in the chemistry.« less
ERIC Educational Resources Information Center
Linting, Marielle; Meulman, Jacqueline J.; Groenen, Patrick J. F.; van der Kooij, Anita J.
2007-01-01
Principal components analysis (PCA) is used to explore the structure of data sets containing linearly related numeric variables. Alternatively, nonlinear PCA can handle possibly nonlinearly related numeric as well as nonnumeric variables. For linear PCA, the stability of its solution can be established under the assumption of multivariate…
2014-01-01
Background The chemical composition of aerosols and particle size distributions are the most significant factors affecting air quality. In particular, the exposure to finer particles can cause short and long-term effects on human health. In the present paper PM10 (particulate matter with aerodynamic diameter lower than 10 μm), CO, NOx (NO and NO2), Benzene and Toluene trends monitored in six monitoring stations of Bari province are shown. The data set used was composed by bi-hourly means for all parameters (12 bi-hourly means per day for each parameter) and it’s referred to the period of time from January 2005 and May 2007. The main aim of the paper is to provide a clear illustration of how large data sets from monitoring stations can give information about the number and nature of the pollutant sources, and mainly to assess the contribution of the traffic source to PM10 concentration level by using multivariate statistical techniques such as Principal Component Analysis (PCA) and Absolute Principal Component Scores (APCS). Results Comparing the night and day mean concentrations (per day) for each parameter it has been pointed out that there is a different night and day behavior for some parameters such as CO, Benzene and Toluene than PM10. This suggests that CO, Benzene and Toluene concentrations are mainly connected with transport systems, whereas PM10 is mostly influenced by different factors. The statistical techniques identified three recurrent sources, associated with vehicular traffic and particulate transport, covering over 90% of variance. The contemporaneous analysis of gas and PM10 has allowed underlining the differences between the sources of these pollutants. Conclusions The analysis of the pollutant trends from large data set and the application of multivariate statistical techniques such as PCA and APCS can give useful information about air quality and pollutant’s sources. These knowledge can provide useful advices to environmental policies in order to reach the WHO recommended levels. PMID:24555534
Wu, Chen-Jiang; Wang, Qing; Li, Hai; Wang, Xiao-Ning; Liu, Xi-Sheng; Shi, Hai-Bin; Zhang, Yu-Dong
2015-10-01
To investigate diagnostic efficiency of DWI using entire-tumor histogram analysis in differentiating the low-grade (LG) prostate cancer (PCa) from intermediate-high-grade (HG) PCa in comparison with conventional ROI-based measurement. DW images (b of 0-1400 s/mm(2)) from 126 pathology-confirmed PCa (diameter >0.5 cm) in 110 patients were retrospectively collected and processed by mono-exponential model. The measurement of tumor apparent diffusion coefficients (ADCs) was performed with using histogram-based and ROI-based approach, respectively. The diagnostic ability of ADCs from two methods for differentiating LG-PCa (Gleason score, GS ≤ 6) from HG-PCa (GS > 6) was determined by ROC regression, and compared by McNemar's test. There were 49 LG-tumor and 77 HG-tumor at pathologic findings. Histogram-based ADCs (mean, median, 10th and 90th) and ROI-based ADCs (mean) showed dominant relationships with ordinal GS of Pca (ρ = -0.225 to -0.406, p < 0.05). All above imaging indices reflected significant difference between LG-PCa and HG-PCa (all p values <0.01). Histogram 10th ADCs had dominantly high Az (0.738), Youden index (0.415), and positive likelihood ratio (LR+, 2.45) in stratifying tumor GS against mean, median and 90th ADCs, and ROI-based ADCs. Histogram mean, median, and 10th ADCs showed higher specificity (65.3%-74.1% vs. 44.9%, p < 0.01), but lower sensitivity (57.1%-71.3% vs. 84.4%, p < 0.05) than ROI-based ADCs in differentiating LG-PCa from HG-PCa. DWI-associated histogram analysis had higher specificity, Az, Youden index, and LR+ for differentiation of PCa Gleason grade than ROI-based approach.
de Falco, Bruna; Incerti, Guido; Pepe, Rosa; Amato, Mariana; Lanzotti, Virginia
2016-09-01
Globe artichoke (Cynara cardunculus L. var. scolymus L. Fiori) and cardoon (Cynara cardunculus L. var. altilis DC) are sources of nutraceuticals and bioactive compounds. To apply a NMR metabolomic fingerprinting approach to Cynara cardunculus heads to obtain simultaneous identification and quantitation of the major classes of organic compounds. The edible part of 14 Globe artichoke populations, belonging to the Romaneschi varietal group, were extracted to obtain apolar and polar organic extracts. The analysis was also extended to one species of cultivated cardoon for comparison. The (1) H-NMR of the extracts allowed simultaneous identification of the bioactive metabolites whose quantitation have been obtained by spectral integration followed by principal component analysis (PCA). Apolar organic extracts were mainly based on highly unsaturated long chain lipids. Polar organic extracts contained organic acids, amino acids, sugars (mainly inulin), caffeoyl derivatives (mainly cynarin), flavonoids, and terpenes. The level of nutraceuticals was found to be highest in the Italian landraces Bianco di Pertosa zia E and Natalina while cardoon showed the lowest content of all metabolites thus confirming the genetic distance between artichokes and cardoon. Metabolomic approach coupling NMR spectroscopy with multivariate data analysis allowed for a detailed metabolite profile of artichoke and cardoon varieties to be obtained. Relevant differences in the relative content of the metabolites were observed for the species analysed. This work is the first application of (1) H-NMR with multivariate statistics to provide a metabolomic fingerprinting of Cynara scolymus. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
NASA Astrophysics Data System (ADS)
Tsai, Jinn-Tsong; Chou, Ping-Yi; Chou, Jyh-Horng
2015-11-01
The aim of this study is to generate vector quantisation (VQ) codebooks by integrating principle component analysis (PCA) algorithm, Linde-Buzo-Gray (LBG) algorithm, and evolutionary algorithms (EAs). The EAs include genetic algorithm (GA), particle swarm optimisation (PSO), honey bee mating optimisation (HBMO), and firefly algorithm (FF). The study is to provide performance comparisons between PCA-EA-LBG and PCA-LBG-EA approaches. The PCA-EA-LBG approaches contain PCA-GA-LBG, PCA-PSO-LBG, PCA-HBMO-LBG, and PCA-FF-LBG, while the PCA-LBG-EA approaches contain PCA-LBG, PCA-LBG-GA, PCA-LBG-PSO, PCA-LBG-HBMO, and PCA-LBG-FF. All training vectors of test images are grouped according to PCA. The PCA-EA-LBG used the vectors grouped by PCA as initial individuals, and the best solution gained by the EAs was given for LBG to discover a codebook. The PCA-LBG approach is to use the PCA to select vectors as initial individuals for LBG to find a codebook. The PCA-LBG-EA used the final result of PCA-LBG as an initial individual for EAs to find a codebook. The search schemes in PCA-EA-LBG first used global search and then applied local search skill, while in PCA-LBG-EA first used local search and then employed global search skill. The results verify that the PCA-EA-LBG indeed gain superior results compared to the PCA-LBG-EA, because the PCA-EA-LBG explores a global area to find a solution, and then exploits a better one from the local area of the solution. Furthermore the proposed PCA-EA-LBG approaches in designing VQ codebooks outperform existing approaches shown in the literature.
Association between dietary phytoestrogens intakes and prostate cancer risk in Sicily.
Russo, Giorgio I; Di Mauro, Marina; Regis, Federica; Reale, Giulio; Campisi, Daniele; Marranzano, Marina; Lo Giudice, Arturo; Solinas, Tatiana; Madonia, Massimo; Cimino, Sebastiano; Morgia, Giuseppe
2018-03-01
In this study we aimed to investigate the association between dietary phytoestrogen consumption and prostate cancer in a sample of southern Italian individuals. A population-based case-control study on the association between prostate cancer and dietary factors was conducted from January 2015 to December 2016 in a single institution of the municipality of Catania, southern Italy (Registration number: 41/2015). A total of 118 histopathological-verified prostate cancer (PCa) cases and a total of 222 controls were collected. Dietary data was collected by using two food frequency questionnaires. Patients with PCa consumed significantly higher levels of phytoestrogens. Multivariate logistic regression showed that lignans (Q[quartile]4 vs. Q1, OR [odds ratio] = 4.72; p < .05) and specifically, lariciresinol (Q4 vs. Q1, OR = 4.60; p < .05), pinoresinol (Q4 vs. Q1, OR = 5.62; p < .05), matairesinol (Q4 vs. Q1, OR = 3.63; p < .05), secoisolariciresinol (Q4 vs. Q1, OR = 4.10; p < .05) were associated with increased risk of PCa. Furthermore, we found that isoflavones (Q3 vs. Q1, OR = 0.28; p < .05) and specifically, genistein (Q4 vs. Q1, OR = 0.40; p < .05) were associated with reduced risk of PCa. We found of an inverse association between dietary isoflavone intake and PCa, while a positive association was found with lignans intake.
Wang, Jing; Wu, Chen-Jiang; Bao, Mei-Ling; Zhang, Jing; Wang, Xiao-Ning; Zhang, Yu-Dong
2017-10-01
To investigate whether machine learning-based analysis of MR radiomics can help improve the performance PI-RADS v2 in clinically relevant prostate cancer (PCa). This IRB-approved study included 54 patients with PCa undergoing multi-parametric (mp) MRI before prostatectomy. Imaging analysis was performed on 54 tumours, 47 normal peripheral (PZ) and 48 normal transitional (TZ) zone based on histological-radiological correlation. Mp-MRI was scored via PI-RADS, and quantified by measuring radiomic features. Predictive model was developed using a novel support vector machine trained with: (i) radiomics, (ii) PI-RADS scores, (iii) radiomics and PI-RADS scores. Paired comparison was made via ROC analysis. For PCa versus normal TZ, the model trained with radiomics had a significantly higher area under the ROC curve (Az) (0.955 [95% CI 0.923-0.976]) than PI-RADS (Az: 0.878 [0.834-0.914], p < 0.001). The Az between them was insignificant for PCa versus PZ (0.972 [0.945-0.988] vs. 0.940 [0.905-0.965], p = 0.097). When radiomics was added, performance of PI-RADS was significantly improved for PCa versus PZ (Az: 0.983 [0.960-0.995]) and PCa versus TZ (Az: 0.968 [0.940-0.985]). Machine learning analysis of MR radiomics can help improve the performance of PI-RADS in clinically relevant PCa. • Machine-based analysis of MR radiomics outperformed in TZ cancer against PI-RADS. • Adding MR radiomics significantly improved the performance of PI-RADS. • DKI-derived Dapp and Kapp were two strong markers for the diagnosis of PCa.
NASA Astrophysics Data System (ADS)
Gharibnezhad, Fahit; Mujica, Luis E.; Rodellar, José
2015-01-01
Using Principal Component Analysis (PCA) for Structural Health Monitoring (SHM) has received considerable attention over the past few years. PCA has been used not only as a direct method to identify, classify and localize damages but also as a significant primary step for other methods. Despite several positive specifications that PCA conveys, it is very sensitive to outliers. Outliers are anomalous observations that can affect the variance and the covariance as vital parts of PCA method. Therefore, the results based on PCA in the presence of outliers are not fully satisfactory. As a main contribution, this work suggests the use of robust variant of PCA not sensitive to outliers, as an effective way to deal with this problem in SHM field. In addition, the robust PCA is compared with the classical PCA in the sense of detecting probable damages. The comparison between the results shows that robust PCA can distinguish the damages much better than using classical one, and even in many cases allows the detection where classic PCA is not able to discern between damaged and non-damaged structures. Moreover, different types of robust PCA are compared with each other as well as with classical counterpart in the term of damage detection. All the results are obtained through experiments with an aircraft turbine blade using piezoelectric transducers as sensors and actuators and adding simulated damages.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, J; Gong, G; Cui, Y
Purpose: To predict early pathological response of breast cancer to neoadjuvant chemotherapy (NAC) based on quantitative, multi-region analysis of dynamic contrast enhancement magnetic resonance imaging (DCE-MRI). Methods: In this institution review board-approved study, 35 patients diagnosed with stage II/III breast cancer were retrospectively investigated using DCE-MR images acquired before and after the first cycle of NAC. First, principal component analysis (PCA) was used to reduce the dimensionality of the DCE-MRI data with a high-temporal resolution. We then partitioned the whole tumor into multiple subregions using k-means clustering based on the PCA-defined eigenmaps. Within each tumor subregion, we extracted four quantitativemore » Haralick texture features based on the gray-level co-occurrence matrix (GLCM). The change in texture features in each tumor subregion between pre- and during-NAC was used to predict pathological complete response after NAC. Results: Three tumor subregions were identified through clustering, each with distinct enhancement characteristics. In univariate analysis, all imaging predictors except one extracted from the tumor subregion associated with fast wash-out were statistically significant (p< 0.05) after correcting for multiple testing, with area under the ROC curve or AUCs between 0.75 and 0.80. In multivariate analysis, the proposed imaging predictors achieved an AUC of 0.79 (p = 0.002) in leave-one-out cross validation. This improved upon conventional imaging predictors such as tumor volume (AUC=0.53) and texture features based on whole-tumor analysis (AUC=0.65). Conclusion: The heterogeneity of the tumor subregion associated with fast wash-out on DCE-MRI predicted early pathological response to neoadjuvant chemotherapy in breast cancer.« less
Fraga, Angelina Bossi; de Lima Silva, Fabiane; Hongyu, Kuang; Da Silva Santos, Darlim; Murphy, Thomas Wayne; Lopes, Fernando Brito
2016-03-01
The objective of this research was to try to unveil the relationship between production traits and genotypic proportions of crossbred dairy cattle using principal component analysis (PCA) and cluster analysis. The herd consists of crossbred animals of Holstein (H) and Zebu (Z) (Gir and Guzerat) in different genotypic proportions; the composition of which varies from 12.5 to 100.0 % of the genetic group H. For this study, 834 milk production records from 257 cows from the years 1997 to 2014 were analyzed. The animals were all managed at a farm located in northeastern Brazil. The variables in the PCA were total milk yield per lactation (MY), milk yield adjusted to 305 days (MY305), lactation length (LL), and proportion of H and Z breeding. This analysis reduced the size of the sample space from the original five variables to two principal components (PCs) that together explained 89.4 % of the total variation. MY, MY305, LL, and genotypic proportion of H all contributed positively to PC1. The genotypic proportion of Z contributed negatively, which established a contrast between H and Z. Further cluster analysis identified two distinct groups when considering production performance and genotype of the animals. The high-performance group was predominantly Holstein breeding, while the lower performing group consisted mostly of Zebu. Under the environmental and management conditions in which this research was conducted, the best performances for the traits considered were achieved from cows whose genotypic proportion was between 38.0 and 94.0 % Holstein breeding.
Kim, Yong-June; Yoon, Hyung-Yoon; Kim, Seon-Kyu; Kim, Young-Won; Kim, Eun-Jung; Kim, Isaac Yi; Kim, Wun-Jae
2011-07-01
Abnormal DNA methylation is associated with many human cancers. The aim of the present study was to identify novel methylation markers in prostate cancer (PCa) by microarray analysis and to test whether these markers could discriminate normal and PCa cells. Microarray-based DNA methylation and gene expression profiling was carried out using a panel of PCa cell lines and a control normal prostate cell line. The methylation status of candidate genes in prostate cell lines was confirmed by real-time reverse transcriptase-PCR, bisulfite sequencing analysis, and treatment with a demethylation agent. DNA methylation and gene expression analysis in 203 human prostate specimens, including 106 PCa and 97 benign prostate hyperplasia (BPH), were carried out. Further validation using microarray gene expression data from the Gene Expression Omnibus (GEO) was carried out. Epidermal growth factor-containing fibulin-like extracellular matrix protein 1 (EFEMP1) was identified as a lead candidate methylation marker for PCa. The gene expression level of EFEMP1 was significantly higher in tissue samples from patients with BPH than in those with PCa (P < 0.001). The sensitivity and specificity of EFEMP1 methylation status in discriminating between PCa and BPH reached 95.3% (101 of 106) and 86.6% (84 of 97), respectively. From the GEO data set, we confirmed that the expression level of EFEMP1 was significantly different between PCa and BPH. Genome-wide characterization of DNA methylation profiles enabled the identification of EFEMP1 aberrant methylation patterns in PCa. EFEMP1 might be a useful indicator for the detection of PCa.
NASA Astrophysics Data System (ADS)
Du, Jun; Wang, Ping; Yue, Shuhua
2016-10-01
Most prostate cancers (PCa) are slowly growing, and only the aggressive ones require early diagnosis and effective treatment. The current standard for PCa diagnosis remains histopathology. Nonetheless, for the differentiation between Gleason score 6 (low-risk PCa), which can be left without treatment, and Gleason score 7 (high-risk PCa), which requires active treatment, the inter-observer discordance can be up to 40%. Our previous study reveals that cholesteryl ester (CE) accumulation induced by PI3K/AKT activation underlies human PCa aggressiveness. However, Raman spectromicroscopy used in this study could only provide compositional information of certain lipid droplets (LDs) selected by the observer, which overlooked cell-to-cell variation and hindered translation to accurate automated diagnosis. Here, we demonstrated quantitative mapping of CE level in human prostate tissues using hyperspectral stimulated Raman scattering (SRS) microscopy that renders compositional information for every pixel in the image. Specifically, hundreds of SRS images at Raman shift between 1620-1800 cm-1 were taken, and multivariate curve resolution algorism was used to retrieve concentration images of acyl C=C bond, sterol C=C bond, and ester C=O bond. Given that the ratio between images of sterol C=C and ester C=O (sterol C=C/C=O) is nonlinearly proportional to CE percentage out of total lipid, we were able to quantitatively map CE level. Our data showed that CE level was significantly greater in high Gleason grade compared to low Gleason grade, and could be a factor that significantly contributed to cancer recurrence. Our study provides an opportunity towards more accurate PCa diagnosis and prediction of aggressiveness.
NASA Astrophysics Data System (ADS)
Xu, M. L.; Yu, Y.; Ramaswamy, H. S.; Zhu, S. M.
2017-01-01
Chinese liquor aroma components were characterized during the aging process using gas chromatography (GC). Principal component and cluster analysis (PCA, CA) were used to discriminate the Chinese liquor age which has a great economic value. Of a total of 21 major aroma components identified and quantified, 13 components which included several acids, alcohols, esters, aldehydes and furans decreased significantly in the first year of aging, maintained the same levels (p > 0.05) for next three years and decreased again (p < 0.05) in the fifth year. On the contrary, a significant increase was observed in propionic acid, furfural and phenylethanol. Ethyl lactate was found to be the most stable aroma component during aging process. Results of PCA and CA demonstrated that young liquor (fresh) and aged liquors were well separated from each other, which is in consistent with the evolution of aroma components along with the aging process. These findings provide a quantitative basis for discriminating the Chinese liquor age and a scientific basis for further research on elucidating the liquor aging process, and a possible tool to guard against counterfeit and defective products.
Dai, Liping; Li, Jitian; Xing, Mengtao; Sanchez, Tino W; Casiano, Carlos A; Zhang, Jian-Ying
2016-11-01
The prostate-specific antigen (PSA) testing has been widely implemented for the early detection and management of prostate cancer (PCa). However, the lack of specificity has led to overdiagnosis, resulting in many possibly unnecessary biopsies and overtreatment. Therefore, novel serological biomarkers with high sensitivity and specificity are of vital importance needed to complement PSA testing in the early diagnosis and effective management of PCa. This is particularly critical in the context of PCa health disparities, where early detection and management could help reduce the disproportionately high PCa mortality observed in African-American men. Previous studies have demonstrated that sera from patients with PCa contain autoantibodies that react with tumor-associated antigens (TAAs). The serological proteome analysis (SERPA) approach was used to identify tumor-associated antigens (TAAs) of PCa. In evaluation study, the level of anti-NPM1 antibody was examined in sera from test cohort, validation cohort, as well as European-American (EA) and African-American (AA) men with PCa by using immunoassay. Nucleophosmin 1 (NPM1) as a 33 kDa TAA in PCa was identified and characterized by SERPA approach. Anti-NPM1 antibody level in PCa was higher than in benign prostatic hyperplasia (BPH) patients and healthy individuals. Receiver operating characteristic (ROC) curve analysis showed similar high diagnostic value for PCa in the test cohort (area under the curve (AUC):0.860) and validation cohort (AUC: 0.822) to differentiate from normal individuals and BPH. Interestingly, AUC values were significantly higher for AA PCa patients. When considering concurrent serum measurements of anti-NPM1 antibody and PSA, 97.1% PCa patients at early stage were identified correctly, while 69.2% BPH patients who had elevated PSA levels were found to be anti-NPM1 negative. Additionally, anti-NPM1 antibody levels in PCa patients at early stage significantly increased after surgery treatment. This intriguing data suggested that NPM1 can elicit autoantibody response in PCa and might be a potential biomarker for the immunodiagnosis and prognosis of PCa, and for supplementing PSA testing in distinguishing PCa from BPH. Prostate 76:1375-1386, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Kumar, Raj G; Rubin, Jonathan E; Berger, Rachel P; Kochanek, Patrick M; Wagner, Amy K
2016-03-01
Studies have characterized absolute levels of multiple inflammatory markers as significant risk factors for poor outcomes after traumatic brain injury (TBI). However, inflammatory marker concentrations are highly inter-related, and production of one may result in the production or regulation of another. Therefore, a more comprehensive characterization of the inflammatory response post-TBI should consider relative levels of markers in the inflammatory pathway. We used principal component analysis (PCA) as a dimension-reduction technique to characterize the sets of markers that contribute independently to variability in cerebrospinal (CSF) inflammatory profiles after TBI. Using PCA results, we defined groups (or clusters) of individuals (n=111) with similar patterns of acute CSF inflammation that were then evaluated in the context of outcome and other relevant CSF and serum biomarkers collected days 0-3 and 4-5 post-injury. We identified four significant principal components (PC1-PC4) for CSF inflammation from days 0-3, and PC1 accounted for the greatest (31%) percentage of variance. PC1 was characterized by relatively higher CSF sICAM-1, sFAS, IL-10, IL-6, sVCAM-1, IL-5, and IL-8 levels. Cluster analysis then defined two distinct clusters, such that individuals in cluster 1 had highly positive PC1 scores and relatively higher levels of CSF cortisol, progesterone, estradiol, testosterone, brain derived neurotrophic factor (BDNF), and S100b; this group also had higher serum cortisol and lower serum BDNF. Multinomial logistic regression analyses showed that individuals in cluster 1 had a 10.9 times increased likelihood of GOS scores of 2/3 vs. 4/5 at 6 months compared to cluster 2, after controlling for covariates. Cluster group did not discriminate between mortality compared to GOS scores of 4/5 after controlling for age and other covariates. Cluster groupings also did not discriminate mortality or 12 month outcomes in multivariate models. PCA and cluster analysis establish that a subset of CSF inflammatory markers measured in days 0-3 post-TBI may distinguish individuals with poor 6-month outcome, and future studies should prospectively validate these findings. PCA of inflammatory mediators after TBI could aid in prognostication and in identifying patient subgroups for therapeutic interventions. Copyright © 2015 Elsevier Inc. All rights reserved.
Ragazzo-Sánchez, Juan Arturo; Ortiz-Basurto, Rosa Isela; Luna-Solano, Guadalupe; Calderón-Santoyo, Montserrat
2014-01-01
Blackberry (Rubus sp.) juice was fermented using four different strains of Saccharomyces cerevisiae (Vitilevure-CM4457, Enoferm-T306, ICV-K1, and Greroche Rhona-L3574) recognized because of their use in the wine industry. A medium alcoholic graduation spirit (<6°GL) with potential to be produced at an industrial scale was obtained. Alcoholic fermentations were performed at 28°C, 200 rpm, and noncontrolled pH. The synergistic effect on the aromatic compounds production during fermentation in mixed culture was compared with those obtained by monoculture and physic mixture of spirits produced in monoculture. The aromatic composition was determined by HS-SPME-GC. The differences in aromatic profile principally rely on the proportions in aromatic compounds and not on the number of those compounds. The multivariance analysis, principal component analysis (PCA), and factorial discriminant analysis (DFA) permit to demonstrate the synergism between the strains. PMID:25506606
Surface-enhanced Raman spectroscopy for differentiation between benign and malignant thyroid tissues
NASA Astrophysics Data System (ADS)
Li, Zuanfang; Li, Chao; Lin, Duo; Huang, Zufang; Pan, Jianji; Chen, Guannan; Lin, Juqiang; Liu, Nenrong; Yu, Yun; Feng, Shangyuan; Chen, Rong
2014-04-01
The aim of this study was to evaluate the potential of applying silver nano-particle based surface-enhanced Raman scattering (SERS) to discriminate different types of human thyroid tissues. SERS measurements were performed on three groups of tissue samples including thyroid cancers (n = 32), nodular goiters (n = 20) and normal thyroid tissues (n = 25). Tentative assignments of the measured tissue SERS spectra suggest interesting cancer specific biomolecular differences. The principal component analysis (PCA) and linear discriminate analysis (LDA) together with the leave-one-out, cross-validated technique yielded diagnostic sensitivities of 92%, 75% and 87.5%; and specificities of 82.6%, 89.4% and 84.4%, respectively, for differentiation among normal, nodular and malignant thyroid tissue samples. This work demonstrates that tissue SERS spectroscopy associated with multivariate analysis diagnostic algorithms has great potential for detection of thyroid cancer at the molecular level.
Untargeted Identification of Wood Type-Specific Markers in Particulate Matter from Wood Combustion.
Weggler, Benedikt A; Ly-Verdu, Saray; Jennerwein, Maximilian; Sippula, Olli; Reda, Ahmed A; Orasche, Jürgen; Gröger, Thomas; Jokiniemi, Jorma; Zimmermann, Ralf
2016-09-20
Residential wood combustion emissions are one of the major global sources of particulate and gaseous organic pollutants. However, the detailed chemical compositions of these emissions are poorly characterized due to their highly complex molecular compositions, nonideal combustion conditions, and sample preparation steps. In this study, the particulate organic emissions from a masonry heater using three types of wood logs, namely, beech, birch, and spruce, were chemically characterized using thermal desorption in situ derivatization coupled to a GCxGC-ToF/MS system. Untargeted data analyses were performed using the comprehensive measurements. Univariate and multivariate chemometric tools, such as analysis of variance (ANOVA), principal component analysis (PCA), and ANOVA simultaneous component analysis (ASCA), were used to reduce the data to highly significant and wood type-specific features. This study reveals substances not previously considered in the literature as meaningful markers for differentiation among wood types.
DOE Office of Scientific and Technical Information (OSTI.GOV)
2012-01-05
SandiaMCR was developed to identify pure components and their concentrations from spectral data. This software efficiently implements the multivariate calibration regression alternating least squares (MCR-ALS), principal component analysis (PCA), and singular value decomposition (SVD). Version 3.37 also includes the PARAFAC-ALS Tucker-1 (for trilinear analysis) algorithms. The alternating least squares methods can be used to determine the composition without or with incomplete prior information on the constituents and their concentrations. It allows the specification of numerous preprocessing, initialization and data selection and compression options for the efficient processing of large data sets. The software includes numerous options including the definition ofmore » equality and non-negativety constraints to realistically restrict the solution set, various normalization or weighting options based on the statistics of the data, several initialization choices and data compression. The software has been designed to provide a practicing spectroscopist the tools required to routinely analysis data in a reasonable time and without requiring expert intervention.« less
Spyropoulos, Evangelos; Kotsiris, Dimitrios; Spyropoulos, Katherine; Panagopoulos, Aggelos; Galanakis, Ioannis; Mavrikos, Stamatios
2017-02-01
We developed a mathematical "prostate cancer (PCa) conditions simulating" predictive model (PCP-SMART), from which we derived a novel PCa predictor (prostate cancer risk determinator [PCRD] index) and a PCa risk equation. We used these to estimate the probability of finding PCa on prostate biopsy, on an individual basis. A total of 371 men who had undergone transrectal ultrasound-guided prostate biopsy were enrolled in the present study. Given that PCa risk relates to the total prostate-specific antigen (tPSA) level, age, prostate volume, free PSA (fPSA), fPSA/tPSA ratio, and PSA density and that tPSA ≥ 50 ng/mL has a 98.5% positive predictive value for a PCa diagnosis, we hypothesized that correlating 2 variables composed of 3 ratios (1, tPSA/age; 2, tPSA/prostate volume; and 3, fPSA/tPSA; 1 variable including the patient's tPSA and the other, a tPSA value of 50 ng/mL) could operate as a PCa conditions imitating/simulating model. Linear regression analysis was used to derive the coefficient of determination (R 2 ), termed the PCRD index. To estimate the PCRD index's predictive validity, we used the χ 2 test, multiple logistic regression analysis with PCa risk equation formation, calculation of test performance characteristics, and area under the receiver operating characteristic curve analysis using SPSS, version 22 (P < .05). The biopsy findings were positive for PCa in 167 patients (45.1%) and negative in 164 (44.2%). The PCRD index was positively signed in 89.82% positive PCa cases and negative in 91.46% negative PCa cases (χ 2 test; P < .001; relative risk, 8.98). The sensitivity was 89.8%, specificity was 91.5%, positive predictive value was 91.5%, negative predictive value was 89.8%, positive likelihood ratio was 10.5, negative likelihood ratio was 0.11, and accuracy was 90.6%. Multiple logistic regression revealed the PCRD index as an independent PCa predictor, and the formulated risk equation was 91% accurate in predicting the probability of finding PCa. On the receiver operating characteristic analysis, the PCRD index (area under the curve, 0.926) significantly (P < .001) outperformed other, established PCa predictors. The PCRD index effectively predicted the prostate biopsy outcome, correctly identifying 9 of 10 men who were eventually diagnosed with PCa and correctly ruling out PCa for 9 of 10 men who did not have PCa. Its predictive power significantly outperformed established PCa predictors, and the formulated risk equation accurately calculated the probability of finding cancer on biopsy, on an individual patient basis. Copyright © 2016 Elsevier Inc. All rights reserved.
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge
Wagner, Florian
2015-01-01
Method Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. Results I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets. PMID:26575370
GO-PCA: An Unsupervised Method to Explore Gene Expression Data Using Prior Knowledge.
Wagner, Florian
2015-01-01
Genome-wide expression profiling is a widely used approach for characterizing heterogeneous populations of cells, tissues, biopsies, or other biological specimen. The exploratory analysis of such data typically relies on generic unsupervised methods, e.g. principal component analysis (PCA) or hierarchical clustering. However, generic methods fail to exploit prior knowledge about the molecular functions of genes. Here, I introduce GO-PCA, an unsupervised method that combines PCA with nonparametric GO enrichment analysis, in order to systematically search for sets of genes that are both strongly correlated and closely functionally related. These gene sets are then used to automatically generate expression signatures with functional labels, which collectively aim to provide a readily interpretable representation of biologically relevant similarities and differences. The robustness of the results obtained can be assessed by bootstrapping. I first applied GO-PCA to datasets containing diverse hematopoietic cell types from human and mouse, respectively. In both cases, GO-PCA generated a small number of signatures that represented the majority of lineages present, and whose labels reflected their respective biological characteristics. I then applied GO-PCA to human glioblastoma (GBM) data, and recovered signatures associated with four out of five previously defined GBM subtypes. My results demonstrate that GO-PCA is a powerful and versatile exploratory method that reduces an expression matrix containing thousands of genes to a much smaller set of interpretable signatures. In this way, GO-PCA aims to facilitate hypothesis generation, design of further analyses, and functional comparisons across datasets.
Li, Ziyi; Safo, Sandra E; Long, Qi
2017-07-11
Sparse principal component analysis (PCA) is a popular tool for dimensionality reduction, pattern recognition, and visualization of high dimensional data. It has been recognized that complex biological mechanisms occur through concerted relationships of multiple genes working in networks that are often represented by graphs. Recent work has shown that incorporating such biological information improves feature selection and prediction performance in regression analysis, but there has been limited work on extending this approach to PCA. In this article, we propose two new sparse PCA methods called Fused and Grouped sparse PCA that enable incorporation of prior biological information in variable selection. Our simulation studies suggest that, compared to existing sparse PCA methods, the proposed methods achieve higher sensitivity and specificity when the graph structure is correctly specified, and are fairly robust to misspecified graph structures. Application to a glioblastoma gene expression dataset identified pathways that are suggested in the literature to be related with glioblastoma. The proposed sparse PCA methods Fused and Grouped sparse PCA can effectively incorporate prior biological information in variable selection, leading to improved feature selection and more interpretable principal component loadings and potentially providing insights on molecular underpinnings of complex diseases.
Rodrigues-Filho, J L; Abe, D S; Gatti-Junior, P; Medeiros, G R; Degani, R M; Blanco, F P; Faria, C R L; Campanelli, L; Soares, F S; Sidagis-Galli, C V; Teixeira-Silva, V; Tundisi, J E M; Matsmura-Tundisi, T; Tundisi, J G
2015-08-01
The Xingu River, one of the most important of the Amazon Basin, is characterized by clear and transparent waters that drain a 509.685 km2 watershed with distinct hydrological and ecological conditions and anthropogenic pressures along its course. As in other basins of the Amazon system, studies in the Xingu are scarce. Furthermore, the eminent construction of the Belo Monte for hydropower production, which will alter the environmental conditions in the basin in its lower middle portion, denotes high importance of studies that generate relevant information that may subsidize a more balanced and equitable development in the Amazon region. Thus, the aim of this study was to analyze the water quality in the Xingu River and its tributaries focusing on spatial patterns by the use of multivariate statistical techniques, identifying which water quality parameters were more important for the environmental changes in the watershed. Data sampling were carried out during two complete hydrological cycles in twenty-five sampling stations. The data of twenty seven variables were analyzed by Spearman's correlation coefficients, cluster analysis (CA), and principal component analysis (PCA). The results showed a high auto-correlation between variables (> 0.7). These variables were removed from multivariate analyzes because they provided redundant information about the environment. The CA resulted in the formation of six clusters, which were clearly observed in the PCA and were characterized by different water quality. The statistical results allowed to identify a high spatial variation in the water quality, which were related to specific features of the environment, different uses, influences of anthropogenic activities and geochemical characteristics of the drained basins. It was also demonstrated that most of the sampling stations in the Xingu River basin showed good water quality, due to the absence of local impacts and high power of depuration of the river itself.
Haware, Rahul V; Bauer-Brandl, Annette; Tho, Ingunn
2010-01-01
The present work challenges a newly developed approach to tablet formulation development by using chemically identical materials (grades and brands of microcrystalline cellulose). Tablet properties with respect to process and formulation parameters (e.g. compression speed, added lubricant and Emcompress fractions) were evaluated by 2(3)-factorial designs. Tablets of constant true volume were prepared on a compaction simulator at constant pressure (approx. 100 MPa). The highly repeatable and accurate force-displacement data obtained was evaluated by simple 'in-die' Heckel method and work descriptors. Relationships and interactions between formulation, process and tablet parameters were identified and quantified by multivariate analysis techniques; principal component analysis (PCA) and partial least square regressions (PLS). The method proved to be able to distinguish between different grades of MCC and even between two different brands of the same grade (Avicel PH 101 and Vivapur 101). One example of interaction was studied in more detail by mixed level design: The interaction effect of lubricant and Emcompress on elastic recovery of Avicel PH 102 was demonstrated to be complex and non-linear using the development tool under investigation.
Camelo-Méndez, G A; Ragazzo-Sánchez, J A; Jiménez-Aparicio, A R; Vanegas-Espinoza, P E; Paredes-López, O; Del Villar-Martínez, A A
2013-09-01
Anthocyanins are a group of water-soluble pigments that provide red, purple or blue color to the leaves, flowers, and fruits. In addition, benefits have been attributed to hypertension and cardiovascular diseases. This study compared the content of total anthocyanins and volatile compounds in aqueous and ethanolic extracts of four varieties of Mexican roselle, with different levels of pigmentation. The multivariable analysis of categorical data demonstrated that ethanol was the best solvent for the extraction of both anthocyanins and volatile compounds. The concentration of anthocyanin in pigmented varieties ranged from 17.3 to 32.2 mg of cyanidin 3-glucoside/g dry weight, while volatile compounds analysis showed that geraniol was the main compound in extracts from the four varieties. The principal component analysis (PCA) allowed description of results with 77.38% of variance establishing a clear grouping for each variety in addition to similarities among some of these varieties. These results were validated by the confusion matrix obtained in the classification by the factorial discriminate analysis (FDA); it can be useful for roselle varieties classification. Small differences in anthocyanin and volatile compounds content could be detected, and it may be of interest for the food industry in order to classify a new individual into one of several groups using different variables at once.
NASA Astrophysics Data System (ADS)
Shimada, Toru; Hasegawa, Takeshi
2017-10-01
The pH dependent chemical structures of bromothymol blue (BTB), which have long been under controversy, are determined by employing a combined technique of multivariate analysis of electronic absorption spectra and quantum chemistry. Principle component analysis (PCA) of the pH dependent spectra apparently reveals that only two chemical species are adequate to fully account for the color changes, with which the spectral decomposition is readily performed by using augmented alternative least-squares (ALS) regression analysis. The quantity variation by the ALS analysis also reveals the practical acid dissociation constant, pKa‧. The determination of pKa‧ is performed for various ionic strengths, which reveals the thermodynamic acid constant (pKa = 7.5) and the number of charge on each chemical species; the yellow form is negatively charged species of - 1 and the blue form that of - 2. On this chemical information, the quantum chemical calculation is carried out to find that BTB molecules take the pure quinoid form in an acid solution and the quinoid-phenolate form in an alkaline solution. The time-dependent density functional theory (TD-DFT) calculations for the theoretically determined chemical structures account for the peak shift of the electronic spectra. In this manner, the structures of all the chemical species appeared in equilibrium have finally been confirmed.
Shimada, Toru; Hasegawa, Takeshi
2017-10-05
The pH dependent chemical structures of bromothymol blue (BTB), which have long been under controversy, are determined by employing a combined technique of multivariate analysis of electronic absorption spectra and quantum chemistry. Principle component analysis (PCA) of the pH dependent spectra apparently reveals that only two chemical species are adequate to fully account for the color changes, with which the spectral decomposition is readily performed by using augmented alternative least-squares (ALS) regression analysis. The quantity variation by the ALS analysis also reveals the practical acid dissociation constant, pK a '. The determination of pK a ' is performed for various ionic strengths, which reveals the thermodynamic acid constant (pK a =7.5) and the number of charge on each chemical species; the yellow form is negatively charged species of -1 and the blue form that of -2. On this chemical information, the quantum chemical calculation is carried out to find that BTB molecules take the pure quinoid form in an acid solution and the quinoid-phenolate form in an alkaline solution. The time-dependent density functional theory (TD-DFT) calculations for the theoretically determined chemical structures account for the peak shift of the electronic spectra. In this manner, the structures of all the chemical species appeared in equilibrium have finally been confirmed. Copyright © 2017 Elsevier B.V. All rights reserved.
A precipitation regionalization and regime for Iran based on multivariate analysis
NASA Astrophysics Data System (ADS)
Raziei, Tayeb
2018-02-01
Monthly precipitation time series of 155 synoptic stations distributed over Iran, covering 1990-2014 time period, were used to identify areas with different precipitation time variability and regimes utilizing S-mode principal component analysis (PCA) and cluster analysis (CA) preceded by T-mode PCA, respectively. Taking into account the maximum loading values of the rotated components, the first approach revealed five sub-regions characterized by different precipitation time variability, while the second method delineated eight sub-regions featured with different precipitation regimes. The sub-regions identified by the two used methods, although partly overlapping, are different considering their areal extent and complement each other as they are useful for different purposes and applications. Northwestern Iran and the Caspian Sea area were found as the two most distinctive Iranian precipitation sub-regions considering both time variability and precipitation regime since they were well captured with relatively identical areas by the two used approaches. However, the areal extents of the other three sub-regions identified by the first approach were not coincident with the coverage of their counterpart sub-regions defined by the second approach. Results suggest that the precipitation sub-region identified by the two methods would not be necessarily the same, as the first method which accounts for the variance of the data grouped stations with similar temporal variability while the second one which considers a fixed climatology defined by the average over the period 1990-2014 clusters stations having a similar march of monthly precipitation.
Konieczynski, Pawel; Viapiana, Agnieszka; Lysiuk, Roman; Wesolowski, Marek
2018-03-01
Infusions prepared from medicinal herbs that are rich in flavonoids are very popular herbal remedies in societies of Eastern Europe. Therefore, the content of essential elements together with total flavonoids was analyzed in 65 commercially available samples of herbal drugs originating from Ukraine, Romania, and Belarus. The results showed that metallic elements (in mg kg -1 d.w.) have occurred in the following order: Fe > Mn > Zn > Cu, both for total and water-extractable species. Total flavonoids were determined in the range from 10.0 to 191.8 mg g -1 d.w. Several significant correlations have been found between the analytes, especially among water-extractable Fe with other metals, and total flavonoids and Fe, Zn, and Mn. Analysis of variance has revealed significant differences among studied samples due to their origin from different countries, especially between Belarussian samples and others. Differences owing to belonging to various plant species were also found, as it was noticed in the case of Polygoni aviculare herba in comparison with other botanical plant species. Moreover, multivariate statistical techniques, such as cluster analysis (CA) and principal component analysis (PCA) were used to gather herbal drugs based on similarity of chemical composition. CA grouped the samples into clusters with similar level of elements and total flavonoid contents, and PCA has indicated Hyperici herba, Tiliae flores, and Crataegi fructus as herbal remedies with close concentration of studied elements and flavonoids.
Abdul-Hamid, Nur Ashikin; Mediani, Ahmed; Maulidiani, M; Shadid, Khalid; Ismail, Intan Safinar; Abas, Faridah; Lajis, Nordin H
2018-04-01
The aim of this study was to examine the variation in metabolite constituents of five commercial varieties of date fruits; Ajwa, Safawi and Ambar which originated from Madinah, the Iranian Bam and Tunisian Deglet Noor. The differences of metabolome were investigated using proton nuclear magnetic resonance ( 1 H NMR) spectroscopy combined with multivariate data analysis (MVDA). Principal Component Analysis (PCA) revealed clear separation between the date varieties. The Tunisian Deglet Noor demonstrated distinct cluster from the rest of the palm date samples based on the metabolite composition as shown by the pattern observed in Hierarchical Clustering Analysis (HCA) and PCA. Deglet Noor exhibited a significant higher level of sucrose (δ 5.40) and fructose (δ 4.16) in comparison with the other four varieties which can be associated with the distinctive sweet taste of this variety. Dates originated from Madinah and Tunisia exhibited a contrast manner in the amount of xylose and moisture content. These two aspects may contribute towards the soft texture of Tunisian dates. All Madinah dates were found to contain phenolic compounds which were well established as great antioxidant and anti-inflammatory agent. Ajwa dates exerted greater effect in inhibiting the generation of nitric oxide (NO) from the stimulated RAW264.7 cells at 95.37% inhibition. Succinic acid was suggested to have the most significant correlation with the trend of NO inhibitory shown by the selected date palm varieties.
Yang, Liping; Mei, Kun; Liu, Xingmei; Wu, Laosheng; Zhang, Minghua; Xu, Jianming; Wang, Fan
2013-08-01
Water quality degradation in river systems has caused great concerns all over the world. Identifying the spatial distribution and sources of water pollutants is the very first step for efficient water quality management. A set of water samples collected bimonthly at 12 monitoring sites in 2009 and 2010 were analyzed to determine the spatial distribution of critical parameters and to apportion the sources of pollutants in Wen-Rui-Tang (WRT) river watershed, near the East China Sea. The 12 monitoring sites were divided into three administrative zones of urban, suburban, and rural zones considering differences in land use and population density. Multivariate statistical methods [one-way analysis of variance, principal component analysis (PCA), and absolute principal component score-multiple linear regression (APCS-MLR) methods] were used to investigate the spatial distribution of water quality and to apportion the pollution sources. Results showed that most water quality parameters had no significant difference between the urban and suburban zones, whereas these two zones showed worse water quality than the rural zone. Based on PCA and APCS-MLR analysis, urban domestic sewage and commercial/service pollution, suburban domestic sewage along with fluorine point source pollution, and agricultural nonpoint source pollution with rural domestic sewage pollution were identified to the main pollution sources in urban, suburban, and rural zones, respectively. Understanding the water pollution characteristics of different administrative zones could put insights into effective water management policy-making especially in the area across various administrative zones.
A feasibility study on age-related factors of wrist pulse using principal component analysis.
Jang-Han Bae; Young Ju Jeon; Sanghun Lee; Jaeuk U Kim
2016-08-01
Various analysis methods for examining wrist pulse characteristics are needed for accurate pulse diagnosis. In this feasibility study, principal component analysis (PCA) was performed to observe age-related factors of wrist pulse from various analysis parameters. Forty subjects in the age group of 20s and 40s were participated, and their wrist pulse signal and respiration signal were acquired with the pulse tonometric device. After pre-processing of the signals, twenty analysis parameters which have been regarded as values reflecting pulse characteristics were calculated and PCA was performed. As a results, we could reduce complex parameters to lower dimension and age-related factors of wrist pulse were observed by combining-new analysis parameter derived from PCA. These results demonstrate that PCA can be useful tool for analyzing wrist pulse signal.
NASA Astrophysics Data System (ADS)
Halassa, Younes; Zeddouri, Aziez; Mouhamadou, Ould Babasy; Kechiched, Rabah; Benhamida, Abdeldjebbar Slimane
2018-05-01
The aquifer system in The Algero-Tunisian border and Chotts region is mainly composed of two aquifers: The first is the Complex Terminal (CT) and the second is the Intercalary aquifer (CI). This study aims the identification and spatial evolution of factors that controlling the water quality in the Complex Terminal aquifer (CT) in the Chotts region (Oued Souf region - Southeastern of Algeria). The concentration of major elements, temperature, pH and salinity were monitored during 2015 in 34 wells from the CT aquifer. The geological, geophysical, hydrogeological and hydrochemical methods were applied in order to carried out a model for the investigated aquifer system and to characterize the hydrogeological and the geochemical behavior, as well as the geometrical and the lithological configuration. Multivariate statistical analyses such as Principal Component Analysis (PCA) were also used for the treatment of several data. Results show that the salinity follows the same regional distribution of Chloride, Sodium, Magnesium, Sulfate and Calcium. Note that the salinity shows low contents in the upstream part of investigated region suggesting restricted dissolution of salts. Hydro-chemical study and saturation indexes highlight the dominance of the dissolution and the precipitation of calcite, dolomite, anhydrite, gypsum and halite. The PCA analysis indicates that Na+, Cl-, Ca2+, Mg2+, SO42- and K+ variables that influence the water mineralization.
Souza, Isys Mascarenhas; Funch, Ligia Silveira; de Queiroz, Luciano Paganucci
2014-01-01
Abstract Hymenaea is a genus of the Resin-producing Clade of the tribe Detarieae (Leguminosae: Caesalpinioideae) with 14 species. Hymenaea courbaril is the most widespread species of the genus, ranging from southern Mexico to southeastern Brazil. As currently circumscribed, Hymenaea courbaril is a polytypic species with six varieties: var. altissima, var. courbaril, var. longifolia, var. stilbocarpa, var. subsessilis, and var. villosa. These varieties are distinguishable mostly by traits related to leaflet shape and indumentation, and calyx indumentation. We carried out morphometric analyses of 14 quantitative (continuous) leaf characters in order to assess the taxonomy of Hymenaea courbaril under the Unified Species Concept framework. Cluster analysis used the Unweighted Pair Group Method with Arithmetic Mean (UPGMA) based on Bray-Curtis dissimilarity matrices. Principal Component Analyses (PCA) were carried out based on the same morphometric matrix. Two sets of Analyses of Similarity and Non Parametric Multivariate Analysis of Variance were carried out to evaluate statistical support (1) for the major groups recovered using UPGMA and PCA, and (2) for the varieties. All analyses recovered three major groups coincident with (1) var. altissima, (2) var. longifolia, and (3) all other varieties. These results, together with geographical and habitat information, were taken as evidence of three separate metapopulation lineages recognized here as three distinct species. Nomenclatural adjustments, including reclassifying formerly misapplied types, are proposed. PMID:25009440
Souza, Isys Mascarenhas; Funch, Ligia Silveira; de Queiroz, Luciano Paganucci
2014-01-01
Hymenaea is a genus of the Resin-producing Clade of the tribe Detarieae (Leguminosae: Caesalpinioideae) with 14 species. Hymenaea courbaril is the most widespread species of the genus, ranging from southern Mexico to southeastern Brazil. As currently circumscribed, Hymenaea courbaril is a polytypic species with six varieties: var. altissima, var. courbaril, var. longifolia, var. stilbocarpa, var. subsessilis, and var. villosa. These varieties are distinguishable mostly by traits related to leaflet shape and indumentation, and calyx indumentation. We carried out morphometric analyses of 14 quantitative (continuous) leaf characters in order to assess the taxonomy of Hymenaea courbaril under the Unified Species Concept framework. Cluster analysis used the Unweighted Pair Group Method with Arithmetic Mean (UPGMA) based on Bray-Curtis dissimilarity matrices. Principal Component Analyses (PCA) were carried out based on the same morphometric matrix. Two sets of Analyses of Similarity and Non Parametric Multivariate Analysis of Variance were carried out to evaluate statistical support (1) for the major groups recovered using UPGMA and PCA, and (2) for the varieties. All analyses recovered three major groups coincident with (1) var. altissima, (2) var. longifolia, and (3) all other varieties. These results, together with geographical and habitat information, were taken as evidence of three separate metapopulation lineages recognized here as three distinct species. Nomenclatural adjustments, including reclassifying formerly misapplied types, are proposed.
Tracking Equilibrium and Nonequilibrium Shifts in Data with TREND.
Xu, Jia; Van Doren, Steven R
2017-01-24
Principal component analysis (PCA) discovers patterns in multivariate data that include spectra, microscopy, and other biophysical measurements. Direct application of PCA to crowded spectra, images, and movies (without selecting peaks or features) was shown recently to identify their equilibrium or temporal changes. To enable the community to utilize these capabilities with a wide range of measurements, we have developed multiplatform software named TREND to Track Equilibrium and Nonequilibrium population shifts among two-dimensional Data frames. TREND can also carry this out by independent component analysis. We highlight a few examples of finding concurrent processes. TREND extracts dual phases of binding to two sites directly from the NMR spectra of the titrations. In a cardiac movie from magnetic resonance imaging, TREND resolves principal components (PCs) representing breathing and the cardiac cycle. TREND can also reconstruct the series of measurements from selected PCs, as illustrated for a biphasic, NMR-detected titration and the cardiac MRI movie. Fidelity of reconstruction of series of NMR spectra or images requires more PCs than needed to plot the largest population shifts. TREND reads spectra from many spectroscopies in the most common formats (JCAMP-DX and NMR) and multiple movie formats. The TREND package thus provides convenient tools to resolve the processes recorded by diverse biophysical methods. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Baccar, D.; Söffker, D.
2017-11-01
Acoustic Emission (AE) is a suitable method to monitor the health of composite structures in real-time. However, AE-based failure mode identification and classification are still complex to apply due to the fact that AE waves are generally released simultaneously from all AE-emitting damage sources. Hence, the use of advanced signal processing techniques in combination with pattern recognition approaches is required. In this paper, AE signals generated from laminated carbon fiber reinforced polymer (CFRP) subjected to indentation test are examined and analyzed. A new pattern recognition approach involving a number of processing steps able to be implemented in real-time is developed. Unlike common classification approaches, here only CWT coefficients are extracted as relevant features. Firstly, Continuous Wavelet Transform (CWT) is applied to the AE signals. Furthermore, dimensionality reduction process using Principal Component Analysis (PCA) is carried out on the coefficient matrices. The PCA-based feature distribution is analyzed using Kernel Density Estimation (KDE) allowing the determination of a specific pattern for each fault-specific AE signal. Moreover, waveform and frequency content of AE signals are in depth examined and compared with fundamental assumptions reported in this field. A correlation between the identified patterns and failure modes is achieved. The introduced method improves the damage classification and can be used as a non-destructive evaluation tool.
de Cobelli, Ottavio; Terracciano, Daniela; Tagliabue, Elena; Raimondi, Sara; Galasso, Giacomo; Cioffi, Antonio; Cordima, Giovanni; Musi, Gennaro; Damiano, Rocco; Cantiello, Francesco; Detti, Serena; Victor Matei, Deliu; Bottero, Danilo; Renne, Giuseppe; Ferro, Matteo
2015-05-01
Obesity is associated with an increased risk of high-grade prostate cancer (PCa). The effect of body mass index (BMI) as a predictor of progression in men with low-risk PCa has been only poorly assessed. In this study, we evaluated the association of BMI with progression in patients with low-risk PCa who met the inclusion criteria for the active surveillance (AS) protocol. We assessed 311 patients who underwent radical prostatectomy and were eligible for AS according to the following criteria: clinical stage T2a or less, prostate-specific antigen level < 10 ng/ml, 2 or fewer cores involved with cancer, Gleason score ≤ 6 grade, and prostate-specific antigen density < 0.2 ng/ml/cc. Reclassification was defined as upstaged (pathological stage > pT2) and upgraded (Gleason score ≥ 7; primary Gleason pattern 4) disease. Seminal vesicle invasion, positive lymph nodes, and tumor volume ≥ 0.5 ml were also recorded. We found that high BMI was significantly associated with upgrading, upstaging, and seminal vesicle invasion, whereas it was not associated with positive lymph nodes or large tumor volume. At multivariate analysis, 1 unit increase of BMI significantly increased the risk of upgrading, upstaging, seminal vesicle invasion, and any outcome by 21%, 23%, 27%, and 20%, respectively. The differences between areas under the receiver operating characteristics curves comparing models with and without BMI were statistically significant for upgrading (P = 0.0002), upstaging (P = 0.0007), and any outcome (P = 0.0001). BMI should be a selection criterion for inclusion of patients with low-risk PCa in AS programs. Our results support the idea that obesity is associated with worse prognosis and suggest that a close AS program is an appropriate treatment option for obese subjects. Copyright © 2015 Elsevier Inc. All rights reserved.
Han, Yuyan; Xu, Junfeng; Kim, Jeri; Wu, Xifeng; Gu, Jian
2017-08-01
Global DNA methylation may affect chromosome structure and genomic stability and is involved in carcinogenesis. In this study, we aimed to investigate whether methylation of pericentromeric repeat NBL2 and subtelomeric repeat D4Z4 in peripheral blood was associated with the aggressiveness of prostate cancer (PCa). We measured the methylation status of different CpG sites of NBL2 and D4Z4 in 795 PCa patients and compared their methylation levels among patients with different Gleason Score at diagnosis. We then analyzed the association of the NBL2 and D4Z4 methylation with the risk of biochemical recurrence (BCR) in patients receiving radical prostatectomy or radiotherapy using a multivariate Cox proportional hazards model. In addition, we used the Kaplan-Meier survival function and log-rank tests to assess BCR-free survival associated with D4Z4 methylation. There was no significant difference in methylation level of NBL2 and D4Z4 between clinically defined aggressive and non-aggressive PCa at diagnosis. However, the methylation of D4Z4 was associated with BCR, while the methylation of NBL2 was not. In tertile analysis, patients in the highest tertile of D4Z4 methylation had an increased risk of BCR (HR = 2.17, 95% CI 1.36-3.48) compared to patients in the lower tertiles after adjustment of age, body mass index, smoking status, pack year, D'Amico risk groups and treatments. Among the four CpG sites in this region, the association was mostly attributable to the methylation of the second CpG site of D4Z4. These data suggest that higher methylation in D4Z4 was associated with worse prognosis of localized PCa patients. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Chen, Jiabo; Li, Fayun; Fan, Zhiping; Wang, Yanjie
2016-01-01
Source apportionment of river water pollution is critical in water resource management and aquatic conservation. Comprehensive application of various GIS-based multivariate statistical methods was performed to analyze datasets (2009–2011) on water quality in the Liao River system (China). Cluster analysis (CA) classified the 12 months of the year into three groups (May–October, February–April and November–January) and the 66 sampling sites into three groups (groups A, B and C) based on similarities in water quality characteristics. Discriminant analysis (DA) determined that temperature, dissolved oxygen (DO), pH, chemical oxygen demand (CODMn), 5-day biochemical oxygen demand (BOD5), NH4+–N, total phosphorus (TP) and volatile phenols were significant variables affecting temporal variations, with 81.2% correct assignments. Principal component analysis (PCA) and positive matrix factorization (PMF) identified eight potential pollution factors for each part of the data structure, explaining more than 61% of the total variance. Oxygen-consuming organics from cropland and woodland runoff were the main latent pollution factor for group A. For group B, the main pollutants were oxygen-consuming organics, oil, nutrients and fecal matter. For group C, the evaluated pollutants primarily included oxygen-consuming organics, oil and toxic organics. PMID:27775679
Goode, C; LeRoy, J; Allen, D G
2007-01-01
This study reports on a multivariate analysis of the moving bed biofilm reactor (MBBR) wastewater treatment system at a Canadian pulp mill. The modelling approach involved a data overview by principal component analysis (PCA) followed by partial least squares (PLS) modelling with the objective of explaining and predicting changes in the BOD output of the reactor. Over two years of data with 87 process measurements were used to build the models. Variables were collected from the MBBR control scheme as well as upstream in the bleach plant and in digestion. To account for process dynamics, a variable lagging approach was used for variables with significant temporal correlations. It was found that wood type pulped at the mill was a significant variable governing reactor performance. Other important variables included flow parameters, faults in the temperature or pH control of the reactor, and some potential indirect indicators of biomass activity (residual nitrogen and pH out). The most predictive model was found to have an RMSEP value of 606 kgBOD/d, representing a 14.5% average error. This was a good fit, given the measurement error of the BOD test. Overall, the statistical approach was effective in describing and predicting MBBR treatment performance.
Nutritional Evaluation of Non-Conventional Vegetables in Brazil.
Silva, Luis Felipe Lima E; Souza, Douglas C DE; Resende, Luciane V; Nassur, Rita DE Cássia M R; Samartini, Carolina Q; Gonçalves, Wilson M
2018-01-01
The objective of this study was to characterize the nutritional compounds of interest present in vegetables known as non-conventional, in Brazil. The following evaluations were carried out: antioxidant activity, phenolic compounds, vitamin C, calories, carbohydrates, humidity, lipids, proteins, fiber, acidity and quantification of minerals (P, K, Ca, Mg, S, Cu, Fe, Mn, Zn and B). The species studied were Amaranthus hybridus L., Amaranthus viridis L., Basella alba L., Eryngium campestre L., Hibiscus sabdariffa L., Lactuca canadensis L., Rumex acetosa L., Stachys byzantina K. Koch, Tropaeolum majus L. and Xanthosoma sagittifolium L. Representative samples of plant structures of interest were harvested from each species suitable for human consumption such as leaves, flowers and flower buds. The results were submitted to multivariate analysis - principal components analysis (PCA). All the species present nutritional compounds of interest in different levels among the evaluated structures.
Kamal, Ghulam Mustafa; Wang, Xiaohua; Bin Yuan; Wang, Jie; Sun, Peng; Zhang, Xu; Liu, Maili
2016-09-01
Soy sauce a well known seasoning all over the world, especially in Asia, is available in global market in a wide range of types based on its purpose and the processing methods. Its composition varies with respect to the fermentation processes and addition of additives, preservatives and flavor enhancers. A comprehensive (1)H NMR based study regarding the metabonomic variations of soy sauce to differentiate among different types of soy sauce available on the global market has been limited due to the complexity of the mixture. In present study, (13)C NMR spectroscopy coupled with multivariate statistical data analysis like principle component analysis (PCA), and orthogonal partial least square-discriminant analysis (OPLS-DA) was applied to investigate metabonomic variations among different types of soy sauce, namely super light, super dark, red cooking and mushroom soy sauce. The main additives in soy sauce like glutamate, sucrose and glucose were easily distinguished and quantified using (13)C NMR spectroscopy which were otherwise difficult to be assigned and quantified due to serious signal overlaps in (1)H NMR spectra. The significantly higher concentration of sucrose in dark, red cooking and mushroom flavored soy sauce can directly be linked to the addition of caramel in soy sauce. Similarly, significantly higher level of glutamate in super light as compared to super dark and mushroom flavored soy sauce may come from the addition of monosodium glutamate. The study highlights the potentiality of (13)C NMR based metabonomics coupled with multivariate statistical data analysis in differentiating between the types of soy sauce on the basis of level of additives, raw materials and fermentation procedures. Copyright © 2016 Elsevier B.V. All rights reserved.
Giménez-Forcada, Elena; Vega-Alegre, Marisol; Timón-Sánchez, Susana
2017-09-01
Naturally occurring arsenic in groundwater exceeding the limit for potability has been reported along the southern edge of the Cenozoic Duero Basin (CDB) near its contact with the Spanish Central System (SCS). In this area, spatial variability of arsenic is high, peaking at 241μg/L. Forty-seven percent of samples collected contained arsenic above the maximum allowable concentration for drinking water (10μg/L). Correlations of As with other hydrochemical variables were investigated using multivariate statistical analysis (Hierarchical Cluster Analysis, HCA and Principal Component Analysis, PCA). It was found that As, V, Cr and pH are closely related and that there were also close correlations with temperature and Na + . The highest concentrations of arsenic and other associated Potentially Toxic Geogenic Trace Elements (PTGTE) are linked to alkaline NaHCO 3 waters (pH≈9), moderate oxic conditions and temperatures of around 18°C-19°C. The most plausible hypothesis to explain the high arsenic concentrations is the contribution of deeper regional flows with a significant hydrothermal component (cold-hydrothermal waters), flowing through faults in the basement rock. Water mixing and water-rock interactions occur both in the fissured aquifer media (igneous and metasedimentary bedrock) and in the sedimentary environment of the CDB, where agricultural pollution phenomena are also active. A combination of multivariate statistical tools and hydrochemical analysis enabled the distribution pattern of dissolved As and other PTGTE in groundwaters in the study area to be interpreted, and their most likely origin to be established. This methodology could be applied to other sedimentary areas with similar characteristics and problems. Copyright © 2017 Elsevier B.V. All rights reserved.
Mostafa, Hamza; Amin, Arwa M; Teh, Chin-Hoe; Murugaiyah, Vikneswaran; Arif, Nor Hayati; Ibrahim, Baharudin
2016-12-01
Alcohol-dependence (AD) is a ravaging public health and social problem. AD diagnosis depends on questionnaires and some biomarkers, which lack specificity and sensitivity, however, often leading to less precise diagnosis, as well as delaying treatment. This represents a great burden, not only on AD individuals but also on their families. Metabolomics using nuclear magnetic resonance spectroscopy (NMR) can provide novel techniques for the identification of novel biomarkers of AD. These putative biomarkers can facilitate early diagnosis of AD. To identify novel biomarkers able to discriminate between alcohol-dependent, non-AD alcohol drinkers and controls using metabolomics. Urine samples were collected from 30 alcohol-dependent persons who did not yet start AD treatment, 54 social drinkers and 60 controls, who were then analysed using NMR. Data analysis was done using multivariate analysis including principal component analysis (PCA) and orthogonal partial least square-discriminate analysis (OPLS-DA), followed by univariate and multivariate logistic regression to develop the discriminatory model. The reproducibility was done using intraclass correlation coefficient (ICC). The OPLS-DA revealed significant discrimination between AD and other groups with sensitivity 86.21%, specificity 97.25% and accuracy 94.93%. Six biomarkers were significantly associated with AD in the multivariate logistic regression model. These biomarkers were cis-aconitic acid, citric acid, alanine, lactic acid, 1,2-propanediol and 2-hydroxyisovaleric acid. The reproducibility of all biomarkers was excellent (0.81-1.0). This study revealed that metabolomics analysis of urine using NMR identified AD novel biomarkers which can discriminate AD from social drinkers and controls with high accuracy. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Q-mode versus R-mode principal component analysis for linear discriminant analysis (LDA)
NASA Astrophysics Data System (ADS)
Lee, Loong Chuen; Liong, Choong-Yeun; Jemain, Abdul Aziz
2017-05-01
Many literature apply Principal Component Analysis (PCA) as either preliminary visualization or variable con-struction methods or both. Focus of PCA can be on the samples (R-mode PCA) or variables (Q-mode PCA). Traditionally, R-mode PCA has been the usual approach to reduce high-dimensionality data before the application of Linear Discriminant Analysis (LDA), to solve classification problems. Output from PCA composed of two new matrices known as loadings and scores matrices. Each matrix can then be used to produce a plot, i.e. loadings plot aids identification of important variables whereas scores plot presents spatial distribution of samples on new axes that are also known as Principal Components (PCs). Fundamentally, the scores matrix always be the input variables for building classification model. A recent paper uses Q-mode PCA but the focus of analysis was not on the variables but instead on the samples. As a result, the authors have exchanged the use of both loadings and scores plots in which clustering of samples was studied using loadings plot whereas scores plot has been used to identify important manifest variables. Therefore, the aim of this study is to statistically validate the proposed practice. Evaluation is based on performance of external error obtained from LDA models according to number of PCs. On top of that, bootstrapping was also conducted to evaluate the external error of each of the LDA models. Results show that LDA models produced by PCs from R-mode PCA give logical performance and the matched external error are also unbiased whereas the ones produced with Q-mode PCA show the opposites. With that, we concluded that PCs produced from Q-mode is not statistically stable and thus should not be applied to problems of classifying samples, but variables. We hope this paper will provide some insights on the disputable issues.
Bhattacharya, Sayanti; Granger, Christopher B; Craig, Damian; Haynes, Carol; Bain, James; Stevens, Robert D; Hauser, Elizabeth R; Newgard, Christopher B; Kraus, William E; Newby, L Kristin; Shah, Svati H
2014-01-01
To validate independent associations between branched-chain amino acids (BCAA) and other metabolites with coronary artery disease (CAD). We conducted mass-spectrometry-based profiling of 63 metabolites in fasting plasma from 1983 sequential patients undergoing cardiac catheterization. Significant CAD was defined as CADindex ≥ 32 (at least one vessel with ≥ 95% stenosis; N = 995) and no CAD as CADindex ≤ 23 and no previous cardiac events (N = 610). Individuals (N = 378) with CAD severity between these extremes were excluded. Principal components analysis (PCA) reduced large numbers of correlated metabolites into uncorrelated factors. Association between metabolite factors and significant CAD vs. no CAD was tested using logistic regression; and between metabolite factors and severity of CAD was tested using linear regression. Of twelve PCA-derived metabolite factors, two were associated with CAD in multivariable models: factor 10, composed of BCAA (adjusted odds ratio, OR, 1.20; 95% CI 1.05-1.35, p = 0.005) and factor 7, composed of short-chain acylcarnitines, which include byproducts of BCAA metabolism (adjusted OR 1.30; 95% CI 1.14-1.48, p = 0.001). After adjustment for glycated albumin (marker of insulin resistance [IR]) both factors 7 (p = 0.0001) and 10 (p = 0.004) remained associated with CAD. Severity of CAD as a continuous variable (including patients with non-obstructive disease) was associated with metabolite factors 2, 3, 6, 7, 8 and 9; only factors 7 and 10 were associated in multivariable models. We validated the independent association of metabolites involved in BCAA metabolism with CAD extremes. These metabolites may be reporting on novel mechanisms of CAD pathogenesis that are independent of IR and diabetes. Copyright © 2013. Published by Elsevier Ireland Ltd.
Yılmaz, Hasan; Yavuz, Ufuk; Üstüner, Murat; Çiftçi, Seyfettin; Yaşar, Hikmet; Müezzinoğlu, Bahar; Uslubaş, Ali Kemal; Dillioğlugil, Özdal
2017-01-01
Objective Only a few papers in the literature aimed to evaluate biopsy core lengths. Additionally, studies evaluated the core length with different approaches. We aimed to determine whether prostate cancer (PCa) detection is affected from core lengths according to three different approaches in a large standard cohort and compare our cut-off values with the published cut-offs. Material and methods We retrospectively analyzed 1,523 initial consecutive transrectal ultrasound-guided 12-core prostate biopsies. Biopsies were evaluated with respect to total core length (total length of each patients’ core) average core length (total core length divided by total number of cores in each patient), and mean core length (mean length of all cores pooled), and compared our cut-off values with the published cut-offs. The prostate volumes were categorized into four groups (<30, 30–59.99, 60–119.99, ≥120 cm3) and PCa detection rates in these categories were examined. Results PCa was found in 41.5% patients. There was no difference between benign and malignant mean core lengths of the pooled cores (p>0.05). Total core length and average core length were not significantly associated with PCa in multivariate logistic regression analyses (p>0.05). The core lengths (mean, average and total core lengths) increased (p<0.001) and PCa rates decreased (p<0.001) steadily with increasing prostate volume categories. PCa percentages decreased in all categories above the utilized cut-offs for mean (p>0.05), average (p<0.05), and total core lengths (p>0.05). Conclusion There was no difference between mean core lengths of benign and malignant cores. Total core length and average core length were not significantly associated with PCa. Contrary to the cut-offs used for mean and average core lengths in the published studies, PCa rates decrease as these core lengths increase. Larger studies are necessary for the determination and acceptance of accurate cut-offs. PMID:28861301
NASA Astrophysics Data System (ADS)
Chen, Zhe; Parker, B. J.; Feng, D. D.; Fulton, R.
2004-10-01
In this paper, we compare various temporal analysis schemes applied to dynamic PET for improved quantification, image quality and temporal compression purposes. We compare an optimal sampling schedule (OSS) design, principal component analysis (PCA) applied in the image domain, and principal component analysis applied in the sinogram domain; for region-of-interest quantification, sinogram-domain PCA is combined with the Huesman algorithm to quantify from the sinograms directly without requiring reconstruction of all PCA channels. Using a simulated phantom FDG brain study and three clinical studies, we evaluate the fidelity of the compressed data for estimation of local cerebral metabolic rate of glucose by a four-compartment model. Our results show that using a noise-normalized PCA in the sinogram domain gives similar compression ratio and quantitative accuracy to OSS, but with substantially better precision. These results indicate that sinogram-domain PCA for dynamic PET can be a useful preprocessing stage for PET compression and quantification applications.
Poniah, Prevathe; Mohd Zain, Shamsul; Abdul Razack, Azad Hassan; Kuppusamy, Shanggar; Karuppayah, Shankar; Sian Eng, Hooi; Mohamed, Zahurin
2017-09-01
Two key issues in prostate cancer (PCa) that demand attention currently are the need for a more precise and minimally invasive screening test owing to the inaccuracy of prostate-specific antigen and differential diagnosis to distinguish advanced vs. indolent cancers. This continues to pose a tremendous challenge in diagnosis and prognosis of PCa and could potentially lead to overdiagnosis and overtreatment complications. Copy number variations (CNVs) in the human genome have been linked to various carcinomas including PCa. Detection of these variants may improve clinical treatment as well as an understanding of the pathobiology underlying this complex disease. To this end, we undertook a pilot genome-wide CNV analysis approach in 36 subjects (18 patients with high-grade PCa and 18 controls that were matched by age and ethnicity) in search of more accurate biomarkers that could potentially explain susceptibility toward high-grade PCa. We conducted this study using the array comparative genomic hybridization technique. Array results were validated in 92 independent samples (46 high-grade PCa, 23 benign prostatic hyperplasia, and 23 healthy controls) using polymerase chain reaction-based copy number counting method. A total of 314 CNV regions were found to be unique to PCa subjects in this cohort (P<0.05). A log 2 ratio-based copy number analysis revealed 5 putative rare or novel CNV loci or both associated with susceptibility to PCa. The CNV gain regions were 1q21.3, 15q15, 7p12.1, and a novel CNV in PCa 12q23.1, harboring ARNT, THBS1, SLC5A8, and DDC genes that are crucial in the p53 and cancer pathways. A CNV loss and deletion event was observed at 8p11.21, which contains the SFRP1 gene from the Wnt signaling pathway. Cross-comparison analysis with genes associated to PCa revealed significant CNVs involved in biological processes that elicit cancer pathogenesis via cytokine production and endothelial cell proliferation. In conclusion, we postulated that the CNVs identified in this study could provide an insight into the development of advanced PCa. Copyright © 2017 Elsevier Inc. All rights reserved.
Monakhova, Yulia B; Diehl, Bernd W K; Fareed, Jawed
2018-02-05
High resolution (600MHz) nuclear magnetic resonance (NMR) spectroscopy is used to distinguish heparin and low-molecular weight heparins (LMWHs) produced from porcine, bovine and ovine mucosal tissues as well as their blends. For multivariate analysis several statistical methods such as principal component analysis (PCA), factor discriminant analysis (FDA), partial least squares - discriminant analysis (PLS-DA), linear discriminant analysis (LDA) were utilized for the modeling of NMR data of more than 100 authentic samples. Heparin and LMWH samples from the independent test set (n=15) were 100% correctly classified according to its animal origin. Moreover, by using 1 H NMR coupled with chemometrics and several batches of bovine heparins from two producers were differentiated. Thus, NMR spectroscopy combined with chemometrics is an efficient tool for simultaneous identification of animal origin and process based manufacturing difference in heparin products. Copyright © 2017 Elsevier B.V. All rights reserved.
Identification of offal adulteration in beef by laser induced breakdown spectroscopy (LIBS).
Velioglu, Hasan Murat; Sezer, Banu; Bilge, Gonca; Baytur, Süleyman Efe; Boyaci, Ismail Hakki
2018-04-01
Minced meat is the major ingredient in sausages, beef burgers, and similar products; and thus it is the main product subjected to adulteration with meat offal. Determination of this kind of meat adulteration is crucial due to religious, economic and ethical concerns. The aim of the present study is to discriminate the beef meat and offal samples by using laser induced breakdown spectroscopy (LIBS). To this end, LIBS and multivariate data analysis were used to discriminate pure beef and offal samples qualitatively and to determine the offal mixture adulteration quantitatively. In this analysis, meat samples were frozen and LIBS analysis were performed. The results indicate that by using principal component analysis (PCA), discrimination of pure offal and offal mixture adulterated beef samples can be achieved successfully. Besides, adulteration ratio can be determined using partial least square analysis method (PLS) with 0.947 coefficient of determination (R 2 ) and 3.8% of limit of detection (LOD) values for offal mixture adulterated beef samples. Copyright © 2017 Elsevier Ltd. All rights reserved.
Moncayo, S; Manzoor, S; Rosales, J D; Anzano, J; Caceres, J O
2017-10-01
The present work focuses on the development of a fast and cost effective method based on Laser Induced Breakdown Spectroscopy (LIBS) to the quality control, traceability and detection of adulteration in milk. Two adulteration cases have been studied; a qualitative analysis for the discrimination between different milk blends and quantification of melamine in adulterated toddler milk powder. Principal Component Analysis (PCA) and neural networks (NN) have been used to analyze LIBS spectra obtaining a correct classification rate of 98% with a 100% of robustness. For the quantification of melamine, two methodologies have been developed; univariate analysis using CN emission band and multivariate calibration NN model obtaining correlation coefficient (R 2 ) values of 0.982 and 0.999 respectively. The results of the use of LIBS technique coupled with chemometric analysis are discussed in terms of its potential use in the food industry to perform the quality control of this dairy product. Copyright © 2017 Elsevier Ltd. All rights reserved.
Tang, Jingyuan; Xu, Lingyan; Xu, Haoxiang; Li, Ran; Han, Peng; Yang, Haiwei
2017-01-01
Previous studies have investigated the association between NAT2 polymorphism and the risk of prostate cancer (PCa). However, the findings from these studies remained inconsistent. Hence, we performed a meta-analysis to provide a more reliable conclusion about such associations. In the present meta-analysis, 13 independent case-control studies were included with a total of 14,469 PCa patients and 10,689 controls. All relevant studies published were searched in the databates PubMed, EMBASE, and Web of Science, till March 1st, 2017. We used the pooled odds ratios (ORs) with 95% confidence intervals (CIs) to evaluate the strength of the association between NAT2*4 allele and susceptibility to PCa. Subgroup analysis was carried out by ethnicity, source of controls and genotyping method. What's more, we also performed trial sequential analysis (TSA) to reduce the risk of type I error and evaluate whether the evidence of the results was firm. Firstly, our results indicated that NAT2*4 allele was not associated with PCa susceptibility (OR = 1.00, 95% CI= 0.95–1.05; P = 0.100). However, after excluding two studies for its heterogeneity and publication bias, no significant relationship was also detected between NAT2*4 allele and the increased risk of PCa, in fixed-effect model (OR = 0.99, 95% CI= 0.94–1.04; P = 0.451). Meanwhile, no significant increased risk of PCa was found in the subgroup analyses by ethnicity, source of controls and genotyping method. Moreover, TSA demonstrated that such association was confirmed in the present study. Therefore, this meta-analysis suggested that no significant association between NAT2 polymorphism and the risk of PCa was found. PMID:28915684
Dou, MengMeng; Zhou, XueLiang; Fan, ZhiRui; Ding, XianFei; Li, LiFeng; Wang, ShuLing; Xue, Wenhua; Wang, Hui; Suo, Zhenhe; Deng, XiaoMing
2018-01-01
Retinoic acid receptor beta (RAR beta) is a retinoic acid receptor gene that has been shown to play key roles during multiple cancer processes, including cell proliferation, apoptosis, migration and invasion. Numerous studies have found that methylation of the RAR beta promoter contributed to the occurrence and development of malignant tumors. However, the connection between RAR beta promoter methylation and prostate cancer (PCa) remains unknown. This meta-analysis evaluated the clinical significance of RAR beta promoter methylation in PCa. We searched all published records relevant to RAR beta and PCa in a series of databases, including PubMed, Embase, Cochrane Library, ISI Web of Science and CNKI. The rates of RAR beta promoter methylation in the PCa and control groups (including benign prostatic hyperplasia and normal prostate tissues) were summarized. In addition, we evaluated the source region of available samples and the methods used to detect methylation. To compare the incidence and variation in RAR beta promoter methylation in PCa and non-PCa tissues, the odds ratio (OR) and 95% confidence interval (CI) were calculated accordingly. All the data were analyzed with the statistical software STATA 12.0. Based on the inclusion and exclusion criteria, 15 articles assessing 1,339 samples were further analyzed. These data showed that the RAR beta promoter methylation rates in PCa tissues were significantly higher than the rates in the non-PCa group (OR=21.65, 95% CI: 9.27-50.57). Subgroup analysis according to the source region of samples showed that heterogeneity in Asia was small (I2=0.0%, P=0.430). Additional subgroup analysis based on the method used to detect RAR beta promoter methylation showed that the heterogeneity detected by MSP (methylation-specific PCR) was relatively small (I2=11.3%, P=0.343). Although studies reported different rates for RAR beta promoter methylation in PCa tissues, the total analysis demonstrated that RAR beta promoter methylation may be correlated with PCa carcinogenesis and that the RAR beta gene is particularly susceptible. Additional studies with sufficient data are essential to further evaluate the clinical features and prognostic utility of RAR beta promoter methylation in PCa. © 2018 The Author(s). Published by S. Karger AG, Basel.
Gad, Haidy A; El-Ahmady, Sherweit H; Abou-Shoer, Mohamed I; Al-Azizi, Mohamed M
2013-01-01
Recently, the fields of chemometrics and multivariate analysis have been widely implemented in the quality control of herbal drugs to produce precise results, which is crucial in the field of medicine. Thyme represents an essential medicinal herb that is constantly adulterated due to its resemblance to many other plants with similar organoleptic properties. To establish a simple model for the quality assessment of Thymus species using UV spectroscopy together with known chemometric techniques. The success of this model may also serve as a technique for the quality control of other herbal drugs. The model was constructed using 30 samples of authenticated Thymus vulgaris and challenged with 20 samples of different botanical origins. The methanolic extracts of all samples were assessed using UV spectroscopy together with chemometric techniques: principal component analysis (PCA), soft independent modeling of class analogy (SIMCA) and hierarchical cluster analysis (HCA). The model was able to discriminate T. vulgaris from other Thymus, Satureja, Origanum, Plectranthus and Eriocephalus species, all traded in the Egyptian market as different types of thyme. The model was also able to classify closely related species in clusters using PCA and HCA. The model was finally used to classify 12 commercial thyme varieties into clusters of species incorporated in the model as thyme or non-thyme. The model constructed is highly recommended as a simple and efficient method for distinguishing T. vulgaris from other related species as well as the classification of marketed herbs as thyme or non-thyme. Copyright © 2013 John Wiley & Sons, Ltd.
Wu, Jia; Gong, Guanghua; Cui, Yi; Li, Ruijiang
2016-11-01
To predict pathological response of breast cancer to neoadjuvant chemotherapy (NAC) based on quantitative, multiregion analysis of dynamic contrast enhancement magnetic resonance imaging (DCE-MRI). In this Institutional Review Board-approved study, 35 patients diagnosed with stage II/III breast cancer were retrospectively investigated using 3T DCE-MR images acquired before and after the first cycle of NAC. First, principal component analysis (PCA) was used to reduce the dimensionality of the DCE-MRI data with high temporal resolution. We then partitioned the whole tumor into multiple subregions using k-means clustering based on the PCA-defined eigenmaps. Within each tumor subregion, we extracted four quantitative Haralick texture features based on the gray-level co-occurrence matrix (GLCM). The change in texture features in each tumor subregion between pre- and during-NAC was used to predict pathological complete response after NAC. Three tumor subregions were identified through clustering, each with distinct enhancement characteristics. In univariate analysis, all imaging predictors except one extracted from the tumor subregion associated with fast washout were statistically significant (P < 0.05) after correcting for multiple testing, with area under the receiver operating characteristic (ROC) curve (AUC) or AUCs between 0.75 and 0.80. In multivariate analysis, the proposed imaging predictors achieved an AUC of 0.79 (P = 0.002) in leave-one-out cross-validation. This improved upon conventional imaging predictors such as tumor volume (AUC = 0.53) and texture features based on whole-tumor analysis (AUC = 0.65). The heterogeneity of the tumor subregion associated with fast washout on DCE-MRI predicted pathological response to NAC in breast cancer. J. Magn. Reson. Imaging 2016;44:1107-1115. © 2016 International Society for Magnetic Resonance in Medicine.
Zhang, Yang; Ma, Xin-Ming; Wang, Xiao-Chun; Liu, Ji-Hong; Huang, Bing-Yan; Guo, Xiao-Yang; Xiong, Shu-Ping; La, Gui-Xiao
2017-02-01
Wheat is one of the most important grain crop plants worldwide. Nitrogen (N) is an essential macronutrient for the growth and development of wheat and exerts a marked influence on its metabolites. To investigate the influence of low nitrogen stress on various metabolites of the flag leaf of wheat (Triticum aestivum L.), a metabolomic analysis of two wheat cultivars under different induced nitrogen levels was conducted during two important growth periods based on large-scale untargeted metabolomic analysis using ultra-performance liquid chromatography-quadrupole time-of-flight mass spectrometry (UPLC-QTOF). Multivariate analyses-such as principle components analysis (PCA) and orthogonal partial least square discriminant analysis (OPLS-DA)-were used for data analysis. PCA yielded distinctive clustering information among the samples, classifying the wheat flag samples into two categories: those under normal N treatment and low N treatment. By processing OPLS-DA, eleven secondary metabolites were shown to be responsible for classifying the two groups. The secondary metabolites may be considered potential biomarkers of low nitrogen stress. Chemical analyses showed that most of the identified secondary metabolites were flavonoids and their related derivatives, such as iso-vitexin, iso-orientin and methylisoorientin-2″-O-rhamnoside, etc. This study confirmed the effect of low nitrogen stress on the metabolism of wheat, and revealed that the accumulation of secondary metabolites is a response to abiotic stresses. Meanwhile, we aimed to identify markers which could be used to monitor the nitrogen status of wheat crops, presumably to guide appropriate fertilization regimens. Furthermore, the UPLC-QTOF metabolic platform technology can be used to study metabolomic variations of wheat under abiotic stresses. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Isolation of candidate genes for apomictic development in buffelgrass (Pennisetum ciliare).
Singh, Manjit; Burson, Byron L; Finlayson, Scott A
2007-08-01
Asexual reproduction through seeds, or apomixis, is a process that holds much promise for agricultural advances. However, the molecular mechanisms underlying apomixis are currently poorly understood. To identify genes related to female gametophyte development in apomictic ovaries of buffelgrass (Pennisetum ciliare (L.) Link), Suppression Subtractive Hybridization of ovary cDNA with leaf cDNA was performed. Through macroarray screening of subtracted cDNAs two genes were identified, Pca21 and Pca24, that showed differential expression between apomictic and sexual ovaries. Sequence analysis showed that both Pca21 and Pca24 are novel genes not previously characterized in plants. Pca21 shows homology to two wheat genes that are also expressed during reproductive development. Pca24 has similarity to coiled-coil-helix-coiled-coil-helix (CHCH) domain containing proteins from maize and sugarcane. Northern blot analysis revealed that both of these genes are expressed throughout female gametophyte development in apomictic ovaries. In situ hybridizations localized the transcript of these two genes to the developing embryo sacs in the apomictic ovaries. Based on the expression patterns it was concluded that Pca21 and Pca24 likely play a role during apomictic development in buffelgrass.
Cao, Zipei; Wei, Lijuan; Zhu, Weizhi; Yao, Xuping
2018-03-01
Reduction of cyclin-dependent kinase inhibitor 2A (CDKN2A) (p16 and p14) expression through DNA methylation has been reported in prostate cancer (PCa). This meta-analysis was conducted to assess the difference of p16 and p14 methylation between PCa and different histological types of nonmalignant controls and the correlation of p16 or p14 methylation with clinicopathological features of PCa. According to the preferred reporting items for systematic reviews and meta-analyses (PRISMA) statement criteria, articles were searched in PubMed, Embase, EBSCO, Wanfang, and CNKI databases. The strength of correlation was calculated by the pooled odds ratios (ORs) and their corresponding 95% confidence intervals (95% CIs). Trial sequential analysis (TSA) was used to estimate the required population information for significant results. A total of 20 studies published from 1997 to 2017 were identified in this meta-analysis, including 1140 PCa patients and 530 cases without cancer. Only p16 methylation in PCa was significantly higher than in benign prostatic lesions (OR = 4.72, P = .011), but had a similar level in PCa and adjacent tissues or high-grade prostatic intraepithelial neoplasias (HGPIN). TSA revealed that this analysis on p16 methylation is a false positive result in cancer versus benign prostatic lesions (the estimated required information size of 5116 participants). p16 methylation was not correlated with PCa in the urine and blood. Besides, p16 methylation was not linked to clinical stage, prostate-specific antigen (PSA) level, and Gleason score (GS) of patients with PCa. p14 methylation was not correlated with PCa in tissue and urine samples. No correlation was observed between p14 methylation and clinical stage or GS. CDKN2A mutation and copy number alteration were not associated with prognosis of PCa in overall survival and disease-free survival. CDKN2A expression was not correlated with the prognosis of PCa in overall survival (492 cases) (P > .1), while CDKN2A expression was significantly associated with a poor disease-free survival (P < .01). CDKN2A methylation may not be significantly associated with the development, progression of PCa. Although CDKN2A expression had an unfavorable prognosis in disease-free survival. More studies are needed to confirm our results.
Shaikhibrahim, Zaki; Lindstrot, Andreas; Ochsenfahrt, Jacqueline; Fuchs, Kerstin; Wernert, Nicolas
2013-01-01
Epigenetic changes have been suggested to drive prostate cancer (PCa) development and progression. Therefore, in this study, we aimed to identify novel epigenetics-related genes in PCa tissues, and to examine their expression in metastatic PCa cell lines. We analyzed the expression of epigenetics-related genes via a clustering analysis based on gene function in moderately and poorly differentiated PCa glands compared to normal glands of the peripheral zone (prostate proper) from PCa patients using Whole Human Genome Oligo Microarrays. Our analysis identified 12 epigenetics-related genes with a more than 2-fold increase or decrease in expression and a p-value <0.01. In modera-tely differentiated tumors compared to normal glands of the peripheral zone, we found the genes, TDRD1, IGF2, DICER1, ADARB1, HILS1, GLMN and TRIM27, to be upregulated, whereas TNRC6A and DGCR8 were found to be downregulated. In poorly differentiated tumors, we found TDRD1, ADARB and RBM3 to be upregulated, whereas DGCR8, PIWIL2 and BC069781 were downregulated. Our analysis of the expression level for each gene in the metastatic androgen-sensitive VCaP and LNCaP, and -insensitive PC3 and DU-145 PCa cell lines revealed differences in expression among the cell lines which may reflect the different biological properties of each cell line, and the potential role of each gene at different metastatic sites. The novel epigenetics-related genes that we identified in primary PCa tissues may provide further insight into the role that epigenetic changes play in PCa. Moreover, some of the genes that we identified may play important roles in primary PCa and metastasis, in primary PCa only, or in metastasis only. Follow-up studies are required to investigate the functional role and the role that the expression of these genes play in the outcome and progression of PCa using tissue microarrays.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steenbergen, K. G., E-mail: kgsteen@gmail.com; Gaston, N.
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement formore » a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.« less
Steenbergen, K G; Gaston, N
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement for a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.
The Classification of Ground Roasted Decaffeinated Coffee Using UV-VIS Spectroscopy and SIMCA Method
NASA Astrophysics Data System (ADS)
Yulia, M.; Asnaning, A. R.; Suhandy, D.
2018-05-01
In this work, an investigation on the classification between decaffeinated and non- decaffeinated coffee samples using UV-VIS spectroscopy and SIMCA method was investigated. Total 200 samples of ground roasted coffee were used (100 samples for decaffeinated coffee and 100 samples for non-decaffeinated coffee). After extraction and dilution, the spectra of coffee samples solution were acquired using a UV-VIS spectrometer (Genesys™ 10S UV-VIS, Thermo Scientific, USA) in the range of 190-1100 nm. The multivariate analyses of the spectra were performed using principal component analysis (PCA) and soft independent modeling of class analogy (SIMCA). The SIMCA model showed that the classification between decaffeinated and non-decaffeinated coffee samples was detected with 100% sensitivity and specificity.
Chabukdhara, Mayuri; Gupta, Sanjay Kumar; Kotecha, Yatharth; Nema, Arvind K
2017-07-01
This study aimed to assess the quality of groundwater and potential health risk due to ingestion of heavy metals in the peri-urban and urban-industrial clusters of Ghaziabad district, Uttar Pradesh, India. Furthermore, the study aimed to evaluate heavy metals sources and their pollution level using multivariate analysis and fuzzy comprehensive assessment (FCA), respectively. Multivariate analysis using principle component analysis (PCA) showed mixed origin for Pb, Cd, Zn, Fe, and Ni, natural source for Cu and Mn and anthropogenic source for Cr. Among all the metals, Pb, Cd, Fe and Ni were above the safe limits of Bureau of Indian Standards (BIS) and World Health Organization (WHO) except Ni. Health risk in terms of hazard quotient (HQ) showed that the HQ values for children were higher than the safe level (HQ = 1) for Pb (2.4) and Cd (2.1) in pre-monsoon while in post-monsoon the value exceeded only for Pb (HQ = 1.23). The health risks of heavy metals for the adults were well within safe limits. The finding of this study indicates potential health risks to the children due to chronic exposure to contaminated groundwater in the region. Based on FCA, groundwater pollution could be categorized as quite high in the peri-urban region, and absolutely high in the urban region of Ghaziabad district. This study showed that different approaches are required for the integrated assessment of the groundwater pollution, and provides a scientific basis for the strategic future planning and comprehensive management. Copyright © 2017 Elsevier Ltd. All rights reserved.
Joshi, Amit D; John, Esther M; Koo, Jocelyn; Ingles, Sue A; Stern, Mariana C
2012-03-01
Studies conducted to assess the association between fish consumption and prostate cancer (PCA) risk are inconclusive. However, few studies have distinguished between fatty and lean fish, and no studies have considered the role of different cooking practices, which may lead to differential accumulation of chemical carcinogens. In this study, we investigated the association between fish intake and localized and advanced PCA taking into account fish types (lean vs. fatty) and cooking practices. We analyzed data for 1,096 controls, 717 localized and 1,140 advanced cases from the California Collaborative Prostate Cancer Study, a multiethnic, population-based case-control study. We used multivariate conditional logistic regression to estimate odds ratios using nutrient density converted variables of fried fish, tuna, dark fish and white fish consumption. We tested for effect modification by cooking methods (high- vs. low-temperature methods) and levels of doneness. We observed that high white fish intake was associated with increased risk of advanced PCA among men who cooked with high-temperature methods (pan-frying, oven-broiling and grilling) until fish was well done (p (trend) = 0.001). No associations were found among men who cooked fish at low temperature and/or just until done (white fish x cooking method p (interaction) = 0.040). Our results indicate that consideration of fish type (oily vs. lean), specific fish cooking practices and levels of doneness of cooked fish helps elucidate the association between fish intake and PCA risk and suggest that avoiding high-temperature cooking methods for white fish may lower PCA risk.
Thomas-Jardin, Shayna E; Kanchwala, Mohammed S; Jacob, Joan; Merchant, Sana; Meade, Rachel K; Gahnim, Nagham M; Nawas, Afshan F; Xing, Chao; Delk, Nikki A
2018-06-01
In immunosurveillance, bone-derived immune cells infiltrate the tumor and secrete inflammatory cytokines to destroy cancer cells. However, cancer cells have evolved mechanisms to usurp inflammatory cytokines to promote tumor progression. In particular, the inflammatory cytokine, interleukin-1 (IL-1), is elevated in prostate cancer (PCa) patient tissue and serum, and promotes PCa bone metastasis. IL-1 also represses androgen receptor (AR) accumulation and activity in PCa cells, yet the cells remain viable and tumorigenic; suggesting that IL-1 may also contribute to AR-targeted therapy resistance. Furthermore, IL-1 and AR protein levels negatively correlate in PCa tumor cells. Taken together, we hypothesize that IL-1 reprograms AR positive (AR + ) PCa cells into AR negative (AR - ) PCa cells that co-opt IL-1 signaling to ensure AR-independent survival and tumor progression in the inflammatory tumor microenvironment. LNCaP and PC3 PCa cells were treated with IL-1β or HS-5 bone marrow stromal cell (BMSC) conditioned medium and analyzed by RNA sequencing and RT-QPCR. To verify genes identified by RNA sequencing, LNCaP, MDA-PCa-2b, PC3, and DU145 PCa cell lines were treated with the IL-1 family members, IL-1α or IL-1β, or exposed to HS-5 BMSC in the presence or absence of Interleukin-1 Receptor Antagonist (IL-1RA). Treated cells were analyzed by western blot and/or RT-QPCR. Comparative analysis of sequencing data from the AR + LNCaP PCa cell line versus the AR - PC3 PCa cell line reveals an IL-1-conferred gene suite in LNCaP cells that is constitutive in PC3 cells. Bioinformatics analysis of the IL-1 regulated gene suite revealed that inflammatory and immune response pathways are primarily elicited; likely facilitating PCa cell survival and tumorigenicity in an inflammatory tumor microenvironment. Our data supports that IL-1 reprograms AR + PCa cells to mimic AR - PCa gene expression patterns that favor AR-targeted treatment resistance and cell survival. © 2018 Wiley Periodicals, Inc.
PCA-LBG-based algorithms for VQ codebook generation
NASA Astrophysics Data System (ADS)
Tsai, Jinn-Tsong; Yang, Po-Yuan
2015-04-01
Vector quantisation (VQ) codebooks are generated by combining principal component analysis (PCA) algorithms with Linde-Buzo-Gray (LBG) algorithms. All training vectors are grouped according to the projected values of the principal components. The PCA-LBG-based algorithms include (1) PCA-LBG-Median, which selects the median vector of each group, (2) PCA-LBG-Centroid, which adopts the centroid vector of each group, and (3) PCA-LBG-Random, which randomly selects a vector of each group. The LBG algorithm finds a codebook based on the better vectors sent to an initial codebook by the PCA. The PCA performs an orthogonal transformation to convert a set of potentially correlated variables into a set of variables that are not linearly correlated. Because the orthogonal transformation efficiently distinguishes test image vectors, the proposed PCA-LBG-based algorithm is expected to outperform conventional algorithms in designing VQ codebooks. The experimental results confirm that the proposed PCA-LBG-based algorithms indeed obtain better results compared to existing methods reported in the literature.
NASA Astrophysics Data System (ADS)
Chitrakar, S.; Miller, S. N.; Liu, T.; Caffrey, P. A.
2015-12-01
Water quality data have been collected from three representative stream reaches in a coalbed methane (CBM) development area for over five years to improve the understanding of salt loading in the system. These streams are located within Atlantic Rim development area of the Muddy Creek in south-central Wyoming. Significant development of CBM wells is ongoing in the study area. Three representative sampling stream reaches included the Duck Pond Draw and Cow Creek, which receive co-produced water, and; South Fork Creek, and upstream Cow Creek which do not receive co-produced water. Water samples were assayed for various parameters which included sodium, calcium, magnesium, fluoride, chlorine, nitrate, O-phosphate, sulfate, carbonate, bicarbonates, and other water quality parameters such as pH, conductivity, and TDS. Based on these water quality parameters we have investigated various hydrochemical and geochemical processes responsible for the high variability in water quality in the region. However, effective interpretation of complex databases to understand aforementioned processes has been a challenging task due to the system's complexity. In this work we applied multivariate statistical techniques including cluster analysis (CA), principle component analysis (PCA) and discriminant analysis (DA) to analyze water quality data and identify similarities and differences among our locations. First, CA technique was applied to group the monitoring sites based on the multivariate similarities. Second, PCA technique was applied to identify the prevalent parameters responsible for the variation of water quality in each group. Third, the DA technique was used to identify the most important factors responsible for variation of water quality during low flow season and high flow season. The purpose of this study is to improve the understanding of factors or sources influencing the spatial and temporal variation of water quality. The ultimate goal of this whole research is to develop coupled salt loading and GIS-based hydrological modelling tool that will be able to simulate the salt loadings under various user defined scenarios in the regions undergoing CBM development. Therefore, the findings from this study will be used to formulate the predominant processes responsible for solute loading.
Ma, Xiaoling; Zuo, Hang; Tian, Mengjing; Zhang, Liyang; Meng, Jia; Zhou, Xuening; Min, Na; Chang, Xinyuan; Liu, Ying
2016-02-01
Metal chemical fractions obtained by optimized BCR three-stage extraction procedure and multivariate analysis techniques were exploited for assessing 7 heavy metals (Cr, Pb, Cd, Co, Cu, Zn and Ni) in sediments from Gansu province, Ningxia and Inner Mongolia Autonomous Regions of the Yellow River in Northern China. The results indicated that higher susceptibility and bioavailability of Cr and Cd with a strong anthropogenic source were due to their higher availability in the exchangeable fraction. A portion of Pb, Cd, Co, Zn, and Ni in reducible fraction may be due to the fact that they can form stable complexes with Fe and Mn oxides. Substantial amount of Pb, Co, Ni and Cu was observed as oxidizable fraction because of their strong affinity to the organic matters so that they can complex with humic substances in sediments. The high geo-accumulation indexes (I(geo)) for Cr and Cd showed their higher environmental risk to the aquatic biota. Principal component analysis (PCA) revealed that high toxic Cr and Cd in polluted sites (Cd in S10, S11 and Cr in S13) may be contributed to anthropogenic sources, it was consistent with the results of dual hierarchical clustering analysis (DHCA), which could give more details about contributing sources. Copyright © 2015 Elsevier Ltd. All rights reserved.
Leachate/domestic wastewater aerobic co-treatment: A pilot-scale study using multivariate analysis.
Ferraz, F M; Bruni, A T; Povinelli, J; Vieira, E M
2016-01-15
Multivariate analysis was used to identify the variables affecting the performance of pilot-scale activated sludge (AS) reactors treating old leachate from a landfill and from domestic wastewater. Raw leachate was pre-treated using air stripping to partially remove the total ammoniacal nitrogen (TAN). The control AS reactor (AS-0%) was loaded only with domestic wastewater, whereas the other reactor was loaded with mixtures containing leachate at volumetric ratios of 2 and 5%. The best removal efficiencies were obtained for a ratio of 2%, as follows: 70 ± 4% for total suspended solids (TSS), 70 ± 3% for soluble chemical oxygen demand (SCOD), 70 ± 4% for dissolved organic carbon (DOC), and 51 ± 9% for the leachate slowly biodegradable organic matter (SBOM). Fourier transform infrared (FTIR) spectroscopic analysis confirmed that most of the SBOM was removed by partial biodegradation rather than dilution or adsorption of organics in the sludge. Nitrification was approximately 80% in the AS-0% and AS-2% reactors. No significant accumulation of heavy metals was observed for any of the tested volumetric ratios. Principal component analysis (PCA) and partial least squares (PLS) indicated that the data dimension could be reduced and that TAN, SCOD, DOC and nitrification efficiency were the main variables that affected the performance of the AS reactors. Copyright © 2015 Elsevier Ltd. All rights reserved.
Liu, Tsang-Sen; Lin, Jhen-Nan; Peng, Tsung-Ren
2018-01-16
Isotopic compositions of δ 2 H, δ 18 O, δ 13 C, and δ 15 N and concentrations of 22 trace elements from garlic samples were analyzed and processed with stepwise principal component analysis (PCA) to discriminate garlic's country of origin among Asian regions including South Korea, Vietnam, Taiwan, and China. Results indicate that there is no single trace-element concentration or isotopic composition that can accomplish the study's purpose and the stepwise PCA approach proposed does allow for discrimination between countries on a regional basis. Sequentially, Step-1 PCA distinguishes garlic's country of origin among Taiwanese, South Korean, and Vietnamese samples; Step-2 PCA discriminates Chinese garlic from South Korean garlic; and Step-3 and Step-4 PCA, Chinese garlic from Vietnamese garlic. In model tests, countries of origin of all audit samples were correctly discriminated by stepwise PCA. Consequently, this study demonstrates that stepwise PCA as applied is a simple and effective approach to discriminating country of origin among Asian garlics. © 2018 American Academy of Forensic Sciences.
Plaque echodensity and textural features are associated with histologic carotid plaque instability.
Doonan, Robert J; Gorgui, Jessica; Veinot, Jean P; Lai, Chi; Kyriacou, Efthyvoulos; Corriveau, Marc M; Steinmetz, Oren K; Daskalopoulou, Stella S
2016-09-01
Carotid plaque echodensity and texture features predict cerebrovascular symptomatology. Our purpose was to determine the association of echodensity and textural features obtained from a digital image analysis (DIA) program with histologic features of plaque instability as well as to identify the specific morphologic characteristics of unstable plaques. Patients scheduled to undergo carotid endarterectomy were recruited and underwent carotid ultrasound imaging. DIA was performed to extract echodensity and textural features using Plaque Texture Analysis software (LifeQ Medical Ltd, Nicosia, Cyprus). Carotid plaque surgical specimens were obtained and analyzed histologically. Principal component analysis (PCA) was performed to reduce imaging variables. Logistic regression models were used to determine if PCA variables and individual imaging variables predicted histologic features of plaque instability. Image analysis data from 160 patients were analyzed. Individual imaging features of plaque echolucency and homogeneity were associated with a more unstable plaque phenotype on histology. These results were independent of age, sex, and degree of carotid stenosis. PCA reduced 39 individual imaging variables to five PCA variables. PCA1 and PCA2 were significantly associated with overall plaque instability on histology (both P = .02), whereas PCA3 did not achieve statistical significance (P = .07). DIA features of carotid plaques are associated with histologic plaque instability as assessed by multiple histologic features. Importantly, unstable plaques on histology appear more echolucent and homogeneous on ultrasound imaging. These results are independent of stenosis, suggesting that image analysis may have a role in refining the selection of patients who undergo carotid endarterectomy. Copyright © 2016 Society for Vascular Surgery. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Gualandi, Adriano; Serpelloni, Enrico; Elina Belardinelli, Maria; Bonafede, Maurizio; Pezzo, Giuseppe; Tolomei, Cristiano
2015-04-01
A critical point in the analysis of ground displacement time series, as those measured by modern space geodetic techniques (primarly continuous GPS/GNSS and InSAR) is the development of data driven methods that allow to discern and characterize the different sources that generate the observed displacements. A widely used multivariate statistical technique is the Principal Component Analysis (PCA), which allows to reduce the dimensionality of the data space maintaining most of the variance of the dataset explained. It reproduces the original data using a limited number of Principal Components, but it also shows some deficiencies, since PCA does not perform well in finding the solution to the so-called Blind Source Separation (BSS) problem. The recovering and separation of the different sources that generate the observed ground deformation is a fundamental task in order to provide a physical meaning to the possible different sources. PCA fails in the BSS problem since it looks for a new Euclidean space where the projected data are uncorrelated. Usually, the uncorrelation condition is not strong enough and it has been proven that the BSS problem can be tackled imposing on the components to be independent. The Independent Component Analysis (ICA) is, in fact, another popular technique adopted to approach this problem, and it can be used in all those fields where PCA is also applied. An ICA approach enables us to explain the displacement time series imposing a fewer number of constraints on the model, and to reveal anomalies in the data such as transient deformation signals. However, the independence condition is not easy to impose, and it is often necessary to introduce some approximations. To work around this problem, we use a variational bayesian ICA (vbICA) method, which models the probability density function (pdf) of each source signal using a mix of Gaussian distributions. This technique allows for more flexibility in the description of the pdf of the sources, giving a more reliable estimate of them. Here we introduce the vbICA technique and present its application on synthetic data that simulate a GPS network recording ground deformation in a tectonically active region, with synthetic time-series containing interseismic, coseismic, and postseismic deformation, plus seasonal deformation, and white and coloured noise. We study the ability of the algorithm to recover the original (known) sources of deformation, and then apply it to a real scenario: the Emilia seismic sequence (2012, northern Italy), which is an example of seismic sequence occurred in a slowly converging tectonic setting, characterized by several local to regional anthropogenic or natural sources of deformation, mainly subsidence due to fluid withdrawal and sediments compaction. We apply both PCA and vbICA to displacement time-series recorded by continuous GPS and InSAR (Pezzo et al., EGU2015-8950).
NASA Astrophysics Data System (ADS)
Teye, Ernest; Huang, Xingyi; Dai, Huang; Chen, Quansheng
2013-10-01
Quick, accurate and reliable technique for discrimination of cocoa beans according to geographical origin is essential for quality control and traceability management. This current study presents the application of Near Infrared Spectroscopy technique and multivariate classification for the differentiation of Ghana cocoa beans. A total of 194 cocoa bean samples from seven cocoa growing regions were used. Principal component analysis (PCA) was used to extract relevant information from the spectral data and this gave visible cluster trends. The performance of four multivariate classification methods: Linear discriminant analysis (LDA), K-nearest neighbors (KNN), Back propagation artificial neural network (BPANN) and Support vector machine (SVM) were compared. The performances of the models were optimized by cross validation. The results revealed that; SVM model was superior to all the mathematical methods with a discrimination rate of 100% in both the training and prediction set after preprocessing with Mean centering (MC). BPANN had a discrimination rate of 99.23% for the training set and 96.88% for prediction set. While LDA model had 96.15% and 90.63% for the training and prediction sets respectively. KNN model had 75.01% for the training set and 72.31% for prediction set. The non-linear classification methods used were superior to the linear ones. Generally, the results revealed that NIR Spectroscopy coupled with SVM model could be used successfully to discriminate cocoa beans according to their geographical origins for effective quality assurance.
Prabitha, Vasumathi Gopala; Suchetha, Sambasivan; Jayanthi, Jayaraj Lalitha; Baiju, Kamalasanan Vijayakumary; Rema, Prabhakaran; Anuraj, Koyippurath; Mathews, Anita; Sebastian, Paul; Subhash, Narayanan
2016-01-01
Diffuse reflectance (DR) spectroscopy is a non-invasive, real-time, and cost-effective tool for early detection of malignant changes in squamous epithelial tissues. The present study aims to evaluate the diagnostic power of diffuse reflectance spectroscopy for non-invasive discrimination of cervical lesions in vivo. A clinical trial was carried out on 48 sites in 34 patients by recording DR spectra using a point-monitoring device with white light illumination. The acquired data were analyzed and classified using multivariate statistical analysis based on principal component analysis (PCA) and linear discriminant analysis (LDA). Diagnostic accuracies were validated using random number generators. The receiver operating characteristic (ROC) curves were plotted for evaluating the discriminating power of the proposed statistical technique. An algorithm was developed and used to classify non-diseased (normal) from diseased sites (abnormal) with a sensitivity of 72 % and specificity of 87 %. While low-grade squamous intraepithelial lesion (LSIL) could be discriminated from normal with a sensitivity of 56 % and specificity of 80 %, and high-grade squamous intraepithelial lesion (HSIL) from normal with a sensitivity of 89 % and specificity of 97 %, LSIL could be discriminated from HSIL with 100 % sensitivity and specificity. The areas under the ROC curves were 0.993 (95 % confidence interval (CI) 0.0 to 1) and 1 (95 % CI 1) for the discrimination of HSIL from normal and HSIL from LSIL, respectively. The results of the study show that DR spectroscopy could be used along with multivariate analytical techniques as a non-invasive technique to monitor cervical disease status in real time.
Influence of shifting cultivation practices on soil-plant-beetle interactions.
Ibrahim, Kalibulla Syed; Momin, Marcy D; Lalrotluanga, R; Rosangliana, David; Ghatak, Souvik; Zothansanga, R; Kumar, Nachimuthu Senthil; Gurusubramanian, Guruswami
2016-08-01
Shifting cultivation (jhum) is a major land use practice in Mizoram. It was considered as an eco-friendly and efficient method when the cycle duration was long (15-30 years), but it poses the problem of land degradation and threat to ecology when shortened (4-5 years) due to increased intensification of farming systems. Studying beetle community structure is very helpful in understanding how shifting cultivation affects the biodiversity features compared to natural forest system. The present study examines the beetle species diversity and estimates the effects of shifting cultivation practices on the beetle assemblages in relation to change in tree species composition and soil nutrients. Scarabaeidae and Carabidae were observed to be the dominant families in the land use systems studied. Shifting cultivation practice significantly (P < 0.05) affected the beetle and tree species diversity as well as the soil nutrients as shown by univariate (one-way analysis of variance (ANOVA), correlation and regression, diversity indices) and multivariate (cluster analysis, principal component analysis (PCA), detrended correspondence analysis (DCA), canonical variate analysis (CVA), permutational multivariate analysis of variance (PERMANOVA), permutational multivariate analysis of dispersion (PERMDISP)) statistical analyses. Besides changing the tree species composition and affecting the soil fertility, shifting cultivation provides less suitable habitat conditions for the beetle species. Bioindicator analysis categorized the beetle species into forest specialists, anthropogenic specialists (shifting cultivation habitat specialist), and habitat generalists. Molecular analysis of bioindicator beetle species was done using mitochondrial cytochrome oxidase subunit I (COI) marker to validate the beetle species and describe genetic variation among them in relation to heterogeneity, transition/transversion bias, codon usage bias, evolutionary distance, and substitution pattern. The present study revealed the fact that shifting cultivation practice significantly affects the beetle species in terms of biodiversity pattern as well as evolutionary features. Spatiotemporal assessment of soil-plant-beetle interactions in shifting cultivation system and their influence in land degradation and ecology will be helpful in making biodiversity conservation decisions in the near future.
Predicting survival of men with recurrent prostate cancer after radical prostatectomy.
Dell'Oglio, Paolo; Suardi, Nazareno; Boorjian, Stephen A; Fossati, Nicola; Gandaglia, Giorgio; Tian, Zhe; Moschini, Marco; Capitanio, Umberto; Karakiewicz, Pierre I; Montorsi, Francesco; Karnes, R Jeffrey; Briganti, Alberto
2016-02-01
To develop and externally validate a novel nomogram aimed at predicting cancer-specific mortality (CSM) after biochemical recurrence (BCR) among prostate cancer (PCa) patients treated with radical prostatectomy (RP) with or without adjuvant external beam radiotherapy (aRT) and/or hormonal therapy (aHT). The development cohort included 689 consecutive PCa patients treated with RP between 1987 and 2011 with subsequent BCR, defined as two subsequent prostate-specific antigen values >0.2 ng/ml. Multivariable competing-risks regression analyses tested the predictors of CSM after BCR for the purpose of 5-year CSM nomogram development. Validation (2000 bootstrap resamples) was internally tested. External validation was performed into a population of 6734 PCa patients with BCR after treatment with RP at the Mayo Clinic from 1987 to 2011. The predictive accuracy (PA) was quantified using the receiver operating characteristic-derived area under the curve and the calibration plot method. The 5-year CSM-free survival rate was 83.6% (confidence interval [CI]: 79.6-87.2). In multivariable analyses, pathologic stage T3b or more (hazard ratio [HR]: 7.42; p = 0.008), pathologic Gleason score 8-10 (HR: 2.19; p = 0.003), lymph node invasion (HR: 3.57; p = 0.001), time to BCR (HR: 0.99; p = 0.03) and age at BCR (HR: 1.04; p = 0.04), were each significantly associated with the risk of CSM after BCR. The bootstrap-corrected PA was 87.4% (bootstrap 95% CI: 82.0-91.7%). External validation of our nomogram showed a good PA at 83.2%. We developed and externally validated the first nomogram predicting 5-year CSM applicable to contemporary patients with BCR after RP with or without adjuvant treatment. Copyright © 2015 Elsevier Ltd. All rights reserved.
Kortesniemi, Maaria; Vuorinen, Anssi L; Sinkkonen, Jari; Yang, Baoru; Rajala, Ari; Kallio, Heikki
2015-04-01
The oilseeds of the commercially important oilseed rape (Brassica napus) and turnip rape (Brassica rapa) were investigated with (1)H NMR metabolomics. The compositions of ripened (cultivated in field trials) and developing seeds (cultivated in controlled conditions) were compared in multivariate models using principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA), and orthogonal partial least squares discriminant analysis (OPLS-DA). Differences in the major lipids and the minor metabolites between the two species were found. A higher content of polyunsaturated fatty acids and sucrose were observed in turnip rape, while the overall oil content and sinapine levels were higher in oilseed rape. The genotype traits were negligible compared to the effect of the growing site and concomitant conditions on the oilseed metabolome. This study demonstrates the applicability of NMR-based analysis in determining the species, geographical origin, developmental stage, and quality of oilseed Brassicas. Copyright © 2014 Elsevier Ltd. All rights reserved.
Ibrahim, Reham S; Fathy, Hoda
2018-03-30
Tracking the impact of commonly applied post-harvesting and industrial processing practices on the compositional integrity of ginger rhizome was implemented in this work. Untargeted metabolite profiling was performed using digitally-enhanced HPTLC method where the chromatographic fingerprints were extracted using ImageJ software then analysed with multivariate Principal Component Analysis (PCA) for pattern recognition. A targeted approach was applied using a new, validated, simple and fast HPTLC image analysis method for simultaneous quantification of the officially recognized markers 6-, 8-, 10-gingerol and 6-shogaol in conjunction with chemometric Hierarchical Clustering Analysis (HCA). The results of both targeted and untargeted metabolite profiling revealed that peeling, drying in addition to storage employed during processing have a great influence on ginger chemo-profile, the different forms of processed ginger shouldn't be used interchangeably. Moreover, it deemed necessary to consider the holistic metabolic profile for comprehensive evaluation of ginger during processing. Copyright © 2018. Published by Elsevier B.V.
Zhu, Yao; Han, Cheng-Tao; Zhang, Gui-Ming; Liu, Fang; Ding, Qiang; Xu, Jian-Feng; Vidal, Adriana C.; Freedland, Stephen J.; Ng, Chi-Fai; Ye, Ding-Wei
2015-01-01
To develop and externally validate a prostate health index (PHI)-based nomogram for predicting the presence of prostate cancer (PCa) at biopsy in Chinese men with prostate-specific antigen 4–10 ng/mL and normal digital rectal examination (DRE). 347 men were recruited from two hospitals between 2012 and 2014 to develop a PHI-based nomogram to predict PCa. To validate these results, we used a separate cohort of 230 men recruited at another center between 2008 and 2013. Receiver operator curves (ROC) were used to assess the ability to predict PCa. A nomogram was derived from the multivariable logistic regression model and its accuracy was assessed by the area under the ROC (AUC). PHI achieved the highest AUC of 0.839 in the development cohort compared to the other predictors (p < 0.001). Including age and prostate volume, a PHI-based nomogram was constructed and rendered an AUC of 0.877 (95% CI 0.813–0.938). The AUC of the nomogram in the validation cohort was 0.786 (95% CI 0.678–0.894). In clinical effectiveness analyses, the PHI-based nomogram reduced unnecessary biopsies from 42.6% to 27% using a 5% threshold risk of PCa to avoid biopsy with no increase in the number of missed cases relative to conventional biopsy decision. PMID:26471350