Sample records for analysis pls-da model

  1. External characteristic determination of eggs and cracked eggs identification using spectral signature

    PubMed Central

    Xie, Chuanqi; He, Yong

    2016-01-01

    This study was carried out to use hyperspectral imaging technique for determining color (L*, a* and b*) and eggshell strength and identifying cracked chicken eggs. Partial least squares (PLS) models based on full and selected wavelengths suggested by regression coefficient (RC) method were established to predict the four parameters, respectively. Partial least squares-discriminant analysis (PLS-DA) and RC-partial least squares-discriminant analysis (RC-PLS-DA) models were applied to identify cracked eggs. PLS models performed well with the correlation coefficient (rp) of 0.788 for L*, 0.810 for a*, 0.766 for b* and 0.835 for eggshell strength. RC-PLS models also obtained the rp of 0.771 for L*, 0.806 for a*, 0.767 for b* and 0.841 for eggshell strength. The classification results were 97.06% in PLS-DA model and 88.24% in RC-PLS-DA model. It demonstrated that hyperspectral imaging technique has the potential to be used to detect color and eggshell strength values and identify cracked chicken eggs. PMID:26882990

  2. Identification of solid state fermentation degree with FT-NIR spectroscopy: Comparison of wavelength variable selection methods of CARS and SCARS.

    PubMed

    Jiang, Hui; Zhang, Hang; Chen, Quansheng; Mei, Congli; Liu, Guohai

    2015-01-01

    The use of wavelength variable selection before partial least squares discriminant analysis (PLS-DA) for qualitative identification of solid state fermentation degree by FT-NIR spectroscopy technique was investigated in this study. Two wavelength variable selection methods including competitive adaptive reweighted sampling (CARS) and stability competitive adaptive reweighted sampling (SCARS) were employed to select the important wavelengths. PLS-DA was applied to calibrate identified model using selected wavelength variables by CARS and SCARS for identification of solid state fermentation degree. Experimental results showed that the number of selected wavelength variables by CARS and SCARS were 58 and 47, respectively, from the 1557 original wavelength variables. Compared with the results of full-spectrum PLS-DA, the two wavelength variable selection methods both could enhance the performance of identified models. Meanwhile, compared with CARS-PLS-DA model, the SCARS-PLS-DA model achieved better results with the identification rate of 91.43% in the validation process. The overall results sufficiently demonstrate the PLS-DA model constructed using selected wavelength variables by a proper wavelength variable method can be more accurate identification of solid state fermentation degree. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. Identification of solid state fermentation degree with FT-NIR spectroscopy: Comparison of wavelength variable selection methods of CARS and SCARS

    NASA Astrophysics Data System (ADS)

    Jiang, Hui; Zhang, Hang; Chen, Quansheng; Mei, Congli; Liu, Guohai

    2015-10-01

    The use of wavelength variable selection before partial least squares discriminant analysis (PLS-DA) for qualitative identification of solid state fermentation degree by FT-NIR spectroscopy technique was investigated in this study. Two wavelength variable selection methods including competitive adaptive reweighted sampling (CARS) and stability competitive adaptive reweighted sampling (SCARS) were employed to select the important wavelengths. PLS-DA was applied to calibrate identified model using selected wavelength variables by CARS and SCARS for identification of solid state fermentation degree. Experimental results showed that the number of selected wavelength variables by CARS and SCARS were 58 and 47, respectively, from the 1557 original wavelength variables. Compared with the results of full-spectrum PLS-DA, the two wavelength variable selection methods both could enhance the performance of identified models. Meanwhile, compared with CARS-PLS-DA model, the SCARS-PLS-DA model achieved better results with the identification rate of 91.43% in the validation process. The overall results sufficiently demonstrate the PLS-DA model constructed using selected wavelength variables by a proper wavelength variable method can be more accurate identification of solid state fermentation degree.

  4. Classification and quantitation of milk powder by near-infrared spectroscopy and mutual information-based variable selection and partial least squares

    NASA Astrophysics Data System (ADS)

    Chen, Hui; Tan, Chao; Lin, Zan; Wu, Tong

    2018-01-01

    Milk is among the most popular nutrient source worldwide, which is of great interest due to its beneficial medicinal properties. The feasibility of the classification of milk powder samples with respect to their brands and the determination of protein concentration is investigated by NIR spectroscopy along with chemometrics. Two datasets were prepared for experiment. One contains 179 samples of four brands for classification and the other contains 30 samples for quantitative analysis. Principal component analysis (PCA) was used for exploratory analysis. Based on an effective model-independent variable selection method, i.e., minimal-redundancy maximal-relevance (MRMR), only 18 variables were selected to construct a partial least-square discriminant analysis (PLS-DA) model. On the test set, the PLS-DA model based on the selected variable set was compared with the full-spectrum PLS-DA model, both of which achieved 100% accuracy. In quantitative analysis, the partial least-square regression (PLSR) model constructed by the selected subset of 260 variables outperforms significantly the full-spectrum model. It seems that the combination of NIR spectroscopy, MRMR and PLS-DA or PLSR is a powerful tool for classifying different brands of milk and determining the protein content.

  5. Development and validation of a Partial Least Squares-Discriminant Analysis (PLS-DA) model based on the determination of ethyl glucuronide (EtG) and fatty acid ethyl esters (FAEEs) in hair for the diagnosis of chronic alcohol abuse.

    PubMed

    Alladio, E; Giacomelli, L; Biosa, G; Corcia, D Di; Gerace, E; Salomone, A; Vincenti, M

    2018-01-01

    The chronic intake of an excessive amount of alcohol is currently ascertained by determining the concentration of direct alcohol metabolites in the hair samples of the alleged abusers, including ethyl glucuronide (EtG) and, less frequently, fatty acid ethyl esters (FAEEs). Indirect blood biomarkers of alcohol abuse are still determined to support hair EtG results and diagnose a consequent liver impairment. In the present study, the supporting role of hair FAEEs is compared with indirect blood biomarkers with respect to the contexts in which hair EtG interpretation is uncertain. Receiver Operating Characteristics (ROC) curves and multivariate Principal Component Analysis (PCA) demonstrated much stronger correlation of EtG results with FAEEs than with any single indirect biomarker or their combinations. Partial Least Squares Discriminant Analysis (PLS-DA) models based on hair EtG and FAEEs were developed to maximize the biomarkers information content on a multivariate background. The final PLS-DA model yielded 100% correct classification on a training/evaluation dataset of 155 subjects, including both chronic alcohol abusers and social drinkers. Then, the PLS-DA model was validated on an external dataset of 81 individual providing optimal discrimination ability between chronic alcohol abusers and social drinkers, in terms of specificity and sensitivity. The PLS-DA scores obtained for each subject, with respect to the PLS-DA model threshold that separates the probabilistic distributions for the two classes, furnished a likelihood ratio value, which in turn conveys the strength of the experimental data support to the classification decision, within a Bayesian logic. Typical boundary real cases from daily work are discussed, too. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Local classification: Locally weighted-partial least squares-discriminant analysis (LW-PLS-DA).

    PubMed

    Bevilacqua, Marta; Marini, Federico

    2014-08-01

    The possibility of devising a simple, flexible and accurate non-linear classification method, by extending the locally weighted partial least squares (LW-PLS) approach to the cases where the algorithm is used in a discriminant way (partial least squares discriminant analysis, PLS-DA), is presented. In particular, to assess which category an unknown sample belongs to, the proposed algorithm operates by identifying which training objects are most similar to the one to be predicted and building a PLS-DA model using these calibration samples only. Moreover, the influence of the selected training samples on the local model can be further modulated by adopting a not uniform distance-based weighting scheme which allows the farthest calibration objects to have less impact than the closest ones. The performances of the proposed locally weighted-partial least squares-discriminant analysis (LW-PLS-DA) algorithm have been tested on three simulated data sets characterized by a varying degree of non-linearity: in all cases, a classification accuracy higher than 99% on external validation samples was achieved. Moreover, when also applied to a real data set (classification of rice varieties), characterized by a high extent of non-linearity, the proposed method provided an average correct classification rate of about 93% on the test set. By the preliminary results, showed in this paper, the performances of the proposed LW-PLS-DA approach have proved to be comparable and in some cases better than those obtained by other non-linear methods (k nearest neighbors, kernel-PLS-DA and, in the case of rice, counterpropagation neural networks). Copyright © 2014 Elsevier B.V. All rights reserved.

  7. Terahertz time-domain attenuated total reflection spectroscopy applied to the rapid discrimination of the botanical origin of honeys

    NASA Astrophysics Data System (ADS)

    Liu, Wen; Zhang, Yuying; Yang, Si; Han, Donghai

    2018-05-01

    A new technique to identify the floral resources of honeys is demanded. Terahertz time-domain attenuated total reflection spectroscopy combined with chemometrics methods was applied to discriminate different categorizes (Medlar honey, Vitex honey, and Acacia honey). Principal component analysis (PCA), cluster analysis (CA) and partial least squares-discriminant analysis (PLS-DA) have been used to find information of the botanical origins of honeys. Spectral range also was discussed to increase the precision of PLS-DA model. The accuracy of 88.46% for validation set was obtained, using PLS-DA model in 0.5-1.5 THz. This work indicated terahertz time-domain attenuated total reflection spectroscopy was an available approach to evaluate the quality of honey rapidly.

  8. Nearest clusters based partial least squares discriminant analysis for the classification of spectral data.

    PubMed

    Song, Weiran; Wang, Hui; Maguire, Paul; Nibouche, Omar

    2018-06-07

    Partial Least Squares Discriminant Analysis (PLS-DA) is one of the most effective multivariate analysis methods for spectral data analysis, which extracts latent variables and uses them to predict responses. In particular, it is an effective method for handling high-dimensional and collinear spectral data. However, PLS-DA does not explicitly address data multimodality, i.e., within-class multimodal distribution of data. In this paper, we present a novel method termed nearest clusters based PLS-DA (NCPLS-DA) for addressing the multimodality and nonlinearity issues explicitly and improving the performance of PLS-DA on spectral data classification. The new method applies hierarchical clustering to divide samples into clusters and calculates the corresponding centre of every cluster. For a given query point, only clusters whose centres are nearest to such a query point are used for PLS-DA. Such a method can provide a simple and effective tool for separating multimodal and nonlinear classes into clusters which are locally linear and unimodal. Experimental results on 17 datasets, including 12 UCI and 5 spectral datasets, show that NCPLS-DA can outperform 4 baseline methods, namely, PLS-DA, kernel PLS-DA, local PLS-DA and k-NN, achieving the highest classification accuracy most of the time. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. Data Mining Methods for Omics and Knowledge of Crude Medicinal Plants toward Big Data Biology

    PubMed Central

    Afendi, Farit M.; Ono, Naoaki; Nakamura, Yukiko; Nakamura, Kensuke; Darusman, Latifah K.; Kibinge, Nelson; Morita, Aki Hirai; Tanaka, Ken; Horai, Hisayuki; Altaf-Ul-Amin, Md.; Kanaya, Shigehiko

    2013-01-01

    Molecular biological data has rapidly increased with the recent progress of the Omics fields, e.g., genomics, transcriptomics, proteomics and metabolomics that necessitates the development of databases and methods for efficient storage, retrieval, integration and analysis of massive data. The present study reviews the usage of KNApSAcK Family DB in metabolomics and related area, discusses several statistical methods for handling multivariate data and shows their application on Indonesian blended herbal medicines (Jamu) as a case study. Exploration using Biplot reveals many plants are rarely utilized while some plants are highly utilized toward specific efficacy. Furthermore, the ingredients of Jamu formulas are modeled using Partial Least Squares Discriminant Analysis (PLS-DA) in order to predict their efficacy. The plants used in each Jamu medicine served as the predictors, whereas the efficacy of each Jamu provided the responses. This model produces 71.6% correct classification in predicting efficacy. Permutation test then is used to determine plants that serve as main ingredients in Jamu formula by evaluating the significance of the PLS-DA coefficients. Next, in order to explain the role of plants that serve as main ingredients in Jamu medicines, information of pharmacological activity of the plants is added to the predictor block. Then N-PLS-DA model, multiway version of PLS-DA, is utilized to handle the three-dimensional array of the predictor block. The resulting N-PLS-DA model reveals that the effects of some pharmacological activities are specific for certain efficacy and the other activities are diverse toward many efficacies. Mathematical modeling introduced in the present study can be utilized in global analysis of big data targeting to reveal the underlying biology. PMID:24688691

  10. Sample classification for improved performance of PLS models applied to the quality control of deep-frying oils of different botanic origins analyzed using ATR-FTIR spectroscopy.

    PubMed

    Kuligowski, Julia; Carrión, David; Quintás, Guillermo; Garrigues, Salvador; de la Guardia, Miguel

    2011-01-01

    The selection of an appropriate calibration set is a critical step in multivariate method development. In this work, the effect of using different calibration sets, based on a previous classification of unknown samples, on the partial least squares (PLS) regression model performance has been discussed. As an example, attenuated total reflection (ATR) mid-infrared spectra of deep-fried vegetable oil samples from three botanical origins (olive, sunflower, and corn oil), with increasing polymerized triacylglyceride (PTG) content induced by a deep-frying process were employed. The use of a one-class-classifier partial least squares-discriminant analysis (PLS-DA) and a rooted binary directed acyclic graph tree provided accurate oil classification. Oil samples fried without foodstuff could be classified correctly, independent of their PTG content. However, class separation of oil samples fried with foodstuff, was less evident. The combined use of double-cross model validation with permutation testing was used to validate the obtained PLS-DA classification models, confirming the results. To discuss the usefulness of the selection of an appropriate PLS calibration set, the PTG content was determined by calculating a PLS model based on the previously selected classes. In comparison to a PLS model calculated using a pooled calibration set containing samples from all classes, the root mean square error of prediction could be improved significantly using PLS models based on the selected calibration sets using PLS-DA, ranging between 1.06 and 2.91% (w/w).

  11. Rapid Elemental Analysis and Provenance Study of Blumea balsamifera DC Using Laser-Induced Breakdown Spectroscopy

    PubMed Central

    Liu, Xiaona; Zhang, Qiao; Wu, Zhisheng; Shi, Xinyuan; Zhao, Na; Qiao, Yanjiang

    2015-01-01

    Laser-induced breakdown spectroscopy (LIBS) was applied to perform a rapid elemental analysis and provenance study of Blumea balsamifera DC. Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were implemented to exploit the multivariate nature of the LIBS data. Scores and loadings of computed principal components visually illustrated the differing spectral data. The PLS-DA algorithm showed good classification performance. The PLS-DA model using complete spectra as input variables had similar discrimination performance to using selected spectral lines as input variables. The down-selection of spectral lines was specifically focused on the major elements of B. balsamifera samples. Results indicated that LIBS could be used to rapidly analyze elements and to perform provenance study of B. balsamifera. PMID:25558999

  12. Multivariate analysis of standoff laser-induced breakdown spectroscopy spectra for classification of explosive-containing residues

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Lucia, Frank C. Jr.; Gottfried, Jennifer L.; Munson, Chase A.

    2008-11-01

    A technique being evaluated for standoff explosives detection is laser-induced breakdown spectroscopy (LIBS). LIBS is a real-time sensor technology that uses components that can be configured into a ruggedized standoff instrument. The U.S. Army Research Laboratory has been coupling standoff LIBS spectra with chemometrics for several years now in order to discriminate between explosives and nonexplosives. We have investigated the use of partial least squares discriminant analysis (PLS-DA) for explosives detection. We have extended our study of PLS-DA to more complex sample types, including binary mixtures, different types of explosives, and samples not included in the model. We demonstrate themore » importance of building the PLS-DA model by iteratively testing it against sample test sets. Independent test sets are used to test the robustness of the final model.« less

  13. Influence of variable selection on partial least squares discriminant analysis models for explosive residue classification

    NASA Astrophysics Data System (ADS)

    De Lucia, Frank C., Jr.; Gottfried, Jennifer L.

    2011-02-01

    Using a series of thirteen organic materials that includes novel high-nitrogen energetic materials, conventional organic military explosives, and benign organic materials, we have demonstrated the importance of variable selection for maximizing residue discrimination with partial least squares discriminant analysis (PLS-DA). We built several PLS-DA models using different variable sets based on laser induced breakdown spectroscopy (LIBS) spectra of the organic residues on an aluminum substrate under an argon atmosphere. The model classification results for each sample are presented and the influence of the variables on these results is discussed. We found that using the whole spectra as the data input for the PLS-DA model gave the best results. However, variables due to the surrounding atmosphere and the substrate contribute to discrimination when the whole spectra are used, indicating this may not be the most robust model. Further iterative testing with additional validation data sets is necessary to determine the most robust model.

  14. Discrimination of biological and chemical threat simulants in residue mixtures on multiple substrates.

    PubMed

    Gottfried, Jennifer L

    2011-07-01

    The potential of laser-induced breakdown spectroscopy (LIBS) to discriminate biological and chemical threat simulant residues prepared on multiple substrates and in the presence of interferents has been explored. The simulant samples tested include Bacillus atrophaeus spores, Escherichia coli, MS-2 bacteriophage, α-hemolysin from Staphylococcus aureus, 2-chloroethyl ethyl sulfide, and dimethyl methylphosphonate. The residue samples were prepared on polycarbonate, stainless steel and aluminum foil substrates by Battelle Eastern Science and Technology Center. LIBS spectra were collected by Battelle on a portable LIBS instrument developed by A3 Technologies. This paper presents the chemometric analysis of the LIBS spectra using partial least-squares discriminant analysis (PLS-DA). The performance of PLS-DA models developed based on the full LIBS spectra, and selected emission intensities and ratios have been compared. The full-spectra models generally provided better classification results based on the inclusion of substrate emission features; however, the intensity/ratio models were able to correctly identify more types of simulant residues in the presence of interferents. The fusion of the two types of PLS-DA models resulted in a significant improvement in classification performance for models built using multiple substrates. In addition to identifying the major components of residue mixtures, minor components such as growth media and solvents can be identified with an appropriately designed PLS-DA model.

  15. Discrimination of healthy and osteoarthritic articular cartilages by Fourier transform infrared imaging and partial least squares-discriminant analysis

    PubMed Central

    Zhang, Xue-Xi; Yin, Jian-Hua; Mao, Zhi-Hua; Xia, Yang

    2015-01-01

    Abstract. Fourier transform infrared imaging (FTIRI) combined with chemometrics algorithm has strong potential to obtain complex chemical information from biology tissues. FTIRI and partial least squares-discriminant analysis (PLS-DA) were used to differentiate healthy and osteoarthritic (OA) cartilages for the first time. A PLS model was built on the calibration matrix of spectra that was randomly selected from the FTIRI spectral datasets of healthy and lesioned cartilage. Leave-one-out cross-validation was performed in the PLS model, and the fitting coefficient between actual and predicted categorical values of the calibration matrix reached 0.95. In the calibration and prediction matrices, the successful identifying percentages of healthy and lesioned cartilage spectra were 100% and 90.24%, respectively. These results demonstrated that FTIRI combined with PLS-DA could provide a promising approach for the categorical identification of healthy and OA cartilage specimens. PMID:26057029

  16. Discrimination of healthy and osteoarthritic articular cartilages by Fourier transform infrared imaging and partial least squares-discriminant analysis.

    PubMed

    Zhang, Xue-Xi; Yin, Jian-Hua; Mao, Zhi-Hua; Xia, Yang

    2015-06-01

    Fourier transform infrared imaging (FTIRI) combined with chemometrics algorithm has strong potential to obtain complex chemical information from biology tissues. FTIRI and partial least squares-discriminant analysis (PLS-DA) were used to differentiate healthy and osteoarthritic (OA) cartilages for the first time. A PLS model was built on the calibration matrix of spectra that was randomly selected from the FTIRI spectral datasets of healthy and lesioned cartilage. Leave-one-out cross-validation was performed in the PLS model, and the fitting coefficient between actual and predicted categorical values of the calibration matrix reached 0.95. In the calibration and prediction matrices, the successful identifying percentages of healthy and lesioned cartilage spectra were 100% and 90.24%, respectively. These results demonstrated that FTIRI combined with PLS-DA could provide a promising approach for the categorical identification of healthy and OA cartilage specimens.

  17. Commercial tree species discrimination using airborne AISA Eagle hyperspectral imagery and partial least squares discriminant analysis (PLS-DA) in KwaZulu-Natal, South Africa

    NASA Astrophysics Data System (ADS)

    Peerbhay, Kabir Yunus; Mutanga, Onisimo; Ismail, Riyad

    2013-05-01

    Discriminating commercial tree species using hyperspectral remote sensing techniques is critical in monitoring the spatial distributions and compositions of commercial forests. However, issues related to data dimensionality and multicollinearity limit the successful application of the technology. The aim of this study was to examine the utility of the partial least squares discriminant analysis (PLS-DA) technique in accurately classifying six exotic commercial forest species (Eucalyptus grandis, Eucalyptus nitens, Eucalyptus smithii, Pinus patula, Pinus elliotii and Acacia mearnsii) using airborne AISA Eagle hyperspectral imagery (393-900 nm). Additionally, the variable importance in the projection (VIP) method was used to identify subsets of bands that could successfully discriminate the forest species. Results indicated that the PLS-DA model that used all the AISA Eagle bands (n = 230) produced an overall accuracy of 80.61% and a kappa value of 0.77, with user's and producer's accuracies ranging from 50% to 100%. In comparison, incorporating the optimal subset of VIP selected wavebands (n = 78) in the PLS-DA model resulted in an improved overall accuracy of 88.78% and a kappa value of 0.87, with user's and producer's accuracies ranging from 70% to 100%. Bands located predominantly within the visible region of the electromagnetic spectrum (393-723 nm) showed the most capability in terms of discriminating between the six commercial forest species. Overall, the research has demonstrated the potential of using PLS-DA for reducing the dimensionality of hyperspectral datasets as well as determining the optimal subset of bands to produce the highest classification accuracies.

  18. Texture analysis of pulmonary parenchymateous changes related to pulmonary thromboembolism in dogs - a novel approach using quantitative methods.

    PubMed

    Marschner, C B; Kokla, M; Amigo, J M; Rozanski, E A; Wiinberg, B; McEvoy, F J

    2017-07-11

    Diagnosis of pulmonary thromboembolism (PTE) in dogs relies on computed tomography pulmonary angiography (CTPA), but detailed interpretation of CTPA images is demanding for the radiologist and only large vessels may be evaluated. New approaches for better detection of smaller thrombi include dual energy computed tomography (DECT) as well as computer assisted diagnosis (CAD) techniques. The purpose of this study was to investigate the performance of quantitative texture analysis for detecting dogs with PTE using grey-level co-occurrence matrices (GLCM) and multivariate statistical classification analyses. CT images from healthy (n = 6) and diseased (n = 29) dogs with and without PTE confirmed on CTPA were segmented so that only tissue with CT numbers between -1024 and -250 Houndsfield Units (HU) was preserved. GLCM analysis and subsequent multivariate classification analyses were performed on texture parameters extracted from these images. Leave-one-dog-out cross validation and receiver operator characteristic (ROC) showed that the models generated from the texture analysis were able to predict healthy dogs with optimal levels of performance. Partial Least Square Discriminant Analysis (PLS-DA) obtained a sensitivity of 94% and a specificity of 96%, while Support Vector Machines (SVM) yielded a sensitivity of 99% and a specificity of 100%. The models, however, performed worse in classifying the type of disease in the diseased dog group: In diseased dogs with PTE sensitivities were 30% (PLS-DA) and 38% (SVM), and specificities were 80% (PLS-DA) and 89% (SVM). In diseased dogs without PTE the sensitivities of the models were 59% (PLS-DA) and 79% (SVM) and specificities were 79% (PLS-DA) and 82% (SVM). The results indicate that texture analysis of CTPA images using GLCM is an effective tool for distinguishing healthy from abnormal lung. Furthermore the texture of pulmonary parenchyma in dogs with PTE is altered, when compared to the texture of pulmonary parenchyma of healthy dogs. The models' poorer performance in classifying dogs within the diseased group, may be related to the low number of dogs compared to texture variables, a lack of balanced number of dogs within each group or a real lack of difference in the texture features among the diseased dogs.

  19. Detection of drug active ingredients by chemometric processing of solid-state NMR spectrometry data -- the case of acetaminophen.

    PubMed

    Paradowska, Katarzyna; Jamróz, Marta Katarzyna; Kobyłka, Mariola; Gowin, Ewelina; Maczka, Paulina; Skibiński, Robert; Komsta, Łukasz

    2012-01-01

    This paper presents a preliminary study in building discriminant models from solid-state NMR spectrometry data to detect the presence of acetaminophen in over-the-counter pharmaceutical formulations. The dataset, containing 11 spectra of pure substances and 21 spectra of various formulations, was processed by partial least squares discriminant analysis (PLS-DA). The model found coped with the discrimination, and its quality parameters were acceptable. It was found that standard normal variate preprocessing had almost no influence on unsupervised investigation of the dataset. The influence of variable selection with the uninformative variable elimination by PLS method was studied, reducing the dataset from 7601 variables to around 300 informative variables, but not improving the model performance. The results showed the possibility to construct well-working PLS-DA models from such small datasets without a full experimental design.

  20. Mass Spectrometry and Fourier Transform Infrared Spectroscopy for Analysis of Biological Materials

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anderson, Timothy J.

    Time-of-flight mass spectrometry along with statistical analysis was utilized to study metabolic profiles among rats fed resistant starch (RS) diets. Fischer 344 rats were fed four starch diets consisting of 55% (w/w, dbs) starch. A control starch diet consisting of corn starch was compared against three RS diets. The RS diets were high-amylose corn starch (HA7), HA7 chemically modified with octenyl succinic anhydride, and stearic-acid-complexed HA7 starch. A subgroup received antibiotic treatment to determine if perturbations in the gut microbiome were long lasting. A second subgroup was treated with azoxymethane (AOM), a carcinogen. At the end of the eight weekmore » study, cecal and distal-colon contents samples were collected from the sacrificed rats. Metabolites were extracted from cecal and distal colon samples into acetonitrile. The extracts were then analyzed on an accurate-mass time-of-flight mass spectrometer to obtain their metabolic profile. The data were analyzed using partial least-squares discriminant analysis (PLS-DA). The PLS-DA analysis utilized a training set and verification set to classify samples within diet and treatment groups. PLS-DA could reliably differentiate the diet treatments for both cecal and distal colon samples. The PLS-DA analyses of the antibiotic and no antibiotic treated subgroups were well classified for cecal samples and modestly separated for distal-colon samples. PLS-DA analysis had limited success separating distal colon samples for rats given AOM from those not treated; the cecal samples from AOM had very poor classification. Mass spectrometry profiling coupled with PLS-DA can readily classify metabolite differences among rats given RS diets.« less

  1. High-resolution time-of-flight mass spectrometry fingerprinting of metabolites from cecum and distal colon contents of rats fed resistant starch

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anderson, Timothy J.; Jones, Roger W.; Ai, Yongfeng

    Time-of-flight mass spectrometry along with statistical analysis was utilized to study metabolic profiles among rats fed resistant starch (RS) diets. Fischer 344 rats were fed four starch diets consisting of 55 % (w/w, dbs) starch. A control starch diet consisting of corn starch was compared against three RS diets. The RS diets were high-amylose corn starch (HA7), HA7 chemically modified with octenyl succinic anhydride, and stearic-acid-complexed HA7 starch. A subgroup received antibiotic treatment to determine if perturbations in the gut microbiome were long lasting. A second subgroup was treated with azoxymethane (AOM), a carcinogen. At the end of the 8-weekmore » study, cecal and distal colon content samples were collected from the sacrificed rats. Metabolites were extracted from cecal and distal colon samples into acetonitrile. The extracts were then analyzed on an accurate-mass time-of-flight mass spectrometer to obtain their metabolic profile. The data were analyzed using partial least-squares discriminant analysis (PLS-DA). The PLS-DA analysis utilized a training set and verification set to classify samples within diet and treatment groups. PLS-DA could reliably differentiate the diet treatments for both cecal and distal colon samples. The PLS-DA analyses of the antibiotic and no antibiotic-treated subgroups were well classified for cecal samples and modestly separated for distal colon samples. PLS-DA analysis had limited success separating distal colon samples for rats given AOM from those not treated; the cecal samples from AOM had very poor classification. Mass spectrometry profiling coupled with PLS-DA can readily classify metabolite differences among rats given RS diets.« less

  2. Iterative random vs. Kennard-Stone sampling for IR spectrum-based classification task using PLS2-DA

    NASA Astrophysics Data System (ADS)

    Lee, Loong Chuen; Liong, Choong-Yeun; Jemain, Abdul Aziz

    2018-04-01

    External testing (ET) is preferred over auto-prediction (AP) or k-fold-cross-validation in estimating more realistic predictive ability of a statistical model. With IR spectra, Kennard-stone (KS) sampling algorithm is often used to split the data into training and test sets, i.e. respectively for model construction and for model testing. On the other hand, iterative random sampling (IRS) has not been the favored choice though it is theoretically more likely to produce reliable estimation. The aim of this preliminary work is to compare performances of KS and IRS in sampling a representative training set from an attenuated total reflectance - Fourier transform infrared spectral dataset (of four varieties of blue gel pen inks) for PLS2-DA modeling. The `best' performance achievable from the dataset is estimated with AP on the full dataset (APF, error). Both IRS (n = 200) and KS were used to split the dataset in the ratio of 7:3. The classic decision rule (i.e. maximum value-based) is employed for new sample prediction via partial least squares - discriminant analysis (PLS2-DA). Error rate of each model was estimated repeatedly via: (a) AP on full data (APF, error); (b) AP on training set (APS, error); and (c) ET on the respective test set (ETS, error). A good PLS2-DA model is expected to produce APS, error and EVS, error that is similar to the APF, error. Bearing that in mind, the similarities between (a) APS, error vs. APF, error; (b) ETS, error vs. APF, error and; (c) APS, error vs. ETS, error were evaluated using correlation tests (i.e. Pearson and Spearman's rank test), using series of PLS2-DA models computed from KS-set and IRS-set, respectively. Overall, models constructed from IRS-set exhibits more similarities between the internal and external error rates than the respective KS-set, i.e. less risk of overfitting. In conclusion, IRS is more reliable than KS in sampling representative training set.

  3. MALDI-TOF-MS with PLS Modeling Enables Strain Typing of the Bacterial Plant Pathogen Xanthomonas axonopodis

    NASA Astrophysics Data System (ADS)

    Sindt, Nathan M.; Robison, Faith; Brick, Mark A.; Schwartz, Howard F.; Heuberger, Adam L.; Prenni, Jessica E.

    2018-02-01

    Matrix-assisted desorption/ionization time of flight mass spectrometry (MALDI-TOF-MS) is a fast and effective tool for microbial species identification. However, current approaches are limited to species-level identification even when genetic differences are known. Here, we present a novel workflow that applies the statistical method of partial least squares discriminant analysis (PLS-DA) to MALDI-TOF-MS protein fingerprint data of Xanthomonas axonopodis, an important bacterial plant pathogen of fruit and vegetable crops. Mass spectra of 32 X. axonopodis strains were used to create a mass spectral library and PLS-DA was employed to model the closely related strains. A robust workflow was designed to optimize the PLS-DA model by assessing the model performance over a range of signal-to-noise ratios (s/n) and mass filter (MF) thresholds. The optimized parameters were observed to be s/n = 3 and MF = 0.7. The model correctly classified 83% of spectra withheld from the model as a test set. A new decision rule was developed, termed the rolled-up Maximum Decision Rule (ruMDR), and this method improved identification rates to 92%. These results demonstrate that MALDI-TOF-MS protein fingerprints of bacterial isolates can be utilized to enable identification at the strain level. Furthermore, the open-source framework of this workflow allows for broad implementation across various instrument platforms as well as integration with alternative modeling and classification algorithms.

  4. Nontargeted metabolomics approach for the differentiation of cultivation ages of mountain cultivated ginseng leaves using UHPLC/QTOF-MS.

    PubMed

    Chang, Xiangwei; Zhang, Juanjuan; Li, Dekun; Zhou, Dazheng; Zhang, Yuling; Wang, Jincheng; Hu, Bing; Ju, Aichun; Ye, Zhengliang

    2017-07-15

    The adulteration or falsification of the cultivation age of mountain cultivated ginseng (MCG) has been a serious problem in the commercial MCG market. To develop an efficient discrimination tool for the cultivation age and to explore potential age-dependent markers, an optimized ultra high-performance liquid chromatography/quadrupole time-of-flight mass spectrometry (UHPLC/QTOF-MS)-based metabolomics approach was applied in the global metabolite profiling of 156 MCG leaf (MGL) samples aged from 6 to 18 years. Multivariate statistical methods such as principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were used to compare the derived patterns between MGL samples of different cultivation ages. The present study demonstrated that 6-18-year-old MGL samples can be successfully discriminated using two simple successive steps, together with four PLS-DA discrimination models. Furthermore, 39 robust age-dependent markers enabling differentiation among the 6-18-year-old MGL samples were discovered. The results were validated by a permutation test and an external test set to verify the predictability and reliability of the established discrimination models. More importantly, without destroying the MCG roots, the proposed approach could also be applied to discriminate MCG root ages indirectly, using a minimum amount of homophyletic MGL samples combined with the established four PLS-DA models and identified markers. Additionally, to the best of our knowledge, this is the first study in which 6-18-year-old MCG root ages have been nondestructively differentiated by analyzing homophyletic MGL samples using UHPLC/QTOF-MS analysis and two simple successive steps together with four PLS-DA models. The method developed in this study can be used as a standard protocol for discriminating and predicting MGL ages directly and homophyletic MCG root ages indirectly. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Non-Destructive Quality Evaluation of Pepper (Capsicum annuum L.) Seeds Using LED-Induced Hyperspectral Reflectance Imaging

    PubMed Central

    Mo, Changyeun; Kim, Giyoung; Lee, Kangjin; Kim, Moon S.; Cho, Byoung-Kwan; Lim, Jongguk; Kang, Sukwon

    2014-01-01

    In this study, we developed a viability evaluation method for pepper (Capsicum annuum L.) seeds based on hyperspectral reflectance imaging. The reflectance spectra of pepper seeds in the 400–700 nm range are collected from hyperspectral reflectance images obtained using blue, green, and red LED illumination. A partial least squares–discriminant analysis (PLS-DA) model is developed to classify viable and non-viable seeds. Four spectral ranges generated with four types of LEDs (blue, green, red, and RGB), which were pretreated using various methods, are investigated to develop the classification models. The optimal PLS-DA model based on the standard normal variate for RGB LED illumination (400–700 nm) yields discrimination accuracies of 96.7% and 99.4% for viable seeds and nonviable seeds, respectively. The use of images based on the PLS-DA model with the first-order derivative of a 31.5-nm gap for red LED illumination (600–700 nm) yields 100% discrimination accuracy for both viable and nonviable seeds. The results indicate that a hyperspectral imaging technique based on LED light can be potentially applied to high-quality pepper seed sorting. PMID:24763251

  6. Non-destructive quality evaluation of pepper (Capsicum annuum L.) seeds using LED-induced hyperspectral reflectance imaging.

    PubMed

    Mo, Changyeun; Kim, Giyoung; Lee, Kangjin; Kim, Moon S; Cho, Byoung-Kwan; Lim, Jongguk; Kang, Sukwon

    2014-04-24

    In this study, we developed a viability evaluation method for pepper (Capsicum annuum L.) seeds based on hyperspectral reflectance imaging. The reflectance spectra of pepper seeds in the 400-700 nm range are collected from hyperspectral reflectance images obtained using blue, green, and red LED illumination. A partial least squares-discriminant analysis (PLS-DA) model is developed to classify viable and non-viable seeds. Four spectral ranges generated with four types of LEDs (blue, green, red, and RGB), which were pretreated using various methods, are investigated to develop the classification models. The optimal PLS-DA model based on the standard normal variate for RGB LED illumination (400-700 nm) yields discrimination accuracies of 96.7% and 99.4% for viable seeds and nonviable seeds, respectively. The use of images based on the PLS-DA model with the first-order derivative of a 31.5-nm gap for red LED illumination (600-700 nm) yields 100% discrimination accuracy for both viable and nonviable seeds. The results indicate that a hyperspectral imaging technique based on LED light can be potentially applied to high-quality pepper seed sorting.

  7. Phenolic Analysis and Theoretic Design for Chinese Commercial Wines' Authentication.

    PubMed

    Li, Si-Yu; Zhu, Bao-Qing; Reeves, Malcolm J; Duan, Chang-Qing

    2018-01-01

    To develop a robust tool for Chinese commercial wines' varietal, regional, and vintage authentication, phenolic compounds in 121 Chinese commercial dry red wines were detected and quantified by using high-performance liquid chromatography triple-quadrupole mass spectrometry (HPLC-QqQ-MS/MS), and differentiation abilities of principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA), and orthogonal partial least squares discriminant analysis (OPLS-DA) were compared. Better than PCA and PLS-DA, OPLS-DA models used to differentiate wines according to their varieties (Cabernet Sauvignon or other varieties), regions (east or west Cabernet Sauvignon wines), and vintages (young or old Cabernet Sauvignon wines) were ideally established. The S-plot provided in OPLS-DA models showed the key phenolic compounds which were both statistically and biochemically significant in sample differentiation. Besides, the potential of the OPLS-DA models in deeper sample differentiating of more detailed regional and vintage information of wines was proved optimistic. On the basis of our results, a promising theoretic design for wine authentication was further proposed for the first time, which might be helpful in practical authentication of more commercial wines. The phenolic data of 121 Chinese commercial dry red wines was processed with different statistical tools for varietal, regional, and vintage differentiation. A promising theoretical design was summarized, which might be helpful for wine authentication in practical situation. © 2017 Institute of Food Technologists®.

  8. Voltammetric fingerprinting of oils and its combination with chemometrics for the detection of extra virgin olive oil adulteration.

    PubMed

    Tsopelas, Fotios; Konstantopoulos, Dimitris; Kakoulidou, Anna Tsantili

    2018-07-26

    In the present work, two approaches for the voltammetric fingerprinting of oils and their combination with chemometrics were investigated in order to detect the adulteration of extra virgin olive oil with olive pomace oil as well as the most common seed oils, namely sunflower, soybean and corn oil. In particular, cyclic voltammograms of diluted extra virgin olive oils, regular (pure) olive oils (blends of refined olive oils with virgin olive oils), olive pomace oils and seed oils in presence of dichloromethane and 0.1 M of LiClO 4 in EtOH as electrolyte were recorded at a glassy carbon working electrode. Cyclic voltammetry was also employed in methanolic extracts of olive and seed oils. Datapoints of cyclic voltammograms were exported and submitted to Principal Component Analysis (PCA), Partial Least Square- Discriminant Analysis (PLS-DA) and soft independent modeling of class analogy (SIMCA). In diluted oils, PLS-DA provided a clear discrimination between olive oils (extra virgin and regular) and olive pomace/seed oils, while SIMCA showed a clear discrimination of extra virgin olive oil in regard to all other samples. Using methanolic extracts and considering datapoints recorded between 0.6 and 1.3 V, PLS-DA provided more information, resulting in three clusters-extra virgin olive oils, regular olive oils and seed/olive pomace oils-while SIMCA showed inferior performance. For the quantification of extra virgin olive oil adulteration with olive pomace oil or seed oils, a model based on Partial Least Square (PLS) analysis was developed. Detection limit of adulteration in olive oil was found to be 2% (v/v) and the linearity range up to 33% (v/v). Validation and applicability of all models was proved using a suitable test set. In the case of PLS, synthetic oil mixtures with 4 known adulteration levels in the range of 4-26% were also employed as a blind test set. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. Chemometric analysis for discrimination of extra virgin olive oils from whole and stoned olive pastes.

    PubMed

    De Luca, Michele; Restuccia, Donatella; Clodoveo, Maria Lisa; Puoci, Francesco; Ragno, Gaetano

    2016-07-01

    Chemometric discrimination of extra virgin olive oils (EVOO) from whole and stoned olive pastes was carried out by using Fourier transform infrared (FTIR) data and partial least squares-discriminant analysis (PLS1-DA) approach. Four Italian commercial EVOO brands, all in both whole and stoned version, were considered in this study. The adopted chemometric methodologies were able to describe the different chemical features in phenolic and volatile compounds contained in the two types of oil by using unspecific IR spectral information. Principal component analysis (PCA) was employed in cluster analysis to capture data patterns and to highlight differences between technological processes and EVOO brands. The PLS1-DA algorithm was used as supervised discriminant analysis to identify the different oil extraction procedures. Discriminant analysis was extended to the evaluation of possible adulteration by addition of aliquots of oil from whole paste to the most valuable oil from stoned olives. The statistical parameters from external validation of all the PLS models were very satisfactory, with low root mean square error of prediction (RMSEP) and relative error (RE%). Copyright © 2016 Elsevier Ltd. All rights reserved.

  10. A metabolic fingerprinting approach based on selected ion flow tube mass spectrometry (SIFT-MS) and chemometrics: A reliable tool for Mediterranean origin-labeled olive oils authentication.

    PubMed

    Bajoub, Aadil; Medina-Rodríguez, Santiago; Ajal, El Amine; Cuadros-Rodríguez, Luis; Monasterio, Romina Paula; Vercammen, Joeri; Fernández-Gutiérrez, Alberto; Carrasco-Pancorbo, Alegría

    2018-04-01

    Selected Ion flow tube mass spectrometry (SIFT-MS) in combination with chemometrics was used to authenticate the geographical origin of Mediterranean virgin olive oils (VOOs) produced under geographical origin labels. In particular, 130 oil samples from six different Mediterranean regions (Kalamata (Greece); Toscana (Italy); Meknès and Tyout (Morocco); and Priego de Córdoba and Baena (Spain)) were considered. The headspace volatile fingerprints were measured by SIFT-MS in full scan with H 3 O + , NO + and O 2 + as precursor ions and the results were subjected to chemometric treatments. Principal Component Analysis (PCA) was used for preliminary multivariate data analysis and Partial Least Squares-Discriminant Analysis (PLS-DA) was applied to build different models (considering the three reagent ions) to classify samples according to the country of origin and regions (within the same country). The multi-class PLS-DA models showed very good performance in terms of fitting accuracy (98.90-100%) and prediction accuracy (96.70-100% accuracy for cross validation and 97.30-100% accuracy for external validation (test set)). Considering the two-class PLS-DA models, the one for the Spanish samples showed 100% sensitivity, specificity and accuracy in calibration, cross validation and external validation; the model for Moroccan oils also showed very satisfactory results (with perfect scores for almost every parameter in all the cases). Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Discrimination of geographical origin and detection of adulteration of kudzu root by fluorescence spectroscopy coupled with multi-way pattern recognition

    NASA Astrophysics Data System (ADS)

    Hu, Leqian; Ma, Shuai; Yin, Chunling

    2018-03-01

    In this work, fluorescence spectroscopy combined with multi-way pattern recognition techniques were developed for determining the geographical origin of kudzu root and detection and quantification of adulterants in kudzu root. Excitation-emission (EEM) spectra were obtained for 150 pure kudzu root samples of different geographical origins and 150 fake kudzu roots with different adulteration proportions by recording emission from 330 to 570 nm with excitation in the range of 320-480 nm, respectively. Multi-way principal components analysis (M-PCA) and multilinear partial least squares discriminant analysis (N-PLS-DA) methods were used to decompose the excitation-emission matrices datasets. 150 pure kudzu root samples could be differentiated exactly from each other according to their geographical origins by M-PCA and N-PLS-DA models. For the adulteration kudzu root samples, N-PLS-DA got better and more reliable classification result comparing with the M-PCA model. The results obtained in this study indicated that EEM spectroscopy coupling with multi-way pattern recognition could be used as an easy, rapid and novel tool to distinguish the geographical origin of kudzu root and detect adulterated kudzu root. Besides, this method was also suitable for determining the geographic origin and detection the adulteration of the other foodstuffs which can produce fluorescence.

  12. Multi-element fingerprinting as a tool in origin authentication of four east China marine species.

    PubMed

    Guo, Lipan; Gong, Like; Yu, Yanlei; Zhang, Hong

    2013-12-01

    The contents of 25 elements in 4 types of commercial marine species from the East China Sea were determined by inductively coupled plasma mass spectrometry and atomic absorption spectrometry. The elemental composition was used to differentiate marine species according to geographical origin by multivariate statistical analysis. The results showed that principal component analysis could distinguish samples from different areas and reveal the elements which played the most important role in origin diversity. The established models by partial least squares discriminant analysis (PLS-DA) and by probabilistic neural network (PNN) can both precisely predict the origin of the marine species. Further study indicated that PLS-DA and PNN were efficacious in regional discrimination. The models from these 2 statistical methods, with an accuracy of 97.92% and 100%, respectively, could both distinguish samples from different areas without the need for species differentiation. © 2013 Institute of Food Technologists®

  13. Paper spray mass spectrometry and PLS-DA improved by variable selection for the forensic discrimination of beers.

    PubMed

    Pereira, Hebert Vinicius; Amador, Victória Silva; Sena, Marcelo Martins; Augusti, Rodinei; Piccin, Evandro

    2016-10-12

    Paper spray mass spectrometry (PS-MS) combined with partial least squares discriminant analysis (PLS-DA) was applied for the first time in a forensic context to a fast and effective differentiation of beers. Eight different brands of American standard lager beers produced by four different breweries (141 samples from 55 batches) were studied with the aim at performing a differentiation according to their market prices. The three leader brands in the Brazilian beer market, which have been subject to fraud, were modeled as the higher-price class, while the five brands most used for counterfeiting were modeled as the lower-price class. Parameters affecting the paper spray ionization were examined and optimized. The best MS signal stability and intensity was obtained while using the positive ion mode, with PS(+) mass spectra characterized by intense pairs of signals corresponding to sodium and potassium adducts of malto-oligosaccharides. Discrimination was not apparent neither by using visual inspection nor principal component analysis (PCA). However, supervised classification models provided high rates of sensitivity and specificity. A PLS-DA model using full scan mass spectra were improved by variable selection with ordered predictors selection (OPS), providing 100% of reliability rate and reducing the number of variables from 1701 to 60. This model was interpreted by detecting fifteen variables as the most significant VIP (variable importance in projection) scores, which were therefore considered diagnostic ions for this type of beer counterfeit. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems.

    PubMed

    Lê Cao, Kim-Anh; Boitard, Simon; Besse, Philippe

    2011-06-22

    Variable selection on high throughput biological data, such as gene expression or single nucleotide polymorphisms (SNPs), becomes inevitable to select relevant information and, therefore, to better characterize diseases or assess genetic structure. There are different ways to perform variable selection in large data sets. Statistical tests are commonly used to identify differentially expressed features for explanatory purposes, whereas Machine Learning wrapper approaches can be used for predictive purposes. In the case of multiple highly correlated variables, another option is to use multivariate exploratory approaches to give more insight into cell biology, biological pathways or complex traits. A simple extension of a sparse PLS exploratory approach is proposed to perform variable selection in a multiclass classification framework. sPLS-DA has a classification performance similar to other wrapper or sparse discriminant analysis approaches on public microarray and SNP data sets. More importantly, sPLS-DA is clearly competitive in terms of computational efficiency and superior in terms of interpretability of the results via valuable graphical outputs. sPLS-DA is available in the R package mixOmics, which is dedicated to the analysis of large biological data sets.

  15. Differentiation of Crataegus spp. guided by nuclear magnetic resonance spectrometry with chemometric analyses.

    PubMed

    Lund, Jensen A; Brown, Paula N; Shipley, Paul R

    2017-09-01

    For compliance with US Current Good Manufacturing Practice regulations for dietary supplements, manufacturers must provide identity of source plant material. Despite the popularity of hawthorn as a dietary supplement, relatively little is known about the comparative phytochemistry of different hawthorn species, and in particular North American hawthorns. The combination of NMR spectrometry with chemometric analyses offers an innovative approach to differentiating hawthorn species and exploring the phytochemistry. Two European and two North American species, harvested from a farm trial in late summer 2008, were analyzed by standard 1D 1 H and J-resolved (JRES) experiments. The data were preprocessed and modelled by principal component analysis (PCA). A supervised model was then generated by partial least squares-discriminant analysis (PLS-DA) for classification and evaluated by cross validation. Supervised random forests models were constructed from the dataset to explore the potential of machine learning for identification of unique patterns across species. 1D 1 H NMR data yielded increased differentiation over the JRES data. The random forests results correlated with PLS-DA results and outperformed PLS-DA in classification accuracy. In all of these analyses differentiation of the Crataegus spp. was best achieved by focusing on the NMR spectral region that contains signals unique to plant phenolic compounds. Identification of potentially significant metabolites for differentiation between species was approached using univariate techniques including significance analysis of microarrays and Kruskall-Wallis tests. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Traceability of Boletaceae mushrooms using data fusion of UV-visible and FTIR combined with chemometrics methods.

    PubMed

    Yao, Sen; Li, Tao; Liu, HongGao; Li, JieQing; Wang, YuanZhong

    2018-04-01

    Boletaceae mushrooms are wild-grown edible mushrooms that have high nutrition, delicious flavor and large economic value distributing in Yunnan Province, China. Traceability is important for the authentication and quality assessment of Boletaceae mushrooms. In this study, UV-visible and Fourier transform infrared (FTIR) spectroscopies were applied for traceability of 247 Boletaceae mushroom samples in combination with chemometrics. Compared with a single spectroscopy technique, data fusion strategy can obviously improve the classification performance in partial least square discriminant analysis (PLS-DA) and grid-search support vector machine (GS-SVM) models, for both species and geographical origin traceability. In addition, PLS-DA and GS-SVM models can provide 100.00% accuracy for species traceability and have reliable evaluation parameters. For geographical origin traceability, the accuracy of prediction in the PLS-DA model by data fusion was just 64.63%, but the GS-SVM model based on data fusion was 100.00%. The results demonstrated that the data fusion strategy of UV-visible and FTIR combined with GS-SVM could provide a higher synergic effect for traceability of Boletaceae mushrooms and have a good generalization ability for the comprehensive quality control and evaluation of similar foods. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.

  17. The Plasmin-Sensitive Protein Pls in Methicillin-Resistant Staphylococcus aureus (MRSA) Is a Glycoprotein.

    PubMed

    Bleiziffer, Isabelle; Eikmeier, Julian; Pohlentz, Gottfried; McAulay, Kathryn; Xia, Guoqing; Hussain, Muzaffar; Peschel, Andreas; Foster, Simon; Peters, Georg; Heilmann, Christine

    2017-01-01

    Most bacterial glycoproteins identified to date are virulence factors of pathogenic bacteria, i.e. adhesins and invasins. However, the impact of protein glycosylation on the major human pathogen Staphylococcus aureus remains incompletely understood. To study protein glycosylation in staphylococci, we analyzed lysostaphin lysates of methicillin-resistant Staphylococcus aureus (MRSA) strains by SDS-PAGE and subsequent periodic acid-Schiff's staining. We detected four (>300, ∼250, ∼165, and ∼120 kDa) and two (>300 and ∼175 kDa) glycosylated surface proteins with strain COL and strain 1061, respectively. The ∼250, ∼165, and ∼175 kDa proteins were identified as plasmin-sensitive protein (Pls) by mass spectrometry. Previously, Pls has been demonstrated to be a virulence factor in a mouse septic arthritis model. The pls gene is encoded by the staphylococcal cassette chromosome (SCC)mec type I in MRSA that also encodes the methicillin resistance-conferring mecA and further genes. In a search for glycosyltransferases, we identified two open reading frames encoded downstream of pls on the SCCmec element, which we termed gtfC and gtfD. Expression and deletion analysis revealed that both gtfC and gtfD mediate glycosylation of Pls. Additionally, the recently reported glycosyltransferases SdgA and SdgB are involved in Pls glycosylation. Glycosylation occurs at serine residues in the Pls SD-repeat region and modifying carbohydrates are N-acetylhexosaminyl residues. Functional characterization revealed that Pls can confer increased biofilm formation, which seems to involve two distinct mechanisms. The first mechanism depends on glycosylation of the SD-repeat region by GtfC/GtfD and probably also involves eDNA, while the second seems to be independent of glycosylation as well as eDNA and may involve the centrally located G5 domains. Other previously known Pls properties are not related to the sugar modifications. In conclusion, Pls is a glycoprotein and Pls glycosyl residues can stimulate biofilm formation. Thus, sugar modifications may represent promising new targets for novel therapeutic or prophylactic measures against life-threatening S. aureus infections.

  18. Geographical provenance of palm oil by fatty acid and volatile compound fingerprinting techniques.

    PubMed

    Tres, A; Ruiz-Samblas, C; van der Veer, G; van Ruth, S M

    2013-04-15

    Analytical methods are required in addition to administrative controls to verify the geographical origin of vegetable oils such as palm oil in an objective manner. In this study the application of fatty acid and volatile organic compound fingerprinting in combination with chemometrics have been applied to verify the geographical origin of crude palm oil (continental scale). For this purpose 94 crude palm oil samples were collected from South East Asia (55), South America (11) and Africa (28). Partial least squares discriminant analysis (PLS-DA) was used to develop a hierarchical classification model by combining two consecutive binary PLS-DA models. First, a PLS-DA model was built to distinguish South East Asian from non-South East Asian palm oil samples. Then a second model was developed, only for the non-Asian samples, to discriminate African from South American crude palm oil. Models were externally validated by using them to predict the identity of new authentic samples. The fatty acid fingerprinting model revealed three misclassified samples. The volatile compound fingerprinting models showed an 88%, 100% and 100% accuracy for the South East Asian, African and American class, respectively. The verification of the geographical origin of crude palm oil is feasible by fatty acid and volatile compound fingerprinting. Further research is required to further validate the approach and to increase its spatial specificity to country/province scale. Copyright © 2012 Elsevier Ltd. All rights reserved.

  19. Recognition of beer brand based on multivariate analysis of volatile fingerprint.

    PubMed

    Cajka, Tomas; Riddellova, Katerina; Tomaniova, Monika; Hajslova, Jana

    2010-06-18

    Automated head-space solid-phase microextraction (HS-SPME)-based sampling procedure, coupled to gas chromatography-time-of-flight mass spectrometry (GC-TOFMS), was developed and employed for obtaining of fingerprints (GC profiles) of beer volatiles. In total, 265 speciality beer samples were collected over a 1-year period with the aim to distinguish, based on analytical (profiling) data, (i) the beers labelled as Rochefort 8; (ii) a group consisting of Rochefort 6, 8, 10 beers; and (iii) Trappist beers. For the chemometric evaluation of the data, partial least squares discriminant analysis (PLS-DA), linear discriminant analysis (LDA), and artificial neural networks with multilayer perceptrons (ANN-MLP) were tested. The best prediction ability was obtained for the model that distinguished a group of Rochefort 6, 8, 10 beers from the rest of beers. In this case, all chemometric tools employed provided 100% correct classification. Slightly worse prediction abilities were achieved for the models "Trappist vs. non-Trappist beers" with the values of 93.9% (PLS-DA), 91.9% (LDA) and 97.0% (ANN-MLP) and "Rochefort 8 vs. the rest" with the values of 87.9% (PLS-DA) and 84.8% (LDA) and 93.9% (ANN-MLP). In addition to chromatographic profiling, also the potential of direct coupling of SPME (extraction/pre-concentration device) with high-resolution TOFMS employing a direct analysis in real time (DART) ion source has been demonstrated as a challenging profiling approach. Copyright (c) 2010 Elsevier B.V. All rights reserved.

  20. Screening method for rapid classification of psychoactive substances in illicit tablets using mid infrared spectroscopy and PLS-DA.

    PubMed

    Pereira, Leandro S A; Lisboa, Fernanda L C; Coelho Neto, José; Valladão, Frederico N; Sena, Marcelo M

    2018-05-09

    Several new psychoactive substances (NPS) have reached the illegal drug market in recent years, and ecstasy-like tablets are one of the forms affected by this change. Cathinones and tryptamines have increasingly been found in ecstasy-like seized samples as well as other amphetamine type stimulants. A presumptive method for identifying different drugs in seized ecstasy tablets (n=92) using ATR-FTIR (attenuated total reflectance - Fourier transform infrared spectroscopy) and PLS-DA (partial least squares discriminant analysis) was developed. A hierarchical strategy of sequential modeling was performed with PLS-DA. The main model discriminated four classes: 5-MeO-MIPT, methylenedioxyamphetamines (MDMA and MDA), methamphetamine, and cathinones. Two submodels were built to identify drugs present in MDs and cathinones classes. Models were validated through the estimate of figures of merit. The average reliability rate (RLR) of the main model was 96.8% and accordance (ACC) was 100%. For the submodels, RLR and ACC were 100%. The reliability of the models was corroborated through their spectral interpretation. Thus, spectral assignments were performed by associating informative vectors of each specific modeled class to the respective drugs. The developed method is simple, fast, and can be applied to the forensic laboratory routine, leading to objective results reports useful for forensic scientists and law enforcement. Copyright © 2018 Elsevier B.V. All rights reserved.

  1. Detection of Genetically Modified Sugarcane by Using Terahertz Spectroscopy and Chemometrics

    NASA Astrophysics Data System (ADS)

    Liu, J.; Xie, H.; Zha, B.; Ding, W.; Luo, J.; Hu, C.

    2018-03-01

    A methodology is proposed to identify genetically modified sugarcane from non-genetically modified sugarcane by using terahertz spectroscopy and chemometrics techniques, including linear discriminant analysis (LDA), support vector machine-discriminant analysis (SVM-DA), and partial least squares-discriminant analysis (PLS-DA). The classification rate of the above mentioned methods is compared, and different types of preprocessing are considered. According to the experimental results, the best option is PLS-DA, with an identification rate of 98%. The results indicated that THz spectroscopy and chemometrics techniques are a powerful tool to identify genetically modified and non-genetically modified sugarcane.

  2. Detection of Lipitor counterfeits: a comparison of NIR and Raman spectroscopy in combination with chemometrics.

    PubMed

    de Peinder, P; Vredenbregt, M J; Visser, T; de Kaste, D

    2008-08-05

    Research has been carried on the feasibility of near infrared (NIR) and Raman spectroscopy as rapid screening methods to discriminate between genuine and counterfeits of the cholesterol-lowering medicine Lipitor. Classification, based on partial least squares discriminant analysis (PLS-DA) models, appears to be successful for both spectroscopic techniques, irrespective of whether atorvastatine or lovastatine has been used as the active pharmaceutical ingredient (API). The discriminative power of the NIR model, in particular, largely relies on the spectral differences of the tablet matrix. This is due to the relative large sample volume that is probed with NIR and the strong spectroscopic activity of the excipients. PLS-DA models based on NIR or Raman spectra can also be applied to distinguish between atorvastatine and lovastatine as the API used in the counterfeits tested in this study. A disadvantage of Raman microscopy for this type of analysis is that it is primarily a surface technique. As a consequence spectra of the coating and the tablet core might differ. Besides, spectra may change with the position of the laser in case the sample is inhomogeneous. However, the robustness of the PLS-DA models turned out to be sufficiently large to allow a reliable discrimination. Principal component analysis (PCA) of the spectra revealed that the conditions, at which tablets have been stored, affect the NIR data. This effect is attributed to the adsorption of water from the atmosphere after unpacking from the blister. It implies that storage conditions should be taken into account when the NIR technique is used for discriminating purposes. However, in this study both models based on NIR spectra and Raman data enabled reliable discrimination between genuine and counterfeited Lipitor tablets, regardless of their storage conditions.

  3. Noninvasive Recognition and Biomarkers of Early Allergic Asthma in Cats Using Multivariate Statistical Analysis of NMR Spectra of Exhaled Breath Condensate

    PubMed Central

    Fulcher, Yan G.; Fotso, Martial; Chang, Chee-Hoon; Rindt, Hans; Reinero, Carol R.

    2016-01-01

    Asthma is prevalent in children and cats, and needs means of noninvasive diagnosis. We sought to distinguish noninvasively the differences in 53 cats before and soon after induction of allergic asthma, using NMR spectra of exhaled breath condensate (EBC). Statistical pattern recognition was improved considerably by preprocessing the spectra with probabilistic quotient normalization and glog transformation. Classification of the 106 preprocessed spectra by principal component analysis and partial least squares with discriminant analysis (PLS-DA) appears to be impaired by variances unrelated to eosinophilic asthma. By filtering out confounding variances, orthogonal signal correction (OSC) PLS-DA greatly improved the separation of the healthy and early asthmatic states, attaining 94% specificity and 94% sensitivity in predictions. OSC enhancement of multi-level PLS-DA boosted the specificity of the prediction to 100%. OSC-PLS-DA of the normalized spectra suggest the most promising biomarkers of allergic asthma in cats to include increased acetone, metabolite(s) with overlapped NMR peaks near 5.8 ppm, and a hydroxyphenyl-containing metabolite, as well as decreased phthalate. Acetone is elevated in the EBC of 74% of the cats with early asthma. The noninvasive detection of early experimental asthma, biomarkers in EBC, and metabolic perturbation invite further investigation of the diagnostic potential in humans. PMID:27764146

  4. Serum and urine metabolomic fingerprinting in diagnostics of inflammatory bowel diseases.

    PubMed

    Dawiskiba, Tomasz; Deja, Stanisław; Mulak, Agata; Ząbek, Adam; Jawień, Ewa; Pawełka, Dorota; Banasik, Mirosław; Mastalerz-Migas, Agnieszka; Balcerzak, Waldemar; Kaliszewski, Krzysztof; Skóra, Jan; Barć, Piotr; Korta, Krzysztof; Pormańczuk, Kornel; Szyber, Przemyslaw; Litarski, Adam; Młynarz, Piotr

    2014-01-07

    To evaluate the utility of serum and urine metabolomic analysis in diagnosing and monitoring of inflammatory bowel diseases (IBD). Serum and urine samples were collected from 24 patients with ulcerative colitis (UC), 19 patients with the Crohn's disease (CD) and 17 healthy controls. The activity of UC was assessed with the Simple Clinical Colitis Activity Index, while the activity of CD was determined using the Harvey-Bradshaw Index. The analysis of serum and urine samples was performed using proton nuclear magnetic resonance (NMR) spectroscopy. All spectra were exported to Matlab for preprocessing which resulted in two data matrixes for serum and urine. Prior to the chemometric analysis, both data sets were unit variance scaled. The differences in metabolite fingerprints were assessed using partial least-squares-discriminant analysis (PLS-DA). Receiver operating characteristic curves and area under curves were used to evaluate the quality and prediction performance of the obtained PLS-DA models. Metabolites responsible for separation in models were tested using STATISTICA 10 with the Mann-Whitney-Wilcoxon test and the Student's t test (α = 0.05). The comparison between the group of patients with active IBD and the group with IBD in remission provided good PLS-DA models (P value 0.002 for serum and 0.003 for urine). The metabolites that allowed to distinguish these groups were: N-acetylated compounds and phenylalanine (up-regulated in serum), low-density lipoproteins and very low-density lipoproteins (decreased in serum) as well as glycine (increased in urine) and acetoacetate (decreased in urine). The significant differences in metabolomic profiles were also found between the group of patients with active IBD and healthy control subjects providing the PLS-DA models with a very good separation (P value < 0.001 for serum and 0.003 for urine). The metabolites that were found to be the strongest biomarkers included in this case: leucine, isoleucine, 3-hydroxybutyric acid, N-acetylated compounds, acetoacetate, glycine, phenylalanine and lactate (increased in serum), creatine, dimethyl sulfone, histidine, choline and its derivatives (decreased in serum), as well as citrate, hippurate, trigonelline, taurine, succinate and 2-hydroxyisobutyrate (decreased in urine). No clear separation in PLS-DA models was found between CD and UC patients based on the analysis of serum and urine samples, although one metabolite (formate) in univariate statistical analysis was significantly lower in serum of patients with active CD, and two metabolites (alanine and N-acetylated compounds) were significantly higher in serum of patients with CD when comparing jointly patients in the remission and active phase of the diseases. Contrary to the results obtained from the serum samples, the analysis of urine samples allowed to distinguish patients with IBD in remission from healthy control subjects. The metabolites of importance included in this case up-regulated acetoacetate and down-regulated citrate, hippurate, taurine, succinate, glycine, alanine and formate. NMR-based metabolomic fingerprinting of serum and urine has the potential to be a useful tool in distinguishing patients with active IBD from those in remission.

  5. Serum and urine metabolomic fingerprinting in diagnostics of inflammatory bowel diseases

    PubMed Central

    Dawiskiba, Tomasz; Deja, Stanisław; Mulak, Agata; Ząbek, Adam; Jawień, Ewa; Pawełka, Dorota; Banasik, Mirosław; Mastalerz-Migas, Agnieszka; Balcerzak, Waldemar; Kaliszewski, Krzysztof; Skóra, Jan; Barć, Piotr; Korta, Krzysztof; Pormańczuk, Kornel; Szyber, Przemyslaw; Litarski, Adam; Młynarz, Piotr

    2014-01-01

    AIM: To evaluate the utility of serum and urine metabolomic analysis in diagnosing and monitoring of inflammatory bowel diseases (IBD). METHODS: Serum and urine samples were collected from 24 patients with ulcerative colitis (UC), 19 patients with the Crohn’s disease (CD) and 17 healthy controls. The activity of UC was assessed with the Simple Clinical Colitis Activity Index, while the activity of CD was determined using the Harvey-Bradshaw Index. The analysis of serum and urine samples was performed using proton nuclear magnetic resonance (NMR) spectroscopy. All spectra were exported to Matlab for preprocessing which resulted in two data matrixes for serum and urine. Prior to the chemometric analysis, both data sets were unit variance scaled. The differences in metabolite fingerprints were assessed using partial least-squares-discriminant analysis (PLS-DA). Receiver operating characteristic curves and area under curves were used to evaluate the quality and prediction performance of the obtained PLS-DA models. Metabolites responsible for separation in models were tested using STATISTICA 10 with the Mann-Whitney-Wilcoxon test and the Student’s t test (α = 0.05). RESULTS: The comparison between the group of patients with active IBD and the group with IBD in remission provided good PLS-DA models (P value 0.002 for serum and 0.003 for urine). The metabolites that allowed to distinguish these groups were: N-acetylated compounds and phenylalanine (up-regulated in serum), low-density lipoproteins and very low-density lipoproteins (decreased in serum) as well as glycine (increased in urine) and acetoacetate (decreased in urine). The significant differences in metabolomic profiles were also found between the group of patients with active IBD and healthy control subjects providing the PLS-DA models with a very good separation (P value < 0.001 for serum and 0.003 for urine). The metabolites that were found to be the strongest biomarkers included in this case: leucine, isoleucine, 3-hydroxybutyric acid, N-acetylated compounds, acetoacetate, glycine, phenylalanine and lactate (increased in serum), creatine, dimethyl sulfone, histidine, choline and its derivatives (decreased in serum), as well as citrate, hippurate, trigonelline, taurine, succinate and 2-hydroxyisobutyrate (decreased in urine). No clear separation in PLS-DA models was found between CD and UC patients based on the analysis of serum and urine samples, although one metabolite (formate) in univariate statistical analysis was significantly lower in serum of patients with active CD, and two metabolites (alanine and N-acetylated compounds) were significantly higher in serum of patients with CD when comparing jointly patients in the remission and active phase of the diseases. Contrary to the results obtained from the serum samples, the analysis of urine samples allowed to distinguish patients with IBD in remission from healthy control subjects. The metabolites of importance included in this case up-regulated acetoacetate and down-regulated citrate, hippurate, taurine, succinate, glycine, alanine and formate. CONCLUSION: NMR-based metabolomic fingerprinting of serum and urine has the potential to be a useful tool in distinguishing patients with active IBD from those in remission. PMID:24415869

  6. Metabolomic analysis of avocado fruits by GC-APCI-TOF MS: effects of ripening degrees and fruit varieties.

    PubMed

    Hurtado-Fernández, E; Pacchiarotta, T; Mayboroda, O A; Fernández-Gutiérrez, A; Carrasco-Pancorbo, A

    2015-01-01

    In order to investigate avocado fruit ripening, nontargeted GC-APCI-TOF MS metabolic profiling analyses were carried out. Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were used to explore the metabolic profiles from fruit samples of 13 varieties at two different ripening degrees. Mannoheptulose; pentadecylfuran; aspartic, malic, stearic, citric and pantothenic acids; mannitol; and β-sitosterol were some of the metabolites found as more influential for the PLS-DA model. The similarities among genetically related samples (putative mutants of "Hass") and their metabolic differences from the rest of the varieties under study have also been evaluated. The achieved results reveal new insights into avocado fruit composition and metabolite changes, demonstrating therefore the value of metabolomics as a functional genomics tool in characterizing the mechanism of fruit ripening development, a key developmental stage in most economically important fruit crops.

  7. Improved detection of highly energetic materials traces on surfaces by standoff laser-induced thermal emission incorporating neural networks

    NASA Astrophysics Data System (ADS)

    Figueroa-Navedo, Amanda; Galán-Freyle, Nataly Y.; Pacheco-Londoño, Leonardo C.; Hernández-Rivera, Samuel P.

    2013-05-01

    Terrorists conceal highly energetic materials (HEM) as Improvised Explosive Devices (IED) in various types of materials such as PVC, wood, Teflon, aluminum, acrylic, carton and rubber to disguise them from detection equipment used by military and security agency personnel. Infrared emissions (IREs) of substrates, with and without HEM, were measured to generate models for detection and discrimination. Multivariable analysis techniques such as principal component analysis (PCA), soft independent modeling by class analogy (SIMCA), partial least squares-discriminant analysis (PLS-DA), support vector machine (SVM) and neural networks (NN) were employed to generate models, in which the emission of IR light from heated samples was stimulated using a CO2 laser giving rise to laser induced thermal emission (LITE) of HEMs. Traces of a specific target threat chemical explosive: PETN in surface concentrations of 10 to 300 ug/cm2 were studied on the surfaces mentioned. Custom built experimental setup used a CO2 laser as a heating source positioned with a telescope, where a minimal loss in reflective optics was reported, for the Mid-IR at a distance of 4 m and 32 scans at 10 s. SVM-DA resulted in the best statistical technique for a discrimination performance of 97%. PLS-DA accurately predicted over 94% and NN 88%.

  8. Comparison of partial least squares and random forests for evaluating relationship between phenolics and bioactivities of Neptunia oleracea.

    PubMed

    Lee, Soo Yee; Mediani, Ahmed; Maulidiani, Maulidiani; Khatib, Alfi; Ismail, Intan Safinar; Zawawi, Norhasnida; Abas, Faridah

    2018-01-01

    Neptunia oleracea is a plant consumed as a vegetable and which has been used as a folk remedy for several diseases. Herein, two regression models (partial least squares, PLS; and random forest, RF) in a metabolomics approach were compared and applied to the evaluation of the relationship between phenolics and bioactivities of N. oleracea. In addition, the effects of different extraction conditions on the phenolic constituents were assessed by pattern recognition analysis. Comparison of the PLS and RF showed that RF exhibited poorer generalization and hence poorer predictive performance. Both the regression coefficient of PLS and the variable importance of RF revealed that quercetin and kaempferol derivatives, caffeic acid and vitexin-2-O-rhamnoside were significant towards the tested bioactivities. Furthermore, principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) results showed that sonication and absolute ethanol are the preferable extraction method and ethanol ratio, respectively, to produce N. oleracea extracts with high phenolic levels and therefore high DPPH scavenging and α-glucosidase inhibitory activities. Both PLS and RF are useful regression models in metabolomics studies. This work provides insight into the performance of different multivariate data analysis tools and the effects of different extraction conditions on the extraction of desired phenolics from plants. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.

  9. An integrated approach utilising chemometrics and GC/MS for classification of chamomile flowers, essential oils and commercial products.

    PubMed

    Wang, Mei; Avula, Bharathi; Wang, Yan-Hong; Zhao, Jianping; Avonto, Cristina; Parcher, Jon F; Raman, Vijayasankar; Zweigenbaum, Jerry A; Wylie, Philip L; Khan, Ikhlas A

    2014-01-01

    As part of an ongoing research program on authentication, safety and biological evaluation of phytochemicals and dietary supplements, an in-depth chemical investigation of different types of chamomile was performed. A collection of chamomile samples including authenticated plants, commercial products and essential oils was analysed by GC/MS. Twenty-seven authenticated plant samples representing three types of chamomile, viz. German chamomile, Roman chamomile and Juhua were analysed. This set of data was employed to construct a sample class prediction (SCP) model based on stepwise reduction of data dimensionality followed by principle component analysis (PCA) and partial least squares discriminant analysis (PLS-DA). The model was cross-validated with samples including authenticated plants and commercial products. The model demonstrated 100.0% accuracy for both recognition and prediction abilities. In addition, 35 commercial products and 11 essential oils purported to contain chamomile were subsequently predicted by the validated PLS-DA model. Furthermore, tentative identification of the marker compounds correlated with different types of chamomile was explored. Copyright © 2013 Elsevier Ltd. All rights reserved.

  10. Standoff detection of chemical and biological threats using laser-induced breakdown spectroscopy.

    PubMed

    Gottfried, Jennifer L; De Lucia, Frank C; Munson, Chase A; Miziolek, Andrzej W

    2008-04-01

    Laser-induced breakdown spectroscopy (LIBS) is a promising technique for real-time chemical and biological warfare agent detection in the field. We have demonstrated the detection and discrimination of the biological warfare agent surrogates Bacillus subtilis (BG) (2% false negatives, 0% false positives) and ovalbumin (0% false negatives, 1% false positives) at 20 meters using standoff laser-induced breakdown spectroscopy (ST-LIBS) and linear correlation. Unknown interferent samples (not included in the model), samples on different substrates, and mixtures of BG and Arizona road dust have been classified with reasonable success using partial least squares discriminant analysis (PLS-DA). A few of the samples tested such as the soot (not included in the model) and the 25% BG:75% dust mixture resulted in a significant number of false positives or false negatives, respectively. Our preliminary results indicate that while LIBS is able to discriminate biomaterials with similar elemental compositions at standoff distances based on differences in key intensity ratios, further work is needed to reduce the number of false positives/negatives by refining the PLS-DA model to include a sufficient range of material classes and carefully selecting a detection threshold. In addition, we have demonstrated that LIBS can distinguish five different organophosphate nerve agent simulants at 20 meters, despite their similar stoichiometric formulas. Finally, a combined PLS-DA model for chemical, biological, and explosives detection using a single ST-LIBS sensor has been developed in order to demonstrate the potential of standoff LIBS for universal hazardous materials detection.

  11. Potential of near-infrared spectroscopy for quality evaluation of cattle leather.

    PubMed

    Braz, Carlos Eduardo M; Jacinto, Manuel Antonio C; Pereira-Filho, Edenir R; Souza, Gilberto B; Nogueira, Ana Rita A

    2018-05-09

    Models using near-infrared spectroscopy (NIRS) were constructed based on physical-mechanical tests to determine the quality of cattle leather. The following official parameters were used, considering the industry requirements: tensile strength (TS), percentage elongation (%E), tear strength (TT), and double hole tear strength (DHS). Classification models were constructed with the use of k-nearest neighbor (kNN), soft independent modeling of class analogy (SIMCA), and partial least squares-discriminant analysis (PLS-DA). The evaluated figures of merit, accuracy, sensitivity, and specificity presented results between 85% and 93%, and the false alarm rates from 9% to 14%. The model with lowest validation percentage (92%) was kNN, and the highest was PLS-DA (100%). For TS, lower values were obtained, from 52% for kNN and 74% for SIMCA. The other parameters %E, TT, and DHS presented hit rates between 87 and 100%. The abilities of the models were similar, showing they can be used to predict the quality of cattle leather. Copyright © 2018 Elsevier B.V. All rights reserved.

  12. 6C.04: INTEGRATED SNP ANALYSIS AND METABOLOMIC PROFILES OF METABOLIC SYNDROME.

    PubMed

    Marrachelli, V; Monleon, D; Morales, J M; Rentero, P; Martínez, F; Chaves, F J; Martin-Escudero, J C; Redon, J

    2015-06-01

    Metabolic syndrome (MS) has become a health and financial burden worldwide. Susceptibility of genetically determined metabotype of MS has not yet been investigated. We aimed to identify a distinctive metabolic profile of blood serum which might correlates to the early detection of the development of MS associated to genetic polymorphism. We applied high resolution NMR spectroscopy to profile blood serum from patients without MS (n = 945) or with (n = 291). Principal component analysis (PCA) and projection to latent structures for discriminant analysis (PLS-DA) were applied to NMR spectral datasets. Results were cross-validated using the Venetian Blinds approach. Additionally, five SNPs previously associated with MS were genotyped with SNPlex and tested for associations between the metabolic profiles and the genetic variants. Statistical analysis was performed using in-house MATLAB scripts and the PLS Toolbox statistical multivariate analysis library. Our analysis provided a PLS-DA Metabolic Syndrome discrimination model based on NMR metabolic profile (AUC = 0.86) with 84% of sensitivity and 72% specificity. The model identified 11 metabolites differentially regulated in patients with MS. Among others, fatty acids, glucose, alanine, hydroxyisovalerate, acetone, trimethylamine, 2-phenylpropionate, isobutyrate and valine, significantly contributed to the model. The combined analysis of metabolomics and SNP data revealed an association between the metabolic profile of MS and genes polymorphism involved in the adiposity regulation and fatty acids metabolism: rs2272903_TT (TFAP2B), rs3803_TT (GATA2), rs174589_CC (FADS2) and rs174577_AA (FADS2). In addition, individuals with the rs2272903-TT genotype seem to develop MS earlier than general population. Our study provides new insights on the metabolic alterations associated with a MS high-risk genotype. These results could help in future development of risk assessment and predictive models for subclinical cardiovascular disease.

  13. Paper spray mass spectrometry and chemometric tools for a fast and reliable identification of counterfeit blended Scottish whiskies.

    PubMed

    Teodoro, Janaína Aparecida Reis; Pereira, Hebert Vinicius; Sena, Marcelo Martins; Piccin, Evandro; Zacca, Jorge Jardim; Augusti, Rodinei

    2017-12-15

    A direct method based on the application of paper spray mass spectrometry (PS-MS) combined with a chemometric supervised method (partial least square discriminant analysis, PLS-DA) was developed and applied to the discrimination of authentic and counterfeit samples of blended Scottish whiskies. The developed methodology employed the negative ion mode MS, included 44 authentic whiskies from diverse brands and batches and 44 counterfeit samples of the same brands seized during operations of the Brazilian Federal Police, totalizing 88 samples. An exploratory principal component analysis (PCA) model showed a reasonable discrimination of the counterfeit whiskies in PC2. In spite of the samples heterogeneity, a robust, reliable and accurate PLS-DA model was generated and validated, which was able to correctly classify the samples with nearly 100% success rate. The use of PS-MS also allowed the identification of the main marker compounds associated with each type of sample analyzed: authentic or counterfeit. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. The effect of thermal treatment on the enhancement of detection of adulteration in extra virgin olive oils by synchronous fluorescence spectroscopy and chemometric analysis.

    PubMed

    Mabood, F; Boqué, R; Folcarelli, R; Busto, O; Jabeen, F; Al-Harrasi, Ahmed; Hussain, J

    2016-05-15

    In this study the effect of thermal treatment on the enhancement of synchronous fluorescence spectroscopic method for discrimination and quantification of pure extra virgin olive oil (EVOO) samples from EVOO samples adulterated with refined oil was investigated. Two groups of samples were used. One group was analyzed at room temperature (25 °C) and the other group was thermally treated in a thermostatic water bath at 75 °C for 8h, in contact with air and with light exposure, to favor oxidation. All the samples were then measured with synchronous fluorescence spectroscopy. Synchronous fluorescence spectra were acquired by varying the wavelength in the region from 250 to 720 nm at 20 nm wavelength differential interval of excitation and emission. Pure and adulterated olive oils were discriminated by using partial least-squares discriminant analysis (PLS-DA). It was found that the best PLS-DA models were those built with the difference spectra (75 °C-25 °C), which were able to discriminate pure from adulterated oils at a 2% level of adulteration of refined olive oils. Furthermore, PLS regression models were also built to quantify the level of adulteration. Again, the best model was the one built with the difference spectra, with a prediction error of 3.18% of adulteration. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. The effect of thermal treatment on the enhancement of detection of adulteration in extra virgin olive oils by synchronous fluorescence spectroscopy and chemometric analysis

    NASA Astrophysics Data System (ADS)

    Mabood, F.; Boqué, R.; Folcarelli, R.; Busto, O.; Jabeen, F.; Al-Harrasi, Ahmed; Hussain, J.

    2016-05-01

    In this study the effect of thermal treatment on the enhancement of synchronous fluorescence spectroscopic method for discrimination and quantification of pure extra virgin olive oil (EVOO) samples from EVOO samples adulterated with refined oil was investigated. Two groups of samples were used. One group was analyzed at room temperature (25 °C) and the other group was thermally treated in a thermostatic water bath at 75 °C for 8 h, in contact with air and with light exposure, to favor oxidation. All the samples were then measured with synchronous fluorescence spectroscopy. Synchronous fluorescence spectra were acquired by varying the wavelength in the region from 250 to 720 nm at 20 nm wavelength differential interval of excitation and emission. Pure and adulterated olive oils were discriminated by using partial least-squares discriminant analysis (PLS-DA). It was found that the best PLS-DA models were those built with the difference spectra (75 °C-25 °C), which were able to discriminate pure from adulterated oils at a 2% level of adulteration of refined olive oils. Furthermore, PLS regression models were also built to quantify the level of adulteration. Again, the best model was the one built with the difference spectra, with a prediction error of 3.18% of adulteration.

  16. Classification of Brazilian and foreign gasolines adulterated with alcohol using infrared spectroscopy.

    PubMed

    da Silva, Neirivaldo C; Pimentel, Maria Fernanda; Honorato, Ricardo S; Talhavini, Marcio; Maldaner, Adriano O; Honorato, Fernanda A

    2015-08-01

    The smuggling of products across the border regions of many countries is a practice to be fought. Brazilian authorities are increasingly worried about the illicit trade of fuels along the frontiers of the country. In order to confirm this as a crime, the Federal Police must have a means of identifying the origin of the fuel. This work describes the development of a rapid and nondestructive methodology to classify gasoline as to its origin (Brazil, Venezuela and Peru), using infrared spectroscopy and multivariate classification. Partial Least Squares Discriminant Analysis (PLS-DA) and Soft Independent Modeling Class Analogy (SIMCA) models were built. Direct standardization (DS) was employed aiming to standardize the spectra obtained in different laboratories of the border units of the Federal Police. Two approaches were considered in this work: (1) local and (2) global classification models. When using Approach 1, the PLS-DA achieved 100% correct classification, and the deviation of the predicted values for the secondary instrument considerably decreased after performing DS. In this case, SIMCA models were not efficient in the classification, even after standardization. Using a global model (Approach 2), both PLS-DA and SIMCA techniques were effective after performing DS. Considering that real situations may involve questioned samples from other nations (such as Peru), the SIMCA method developed according to Approach 2 is a more adequate, since the sample will be classified neither as Brazil nor Venezuelan. This methodology could be applied to other forensic problems involving the chemical classification of a product, provided that a specific modeling is performed. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  17. Raman microspectroscopy of nucleus and cytoplasm for human colon cancer diagnosis.

    PubMed

    Liu, Wenjing; Wang, Hongbo; Du, Jingjing; Jing, Chuanyong

    2017-11-15

    Subcellular Raman analysis is a promising clinic tool for cancer diagnosis, but constrained by the difficulty of deciphering subcellular spectra in actual human tissues. We report a label-free subcellular Raman analysis for use in cancer diagnosis that integrates subcellular signature spectra by subtracting cytoplasm from nucleus spectra (Nuc.-Cyt.) with a partial least squares-discriminant analysis (PLS-DA) model. Raman mapping with the classical least-squares (CLS) model allowed direct visualization of the distribution of the cytoplasm and nucleus. The PLS-DA model was employed to evaluate the diagnostic performance of five types of spectral datasets, including non-selective, nucleus, cytoplasm, ratio of nucleus to cytoplasm (Nuc./Cyt.), and nucleus minus cytoplasm (Nuc.-Cyt.), resulting in diagnostic sensitivity of 88.3%, 84.0%, 98.4%, 84.5%, and 98.9%, respectively. Discriminating between normal and cancerous cells of actual human tissues through subcellular Raman markers is feasible, especially when using the nucleus-cytoplasm difference spectra. The subcellular Raman approach had good stability, and had excellent diagnostic performance for rectal as well as colon tissues. The insights gained from this study shed new light on the general applicability of subcellular Raman analysis in clinical trials. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Classification of Fusarium-Infected Korean Hulled Barley Using Near-Infrared Reflectance Spectroscopy and Partial Least Squares Discriminant Analysis

    PubMed Central

    Lim, Jongguk; Kim, Giyoung; Mo, Changyeun; Oh, Kyoungmin; Yoo, Hyeonchae; Ham, Hyeonheui; Kim, Moon S.

    2017-01-01

    The purpose of this study is to use near-infrared reflectance (NIR) spectroscopy equipment to nondestructively and rapidly discriminate Fusarium-infected hulled barley. Both normal hulled barley and Fusarium-infected hulled barley were scanned by using a NIR spectrometer with a wavelength range of 1175 to 2170 nm. Multiple mathematical pretreatments were applied to the reflectance spectra obtained for Fusarium discrimination and the multivariate analysis method of partial least squares discriminant analysis (PLS-DA) was used for discriminant prediction. The PLS-DA prediction model developed by applying the second-order derivative pretreatment to the reflectance spectra obtained from the side of hulled barley without crease achieved 100% accuracy in discriminating the normal hulled barley and the Fusarium-infected hulled barley. These results demonstrated the feasibility of rapid discrimination of the Fusarium-infected hulled barley by combining multivariate analysis with the NIR spectroscopic technique, which is utilized as a nondestructive detection method. PMID:28974012

  19. Characterization of oils and fats by 1H NMR and GC/MS fingerprinting: classification, prediction and detection of adulteration.

    PubMed

    Fang, Guihua; Goh, Jing Yeen; Tay, Manjun; Lau, Hiu Fung; Li, Sam Fong Yau

    2013-06-01

    The correct identification of oils and fats is important to consumers from both commercial and health perspectives. Proton nuclear magnetic resonance ((1)H NMR) spectroscopy, gas chromatography-mass spectrometry (GC/MS) fingerprinting and chemometrics were employed successfully for the quality control of oils and fats. Principal component analysis (PCA) of both techniques showed group clustering of 14 types of oils and fats. Partial least squares discriminant analysis (PLS-DA) and orthogonal projections to latent structures discriminant analysis (OPLS-DA) using GC/MS data had excellent classification sensitivity and specificity compared to models using NMR data. Depending on the availability of the instruments, data from either technique can effectively be applied for the establishment of an oils and fats database to identify unknown samples. Partial least squares (PLS) models were successfully established for the detection of as low as 5% of lard and beef tallow spiked into canola oil, thus illustrating possible applications in Islamic and Jewish countries. Copyright © 2012 Elsevier Ltd. All rights reserved.

  20. Preliminary antifungal and cytotoxic evaluation of synthetic cycloalkyl[b]thiophene derivatives with PLS-DA analysis.

    PubMed

    Souza, Beatriz C C; De Oliveira, Tiago B; Aquino, Thiago M; de Lima, Maria C A; Pitta, Ivan R; Galdino, Suely L; Lima, Edeltrudes O; Gonçalves-Silva, Teresinha; Militão, Gardênia C G; Scotti, Luciana; Scotti, Marcus T; Mendonça, Francisco J B

    2012-06-01

    A series of 2-[(arylidene)amino]-cycloalkyl[b]thiophene-3-carbonitriles (2a-x) was synthesized by incorporation of substituted aromatic aldehydes in Gewald adducts (1a-c). The title compounds were screened for their antifungal activity against Candida krusei and Criptococcus neoformans and for their antiproliferative activity against a panel of 3 human cancer cell lines (HT29, NCI H-292 and HEP). For antiproliferative activity, the partial least squares (PLS) methodology was applied. Some of the prepared compounds exhibited promising antifungal and proliferative properties. The most active compounds for antifungal activity were cyclohexyl[b]thiophene derivatives, and for antiproliferative activity cycloheptyl[b]thiophene derivatives, especially 2-[(1H-indol-2-yl-methylidene)amino]- 5,6,7,8-tetrahydro-4H-cyclohepta[b]thiophene-3-carbonitrile (2r), which inhibited more than 97 % growth of the three cell lines. The PLS discriminant analysis (PLS-DA) applied generated good exploratory and predictive results and showed that the descriptors having shape characteristics were strongly correlated with the biological data.

  1. Thermal oxidation process accelerates degradation of the olive oil mixed with sunflower oil and enables its discrimination using synchronous fluorescence spectroscopy and chemometric analysis

    NASA Astrophysics Data System (ADS)

    Mabood, Fazal; Boqué, Ricard; Folcarelli, Rita; Busto, Olga; Al-Harrasi, Ahmed; Hussain, Javid

    2015-05-01

    We have investigated the effect of thermal treatment on the discrimination of pure extra virgin olive oil (EVOO) samples from EVOO samples adulterated with sunflower oil. Two groups of samples were used. One group was analyzed at room temperature (25 °C) and the other group was thermally treated in a thermostatic water bath at 75 °C for 8 h, in contact with air and with light exposure, to favor oxidation. All samples were then measured with synchronous fluorescence spectroscopy. Fluorescence spectra were acquired by varying the excitation wavelength in the region from 250 to 720 nm. In order to optimize the differences between excitation and emission wavelengths, four constant differential wavelengths, i.e., 20 nm, 40 nm, 60 nm and 80 nm, were tried. Partial least-squares discriminant analysis (PLS-DA) was used to discriminate between pure and adulterated oils. It was found that the 20 nm difference was the optimal, at which the discrimination models showed the best results. The best PLS-DA models were those built with the difference spectra (75-25 °C), which were able to discriminate pure from adulterated oils at a 2% level of adulteration. Furthermore, PLS regression models were built to quantify the level of adulteration. Again, the best model was the one built with the difference spectra, with a prediction error of 1.75% of adulteration.

  2. Metabolomic Profile of Ards by Nuclear Magnetic Resonance Spectroscopy in Patients with H1N1 Influenza Virus Pneumonia.

    PubMed

    Izquierdo-Garcia, Jose L; Nin, Nicolas; Jimenez-Clemente, Jorge; Horcajada, Juan P; Arenas-Miras, Maria Del Mar; Gea, Joaquim; Esteban, Andres; Ruiz-Cabello, Jesus; Lorente, Jose A

    2017-12-29

    The integrated analysis of changes in the metabolic profile could be critical for the discovery of biomarkers of lung injury, and also for generating new pathophysiological hypotheses and designing novel therapeutic targets for the acute respiratory distress syndrome (ARDS). This study aimed at developing a Nuclear Magnetic Resonance (NMR)-based approach for the identification of the metabolomic profile of ARDS in patients with H1N1 influenza virus pneumonia. Serum samples from 30 patients (derivation set) diagnosed of H1N1 influenza virus pneumonia were analysed by unsupervised Principal Component Analysis (PCA) to identify metabolic differences between patients with and without ARDS by NMR-spectroscopy. A predictive model of partial least squares discriminant analysis (PLS-DA) was developed for the identification of ARDS. PLS-DA was trained with the derivation set and tested in another set of samples from 26 patients also diagnosed of H1N1 influenza virus pneumonia (validation set). Decreased serum glucose, alanine, glutamine, methylhistidine and fatty acids concentrations, and elevated serum phenylalanine and methylguanidine concentrations, discriminated patients with ARDS versus patients without ARDS. PLS-DA model successfully identified the presence of ARDS in the validation set with a success rate of 92% (sensitivity 100% and specificity 91%). The classification functions showed a good correlation with the Sequential Organ Failure Assessment (SOFA) score (R = 0.74, p < 0.0001) and the Pa02/Fi02 ratio (R = 0.41, p = 0.03). The serum metabolomic profile is sensitive and specific to identify ARDS in patients with H1N1 influenza A pneumonia. Future studies are needed to determine the role of NMR-spectroscopy as a biomarker of ARDS.

  3. Simultaneous data pre-processing and SVM classification model selection based on a parallel genetic algorithm applied to spectroscopic data of olive oils.

    PubMed

    Devos, Olivier; Downey, Gerard; Duponchel, Ludovic

    2014-04-01

    Classification is an important task in chemometrics. For several years now, support vector machines (SVMs) have proven to be powerful for infrared spectral data classification. However such methods require optimisation of parameters in order to control the risk of overfitting and the complexity of the boundary. Furthermore, it is established that the prediction ability of classification models can be improved using pre-processing in order to remove unwanted variance in the spectra. In this paper we propose a new methodology based on genetic algorithm (GA) for the simultaneous optimisation of SVM parameters and pre-processing (GENOPT-SVM). The method has been tested for the discrimination of the geographical origin of Italian olive oil (Ligurian and non-Ligurian) on the basis of near infrared (NIR) or mid infrared (FTIR) spectra. Different classification models (PLS-DA, SVM with mean centre data, GENOPT-SVM) have been tested and statistically compared using McNemar's statistical test. For the two datasets, SVM with optimised pre-processing give models with higher accuracy than the one obtained with PLS-DA on pre-processed data. In the case of the NIR dataset, most of this accuracy improvement (86.3% compared with 82.8% for PLS-DA) occurred using only a single pre-processing step. For the FTIR dataset, three optimised pre-processing steps are required to obtain SVM model with significant accuracy improvement (82.2%) compared to the one obtained with PLS-DA (78.6%). Furthermore, this study demonstrates that even SVM models have to be developed on the basis of well-corrected spectral data in order to obtain higher classification rates. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Partial Least Squares with Structured Output for Modelling the Metabolomics Data Obtained from Complex Experimental Designs: A Study into the Y-Block Coding.

    PubMed

    Xu, Yun; Muhamadali, Howbeer; Sayqal, Ali; Dixon, Neil; Goodacre, Royston

    2016-10-28

    Partial least squares (PLS) is one of the most commonly used supervised modelling approaches for analysing multivariate metabolomics data. PLS is typically employed as either a regression model (PLS-R) or a classification model (PLS-DA). However, in metabolomics studies it is common to investigate multiple, potentially interacting, factors simultaneously following a specific experimental design. Such data often cannot be considered as a "pure" regression or a classification problem. Nevertheless, these data have often still been treated as a regression or classification problem and this could lead to ambiguous results. In this study, we investigated the feasibility of designing a hybrid target matrix Y that better reflects the experimental design than simple regression or binary class membership coding commonly used in PLS modelling. The new design of Y coding was based on the same principle used by structural modelling in machine learning techniques. Two real metabolomics datasets were used as examples to illustrate how the new Y coding can improve the interpretability of the PLS model compared to classic regression/classification coding.

  5. Detection of residues from explosive manipulation by near infrared hyperspectral imaging: a promising forensic tool.

    PubMed

    Fernández de la Ossa, Mª Ángeles; Amigo, José Manuel; García-Ruiz, Carmen

    2014-09-01

    In this study near infrared hyperspectral imaging (NIR-HSI) is used to provide a fast, non-contact, non-invasive and non-destructive method for the analysis of explosive residues on human handprints. Volunteers manipulated individually each of these explosives and after deposited their handprints on plastic sheets. For this purpose, classical explosives, potentially used as part of improvised explosive devices (IEDs) as ammonium nitrate, blackpowder, single- and double-base smokeless gunpowders and dynamite were studied. A partial-least squares discriminant analysis (PLS-DA) model was built to detect and classify the presence of explosive residues in handprints. High levels of sensitivity and specificity for the PLS-DA classification model created to identify ammonium nitrate, blackpowder, single- and double-base smokeless gunpowders and dynamite residues were obtained, allowing the development of a preliminary library and facilitating the direct and in situ detection of explosives by NIR-HSI. Consequently, this technique is showed as a promising forensic tool for the detection of explosive residues and other related samples. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  6. Characterization of human breast cancer tissues by infrared imaging.

    PubMed

    Verdonck, M; Denayer, A; Delvaux, B; Garaud, S; De Wind, R; Desmedt, C; Sotiriou, C; Willard-Gallo, K; Goormaghtigh, E

    2016-01-21

    Fourier Transform InfraRed (FTIR) spectroscopy coupled to microscopy (IR imaging) has shown unique advantages in detecting morphological and molecular pathologic alterations in biological tissues. The aim of this study was to evaluate the potential of IR imaging as a diagnostic tool to identify characteristics of breast epithelial cells and the stroma. In this study a total of 19 breast tissue samples were obtained from 13 patients. For 6 of the patients, we also obtained Non-Adjacent Non-Tumor tissue samples. Infrared images were recorded on the main cell/tissue types identified in all breast tissue samples. Unsupervised Principal Component Analyses and supervised Partial Least Square Discriminant Analyses (PLS-DA) were used to discriminate spectra. Leave-one-out cross-validation was used to evaluate the performance of PLS-DA models. Our results show that IR imaging coupled with PLS-DA can efficiently identify the main cell types present in FFPE breast tissue sections, i.e. epithelial cells, lymphocytes, connective tissue, vascular tissue and erythrocytes. A second PLS-DA model could distinguish normal and tumor breast epithelial cells in the breast tissue sections. A patient-specific model reached particularly high sensitivity, specificity and MCC rates. Finally, we showed that the stroma located close or at distance from the tumor exhibits distinct spectral characteristics. In conclusion FTIR imaging combined with computational algorithms could be an accurate, rapid and objective tool to identify/quantify breast epithelial cells and differentiate tumor from normal breast tissue as well as normal from tumor-associated stroma, paving the way to the establishment of a potential complementary tool to ensure safe tumor margins.

  7. Rapid high-throughput characterisation, classification and selection of recombinant mammalian cell line phenotypes using intact cell MALDI-ToF mass spectrometry fingerprinting and PLS-DA modelling.

    PubMed

    Povey, Jane F; O'Malley, Christopher J; Root, Tracy; Martin, Elaine B; Montague, Gary A; Feary, Marc; Trim, Carol; Lang, Dietmar A; Alldread, Richard; Racher, Andrew J; Smales, C Mark

    2014-08-20

    Despite many advances in the generation of high producing recombinant mammalian cell lines over the last few decades, cell line selection and development is often slowed by the inability to predict a cell line's phenotypic characteristics (e.g. growth or recombinant protein productivity) at larger scale (large volume bioreactors) using data from early cell line construction at small culture scale. Here we describe the development of an intact cell MALDI-ToF mass spectrometry fingerprinting method for mammalian cells early in the cell line construction process whereby the resulting mass spectrometry data are used to predict the phenotype of mammalian cell lines at larger culture scale using a Partial Least Squares Discriminant Analysis (PLS-DA) model. Using MALDI-ToF mass spectrometry, a library of mass spectrometry fingerprints was generated for individual cell lines at the 96 deep well plate stage of cell line development. The growth and productivity of these cell lines were evaluated in a 10L bioreactor model of Lonza's large-scale (up to 20,000L) fed-batch cell culture processes. Using the mass spectrometry information at the 96 deep well plate stage and phenotype information at the 10L bioreactor scale a PLS-DA model was developed to predict the productivity of unknown cell lines at the 10L scale based upon their MALDI-ToF fingerprint at the 96 deep well plate scale. This approach provides the basis for the very early prediction of cell lines' performance in cGMP manufacturing-scale bioreactors and the foundation for methods and models for predicting other mammalian cell phenotypes from rapid, intact-cell mass spectrometry based measurements. Copyright © 2014 Elsevier B.V. All rights reserved.

  8. Non-targeted analyses of animal plasma: betaine and choline represent the nutritional and metabolic status.

    PubMed

    Katayama, K; Sato, T; Arai, T; Amao, H; Ohta, Y; Ozawa, T; Kenyon, P R; Hickson, R E; Tazaki, H

    2013-02-01

    Simple liquid chromatography-mass spectrometry (LC-MS) was applied to non-targeted metabolic analyses to discover new metabolic markers in animal plasma. Principle component analysis (PCA) and partial least squares-discriminate analysis (PLS-DA) were used to analyse LC-MS multivariate data. PCA clearly generated two separate clusters for artificially induced diabetic mice and healthy control mice. PLS-DA of time-course changes in plasma metabolites of chicks after feeding generated three clusters (pre- and immediately after feeding, 0.5-3 h after feeding and 4 h after feeding). Two separate clusters were also generated for plasma metabolites of pregnant Angus heifers with differing live-weight change profiles (gaining or losing). The accompanying PLS-DA loading plot detailed the metabolites that contribute the most to the cluster separation. In each case, the same highly hydrophilic metabolite was strongly correlated to the group separation. The metabolite was identified as betaine by LC-MS/MS. This result indicates that betaine and its metabolic precursor, choline, may be useful biomarkers to evaluate the nutritional and metabolic status of animals. © 2011 Blackwell Verlag GmbH.

  9. 1H nuclear magnetic resonance-based metabolomic characterization of wines by grape varieties and production areas.

    PubMed

    Son, Hong-Seok; Kim, Ki Myong; van den Berg, Frans; Hwang, Geum-Sook; Park, Won-Mok; Lee, Cherl-Ho; Hong, Young-Shick

    2008-09-10

    (1)H NMR spectroscopy was used to investigate the metabolic differences in wines produced from different grape varieties and different regions. A significant separation among wines from Campbell Early, Cabernet Sauvignon, and Shiraz grapes was observed using principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA). The metabolites contributing to the separation were assigned to be 2,3-butanediol, lactate, acetate, proline, succinate, malate, glycerol, tartarate, glucose, and phenolic compounds by PCA and PLS-DA loading plots. Wines produced from Cabernet Sauvignon grapes harvested in the continental areas of Australia, France, and California were also separated. PLS-DA loading plots revealed that the level of proline in Californian Cabernet Sauvignon wines was higher than that in Australian and French Cabernet Sauvignon, Australian Shiraz, and Korean Campbell Early wines, showing that the chemical composition of the grape berries varies with the variety and growing area. This study highlights the applicability of NMR-based metabolomics with multivariate statistical data sets in determining wine quality and product origin.

  10. Atmospheric pressure chemical ionisation mass spectrometry analysis linked with chemometrics for food classification - a case study: geographical provenance and cultivar classification of monovarietal clarified apple juices.

    PubMed

    Gan, Heng-Hui; Soukoulis, Christos; Fisk, Ian

    2014-03-01

    In the present work, we have evaluated for first time the feasibility of APCI-MS volatile compound fingerprinting in conjunction with chemometrics (PLS-DA) as a new strategy for rapid and non-destructive food classification. For this purpose 202 clarified monovarietal juices extracted from apples differing in their botanical and geographical origin were used for evaluation of the performance of APCI-MS as a classification tool. For an independent test set PLS-DA analyses of pre-treated spectral data gave 100% and 94.2% correct classification rate for the classification by cultivar and geographical origin, respectively. Moreover, PLS-DA analysis of APCI-MS in conjunction with GC-MS data revealed that masses within the spectral ACPI-MS data set were related with parent ions or fragments of alkyesters, carbonyl compounds (hexanal, trans-2-hexenal) and alcohols (1-hexanol, 1-butanol, cis-3-hexenol) and had significant discriminating power both in terms of cultivar and geographical origin. Copyright © 2013 The Authors. Published by Elsevier Ltd.. All rights reserved.

  11. Discrimination of edible oils and fats by combination of multivariate pattern recognition and FT-IR spectroscopy: A comparative study between different modeling methods

    NASA Astrophysics Data System (ADS)

    Javidnia, Katayoun; Parish, Maryam; Karimi, Sadegh; Hemmateenejad, Bahram

    2013-03-01

    By using FT-IR spectroscopy, many researchers from different disciplines enrich the experimental complexity of their research for obtaining more precise information. Moreover chemometrics techniques have boosted the use of IR instruments. In the present study we aimed to emphasize on the power of FT-IR spectroscopy for discrimination between different oil samples (especially fat from vegetable oils). Also our data were used to compare the performance of different classification methods. FT-IR transmittance spectra of oil samples (Corn, Colona, Sunflower, Soya, Olive, and Butter) were measured in the wave-number interval of 450-4000 cm-1. Classification analysis was performed utilizing PLS-DA, interval PLS-DA, extended canonical variate analysis (ECVA) and interval ECVA methods. The effect of data preprocessing by extended multiplicative signal correction was investigated. Whilst all employed method could distinguish butter from vegetable oils, iECVA resulted in the best performances for calibration and external test set with 100% sensitivity and specificity.

  12. Discrimination of Gastrodia elata from Different Geographical Origin for Quality Evaluation Using Newly-Build Near Infrared Spectrum Coupled with Multivariate Analysis.

    PubMed

    Zuo, Yamin; Deng, Xuehua; Wu, Qing

    2018-05-04

    Discrimination of Gastrodia elata ( G. elata ) geographical origin is of great importance to pharmaceutical companies and consumers in China. this paper focuses on the feasibility of near infrared spectrum (NIRS) combined multivariate analysis as a rapid and non-destructive method to prove its fit for this purpose. Firstly, 16 batches of G. elata samples from four main-cultivation regions in China were quantified by traditional HPLC method. It showed that samples from different origins could not be efficiently differentiated by the contents of four phenolic compounds in this study. Secondly, the raw near infrared (NIR) spectra of those samples were acquired and two different pattern recognition techniques were used to classify the geographical origins. The results showed that with spectral transformation optimized, discriminant analysis (DA) provided 97% and 99% correct classification for the calibration and validation sets of samples from discriminating of four different main-cultivation regions, and provided 98% and 99% correct classifications for the calibration and validation sets of samples from eight different cities, respectively, which all performed better than the principal component analysis (PCA) method. Thirdly, as phenolic compounds content (PCC) is highly related with the quality of G. elata , synergy interval partial least squares (Si-PLS) was applied to build the PCC prediction model. The coefficient of determination for prediction (R p ²) of the Si-PLS model was 0.9209, and root mean square error for prediction (RMSEP) was 0.338. The two regions (4800 cm −1 ⁻5200 cm −1 , and 5600 cm −1 ⁻6000 cm −1 ) selected by Si-PLS corresponded to the absorptions of aromatic ring in the basic phenolic structure. It can be concluded that NIR spectroscopy combined with PCA, DA and Si-PLS would be a potential tool to provide a reference for the quality control of G. elata.

  13. Characteristic fingerprinting based on macamides for discrimination of maca (Lepidium meyenii) by LC/MS/MS and multivariate statistical analysis.

    PubMed

    Pan, Yu; Zhang, Ji; Li, Hong; Wang, Yuan-Zhong; Li, Wan-Yi

    2016-10-01

    Macamides with a benzylalkylamide nucleus are characteristic and major bioactive compounds in the functional food maca (Lepidium meyenii Walp). The aim of this study was to explore variations in macamide content among maca from China and Peru. Twenty-seven batches of maca hypocotyls with different phenotypes, sampled from different geographical origins, were extracted and profiled by liquid chromatography with ultraviolet detection/tandem mass spectrometry (LC-UV/MS/MS). Twelve macamides were identified by MS operated in multiple scanning modes. Similarity analysis showed that maca samples differed significantly in their macamide fingerprinting. Partial least squares discriminant analysis (PLS-DA) was used to differentiate samples according to their geographical origin and to identify the most relevant variables in the classification model. The prediction accuracy for raw maca was 91% and five macamides were selected and considered as chemical markers for sample classification. When combined with a PLS-DA model, characteristic fingerprinting based on macamides could be recommended for labelling for the authentication of maca from different geographical origins. The results provided potential evidence for the relationships between environmental or other factors and distribution of macamides. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.

  14. LIBS data analysis using a predictor-corrector based digital signal processor algorithm

    NASA Astrophysics Data System (ADS)

    Sanders, Alex; Griffin, Steven T.; Robinson, Aaron

    2012-06-01

    There are many accepted sensor technologies for generating spectra for material classification. Once the spectra are generated, communication bandwidth limitations favor local material classification with its attendant reduction in data transfer rates and power consumption. Transferring sensor technologies such as Cavity Ring-Down Spectroscopy (CRDS) and Laser Induced Breakdown Spectroscopy (LIBS) require effective material classifiers. A result of recent efforts has been emphasis on Partial Least Squares - Discriminant Analysis (PLS-DA) and Principle Component Analysis (PCA). Implementation of these via general purpose computers is difficult in small portable sensor configurations. This paper addresses the creation of a low mass, low power, robust hardware spectra classifier for a limited set of predetermined materials in an atmospheric matrix. Crucial to this is the incorporation of PCA or PLS-DA classifiers into a predictor-corrector style implementation. The system configuration guarantees rapid convergence. Software running on multi-core Digital Signal Processor (DSPs) simulates a stream-lined plasma physics model estimator, reducing Analog-to-Digital (ADC) power requirements. This paper presents the results of a predictorcorrector model implemented on a low power multi-core DSP to perform substance classification. This configuration emphasizes the hardware system and software design via a predictor corrector model that simultaneously decreases the sample rate while performing the classification.

  15. Rapid characterization of transgenic and non-transgenic soybean oils by chemometric methods using NIR spectroscopy

    NASA Astrophysics Data System (ADS)

    Luna, Aderval S.; da Silva, Arnaldo P.; Pinho, Jéssica S. A.; Ferré, Joan; Boqué, Ricard

    Near infrared (NIR) spectroscopy and multivariate classification were applied to discriminate soybean oil samples into non-transgenic and transgenic. Principal Component Analysis (PCA) was applied to extract relevant features from the spectral data and to remove the anomalous samples. The best results were obtained when with Support Vectors Machine-Discriminant Analysis (SVM-DA) and Partial Least Squares-Discriminant Analysis (PLS-DA) after mean centering plus multiplicative scatter correction. For SVM-DA the percentage of successful classification was 100% for the training group and 100% and 90% in validation group for non transgenic and transgenic soybean oil samples respectively. For PLS-DA the percentage of successful classification was 95% and 100% in training group for non transgenic and transgenic soybean oil samples respectively and 100% and 80% in validation group for non transgenic and transgenic respectively. The results demonstrate that NIR spectroscopy can provide a rapid, nondestructive and reliable method to distinguish non-transgenic and transgenic soybean oils.

  16. Statistical analysis of fragmentation patterns of electron ionization mass spectra of enolized-trimethylsilylated anabolic androgenic steroids

    NASA Astrophysics Data System (ADS)

    Fragkaki, A. G.; Angelis, Y. S.; Tsantili-Kakoulidou, A.; Koupparis, M.; Georgakopoulos, C.

    2009-08-01

    Anabolic androgenic steroids (AAS) are included in the List of prohibited substances of the World Anti-Doping Agency (WADA) as substances abused to enhance athletic performance. Gas chromatography coupled to mass spectrometry (GC-MS) plays an important role in doping control analyses identifying AAS as their enolized-trimethylsilyl (TMS)-derivatives using the electron ionization (EI) mode. This paper explores the suitability of complementary GC-MS mass spectra and statistical analysis (principal component analysis, PCA and partial least squares-discriminant analysis, PLS-DA) to differentiate AAS as a function of their structural and conformational features expressed by their fragment ions. The results obtained showed that the application of PCA yielded a classification among the AAS molecules which became more apparent after applying PLS-DA to the dataset. The application of PLS-DA yielded a clear separation among the AAS molecules which were, thus, classified as: 1-ene-3-keto, 3-hydroxyl with saturated A-ring, 1-ene-3-hydroxyl, 4-ene-3-keto, 1,4-diene-3-keto and 3-keto with saturated A-ring anabolic steroids. The study of this paper also presents structurally diagnostic fragment ions and dissociation routes providing evidence for the presence of unknown AAS or chemically modified molecules known as designer steroids.

  17. Application of FTIR-ATR spectroscopy coupled with multivariate analysis for rapid estimation of butter adulteration.

    PubMed

    Fadzlillah, Nurrulhidayah Ahmad; Rohman, Abdul; Ismail, Amin; Mustafa, Shuhaimi; Khatib, Alfi

    2013-01-01

    In dairy product sector, butter is one of the potential sources of fat soluble vitamins, namely vitamin A, D, E, K; consequently, butter is taken into account as high valuable price from other dairy products. This fact has attracted unscrupulous market players to blind butter with other animal fats to gain economic profit. Animal fats like mutton fat (MF) are potential to be mixed with butter due to the similarity in terms of fatty acid composition. This study focused on the application of FTIR-ATR spectroscopy in conjunction with chemometrics for classification and quantification of MF as adulterant in butter. The FTIR spectral region of 3910-710 cm⁻¹ was used for classification between butter and butter blended with MF at various concentrations with the aid of discriminant analysis (DA). DA is able to classify butter and adulterated butter without any mistakenly grouped. For quantitative analysis, partial least square (PLS) regression was used to develop a calibration model at the frequency regions of 3910-710 cm⁻¹. The equation obtained for the relationship between actual value of MF and FTIR predicted values of MF in PLS calibration model was y = 0.998x + 1.033, with the values of coefficient of determination (R²) and root mean square error of calibration are 0.998 and 0.046% (v/v), respectively. The PLS calibration model was subsequently used for the prediction of independent samples containing butter in the binary mixtures with MF. Using 9 principal components, root mean square error of prediction (RMSEP) is 1.68% (v/v). The results showed that FTIR spectroscopy can be used for the classification and quantification of MF in butter formulation for verification purposes.

  18. Metabolite profiling of Clinacanthus nutans leaves extracts obtained from different drying methods by 1H NMR-based metabolomics

    NASA Astrophysics Data System (ADS)

    Hashim, Noor Haslinda Noor; Latip, Jalifah; Khatib, Alfi

    2016-11-01

    The metabolites of Clinacanthus nutans leaves extracts and their dependence on drying process were systematically characterized using 1H nuclear magnetic resonance spectroscopy (NMR) multivariate data analysis. Principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA) were able to distinguish the leaves extracts obtained from different drying methods. The identified metabolites were carbohydrates, amino acid, flavonoids and sulfur glucoside compounds. The major metabolites responsible for the separation in PLS-DA loading plots were lupeol, cycloclinacosides, betulin, cerebrosides and choline. The results showed that the combination of 1H NMR spectroscopy and multivariate data analyses could act as an efficient technique to understand the C. nutans composition and its variation.

  19. Non-targeted 1H NMR fingerprinting and multivariate statistical analyses for the characterisation of the geographical origin of Italian sweet cherries.

    PubMed

    Longobardi, F; Ventrella, A; Bianco, A; Catucci, L; Cafagna, I; Gallo, V; Mastrorilli, P; Agostiano, A

    2013-12-01

    In this study, non-targeted (1)H NMR fingerprinting was used in combination with multivariate statistical techniques for the classification of Italian sweet cherries based on their different geographical origins (Emilia Romagna and Puglia). As classification techniques, Soft Independent Modelling of Class Analogy (SIMCA), Partial Least Squares Discriminant Analysis (PLS-DA), and Linear Discriminant Analysis (LDA) were carried out and the results were compared. For LDA, before performing a refined selection of the number/combination of variables, two different strategies for a preliminary reduction of the variable number were tested. The best average recognition and CV prediction abilities (both 100.0%) were obtained for all the LDA models, although PLS-DA also showed remarkable performances (94.6%). All the statistical models were validated by observing the prediction abilities with respect to an external set of cherry samples. The best result (94.9%) was obtained with LDA by performing a best subset selection procedure on a set of 30 principal components previously selected by a stepwise decorrelation. The metabolites that mostly contributed to the classification performances of such LDA model, were found to be malate, glucose, fructose, glutamine and succinate. Copyright © 2013 Elsevier Ltd. All rights reserved.

  20. Authentication of Organically and Conventionally Grown Basils by Gas Chromatography/Mass Spectrometry Chemical Profiles

    PubMed Central

    Wang, Zhengfang; Chen, Pei; Yu, Liangli; Harrington, Peter de B.

    2013-01-01

    Basil plants cultivated by organic and conventional farming practices were accurately classified by pattern recognition of gas chromatography/mass spectrometry (GC/MS) data. A novel extraction procedure was devised to extract characteristic compounds from ground basil powders. Two in-house fuzzy classifiers, i.e., the fuzzy rule-building expert system (FuRES) and the fuzzy optimal associative memory (FOAM) for the first time, were used to build classification models. Two crisp classifiers, i.e., soft independent modeling by class analogy (SIMCA) and the partial least-squares discriminant analysis (PLS-DA), were used as control methods. Prior to data processing, baseline correction and retention time alignment were performed. Classifiers were built with the two-way data sets, the total ion chromatogram representation of data sets, and the total mass spectrum representation of data sets, separately. Bootstrapped Latin partition (BLP) was used as an unbiased evaluation of the classifiers. By using two-way data sets, average classification rates with FuRES, FOAM, SIMCA, and PLS-DA were 100 ± 0%, 94.4 ± 0.4%, 93.3 ± 0.4%, and 100 ± 0%, respectively, for 100 independent evaluations. The established classifiers were used to classify a new validation set collected 2.5 months later with no parametric changes except that the training set and validation set were individually mean-centered. For the new two-way validation set, classification rates with FuRES, FOAM, SIMCA, and PLS-DA were 100%, 83%, 97%, and 100%, respectively. Thereby, the GC/MS analysis was demonstrated as a viable approach for organic basil authentication. It is the first time that a FOAM has been applied to classification. A novel baseline correction method was used also for the first time. The FuRES and the FOAM are demonstrated as powerful tools for modeling and classifying GC/MS data of complex samples and the data pretreatments are demonstrated to be useful to improve the performance of classifiers. PMID:23398171

  1. Analysis of the polymeric fractions of scrap from mobile phones using laser-induced breakdown spectroscopy: chemometric applications for better data interpretation.

    PubMed

    Aquino, Francisco W B; Pereira-Filho, Edenir R

    2015-03-01

    Because of their short life span and high production and consumption rates, mobile phones are one of the contributors to WEEE (waste electrical and electronic equipment) growth in many countries. If incorrectly managed, the hazardous materials used in the assembly of these devices can pollute the environment and pose dangers for workers involved in the recycling of these materials. In this study, 144 polymer fragments originating from 50 broken or obsolete mobile phones were analyzed via laser-induced breakdown spectroscopy (LIBS) without previous treatment. The coated polymers were mainly characterized by the presence of Ag, whereas the uncoated polymers were related to the presence of Al, K, Na, Si and Ti. Classification models were proposed using black and white polymers separately in order to identify the manufacturer and origin using KNN (K-nearest neighbor), SIMCA (Soft Independent Modeling of Class Analogy) and PLS-DA (Partial Least Squares for Discriminant Analysis). For the black polymers the percentage of correct predictions was, in average, 58% taking into consideration the models for manufacturer and origin identification. In the case of white polymers, the percentage of correct predictions ranged from 72.8% (PLS-DA) to 100% (KNN). Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Assessment of Fecal Microbiota and Fecal Metabolome in Symptomatic Uncomplicated Diverticular Disease of the Colon.

    PubMed

    Tursi, Antonio; Mastromarino, Paola; Capobianco, Daniela; Elisei, Walter; Miccheli, Alfredo; Capuani, Giorgio; Tomassini, Alberta; Campagna, Giuseppe; Picchio, Marcello; Giorgetti, GianMarco; Fabiocchi, Federica; Brandimarte, Giovanni

    2016-10-01

    The aim of this study was to assess fecal microbiota and metabolome in a population with symptomatic uncomplicated diverticular disease (SUDD). Whether intestinal microbiota and metabolic profiling may be altered in patients with SUDD is unknown. Stool samples from 44 consecutive women [15 patients with SUDD, 13 with asymptomatic diverticulosis (AD), and 16 healthy controls (HCs)] were analyzed. Real-time polymerase chain reaction was used to quantify targeted microorganisms. High-resolution proton nuclear magnetic resonance spectroscopy associated with multivariate analysis with partial least-square discriminant analysis (PLS-DA) was applied on the metabolite data set. The overall bacterial quantity did not differ among the 3 groups (P=0.449), with no difference in Bacteroides/Prevotella, Clostridium coccoides, Bifidobacterium, Lactobacillus, and Escherichia coli subgroups. The amount of Akkermansia muciniphila species was significantly different between HC, AD, and SUDD subjects (P=0.017). PLS-DA analysis of nuclear magnetic resonance -based metabolomics associated with microbiological data showed significant discrimination between HCs and AD patients (R=0.733; Q=0.383; P<0.05, LV=2). PLS analysis showed lower N-acetyl compound and isovalerate levels in AD, associated with higher levels of A. municiphila, as compared with the HC group. PLS-DA applied on AD and SUDD samples showed a good discrimination between these 2 groups (R=0.69; Q=0.35; LV=2). SUDD patients were characterized by low levels of valerate, butyrate, and choline and by high levels of N-acetyl derivatives and U1. SUDD and AD do not show colonic bacterial overgrowth, but a significant difference in the levels of fecal A. muciniphila was observed. Moreover, increasing expression of some metabolites as expression of different AD and SUDD metabolic activity was found.

  3. FT-Raman and chemometric tools for rapid determination of quality parameters in milk powder: Classification of samples for the presence of lactose and fraud detection by addition of maltodextrin.

    PubMed

    Rodrigues Júnior, Paulo Henrique; de Sá Oliveira, Kamila; de Almeida, Carlos Eduardo Rocha; De Oliveira, Luiz Fernando Cappa; Stephani, Rodrigo; Pinto, Michele da Silva; de Carvalho, Antônio Fernandes; Perrone, Ítalo Tuler

    2016-04-01

    FT-Raman spectroscopy has been explored as a quick screening method to evaluate the presence of lactose and identify milk powder samples adulterated with maltodextrin (2.5-50% w/w). Raman measurements can easily differentiate samples of milk powder, without the need for sample preparation, while traditional quality control methods, including high performance liquid chromatography, are cumbersome and slow. FT-Raman spectra were obtained from samples of whole lactose and low-lactose milk powder, both without and with addition of maltodextrin. Differences were observed between the spectra involved in identifying samples with low lactose content, as well as adulterated samples. Exploratory data analysis using Raman spectroscopy and multivariate analysis was also developed to classify samples with PCA and PLS-DA. The PLS-DA models obtained allowed to correctly classify all samples. These results demonstrate the utility of FT-Raman spectroscopy in combination with chemometrics to infer about the quality of milk powder. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Discrimination of edible oils and fats by combination of multivariate pattern recognition and FT-IR spectroscopy: a comparative study between different modeling methods.

    PubMed

    Javidnia, Katayoun; Parish, Maryam; Karimi, Sadegh; Hemmateenejad, Bahram

    2013-03-01

    By using FT-IR spectroscopy, many researchers from different disciplines enrich the experimental complexity of their research for obtaining more precise information. Moreover chemometrics techniques have boosted the use of IR instruments. In the present study we aimed to emphasize on the power of FT-IR spectroscopy for discrimination between different oil samples (especially fat from vegetable oils). Also our data were used to compare the performance of different classification methods. FT-IR transmittance spectra of oil samples (Corn, Colona, Sunflower, Soya, Olive, and Butter) were measured in the wave-number interval of 450-4000 cm(-1). Classification analysis was performed utilizing PLS-DA, interval PLS-DA, extended canonical variate analysis (ECVA) and interval ECVA methods. The effect of data preprocessing by extended multiplicative signal correction was investigated. Whilst all employed method could distinguish butter from vegetable oils, iECVA resulted in the best performances for calibration and external test set with 100% sensitivity and specificity. Copyright © 2012 Elsevier B.V. All rights reserved.

  5. Fast classification of hazelnut cultivars through portable infrared spectroscopy and chemometrics

    NASA Astrophysics Data System (ADS)

    Manfredi, Marcello; Robotti, Elisa; Quasso, Fabio; Mazzucco, Eleonora; Calabrese, Giorgio; Marengo, Emilio

    2018-01-01

    The authentication and traceability of hazelnuts is very important for both the consumer and the food industry, to safeguard the protected varieties and the food quality. This study investigates the use of a portable FTIR spectrometer coupled to multivariate statistical analysis for the classification of raw hazelnuts. The method discriminates hazelnuts from different origins/cultivars based on differences of the signal intensities of their IR spectra. The multivariate classification methods, namely principal component analysis (PCA) followed by linear discriminant analysis (LDA) and partial least square discriminant analysis (PLS-DA), with or without variable selection, allowed a very good discrimination among the groups, with PLS-DA coupled to variable selection providing the best results. Due to the fast analysis, high sensitivity, simplicity and no sample preparation, the proposed analytical methodology could be successfully used to verify the cultivar of hazelnuts, and the analysis can be performed quickly and directly on site.

  6. Identification of Terpenoid Chemotypes Among High (-)-trans-Δ9- Tetrahydrocannabinol-Producing Cannabis sativa L. Cultivars.

    PubMed

    Fischedick, Justin T

    2017-01-01

    Introduction: With laws changing around the world regarding the legal status of Cannabis sativa (cannabis) it is important to develop objective classification systems that help explain the chemical variation found among various cultivars. Currently cannabis cultivars are named using obscure and inconsistent nomenclature. Terpenoids, responsible for the aroma of cannabis, are a useful group of compounds for distinguishing cannabis cultivars with similar cannabinoid content. Methods: In this study we analyzed terpenoid content of cannabis samples obtained from a single medical cannabis dispensary in California over the course of a year. Terpenoids were quantified by gas chromatography with flame ionization detection and peak identification was confirmed with gas chromatography mass spectrometry. Quantitative data from 16 major terpenoids were analyzed using hierarchical clustering analysis (HCA), principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA), and orthogonal partial least squares discriminant analysis (OPLS-DA). Results: A total of 233 samples representing 30 cultivars were used to develop a classification scheme based on quantitative data, HCA, PCA, and OPLS-DA. Initially cultivars were divided into five major groups, which were subdivided into 13 classes based on differences in terpenoid profile. Different classification models were compared with PLS-DA and found to perform best when many representative samples of a particular class were included. Conclusion: A hierarchy of terpenoid chemotypes was observed in the data set. Some cultivars fit into distinct chemotypes, whereas others seemed to represent a continuum of chemotypes. This study has demonstrated an approach to classifying cannabis cultivars based on terpenoid profile.

  7. Rapid classification of pharmaceutical ingredients with Raman spectroscopy using compressive detection strategy with PLS-DA multivariate filters.

    PubMed

    Cebeci Maltaş, Derya; Kwok, Kaho; Wang, Ping; Taylor, Lynne S; Ben-Amotz, Dor

    2013-06-01

    Identifying pharmaceutical ingredients is a routine procedure required during industrial manufacturing. Here we show that a recently developed Raman compressive detection strategy can be employed to classify various widely used pharmaceutical materials using a hybrid supervised/unsupervised strategy in which only two ingredients are used for training and yet six other ingredients can also be distinguished. More specifically, our liquid crystal spatial light modulator (LC-SLM) based compressive detection instrument is trained using only the active ingredient, tadalafil, and the excipient, lactose, but is tested using these and various other excipients; microcrystalline cellulose, magnesium stearate, titanium (IV) oxide, talc, sodium lauryl sulfate and hydroxypropyl cellulose. Partial least squares discriminant analysis (PLS-DA) is used to generate the compressive detection filters necessary for fast chemical classification. Although the filters used in this study are trained on only lactose and tadalafil, we show that all the pharmaceutical ingredients mentioned above can be differentiated and classified using PLS-DA compressive detection filters with an accumulation time of 10ms per filter. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Application of Near Infrared Reflectance Spectroscopy for Rapid and Non-Destructive Discrimination of Hulled Barley, Naked Barley, and Wheat Contaminated with Fusarium

    PubMed Central

    Lim, Jongguk; Kim, Giyoung; Mo, Changyeun; Oh, Kyoungmin; Kim, Geonseob; Ham, Hyeonheui; Kim, Seongmin; Kim, Moon S.

    2018-01-01

    Fusarium is a common fungal disease in grains that reduces the yield of barley and wheat. In this study, a near infrared reflectance spectroscopic technique was used with a statistical prediction model to rapidly and non-destructively discriminate grain samples contaminated with Fusarium. Reflectance spectra were acquired from hulled barley, naked barley, and wheat samples contaminated with Fusarium using near infrared reflectance (NIR) spectroscopy with a wavelength range of 1175–2170 nm. After measurement, the samples were cultured in a medium to discriminate contaminated samples. A partial least square discrimination analysis (PLS-DA) prediction model was developed using the acquired reflectance spectra and the culture results. The correct classification rate (CCR) of Fusarium for the hulled barley, naked barley, and wheat samples developed using raw spectra was 98% or higher. The accuracy of discrimination prediction improved when second and third-order derivative pretreatments were applied. The grains contaminated with Fusarium could be rapidly discriminated using spectroscopy technology and a PLS-DA discrimination model, and the potential of the non-destructive discrimination method could be verified. PMID:29301319

  9. Identification of Coffee Varieties Using Laser-Induced Breakdown Spectroscopy and Chemometrics.

    PubMed

    Zhang, Chu; Shen, Tingting; Liu, Fei; He, Yong

    2017-12-31

    We linked coffee quality to its different varieties. This is of interest because the identification of coffee varieties should help coffee trading and consumption. Laser-induced breakdown spectroscopy (LIBS) combined with chemometric methods was used to identify coffee varieties. Wavelet transform (WT) was used to reduce LIBS spectra noise. Partial least squares-discriminant analysis (PLS-DA), radial basis function neural network (RBFNN), and support vector machine (SVM) were used to build classification models. Loadings of principal component analysis (PCA) were used to select the spectral variables contributing most to the identification of coffee varieties. Twenty wavelength variables corresponding to C I, Mg I, Mg II, Al II, CN, H, Ca II, Fe I, K I, Na I, N I, and O I were selected. PLS-DA, RBFNN, and SVM models on selected wavelength variables showed acceptable results. SVM and RBFNN models performed better with a classification accuracy of over 80% in the prediction set, for both full spectra and the selected variables. The overall results indicated that it was feasible to use LIBS and chemometric methods to identify coffee varieties. For further studies, more samples are needed to produce robust classification models, research should be conducted on which methods to use to select spectral peaks that correspond to the elements contributing most to identification, and the methods for acquiring stable spectra should also be studied.

  10. Identification of Coffee Varieties Using Laser-Induced Breakdown Spectroscopy and Chemometrics

    PubMed Central

    Zhang, Chu; Shen, Tingting

    2017-01-01

    We linked coffee quality to its different varieties. This is of interest because the identification of coffee varieties should help coffee trading and consumption. Laser-induced breakdown spectroscopy (LIBS) combined with chemometric methods was used to identify coffee varieties. Wavelet transform (WT) was used to reduce LIBS spectra noise. Partial least squares-discriminant analysis (PLS-DA), radial basis function neural network (RBFNN), and support vector machine (SVM) were used to build classification models. Loadings of principal component analysis (PCA) were used to select the spectral variables contributing most to the identification of coffee varieties. Twenty wavelength variables corresponding to C I, Mg I, Mg II, Al II, CN, H, Ca II, Fe I, K I, Na I, N I, and O I were selected. PLS-DA, RBFNN, and SVM models on selected wavelength variables showed acceptable results. SVM and RBFNN models performed better with a classification accuracy of over 80% in the prediction set, for both full spectra and the selected variables. The overall results indicated that it was feasible to use LIBS and chemometric methods to identify coffee varieties. For further studies, more samples are needed to produce robust classification models, research should be conducted on which methods to use to select spectral peaks that correspond to the elements contributing most to identification, and the methods for acquiring stable spectra should also be studied. PMID:29301228

  11. Non-destructive technique for determining the viability of soybean (Glycine max) seeds using FT-NIR spectroscopy.

    PubMed

    Kusumaningrum, Dewi; Lee, Hoonsoo; Lohumi, Santosh; Mo, Changyeun; Kim, Moon S; Cho, Byoung-Kwan

    2018-03-01

    The viability of seeds is important for determining their quality. A high-quality seed is one that has a high capability of germination that is necessary to ensure high productivity. Hence, developing technology for the detection of seed viability is a high priority in agriculture. Fourier transform near-infrared (FT-NIR) spectroscopy is one of the most popular devices among other vibrational spectroscopies. This study aims to use FT-NIR spectroscopy to determine the viability of soybean seeds. Viable and artificial ageing seeds as non-viable soybeans were used in this research. The FT-NIR spectra of soybean seeds were collected and analysed using a partial least-squares discriminant analysis (PLS-DA) to classify viable and non-viable soybean seeds. Moreover, the variable importance in projection (VIP) method for variable selection combined with the PLS-DA was employed. The most effective wavelengths were selected by the VIP method, which selected 146 optimal variables from the full set of 1557 variables. The results demonstrated that the FT-NIR spectral analysis with the PLS-DA method that uses all variables or the selected variables showed good performance based on the high value of prediction accuracy for soybean viability with an accuracy close to 100%. Hence, FT-NIR techniques with a chemometric analysis have the potential for rapidly measuring soybean seed viability. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.

  12. Characterization of the volatile components in green tea by IRAE-HS-SPME/GC-MS combined with multivariate analysis.

    PubMed

    Yang, Yan-Qin; Yin, Hong-Xu; Yuan, Hai-Bo; Jiang, Yong-Wen; Dong, Chun-Wang; Deng, Yu-Liang

    2018-01-01

    In the present work, a novel infrared-assisted extraction coupled to headspace solid-phase microextraction (IRAE-HS-SPME) followed by gas chromatography-mass spectrometry (GC-MS) was developed for rapid determination of the volatile components in green tea. The extraction parameters such as fiber type, sample amount, infrared power, extraction time, and infrared lamp distance were optimized by orthogonal experimental design. Under optimum conditions, a total of 82 volatile compounds in 21 green tea samples from different geographical origins were identified. Compared with classical water-bath heating, the proposed technique has remarkable advantages of considerably reducing the analytical time and high efficiency. In addition, an effective classification of green teas based on their volatile profiles was achieved by partial least square-discriminant analysis (PLS-DA) and hierarchical clustering analysis (HCA). Furthermore, the application of a dual criterion based on the variable importance in the projection (VIP) values of the PLS-DA models and on the category from one-way univariate analysis (ANOVA) allowed the identification of 12 potential volatile markers, which were considered to make the most important contribution to the discrimination of the samples. The results suggest that IRAE-HS-SPME/GC-MS technique combined with multivariate analysis offers a valuable tool to assess geographical traceability of different tea varieties.

  13. Characterization of the volatile components in green tea by IRAE-HS-SPME/GC-MS combined with multivariate analysis

    PubMed Central

    Yin, Hong-Xu; Yuan, Hai-Bo; Jiang, Yong-Wen; Dong, Chun-Wang; Deng, Yu-Liang

    2018-01-01

    In the present work, a novel infrared-assisted extraction coupled to headspace solid-phase microextraction (IRAE-HS-SPME) followed by gas chromatography-mass spectrometry (GC-MS) was developed for rapid determination of the volatile components in green tea. The extraction parameters such as fiber type, sample amount, infrared power, extraction time, and infrared lamp distance were optimized by orthogonal experimental design. Under optimum conditions, a total of 82 volatile compounds in 21 green tea samples from different geographical origins were identified. Compared with classical water-bath heating, the proposed technique has remarkable advantages of considerably reducing the analytical time and high efficiency. In addition, an effective classification of green teas based on their volatile profiles was achieved by partial least square-discriminant analysis (PLS-DA) and hierarchical clustering analysis (HCA). Furthermore, the application of a dual criterion based on the variable importance in the projection (VIP) values of the PLS-DA models and on the category from one-way univariate analysis (ANOVA) allowed the identification of 12 potential volatile markers, which were considered to make the most important contribution to the discrimination of the samples. The results suggest that IRAE-HS-SPME/GC-MS technique combined with multivariate analysis offers a valuable tool to assess geographical traceability of different tea varieties. PMID:29494626

  14. PCA as a practical indicator of OPLS-DA model reliability.

    PubMed

    Worley, Bradley; Powers, Robert

    Principal Component Analysis (PCA) and Orthogonal Projections to Latent Structures Discriminant Analysis (OPLS-DA) are powerful statistical modeling tools that provide insights into separations between experimental groups based on high-dimensional spectral measurements from NMR, MS or other analytical instrumentation. However, when used without validation, these tools may lead investigators to statistically unreliable conclusions. This danger is especially real for Partial Least Squares (PLS) and OPLS, which aggressively force separations between experimental groups. As a result, OPLS-DA is often used as an alternative method when PCA fails to expose group separation, but this practice is highly dangerous. Without rigorous validation, OPLS-DA can easily yield statistically unreliable group separation. A Monte Carlo analysis of PCA group separations and OPLS-DA cross-validation metrics was performed on NMR datasets with statistically significant separations in scores-space. A linearly increasing amount of Gaussian noise was added to each data matrix followed by the construction and validation of PCA and OPLS-DA models. With increasing added noise, the PCA scores-space distance between groups rapidly decreased and the OPLS-DA cross-validation statistics simultaneously deteriorated. A decrease in correlation between the estimated loadings (added noise) and the true (original) loadings was also observed. While the validity of the OPLS-DA model diminished with increasing added noise, the group separation in scores-space remained basically unaffected. Supported by the results of Monte Carlo analyses of PCA group separations and OPLS-DA cross-validation metrics, we provide practical guidelines and cross-validatory recommendations for reliable inference from PCA and OPLS-DA models.

  15. In vivo diagnosis of cervical precancer using Raman spectroscopy and genetic algorithm techniques.

    PubMed

    Duraipandian, Shiyamala; Zheng, Wei; Ng, Joseph; Low, Jeffrey J H; Ilancheran, A; Huang, Zhiwei

    2011-10-21

    This study aimed to evaluate the clinical utility of applying near-infrared (NIR) Raman spectroscopy and genetic algorithm-partial least squares-discriminant analysis (GA-PLS-DA) to identify biomolecular changes of cervical tissues associated with dysplastic transformation during colposcopic examination. A total of 105 in vivo Raman spectra were measured from 57 cervical sites (35 normal and 22 precancer sites) of 29 patients recruited, in which 65 spectra were from normal sites, while 40 spectra were from cervical precancerous lesions (i.e., 7 low-grade CIN and 33 high-grade CIN). The GA feature selection technique incorporated with PLS was utilized to study the significant biochemical Raman bands for differentiation between normal and precancer cervical tissues. The GA-PLS-DA algorithm with double cross-validation (dCV) identified seven diagnostically significant Raman bands in the ranges of 925-935, 979-999, 1080-1090, 1240-1260, 1320-1340, 1400-1420, and 1625-1645 cm(-1) related to proteins, nucleic acids and lipids in tissue, and yielded a diagnostic accuracy of 82.9% (sensitivity of 72.5% (29/40) and specificity of 89.2% (58/65)) for precancer detection. The results of this exploratory study suggest that Raman spectroscopy in conjunction with GA-PLS-DA and dCV methods has the potential to provide clinically significant discrimination between normal and precancer cervical tissues at the molecular level.

  16. Application of a Novel S3 Nanowire Gas Sensor Device in Parallel with GC-MS for the Identification of Rind Percentage of Grated Parmigiano Reggiano.

    PubMed

    Abbatangelo, Marco; Núñez-Carmona, Estefanía; Sberveglieri, Veronica; Zappa, Dario; Comini, Elisabetta; Sberveglieri, Giorgio

    2018-05-18

    Parmigiano Reggiano cheese is one of the most appreciated and consumed foods worldwide, especially in Italy, for its high content of nutrients and taste. However, these characteristics make this product subject to counterfeiting in different forms. In this study, a novel method based on an electronic nose has been developed to investigate the potentiality of this tool to distinguish rind percentages in grated Parmigiano Reggiano packages that should be lower than 18%. Different samples, in terms of percentage, seasoning and rind working process, were considered to tackle the problem at 360°. In parallel, GC-MS technique was used to give a name to the compounds that characterize Parmigiano and to relate them to sensors responses. Data analysis consisted of two stages: Multivariate analysis (PLS) and classification made in a hierarchical way with PLS-DA ad ANNs. Results were promising, in terms of correct classification of the samples. The correct classification rate (%) was higher for ANNs than PLS-DA, with correct identification approaching 100 percent.

  17. Classification of type 2 diabetes rats based on urine amino acids metabolic profiling by liquid chromatography coupled with tandem mass spectrometry.

    PubMed

    Wang, Chunyan; Zhu, Hongbin; Pi, Zifeng; Song, Fengrui; Liu, Zhiqiang; Liu, Shuying

    2013-09-15

    An analytical method for quantifying underivatized amino acids (AAs) in urine samples of rats was developed by using liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS). Classification of type 2 diabetes rats was based on urine amino acids metabolic profiling. LC-MS/MS analysis was applied through chromatographic separation and multiple reactions monitoring (MRM) transitions of MS/MS. Multivariate profile-wide predictive models were constructed using partial least squares discriminant analysis (PLS-DA) by SIMAC-P 11.5 version software package and hierarchical cluster analysis (HCA) by SPSS 18.0 version software. Some amino acids in urine of rats have significant change. The results of the present study prove that this method could perform the quantification of free AAs in urine of rats by using LC-MS/MS. In summary, the PLS-DA and HCA statistical analysis in our research were preferable to differentiate healthy rats and type 2 diabetes rats by the quantification of AAs in their urine samples. In addition, comparing with health group the seven increased amino acids in urine of type 2 rats were returned to normal under the treatment of acarbose. Copyright © 2013 Elsevier B.V. All rights reserved.

  18. Cerebrospinal fluid metabolomic profiling in tuberculous and viral meningitis: Screening potential markers for differential diagnosis.

    PubMed

    Li, Zihui; Du, Boping; Li, Jing; Zhang, Jinli; Zheng, Xiaojing; Jia, Hongyan; Xing, Aiying; Sun, Qi; Liu, Fei; Zhang, Zongde

    2017-03-01

    Tuberculous meningitis (TBM) is the most severe and frequent form of central nervous system tuberculosis. The current lack of efficient diagnostic tests makes it difficult to differentiate TBM from other common types of meningitis, especially viral meningitis (VM). Metabolomics is an important tool to identify disease-specific biomarkers. However, little metabolomic information is available on adult TBM. We used 1 H nuclear magnetic resonance-based metabolomics to investigate the metabolic features of the CSF from 18 TBM and 20 VM patients. Principal component analysis and orthogonal signal correction-partial least squares-discriminant analysis (OSC-PLS-DA) were applied to analyze profiling data. Metabolites were identified using the Human Metabolome Database and pathway analysis was performed with MetaboAnalyst 3.0. The OSC-PLS-DA model could distinguish TBM from VM with high reliability. A total of 25 key metabolites that contributed to their discrimination were identified, including some, such as betaine and cyclohexane, rarely reported before in TBM. Pathway analysis indicated that amino acid and energy metabolism was significantly different in the CSF of TBM compared with VM. Twenty-five key metabolites identified in our study may be potential biomarkers for TBM differential diagnosis and are worthy of further investigation. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Using near infrared spectroscopy to classify soybean oil according to expiration date.

    PubMed

    da Costa, Gean Bezerra; Fernandes, David Douglas Sousa; Gomes, Adriano A; de Almeida, Valber Elias; Veras, Germano

    2016-04-01

    A rapid and non-destructive methodology is proposed for the screening of edible vegetable oils according to conservation state expiration date employing near infrared (NIR) spectroscopy and chemometric tools. A total of fifty samples of soybean vegetable oil, of different brands andlots, were used in this study; these included thirty expired and twenty non-expired samples. The oil oxidation was measured by peroxide index. NIR spectra were employed in raw form and preprocessed by offset baseline correction and Savitzky-Golay derivative procedure, followed by PCA exploratory analysis, which showed that NIR spectra would be suitable for the classification task of soybean oil samples. The classification models were based in SPA-LDA (Linear Discriminant Analysis coupled with Successive Projection Algorithm) and PLS-DA (Discriminant Analysis by Partial Least Squares). The set of samples (50) was partitioned into two groups of training (35 samples: 15 non-expired and 20 expired) and test samples (15 samples 5 non-expired and 10 expired) using sample-selection approaches: (i) Kennard-Stone, (ii) Duplex, and (iii) Random, in order to evaluate the robustness of the models. The obtained results for the independent test set (in terms of correct classification rate) were 96% and 98% for SPA-LDA and PLS-DA, respectively, indicating that the NIR spectra can be used as an alternative to evaluate the degree of oxidation of soybean oil samples. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Gait dynamics to optimize fall risk assessment in geriatric patients admitted to an outpatient diagnostic clinic

    PubMed Central

    de Groot, Maartje H.; van Campen, Jos P.; Beijnen, Jos H.; Hortobágyi, Tibor; Vuillerme, Nicolas; Lamoth, Claudine C. J.

    2017-01-01

    Fall prediction in geriatric patients remains challenging because the increased fall risk involves multiple, interrelated factors caused by natural aging and/or pathology. Therefore, we used a multi-factorial statistical approach to model categories of modifiable fall risk factors among geriatric patients to identify fallers with highest sensitivity and specificity with a focus on gait performance. Patients (n = 61, age = 79; 41% fallers) underwent extensive screening in three categories: (1) patient characteristics (e.g., handgrip strength, medication use, osteoporosis-related factors) (2) cognitive function (global cognition, memory, executive function), and (3) gait performance (speed-related and dynamic outcomes assessed by tri-axial trunk accelerometry). Falls were registered prospectively (mean follow-up 8.6 months) and one year retrospectively. Principal Component Analysis (PCA) on 11 gait variables was performed to determine underlying gait properties. Three fall-classification models were then built using Partial Least Squares–Discriminant Analysis (PLS-DA), with separate and combined analyses of the fall risk factors. PCA identified ‘pace’, ‘variability’, and ‘coordination’ as key properties of gait. The best PLS-DA model produced a fall classification accuracy of AUC = 0.93. The specificity of the model using patient characteristics was 60% but reached 80% when cognitive and gait outcomes were added. The inclusion of cognition and gait dynamics in fall classification models reduced misclassification. We therefore recommend assessing geriatric patients’ fall risk using a multi-factorial approach that incorporates patient characteristics, cognition, and gait dynamics. PMID:28575126

  1. Gait dynamics to optimize fall risk assessment in geriatric patients admitted to an outpatient diagnostic clinic.

    PubMed

    Kikkert, Lisette H J; de Groot, Maartje H; van Campen, Jos P; Beijnen, Jos H; Hortobágyi, Tibor; Vuillerme, Nicolas; Lamoth, Claudine C J

    2017-01-01

    Fall prediction in geriatric patients remains challenging because the increased fall risk involves multiple, interrelated factors caused by natural aging and/or pathology. Therefore, we used a multi-factorial statistical approach to model categories of modifiable fall risk factors among geriatric patients to identify fallers with highest sensitivity and specificity with a focus on gait performance. Patients (n = 61, age = 79; 41% fallers) underwent extensive screening in three categories: (1) patient characteristics (e.g., handgrip strength, medication use, osteoporosis-related factors) (2) cognitive function (global cognition, memory, executive function), and (3) gait performance (speed-related and dynamic outcomes assessed by tri-axial trunk accelerometry). Falls were registered prospectively (mean follow-up 8.6 months) and one year retrospectively. Principal Component Analysis (PCA) on 11 gait variables was performed to determine underlying gait properties. Three fall-classification models were then built using Partial Least Squares-Discriminant Analysis (PLS-DA), with separate and combined analyses of the fall risk factors. PCA identified 'pace', 'variability', and 'coordination' as key properties of gait. The best PLS-DA model produced a fall classification accuracy of AUC = 0.93. The specificity of the model using patient characteristics was 60% but reached 80% when cognitive and gait outcomes were added. The inclusion of cognition and gait dynamics in fall classification models reduced misclassification. We therefore recommend assessing geriatric patients' fall risk using a multi-factorial approach that incorporates patient characteristics, cognition, and gait dynamics.

  2. Nuclear Forensic Inferences Using Iterative Multidimensional Statistics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Robel, M; Kristo, M J; Heller, M A

    2009-06-09

    Nuclear forensics involves the analysis of interdicted nuclear material for specific material characteristics (referred to as 'signatures') that imply specific geographical locations, production processes, culprit intentions, etc. Predictive signatures rely on expert knowledge of physics, chemistry, and engineering to develop inferences from these material characteristics. Comparative signatures, on the other hand, rely on comparison of the material characteristics of the interdicted sample (the 'questioned sample' in FBI parlance) with those of a set of known samples. In the ideal case, the set of known samples would be a comprehensive nuclear forensics database, a database which does not currently exist. Inmore » fact, our ability to analyze interdicted samples and produce an extensive list of precise materials characteristics far exceeds our ability to interpret the results. Therefore, as we seek to develop the extensive databases necessary for nuclear forensics, we must also develop the methods necessary to produce the necessary inferences from comparison of our analytical results with these large, multidimensional sets of data. In the work reported here, we used a large, multidimensional dataset of results from quality control analyses of uranium ore concentrate (UOC, sometimes called 'yellowcake'). We have found that traditional multidimensional techniques, such as principal components analysis (PCA), are especially useful for understanding such datasets and drawing relevant conclusions. In particular, we have developed an iterative partial least squares-discriminant analysis (PLS-DA) procedure that has proven especially adept at identifying the production location of unknown UOC samples. By removing classes which fell far outside the initial decision boundary, and then rebuilding the PLS-DA model, we have consistently produced better and more definitive attributions than with a single pass classification approach. Performance of the iterative PLS-DA method compared favorably to that of classification and regression tree (CART) and k nearest neighbor (KNN) algorithms, with the best combination of accuracy and robustness, as tested by classifying samples measured independently in our laboratories against the vendor QC based reference set.« less

  3. Detection of sunn pest-damaged wheat samples using visible/near-infrared spectroscopy based on pattern recognition.

    PubMed

    Basati, Zahra; Jamshidi, Bahareh; Rasekh, Mansour; Abbaspour-Gilandeh, Yousef

    2018-05-30

    The presence of sunn pest-damaged grains in wheat mass reduces the quality of flour and bread produced from it. Therefore, it is essential to assess the quality of the samples in collecting and storage centers of wheat and flour mills. In this research, the capability of visible/near-infrared (Vis/NIR) spectroscopy combined with pattern recognition methods was investigated for discrimination of wheat samples with different percentages of sunn pest-damaged. To this end, various samples belonging to five classes (healthy and 5%, 10%, 15% and 20% unhealthy) were analyzed using Vis/NIR spectroscopy (wavelength range of 350-1000 nm) based on both supervised and unsupervised pattern recognition methods. Principal component analysis (PCA) and hierarchical cluster analysis (HCA) as the unsupervised techniques and soft independent modeling of class analogies (SIMCA) and partial least squares-discriminant analysis (PLS-DA) as supervised methods were used. The results showed that Vis/NIR spectra of healthy samples were correctly clustered using both PCA and HCA. Due to the high overlapping between the four unhealthy classes (5%, 10%, 15% and 20%), it was not possible to discriminate all the unhealthy samples in individual classes. However, when considering only the two main categories of healthy and unhealthy, an acceptable degree of separation between the classes can be obtained after classification with supervised pattern recognition methods of SIMCA and PLS-DA. SIMCA based on PCA modeling correctly classified samples in two classes of healthy and unhealthy with classification accuracy of 100%. Moreover, the power of the wavelengths of 839 nm, 918 nm and 995 nm were more than other wavelengths to discriminate two classes of healthy and unhealthy. It was also concluded that PLS-DA provides excellent classification results of healthy and unhealthy samples (R 2  = 0.973 and RMSECV = 0.057). Therefore, Vis/NIR spectroscopy based on pattern recognition techniques can be useful for rapid distinguishing the healthy wheat samples from those damaged by sunn pest in the maintenance and processing centers. Copyright © 2018 Elsevier B.V. All rights reserved.

  4. [NIR Fingerprints of Different Medicinal Parts of Angelicae Sinensis Radix].

    PubMed

    Zhang, Ya-ya; Gu, Zhi-rong; Ding, Jun-xia; Wang, Yao-peng; Sun, Yu-jing; Wang, Ya-li

    2015-07-01

    To investigate the spectrum characteristics of near-intrared dittuse retlectance spectroscopy (NIR) fingerprint of different medicinal parts of Angelicae Sinensis Radix. 96 batches of samples were collected from 14 counties of Gansu Province and Yunnan Province. The NIR fingerprints were collected by integrated sphere. Similarity analysis and partial least square discriminant analysis(PLS-DA) were used to analyze the fingerprint. The average spectrum of NIR fingerprint of different medicinal parts of Angelicae Sinensis Radix showed some differences; the absorbance in characteristic absorption was in a decreasing order of body > tail > head > whole. Most NIR fingerprint similarities of different medicinal parts of Angelicae Sinensis Radix exceeded 0. 95. The established model of PLS-DA could be used to accurately classify the medicinal parts of Angelicae Sinensis Radix. The differences of NIR fingerprints of different medicinal parts of Angelicae Sinensis Radix were mainly existing in the wave number ranges of 8,443 - 8,284 cm -1, 7,003 - 6,896 cm-1, 6,102 - 5,864 cm-1, 4,847 - 4,674 cm-1, and 4,386 - 4,208 cm-1. The different medicinal parts of Angelicae Sinensis Radix have some differences in chemical components.

  5. Influence of metal substrates on the detection of explosive residues with laser-induced breakdown spectroscopy.

    PubMed

    Gottfried, Jennifer L

    2013-02-01

    Laser-induced breakdown spectroscopy is a promising approach for explosive residue detection, but several limitations to its widespread use remain. One issue is that the emission spectra of the residues are dependent on the substrate composition because some of the substrate is usually entrained in the laser-induced plasma and the laser-material interaction can be significantly affected by the substrate type. Here, we have demonstrated that despite the strong spectral variation in cyclotrimethylenetrinitramine (RDX) residues applied to various metal substrates, classification of the RDX residue independent of substrate type is feasible. Several approaches to improving the chemometric models based on partial least squares discriminant analysis (PLS-DA) have been described: classifying the RDX residue spectra together in one class independent of substrate, using selected emission intensities and ratios to increase the true positive rate (TPR) and decrease the false positive rate (FPR), and fusing the results from two PLS-DA models generated using the full broadband spectra and selected intensities and ratios. The combination of these approaches resulted in a TPR of 97.5% and a FPR of 1.0% for RDX classification on metal substrates.

  6. Second trimester maternal urine for the diagnosis of trisomy 21 and prediction of poor pregnancy outcomes.

    PubMed

    Diaz, Sílvia O; Barros, António S; Goodfellow, Brian J; Duarte, Iola F; Galhano, Eulália; Pita, Cristina; Almeida, Maria do Céu; Carreira, Isabel M; Gil, Ana M

    2013-06-07

    Given the recognized lack of prenatal clinical methods for the early diagnosis of preterm delivery, intrauterine growth restriction, preeclampsia and gestational diabetes mellitus, and the continuing need for optimized diagnosis methods for specific chromosomal disorders (e.g., trisomy 21) and fetal malformations, this work sought specific metabolic signatures of these conditions in second trimester maternal urine, using (1)H Nuclear Magnetic Resonance ((1)H NMR) metabolomics. Several variable importance to the projection (VIP)- and b-coefficient-based variable selection methods were tested, both individually and through their intersection, and the resulting data sets were analyzed by partial least-squares discriminant analysis (PLS-DA) and submitted to Monte Carlo cross validation (MCCV) and permutation tests to evaluate model predictive power. The NMR data subsets produced significantly improved PLS-DA models for all conditions except for pre-premature rupture of membranes. Specific urinary metabolic signatures were unveiled for central nervous system malformations, trisomy 21, preterm delivery, gestational diabetes, intrauterine growth restriction and preeclampsia, and biochemical interpretations were proposed. This work demonstrated, for the first time, the value of maternal urine profiling as a complementary means of prenatal diagnostics and early prediction of several poor pregnancy outcomes.

  7. Variable selection based on clustering analysis for improvement of polyphenols prediction in green tea using synchronous fluorescence spectra.

    PubMed

    Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi

    2018-03-13

    Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models' performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.

  8. Evaluation of Classifier Performance for Multiclass Phenotype Discrimination in Untargeted Metabolomics.

    PubMed

    Trainor, Patrick J; DeFilippis, Andrew P; Rai, Shesh N

    2017-06-21

    Statistical classification is a critical component of utilizing metabolomics data for examining the molecular determinants of phenotypes. Despite this, a comprehensive and rigorous evaluation of the accuracy of classification techniques for phenotype discrimination given metabolomics data has not been conducted. We conducted such an evaluation using both simulated and real metabolomics datasets, comparing Partial Least Squares-Discriminant Analysis (PLS-DA), Sparse PLS-DA, Random Forests, Support Vector Machines (SVM), Artificial Neural Network, k -Nearest Neighbors ( k -NN), and Naïve Bayes classification techniques for discrimination. We evaluated the techniques on simulated data generated to mimic global untargeted metabolomics data by incorporating realistic block-wise correlation and partial correlation structures for mimicking the correlations and metabolite clustering generated by biological processes. Over the simulation studies, covariance structures, means, and effect sizes were stochastically varied to provide consistent estimates of classifier performance over a wide range of possible scenarios. The effects of the presence of non-normal error distributions, the introduction of biological and technical outliers, unbalanced phenotype allocation, missing values due to abundances below a limit of detection, and the effect of prior-significance filtering (dimension reduction) were evaluated via simulation. In each simulation, classifier parameters, such as the number of hidden nodes in a Neural Network, were optimized by cross-validation to minimize the probability of detecting spurious results due to poorly tuned classifiers. Classifier performance was then evaluated using real metabolomics datasets of varying sample medium, sample size, and experimental design. We report that in the most realistic simulation studies that incorporated non-normal error distributions, unbalanced phenotype allocation, outliers, missing values, and dimension reduction, classifier performance (least to greatest error) was ranked as follows: SVM, Random Forest, Naïve Bayes, sPLS-DA, Neural Networks, PLS-DA and k -NN classifiers. When non-normal error distributions were introduced, the performance of PLS-DA and k -NN classifiers deteriorated further relative to the remaining techniques. Over the real datasets, a trend of better performance of SVM and Random Forest classifier performance was observed.

  9. Detection and identification of multiple adulterants in plant food supplements using attenuated total reflectance-Infrared spectroscopy.

    PubMed

    Deconinck, E; Aouadi, C; Bothy, J L; Courselle, P

    2018-04-15

    Due to the rising popularity of dietary supplements, especially plant food supplements, and alternative herbal medicines, a whole market developed and these products became freely available through internet. Though several searches revealed that at least a part of these products, especially the ones obtained from websites disclosing their physical identity, are aldulterated with pharmaceutical compounds. This causes a threat for public health, since these compounds are not declared and therefore adverse effects will not immediately be related to the product. The more the adulterants can interfere with other medicinal treatments. Since the present active pharmaceutical ingredients are not declared on the package and the products are sold as 100% natural or herbal in nature, it is very difficult for custom personnel to discriminate between products to be confiscated or not. Therefore easy to apply analytical approaches to discriminate between adulterated and non-adulterated products are necessary. This paper presents an approach based on infrared spectroscopy combined with attenuated total reflectance (ATR) and partial least squares- discriminant analysis (PLS-DA) to easily differentiate between adulterated and non- adulterated plant food supplements and to get a first idea of the nature of the adulterant present. The performance of PLS-DA models based on Mid-IR and NIR data were compared as well as models based on the combined data. Further three preprocessing strategies were compared. The best performance was obtained for a PLS-DA model using Mid-IR data with the second derivative as preprocessing method. This model showed a correct classification rate of 98.3% for an external test set. Also eight real samples were screened using the model and for seven of these samples a correct classification was obtained. Generally it could be concluded that the obtained model and the presented approach could be used at customs to discriminate between adulterated and non-adulterated herbal food supplements and even get a first idea of the nature of the adulterant present. The more the presented approach hardly needs sample preparation. Copyright © 2018 Elsevier B.V. All rights reserved.

  10. Variable selection based on clustering analysis for improvement of polyphenols prediction in green tea using synchronous fluorescence spectra

    NASA Astrophysics Data System (ADS)

    Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi

    2018-04-01

    Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models’ performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.

  11. Metabolite analysis distinguishes between mice with epidermolysis bullosa acquisita and healthy mice.

    PubMed

    Schönig, Sarah; Recke, Andreas; Hirose, Misa; Ludwig, Ralf J; Seeger, Karsten

    2013-06-26

    Epidermolysis bullosa acquisita (EBA) is a rare skin blistering disease with a prevalence of 0.2/ million people. EBA is characterized by autoantibodies against type VII collagen. Type VII collagen builds anchoring fibrils that are essential for the dermal-epidermal junction. The pathogenic relevance of antibodies against type VII collagen subdomains has been demonstrated both in vitro and in vivo. Despite the multitude of clinical and immunological data, no information on metabolic changes exists. We used an animal model of EBA to obtain insights into metabolomic changes during EBA. Sera from mice with immunization-induced EBA and control mice were obtained and metabolites were isolated by filtration. Proton nuclear magnetic resonance (NMR) spectra were recorded and analyzed by principal component analysis (PCA), partial least squares discrimination analysis (PLS-DA) and random forest. The metabolic pattern of immunized mice and control mice could be clearly distinguished with PCA and PLS-DA. Metabolites that contribute to the discrimination could be identified via random forest. The observed changes in the metabolic pattern of EBA sera, i.e. increased levels of amino acid, point toward an increased energy demand in EBA. Knowledge about metabolic changes due to EBA could help in future to assess the disease status during treatment. Confirming the metabolic changes in patients needs probably large cohorts.

  12. Diagnosis of coinfection by schistosomiasis and viral hepatitis B or C using 1H NMR-based metabonomics.

    PubMed

    Gouveia, Liana Ribeiro; Santos, Joelma Carvalho; Silva, Ronaldo Dionísio; Batista, Andrea Dória; Domingues, Ana Lúcia Coutinho; Lopes, Edmundo Pessoa de Almeida; Silva, Ricardo Oliveira

    2017-01-01

    Diagnosis of liver involvement due to schistosomiasis in asymptomatic patients from endemic areas previously diagnosed with chronic hepatitis B (HBV) or C (HCV) and periportal fibrosis is challenging. H-1 Nuclear Magnetic Resonance (NMR)-based metabonomics strategy is a powerful tool for providing a profile of endogenous metabolites of low molecular weight in biofluids in a non-invasive way. The aim of this study was to diagnose periportal fibrosis due to schistosomiasis mansoni in patients with chronic HBV or HCV infection through NMR-based metabonomics models. The study included 40 patients divided into two groups: (i) 18 coinfected patients with schistosomiasis mansoni and HBV or HCV; and (ii) 22 HBV or HCV monoinfected patients. The serum samples were analyzed through H-1 NMR spectroscopy and the models were based on Principal Component Analysis (PCA) and Partial Least Squares-Discriminant Analysis (PLS-DA). Ultrasonography examination was used to ascertain the diagnosis of periportal fibrosis. Exploratory analysis showed a clear separation between coinfected and monoinfected samples. The supervised model built from PLS-DA showed accuracy, R2 and Q2 values equal to 100%, 98.1% and 97.5%, respectively. According to the variable importance in the projection plot, lactate serum levels were higher in the coinfected group, while the signals attributed to HDL serum cholesterol were more intense in the monoinfected group. The metabonomics models constructed in this study are promising as an alternative tool for diagnosis of periportal fibrosis by schistosomiasis in patients with chronic HBV or HCV infection from endemic areas for Schistosoma mansoni.

  13. Comparing two metabolic profiling approaches (liquid chromatography and gas chromatography coupled to mass spectrometry) for extra-virgin olive oil phenolic compounds analysis: A botanical classification perspective.

    PubMed

    Bajoub, Aadil; Pacchiarotta, Tiziana; Hurtado-Fernández, Elena; Olmo-García, Lucía; García-Villalba, Rocío; Fernández-Gutiérrez, Alberto; Mayboroda, Oleg A; Carrasco-Pancorbo, Alegría

    2016-01-08

    Over the last decades, the phenolic compounds from virgin olive oil (VOO) have become the subject of intensive research because of their biological activities and their influence on some of the most relevant attributes of this interesting matrix. Developing metabolic profiling approaches to determine them in monovarietal virgin olive oils could help to gain a deeper insight into olive oil phenolic compounds composition as well as to promote their use for botanical origin tracing purposes. To this end, two approaches were comparatively investigated (LC-ESI-TOF MS and GC-APCI-TOF MS) to evaluate their capacity to properly classify 25 olive oil samples belonging to five different varieties (Arbequina, Cornicabra, Hojiblanca, Frantoio and Picual), using the entire chromatographic phenolic profiles combined to chemometrics (principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA)). The application of PCA to LC-MS and GC-MS data showed the natural clustering of the samples, seeing that 2 varieties were dominating the models (Arbequina and Frantoio), suppressing any possible discrimination among the other cultivars. Afterwards, PLS-DA was used to build four different efficient predictive models for varietal classification of the samples under study. The varietal markers pointed out by each platform were compared. In general, with the exception of one GC-MS model, all exhibited proper quality parameters. The models constructed by using the LC-MS data demonstrated superior classification ability. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. Use of metabolomics and lipidomics to evaluate the hypocholestreolemic effect of Proanthocyanidins from grape seed in a pig model.

    PubMed

    Quifer-Rada, Paola; Choy, Ying Yng; Calvert, Christopher C; Waterhouse, Andrew L; Lamuela-Raventos, Rosa M

    2016-10-01

    This work aims to evaluate changes in the fecal metabolomic profile due to grape seed extract (GSE) intake by untargeted and targeted analysis using high resolution mass spectrometry in conjunction with multivariate statistics. An intervention study with six crossbred female pigs was performed. The pigs followed a standard diet for 3 days, then they were fed with a supplemented diet containing 1% (w/w) of MegaNatural® Gold grape seed extract for 6 days. Fresh pig fecal samples were collected daily. A combination of untargeted high resolution mass spectrometry, multivariate analysis (PLS-DA), data-dependent MS/MS scan, and accurate mass database matching was used to measure the effect of the treatment on fecal composition. The resultant PLS-DA models showed a good discrimination among classes with great robustness and predictability. A total of 14 metabolites related to the GSE consumption were identified including biliary acid, dicarboxylic fatty acid, cholesterol metabolites, purine metabolites, and eicosanoid metabolites among others. Moreover, targeted metabolomics using GC-MS showed that cholesterol and its metabolites fecal excretion was increased due to the proanthocyanidins from grape seed extract. The results show that oligomeric procyanidins from GSE modifies bile acid and steroid excretion, which could exert a hypocholesterolemic effect. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. HPLC-based metabolic profiling and quality control of leaves of different Panax species

    PubMed Central

    Yang, Seung-Ok; Lee, Sang Won; Kim, Young Ock; Sohn, Sang-Hyun; Kim, Young Chang; Hyun, Dong Yoon; Hong, Yoon Pyo; Shin, Yu Su

    2013-01-01

    Leaves from Panax ginseng Meyer (Korean origin and Chinese origin of Korean ginseng) and P. quinquefolius (American ginseng) were harvested in Haenam province, Korea, and were analyzed to investigate patterns in major metabolites using HPLC-based metabolic profiling. Partial least squares discriminant analysis (PLS-DA) was used to analyze the HPLC chromatogram data. There was a clear separation between Panax species and/or origins from different countries in the PLS-DA score plots. The ginsenoside compounds of Rg1, Re, Rg2, Rb2, Rb3, and Rd in Korean leaves were higher than in Chinese and American ginseng leaves, and the Rb1 level in P. quinquefolius leaves was higher than in P. ginseng (Korean origin or Chinese origin). HPLC chromatogram data coupled with multivariate statistical analysis can be used to profile the metabolite content and undertake quality control of Panax products. PMID:23717177

  16. [Determination of wine original regions using information fusion of NIR and MIR spectroscopy].

    PubMed

    Xiang, Ling-Li; Li, Meng-Hua; Li, Jing-Mingz; Li, Jun-Hui; Zhang, Lu-Da; Zhao, Long-Lian

    2014-10-01

    Geographical origins of wine grapes are significant factors affecting wine quality and wine prices. Tasters' evaluation is a good method but has some limitations. It is important to discriminate different wine original regions quickly and accurately. The present paper proposed a method to determine wine original regions based on Bayesian information fusion that fused near-infrared (NIR) transmission spectra information and mid-infrared (MIR) ATR spectra information of wines. This method improved the determination results by expanding the sources of analysis information. NIR spectra and MIR spectra of 153 wine samples from four different regions of grape growing were collected by near-infrared and mid-infrared Fourier transform spe trometer separately. These four different regions are Huailai, Yantai, Gansu and Changli, which areall typical geographical originals for Chinese wines. NIR and MIR discriminant models for wine regions were established using partial least squares discriminant analysis (PLS-DA) based on NIR spectra and MIR spectra separately. In PLS-DA, the regions of wine samples are presented in group of binary code. There are four wine regions in this paper, thereby using four nodes standing for categorical variables. The output nodes values for each sample in NIR and MIR models were normalized first. These values stand for the probabilities of each sample belonging to each category. They seemed as the input to the Bayesian discriminant formula as a priori probability value. The probabilities were substituteed into the Bayesian formula to get posterior probabilities, by which we can judge the new class characteristics of these samples. Considering the stability of PLS-DA models, all the wine samples were divided into calibration sets and validation sets randomly for ten times. The results of NIR and MIR discriminant models of four wine regions were as follows: the average accuracy rates of calibration sets were 78.21% (NIR) and 82.57% (MIR), and the average accuracy rates of validation sets were 82.50% (NIR) and 81.98% (MIR). After using the method proposed in this paper, the accuracy rates of calibration and validation changed to 87.11% and 90.87% separately, which all achieved better results of determination than individual spectroscopy. These results suggest that Bayesian information fusion of NIR and MIR spectra is feasible for fast identification of wine original regions.

  17. A Reliable Methodology for Determining Seed Viability by Using Hyperspectral Data from Two Sides of Wheat Seeds.

    PubMed

    Zhang, Tingting; Wei, Wensong; Zhao, Bin; Wang, Ranran; Li, Mingliu; Yang, Liming; Wang, Jianhua; Sun, Qun

    2018-03-08

    This study investigated the possibility of using visible and near-infrared (VIS/NIR) hyperspectral imaging techniques to discriminate viable and non-viable wheat seeds. Both sides of individual seeds were subjected to hyperspectral imaging (400-1000 nm) to acquire reflectance spectral data. Four spectral datasets, including the ventral groove side, reverse side, mean (the mean of two sides' spectra of every seed), and mixture datasets (two sides' spectra of every seed), were used to construct the models. Classification models, partial least squares discriminant analysis (PLS-DA), and support vector machines (SVM), coupled with some pre-processing methods and successive projections algorithm (SPA), were built for the identification of viable and non-viable seeds. Our results showed that the standard normal variate (SNV)-SPA-PLS-DA model had high classification accuracy for whole seeds (>85.2%) and for viable seeds (>89.5%), and that the prediction set was based on a mixed spectral dataset by only using 16 wavebands. After screening with this model, the final germination of the seed lot could be higher than 89.5%. Here, we develop a reliable methodology for predicting the viability of wheat seeds, showing that the VIS/NIR hyperspectral imaging is an accurate technique for the classification of viable and non-viable wheat seeds in a non-destructive manner.

  18. A Reliable Methodology for Determining Seed Viability by Using Hyperspectral Data from Two Sides of Wheat Seeds

    PubMed Central

    Zhang, Tingting; Wei, Wensong; Zhao, Bin; Wang, Ranran; Li, Mingliu; Yang, Liming; Wang, Jianhua; Sun, Qun

    2018-01-01

    This study investigated the possibility of using visible and near-infrared (VIS/NIR) hyperspectral imaging techniques to discriminate viable and non-viable wheat seeds. Both sides of individual seeds were subjected to hyperspectral imaging (400–1000 nm) to acquire reflectance spectral data. Four spectral datasets, including the ventral groove side, reverse side, mean (the mean of two sides’ spectra of every seed), and mixture datasets (two sides’ spectra of every seed), were used to construct the models. Classification models, partial least squares discriminant analysis (PLS-DA), and support vector machines (SVM), coupled with some pre-processing methods and successive projections algorithm (SPA), were built for the identification of viable and non-viable seeds. Our results showed that the standard normal variate (SNV)-SPA-PLS-DA model had high classification accuracy for whole seeds (>85.2%) and for viable seeds (>89.5%), and that the prediction set was based on a mixed spectral dataset by only using 16 wavebands. After screening with this model, the final germination of the seed lot could be higher than 89.5%. Here, we develop a reliable methodology for predicting the viability of wheat seeds, showing that the VIS/NIR hyperspectral imaging is an accurate technique for the classification of viable and non-viable wheat seeds in a non-destructive manner. PMID:29517991

  19. Tumour xenograft detection through quantitative analysis of the metabolic profile of urine in mice

    NASA Astrophysics Data System (ADS)

    Moroz, Jennifer; Turner, Joan; Slupsky, Carolyn; Fallone, Gino; Syme, Alasdair

    2011-02-01

    The metabolic content of urine from NIH III nude mice (n = 22) was analysed before and after inoculation with human glioblastoma multiforme (GBM) cancer cells. An age- and gender-matched control population (n = 14) was also studied to identify non-tumour-related changes. Urine samples were collected daily for 6 weeks, beginning 1 week before cell injection. Metabolite concentrations were obtained via targeted profiling with Chenomx Suite 5.1, based on nuclear magnetic resonance (NMR) spectra acquired on an Oxford 800 MHz cold probe NMR spectrometer. The Wilcoxon rank sum test was used to evaluate the significance of the change in metabolite concentration between the two time points. Both the metabolite concentrations and the ratios of pairs of metabolites were studied. The complicated inter-relationships between metabolites were assessed through partial least-squares discriminant analysis (PLS-DA). Receiver operating characteristic (ROC) curves were generated for all variables and the area under the curve (AUC) calculated. The data indicate that the number of statistically significant changes in metabolite concentrations was more pronounced in the tumour-bearing population than in the control animals. This was also true of the ratios of pairs of metabolites. ROC analysis suggests that the ratios were better able to differentiate between the pre- and post-injection samples compared to the metabolite concentrations. PLS-DA models produced good separation between the populations and had the best AUC results (all models exceeded 0.937). These results demonstrate that metabolomics may be used as a screening tool for GBM cells grown in xenograft models in mice.

  20. [Study on brand traceability of vinegar based on near infrared spectroscopy technology].

    PubMed

    Guan, Xiao; Liu, Jing; Gu, Fang-Qing; Yang, Yong-Jian

    2014-09-01

    In the present paper, 152 vinegar samples with four different brands were chosen as research targets, and their near infrared spectra were collected by diffusion reflection mode and transmission mode, respectively. Furthermore, the brand traceability models for edible vinegar were constructed. The effects of the collection mode and pretreatment methods of spectrum on the precision of traceability models were investigated intensively. The models constructed by PLS1-DA modeling method using spectrum data of 114 training samples were applied to predict 38 test samples, and R2, RMSEC and RMSEP of the model based on transmission mode data were 0.92, 0.113 and 0.127, respectively, with recognition rate of 76.32%, and those based on diffusion reflection mode data were 0.97, 0.102 and 0.119, with recognition rate of 86.84%. The results demonstrated that the near infrared spectrum combined with PLS1-DA can be used to establish the brand traceability models for edible vinegar, and diffuse reflection mode is more beneficial for predictive ability of the model.

  1. Multivariate analysis applied to the study of spatial distributions found in drug-eluting stent coatings by confocal Raman microscopy.

    PubMed

    Balss, Karin M; Long, Frederick H; Veselov, Vladimir; Orana, Argjenta; Akerman-Revis, Eugena; Papandreou, George; Maryanoff, Cynthia A

    2008-07-01

    Multivariate data analysis was applied to confocal Raman measurements on stents coated with the polymers and drug used in the CYPHER Sirolimus-eluting Coronary Stents. Partial least-squares (PLS) regression was used to establish three independent calibration curves for the coating constituents: sirolimus, poly(n-butyl methacrylate) [PBMA], and poly(ethylene-co-vinyl acetate) [PEVA]. The PLS calibrations were based on average spectra generated from each spatial location profiled. The PLS models were tested on six unknown stent samples to assess accuracy and precision. The wt % difference between PLS predictions and laboratory assay values for sirolimus was less than 1 wt % for the composite of the six unknowns, while the polymer models were estimated to be less than 0.5 wt % difference for the combined samples. The linearity and specificity of the three PLS models were also demonstrated with the three PLS models. In contrast to earlier univariate models, the PLS models achieved mass balance with better accuracy. This analysis was extended to evaluate the spatial distribution of the three constituents. Quantitative bitmap images of drug-eluting stent coatings are presented for the first time to assess the local distribution of components.

  2. Metabolite analysis distinguishes between mice with epidermolysis bullosa acquisita and healthy mice

    PubMed Central

    2013-01-01

    Background Epidermolysis bullosa acquisita (EBA) is a rare skin blistering disease with a prevalence of 0.2/ million people. EBA is characterized by autoantibodies against type VII collagen. Type VII collagen builds anchoring fibrils that are essential for the dermal-epidermal junction. The pathogenic relevance of antibodies against type VII collagen subdomains has been demonstrated both in vitro and in vivo. Despite the multitude of clinical and immunological data, no information on metabolic changes exists. Methods We used an animal model of EBA to obtain insights into metabolomic changes during EBA. Sera from mice with immunization-induced EBA and control mice were obtained and metabolites were isolated by filtration. Proton nuclear magnetic resonance (NMR) spectra were recorded and analyzed by principal component analysis (PCA), partial least squares discrimination analysis (PLS-DA) and random forest. Results The metabolic pattern of immunized mice and control mice could be clearly distinguished with PCA and PLS-DA. Metabolites that contribute to the discrimination could be identified via random forest. The observed changes in the metabolic pattern of EBA sera, i.e. increased levels of amino acid, point toward an increased energy demand in EBA. Conclusions Knowledge about metabolic changes due to EBA could help in future to assess the disease status during treatment. Confirming the metabolic changes in patients needs probably large cohorts. PMID:23800341

  3. [Quality evaluation of American ginseng using UPLC coupled with multivariate analysis].

    PubMed

    Tang, Yan; Yan, Shu-Mo; Wang, Jing-Jing; Yuan, Yuan; Yang, Bin

    2016-05-01

    An ultra performance liquid chromatography (UPLC)method combined with multivariate data analysis was developed to evaluate the quality of American ginseng by simultaneously determining the concentrations of six ginsenosides (Rg₁, Re, Rb₁, Rc, Ro and Rd)in the samples. For UPLC, acetonitrile with 0.01% formic acid and water with 0.01% formic acid were used as the mobile phase with gradient elution. Under the established chromatographic conditions, the six ginsenosides could be well separated and the results of linearity, stability, precision, repeatability, and recovery rate all reached the requirement of quantification analysis, respectively. The total contents of Rg₁, Re, and Rb₁ in 57 samples all reached the requirement of the 2015 edition of Chinese Pharmacopoeia. At the same time, the experimental data were analyzed by principle component analysis (PCA) and partial least squares discriminant analysis (PLS-DA). The crude drugs and the decoction pieces can be discriminated by a PCA method and the samples with different age can be distinguished by a PLS-DA method. Copyright© by the Chinese Pharmaceutical Association.

  4. [Spectral quantitative analysis by nonlinear partial least squares based on neural network internal model for flue gas of thermal power plant].

    PubMed

    Cao, Hui; Li, Yao-Jiang; Zhou, Yan; Wang, Yan-Xia

    2014-11-01

    To deal with nonlinear characteristics of spectra data for the thermal power plant flue, a nonlinear partial least square (PLS) analysis method with internal model based on neural network is adopted in the paper. The latent variables of the independent variables and the dependent variables are extracted by PLS regression firstly, and then they are used as the inputs and outputs of neural network respectively to build the nonlinear internal model by train process. For spectra data of flue gases of the thermal power plant, PLS, the nonlinear PLS with the internal model of back propagation neural network (BP-NPLS), the non-linear PLS with the internal model of radial basis function neural network (RBF-NPLS) and the nonlinear PLS with the internal model of adaptive fuzzy inference system (ANFIS-NPLS) are compared. The root mean square error of prediction (RMSEP) of sulfur dioxide of BP-NPLS, RBF-NPLS and ANFIS-NPLS are reduced by 16.96%, 16.60% and 19.55% than that of PLS, respectively. The RMSEP of nitric oxide of BP-NPLS, RBF-NPLS and ANFIS-NPLS are reduced by 8.60%, 8.47% and 10.09% than that of PLS, respectively. The RMSEP of nitrogen dioxide of BP-NPLS, RBF-NPLS and ANFIS-NPLS are reduced by 2.11%, 3.91% and 3.97% than that of PLS, respectively. Experimental results show that the nonlinear PLS is more suitable for the quantitative analysis of glue gas than PLS. Moreover, by using neural network function which can realize high approximation of nonlinear characteristics, the nonlinear partial least squares method with internal model mentioned in this paper have well predictive capabilities and robustness, and could deal with the limitations of nonlinear partial least squares method with other internal model such as polynomial and spline functions themselves under a certain extent. ANFIS-NPLS has the best performance with the internal model of adaptive fuzzy inference system having ability to learn more and reduce the residuals effectively. Hence, ANFIS-NPLS is an accurate and useful quantitative thermal power plant flue gas analysis method.

  5. Real-time Raman spectroscopy for in vivo, online gastric cancer diagnosis during clinical endoscopic examination.

    PubMed

    Duraipandian, Shiyamala; Sylvest Bergholt, Mads; Zheng, Wei; Yu Ho, Khek; Teh, Ming; Guan Yeoh, Khay; Bok Yan So, Jimmy; Shabbir, Asim; Huang, Zhiwei

    2012-08-01

    Optical spectroscopic techniques including reflectance, fluorescence and Raman spectroscopy have shown promising potential for in vivo precancer and cancer diagnostics in a variety of organs. However, data-analysis has mostly been limited to post-processing and off-line algorithm development. In this work, we develop a fully automated on-line Raman spectral diagnostics framework integrated with a multimodal image-guided Raman technique for real-time in vivo cancer detection at endoscopy. A total of 2748 in vivo gastric tissue spectra (2465 normal and 283 cancer) were acquired from 305 patients recruited to construct a spectral database for diagnostic algorithms development. The novel diagnostic scheme developed implements on-line preprocessing, outlier detection based on principal component analysis statistics (i.e., Hotelling's T2 and Q-residuals) for tissue Raman spectra verification as well as for organ specific probabilistic diagnostics using different diagnostic algorithms. Free-running optical diagnosis and processing time of < 0.5 s can be achieved, which is critical to realizing real-time in vivo tissue diagnostics during clinical endoscopic examination. The optimized partial least squares-discriminant analysis (PLS-DA) models based on the randomly resampled training database (80% for learning and 20% for testing) provide the diagnostic accuracy of 85.6% [95% confidence interval (CI): 82.9% to 88.2%] [sensitivity of 80.5% (95% CI: 71.4% to 89.6%) and specificity of 86.2% (95% CI: 83.6% to 88.7%)] for the detection of gastric cancer. The PLS-DA algorithms are further applied prospectively on 10 gastric patients at gastroscopy, achieving the predictive accuracy of 80.0% (60/75) [sensitivity of 90.0% (27/30) and specificity of 73.3% (33/45)] for in vivo diagnosis of gastric cancer. The receiver operating characteristics curves further confirmed the efficacy of Raman endoscopy together with PLS-DA algorithms for in vivo prospective diagnosis of gastric cancer. This work successfully moves biomedical Raman spectroscopic technique into real-time, on-line clinical cancer diagnosis, especially in routine endoscopic diagnostic applications.

  6. Real-time Raman spectroscopy for in vivo, online gastric cancer diagnosis during clinical endoscopic examination

    NASA Astrophysics Data System (ADS)

    Duraipandian, Shiyamala; Sylvest Bergholt, Mads; Zheng, Wei; Yu Ho, Khek; Teh, Ming; Guan Yeoh, Khay; Bok Yan So, Jimmy; Shabbir, Asim; Huang, Zhiwei

    2012-08-01

    Optical spectroscopic techniques including reflectance, fluorescence and Raman spectroscopy have shown promising potential for in vivo precancer and cancer diagnostics in a variety of organs. However, data-analysis has mostly been limited to post-processing and off-line algorithm development. In this work, we develop a fully automated on-line Raman spectral diagnostics framework integrated with a multimodal image-guided Raman technique for real-time in vivo cancer detection at endoscopy. A total of 2748 in vivo gastric tissue spectra (2465 normal and 283 cancer) were acquired from 305 patients recruited to construct a spectral database for diagnostic algorithms development. The novel diagnostic scheme developed implements on-line preprocessing, outlier detection based on principal component analysis statistics (i.e., Hotelling's T2 and Q-residuals) for tissue Raman spectra verification as well as for organ specific probabilistic diagnostics using different diagnostic algorithms. Free-running optical diagnosis and processing time of < 0.5 s can be achieved, which is critical to realizing real-time in vivo tissue diagnostics during clinical endoscopic examination. The optimized partial least squares-discriminant analysis (PLS-DA) models based on the randomly resampled training database (80% for learning and 20% for testing) provide the diagnostic accuracy of 85.6% [95% confidence interval (CI): 82.9% to 88.2%] [sensitivity of 80.5% (95% CI: 71.4% to 89.6%) and specificity of 86.2% (95% CI: 83.6% to 88.7%)] for the detection of gastric cancer. The PLS-DA algorithms are further applied prospectively on 10 gastric patients at gastroscopy, achieving the predictive accuracy of 80.0% (60/75) [sensitivity of 90.0% (27/30) and specificity of 73.3% (33/45)] for in vivo diagnosis of gastric cancer. The receiver operating characteristics curves further confirmed the efficacy of Raman endoscopy together with PLS-DA algorithms for in vivo prospective diagnosis of gastric cancer. This work successfully moves biomedical Raman spectroscopic technique into real-time, on-line clinical cancer diagnosis, especially in routine endoscopic diagnostic applications.

  7. Investigation of crimson-dyed fibres for a new approach on the characterization of cochineal and kermes dyes in historical textiles.

    PubMed

    Serrano, Ana; van den Doel, Andre; van Bommel, Maarten; Hallett, Jessica; Joosten, Ineke; van den Berg, Klaas J

    2015-10-15

    The colorant behaviour of cochineal and kermes insect dyes in 141 experimentally-dyed and 28 artificially-aged samples of silk and wool was investigated using ultra-high performance liquid chromatography coupled to photodiode array detector (UHPLC-PDA), liquid chromatography electrospray ionisation mass spectrometry (LC-ESI-MS) and image scanning electron microscopy - energy dispersive X-ray spectroscopy (SEM-EDX). Partial-least squares discriminant analysis (PLS-DA) was then used to model the acquired UHPLC-PDA data and assess the possibility of discriminating cochineal insect species, as well as their correspondent dyed and aged reference fibres. The resulting models helped to characterize a set of 117 red samples from 95 historical textiles, in which UHPLC-PDA analyses have reported the presence of cochineal and kermes insect dyes. Analytical investigation of the experimentally-dyed and artificially-aged fibres has demonstrated that the ratio of compounds in the insects dye composition can change, depending on the dyeing conditions applied and the type of fibres used. Similarities were observed when comparing the UHPLC-MS and SEM-EDX results from the dyed and aged references with the historical samples. This was verified with PLS-DA models of the chromatographic data, facilitating the classification of the cochineal species present in the historical samples. The majority of these samples were identified to contain American cochineal, which is in agreement with historical and dye identification literature that describe the impact of this dyestuff into European and Asian dyeing practices, after the Iberian Expansion in the 16th century. The analytical results emphasize the importance of using statistical data interpretation for the discrimination of cochineal dyes, besides qualitative and quantitative evaluation of chromatograms. Hence, the combination of UHPLC-PDA with a statistical classification method, such as PLS-DA, has been demonstrated to be an advisable approach in future investigations to assess closely related species of natural dyes in historical textile samples. This is particularly important when aiming to achieve more accurate interpretations about the history of works of art, or the application of natural dyes in old textile production. Copyright © 2015 Elsevier B.V. All rights reserved.

  8. Authenticity identification and classification of Rhodiola species in traditional Tibetan medicine based on Fourier transform near-infrared spectroscopy and chemometrics analysis.

    PubMed

    Li, Tao; Su, Chen

    2018-06-02

    Rhodiola is an increasingly widely used traditional Tibetan medicine and traditional Chinese medicine in China. The composition profiles of bioactive compounds are somewhat jagged according to different species, which makes it crucial to identify authentic Rhodiola species accurately so as to ensure clinical application of Rhodiola. In this paper, a nondestructive, rapid, and efficient method in classification of Rhodiola was developed by Fourier transform near-infrared (FT-NIR) spectroscopy combined with chemometrics analysis. A total of 160 batches of raw spectra were obtained from four different species of Rhodiola by FT-NIR, such as Rhodiola crenulata, Rhodiola fastigiata, Rhodiola kirilowii, and Rhodiola brevipetiolata. After excluding the outliers, different performances of 3 sample dividing methods, 12 spectral preprocessing methods, 2 wavelength selection methods, and 2 modeling evaluation methods were compared. The results indicated that this combination was superior than others in the authenticity identification analysis, which was FT-NIR combined with sample set partitioning based on joint x-y distances (SPXY), standard normal variate transformation (SNV) + Norris-Williams (NW) + 2nd derivative, competitive adaptive reweighted sampling (CARS), and kernel extreme learning machine (KELM). The accuracy (ACCU), sensitivity (SENS), and specificity (SPEC) of the optimal model were all 1, which showed that this combination of FT-NIR and chemometrics methods had the optimal authenticity identification performance. The classification performance of the partial least squares discriminant analysis (PLS-DA) model was slightly lower than KELM model, and PLS-DA model results were ACCU = 0.97, SENS = 0.93, and SPEC = 0.98, respectively. It can be concluded that FT-NIR combined with chemometrics analysis has great potential in authenticity identification and classification of Rhodiola, which can provide a valuable reference for the safety and effectiveness of clinical application of Rhodiola. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. [Research on Rapid Discrimination of Edible Oil by ATR Infrared Spectroscopy].

    PubMed

    Ma, Xiao; Yuan, Hong-fu; Song, Chun-feng; Hu, Ai-qin; Li, Xiao-yu; Zhao, Zhong; Li, Xiu-qin; Guo Zhen; Zhu, Zhi-qiang

    2015-07-01

    A rapid discrimination method of edible oils, KL-BP model, was proposed by attenuated total reflectance infrared spectroscopy. The model extracts the characteristic of classification from source data by KL and reduces data dimension at the same time. Then the neural network model is constructed by the new data which as the input of the model. 84 edible oil samples which include sesame oil, corn oil, canola oil, blend oil, sunflower oil, peanut oil, olive oil, soybean oil and tea seed oil, were collected and their infrared spectra determined using an ATR FT-IR spectrometer. In order to compare the method performance, principal component analysis (PCA) direct-classification model, KL direct-classification model, PLS-DA model, PCA-BP model and KL-BP model are constructed in this paper. The results show that the recognition rates of PCA, PCA-BP, KL, PLS-DA and KL-BP are 59.1%, 68.2%, 77.3%, 77.3% and 90.9% for discriminating the 9 kinds of edible oils, respectively. KL extracts the eigenvector which make the distance between different class and distance of every class ratio is the largest. So the method can get much more classify information than PCA. BP neural network can effectively enhance the classification ability and accuracy. Taking full of the advantages of KL in extracting more category information in dimension reducing and the features of BP neural network in self-learning, adaptive, nonlinear, the KL-BP method has the best classification ability and recognition accuracy and great importance for rapidly recognizing edible oil in practice.

  10. Classification of Amazonian rosewood essential oil by Raman spectroscopy and PLS-DA with reliability estimation.

    PubMed

    Almeida, Mariana R; Fidelis, Carlos H V; Barata, Lauro E S; Poppi, Ronei J

    2013-12-15

    The Amazon tree Aniba rosaeodora Ducke (rosewood) provides an essential oil valuable for the perfume industry, but after decades of predatory extraction it is at risk of extinction. The extraction of the essential oil from wood implies the cutting of the tree, and then the study of oil extracted from the leaves is important as a sustainable alternative. The goal of this study was to test the applicability of Raman spectroscopy and Partial Least Square Discriminant Analysis (PLS-DA) as means to classify the essential oil extracted from different parties (wood, leaves and branches) of the Brazilian tree A. rosaeodora. For the development of classification models, the Raman spectra were split into two sets: training and test. The value of the limit that separates the classes was calculated based on the distribution of samples of training. This value was calculated in a manner that the classes are divided with a lower probability of incorrect classification for future estimates. The best model presented sensitivity and specificity of 100%, predictive accuracy and efficiency of 100%. These results give an overall vision of the behavior of the model, but do not give information about individual samples; in this case, the confidence interval for each sample of classification was also calculated using the resampling bootstrap technique. The methodology developed have the potential to be an alternative for standard procedures used for oil analysis and it can be employed as screening method, since it is fast, non-destructive and robust. © 2013 Elsevier B.V. All rights reserved.

  11. The feasibility of using explicit method for linear correction of the particle size variation using NIR Spectroscopy combined with PLS2regression method

    NASA Astrophysics Data System (ADS)

    Yulia, M.; Suhandy, D.

    2018-03-01

    NIR spectra obtained from spectral data acquisition system contains both chemical information of samples as well as physical information of the samples, such as particle size and bulk density. Several methods have been established for developing calibration models that can compensate for sample physical information variations. One common approach is to include physical information variation in the calibration model both explicitly and implicitly. The objective of this study was to evaluate the feasibility of using explicit method to compensate the influence of different particle size of coffee powder in NIR calibration model performance. A number of 220 coffee powder samples with two different types of coffee (civet and non-civet) and two different particle sizes (212 and 500 µm) were prepared. Spectral data was acquired using NIR spectrometer equipped with an integrating sphere for diffuse reflectance measurement. A discrimination method based on PLS-DA was conducted and the influence of different particle size on the performance of PLS-DA was investigated. In explicit method, we add directly the particle size as predicted variable results in an X block containing only the NIR spectra and a Y block containing the particle size and type of coffee. The explicit inclusion of the particle size into the calibration model is expected to improve the accuracy of type of coffee determination. The result shows that using explicit method the quality of the developed calibration model for type of coffee determination is a little bit superior with coefficient of determination (R2) = 0.99 and root mean square error of cross-validation (RMSECV) = 0.041. The performance of the PLS2 calibration model for type of coffee determination with particle size compensation was quite good and able to predict the type of coffee in two different particle sizes with relatively high R2 pred values. The prediction also resulted in low bias and RMSEP values.

  12. Improved discrimination between monocotyledonous and dicotyledonous plants for weed control based on the blue-green region of ultraviolet-induced fluorescence spectra.

    PubMed

    Panneton, Bernard; Guillaume, Serge; Roger, Jean-Michel; Samson, Guy

    2010-01-01

    Precision weeding by spot spraying in real time requires sensors to discriminate between weeds and crop without contact. Among the optical based solutions, the ultraviolet (UV) induced fluorescence of the plants appears as a promising alternative. In a first paper, the feasibility of discriminating between corn hybrids, monocotyledonous, and dicotyledonous weeds was demonstrated on the basis of the complete spectra. Some considerations about the different sources of fluorescence oriented the focus to the blue-green fluorescence (BGF) part, ignoring the chlorophyll fluorescence that is inherently more variable in time. This paper investigates the potential of performing weed/crop discrimination on the basis of several large spectral bands in the BGF area. A partial least squares discriminant analysis (PLS-DA) was performed on a set of 1908 spectra of corn and weed plants over 3 years and various growing conditions. The discrimination between monocotyledonous and dicotyledonous plants based on the blue-green fluorescence yielded robust models (classification error between 1.3 and 4.6% for between-year validation). On the basis of the analysis of the PLS-DA model, two large bands were chosen in the blue-green fluorescence zone (400-425 nm and 425-490 nm). A linear discriminant analysis based on the signal from these two bands also provided very robust inter-year results (classification error from 1.5% to 5.2%). The same selection process was applied to discriminate between monocotyledonous weeds and maize but yielded no robust models (up to 50% inter-year error). Further work will be required to solve this problem and provide a complete UV fluorescence based sensor for weed-maize discrimination.

  13. Analysis of Exhaled Breath Volatile Organic Compounds in Inflammatory Bowel Disease: A Pilot Study.

    PubMed

    Hicks, Lucy C; Huang, Juzheng; Kumar, Sacheen; Powles, Sam T; Orchard, Timothy R; Hanna, George B; Williams, Horace R T

    2015-09-01

    Distinguishing between the inflammatory bowel diseases [IBD], Crohn's disease [CD] and ulcerative colitis [UC], is important for determining management and prognosis. Selected ion flow tube mass spectrometry [SIFT-MS] may be used to analyse volatile organic compounds [VOCs] in exhaled breath: these may be altered in disease states, and distinguishing breath VOC profiles can be identified. The aim of this pilot study was to identify, quantify, and analyse VOCs present in the breath of IBD patients and controls, potentially providing insights into disease pathogenesis and complementing current diagnostic algorithms. SIFT-MS breath profiling of 56 individuals [20 UC, 18 CD, and 18 healthy controls] was undertaken. Multivariate analysis included principal components analysis and partial least squares discriminant analysis with orthogonal signal correction [OSC-PLS-DA]. Receiver operating characteristic [ROC] analysis was performed for each comparative analysis using statistically significant VOCs. OSC-PLS-DA modelling was able to distinguish both CD and UC from healthy controls and from one other with good sensitivity and specificity. ROC analysis using combinations of statistically significant VOCs [dimethyl sulphide, hydrogen sulphide, hydrogen cyanide, ammonia, butanal, and nonanal] gave integrated areas under the curve of 0.86 [CD vs healthy controls], 0.74 [UC vs healthy controls], and 0.83 [CD vs UC]. Exhaled breath VOC profiling was able to distinguish IBD patients from controls, as well as to separate UC from CD, using both multivariate and univariate statistical techniques. Copyright © 2015 European Crohn’s and Colitis Organisation (ECCO). Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  14. Rare earth elements minimal harvest year variation facilitates robust geographical origin discrimination: The case of PDO "Fava Santorinis".

    PubMed

    Drivelos, Spiros A; Danezis, Georgios P; Haroutounian, Serkos A; Georgiou, Constantinos A

    2016-12-15

    This study examines the trace and rare earth elemental (REE) fingerprint variations of PDO (Protected Designation of Origin) "Fava Santorinis" over three consecutive harvesting years (2011-2013). Classification of samples in harvesting years was studied by performing discriminant analysis (DA), k nearest neighbours (κ-NN), partial least squares (PLS) analysis and probabilistic neural networks (PNN) using rare earth elements and trace metals determined using ICP-MS. DA performed better than κ-NN, producing 100% discrimination using trace elements and 79% using REEs. PLS was found to be superior to PNN, achieving 99% and 90% classification for trace and REEs, respectively, while PNN achieved 96% and 71% classification for trace and REEs, respectively. The information obtained using REEs did not enhance classification, indicating that REEs vary minimally per harvesting year, providing robust geographical origin discrimination. The results show that seasonal patterns can occur in the elemental composition of "Fava Santorinis", probably reflecting seasonality of climate. Copyright © 2016 Elsevier Ltd. All rights reserved.

  15. Quantitative determination and evaluation of Paris polyphylla var. yunnanensis with different harvesting times using UPLC-UV-MS and FT-IR spectroscopy in combination with partial least squares discriminant analysis.

    PubMed

    Yang, Yuan-Gui; Zhang, Ji; Zhao, Yan-Li; Zhang, Jin-Yu; Wang, Yuan-Zhong

    2017-07-01

    A rapid method was developed and validated by ultra-performance liquid chromatography-triple quadrupole mass spectroscopy with ultraviolet detection (UPLC-UV-MS) for simultaneous determination of paris saponin I, paris saponin II, paris saponin VI and paris saponin VII. Partial least squares discriminant analysis (PLS-DA) based on UPLC and Fourier transform infrared (FT-IR) spectroscopy was employed to evaluate Paris polyphylla var. yunnanensis (PPY) at different harvesting times. Quantitative determination implied that the various contents of bioactive compounds with different harvesting times may lead to different pharmacological effects; the average content of total saponins for PPY harvested at 8 years was higher than that from other samples. The PLS-DA of FT-IR spectra had a better performance than that of UPLC for discrimination of PPY from different harvesting times. Copyright © 2016 John Wiley & Sons, Ltd.

  16. Classification of edible oils and modeling of their physico-chemical properties by chemometric methods using mid-IR spectroscopy

    NASA Astrophysics Data System (ADS)

    Luna, Aderval S.; da Silva, Arnaldo P.; Ferré, Joan; Boqué, Ricard

    This research work describes two studies for the classification and characterization of edible oils and its quality parameters through Fourier transform mid infrared spectroscopy (FT-mid-IR) together with chemometric methods. The discrimination of canola, sunflower, corn and soybean oils was investigated using SVM-DA, SIMCA and PLS-DA. Using FT-mid-IR, DPLS was able to classify 100% of the samples from the validation set, but SIMCA and SVM-DA were not. The quality parameters: refraction index and relative density of edible oils were obtained from reference methods. Prediction models for FT-mid-IR spectra were calculated for these quality parameters using partial least squares (PLS) and support vector machines (SVM). Several preprocessing alternatives (first derivative, multiplicative scatter correction, mean centering, and standard normal variate) were investigated. The best result for the refraction index was achieved with SVM as well as for the relative density except when the preprocessing combination of mean centering and first derivative was used. For both of quality parameters, the best results obtained for the figures of merit expressed by the root mean square error of cross validation (RMSECV) and prediction (RMSEP) were equal to 0.0001.

  17. Patterns of Circulating Inflammatory Biomarkers in Older Persons with Varying Levels of Physical Performance: A Partial Least Squares-Discriminant Analysis Approach

    PubMed Central

    Marzetti, Emanuele; Landi, Francesco; Marini, Federico; Cesari, Matteo; Buford, Thomas W.; Manini, Todd M.; Onder, Graziano; Pahor, Marco; Bernabei, Roberto; Leeuwenburgh, Christiaan; Calvani, Riccardo

    2014-01-01

    Background: Chronic, low-grade inflammation and declining physical function are hallmarks of the aging process. However, previous attempts to correlate individual inflammatory biomarkers with physical performance in older people have produced mixed results. Given the complexity of the inflammatory response, the simultaneous analysis of an array of inflammatory mediators may provide more insights into the relationship between inflammation and age-related physical function decline. This study was designed to explore the association between a panel of inflammatory markers and physical performance in older adults through a multivariate statistical approach. Methods: Community-dwelling older persons were categorized into “normal walkers” (NWs; n = 27) or “slow walkers” (SWs; n = 11) groups using 0.8 m s−1 as the 4-m gait speed cutoff. A panel of 14 circulating inflammatory biomarkers was assayed by multiplex analysis. Partial least squares-discriminant analysis (PLS-DA) was used to identify patterns of inflammatory mediators associated with gait speed categories. Results: The optimal complexity of the PLS-DA model was found to be five latent variables. The proportion of correct classification was 88.9% for NW subjects (74.1% in cross-validation) and 90.9% for SW individuals (81.8% in cross-validation). Discriminant biomarkers in the model were interleukin 8, myeloperoxidase, and tumor necrosis factor alpha (all higher in the SW group), and P-selectin, interferon gamma, and granulocyte–macrophage colony-stimulating factor (all higher in the NW group). Conclusion: Distinct profiles of circulating inflammatory biomarkers characterize older subjects with different levels of physical performance. The dissection of these patterns may provide novel insights into the role played by inflammation in the disabling cascade and possible new targets for interventions. PMID:25593902

  18. Non-destructive profiling of volatile organic compounds using HS-SPME/GC-MS and its application for the geographical discrimination of white rice.

    PubMed

    Lim, Dong Kyu; Mo, Changyeun; Lee, Dong-Kyu; Long, Nguyen Phuoc; Lim, Jongguk; Kwon, Sung Won

    2018-01-01

    The authenticity determination of white rice is crucial to prevent deceptive origin labeling and dishonest trading. However, a non-destructive and comprehensive method for rapidly discriminating the geographical origins of white rice between countries is still lacking. In the current study, we developed a volatile organic compound based geographical discrimination method using headspace solid-phase microextraction coupled to gas chromatography-mass spectrometry (HS-SPME/GC-MS) to discriminate rice samples from Korea and China. A partial least squares discriminant analysis (PLS-DA) model exhibited a good classification of white rice between Korea and China (accuracy = 0.958, goodness of fit = 0.937, goodness of prediction = 0.831, and permutation test p-value = 0.043). Combining the PLS-DA based feature selection with the differentially expressed features from the unpaired t-test and significance analysis of microarrays, 12 discriminatory biomarkers were found. Among them, hexanal and 1-hexanol have been previously known to be associated with the cultivation environment and storage conditions. Other hydrocarbon biomarkers are novel, and their impact on rice production and storage remains to be elucidated. In conclusion, our findings highlight the ability to rapidly discriminate white rice from Korea and China. The developed method maybe useful for the authenticity and quality control of white rice. Copyright © 2017. Published by Elsevier B.V.

  19. Discrimination of Aurantii Fructus Immaturus and Fructus Poniciri Trifoliatae Immaturus by Flow Injection UV Spectroscopy (FIUV) and 1H NMR using Partial Least-squares Discriminant Analysis (PLS-DA)

    USDA-ARS?s Scientific Manuscript database

    Two simple fingerprinting methods, flow-injection UV spectroscopy (FIUV) and 1H nuclear magnetic resonance (NMR), for discrimination of Aurantii FructusImmaturus and Fructus Poniciri TrifoliataeImmaturususing were described. Both methods were combined with partial least-squares discriminant analysis...

  20. Kernel analysis of partial least squares (PLS) regression models.

    PubMed

    Shinzawa, Hideyuki; Ritthiruangdej, Pitiporn; Ozaki, Yukihiro

    2011-05-01

    An analytical technique based on kernel matrix representation is demonstrated to provide further chemically meaningful insight into partial least squares (PLS) regression models. The kernel matrix condenses essential information about scores derived from PLS or principal component analysis (PCA). Thus, it becomes possible to establish the proper interpretation of the scores. A PLS model for the total nitrogen (TN) content in multiple Thai fish sauces is built with a set of near-infrared (NIR) transmittance spectra of the fish sauce samples. The kernel analysis of the scores effectively reveals that the variation of the spectral feature induced by the change in protein content is substantially associated with the total water content and the protein hydration. Kernel analysis is also carried out on a set of time-dependent infrared (IR) spectra representing transient evaporation of ethanol from a binary mixture solution of ethanol and oleic acid. A PLS model to predict the elapsed time is built with the IR spectra and the kernel matrix is derived from the scores. The detailed analysis of the kernel matrix provides penetrating insight into the interaction between the ethanol and the oleic acid.

  1. Electrospray ionization mass spectrometry and partial least squares discriminant analysis applied to the quality control of olive oil.

    PubMed

    Alves, Junia O; Botelho, Bruno G; Sena, Marcelo M; Augusti, Rodinei

    2013-10-01

    Direct infusion electrospray ionization mass spectrometry in the positive ion mode [ESI(+)-MS] is used to obtain fingerprints of aqueous-methanolic extracts of two types of olive oils, extra virgin (EV) and ordinary (OR), as well as of samples of EV olive oil adulterated by the addition of OR olive oil and other edible oils: corn (CO), sunflower (SF), soybean (SO) and canola (CA). The MS data is treated by the partial least squares discriminant analysis (PLS-DA) protocol aiming at discriminating the above-mentioned classes formed by the genuine olive oils, EV (1) and OR (2), as well as the EV adulterated samples, i.e. EV/SO (3), EV/CO (4), EV/SF (5), EV/CA (6) and EV/OR (7). The PLS-DA model employed is built with 190 and 70 samples for the training and test sets, respectively. For all classes (1-7), EV and OR olive oils as well as the adulterated samples (in a proportion varying from 0.5 to 20.0% w/w) are properly classified. The developed methodology required no ions identification and demonstrated to be fast, as each measurement lasted about 3 min including the extraction step and MS analysis, and reliable, because high sensitivities (rate of true positives) and specificities (rate of true negatives) were achieved. Finally, it can be envisaged that this approach has potential to be applied in quality control of EV olive oils. Copyright © 2013 John Wiley & Sons, Ltd.

  2. A proposed metabolic strategy for monitoring disease progression in Alzheimer's disease.

    PubMed

    Greenberg, Nicola; Grassano, Antonio; Thambisetty, Madhav; Lovestone, Simon; Legido-Quigley, Cristina

    2009-04-01

    A specific, sensitive and essentially non-invasive assay to diagnose and monitor Alzheimer's disease (AD) would be valuable to both clinicians and medical researchers. The aim of this study was to perform a metabonomic statistical analysis on plasma fingerprints. Objectives were to investigate novel biomarkers indicative of AD, to consider the role of bile acids as AD biomarkers and to consider whether mild cognitive impairment (MCI) is a separate disease from AD. Samples were analysed by ultraperformance liquid chromatography-MS and resulting data sets were interpreted using soft-independent modelling of class analogy statistical analysis methods. PCA models did not show any grouping of subjects by disease state. Partial least-squares discriminant analysis (PLS-DS) models yielded class separation for AD. However, as with earlier studies, model validation revealed a predictive power of Q(2)<0.5 and indicating their unsuitability for predicting disease state. Three bile acids were extracted from the data and quantified, up-regulation was observed for MCI and AD patients. PLS-DA did not support MCI being considered as a separate disease from AD with MCI patient metabolic profiles being significantly closer to AD patients than controls. This study suggested that further investigation into the lipid fraction of the metabolome may yield useful biomarkers for AD and metabolomic profiles could be used to predict disease state in a clinical setting.

  3. Hyperspectral Imaging and Support Vector Machine: A Powerful Combination to Differentiate Black Cohosh (Actaea racemosa) from Other Cohosh Species.

    PubMed

    Tankeu, Sidonie; Vermaak, Ilze; Chen, Weiyang; Sandasi, Maxleene; Kamatou, Guy; Viljoen, Alvaro

    2018-04-01

    Actaea racemosa (black cohosh) has a history of traditional use in the treatment of general gynecological problems. However, the plant is known to be vulnerable to adulteration with other cohosh species. This study evaluated the use of shortwave infrared hyperspectral imaging (SWIR-HSI) in tandem with chemometric data analysis as a fast alternative method for the discrimination of four cohosh species ( Actaea racemosa, Actaea podocarpa, Actaea pachypoda, Actaea cimicifuga ) and 36 commercial products labelled as black cohosh. The raw material and commercial products were analyzed using SWIR-HSI and ultra-high-performance liquid chromatography coupled to mass spectrometry (UHPLC-MS) followed by chemometric modeling. From SWIR-HSI data (920 - 2514 nm), the range containing the discriminating information of the four species was identified as 1204 - 1480 nm using Matlab software. After reduction of the data set range, partial least squares discriminant analysis (PLS-DA) and support vector machine discriminant analysis (SVM-DA) models with coefficients of determination ( R2 ) of ≥ 0.8 were created. The novel SVM-DA model showed better predictions and was used to predict the commercial product content. Seven out of 36 commercial products were recognized by the SVM-DA model as being true black cohosh while 29 products indicated adulteration. Analysis of the UHPLC-MS data demonstrated that six commercial products could be authentic black cohosh. This was confirmed using the fragmentation patterns of three black cohosh markers (cimiracemoside C; 12- β ,21-dihydroxycimigenol-3- O -L-arabinoside; and 24- O -acetylhydroshengmanol-3- O - β -D-xylopyranoside). SWIR-HSI in conjunction with chemometric tools (SVM-DA) could identify 80% adulteration of commercial products labelled as black cohosh. Georg Thieme Verlag KG Stuttgart · New York.

  4. Rapid detection of milk adulteration using intact protein flow injection mass spectrometric fingerprints combined with chemometrics.

    PubMed

    Du, Lijuan; Lu, Weiying; Cai, Zhenzhen Julia; Bao, Lei; Hartmann, Christoph; Gao, Boyan; Yu, Liangli Lucy

    2018-02-01

    Flow injection mass spectrometry (FIMS) combined with chemometrics was evaluated for rapidly detecting economically motivated adulteration (EMA) of milk. Twenty-two pure milk and thirty-five counterparts adulterated with soybean, pea, and whey protein isolates at 0.5, 1, 3, 5, and 10% (w/w) levels were analyzed. The principal component analysis (PCA), partial least-squares-discriminant analysis (PLS-DA), and support vector machine (SVM) classification models indicated that the adulterated milks could successfully be classified from the pure milks. FIMS combined with chemometrics might be an effective method to detect possible EMA in milk. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Quantification and statistical significance analysis of group separation in NMR-based metabonomics studies

    PubMed Central

    Goodpaster, Aaron M.; Kennedy, Michael A.

    2015-01-01

    Currently, no standard metrics are used to quantify cluster separation in PCA or PLS-DA scores plots for metabonomics studies or to determine if cluster separation is statistically significant. Lack of such measures makes it virtually impossible to compare independent or inter-laboratory studies and can lead to confusion in the metabonomics literature when authors putatively identify metabolites distinguishing classes of samples based on visual and qualitative inspection of scores plots that exhibit marginal separation. While previous papers have addressed quantification of cluster separation in PCA scores plots, none have advocated routine use of a quantitative measure of separation that is supported by a standard and rigorous assessment of whether or not the cluster separation is statistically significant. Here quantification and statistical significance of separation of group centroids in PCA and PLS-DA scores plots are considered. The Mahalanobis distance is used to quantify the distance between group centroids, and the two-sample Hotelling's T2 test is computed for the data, related to an F-statistic, and then an F-test is applied to determine if the cluster separation is statistically significant. We demonstrate the value of this approach using four datasets containing various degrees of separation, ranging from groups that had no apparent visual cluster separation to groups that had no visual cluster overlap. Widespread adoption of such concrete metrics to quantify and evaluate the statistical significance of PCA and PLS-DA cluster separation would help standardize reporting of metabonomics data. PMID:26246647

  6. Metabolic Profiling and Classification of Propolis Samples from Southern Brazil: An NMR-Based Platform Coupled with Machine Learning.

    PubMed

    Maraschin, Marcelo; Somensi-Zeggio, Amélia; Oliveira, Simone K; Kuhnen, Shirley; Tomazzoli, Maíra M; Raguzzoni, Josiane C; Zeri, Ana C M; Carreira, Rafael; Correia, Sara; Costa, Christopher; Rocha, Miguel

    2016-01-22

    The chemical composition of propolis is affected by environmental factors and harvest season, making it difficult to standardize its extracts for medicinal usage. By detecting a typical chemical profile associated with propolis from a specific production region or season, certain types of propolis may be used to obtain a specific pharmacological activity. In this study, propolis from three agroecological regions (plain, plateau, and highlands) from southern Brazil, collected over the four seasons of 2010, were investigated through a novel NMR-based metabolomics data analysis workflow. Chemometrics and machine learning algorithms (PLS-DA and RF), including methods to estimate variable importance in classification, were used in this study. The machine learning and feature selection methods permitted construction of models for propolis sample classification with high accuracy (>75%, reaching ∼90% in the best case), better discriminating samples regarding their collection seasons comparatively to the harvest regions. PLS-DA and RF allowed the identification of biomarkers for sample discrimination, expanding the set of discriminating features and adding relevant information for the identification of the class-determining metabolites. The NMR-based metabolomics analytical platform, coupled to bioinformatic tools, allowed characterization and classification of Brazilian propolis samples regarding the metabolite signature of important compounds, i.e., chemical fingerprint, harvest seasons, and production regions.

  7. Potential metabolomic biomarkers for reliable diagnosis of Behcet's disease using gas chromatography/ time-of-flight-mass spectrometry.

    PubMed

    Ahn, Joong Kyong; Kim, Jungyeon; Hwang, Jiwon; Song, Juhwan; Kim, Kyoung Heon; Cha, Hoon-Suk

    2018-05-01

    Although many diagnostic criteria of Behcet's disease (BD) have been developed and revised by experts, diagnosing BD is still complicated and challenging. No metabolomic studies on serum have been attempted to improve the diagnosis and to identify potential biomarkers of BD. The purposes of this study were to investigate distinctive metabolic changes in serum samples of BD patients and to identify metabolic candidate biomarkers for reliable diagnosis of BD using the metabolomics platform. Metabolomic profiling of 90 serum samples from 45 BD patients and 45 healthy controls (HCs) were performed via gas chromatography with time-of-flight mass spectrometry (GC/TOF-MS) with multivariate statistical analyses. A total of 104 metabolites were identified from samples. The serum metabolite profiles obtained from GC/TOF-MS analysis can distinguish BD patients from HC group in discovery set. The variation values of the partial least squared-discrimination analysis (PLS-DA) model are R 2 X of 0.246, R 2 Y of 0.913 and Q 2 of 0.852, respectively, indicating strong explanation and prediction capabilities of the model. A panel of five metabolic biomarkers, namely, decanoic acid, fructose, tagatose, linoleic acid and oleic acid were selected and adequately validated as putative biomarkers of BD (sensitivity 100%, specificity 97.1%, area under the curve 0.998) in the discovery set and independent set. The PLS_DA model showed clear discrimination of BD and HC groups by the five metabolic biomarkers in independent set. This is the first report on characteristic metabolic profiles and potential metabolite biomarkers in serum for reliable diagnosis of BD using GC/TOF-MS. Copyright © 2017. Published by Elsevier SAS.

  8. Simultaneous determination of penicillin G salts by infrared spectroscopy: Evaluation of combining orthogonal signal correction with radial basis function-partial least squares regression

    NASA Astrophysics Data System (ADS)

    Talebpour, Zahra; Tavallaie, Roya; Ahmadi, Seyyed Hamid; Abdollahpour, Assem

    2010-09-01

    In this study, a new method for the simultaneous determination of penicillin G salts in pharmaceutical mixture via FT-IR spectroscopy combined with chemometrics was investigated. The mixture of penicillin G salts is a complex system due to similar analytical characteristics of components. Partial least squares (PLS) and radial basis function-partial least squares (RBF-PLS) were used to develop the linear and nonlinear relation between spectra and components, respectively. The orthogonal signal correction (OSC) preprocessing method was used to correct unexpected information, such as spectral overlapping and scattering effects. In order to compare the influence of OSC on PLS and RBF-PLS models, the optimal linear (PLS) and nonlinear (RBF-PLS) models based on conventional and OSC preprocessed spectra were established and compared. The obtained results demonstrated that OSC clearly enhanced the performance of both RBF-PLS and PLS calibration models. Also in the case of some nonlinear relation between spectra and component, OSC-RBF-PLS gave satisfactory results than OSC-PLS model which indicated that the OSC was helpful to remove extrinsic deviations from linearity without elimination of nonlinear information related to component. The chemometric models were tested on an external dataset and finally applied to the analysis commercialized injection product of penicillin G salts.

  9. Biological variation of Vanilla planifolia leaf metabolome.

    PubMed

    Palama, Tony Lionel; Fock, Isabelle; Choi, Young Hae; Verpoorte, Robert; Kodja, Hippolyte

    2010-04-01

    The metabolomic analysis of Vanilla planifolia leaves collected at different developmental stages was carried out using (1)H-nuclear magnetic resonance (NMR) spectroscopy and multivariate data analysis in order to evaluate their variation. Ontogenic changes of the metabolome were considered since leaves of different ages were collected at two different times of the day and in two different seasons. Principal component analysis (PCA) and partial least square modeling discriminate analysis (PLS-DA) of (1)H NMR data provided a clear separation according to leaf age, time of the day and season of collection. Young leaves were found to have higher levels of glucose, bis[4-(beta-D-glucopyranosyloxy)-benzyl]-2-isopropyltartrate (glucoside A) and bis[4-(beta-D-glucopyranosyloxy)-benzyl]-2-(2-butyl)-tartrate (glucoside B), whereas older leaves had more sucrose, acetic acid, homocitric acid and malic acid. Results obtained from PLS-DA analysis showed that leaves collected in March 2008 had higher levels of glucosides A and B as compared to those collected in August 2007. However, the relative standard deviation (RSD) exhibited by the individual values of glucosides A and B showed that those compounds vary more according to their developmental stage (50%) than to the time of day or the season in which they were collected (19%). Although morphological variations of the V. planifolia accessions were observed, no clear separation of the accessions was determined from the analysis of the NMR spectra. The results obtained in this study, show that this method based on the use of (1)H NMR spectroscopy in combination with multivariate analysis has a great potential for further applications in the study of vanilla leaf metabolome. Copyright 2009 Elsevier Ltd. All rights reserved.

  10. NMR metabolomics of ripened and developing oilseed rape (Brassica napus) and turnip rape (Brassica rapa).

    PubMed

    Kortesniemi, Maaria; Vuorinen, Anssi L; Sinkkonen, Jari; Yang, Baoru; Rajala, Ari; Kallio, Heikki

    2015-04-01

    The oilseeds of the commercially important oilseed rape (Brassica napus) and turnip rape (Brassica rapa) were investigated with (1)H NMR metabolomics. The compositions of ripened (cultivated in field trials) and developing seeds (cultivated in controlled conditions) were compared in multivariate models using principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA), and orthogonal partial least squares discriminant analysis (OPLS-DA). Differences in the major lipids and the minor metabolites between the two species were found. A higher content of polyunsaturated fatty acids and sucrose were observed in turnip rape, while the overall oil content and sinapine levels were higher in oilseed rape. The genotype traits were negligible compared to the effect of the growing site and concomitant conditions on the oilseed metabolome. This study demonstrates the applicability of NMR-based analysis in determining the species, geographical origin, developmental stage, and quality of oilseed Brassicas. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Detection of pit fragments in fresh cherries using near infrared spectroscopy

    USDA-ARS?s Scientific Manuscript database

    NIR spectroscopy in the wavelength region from 900nm to 2600nm was evaluated as the basis for a rapid, non-destructive method for the detection of pits and pit fragments in fresh cherries. Partial Least Squares discriminant analysis (PLS-DA) following various spectral pretreatments was applied to sp...

  12. Multivariate methods on the excitation emission matrix fluorescence spectroscopic data of diesel-kerosene mixtures: a comparative study.

    PubMed

    Divya, O; Mishra, Ashok K

    2007-05-29

    Quantitative determination of kerosene fraction present in diesel has been carried out based on excitation emission matrix fluorescence (EEMF) along with parallel factor analysis (PARAFAC) and N-way partial least squares regression (N-PLS). EEMF is a simple, sensitive and nondestructive method suitable for the analysis of multifluorophoric mixtures. Calibration models consisting of varying compositions of diesel and kerosene were constructed and their validation was carried out using leave-one-out cross validation method. The accuracy of the model was evaluated through the root mean square error of prediction (RMSEP) for the PARAFAC, N-PLS and unfold PLS methods. N-PLS was found to be a better method compared to PARAFAC and unfold PLS method because of its low RMSEP values.

  13. Analysis of spreadable cheese by Raman spectroscopy and chemometric tools.

    PubMed

    Oliveira, Kamila de Sá; Callegaro, Layce de Souza; Stephani, Rodrigo; Almeida, Mariana Ramos; de Oliveira, Luiz Fernando Cappa

    2016-03-01

    In this work, FT-Raman spectroscopy was explored to evaluate spreadable cheese samples. A partial least squares discriminant analysis was employed to identify the spreadable cheese samples containing starch. To build the models, two types of samples were used: commercial samples and samples manufactured in local industries. The method of supervised classification PLS-DA was employed to classify the samples as adulterated or without starch. Multivariate regression was performed using the partial least squares method to quantify the starch in the spreadable cheese. The limit of detection obtained for the model was 0.34% (w/w) and the limit of quantification was 1.14% (w/w). The reliability of the models was evaluated by determining the confidence interval, which was calculated using the bootstrap re-sampling technique. The results show that the classification models can be used to complement classical analysis and as screening methods. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Effects of soil type and organic fertilizers on fatty acids and vitamin E in Korean ginseng (Panax ginseng Meyer).

    PubMed

    Chung, Ill-Min; Kim, Jae-Kwang; Yang, Jin-Hee; Lee, Ji-Hee; Park, Sung-Kyu; Son, Na-Young; Kim, Seung-Hyun

    2017-12-01

    This study examined the effects of soil type and fertilizer regimes on variations in fatty acids (FAs) and vitamin E (Vit-E) in 6-year-old ginseng roots. We observed significant variation in both FA and Vit-E contents owing to the type and quantity of organic fertilizer used in each soil type during cultivation. Unsaturated FAs were approximately 2.7-fold higher in ginseng than in saturated FAs. Linoleic, palmitic, and oleic acids were the most abundant FAs detected in ginseng roots. Additionally, α-tocopherol was the major Vit-E detected. In particular, the increased application of rice straw compost or food waste fertilizer elevated the quantity of nutritionally desirable FAs and bioactive Vit-E in ginseng root. Partial least square-discriminant analysis (PLS-DA) score plots showed that soil type might be the main cause of differences in FA and Vit-E levels in ginseng. Specifically, the PLS-DA model indicated that palmitic acid is a suitable FA marker in determining whether ginseng plants were grown in a paddy-converted field or an upland field. Moreover, linoleic acid levels were highly correlated with α-linolenic acid (r=0.8374; p<0.0001) according to Pearson's correlations and hierarchical clustering analysis. Hence, these preliminary results should prove useful for the reliable production of ginseng containing high phytonutrient quantities according to cultivation conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Metabolomic changes in follicular fluid induced by soy isoflavones administered to rats from weaning until sexual maturity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Wenxiang; Zhang, Wenchang, E-mail: wenchang2002@sina.com; Liu, Jin

    Female Wistar rats at 21 days of age were treated with one of three concentrations of soy isoflavones (SIF) (50, 100 or 200 mg/kg body weight, orally, once per day) from weaning until sexual maturity (3 months) in order to evaluate the influence of SIF on ovarian follicle development. After treatment, the serum sex hormone levels and enumeration of ovarian follicles of the ovary were measured. The metabolic profile of follicular fluid was determined using HPLC-MS. Principal component analysis (PCA) and partial least-squares-discriminant analysis (PLS-DA) was used to identify differences in metabolites and reveal useful toxic biomarkers. The results indicatedmore » that modest doses of SIF affect ovarian follicle development, as demonstrated by decreased serum estradiol levels and increases in both ovarian follicle atresia and corpora lutea number in the ovary. SIF treatment-related metabolic alterations in follicular fluid were also found in the PCA and PLS-DA models. The 24 most significantly altered metabolites were identified, including primary sex hormones, amino acids, fatty acids and metabolites involved in energy metabolism. These findings may indicate that soy isoflavones affect ovarian follicle development by inducing metabolomic variations in the follicular fluid. - Highlights: ► Modest doses of soy isoflavones (SIF) do affect ovarian follicle development. ► SIF treatment-related metabolic alterations in follicular fluid were found. ► The 24 most significantly altered metabolites were identified.« less

  16. Olive oil sensory defects classification with data fusion of instrumental techniques and multivariate analysis (PLS-DA).

    PubMed

    Borràs, Eva; Ferré, Joan; Boqué, Ricard; Mestres, Montserrat; Aceña, Laura; Calvo, Angels; Busto, Olga

    2016-07-15

    Three instrumental techniques, headspace-mass spectrometry (HS-MS), mid-infrared spectroscopy (MIR) and UV-visible spectrophotometry (UV-vis), have been combined to classify virgin olive oil samples based on the presence or absence of sensory defects. The reference sensory values were provided by an official taste panel. Different data fusion strategies were studied to improve the discrimination capability compared to using each instrumental technique individually. A general model was applied to discriminate high-quality non-defective olive oils (extra-virgin) and the lowest-quality olive oils considered non-edible (lampante). A specific identification of key off-flavours, such as musty, winey, fusty and rancid, was also studied. The data fusion of the three techniques improved the classification results in most of the cases. Low-level data fusion was the best strategy to discriminate musty, winey and fusty defects, using HS-MS, MIR and UV-vis, and the rancid defect using only HS-MS and MIR. The mid-level data fusion approach using partial least squares-discriminant analysis (PLS-DA) scores was found to be the best strategy for defective vs non-defective and edible vs non-edible oil discrimination. However, the data fusion did not sufficiently improve the results obtained by a single technique (HS-MS) to classify non-defective classes. These results indicate that instrumental data fusion can be useful for the identification of sensory defects in virgin olive oils. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. Discerning some Tylenol brands using attenuated total reflection Fourier transform infrared data and multivariate analysis techniques.

    PubMed

    Msimanga, Huggins Z; Ollis, Robert J

    2010-06-01

    Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were used to classify acetaminophen-containing medicines using their attenuated total reflection Fourier transform infrared (ATR-FT-IR) spectra. Four formulations of Tylenol (Arthritis Pain Relief, Extra Strength Pain Relief, 8 Hour Pain Relief, and Extra Strength Pain Relief Rapid Release) along with 98% pure acetaminophen were selected for this study because of the similarity of their spectral features, with correlation coefficients ranging from 0.9857 to 0.9988. Before acquiring spectra for the predictor matrix, the effects on spectral precision with respect to sample particle size (determined by sieve size opening), force gauge of the ATR accessory, sample reloading, and between-tablet variation were examined. Spectra were baseline corrected and normalized to unity before multivariate analysis. Analysis of variance (ANOVA) was used to study spectral precision. The large particles (35 mesh) showed large variance between spectra, while fine particles (120 mesh) indicated good spectral precision based on the F-test. Force gauge setting did not significantly affect precision. Sample reloading using the fine particle size and a constant force gauge setting of 50 units also did not compromise precision. Based on these observations, data acquisition for the predictor matrix was carried out with the fine particles (sieve size opening of 120 mesh) at a constant force gauge setting of 50 units. After removing outliers, PCA successfully classified the five samples in the first and second components, accounting for 45.0% and 24.5% of the variances, respectively. The four-component PLS-DA model (R(2)=0.925 and Q(2)=0.906) gave good test spectra predictions with an overall average of 0.961 +/- 7.1% RSD versus the expected 1.0 prediction for the 20 test spectra used.

  18. Unbiased plasma metabolomics reveal the correlation of metabolic pathways and Prakritis of humans.

    PubMed

    Shirolkar, Amey; Chakraborty, Sutapa; Mandal, Tusharkanti; Dabur, Rajesh

    2017-11-25

    Ayurveda, an ancient Indian medicinal system, has categorized human body constitutions in three broad constitutional types (prakritis) i.e. Vata, Pitta and Kapha. Analysis of plasma metabolites and related pathways to classify Prakriti specific dominant marker metabolites and metabolic pathways. 38 healthy male individuals were assessed for dominant Prakritis and their fasting blood samples were collected. The processed plasma samples were subjected to rapid resolution liquid chromatography-electrospray ionization-quadrupole time of flight mass spectrometry (RRLC-ESI-QTOFMS). Mass profiles were aligned and subjected to multivariate analysis. Partial least square discriminant analysis (PLS-DA) model showed 97.87% recognition capability. List of PLS-DA metabolites was subjected to permutative Benjamini-Hochberg false discovery rate (FDR) correction and final list of 76 metabolites with p < 0.05 and fold-change > 2.0 was identified. Pathway analysis using metascape and JEPETTO plugins in Cytoscape revealed that steroidal hormone biosynthesis, amino acid, and arachidonic acid metabolism are major pathways varying with different constitution. Biological Go processes analysis showed that aromatic amino acids, sphingolipids, and pyrimidine nucleotides metabolic processes were dominant in kapha type of body constitution. Fat soluble vitamins, cellular amino acid, and androgen biosynthesis process along with branched chain amino acid and glycerolipid catabolic processes were dominant in pitta type individuals. Vata Prakriti was found to have dominant catecholamine, arachidonic acid and hydrogen peroxide metabolomics processes. The neurotransmission and oxidative stress in vata, BCAA catabolic, androgen, xenobiotics metabolic processes in pitta, and aromatic amino acids, sphingolipid, and pyrimidine metabolic process in kaphaPrakriti were the dominant marker pathways. Copyright © 2017 Transdisciplinary University, Bangalore and World Ayurveda Foundation. Published by Elsevier B.V. All rights reserved.

  19. Different approaches in Partial Least Squares and Artificial Neural Network models applied for the analysis of a ternary mixture of Amlodipine, Valsartan and Hydrochlorothiazide

    NASA Astrophysics Data System (ADS)

    Darwish, Hany W.; Hassan, Said A.; Salem, Maissa Y.; El-Zeany, Badr A.

    2014-03-01

    Different chemometric models were applied for the quantitative analysis of Amlodipine (AML), Valsartan (VAL) and Hydrochlorothiazide (HCT) in ternary mixture, namely, Partial Least Squares (PLS) as traditional chemometric model and Artificial Neural Networks (ANN) as advanced model. PLS and ANN were applied with and without variable selection procedure (Genetic Algorithm GA) and data compression procedure (Principal Component Analysis PCA). The chemometric methods applied are PLS-1, GA-PLS, ANN, GA-ANN and PCA-ANN. The methods were used for the quantitative analysis of the drugs in raw materials and pharmaceutical dosage form via handling the UV spectral data. A 3-factor 5-level experimental design was established resulting in 25 mixtures containing different ratios of the drugs. Fifteen mixtures were used as a calibration set and the other ten mixtures were used as validation set to validate the prediction ability of the suggested methods. The validity of the proposed methods was assessed using the standard addition technique.

  20. Preliminary construction of integral analysis for characteristic components in complex matrices by in-house fabricated solid-phase microextraction fibers combined with gas chromatography-mass spectrometry.

    PubMed

    Tang, Zhentao; Hou, Wenqian; Liu, Xiuming; Wang, Mingfeng; Duan, Yixiang

    2016-08-26

    Integral analysis plays an important role in study and quality control of substances with complex matrices in our daily life. As the preliminary construction of integral analysis of substances with complex matrices, developing a relatively comprehensive and sensitive methodology might offer more informative and reliable characteristic components. Flavoring mixtures belonging to the representatives of substances with complex matrices have now been widely used in various fields. To better study and control the quality of flavoring mixtures as additives in food industry, an in-house fabricated solid-phase microextraction (SPME) fiber was prepared based on sol-gel technology in this work. The active organic component of the fiber coating was multi-walled carbon nanotubes (MWCNTs) functionalized with hydroxyl-terminated polydimethyldiphenylsiloxane, which integrate the non-polar and polar chains of both materials. In this way, more sensitive extraction capability for a wider range of compounds can be obtained in comparison with commercial SPME fibers. Preliminarily integral analysis of three similar types of samples were realized by the optimized SPME-GC-MS method. With the obtained GC-MS data, a valid and well-fit model was established by partial least square discriminant analysis (PLS-DA) for classification of these samples (R2X=0.661, R2Y=0.996, Q2=0.986). The validity of the model (R2=0.266, Q2=-0.465) has also approved the potential to predict the "belongingness" of new samples. With the PLS-DA and SPSS method, further screening out the markers among three similar batches of samples may be helpful for monitoring and controlling the quality of the flavoring mixtures as additives in food industry. Conversely, the reliability and effectiveness of the GC-MS data has verified the comprehensive and efficient extraction performance of the in-house fabricated fiber. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. Metabolomics fingerprint of coffee species determined by untargeted-profiling study using LC-HRMS.

    PubMed

    Souard, Florence; Delporte, Cédric; Stoffelen, Piet; Thévenot, Etienne A; Noret, Nausicaa; Dauvergne, Bastien; Kauffmann, Jean-Michel; Van Antwerpen, Pierre; Stévigny, Caroline

    2018-04-15

    Coffee bean extracts are consumed all over the world as beverage and there is a growing interest in coffee leaf extracts as food supplements. The wild diversity in Coffea (Rubiaceae) genus is large and could offer new opportunities and challenges. In the present work, a metabolomics approach was implemented to examine leaf chemical composition of 9 Coffea species grown in the same environmental conditions. Leaves were analyzed by LC-HRMS and a comprehensive statistical workflow was designed. It served for univariate hypothesis testing and multivariate modeling by PCA and partial PLS-DA on the Workflow4Metabolomics infrastructure. The first two axes of PCA and PLS-DA describes more than 40% of variances with good values of explained variances. This strategy permitted to investigate the metabolomics data and their relation with botanic and genetic informations. Finally, the identification of several key metabolites for the discrimination between species was further characterized. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. A comparison of different strategies in multivariate regression models for the direct determination of Mn, Cr, and Ni in steel samples using laser-induced breakdown spectroscopy

    NASA Astrophysics Data System (ADS)

    Luna, Aderval S.; Gonzaga, Fabiano B.; da Rocha, Werickson F. C.; Lima, Igor C. A.

    2018-01-01

    Laser-induced breakdown spectroscopy (LIBS) analysis was carried out on eleven steel samples to quantify the concentrations of chromium, nickel, and manganese. LIBS spectral data were correlated to known concentrations of the samples using different strategies in partial least squares (PLS) regression models. For the PLS analysis, one predictive model was separately generated for each element, while different approaches were used for the selection of variables (VIP: variable importance in projection and iPLS: interval partial least squares) in the PLS model to quantify the contents of the elements. The comparison of the performance of the models showed that there was no significant statistical difference using the Wilcoxon signed rank test. The elliptical joint confidence region (EJCR) did not detect systematic errors in these proposed methodologies for each metal.

  3. PLS modelling of structure—activity relationships of catechol O-methyltransferase inhibitors

    NASA Astrophysics Data System (ADS)

    Lotta, Timo; Taskinen, Jyrki; Bäckström, Reijo; Nissinen, Erkki

    1992-06-01

    Quantitative structure-activity analysis was carried out for in vitro inhibition of rat brain soluble catechol O-methyltransferase by a series (N=99) of 1,5-substituted-3,4-dihydroxybenzenes using computational chemistry and multivariate PLS modelling of data sets. The molecular structural descriptors (N=19) associated with the electronics of the catecholic ring and sizes of substituents were derived theoretically. For the whole set of molecules two separate PLS models have to be used. A PLS model with two significant (crossvalidated) model dimensions describing 82.2% of the variance in inhibition activity data was capable of predicting all molecules except those having the largest R1 substituent or having a large R5 substituent compared to the NO2 group. The other PLS model with three significant (crossvalidated) model dimensions described 83.3% of the variance in inhibition activity data. This model could not handle compounds having a small R5 substituent, compared to the NO2 group, or the largest R1 substituent. The predictive capability of these PLS models was good. The models reveal that inhibition activity is nonlinearly related to the size of the R5 substituent. The analysis of the PLS models also shows that the binding affinity is greatly dependent on the electronic nature of both R1 and R5 substituents. The electron-withdrawing nature of the substituents enhances inhibition activity. In addition, the size of the R1 substituent and its lipophilicity are important in the binding of inhibitors. The size of the R1 substituent has an upper limit. On the other hand, ionized R1 substituents decrease inhibition activity.

  4. Rapid and Simultaneous Prediction of Eight Diesel Quality Parameters through ATR-FTIR Analysis.

    PubMed

    Nespeca, Maurilio Gustavo; Hatanaka, Rafael Rodrigues; Flumignan, Danilo Luiz; de Oliveira, José Eduardo

    2018-01-01

    Quality assessment of diesel fuel is highly necessary for society, but the costs and time spent are very high while using standard methods. Therefore, this study aimed to develop an analytical method capable of simultaneously determining eight diesel quality parameters (density; flash point; total sulfur content; distillation temperatures at 10% (T10), 50% (T50), and 85% (T85) recovery; cetane index; and biodiesel content) through attenuated total reflection Fourier transform infrared (ATR-FTIR) spectroscopy and the multivariate regression method, partial least square (PLS). For this purpose, the quality parameters of 409 samples were determined using standard methods, and their spectra were acquired in ranges of 4000-650 cm -1 . The use of the multivariate filters, generalized least squares weighting (GLSW) and orthogonal signal correction (OSC), was evaluated to improve the signal-to-noise ratio of the models. Likewise, four variable selection approaches were tested: manual exclusion, forward interval PLS (FiPLS), backward interval PLS (BiPLS), and genetic algorithm (GA). The multivariate filters and variables selection algorithms generated more fitted and accurate PLS models. According to the validation, the FTIR/PLS models presented accuracy comparable to the reference methods and, therefore, the proposed method can be applied in the diesel routine monitoring to significantly reduce costs and analysis time.

  5. Rapid and Simultaneous Prediction of Eight Diesel Quality Parameters through ATR-FTIR Analysis

    PubMed Central

    Hatanaka, Rafael Rodrigues; Flumignan, Danilo Luiz; de Oliveira, José Eduardo

    2018-01-01

    Quality assessment of diesel fuel is highly necessary for society, but the costs and time spent are very high while using standard methods. Therefore, this study aimed to develop an analytical method capable of simultaneously determining eight diesel quality parameters (density; flash point; total sulfur content; distillation temperatures at 10% (T10), 50% (T50), and 85% (T85) recovery; cetane index; and biodiesel content) through attenuated total reflection Fourier transform infrared (ATR-FTIR) spectroscopy and the multivariate regression method, partial least square (PLS). For this purpose, the quality parameters of 409 samples were determined using standard methods, and their spectra were acquired in ranges of 4000–650 cm−1. The use of the multivariate filters, generalized least squares weighting (GLSW) and orthogonal signal correction (OSC), was evaluated to improve the signal-to-noise ratio of the models. Likewise, four variable selection approaches were tested: manual exclusion, forward interval PLS (FiPLS), backward interval PLS (BiPLS), and genetic algorithm (GA). The multivariate filters and variables selection algorithms generated more fitted and accurate PLS models. According to the validation, the FTIR/PLS models presented accuracy comparable to the reference methods and, therefore, the proposed method can be applied in the diesel routine monitoring to significantly reduce costs and analysis time. PMID:29629209

  6. Incorporation of support vector machines in the LIBS toolbox for sensitive and robust classification amidst unexpected sample and system variability

    PubMed Central

    ChariDingari, Narahara; Barman, Ishan; Myakalwar, Ashwin Kumar; Tewari, Surya P.; Kumar, G. Manoj

    2012-01-01

    Despite the intrinsic elemental analysis capability and lack of sample preparation requirements, laser-induced breakdown spectroscopy (LIBS) has not been extensively used for real world applications, e.g. quality assurance and process monitoring. Specifically, variability in sample, system and experimental parameters in LIBS studies present a substantive hurdle for robust classification, even when standard multivariate chemometric techniques are used for analysis. Considering pharmaceutical sample investigation as an example, we propose the use of support vector machines (SVM) as a non-linear classification method over conventional linear techniques such as soft independent modeling of class analogy (SIMCA) and partial least-squares discriminant analysis (PLS-DA) for discrimination based on LIBS measurements. Using over-the-counter pharmaceutical samples, we demonstrate that application of SVM enables statistically significant improvements in prospective classification accuracy (sensitivity), due to its ability to address variability in LIBS sample ablation and plasma self-absorption behavior. Furthermore, our results reveal that SVM provides nearly 10% improvement in correct allocation rate and a concomitant reduction in misclassification rates of 75% (cf. PLS-DA) and 80% (cf. SIMCA)-when measurements from samples not included in the training set are incorporated in the test data – highlighting its robustness. While further studies on a wider matrix of sample types performed using different LIBS systems is needed to fully characterize the capability of SVM to provide superior predictions, we anticipate that the improved sensitivity and robustness observed here will facilitate application of the proposed LIBS-SVM toolbox for screening drugs and detecting counterfeit samples as well as in related areas of forensic and biological sample analysis. PMID:22292496

  7. The influence of a sports drink on the postexercise metabolism of elite athletes as investigated by NMR-based metabolomics.

    PubMed

    Miccheli, Alfredo; Marini, Federico; Capuani, Giorgio; Miccheli, Alberta Tomassini; Delfini, Maurizio; Di Cocco, Maria Enrica; Puccetti, Caterina; Paci, Maurizio; Rizzo, Marta; Spataro, Antonio

    2009-10-01

    The aim of this study is to evaluate the systemic effects of an isotonic sports drink on the metabolic status of athletes of the Italian Olympic rowing team during recovery after strenuous and prolonged physical exercise by means of nuclear magnetic resonance (NMR)-based metabolomics analysis on plasma and urine. Forty-four male athletes of the Italian Olympic rowing team were enrolled in a double-blind crossover study. All subjects underwent 2 evaluations at 1-week intervals. The evaluation was performed on a rowing ergometer after strenuous physical exercise to produce a state of dehydration. Afterward, the athletes were rehydrated either with a green tea-based carbohydrate-hydroelectrolyte drink or with oligomineral water. Three blood samples were drawn for each subject: at rest, after the exercise, and following rehydratation, while 2 urine samples were collected: at rest and after the rehydratation period. Biofluid samples were analyzed by high-resolution (1)H NMR metabolic profiling combined with multilevel simultaneous data-analysis (MSCA) and partial-least squares-discriminant analysis (PLS-DA). The between-subject variations, as evaluated by MSCA, reflected the variations of lactate levels induced by the physical exercise. Analysis of the within-individual variance using multilevel PLS-DA models of plasma and urine metabolic profiles showed an effect of the green tea-based sports drink on glucose, citrate, and lactate levels in plasma and on acetone, 3-OH-butyrate, and lactate levels in urine. The increase of caffeine and hippuric acid levels in urine indicated the absorption of green tea extract components. NMR-based metabolomics allowed the complex effects of a green tea extract-based carbohydrate/hydroelectrolyte beverage on the energy metabolism of athletes during recovery by postexercise rehydration to be evaluated.

  8. Analysis of protein glycation in human fingernail clippings with near-infrared (NIR) spectroscopy as an alternative technique for the diagnosis of diabetes mellitus.

    PubMed

    Monteyne, Tinne; Coopman, Renaat; Kishabongo, Antoine S; Himpe, Jonas; Lapauw, Bruno; Shadid, Samyah; Van Aken, Elisabeth H; Berenson, Darja; Speeckaert, Marijn M; De Beer, Thomas; Delanghe, Joris R

    2018-05-11

    Glycated keratin allows the monitoring of average tissue glucose exposure over previous weeks. In the present study, we wanted to explore if near-infrared (NIR) spectroscopy could be used as a non-invasive diagnostic tool for assessing glycation in diabetes mellitus. A total of 52 patients with diabetes mellitus and 107 healthy subjects were enrolled in this study. A limited number (n=21) of nails of healthy subjects were glycated in vitro with 0.278 mol/L, 0.556 mol/L and 0.833 mol/L glucose solution to study the effect of glucose on the nail spectrum. Consequently, the nail clippings of the patients were analyzed using a Thermo Fisher Antaris II Near-IR Analyzer Spectrometer and near infrared (NIR) chemical imaging. Spectral classification (patients with diabetes mellitus vs. healthy subjects) was performed using partial least square discriminant analysis (PLS-DA). In vitro glycation resulted in peak sharpening between 4300 and 4400 cm-1 and spectral variations at 5270 cm-1 and between 6600 and 7500 cm-1. Similar regions encountered spectral deviations during analysis of the patients' nails. Optimization of the spectral collection parameters was necessary in order to distinguish a large dataset. Spectra had to be collected at 16 cm-1, 128 scans, region 4000-7500 cm-1. Using standard normal variate, Savitsky-Golay smoothing (7 points) and first derivative preprocessing allowed for the prediction of the test set with 100% correct assignments utilizing a PLS-DA model. Analysis of protein glycation in human fingernail clippings with NIR spectroscopy could be an alternative affordable technique for the diagnosis of diabetes mellitus.

  9. Lipidomics study of plasma phospholipid metabolism in early type 2 diabetes rats with ancient prescription Huang-Qi-San intervention by UPLC/Q-TOF-MS and correlation coefficient.

    PubMed

    Wu, Xia; Zhu, Jian-Cheng; Zhang, Yu; Li, Wei-Min; Rong, Xiang-Lu; Feng, Yi-Fan

    2016-08-25

    Potential impact of lipid research has been increasingly realized both in disease treatment and prevention. An effective metabolomics approach based on ultra-performance liquid chromatography/quadrupole-time-of-flight mass spectrometry (UPLC/Q-TOF-MS) along with multivariate statistic analysis has been applied for investigating the dynamic change of plasma phospholipids compositions in early type 2 diabetic rats after the treatment of an ancient prescription of Chinese Medicine Huang-Qi-San. The exported UPLC/Q-TOF-MS data of plasma samples were subjected to SIMCA-P and processed by bioMark, mixOmics, Rcomdr packages with R software. A clear score plots of plasma sample groups, including normal control group (NC), model group (MC), positive medicine control group (Flu) and Huang-Qi-San group (HQS), were achieved by principal-components analysis (PCA), partial least-squares discriminant analysis (PLS-DA) and orthogonal partial least-squares discriminant analysis (OPLS-DA). Biomarkers were screened out using student T test, principal component regression (PCR), partial least-squares regression (PLS) and important variable method (variable influence on projection, VIP). Structures of metabolites were identified and metabolic pathways were deduced by correlation coefficient. The relationship between compounds was explained by the correlation coefficient diagram, and the metabolic differences between similar compounds were illustrated. Based on KEGG database, the biological significances of identified biomarkers were described. The correlation coefficient was firstly applied to identify the structure and deduce the metabolic pathways of phospholipids metabolites, and the study provided a new methodological cue for further understanding the molecular mechanisms of metabolites in the process of regulating Huang-Qi-San for treating early type 2 diabetes. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  10. Incorporation of support vector machines in the LIBS toolbox for sensitive and robust classification amidst unexpected sample and system variability.

    PubMed

    Dingari, Narahara Chari; Barman, Ishan; Myakalwar, Ashwin Kumar; Tewari, Surya P; Kumar Gundawar, Manoj

    2012-03-20

    Despite the intrinsic elemental analysis capability and lack of sample preparation requirements, laser-induced breakdown spectroscopy (LIBS) has not been extensively used for real-world applications, e.g., quality assurance and process monitoring. Specifically, variability in sample, system, and experimental parameters in LIBS studies present a substantive hurdle for robust classification, even when standard multivariate chemometric techniques are used for analysis. Considering pharmaceutical sample investigation as an example, we propose the use of support vector machines (SVM) as a nonlinear classification method over conventional linear techniques such as soft independent modeling of class analogy (SIMCA) and partial least-squares discriminant analysis (PLS-DA) for discrimination based on LIBS measurements. Using over-the-counter pharmaceutical samples, we demonstrate that the application of SVM enables statistically significant improvements in prospective classification accuracy (sensitivity), because of its ability to address variability in LIBS sample ablation and plasma self-absorption behavior. Furthermore, our results reveal that SVM provides nearly 10% improvement in correct allocation rate and a concomitant reduction in misclassification rates of 75% (cf. PLS-DA) and 80% (cf. SIMCA)-when measurements from samples not included in the training set are incorporated in the test data-highlighting its robustness. While further studies on a wider matrix of sample types performed using different LIBS systems is needed to fully characterize the capability of SVM to provide superior predictions, we anticipate that the improved sensitivity and robustness observed here will facilitate application of the proposed LIBS-SVM toolbox for screening drugs and detecting counterfeit samples, as well as in related areas of forensic and biological sample analysis.

  11. HPLC-ESI-MS(n) Analysis, Fed-Batch Cultivation Enhances Bioactive Compound Biosynthesis and Immune-Regulative Effect of Adventitious Roots in Pseudostellaria heterophylla.

    PubMed

    Wang, Juan; Li, Jing; Li, Hongfa; Wu, Xiaolei; Gao, Wenyuan

    2015-09-01

    A electrospray ionization tandem mass spectrometry (ESI-MS(n)) analysis was performed in order to identify the active composition in Pseudostellaria heterophylla adventitious roots. Pseudostellarin A, C, D, and G were identified from P. heterophylla adventitious roots on the basis of LC-MS(n) analysis. The culture conditions of adventitious roots were optimized, and datasets were subjected to a partial least squares discriminant analysis (PLS-DA), in which the growth ratio and some compounds showed a positive correlation with an aeration volume of 0.3 vvm and inoculum density of 0.15 %. Fed-batch cultivation enhanced the contents of total saponin, polysaccharides, and specific oxygen uptaker rate (SOUR). The maximum dry root weight (4.728 g l(-1)) was achieved in the 3/4 Murashige and Skoog (MS) medium group. PLS-DA showed that polysaccharides contributed significantly to the clustering of different groups and showed a positive correlation in the MS medium group. The delayed-type hypersensitivity (DTH) reaction on the mice induced by 2,4-dinitrofluorobenzene (DNFB) was applied to compare the immunocompetence effects of adventitious roots (AR) with field native roots (NR) of P. heterophylla. As a result, AR possessed a similar immunoregulation function as NR.

  12. Metabolic Response to XD14 Treatment in Human Breast Cancer Cell Line MCF-7

    PubMed Central

    Pan, Daqiang; Kather, Michel; Willmann, Lucas; Schlimpert, Manuel; Bauer, Christoph; Lagies, Simon; Schmidtkunz, Karin; Eisenhardt, Steffen U.; Jung, Manfred; Günther, Stefan; Kammerer, Bernd

    2016-01-01

    XD14 is a 4-acyl pyrrole derivative, which was discovered by a high-throughput virtual screening experiment. XD14 inhibits bromodomain and extra-terminal domain (BET) proteins (BRD2, BRD3, BRD4 and BRDT) and consequently suppresses cell proliferation. In this study, metabolic profiling reveals the molecular effects in the human breast cancer cell line MCF-7 (Michigan Cancer Foundation-7) treated by XD14. A three-day time series experiment with two concentrations of XD14 was performed. Gas chromatography-mass spectrometry (GC-MS) was applied for untargeted profiling of treated and non-treated MCF-7 cells. The gained data sets were evaluated by several statistical methods: analysis of variance (ANOVA), clustering analysis, principle component analysis (PCA), and partial least squares discriminant analysis (PLS-DA). Cell proliferation was strongly inhibited by treatment with 50 µM XD14. Samples could be discriminated by time and XD14 concentration using PLS-DA. From the 117 identified metabolites, 67 were significantly altered after XD14 treatment. These metabolites include amino acids, fatty acids, Krebs cycle and glycolysis intermediates, as well as compounds of purine and pyrimidine metabolism. This massive intervention in energy metabolism and the lack of available nucleotides could explain the decreased proliferation rate of the cancer cells. PMID:27783056

  13. Characterization of potential plasma biomarkers related to cognitive impairment by untargeted profiling of phospholipids using the HILIC-ESI-IT-TOF-MS system.

    PubMed

    Song, Shuang; Cheong, Ling-Zhi; Man, Qing-Qing; Pang, Shao-Jie; Li, Yue-Qi; Ren, Biao; Zhang, Jian

    2018-05-01

    Early diagnosis of neural changes causing cognitive impairment is critical for development of preventive therapies for dementia. Biomarkers currently characterized cannot be extensively applied due to the invasive sampling of cerebrospinal fluid. The other imaging approaches are either expensive or require a high technique. Phospholipids (PLs), which are basic constituents of neurons, might be a key variable in the pathogenesis of cognitive impairment. Changes in plasma PL provide the possibility for development of novel biomarkers with minimal invasion and high patient acceptance. In this work, a HILIC-ESI-IT-TOF-MS system was introduced for untargeted profiling of plasma PLs to investigate the relationship between changes of plasma PL profiles and cognitive impairment. A total of 272 types of PL molecular structures were characterized in human plasma and quantified through the internal standard method. Univariate analysis shows 29 PLs were significantly different between the control (n = 41) and the cognitive impairment (CI) group (n = 41). Multivariate analysis (PCA and OPLS-DA) was conducted based on these 29 potential PL biomarkers. Both univariate and multivariate analyses show abnormality of PL metabolism in the CI group, and the downregulation of ethanolamine plasmalogen (pPE) supply, especially those with PUFAs, in the circulation system should be strongly associated with neurodegeneration. A discriminative model was established with satisfied fit (R2) and prediction (Q2) abilities, and the classification test showed better recognition of the CI group than the control group indicating that this model of PL biomarkers could be used as indicators for screening of CI. Graphical abstract Characterization of potential plasma biomarkers related to cognitive impairment by untargeted profiling of phospholipids.

  14. Application of Genetic Algorithm (GA) Assisted Partial Least Square (PLS) Analysis on Trilinear and Non-trilinear Fluorescence Data Sets to Quantify the Fluorophores in Multifluorophoric Mixtures: Improving Quantification Accuracy of Fluorimetric Estimations of Dilute Aqueous Mixtures.

    PubMed

    Kumar, Keshav

    2018-03-01

    Excitation-emission matrix fluorescence (EEMF) and total synchronous fluorescence spectroscopy (TSFS) are the 2 fluorescence techniques that are commonly used for the analysis of multifluorophoric mixtures. These 2 fluorescence techniques are conceptually different and provide certain advantages over each other. The manual analysis of such highly correlated large volume of EEMF and TSFS towards developing a calibration model is difficult. Partial least square (PLS) analysis can analyze the large volume of EEMF and TSFS data sets by finding important factors that maximize the correlation between the spectral and concentration information for each fluorophore. However, often the application of PLS analysis on entire data sets does not provide a robust calibration model and requires application of suitable pre-processing step. The present work evaluates the application of genetic algorithm (GA) analysis prior to PLS analysis on EEMF and TSFS data sets towards improving the precision and accuracy of the calibration model. The GA algorithm essentially combines the advantages provided by stochastic methods with those provided by deterministic approaches and can find the set of EEMF and TSFS variables that perfectly correlate well with the concentration of each of the fluorophores present in the multifluorophoric mixtures. The utility of the GA assisted PLS analysis is successfully validated using (i) EEMF data sets acquired for dilute aqueous mixture of four biomolecules and (ii) TSFS data sets acquired for dilute aqueous mixtures of four carcinogenic polycyclic aromatic hydrocarbons (PAHs) mixtures. In the present work, it is shown that by using the GA it is possible to significantly improve the accuracy and precision of the PLS calibration model developed for both EEMF and TSFS data set. Hence, GA must be considered as a useful pre-processing technique while developing an EEMF and TSFS calibration model.

  15. Proton Nuclear Magnetic Resonance-Spectroscopic Discrimination of Wines Reflects Genetic Homology of Several Different Grape (V. vinifera L.) Cultivars.

    PubMed

    Hu, Boran; Yue, Yaqing; Zhu, Yong; Wen, Wen; Zhang, Fengmin; Hardie, Jim W

    2015-01-01

    Proton nuclear magnetic resonance spectroscopy coupled multivariate analysis (1H NMR-PCA/PLS-DA) is an important tool for the discrimination of wine products. Although 1H NMR has been shown to discriminate wines of different cultivars, a grape genetic component of the discrimination has been inferred only from discrimination of cultivars of undefined genetic homology and in the presence of many confounding environmental factors. We aimed to confirm the influence of grape genotypes in the absence of those factors. We applied 1H NMR-PCA/PLS-DA and hierarchical cluster analysis (HCA) to wines from five, variously genetically-related grapevine (V. vinifera) cultivars; all grown similarly on the same site and vinified similarly. We also compared the semi-quantitative profiles of the discriminant metabolites of each cultivar with previously reported chemical analyses. The cultivars were clearly distinguishable and there was a general correlation between their grouping and their genetic homology as revealed by recent genomic studies. Between cultivars, the relative amounts of several of the cultivar-related discriminant metabolites conformed closely with reported chemical analyses. Differences in grape-derived metabolites associated with genetic differences alone are a major source of 1H NMR-based discrimination of wines and 1H NMR has the capacity to discriminate between very closely related cultivars. The study confirms that genetic variation among grape cultivars alone can account for the discrimination of wine by 1H NMR-PCA/PLS and indicates that 1H NMR spectra of wine of single grape cultivars may in future be used in tandem with hierarchical cluster analysis to elucidate genetic lineages and metabolomic relations of grapevine cultivars. In the absence of genetic information, for example, where predecessor varieties are no longer extant, this may be a particularly useful approach.

  16. Comprehensive analysis of serum metabolites in gestational diabetes mellitus by UPLC/Q-TOF-MS.

    PubMed

    Liu, Tianhu; Li, Jiaxun; Xu, Fengcheng; Wang, Mengni; Ding, Shijia; Xu, Hongbing; Dong, Fang

    2016-02-01

    Gestational diabetes mellitus (GDM) refers to the first sign or onset of diabetes mellitus during pregnancy rather than progestation. In recent decades, more and more research has focused on the etiology and pathogenesis of GDM in order to further understand GDM progress and recovery. Using an advanced metabolomics platform based on ultra-performance liquid chromatography quadrupole time-of-flight mass spectrometry (UPLC/Q-TOF-MS), we explored the changes in serum metabolites between women with GDM and healthy controls during and after pregnancy. Some significant differences were discovered using multivariate analysis including partial least-squares discriminant analysis (PLS-DA) and orthogonal PLS-DA (OPLS-DA). The dysregulated metabolites were further compared and verified in several databases to understand how these compounds might function as potential biomarkers. Analyses of the metabolic pathways associated with these potential biomarkers were subsequently explored. A total of 35 metabolites were identified, contributing to GDM progress to some extent. The identified biomarkers were involved in some important metabolic pathways including glycine, serine, and threonine metabolism; steroid hormone biosynthesis; tyrosine metabolism; glycerophospholipid metabolism; and fatty acid metabolism. The above mentioned metabolic pathways mainly participate in three major metabolic cycles in humans, including lipid metabolism, carbohydrate metabolism, and amino acid metabolism. In this pilot study, the valuable comprehensive analysis gave us further insight into the etiology and pathophysiology of GDM, which might benefit the feasibility of a rapid, accurate diagnosis and reasonable treatment as soon as possible but also prevent GDM and its related short- and long-term complications.

  17. The development of comparative bias index

    NASA Astrophysics Data System (ADS)

    Aimran, Ahmad Nazim; Ahmad, Sabri; Afthanorhan, Asyraf; Awang, Zainudin

    2017-08-01

    Structural Equation Modeling (SEM) is a second generation statistical analysis techniques developed for analyzing the inter-relationships among multiple variables in a model simultaneously. There are two most common used methods in SEM namely Covariance-Based Structural Equation Modeling (CB-SEM) and Partial Least Square Path Modeling (PLS-PM). There have been continuous debates among researchers in the use of PLS-PM over CB-SEM. While there is few studies were conducted to test the performance of CB-SEM and PLS-PM bias in estimating simulation data. This study intends to patch this problem by a) developing the Comparative Bias Index and b) testing the performance of CB-SEM and PLS-PM using developed index. Based on balanced experimental design, two multivariate normal simulation data with of distinct specifications of size 50, 100, 200 and 500 are generated and analyzed using CB-SEM and PLS-PM.

  18. Quantitative analysis of glycated albumin in serum based on ATR-FTIR spectrum combined with SiPLS and SVM.

    PubMed

    Li, Yuanpeng; Li, Fucui; Yang, Xinhao; Guo, Liu; Huang, Furong; Chen, Zhenqiang; Chen, Xingdan; Zheng, Shifu

    2018-08-05

    A rapid quantitative analysis model for determining the glycated albumin (GA) content based on Attenuated total reflectance (ATR)-Fourier transform infrared spectroscopy (FTIR) combining with linear SiPLS and nonlinear SVM has been developed. Firstly, the real GA content in human serum was determined by GA enzymatic method, meanwhile, the ATR-FTIR spectra of serum samples from the population of health examination were obtained. The spectral data of the whole spectra mid-infrared region (4000-600 cm -1 ) and GA's characteristic region (1800-800 cm -1 ) were used as the research object of quantitative analysis. Secondly, several preprocessing steps including first derivative, second derivative, variable standardization and spectral normalization, were performed. Lastly, quantitative analysis regression models were established by using SiPLS and SVM respectively. The SiPLS modeling results are as follows: root mean square error of cross validation (RMSECV T ) = 0.523 g/L, calibration coefficient (R C ) = 0.937, Root Mean Square Error of Prediction (RMSEP T ) = 0.787 g/L, and prediction coefficient (R P ) = 0.938. The SVM modeling results are as follows: RMSECV T  = 0.0048 g/L, R C  = 0.998, RMSEP T  = 0.442 g/L, and R p  = 0.916. The results indicated that the model performance was improved significantly after preprocessing and optimization of characteristic regions. While modeling performance of nonlinear SVM was considerably better than that of linear SiPLS. Hence, the quantitative analysis model for GA in human serum based on ATR-FTIR combined with SiPLS and SVM is effective. And it does not need sample preprocessing while being characterized by simple operations and high time efficiency, providing a rapid and accurate method for GA content determination. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. Maldi-tof fingerprinting of seminal plasma lipids in the study of human male infertility.

    PubMed

    Camargo, Mariana; Intasqui, Paula; de Lima, Camila Bruna; Montani, Daniela Antunes; Nichi, Marcílio; Pilau, Eduardo Jorge; Gozzo, Fabio Cesar; Lo Turco, Edson Guimarães; Bertolla, Ricardo Pimenta

    2014-09-01

    This study proposed lipid fingerprinting of human seminal plasma by mass spectrometry as an analytical method to differentiate biological conditions. For this purpose, we chose infertile men as a model to study specific conditions, namely: high and low seminal plasma lipid peroxidation levels (sub-study 1.1), high and low sperm nuclear DNA fragmentation (sub-study 1.2), and intervention status: before and after subinguinal microsurgical varicocelectomy (study 2). Study 1 included 133 patients, of which 113 were utilized for sub-study 1.1 and 89 for sub-study 1.2. Study 2 included 17 adult men submitted to subinguinal varicocelectomy, before and 90 days after varicocelectomy. Lipids were extracted from seminal plasma and submitted to Matrix-Assisted Laser Desorption Ionization Quadrupole-Time-of-Flight Mass Spectrometry in the positive ionization mode. Spectra were processed using Waters(®) MassLynx, and MetaboAnalyst online software was used for statistical analyses. For sub-studies 1.1 and 1.2, and study 2, univariate analysis revealed 8, 87 and 34 significant ions, respectively. Multivariate analysis was performed through PCA and PLS-DA. PCA generated 56, 32 and 34 components respectively for each study and these were submitted to logistic regression. A ROC curve was plotted and the area under the curve was equal to 97.4, 92.5 and 96.5%. PLS-DA generated a list of 19, 24 and 23 VIP ions for sub-studies 1.1 and 1.2, and study 2, respectively. Therefore, this study established the lipid profile and comparison of patterns altered in response to specific biological conditions.

  20. Targeted isolation and identification of bioactive compounds lowering cholesterol in the crude extracts of crabapples using UPLC-DAD-MS-SPE/NMR based on pharmacology-guided PLS-DA.

    PubMed

    Wen, Chao; Wang, Dongshan; Li, Xing; Huang, Tao; Huang, Cheng; Hu, Kaifeng

    2018-02-20

    The anti-hyperlipidemic effects of crude crabapple extracts derived from Malus 'Red jade', Malus hupehensis (Pamp.) Rehd. and Malus prunifolia (Willd.) Borkh. were evaluated on high-fat diet induced obese (HF DIO) mice. The results revealed that some of these extracts could lower serum cholesterol levels in HF DIO mice. The same extracts were also parallelly analyzed by LC-MS in both positive and negative ionization modes. Based on the pharmacological results, 22 LC-MS variables were identified to be correlated with the anti-hyperlipidemic effects using partial least square discriminant analysis (PLS-DA) and independent samples t-test. Further, under the guidance of the bioactivity-correlated LC-MS signals, 10 compounds were targetedly isolated and enriched using UPLC-DAD-MS-SPE and identified/elucidated by NMR together with MS/MS as citric acid(1), p-coumaric acid(2), hyperoside(3), myricetin(4), naringenin(5), quercetin(6), kaempferol(7), gentiopicroside(8), ursolic acid(9) and 8-epiloganic acid(10). Among these 10 compounds, 6 compounds, hyperoside(3), myricetin(4), naringenin(5), quercetin(6), kaempferol(7) and ursolic acid(9), were individually studied and reported to indeed have effects on lowering the serum lipid levels. These results demonstrated the efficiency of this strategy for drug discovery. In contrast to traditional routes to discover bioactive compounds in the plant extracts, targeted isolation and identification of bioactive compounds in the crude plant extracts using UPLC-DAD-MS-SPE/NMR based on pharmacology-guided PLS-DA of LC-MS data brings forward a new efficient dereplicated approach to natural products research for drug discovery. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. [Study on the detection of active ingredient contents of Paecilomyces hepiali mycelium via near infrared spectroscopy].

    PubMed

    Teng, Wei-Zhuo; Song, Jia; Meng, Fan-Xin; Meng, Qing-Fan; Lu, Jia-Hui; Hu, Shuang; Teng, Li-Rong; Wang, Di; Xie, Jing

    2014-10-01

    Partial least squares (PLS) and radial basis function neural network (RBFNN) combined with near infrared spectros- copy (NIR) were applied to develop models for cordycepic acid, polysaccharide and adenosine analysis in Paecilomyces hepialid fermentation mycelium. The developed models possess well generalization and predictive ability which can be applied for crude drugs and related productions determination. During the experiment, 214 Paecilomyces hepialid mycelium samples were obtained via chemical mutagenesis combined with submerged fermentation. The contents of cordycepic acid, polysaccharide and adenosine were determined via traditional methods and the near infrared spectroscopy data were collected. The outliers were removed and the numbers of calibration set were confirmed via Monte Carlo partial least square (MCPLS) method. Based on the values of degree of approach (Da), both moving window partial least squares (MWPLS) and moving window radial basis function neural network (MWRBFNN) were applied to optimize characteristic wavelength variables, optimum preprocessing methods and other important variables in the models. After comparison, the RBFNN, RBFNN and PLS models were developed successfully for cordycepic acid, polysaccharide and adenosine detection, and the correlation between reference values and predictive values in both calibration set (R2c) and validation set (R2p) of optimum models was 0.9417 and 0.9663, 0.9803 and 0.9850, and 0.9761 and 0.9728, respectively. All the data suggest that these models possess well fitness and predictive ability.

  2. Analysis of trace metal concentrations in raw cow's milk from three dairy farms in North Gondar, Ethiopia: chemometric approach.

    PubMed

    Akele, M L; Abebe, D Z; Alemu, A K; Assefa, A G; Madhusudhan, A; de Oliveira, R R

    2017-09-11

    Concentrations of essential (Cu, Mn, and Zn) and toxic (Cr, Cd, and Pb) trace metals in 30 raw cow's milk samples were quantified using flame atomic absorption spectrometry. The samples were collected from the Nara-Awudarda, Tana-Abo, and Kosoye Amba-Rass sites in North Gondar, Ethiopia, preserved in a deep freezer (-20 °C), and then digested by Kjeldahl apparatus with HNO 3 /H 2 O 2 (5:2; v/v) at 300 °C for 2.5 h. The data were subject to principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA). Overall hazard quotient (HQ) and carcinogenic risk (CR) values were also estimated to assess metal-related health risks. The mean concentrations of Cr, Mn, Cu, Zn, Cd, and Pb in the milk samples ranged 0.468-0.828, 1.614-2.806, 0.840-1.532, 1.208-5.267, ND-0.330, and ND-0.186 mg/kg, respectively. The lowest values were obtained for Kosoye Amba-Rass milk samples, while the highest were found for those collected from Nara-Awudarda milk samples, probably due to high mineral enrichment and metal leaching (especially Cd and Pb) from coal deposits. PCA revealed clustering of samples with respect to their geographic origin. Validation of PLS-DA model showed 100% classification efficiency using external validation samples and detected Cd and Cu as trace metal markers. The HQ and CR values were within the safe level; however, the former is close to the alert threshold level for Nara-Awudarda milk samples. Thus, further studies on common foodstuffs, constituting a higher proportion in the local diet, are required in this area to provide a complete risk assessment.

  3. Centrifugal ultrafiltration of human serum for improving immunoglobulin A quantification using attenuated total reflectance infrared spectroscopy.

    PubMed

    Elsohaby, Ibrahim; McClure, J Trenton; Riley, Christopher B; Bryanton, Janet; Bigsby, Kathryn; Shaw, R Anthony

    2018-02-20

    Attenuated total reflectance infrared (ATR-IR) spectroscopy is a simple, rapid and cost-effective method for the analysis of serum. However, the complex nature of serum remains a limiting factor to the reliability of this method. We investigated the benefits of coupling the centrifugal ultrafiltration with ATR-IR spectroscopy for quantification of human serum IgA concentration. Human serum samples (n = 196) were analyzed for IgA using an immunoturbidimetric assay. ATR-IR spectra were acquired for whole serum samples and for the retentate (residue) reconstituted with saline following 300 kDa centrifugal ultrafiltration. IR-based analytical methods were developed for each of the two spectroscopic datasets, and the accuracy of each of the two methods compared. Analytical methods were based upon partial least squares regression (PLSR) calibration models - one with 5-PLS factors (for whole serum) and the second with 9-PLS factors (for the reconstituted retentate). Comparison of the two sets of IR-based analytical results to reference IgA values revealed improvements in the Pearson correlation coefficient (from 0.66 to 0.76), and the root mean squared error of prediction in IR-based IgA concentrations (from 102 to 79 mg/dL) for the ultrafiltration retentate-based method as compared to the method built upon whole serum spectra. Depleting human serum low molecular weight proteins using a 300 kDa centrifugal filter thus enhances the accuracy IgA quantification by ATR-IR spectroscopy. Further evaluation and optimization of this general approach may ultimately lead to routine analysis of a range of high molecular-weight analytical targets that are otherwise unsuitable for IR-based analysis. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. A(1)H NMR-based metabonomic study on the SAMP8 and SAMR1 mice and the effect of electro-acupuncture.

    PubMed

    Qiao-feng, Wu; Ling-ling, Guo; Shu-guang, Yu; Qi, Zhang; Sheng-feng, Lu; Fang, Zeng; Hai-yan, Yin; Yong, Tang; Xian-zhong, Yan

    2011-10-01

    A (1)H NMR-based metabonomic method was used to investigate the metabolic change of plasma in senescence-prone 8 (SAMP8) mice before and after electro-acupuncture (EA). Sixteen SAMP8 male mice (aged 8 months) were randomly divided into model group and acupuncture treatment group while the later group received EA treatment for 21 days. Eight senescence-resistant 1 (SAMR1) mice were used as the control group. Morris water maze was used to evaluate the effects of EA. All mice plasma samples obtained from different groups were analyzed by using 600 MHz (1)H nuclear magnetic resonances ((1)H NMR) spectroscopy. The data sets were analyzed by Principal Components Analysis (PCA) and Partial Least Squares-Discriminant Analysis (PLS-DA) to discriminate the key plasma metabolites among different groups. Results indicated that both the escape and probe tasks of SAMP8 could be improved by EA treatment. Metabonomic study showed that SAMR1 and SAMP8 were separated clearly in both CPMG_OSC_PLS and LED _OSC_PLS score plots. Interestingly, samples obtained from EA group were distributed closely to SAMR1 group in CPMG_OSC_PLS score plot, but away from SAMP8 group in LED_OSC_PLS score plot. Corresponding loading plots showed that much less lactate was seen in SAMP8 mice plasma. Other changes including higher levels of dimethylamine (DMA) Choline and α-glucose but lower levels of leucine/isoleucine, HDL, LDL/VLDL, 3-Hydroxybutyrate (3-HB), and Trimethylamine N-oxide (TMAO) were observed in the SAMP8 mice plasma than in the SAMR1. After EA treatment, the levels of lactate, DMA, choline and TMAO were improved. Results of this work can provide valuable clues to the understanding of the metabolic changes in the senile impairment of mice. It is also hoped that the methodology can be used in evaluating the effects of EA and understanding the underlying acupuncture mechanism in treating neurodegenerative diseases. Copyright © 2011 Elsevier Inc. All rights reserved.

  5. Identification of anisodamine tablets by Raman and near-infrared spectroscopy with chemometrics.

    PubMed

    Li, Lian; Zang, Hengchang; Li, Jun; Chen, Dejun; Li, Tao; Wang, Fengshan

    2014-06-05

    Vibrational spectroscopy including Raman and near-infrared (NIR) spectroscopy has become an attractive tool for pharmaceutical analysis. In this study, effective calibration models for the identification of anisodamine tablet and its counterfeit and the distinguishment of manufacturing plants, based on Raman and NIR spectroscopy, were built, respectively. Anisodamine counterfeit tablets were identified by Raman spectroscopy with correlation coefficient method, and the results showed that the predictive accuracy was 100%. The genuine anisodamine tablets from 5 different manufacturing plants were distinguished by NIR spectroscopy using partial least squares discriminant analysis (PLS-DA) models based on interval principal component analysis (iPCA) method. And the results showed the recognition rate and rejection rate were 100% respectively. In conclusion, Raman spectroscopy and NIR spectroscopy combined with chemometrics are feasible and potential tools for rapid pharmaceutical tablet discrimination. Copyright © 2014 Elsevier B.V. All rights reserved.

  6. Volatiles Mediated Interactions Between Aspergillus oryzae Strains Modulate Morphological Transition and Exometabolomes.

    PubMed

    Singh, Digar; Lee, Choong H

    2018-01-01

    Notwithstanding its mitosporic nature, an improbable morpho-transformation state i. e., sclerotial development (SD), is vaguely known in Aspergillus oryzae . Nevertheless an intriguing phenomenon governing mold's development and stress response, the effects of exogenous factors engendering SD, especially the volatile organic compounds (VOCs) mediated interactions (VMI) pervasive in microbial niches have largely remained unexplored. Herein, we examined the effects of intra-species VMI on SD in A. oryzae RIB 40, followed by comprehensive analyses of associated growth rates, pH alterations, biochemical phenotypes, and exometabolomes. We cultivated A. oryzae RIB 40 (S1 VMI : KACC 44967) opposite a non-SD partner strain, A. oryzae (S2: KCCM 60345), conditioning VMI in a specially designed "twin plate assembly." Notably, SD in S1 VMI was delayed relative to its non-conditioned control (S1) cultivated without partner strain (S2) in twin plate. Selectively evaluating A. oryzae RIB 40 (S1 VMI vs. S1) for altered phenotypes concomitant to SD, we observed a marked disparity for corresponding growth rates (S1 VMI < S1) 7days , media pH (S1 VMI > S1) 7days , and biochemical characteristics viz ., protease (S1 VMI > S1) 7days , amylase (S1 VMI > nS1) 3-7 days , and antioxidants (S1 VMI > S1) 7days levels. The partial least squares-discriminant analysis (PLS-DA) of gas chromatography-time of flight-mass spectrometry (GC-TOF-MS) datasets for primary metabolites exhibited a clustered pattern (PLS1, 22.04%; PLS2, 11.36%), with 7 days incubated S1 VMI extracts showed higher abundance of amino acids, sugars, and sugar alcohols with lower organic acids and fatty acids levels, relative to S1. Intriguingly, the higher amino acid and sugar alcohol levels were positively correlated with antioxidant activity, likely impeding SD in S1 VMI . Further, the PLS-DA (PLS1, 18.11%; PLS2, 15.02%) based on liquid chromatography-mass spectrometry (LC-MS) datasets exhibited a notable disparity for post-SD (9-11 days) sample extracts with higher oxylipins and 13-desoxypaxilline levels in S1 VMI relative to S1, intertwining Aspergillus morphogenesis and secondary metabolism. The analysis of VOCs for the 7 days incubated samples displayed considerably higher accumulation of C-8 compounds in the headspace of twin-plate experimental sets (S1 VMI :S2) compared to those in non-conditioned controls (S1 and S2-without respective partner strains), potentially triggering altered morpho-transformation and concurring biochemical as well as metabolic states in molds.

  7. Simultaneous quantitative determination of paracetamol and tramadol in tablet formulation using UV spectrophotometry and chemometric methods

    NASA Astrophysics Data System (ADS)

    Glavanović, Siniša; Glavanović, Marija; Tomišić, Vladislav

    2016-03-01

    The UV spectrophotometric methods for simultaneous quantitative determination of paracetamol and tramadol in paracetamol-tramadol tablets were developed. The spectrophotometric data obtained were processed by means of partial least squares (PLS) and genetic algorithm coupled with PLS (GA-PLS) methods in order to determine the content of active substances in the tablets. The results gained by chemometric processing of the spectroscopic data were statistically compared with those obtained by means of validated ultra-high performance liquid chromatographic (UHPLC) method. The accuracy and precision of data obtained by the developed chemometric models were verified by analysing the synthetic mixture of drugs, and by calculating recovery as well as relative standard error (RSE). A statistically good agreement was found between the amounts of paracetamol determined using PLS and GA-PLS algorithms, and that obtained by UHPLC analysis, whereas for tramadol GA-PLS results were proven to be more reliable compared to those of PLS. The simplest and the most accurate and precise models were constructed by using the PLS method for paracetamol (mean recovery 99.5%, RSE 0.89%) and the GA-PLS method for tramadol (mean recovery 99.4%, RSE 1.69%).

  8. Sugar and acid content of Citrus prediction modeling using FT-IR fingerprinting in combination with multivariate statistical analysis.

    PubMed

    Song, Seung Yeob; Lee, Young Koung; Kim, In-Jung

    2016-01-01

    A high-throughput screening system for Citrus lines were established with higher sugar and acid contents using Fourier transform infrared (FT-IR) spectroscopy in combination with multivariate analysis. FT-IR spectra confirmed typical spectral differences between the frequency regions of 950-1100 cm(-1), 1300-1500 cm(-1), and 1500-1700 cm(-1). Principal component analysis (PCA) and subsequent partial least square-discriminant analysis (PLS-DA) were able to discriminate five Citrus lines into three separate clusters corresponding to their taxonomic relationships. The quantitative predictive modeling of sugar and acid contents from Citrus fruits was established using partial least square regression algorithms from FT-IR spectra. The regression coefficients (R(2)) between predicted values and estimated sugar and acid content values were 0.99. These results demonstrate that by using FT-IR spectra and applying quantitative prediction modeling to Citrus sugar and acid contents, excellent Citrus lines can be early detected with greater accuracy. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. In vivo study of dermal collagen of striae distensae by confocal Raman spectroscopy.

    PubMed

    Lung, Pam Wen; Tippavajhala, Vamshi Krishna; de Oliveira Mendes, Thiago; Téllez-Soto, Claudio A; Schuck, Desirée Cigaran; Brohem, Carla Abdo; Lorencini, Marcio; Martin, Airton Abrahão

    2018-04-01

    This research work mainly deals with studying qualitatively the changes in the dermal collagen of two forms of striae distensae (SD) namely striae rubrae (SR) and striae albae (SA) when compared to normal skin (NS) using confocal Raman spectroscopy. The methodology includes an in vivo human skin study for the comparison of confocal Raman spectra of dermis region of SR, SA, and NS by supervised multivariate analysis using partial least squares discriminant analysis (PLS-DA) to determine qualitatively the changes in dermal collagen. These groups are further analyzed for the extent of hydration of dermal collagen by studying the changes in the water content bound to it. PLS-DA score plot showed good separation of the confocal Raman spectra of dermis region into SR, SA, and NS data groups. Further analysis using loading plot and S-plot indicated the participation of various components of dermal collagen in the separation of these groups. Bound water content analysis showed that the extent of hydration of collagen is more in SD when compared to NS. Based on the results obtained, this study confirms the active involvement of dermal collagen in the formation of SD. It also emphasizes the need to study quantitatively the role of these various biochemical changes in the dermal collagen responsible for the variance between SR, SA, and NS.

  10. Classification of jet fuel properties by near-infrared spectroscopy using fuzzy rule-building expert systems and support vector machines.

    PubMed

    Xu, Zhanfeng; Bunker, Christopher E; Harrington, Peter de B

    2010-11-01

    Monitoring the changes of jet fuel physical properties is important because fuel used in high-performance aircraft must meet rigorous specifications. Near-infrared (NIR) spectroscopy is a fast method to characterize fuels. Because of the complexity of NIR spectral data, chemometric techniques are used to extract relevant information from spectral data to accurately classify physical properties of complex fuel samples. In this work, discrimination of fuel types and classification of flash point, freezing point, boiling point (10%, v/v), boiling point (50%, v/v), and boiling point (90%, v/v) of jet fuels (JP-5, JP-8, Jet A, and Jet A1) were investigated. Each physical property was divided into three classes, low, medium, and high ranges, using two evaluations with different class boundary definitions. The class boundaries function as the threshold to alarm when the fuel properties change. Optimal partial least squares discriminant analysis (oPLS-DA), fuzzy rule-building expert system (FuRES), and support vector machines (SVM) were used to build the calibration models between the NIR spectra and classes of physical property of jet fuels. OPLS-DA, FuRES, and SVM were compared with respect to prediction accuracy. The validation of the calibration model was conducted by applying bootstrap Latin partition (BLP), which gives a measure of precision. Prediction accuracy of 97 ± 2% of the flash point, 94 ± 2% of freezing point, 99 ± 1% of the boiling point (10%, v/v), 98 ± 2% of the boiling point (50%, v/v), and 96 ± 1% of the boiling point (90%, v/v) were obtained by FuRES in one boundaries definition. Both FuRES and SVM obtained statistically better prediction accuracy over those obtained by oPLS-DA. The results indicate that combined with chemometric classifiers NIR spectroscopy could be a fast method to monitor the changes of jet fuel physical properties.

  11. [Experimental study of metabonomics in the diagnosis of allergic rhinitis in mice].

    PubMed

    Wang, A; Li, Q F; Zhang, G Q; Zhao, C Q

    2016-02-01

    To investigate the application of metabonomics in the diagnosis of allergic rhinitis. Eighty male Kunming mice were randomly divided into two groups, control group (30 mice) and allergic rhinitis (AR) group (50 mice). After modeling, removal behavior score more than 6 and retain 30 mice behavior score equal to 6.Collect the mice peripheral blood and preparate blood serum, using UPLC-MS chromatographic separation and detection. The data were pretreated by SPSS and Excel, after chromatographic peak matching by MZmine. Firstly , delete interference data in accordance with the 80% rule .Then, the investigate data were analyzed by PLS-DA and PCA-X. Three-dimensional view of the control group (30 mice) and AR group (30 mice) blood serum data was drawn using PCA-X and PLS-DA method. The two groups of samples could be completely separated through views, which showed that there was a significant difference between the two groups of data. There were some differences in the blood metabolites between the control group and AR group . The study showed that it was scientific and feasible to diagnose AR using the metabonomics.

  12. Assessment of the discrimination of animal fat by FT-Raman spectroscopy

    NASA Astrophysics Data System (ADS)

    Abbas, O.; Fernández Pierna, J. A.; Codony, R.; von Holst, C.; Baeten, V.

    2009-04-01

    In recent years, there has been an increased attention towards the composition of feeding fats. In the aftermath of the BSE crisis all animal by-products utilised in animal nutrition have been subjected to close scrutiny. Regulation requires that the material belongs to the category of animal by-products fit for human consumption. This implies the use of reliable techniques in order to insure the safety of products. The feasibility of using rapid and non-destructive methods, to control the composition of feedstuffs on animal fats has been studied. Fourier Transform Raman spectroscopy has been chosen for its advantage to give detailed structural information. Data were treated using chemometric methods as PCA and PLS-DA which have permitted to separate well the different classes of animal fats. The same methodology was applied on fats from various types of feedstock and production technology processes. PLS-DA model for the discrimination of animal fats from the other categories presents a sensitivity and a specificity of 0.958 and 0.914, respectively. These results encourage the use of FT-Raman spectroscopy to discriminate animal fats.

  13. [Effect of near infrared spectrum on the precision of PLS model for oil yield from oil shale].

    PubMed

    Wang, Zhi-Hong; Liu, Jie; Chen, Xiao-Chao; Sun, Yu-Yang; Yu, Yang; Lin, Jun

    2012-10-01

    It is impossible to use present measurement methods for the oil yield of oil shale to realize in-situ detection and these methods unable to meet the requirements of the oil shale resources exploration and exploitation. But in-situ oil yield analysis of oil shale can be achieved by the portable near infrared spectroscopy technique. There are different correlativities of NIR spectrum data formats and contents of sample components, and the different absorption specialities of sample components shows in different NIR spectral regions. So with the proportioning samples, the PLS modeling experiments were done by 3 formats (reflectance, absorbance and K-M function) and 4 regions of modeling spectrum, and the effect of NIR spectral format and region to the precision of PLS model for oil yield from oil shale was studied. The results show that the best data format is reflectance and the best modeling region is combination spectral range by PLS model method and proportioning samples. Therefore, the appropriate data format and the proper characteristic spectral region can increase the precision of PLS model for oil yield form oil shale.

  14. Relationship between hydroxycinnamic acids and the resistance of apple cultivars to rosy apple aphid.

    PubMed

    Berrueta, Luis A; Sasía-Arriba, Andrea; Miñarro, Marcos; Antón, María J; Alonso-Salces, Rosa M; Micheletti, Diego; Gallo, Blanca; Dapena, Enrique

    2018-09-01

    The phenolic profiles of apple cultivars from the SERIDA Asturian cider apple breeding program, including parents and progenies, were determined by ultrahigh-performance liquid chromatography-diode array detector-electrospray ionization-quadrupole time of flight/mass spectrometer in order to study the relationship between phenols and the resistance of apple tree cultivars to rosy apple aphid (RAA). A pattern recognition technique named partial least square discriminant analysis (PLS-DA) was used to classify apple cultivars based on resistance to RAA, resistant and susceptible, reaching scores with accuracy higher than 97% and 91% respectively. Hydroxycinnamic acids, particularly 4-caffeoylquinic acid (4-CQA) and 4-p-coumaroylquinic acid (4-pCoQA), were identified as the major player in RAA resistance by the PLS-DA model. Indeed, the isomerisation 5-CQA → 4-CQA is favoured in resistant cultivars, whereas the isomerisation 5-pCoQA → 4-pCoQA is favoured in susceptible cultivars. As a result, resistant cultivars accumulate higher amounts of 4-CQA than susceptible ones, and the opposite occurs for 4-pCoQA. Also, minor isomerisations of 5-CQA to 1-CQA or 3-CQA show opposite behaviour for resistant and susceptible cultivars. Cultivar resistance to RAA is concluded to be related with the phenylpropanoid pathway, the isomerisation reactions being the key metabolic reaction for a cultivar to be resistant or susceptible to RAA. Copyright © 2018 Elsevier B.V. All rights reserved.

  15. The assessment of the performance of covariance-based structural equation modeling and partial least square path modeling

    NASA Astrophysics Data System (ADS)

    Aimran, Ahmad Nazim; Ahmad, Sabri; Afthanorhan, Asyraf; Awang, Zainudin

    2017-05-01

    Structural equation modeling (SEM) is the second generation statistical analysis technique developed for analyzing the inter-relationships among multiple variables in a model. Previous studies have shown that there seemed to be at least an implicit agreement about the factors that should drive the choice between covariance-based structural equation modeling (CB-SEM) and partial least square path modeling (PLS-PM). PLS-PM appears to be the preferred method by previous scholars because of its less stringent assumption and the need to avoid the perceived difficulties in CB-SEM. Along with this issue has been the increasing debate among researchers on the use of CB-SEM and PLS-PM in studies. The present study intends to assess the performance of CB-SEM and PLS-PM as a confirmatory study in which the findings will contribute to the body of knowledge of SEM. Maximum likelihood (ML) was chosen as the estimator for CB-SEM and was expected to be more powerful than PLS-PM. Based on the balanced experimental design, the multivariate normal data with specified population parameter and sample sizes were generated using Pro-Active Monte Carlo simulation, and the data were analyzed using AMOS for CB-SEM and SmartPLS for PLS-PM. Comparative Bias Index (CBI), construct relationship, average variance extracted (AVE), composite reliability (CR), and Fornell-Larcker criterion were used to study the consequence of each estimator. The findings conclude that CB-SEM performed notably better than PLS-PM in estimation for large sample size (100 and above), particularly in terms of estimations accuracy and consistency.

  16. Metabolomic profiles illuminate the efficacy of Chinese herbal Da-Cheng-Qi decoction on acute pancreatitis in rats.

    PubMed

    Li, Juan; Zhu, Shi-Feng; Zhao, Xian-Lin; Liu, Yi-Xia; Wan, Mei-Hua; Guo, Hui; Liu, Yi-Ling; Gong, Han-Lin; Chen, Guang-Yuan; Tang, Wen-Fu

    2015-01-01

    Chinese herbal drug Da-Cheng-Qi decoction (DCQD) has been widely used for decades to treat acute pancreatitis (AP). Previous trials are mostly designed to state the potential mechanisms of the therapeutic effects rather than to detect its whole effect on metabolism. This study aimed to investigate the efficacy of DCQD on metabolism in AP. Twenty-two male adult Sprague-Dawley rats were randomized into three groups. AP was induced by retrograde ductal infusion of 3.5% sodium taurocholate solution in DCQD and AP group, while 0.9% saline solution was used in sham operation (SO) group. Blood samples were obtained 12 h after drug administration and a 600 MHz superconducting Nuclear Magnetic Resonance (NMR) spectrometer was used to detected plasma metabolites. Principal Components Analysis (PCA) and Partial Least Squares-Discriminant Analysis after Orthogonal Signal Correction (OSC-PLS-DA) were applied to analyze the Longitudinal Eddy-delay (LED) and Carr-Purcell-Meiboom-Gill (CPMG) spectra. Differences in concentrations of metabolites among the three groups were detected by OSC-PLS-DA of 1HNMR spectra (both LED and CPMG). Compared with SO group, DCQD group had higher levels of plasma glycerol, glutamic acid, low density lipoprotein (LDL), saturated fatty acid (FA) and lower levels of alanine and glutamine, while the metabolic changes were reversed in the AP group. Our results demonstrated that DCQD was capable of altering the changed concentrations of metabolites in rats with AP and 1HNMR-based metabolomic approach provided a new methodological cue for systematically investigating the efficacies and mechanisms of DCQD in treating AP. Copyright © 2015 IAP and EPC. Published by Elsevier B.V. All rights reserved.

  17. A Metabolomic Approach to Target Compounds from the Asteraceae Family for Dual COX and LOX Inhibition

    PubMed Central

    Chagas-Paula, Daniela A.; Zhang, Tong; Da Costa, Fernando B.; Edrada-Ebel, RuAngelie

    2015-01-01

    The application of metabolomics in phytochemical analysis is an innovative strategy for targeting active compounds from a complex plant extract. Species of the Asteraceae family are well-known to exhibit potent anti-inflammatory (AI) activity. Dual inhibition of the enzymes COX-1 and 5-LOX is essential for the treatment of several inflammatory diseases, but there is not much investigation reported in the literature for natural products. In this study, 57 leaf extracts (EtOH-H2O 7:3, v/v) from different genera and species of the Asteraceae family were tested against COX-1 and 5-LOX while HPLC-ESI-HRMS analysis of the extracts indicated high diversity in their chemical compositions. Using O2PLS-DA (R2 > 0.92; VIP > 1 and positive Y-correlation values), dual inhibition potential of low-abundance metabolites was determined. The O2PLS-DA results exhibited good validation values (cross-validation = Q2 > 0.7 and external validation = P2 > 0.6) with 0% of false positive predictions. The metabolomic approach determined biomarkers for the required biological activity and detected active compounds in the extracts displaying unique mechanisms of action. In addition, the PCA data also gave insights on the chemotaxonomy of the family Asteraceae across its diverse range of genera and tribes. PMID:26184333

  18. Rapid analysis of composition and reactivity in cellulosic biomass feedstocks with near-infrared spectroscopy.

    PubMed

    Payne, Courtney E; Wolfrum, Edward J

    2015-01-01

    Obtaining accurate chemical composition and reactivity (measures of carbohydrate release and yield) information for biomass feedstocks in a timely manner is necessary for the commercialization of biofuels. Our objective was to use near-infrared (NIR) spectroscopy and partial least squares (PLS) multivariate analysis to develop calibration models to predict the feedstock composition and the release and yield of soluble carbohydrates generated by a bench-scale dilute acid pretreatment and enzymatic hydrolysis assay. Major feedstocks included in the calibration models are corn stover, sorghum, switchgrass, perennial cool season grasses, rice straw, and miscanthus. We present individual model statistics to demonstrate model performance and validation samples to more accurately measure predictive quality of the models. The PLS-2 model for composition predicts glucan, xylan, lignin, and ash (wt%) with uncertainties similar to primary measurement methods. A PLS-2 model was developed to predict glucose and xylose release following pretreatment and enzymatic hydrolysis. An additional PLS-2 model was developed to predict glucan and xylan yield. PLS-1 models were developed to predict the sum of glucose/glucan and xylose/xylan for release and yield (grams per gram). The release and yield models have higher uncertainties than the primary methods used to develop the models. It is possible to build effective multispecies feedstock models for composition, as well as carbohydrate release and yield. The model for composition is useful for predicting glucan, xylan, lignin, and ash with good uncertainties. The release and yield models have higher uncertainties; however, these models are useful for rapidly screening sample populations to identify unusual samples.

  19. Non-destructive fraud detection in rosehip oil by MIR spectroscopy and chemometrics.

    PubMed

    Santana, Felipe Bachion de; Gontijo, Lucas Caixeta; Mitsutake, Hery; Mazivila, Sarmento Júnior; Souza, Leticia Maria de; Borges Neto, Waldomiro

    2016-10-15

    Rosehip oil (Rosa eglanteria L.) is an important oil in the food, pharmaceutical and cosmetic industries. However, due to its high added value, it is liable to adulteration with other cheaper or lower quality oils. With this perspective, this work provides a new simple, fast and accurate methodology using mid-infrared (MIR) spectroscopy and partial least squares discriminant analysis (PLS-DA) as a means to discriminate authentic rosehip oil from adulterated rosehip oil containing soybean, corn and sunflower oils in different proportions. The model showed excellent sensitivity and specificity with 100% correct classification. Therefore, the developed methodology is a viable alternative for use in the laboratory and industry for standard quality analysis of rosehip oil since it is fast, accurate and non-destructive. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Authentication of whisky due to its botanical origin and way of production by instrumental analysis and multivariate classification methods

    NASA Astrophysics Data System (ADS)

    Wiśniewska, Paulina; Boqué, Ricard; Borràs, Eva; Busto, Olga; Wardencki, Waldemar; Namieśnik, Jacek; Dymerski, Tomasz

    2017-02-01

    Headspace mass-spectrometry (HS-MS), mid infrared (MIR) and UV-vis spectroscopy were used to authenticate whisky samples from different origins and ways of production ((Irish, Spanish, Bourbon, Tennessee Whisky and Scotch). The collected spectra were processed with partial least-squares discriminant analysis (PLS-DA) to build the classification models. In all cases the five groups of whiskies were distinguished, but the best results were obtained by HS-MS, which indicates that the biggest differences between different types of whisky are due to their aroma. Differences were also found inside groups, showing that not only raw material is important to discriminate samples but also the way of their production. The methodology is quick, easy and does not require sample preparation.

  1. Advanced spectrophotometric chemometric methods for resolving the binary mixture of doxylamine succinate and pyridoxine hydrochloride.

    PubMed

    Katsarov, Plamen; Gergov, Georgi; Alin, Aylin; Pilicheva, Bissera; Al-Degs, Yahya; Simeonov, Vasil; Kassarova, Margarita

    2018-03-01

    The prediction power of partial least squares (PLS) and multivariate curve resolution-alternating least squares (MCR-ALS) methods have been studied for simultaneous quantitative analysis of the binary drug combination - doxylamine succinate and pyridoxine hydrochloride. Analysis of first-order UV overlapped spectra was performed using different PLS models - classical PLS1 and PLS2 as well as partial robust M-regression (PRM). These linear models were compared to MCR-ALS with equality and correlation constraints (MCR-ALS-CC). All techniques operated within the full spectral region and extracted maximum information for the drugs analysed. The developed chemometric methods were validated on external sample sets and were applied to the analyses of pharmaceutical formulations. The obtained statistical parameters were satisfactory for calibration and validation sets. All developed methods can be successfully applied for simultaneous spectrophotometric determination of doxylamine and pyridoxine both in laboratory-prepared mixtures and commercial dosage forms.

  2. Heritable IUGR and adult metabolic syndrome are reversible and associated with alterations in the metabolome following dietary supplementation of 1-carbon intermediates.

    PubMed

    Seferovic, Maxim D; Goodspeed, Danielle M; Chu, Derrick M; Krannich, Laura A; Gonzalez-Rodriguez, Pablo J; Cox, James E; Aagaard, Kjersti M

    2015-06-01

    Metabolic syndrome (MetS), following intrauterine growth restriction (IUGR), is epigenetically heritable. Recently, we abrogated the F2 adult phenotype with essential nutrient supplementation (ENS) of intermediates along the 1-carbon pathway. With the use of the same grandparental uterine artery ligation model, we profiled the F2 serum metabolome at weaning [postnatal day (d)21; n = 76] and adulthood (d160; n = 12) to test if MetS is preceded by alterations in the metabolome. Indicative of developmentally programmed MetS, adult F2, formerly IUGR rats, were obese (621 vs. 461 g; P < 0.0001), dyslipidemic (133 vs. 67 mg/dl; P < 0.001), and glucose intolerant (26 vs. 15 mg/kg/min; P < 0.01). Unbiased gas chromatography-mass spectrometry (GC-MS) profiling revealed 34 peaks corresponding to 12 nonredundant metabolites and 9 unknowns to be changing at weaning [false discovery rate (FDR) < 0.05]. Markers of later-in-life MetS included citric acid, glucosamine, myoinositol, and proline (P < 0.03). Hierarchical clustering revealed grouping by IUGR lineage and supplementation at d21 and d160. Weanlings grouped distinctly for ENS and IUGR by partial least-squares discriminate analysis (PLS-DA; P < 0.01), whereas paternal and maternal IUGR (IUGR(pat)/IUGR(mat), respectively) control-fed rats, destined for MetS, had a distinct metabolome at weaning (randomForest analysis; class error < 0.1) and adulthood (PLS-DA; P < 0.05). In sum, we have found that alterations in the metabolome accompany heritable IUGR, precede adult-onset MetS, and are partially amenable to dietary intervention. © FASEB.

  3. [Study of cuttings identification using laser-induced breakdown spectroscopy].

    PubMed

    Tian, Ye; Wang, Zhen-nan; Hou, Hua-ming; Zhai, Xiao-wei; Ci, Xing-hua; Zheng, Rong-er

    2012-08-01

    Cutting identification is one of the most important links in the course of cutting logging which is very significant in the process of oil drilling. In the present paper, LIBS was used for identification of four kinds of cutting samples coming from logging field, and then multivariate analysis was used in data processing. The whole spectra model and the feature model were built for cuttings identification using PLS-DA method. The accuracy of the whole spectra model was 88.3%, a little more than the feature model with an accuracy of 86.7%. While in the aspect of data size, the variables were decreased from 24,041 to 27 by feature extraction, which increased the efficiency of data processing observably. The obtained results demonstrate that LIBS combined with chemometrics method could be developed as a rapid and valid approach to cutting identification and has great potential to be used in logging field.

  4. Determination of benzo[a]pyrene in cigarette mainstream smoke by using mid-infrared spectroscopy associated with a novel chemometric algorithm.

    PubMed

    Zhang, Yan; Zou, Hong-Yan; Shi, Pei; Yang, Qin; Tang, Li-Juan; Jiang, Jian-Hui; Wu, Hai-Long; Yu, Ru-Qin

    2016-01-01

    Determination of benzo[a]pyrene (BaP) in cigarette smoke can be very important for the tobacco quality control and the assessment of its harm to human health. In this study, mid-infrared spectroscopy (MIR) coupled to chemometric algorithm (DPSO-WPT-PLS), which was based on the wavelet packet transform (WPT), discrete particle swarm optimization algorithm (DPSO) and partial least squares regression (PLS), was used to quantify harmful ingredient benzo[a]pyrene in the cigarette mainstream smoke with promising result. Furthermore, the proposed method provided better performance compared to several other chemometric models, i.e., PLS, radial basis function-based PLS (RBF-PLS), PLS with stepwise regression variable selection (Stepwise-PLS) as well as WPT-PLS with informative wavelet coefficients selected by correlation coefficient test (rtest-WPT-PLS). It can be expected that the proposed strategy could become a new effective, rapid quantitative analysis technique in analyzing the harmful ingredient BaP in cigarette mainstream smoke. Copyright © 2015 Elsevier B.V. All rights reserved.

  5. Volatiles Mediated Interactions Between Aspergillus oryzae Strains Modulate Morphological Transition and Exometabolomes

    PubMed Central

    Singh, Digar; Lee, Choong H.

    2018-01-01

    Notwithstanding its mitosporic nature, an improbable morpho-transformation state i. e., sclerotial development (SD), is vaguely known in Aspergillus oryzae. Nevertheless an intriguing phenomenon governing mold's development and stress response, the effects of exogenous factors engendering SD, especially the volatile organic compounds (VOCs) mediated interactions (VMI) pervasive in microbial niches have largely remained unexplored. Herein, we examined the effects of intra-species VMI on SD in A. oryzae RIB 40, followed by comprehensive analyses of associated growth rates, pH alterations, biochemical phenotypes, and exometabolomes. We cultivated A. oryzae RIB 40 (S1VMI: KACC 44967) opposite a non-SD partner strain, A. oryzae (S2: KCCM 60345), conditioning VMI in a specially designed “twin plate assembly.” Notably, SD in S1VMI was delayed relative to its non-conditioned control (S1) cultivated without partner strain (S2) in twin plate. Selectively evaluating A. oryzae RIB 40 (S1VMI vs. S1) for altered phenotypes concomitant to SD, we observed a marked disparity for corresponding growth rates (S1VMI < S1)7days, media pH (S1VMI > S1)7days, and biochemical characteristics viz., protease (S1VMI > S1)7days, amylase (S1VMI > nS1)3–7days, and antioxidants (S1VMI > S1)7days levels. The partial least squares—discriminant analysis (PLS-DA) of gas chromatography—time of flight—mass spectrometry (GC-TOF-MS) datasets for primary metabolites exhibited a clustered pattern (PLS1, 22.04%; PLS2, 11.36%), with 7 days incubated S1VMI extracts showed higher abundance of amino acids, sugars, and sugar alcohols with lower organic acids and fatty acids levels, relative to S1. Intriguingly, the higher amino acid and sugar alcohol levels were positively correlated with antioxidant activity, likely impeding SD in S1VMI. Further, the PLS-DA (PLS1, 18.11%; PLS2, 15.02%) based on liquid chromatography—mass spectrometry (LC-MS) datasets exhibited a notable disparity for post-SD (9–11 days) sample extracts with higher oxylipins and 13-desoxypaxilline levels in S1VMI relative to S1, intertwining Aspergillus morphogenesis and secondary metabolism. The analysis of VOCs for the 7 days incubated samples displayed considerably higher accumulation of C-8 compounds in the headspace of twin-plate experimental sets (S1VMI:S2) compared to those in non-conditioned controls (S1 and S2—without respective partner strains), potentially triggering altered morpho-transformation and concurring biochemical as well as metabolic states in molds. PMID:29670599

  6. Quantification of adulterations in extra virgin flaxseed oil using MIR and PLS.

    PubMed

    de Souza, Letícia Maria; de Santana, Felipe Bachion; Gontijo, Lucas Caixeta; Mazivila, Sarmento Júnior; Borges Neto, Waldomiro

    2015-09-01

    This paper proposes a new method for the quantitative analysis of soybean oil (SO) and sunflower oil (SFO) as adulterants in extra virgin flaxseed oil (EFO) by applying Mid Infrared Spectroscopy (MIR) associated with chemometric technique of Partial Least Squares (PLS). The PLS models were built in accordance with standard method ASTM E1655-05 and these showed good correlation between the reference values and those calculated using the PLS models with low error values, with R = 0.998 for SFO and R = 0.999 for SO in EFO. These models were validated analytically in accordance with Brazilian and international guidelines through the estimate of figures of merit parameters, thus showing an effective and feasible method to control the quality of extra virgin flaxseed oil. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Proton Nuclear Magnetic Resonance-Spectroscopic Discrimination of Wines Reflects Genetic Homology of Several Different Grape (V. vinifera L.) Cultivars

    PubMed Central

    Zhu, Yong; Wen, Wen; Zhang, Fengmin; Hardie, Jim W.

    2015-01-01

    Background and Aims Proton nuclear magnetic resonance spectroscopy coupled multivariate analysis (1H NMR-PCA/PLS-DA) is an important tool for the discrimination of wine products. Although 1H NMR has been shown to discriminate wines of different cultivars, a grape genetic component of the discrimination has been inferred only from discrimination of cultivars of undefined genetic homology and in the presence of many confounding environmental factors. We aimed to confirm the influence of grape genotypes in the absence of those factors. Methods and Results We applied 1H NMR-PCA/PLS-DA and hierarchical cluster analysis (HCA) to wines from five, variously genetically-related grapevine (V. vinifera) cultivars; all grown similarly on the same site and vinified similarly. We also compared the semi-quantitative profiles of the discriminant metabolites of each cultivar with previously reported chemical analyses. The cultivars were clearly distinguishable and there was a general correlation between their grouping and their genetic homology as revealed by recent genomic studies. Between cultivars, the relative amounts of several of the cultivar-related discriminant metabolites conformed closely with reported chemical analyses. Conclusions Differences in grape-derived metabolites associated with genetic differences alone are a major source of 1H NMR-based discrimination of wines and 1H NMR has the capacity to discriminate between very closely related cultivars. Significance of the Study The study confirms that genetic variation among grape cultivars alone can account for the discrimination of wine by 1H NMR-PCA/PLS and indicates that 1H NMR spectra of wine of single grape cultivars may in future be used in tandem with hierarchical cluster analysis to elucidate genetic lineages and metabolomic relations of grapevine cultivars. In the absence of genetic information, for example, where predecessor varieties are no longer extant, this may be a particularly useful approach. PMID:26658757

  8. Rapid analysis of composition and reactivity in cellulosic biomass feedstocks with near-infrared spectroscopy

    DOE PAGES

    Payne, Courtney E.; Wolfrum, Edward J.

    2015-03-12

    Obtaining accurate chemical composition and reactivity (measures of carbohydrate release and yield) information for biomass feedstocks in a timely manner is necessary for the commercialization of biofuels. Our objective was to use near-infrared (NIR) spectroscopy and partial least squares (PLS) multivariate analysis to develop calibration models to predict the feedstock composition and the release and yield of soluble carbohydrates generated by a bench-scale dilute acid pretreatment and enzymatic hydrolysis assay. Major feedstocks included in the calibration models are corn stover, sorghum, switchgrass, perennial cool season grasses, rice straw, and miscanthus. Here are the results: We present individual model statistics tomore » demonstrate model performance and validation samples to more accurately measure predictive quality of the models. The PLS-2 model for composition predicts glucan, xylan, lignin, and ash (wt%) with uncertainties similar to primary measurement methods. A PLS-2 model was developed to predict glucose and xylose release following pretreatment and enzymatic hydrolysis. An additional PLS-2 model was developed to predict glucan and xylan yield. PLS-1 models were developed to predict the sum of glucose/glucan and xylose/xylan for release and yield (grams per gram). The release and yield models have higher uncertainties than the primary methods used to develop the models. In conclusion, it is possible to build effective multispecies feedstock models for composition, as well as carbohydrate release and yield. The model for composition is useful for predicting glucan, xylan, lignin, and ash with good uncertainties. The release and yield models have higher uncertainties; however, these models are useful for rapidly screening sample populations to identify unusual samples.« less

  9. Rapid analysis of composition and reactivity in cellulosic biomass feedstocks with near-infrared spectroscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Payne, Courtney E.; Wolfrum, Edward J.

    Obtaining accurate chemical composition and reactivity (measures of carbohydrate release and yield) information for biomass feedstocks in a timely manner is necessary for the commercialization of biofuels. Our objective was to use near-infrared (NIR) spectroscopy and partial least squares (PLS) multivariate analysis to develop calibration models to predict the feedstock composition and the release and yield of soluble carbohydrates generated by a bench-scale dilute acid pretreatment and enzymatic hydrolysis assay. Major feedstocks included in the calibration models are corn stover, sorghum, switchgrass, perennial cool season grasses, rice straw, and miscanthus. Here are the results: We present individual model statistics tomore » demonstrate model performance and validation samples to more accurately measure predictive quality of the models. The PLS-2 model for composition predicts glucan, xylan, lignin, and ash (wt%) with uncertainties similar to primary measurement methods. A PLS-2 model was developed to predict glucose and xylose release following pretreatment and enzymatic hydrolysis. An additional PLS-2 model was developed to predict glucan and xylan yield. PLS-1 models were developed to predict the sum of glucose/glucan and xylose/xylan for release and yield (grams per gram). The release and yield models have higher uncertainties than the primary methods used to develop the models. In conclusion, it is possible to build effective multispecies feedstock models for composition, as well as carbohydrate release and yield. The model for composition is useful for predicting glucan, xylan, lignin, and ash with good uncertainties. The release and yield models have higher uncertainties; however, these models are useful for rapidly screening sample populations to identify unusual samples.« less

  10. Assessing statistical differences between parameters estimates in Partial Least Squares path modeling.

    PubMed

    Rodríguez-Entrena, Macario; Schuberth, Florian; Gelhard, Carsten

    2018-01-01

    Structural equation modeling using partial least squares (PLS-SEM) has become a main-stream modeling approach in various disciplines. Nevertheless, prior literature still lacks a practical guidance on how to properly test for differences between parameter estimates. Whereas existing techniques such as parametric and non-parametric approaches in PLS multi-group analysis solely allow to assess differences between parameters that are estimated for different subpopulations, the study at hand introduces a technique that allows to also assess whether two parameter estimates that are derived from the same sample are statistically different. To illustrate this advancement to PLS-SEM, we particularly refer to a reduced version of the well-established technology acceptance model.

  11. Geographical origin discrimination of lentils (Lens culinaris Medik.) using 1H NMR fingerprinting and multivariate statistical analyses.

    PubMed

    Longobardi, Francesco; Innamorato, Valentina; Di Gioia, Annalisa; Ventrella, Andrea; Lippolis, Vincenzo; Logrieco, Antonio F; Catucci, Lucia; Agostiano, Angela

    2017-12-15

    Lentil samples coming from two different countries, i.e. Italy and Canada, were analysed using untargeted 1 H NMR fingerprinting in combination with chemometrics in order to build models able to classify them according to their geographical origin. For such aim, Soft Independent Modelling of Class Analogy (SIMCA), k-Nearest Neighbor (k-NN), Principal Component Analysis followed by Linear Discriminant Analysis (PCA-LDA) and Partial Least Squares-Discriminant Analysis (PLS-DA) were applied to the NMR data and the results were compared. The best combination of average recognition (100%) and cross-validation prediction abilities (96.7%) was obtained for the PCA-LDA. All the statistical models were validated both by using a test set and by carrying out a Monte Carlo Cross Validation: the obtained performances were found to be satisfying for all the models, with prediction abilities higher than 95% demonstrating the suitability of the developed methods. Finally, the metabolites that mostly contributed to the lentil discrimination were indicated. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. [Influence of Spectral Pre-Processing on PLS Quantitative Model of Detecting Cu in Navel Orange by LIBS].

    PubMed

    Li, Wen-bing; Yao, Lin-tao; Liu, Mu-hua; Huang, Lin; Yao, Ming-yin; Chen, Tian-bing; He, Xiu-wen; Yang, Ping; Hu, Hui-qin; Nie, Jiang-hui

    2015-05-01

    Cu in navel orange was detected rapidly by laser-induced breakdown spectroscopy (LIBS) combined with partial least squares (PLS) for quantitative analysis, then the effect on the detection accuracy of the model with different spectral data ptetreatment methods was explored. Spectral data for the 52 Gannan navel orange samples were pretreated by different data smoothing, mean centralized and standard normal variable transform. Then 319~338 nm wavelength section containing characteristic spectral lines of Cu was selected to build PLS models, the main evaluation indexes of models such as regression coefficient (r), root mean square error of cross validation (RMSECV) and the root mean square error of prediction (RMSEP) were compared and analyzed. Three indicators of PLS model after 13 points smoothing and processing of the mean center were found reaching 0. 992 8, 3. 43 and 3. 4 respectively, the average relative error of prediction model is only 5. 55%, and in one word, the quality of calibration and prediction of this model are the best results. The results show that selecting the appropriate data pre-processing method, the prediction accuracy of PLS quantitative model of fruits and vegetables detected by LIBS can be improved effectively, providing a new method for fast and accurate detection of fruits and vegetables by LIBS.

  13. One input-class and two input-class classifications for differentiating olive oil from other edible vegetable oils by use of the normal-phase liquid chromatography fingerprint of the methyl-transesterified fraction.

    PubMed

    Jiménez-Carvelo, Ana M; Pérez-Castaño, Estefanía; González-Casado, Antonio; Cuadros-Rodríguez, Luis

    2017-04-15

    A new method for differentiation of olive oil (independently of the quality category) from other vegetable oils (canola, safflower, corn, peanut, seeds, grapeseed, palm, linseed, sesame and soybean) has been developed. The analytical procedure for chromatographic fingerprinting of the methyl-transesterified fraction of each vegetable oil, using normal-phase liquid chromatography, is described and the chemometric strategies applied and discussed. Some chemometric methods, such as k-nearest neighbours (kNN), partial least squared-discriminant analysis (PLS-DA), support vector machine classification analysis (SVM-C), and soft independent modelling of class analogies (SIMCA), were applied to build classification models. Performance of the classification was evaluated and ranked using several classification quality metrics. The discriminant analysis, based on the use of one input-class, (plus a dummy class) was applied for the first time in this study. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Metabolomics Tools for Describing Complex Pesticide Exposure in Pregnant Women in Brittany (France)

    PubMed Central

    Bonvallot, Nathalie; Tremblay-Franco, Marie; Chevrier, Cécile; Canlet, Cécile; Warembourg, Charline; Cravedi, Jean-Pierre; Cordier, Sylvaine

    2013-01-01

    Background The use of pesticides and the related environmental contaminations can lead to human exposure to various molecules. In early-life, such exposures could be responsible for adverse developmental effects. However, human health risks associated with exposure to complex mixtures are currently under-explored. Objective This project aims at answering the following questions: What is the influence of exposures to multiple pesticides on the metabolome? What mechanistic pathways could be involved in the metabolic changes observed? Methods Based on the PELAGIE cohort (Brittany, France), 83 pregnant women who provided a urine sample in early pregnancy, were classified in 3 groups according to the surface of land dedicated to agricultural cereal activities in their town of residence. Nuclear magnetic resonance-based metabolomics analyses were performed on urine samples. Partial Least Squares Regression-Discriminant Analysis (PLS-DA) and polytomous regressions were used to separate the urinary metabolic profiles from the 3 exposure groups after adjusting for potential confounders. Results The 3 groups of exposure were correctly separated with a PLS-DA model after implementing an orthogonal signal correction with pareto standardizations (R2 = 90.7% and Q2 = 0.53). After adjusting for maternal age, parity, body mass index and smoking habits, the most statistically significant changes were observed for glycine, threonine, lactate and glycerophosphocholine (upward trend), and for citrate (downward trend). Conclusion This work suggests that an exposure to complex pesticide mixtures induces modifications of metabolic fingerprints. It can be hypothesized from identified discriminating metabolites that the pesticide mixtures could increase oxidative stress and disturb energy metabolism. PMID:23704985

  15. High speed measurement of corn seed viability using hyperspectral imaging

    NASA Astrophysics Data System (ADS)

    Ambrose, Ashabahebwa; Kandpal, Lalit Mohan; Kim, Moon S.; Lee, Wang-Hee; Cho, Byoung-Kwan

    2016-03-01

    Corn is one of the most cultivated crops all over world as food for humans as well as animals. Optimized agronomic practices and improved technological interventions during planting, harvesting and post-harvest handling are critical to improving the quantity and quality of corn production. Seed germination and vigor are the primary determinants of high yield notwithstanding any other factors that may play during the growth period. Seed viability may be lost during storage due to unfavorable conditions e.g. moisture content and temperatures, or physical damage during mechanical processing e.g. shelling, or over heating during drying. It is therefore vital for seed companies and farmers to test and ascertain seed viability to avoid losses of any kind. This study aimed at investigating the possibility of using hyperspectral imaging (HSI) technique to discriminate viable and nonviable corn seeds. A group of corn samples were heat treated by using microwave process while a group of seeds were kept as control group (untreated). The hyperspectral images of corn seeds of both groups were captured between 400 and 2500 nm wave range. Partial least squares discriminant analysis (PLS-DA) was built for the classification of aged (heat treated) and normal (untreated) corn seeds. The model showed highest classification accuracy of 97.6% (calibration) and 95.6% (prediction) in the SWIR region of the HSI. Furthermore, the PLS-DA and binary images were capable to provide the visual information of treated and untreated corn seeds. The overall results suggest that HSI technique is accurate for classification of viable and non-viable seeds with non-destructive manner.

  16. Classification of Peronospora infected grapevine leaves with the use of hyperspectral imaging analysis

    NASA Astrophysics Data System (ADS)

    Serranti, S.; Bonifazi, G.; Luciani, V.; D'Aniello, L.

    2017-05-01

    The present work explores the possible utilization of hyperspectral devices, following a proximity based approach, for the diagnosis of Peronospora infection in the vineyards. It compares the performance of two hyperspectral cameras, characterized by different spectral acquisition ranges, in the identification of different levels of infection as detectable from the analysis of the leaf surface. For this purpose, healthy grapevine leaves and leaves affected by a different grade of Peronospora infection have been acquired in laboratory conditions using two different sensing devices: a Specim Imspector V10™ and a Specim Spectral Camera N17™ working in the region between 400-1000 nm and 1000-1700 nm, respectively. A Partial Least Squares Discriminant Analysis (PLS-DA) model has been built to perform the classification of healthy, infected and necrotic leaves.

  17. Discrimination of whisky brands and counterfeit identification by UV-Vis spectroscopy and multivariate data analysis.

    PubMed

    Martins, Angélica Rocha; Talhavini, Márcio; Vieira, Maurício Leite; Zacca, Jorge Jardim; Braga, Jez Willian Batista

    2017-08-15

    The discrimination of whisky brands and counterfeit identification were performed by UV-Vis spectroscopy combined with partial least squares for discriminant analysis (PLS-DA). In the proposed method all spectra were obtained with no sample preparation. The discrimination models were built with the employment of seven whisky brands: Red Label, Black Label, White Horse, Chivas Regal (12years), Ballantine's Finest, Old Parr and Natu Nobilis. The method was validated with an independent test set of authentic samples belonging to the seven selected brands and another eleven brands not included in the training samples. Furthermore, seventy-three counterfeit samples were also used to validate the method. Results showed correct classification rates for genuine and false samples over 98.6% and 93.1%, respectively, indicating that the method can be helpful for the forensic analysis of whisky samples. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Authentication of whisky due to its botanical origin and way of production by instrumental analysis and multivariate classification methods.

    PubMed

    Wiśniewska, Paulina; Boqué, Ricard; Borràs, Eva; Busto, Olga; Wardencki, Waldemar; Namieśnik, Jacek; Dymerski, Tomasz

    2017-02-15

    Headspace mass-spectrometry (HS-MS), mid infrared (MIR) and UV-vis spectroscopy were used to authenticate whisky samples from different origins and ways of production ((Irish, Spanish, Bourbon, Tennessee Whisky and Scotch). The collected spectra were processed with partial least-squares discriminant analysis (PLS-DA) to build the classification models. In all cases the five groups of whiskies were distinguished, but the best results were obtained by HS-MS, which indicates that the biggest differences between different types of whisky are due to their aroma. Differences were also found inside groups, showing that not only raw material is important to discriminate samples but also the way of their production. The methodology is quick, easy and does not require sample preparation. Copyright © 2016 Elsevier B.V. All rights reserved.

  19. Assessment of Various Organic Matter Properties by Infrared Reflectance Spectroscopy of Sediments and Filters

    NASA Astrophysics Data System (ADS)

    Alaoui, G.; Leger, M.; Gagne, J.; Tremblay, L.

    2009-05-01

    The goal of this work was to evaluate the capability of infrared reflectance spectroscopy for a fast quantification of the elemental and molecular compositions of sedimentary and particulate organic matter (OM). A partial least-squares (PLS) regression model was used for analysis and values were compared to those obtained by traditional methods (i.e., elemental, humic and HPLC analyses). PLS tools are readily accessible from software such as GRAMS (Thermo-Fisher) used in spectroscopy. This spectroscopic-chemometric approach has several advantages including its rapidity and use of whole unaltered samples. To predict properties, a set of infrared spectra from representative samples must first be fitted to form a PLS calibration model. In this study, a large set (180) of sediments and particles on GFF filters from the St. Lawrence estuarine system were used. These samples are very heterogenous (e.g., various tributaries, terrigenous vs. marine, events such as landslides and floods) and thus represent a challenging test for PLS prediction. For sediments, the infrared spectra were obtained with a diffuse reflectance, or DRIFT, accessory. Sedimentary carbon, nitrogen, humic substance contents as well as humic substance proportions in OM and N:C ratios were predicted by PLS. The relative root mean square error of prediction (%RMSEP) for these properties were between 5.7% (humin content) and 14.1% (total humic substance yield) using the cross-validation, or leave-one out, approach. The %RMSEP calculated by PLS for carbon content was lower with the PLS model (7.6%) than with an external calibration method (11.7%) (Tremblay and Gagné, 2002, Anal. Chem., 74, 2985). Moreover, the PLS approach does not require the extraction of POM needed in external calibration. Results highlighted the importance of using a PLS calibration set representative of the unknown samples (e.g., same area). For filtered particles, the infrared spectra were obtained using a novel approach based on attenuated total reflectance, or ATR, allowing the direct analysis of the filters. In addition to carbon and nitrogen contents, amino acid and muramic acid (a bacterial biomarker) yields were predicted using PLS. Calculated %RMSEP varied from 6.4% (total amino acid content) to 18.6% (muramic acid content) with cross-validation. PLS regression modeling does not require a priori knowledge of the spectral bands associated with the properties to be predicted. In turn, the spectral regions that give good PLS predictions provided valuable information on band assignment and geochemical processes. For instance, nitrogen and humin contents were greatly determined by an absorption band caused by aluminosilicate OH group. This supports the idea that OM-clay interactions, important in humin formation and OM preservation, are mediated by nitrogen-containing groups.

  20. Application of dielectric spectroscopy for monitoring high cell density in monoclonal antibody producing CHO cell cultivations.

    PubMed

    Párta, László; Zalai, Dénes; Borbély, Sándor; Putics, Akos

    2014-02-01

    The application of dielectric spectroscopy was frequently investigated as an on-line cell culture monitoring tool; however, it still requires supportive data and experience in order to become a robust technique. In this study, dielectric spectroscopy was used to predict viable cell density (VCD) at industrially relevant high levels in concentrated fed-batch culture of Chinese hamster ovary cells producing a monoclonal antibody for pharmaceutical purposes. For on-line dielectric spectroscopy measurements, capacitance was scanned within a wide range of frequency values (100-19,490 kHz) in six parallel cell cultivation batches. Prior to detailed mathematical analysis of the collected data, principal component analysis (PCA) was applied to compare dielectric behavior of the cultivations. PCA analysis resulted in detecting measurement disturbances. By using the measured spectroscopic data, partial least squares regression (PLS), Cole-Cole, and linear modeling were applied and compared in order to predict VCD. The Cole-Cole and the PLS model provided reliable prediction over the entire cultivation including both the early and decline phases of cell growth, while the linear model failed to estimate VCD in the later, declining cultivation phase. In regards to the measurement error sensitivity, remarkable differences were shown among PLS, Cole-Cole, and linear modeling. VCD prediction accuracy could be improved in the runs with measurement disturbances by first derivative pre-treatment in PLS and by parameter optimization of the Cole-Cole modeling.

  1. Metabolic Characterization of Peripheral Host Responses to Drainage-Resistant Klebsiella pneumoniae Liver Abscesses by Serum 1H-NMR Spectroscopy.

    PubMed

    Chang, Zhihui; Wang, Hairui; Li, Beibei; Liu, Zhaoyu; Zheng, Jiahe

    2018-01-01

    Purpose: To explore the metabolic characterization of host responses to drainage-resistant Klebsiella pneumoniae liver abscesses (DRKPLAs) with serum 1H-nuclear magnetic resonance (NMR) spectroscopy. Materials and Methods: The hospital records of all patients with a diagnosis of a liver abscess between June 2015 and December 2016 were retrieved from an electronic hospital database. Eighty-six patients with Klebsiella pneumoniae ( K. pneumoniae ) liver abscesses who underwent percutaneous drainage were identified. Twenty patients with confirmed DRKPLAs were studied. Moreover, we identified 20 consecutive patients with drainage-sensitive Klebsiella pneumoniae liver abscesses (DSKPLAs) as controls. Serum samples from the two groups were analyzed with 1H NMR spectroscopy. Partial least squares discriminant analysis (PLS-DA) was used to perform 1H NMR metabolic profiling. Metabolites were identified using the Human Metabolome Database, and pathway analysis was performed with MetaboAnalyst 3.0. Results: The PLS-DA test was able to discriminate between the two groups. Five key metabolites that contributed to their discrimination were identified. Glucose, lactate, and 3-hydroxybutyrate were found to be upregulated in DRKPLAs, whereas glutamine and alanine were downregulated compared with the DSKPLAs. Pathway analysis indicated that amino acid metabolisms were significantly different between the DRKPLAs and the DSKPLAs. The D-glutamine and D-glutamate metabolisms exhibited the greatest influences. Conclusions: The five key metabolites identified in our study may be potential targets for guiding novel therapeutics of DRKPLAs and are worthy of additional investigation.

  2. Variation of metabolic profiles in developing maize kernels up- and down-regulated for the hda101 gene

    PubMed Central

    Castro, Cecilia; Motto, Mario; Rossi, Vincenzo; Manetti, Cesare

    2008-01-01

    To shed light on the specific contribution of HDA101 in modulating metabolic pathways in the maize seed, changes in the metabolic profiles of kernels obtained from hda101 mutant plants have been investigated by a metabonomic approach. Dynamic properties of chromatin folding can be mediated by enzymes that modify DNA and histones. The enzymes responsible for the steady-state of histone acetylation are histone acetyltransferase and histone deacetylase (HDA). Therefore, it is interesting to evaluate the effects of up- and down-regulation of a Rpd-3 type HDA on the development of maize seeds in terms of metabolic changes. This has been reached by analysing nuclear magnetic resonance spectra by different chemometrician approaches, such as Orthogonal Projection to Latent Structure-Discriminant Analysis, Parallel Factors Analysis, and Multi-way Partial Least Squares-Discriminant Analysis (N-PLS-DA). In particular, the latter approaches were chosen because they explicitly take time into account, organizing data into a set of slices that refer to different steps of the developing process. The results show the good discriminating capabilities of the N-PLS-DA approach, even if the number of samples ought be increased to obtain better predictive capabilities. However, using this approach, it was possible to show differences in the accumulation of metabolites during development and to highlight the changes occuring in the modified seeds. In particular, the results confirm the role of this gene in cell cycle control. PMID:18836140

  3. [Urine metabonomic study on hypertension patients of ascendant hyperactivity of gan yang syndrome by high performance liquid chromatography coupled with time of flight mass spectrometry].

    PubMed

    Jiang, Hai-Qiang; Li, Yun-Lun; Xie, Jun

    2012-03-01

    To study the changes of urine metabolites in hypertension patients of ascendant hyperactivity of Gan yang syndrome (AHGYS), and to explore its essence in hypertension patients. Ten typical hypertension patients of AHGYS were recruited as the patient group, and the other twelve healthy volunteers were recruited as the normal group. The metabolite profiling in the urine were collected using by high performance liquid chromatography coupled with time of flight mass spectrometry (HPLC-TOFMS). The principal component analysis (PCA) and partial least-square discriminant analysis (PLS-DA) were analyzed using SIMCA-P Software. The differential metabolites in the urine were found out and identified. The possible relevant metabolic pathways were explained. The data from the analysis by PCA in the urine samples of the patient group and the normal group showed, two sets of data could be obviously classified in the score plot. Compared with the normal group, significant changes happened to the body metabolism in the patient group. The metabolites relevant to hypertension patients of AHGYS were determined using the PLS-DA. Fifteen compounds of the structure and metabolic pathways had been confirmed through inquiring KEGG Database, mainly including amino acids, free fatty acids, sphingosine, and so on. The hypertension patients of AHGYS were studied using HPLC-TOFMS combined with pattern recognition, thus finding out small molecular metabolic markers from the microscopic field, which was advantageous in probing the biological nature of Chinese medicine syndromes.

  4. Green method by diffuse reflectance infrared spectroscopy and spectral region selection for the quantification of sulphamethoxazole and trimethoprim in pharmaceutical formulations.

    PubMed

    da Silva, Fabiana E B; Flores, Érico M M; Parisotto, Graciele; Müller, Edson I; Ferrão, Marco F

    2016-03-01

    An alternative method for the quantification of sulphametoxazole (SMZ) and trimethoprim (TMP) using diffuse reflectance infrared Fourier-transform spectroscopy (DRIFTS) and partial least square regression (PLS) was developed. Interval Partial Least Square (iPLS) and Synergy Partial Least Square (siPLS) were applied to select a spectral range that provided the lowest prediction error in comparison to the full-spectrum model. Fifteen commercial tablet formulations and forty-nine synthetic samples were used. The ranges of concentration considered were 400 to 900 mg g-1SMZ and 80 to 240 mg g-1 TMP. Spectral data were recorded between 600 and 4000 cm-1 with a 4 cm-1 resolution by Diffuse Reflectance Infrared Fourier Transform Spectroscopy (DRIFTS). The proposed procedure was compared to high performance liquid chromatography (HPLC). The results obtained from the root mean square error of prediction (RMSEP), during the validation of the models for samples of sulphamethoxazole (SMZ) and trimethoprim (TMP) using siPLS, demonstrate that this approach is a valid technique for use in quantitative analysis of pharmaceutical formulations. The selected interval algorithm allowed building regression models with minor errors when compared to the full spectrum PLS model. A RMSEP of 13.03 mg g-1for SMZ and 4.88 mg g-1 for TMP was obtained after the selection the best spectral regions by siPLS.

  5. Forensic Discrimination of Latent Fingerprints Using Laser-Induced Breakdown Spectroscopy (LIBS) and Chemometric Approaches.

    PubMed

    Yang, Jun-Ho; Yoh, Jack J

    2018-01-01

    A novel technique is reported for separating overlapping latent fingerprints using chemometric approaches that combine laser-induced breakdown spectroscopy (LIBS) and multivariate analysis. The LIBS technique provides the capability of real time analysis and high frequency scanning as well as the data regarding the chemical composition of overlapping latent fingerprints. These spectra offer valuable information for the classification and reconstruction of overlapping latent fingerprints by implementing appropriate statistical multivariate analysis. The current study employs principal component analysis and partial least square methods for the classification of latent fingerprints from the LIBS spectra. This technique was successfully demonstrated through a classification study of four distinct latent fingerprints using classification methods such as soft independent modeling of class analogy (SIMCA) and partial least squares discriminant analysis (PLS-DA). The novel method yielded an accuracy of more than 85% and was proven to be sufficiently robust. Furthermore, through laser scanning analysis at a spatial interval of 125 µm, the overlapping fingerprints were reconstructed as separate two-dimensional forms.

  6. Rapid characterization of chemical markers for discrimination of Moutan Cortex and its processed products by direct injection-based mass spectrometry profiling and metabolomic method.

    PubMed

    Li, Chao-Ran; Li, Meng-Ning; Yang, Hua; Li, Ping; Gao, Wen

    2018-06-01

    Processing of herbal medicines is a characteristic pharmaceutical technique in Traditional Chinese Medicine, which can reduce toxicity and side effect, improve the flavor and efficacy, and even change the pharmacological action entirely. It is significant and crucial to perform a method to find chemical markers for differentiating herbal medicines in different processed degrees. The aim of this study was to perform a rapid and reasonable method to discriminate Moutan Cortex and its processed products, and to reveal the characteristics of chemical components depend on chemical markers. Thirty batches of Moutan Cortex and its processed products, including 11 batches of Raw Moutan Cortex (RMC), 9 batches of Moutan Cortex Tostus (MCT) and 10 batches of Moutan Cortex Carbonisatus (MCC), were directly injected in electrospray ionization quadrupole time-of-flight mass spectrometry (ESI-QTOF MS) for rapid analysis in positive and negative mode. Without chromatographic separation, each run was completed within 3 min. The raw MS data were automatically extracted by background deduction and molecular feature (MF) extraction algorithm. In negative mode, a total of 452 MFs were obtained and then pretreated by data filtration and differential analysis. After that, the filtered 85 MFs were treated by principal component analysis (PCA) to reduce the dimensions. Subsequently, a partial least squares discrimination analysis (PLS-DA) model was constructed for differentiation and chemical markers detection of Moutan Cortex in different processed degrees. The positive mode data were treated as same as those in negative mode. RMC, MCT and MCC were successfully classified. Moreover, 14 and 3 chemical markers from negative and positive mode respectively, were screened by the combination of their relative peak areas and the parameter variable importance in the projection (VIP) values in PLS-DA model. The content changes of these chemical markers were employed in order to illustrate chemical changes of Moutan Cortex after processed. These results showed that the proposed method which combined non-targeted metabolomics analysis with multivariate statistics analysis is reasonable and effective. It could not only be applied to discriminate herbal medicines and their processing products, but also to reveal the characteristics of chemical components during processing. Copyright © 2018. Published by Elsevier GmbH.

  7. Automatic detection and classification of EOL-concrete and resulting recovered products by hyperspectral imaging

    NASA Astrophysics Data System (ADS)

    Palmieri, Roberta; Bonifazi, Giuseppe; Serranti, Silvia

    2014-05-01

    The recovery of materials from Demolition Waste (DW) represents one of the main target of the recycling industry and the its characterization is important in order to set up efficient sorting and/or quality control systems. End-Of-Life (EOL) concrete materials identification is necessary to maximize DW conversion into useful secondary raw materials, so it is fundamental to develop strategies for the implementation of an automatic recognition system of the recovered products. In this paper, HyperSpectral Imaging (HSI) technique was applied in order to detect DW composition. Hyperspectral images were acquired by a laboratory device equipped with a HSI sensing device working in the near infrared range (1000-1700 nm): NIR Spectral Camera™, embedding an ImSpector™ N17E (SPECIM Ltd, Finland). Acquired spectral data were analyzed adopting the PLS_Toolbox (Version 7.5, Eigenvector Research, Inc.) under Matlab® environment (Version 7.11.1, The Mathworks, Inc.), applying different chemometric methods: Principal Component Analysis (PCA) for exploratory data approach and Partial Least Square- Discriminant Analysis (PLS-DA) to build classification models. Results showed that it is possible to recognize DW materials, distinguishing recycled aggregates from contaminants (e.g. bricks, gypsum, plastics, wood, foam, etc.). The developed procedure is cheap, fast and non-destructive: it could be used to make some steps of the recycling process more efficient and less expensive.

  8. Modeling RP-1 fuel advanced distillation data using comprehensive two-dimensional gas chromatography coupled with time-of-flight mass spectrometry and partial least squares analysis.

    PubMed

    Kehimkar, Benjamin; Parsons, Brendon A; Hoggard, Jamin C; Billingsley, Matthew C; Bruno, Thomas J; Synovec, Robert E

    2015-01-01

    Recent efforts in predicting rocket propulsion (RP-1) fuel performance through modeling put greater emphasis on obtaining detailed and accurate fuel properties, as well as elucidating the relationships between fuel compositions and their properties. Herein, we study multidimensional chromatographic data obtained by comprehensive two-dimensional gas chromatography combined with time-of-flight mass spectrometry (GC × GC-TOFMS) to analyze RP-1 fuels. For GC × GC separations, RTX-Wax (polar stationary phase) and RTX-1 (non-polar stationary phase) columns were implemented for the primary and secondary dimensions, respectively, to separate the chemical compound classes (alkanes, cycloalkanes, aromatics, etc.), providing a significant level of chemical compositional information. The GC × GC-TOFMS data were analyzed using partial least squares regression (PLS) chemometric analysis to model and predict advanced distillation curve (ADC) data for ten RP-1 fuels that were previously analyzed using the ADC method. The PLS modeling provides insight into the chemical species that impact the ADC data. The PLS modeling correlates compositional information found in the GC × GC-TOFMS chromatograms of each RP-1 fuel, and their respective ADC, and allows prediction of the ADC for each RP-1 fuel with good precision and accuracy. The root-mean-square error of calibration (RMSEC) ranged from 0.1 to 0.5 °C, and was typically below ∼0.2 °C, for the PLS calibration of the ADC modeling with GC × GC-TOFMS data, indicating a good fit of the model to the calibration data. Likewise, the predictive power of the overall method via PLS modeling was assessed using leave-one-out cross-validation (LOOCV) yielding root-mean-square error of cross-validation (RMSECV) ranging from 1.4 to 2.6 °C, and was typically below ∼2.0 °C, at each % distilled measurement point during the ADC analysis.

  9. Prediction of CO concentrations based on a hybrid Partial Least Square and Support Vector Machine model

    NASA Astrophysics Data System (ADS)

    Yeganeh, B.; Motlagh, M. Shafie Pour; Rashidi, Y.; Kamalan, H.

    2012-08-01

    Due to the health impacts caused by exposures to air pollutants in urban areas, monitoring and forecasting of air quality parameters have become popular as an important topic in atmospheric and environmental research today. The knowledge on the dynamics and complexity of air pollutants behavior has made artificial intelligence models as a useful tool for a more accurate pollutant concentration prediction. This paper focuses on an innovative method of daily air pollution prediction using combination of Support Vector Machine (SVM) as predictor and Partial Least Square (PLS) as a data selection tool based on the measured values of CO concentrations. The CO concentrations of Rey monitoring station in the south of Tehran, from Jan. 2007 to Feb. 2011, have been used to test the effectiveness of this method. The hourly CO concentrations have been predicted using the SVM and the hybrid PLS-SVM models. Similarly, daily CO concentrations have been predicted based on the aforementioned four years measured data. Results demonstrated that both models have good prediction ability; however the hybrid PLS-SVM has better accuracy. In the analysis presented in this paper, statistic estimators including relative mean errors, root mean squared errors and the mean absolute relative error have been employed to compare performances of the models. It has been concluded that the errors decrease after size reduction and coefficients of determination increase from 56 to 81% for SVM model to 65-85% for hybrid PLS-SVM model respectively. Also it was found that the hybrid PLS-SVM model required lower computational time than SVM model as expected, hence supporting the more accurate and faster prediction ability of hybrid PLS-SVM model.

  10. Imaging of glia activation in people with primary lateral sclerosis.

    PubMed

    Paganoni, Sabrina; Alshikho, Mohamad J; Zürcher, Nicole R; Cernasov, Paul; Babu, Suma; Loggia, Marco L; Chan, James; Chonde, Daniel B; Garcia, David Izquierdo; Catana, Ciprian; Mainero, Caterina; Rosen, Bruce R; Cudkowicz, Merit E; Hooker, Jacob M; Atassi, Nazem

    2018-01-01

    Glia activation is thought to contribute to neuronal damage in several neurodegenerative diseases based on preclinical and human post - mortem studies, but its role in primary lateral sclerosis (PLS) is unknown. To localize and measure glia activation in people with PLS compared to healthy controls (HC). Ten participants with PLS and ten age-matched HCs underwent simultaneous magnetic resonance (MR) and proton emission tomography (PET). The radiotracer [ 11 C]-PBR28 was used to obtain PET-based measures of 18 kDa translocator protein (TSPO) expression, a marker of activated glial cells. MR techniques included a structural sequence to measure cortical thickness and diffusion tensor imaging (DTI) to assess white matter integrity. PET data showed increased [ 11 C]-PBR28 uptake in anatomically-relevant motor regions which co-localized with areas of regional gray matter atrophy and decreased subcortical fractional anisotropy. This study supports a link between glia activation and neuronal degeneration in PLS, and suggests that these disease mechanisms can be measured in vivo in PLS. Future studies are needed to determine the longitudinal changes of these imaging measures and to clarify if MR-PET with [ 11 C]-PBR28 can be used as a biomarker for drug development in the context of clinical trials for PLS.

  11. Near Infrared Spectroscopy Detection and Quantification of Herbal Medicines Adulterated with Sibutramine.

    PubMed

    da Silva, Neirivaldo Cavalcante; Honorato, Ricardo Saldanha; Pimentel, Maria Fernanda; Garrigues, Salvador; Cervera, Maria Luisa; de la Guardia, Miguel

    2015-09-01

    There is an increasing demand for herbal medicines in weight loss treatment. Some synthetic chemicals, such as sibutramine (SB), have been detected as adulterants in herbal formulations. In this study, two strategies using near infrared (NIR) spectroscopy have been developed to evaluate potential adulteration of herbal medicines with SB: a qualitative screening approach and a quantitative methodology based on multivariate calibration. Samples were composed by products commercialized as herbal medicines, as well as by laboratory adulterated samples. Spectra were obtained in the range of 14,000-4000 per cm. Using PLS-DA, a correct classification of 100% was achieved for the external validation set. In the quantitative approach, the root mean squares error of prediction (RMSEP), for both PLS and MLR models, was 0.2% w/w. The results prove the potential of NIR spectroscopy and multivariate calibration in quantifying sibutramine in adulterated herbal medicines samples. © 2015 American Academy of Forensic Sciences.

  12. Fast and nondestructive determination of protein content in rapeseeds (Brassica napus L.) using Fourier transform infrared photoacoustic spectroscopy (FTIR-PAS).

    PubMed

    Lu, Yuzhen; Du, Changwen; Yu, Changbing; Zhou, Jianmin

    2014-08-01

    Fast and non-destructive determination of rapeseed protein content carries significant implications in rapeseed production. This study presented the first attempt of using Fourier transform mid-infrared photoacoustic spectroscopy (FTIR-PAS) to quantify protein content of rapeseed. The full-spectrum model was first built using partial least squares (PLS). Interval selection methods including interval partial least squares (iPLS), synergy interval partial least squares (siPLS), backward elimination interval partial least squares (biPLS) and dynamic backward elimination interval partial least squares (dyn-biPLS) were then employed to select the relevant band or band combination for PLS modeling. The full-spectrum PLS model achieved an ratio of prediction to deviation (RPD) of 2.047. In comparison, all interval selection methods produced better results than full-spectrum modeling. siPLS achieved the best predictive accuracy with an RPD of 3.215 when the spectrum was sectioned into 25 intervals, and two intervals (1198-1335 and 1614-1753 cm(-1) ) were selected. iPLS excelled biPLS and dyn-biPLS, and dyn-biPLS performed slightly better than biPLS. FTIR-PAS was verified as a promising analytical tool to quantify rapeseed protein content. Interval selection could extract the relevant individual band or synergy band associated with the sample constituent of interest, and then improve the prediction accuracy of the full-spectrum model. © 2013 Society of Chemical Industry.

  13. Three-way analysis of the UPLC-PDA dataset for the multicomponent quantitation of hydrochlorothiazide and olmesartan medoxomil in tablets by parallel factor analysis and three-way partial least squares.

    PubMed

    Dinç, Erdal; Ertekin, Zehra Ceren

    2016-01-01

    An application of parallel factor analysis (PARAFAC) and three-way partial least squares (3W-PLS1) regression models to ultra-performance liquid chromatography-photodiode array detection (UPLC-PDA) data with co-eluted peaks in the same wavelength and time regions was described for the multicomponent quantitation of hydrochlorothiazide (HCT) and olmesartan medoxomil (OLM) in tablets. Three-way dataset of HCT and OLM in their binary mixtures containing telmisartan (IS) as an internal standard was recorded with a UPLC-PDA instrument. Firstly, the PARAFAC algorithm was applied for the decomposition of three-way UPLC-PDA data into the chromatographic, spectral and concentration profiles to quantify the concerned compounds. Secondly, 3W-PLS1 approach was subjected to the decomposition of a tensor consisting of three-way UPLC-PDA data into a set of triads to build 3W-PLS1 regression for the analysis of the same compounds in samples. For the proposed three-way analysis methods in the regression and prediction steps, the applicability and validity of PARAFAC and 3W-PLS1 models were checked by analyzing the synthetic mixture samples, inter-day and intra-day samples, and standard addition samples containing HCT and OLM. Two different three-way analysis methods, PARAFAC and 3W-PLS1, were successfully applied to the quantitative estimation of the solid dosage form containing HCT and OLM. Regression and prediction results provided from three-way analysis were compared with those obtained by traditional UPLC method. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. Differentiation of cumin seeds using a metal-oxide based gas sensor array in tandem with chemometric tools.

    PubMed

    Ghasemi-Varnamkhasti, Mahdi; Amiri, Zahra Safari; Tohidi, Mojtaba; Dowlati, Majid; Mohtasebi, Seyed Saeid; Silva, Adenilton C; Fernandes, David D S; Araujo, Mário C U

    2018-01-01

    Cumin is a plant of the Apiaceae family (umbelliferae) which has been used since ancient times as a medicinal plant and as a spice. The difference in the percentage of aromatic compounds in cumin obtained from different locations has led to differentiation of some species of cumin from other species. The quality and price of cumin vary according to the specie and may be an incentive for the adulteration of high value samples with low quality cultivars. An electronic nose simulates the human olfactory sense by using an array of sensors to distinguish complex smells. This makes it an alternative for the identification and classification of cumin species. The data, however, may have a complex structure, difficult to interpret. Given this, chemometric tools can be used to manipulate data with two-dimensional structure (sensor responses in time) obtained by using electronic nose sensors. In this study, an electronic nose based on eight metal oxide semiconductor sensors (MOS) and 2D-LDA (two-dimensional linear discriminant analysis), U-PLS-DA (Partial least square discriminant analysis applied to the unfolded data) and PARAFAC-LDA (Parallel factor analysis with linear discriminant analysis) algorithms were used in order to identify and classify different varieties of both cultivated and wild black caraway and cumin. The proposed methodology presented a correct classification rate of 87.1% for PARAFAC-LDA and 100% for 2D-LDA and U-PLS-DA, indicating a promising strategy for the classification different varieties of cumin, caraway and other seeds. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Authentication of animal origin of heparin and low molecular weight heparin including ovine, porcine and bovine species using 1D NMR spectroscopy and chemometric tools.

    PubMed

    Monakhova, Yulia B; Diehl, Bernd W K; Fareed, Jawed

    2018-02-05

    High resolution (600MHz) nuclear magnetic resonance (NMR) spectroscopy is used to distinguish heparin and low-molecular weight heparins (LMWHs) produced from porcine, bovine and ovine mucosal tissues as well as their blends. For multivariate analysis several statistical methods such as principal component analysis (PCA), factor discriminant analysis (FDA), partial least squares - discriminant analysis (PLS-DA), linear discriminant analysis (LDA) were utilized for the modeling of NMR data of more than 100 authentic samples. Heparin and LMWH samples from the independent test set (n=15) were 100% correctly classified according to its animal origin. Moreover, by using 1 H NMR coupled with chemometrics and several batches of bovine heparins from two producers were differentiated. Thus, NMR spectroscopy combined with chemometrics is an efficient tool for simultaneous identification of animal origin and process based manufacturing difference in heparin products. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Quantitative and Discriminative Evaluation of Contents of Phenolic and Flavonoid and Antioxidant Competence for Chinese Honeys from Different Botanical Origins.

    PubMed

    Shen, Shi; Wang, Jingbo; Zhuo, Qin; Chen, Xi; Liu, Tingting; Zhang, Shuang-Qing

    2018-05-08

    Phenolics and flavonoids in honey are considered as the main phytonutrients which not only act as natural antioxidants, but can also be used as floral markers for honey identification. In this study, the chemical profiles of phenolics and flavonoids, antioxidant competences including total phenolic content, DPPH and ABTS assays and discrimination using chemometric analysis of various Chinese monofloral honeys from six botanical origins (acacia, Vitex , linden, rapeseed, Astragalus and Codonopsis ) were examined. A reproducible and sensitive ultra-performance liquid chromatography-tandem mass spectrometry (UPLC-MS/MS) method was optimized and validated for the simultaneous determination of 38 phenolics, flavonoids and abscisic acid in honey. Formononetin, ononin, calycosin and calycosin-7- O -β-d-glucoside were identified and quantified in honeys for the first time. Principal component analysis (PCA) showed obvious differences among the honey samples in three-dimensional space accounting for 72.63% of the total variance. Hierarchical cluster analysis (HCA) also revealed that the botanical origins of honey samples correlated with their phenolic and flavonoid contents. Partial least squares-discriminant analysis (PLS-DA) classification was performed to derive a model with high prediction ability. Orthogonal partial least squares-discriminant analysis (OPLS-DA) model was employed to identify markers specific to a particular honey type. The results indicated that Chinese honeys contained various and discriminative phenolics and flavonoids, as well as antioxidant competence from different botanical origins, which was an alternative approach to honey identification and nutritional evaluation.

  17. The classification of almonds (Prunus dulcis) by country and variety using UHPLC-HRMS-based untargeted metabolomics.

    PubMed

    Gil Solsona, R; Boix, C; Ibáñez, M; Sancho, J V

    2018-03-01

    The aim of this study was to use an untargeted UHPLC-HRMS-based metabolomics approach allowing discrimination between almonds based on their origin and variety. Samples were homogenised, extracted with ACN:H 2 O (80:20) containing 0.1% HCOOH and injected in a UHPLC-QTOF instrument in both positive and negative ionisation modes. Principal component analysis (PCA) was performed to ensure the absence of outliers. Partial least squares - discriminant analysis (PLS-DA) was employed to create and validate the models for country (with five different compounds) and variety (with 20 features), showing more than 95% accuracy. Additional samples were injected and the model was evaluated with blind samples, with more than 95% of samples being correctly classified using both models. MS/MS experiments were carried out to tentatively elucidate the highlighted marker compounds (pyranosides, peptides or amino acids, among others). This study has shown the potential of high-resolution mass spectrometry to perform and validate classification models, also providing information concerning the identification of the unexpected biomarkers which showed the highest discriminant power.

  18. Modified multiblock partial least squares path modeling algorithm with backpropagation neural networks approach

    NASA Astrophysics Data System (ADS)

    Yuniarto, Budi; Kurniawan, Robert

    2017-03-01

    PLS Path Modeling (PLS-PM) is different from covariance based SEM, where PLS-PM use an approach based on variance or component, therefore, PLS-PM is also known as a component based SEM. Multiblock Partial Least Squares (MBPLS) is a method in PLS regression which can be used in PLS Path Modeling which known as Multiblock PLS Path Modeling (MBPLS-PM). This method uses an iterative procedure in its algorithm. This research aims to modify MBPLS-PM with Back Propagation Neural Network approach. The result is MBPLS-PM algorithm can be modified using the Back Propagation Neural Network approach to replace the iterative process in backward and forward step to get the matrix t and the matrix u in the algorithm. By modifying the MBPLS-PM algorithm using Back Propagation Neural Network approach, the model parameters obtained are relatively not significantly different compared to model parameters obtained by original MBPLS-PM algorithm.

  19. Decision-making in healthcare: a practical application of partial least square path modelling to coverage of newborn screening programmes.

    PubMed

    Fischer, Katharina E

    2012-08-02

    Decision-making in healthcare is complex. Research on coverage decision-making has focused on comparative studies for several countries, statistical analyses for single decision-makers, the decision outcome and appraisal criteria. Accounting for decision processes extends the complexity, as they are multidimensional and process elements need to be regarded as latent constructs (composites) that are not observed directly. The objective of this study was to present a practical application of partial least square path modelling (PLS-PM) to evaluate how it offers a method for empirical analysis of decision-making in healthcare. Empirical approaches that applied PLS-PM to decision-making in healthcare were identified through a systematic literature search. PLS-PM was used as an estimation technique for a structural equation model that specified hypotheses between the components of decision processes and the reasonableness of decision-making in terms of medical, economic and other ethical criteria. The model was estimated for a sample of 55 coverage decisions on the extension of newborn screening programmes in Europe. Results were evaluated by standard reliability and validity measures for PLS-PM. After modification by dropping two indicators that showed poor measures in the measurement models' quality assessment and were not meaningful for newborn screening, the structural equation model estimation produced plausible results. The presence of three influences was supported: the links between both stakeholder participation or transparency and the reasonableness of decision-making; and the effect of transparency on the degree of scientific rigour of assessment. Reliable and valid measurement models were obtained to describe the composites of 'transparency', 'participation', 'scientific rigour' and 'reasonableness'. The structural equation model was among the first applications of PLS-PM to coverage decision-making. It allowed testing of hypotheses in situations where there are links between several non-observable constructs. PLS-PM was compatible in accounting for the complexity of coverage decisions to obtain a more realistic perspective for empirical analysis. The model specification can be used for hypothesis testing by using larger sample sizes and for data in the full domain of health technologies.

  20. ATR-FTIR spectroscopy for the determination of Na4EDTA in detergent aqueous solutions.

    PubMed

    Suárez, Leticia; García, Roberto; Riera, Francisco A; Diez, María A

    2013-10-15

    Fourier transform infrared spectroscopy in the attenuated total reflectance mode (ATR-FTIR) combined with partial last square (PLS) algorithms was used to design calibration and prediction models for a wide range of tetrasodium ethylenediaminetetraacetate (Na4EDTA) concentrations (0.1 to 28% w/w) in aqueous solutions. The spectra obtained using air and water as a background medium were tested for the best fit. The PLS models designed afforded a sufficient level of precision and accuracy to allow even very small amounts of Na4EDTA to be determined. A root mean square error of nearly 0.37 for the validation set was obtained. Over a concentration range below 5% w/w, the values estimated from a combination of ATR-FTIR spectroscopy and a PLS algorithm model were similar to those obtained from an HPLC analysis of NaFeEDTA complexes and subsequent detection by UV absorbance. However, the lowest detection limit for Na4EDTA concentrations afforded by this spectroscopic/chemometric method was 0.3% w/w. The PLS model was successfully used as a rapid and simple method to quantify Na4EDTA in aqueous solutions of industrial detergents as an alternative to HPLC-UV analysis which involves time-consuming dilution and complexation processes. © 2013 Elsevier B.V. All rights reserved.

  1. An in vitro approach for lipolysis measurement using high-resolution mass spectrometry and partial least squares based analysis.

    PubMed

    Chang, Wen-Qi; Zhou, Jian-Liang; Li, Yi; Shi, Zi-Qi; Wang, Li; Yang, Jie; Li, Ping; Liu, Li-Fang; Xin, Gui-Zhong

    2017-01-15

    The elevation of free fatty acids (FFAs) has been regarded as a universal metabolic signature of excessive adipocyte lipolysis. Nowadays, in vitro lipolysis assay is generally essential for drug screening prior to the animal study. Here, we present a novel in vitro approach for lipolysis measurement combining UHPLC-Orbitrap and partial least squares (PLS) based analysis. Firstly, the calibration matrix was constructed by serial proportions of mixed samples (blended with control and model samples). Then, lipidome profiling was performed by UHPLC-Orbitrap, and 403 variables were extracted and aligned as dataset. Owing to the high resolution of Orbitrap analyzer and open source lipid identification software, 28 FFAs were further screened and identified. Based on the relative intensity of the screened FFAs, PLS regression model was constructed for lipolysis measurement. After leave-one-out cross-validation, ten principal components have been designated to build the final PLS model with excellent performances (RMSECV, 0.0268; RMSEC, 0.0173; R 2 , 0.9977). In addition, the high predictive accuracy (R 2  = 0.9907 and RMSEP = 0.0345) of the trained PLS model was also demonstrated using test samples. Finally, taking curcumin as a model compound, its antilipolytic effect on palmitic acid-induced lipolysis was successfully predicted as 31.78% by the proposed approach. Besides, supplementary evidences of curcumin induced modification in FFAs compositions as well as lipidome were given by PLS extended methods. Different from general biological assays, high resolution MS-based method provide more sophisticated information included in biological events. Thus, the novel biological evaluation model proposed here showed promising perspectives for drug evaluation or disease diagnosis. Copyright © 2016 Elsevier B.V. All rights reserved.

  2. Non-destructive evaluation of bacteria-infected watermelon seeds using visible/near-infrared hyperspectral imaging.

    PubMed

    Lee, Hoonsoo; Kim, Moon S; Song, Yu-Rim; Oh, Chang-Sik; Lim, Hyoun-Sub; Lee, Wang-Hee; Kang, Jum-Soon; Cho, Byoung-Kwan

    2017-03-01

    There is a need to minimize economic damage by sorting infected seeds from healthy seeds before seeding. However, current methods of detecting infected seeds, such as seedling grow-out, enzyme-linked immunosorbent assays, the polymerase chain reaction (PCR) and the real-time PCR have a critical drawbacks in that they are time-consuming, labor-intensive and destructive procedures. The present study aimed to evaluate the potential of visible/near-infrared (Vis/NIR) hyperspectral imaging system for detecting bacteria-infected watermelon seeds. A hyperspectral Vis/NIR reflectance imaging system (spectral region of 400-1000 nm) was constructed to obtain hyperspectral reflectance images for 336 bacteria-infected watermelon seeds, which were then subjected to partial least square discriminant analysis (PLS-DA) and a least-squares support vector machine (LS-SVM) to classify bacteria-infected watermelon seeds from healthy watermelon seeds. The developed system detected bacteria-infected watermelon seeds with an accuracy > 90% (PLS-DA: 91.7%, LS-SVM: 90.5%), suggesting that the Vis/NIR hyperspectral imaging system is effective for quarantining bacteria-infected watermelon seeds. The results of the present study show that it is possible to use the Vis/NIR hyperspectral imaging system for detecting bacteria-infected watermelon seeds. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.

  3. Non-destructive geographical traceability of sea cucumber (Apostichopus japonicus) using near infrared spectroscopy combined with chemometric methods

    PubMed Central

    Cai, Rui; Wang, Shisheng; Tang, Bo; Li, Yueqing; Zhao, Weijie

    2018-01-01

    Sea cucumber is the major tonic seafood worldwide, and geographical origin traceability is an important part of its quality and safety control. In this work, a non-destructive method for origin traceability of sea cucumber (Apostichopus japonicus) from northern China Sea and East China Sea using near infrared spectroscopy (NIRS) and multivariate analysis methods was proposed. Total fat contents of 189 fresh sea cucumber samples were determined and partial least-squares (PLS) regression was used to establish the quantitative NIRS model. The ordered predictor selection algorithm was performed to select feasible wavelength regions for the construction of PLS and identification models. The identification model was developed by principal component analysis combined with Mahalanobis distance and scaling to the first range algorithms. In the test set of the optimum PLS models, the root mean square error of prediction was 0.45, and correlation coefficient was 0.90. The correct classification rates of 100% were obtained in both identification calibration model and test model. The overall results indicated that NIRS method combined with chemometric analysis was a suitable tool for origin traceability and identification of fresh sea cucumber samples from nine origins in China. PMID:29410795

  4. Non-destructive geographical traceability of sea cucumber (Apostichopus japonicus) using near infrared spectroscopy combined with chemometric methods.

    PubMed

    Guo, Xiuhan; Cai, Rui; Wang, Shisheng; Tang, Bo; Li, Yueqing; Zhao, Weijie

    2018-01-01

    Sea cucumber is the major tonic seafood worldwide, and geographical origin traceability is an important part of its quality and safety control. In this work, a non-destructive method for origin traceability of sea cucumber ( Apostichopus japonicus ) from northern China Sea and East China Sea using near infrared spectroscopy (NIRS) and multivariate analysis methods was proposed. Total fat contents of 189 fresh sea cucumber samples were determined and partial least-squares (PLS) regression was used to establish the quantitative NIRS model. The ordered predictor selection algorithm was performed to select feasible wavelength regions for the construction of PLS and identification models. The identification model was developed by principal component analysis combined with Mahalanobis distance and scaling to the first range algorithms. In the test set of the optimum PLS models, the root mean square error of prediction was 0.45, and correlation coefficient was 0.90. The correct classification rates of 100% were obtained in both identification calibration model and test model. The overall results indicated that NIRS method combined with chemometric analysis was a suitable tool for origin traceability and identification of fresh sea cucumber samples from nine origins in China.

  5. [Detection of Hawthorn Fruit Defects Using Hyperspectral Imaging].

    PubMed

    Liu, De-hua; Zhang, Shu-juan; Wang, Bin; Yu, Ke-qiang; Zhao, Yan-ru; He, Yong

    2015-11-01

    Hyperspectral imaging technology covered the range of 380-1000 nm was employed to detect defects (bruise and insect damage) of hawthorn fruit. A total of 134 samples were collected, which included damage fruit of 46, pest fruit of 30, injure and pest fruit of 10 and intact fruit of 48. Because calyx · s⁻¹ tem-end and bruise/insect damage regions offered a similar appearance characteristic in RGB images, which could produce easily confusion between them. Hence, five types of defects including bruise, insect damage, sound, calyx, and stem-end were collected from 230 hawthorn fruits. After acquiring hyperspectral images of hawthorn fruits, the spectral data were extracted from region of interest (ROI). Then, several pretreatment methods of standard normalized variate (SNV), savitzky golay (SG), median filter (MF) and multiplicative scatter correction (MSC) were used and partial least squares method(PLS) model was carried out to obtain the better performance. Accordingly to their results, SNV pretreatment methods assessed by PLS was viewed as best pretreatment method. Lastly, SNV was chosen as the pretreatment method. Spectral features of five different regions were combined with Regression coefficients(RCs) of partial least squares-discriminant analysis (PLS-DA) model was used to identify the important wavelengths and ten wavebands at 483, 563, 645, 671, 686, 722, 777, 819, 837 and 942 nm were selected from all of the wavebands. Using Kennard-Stone algorithm, all kinds of samples were randomly divided into training set (173) and test set (57) according to the proportion of 3:1. And then, least squares-support vector machine (LS-SVM) discriminate model was established by using the selected wavebands. The results showed that the discriminate accuracy of the method was 91.23%. In the other hand, images at ten important wavebands were executed to Principal component analysis (PCA). Using "Sobel" operator and region growing algrorithm "Regiongrow", the edge and defect feature of 86 Hawthorn could be recognized. Lastly, the detect precision of bruised, insect damage and two-defect samples is 95.65%, 86.67% and 100%, respectively. This investigation demonstrated that hyperspectral imaging technology could detect the defects of bruise, insect damage, calyx, and stem-end in hawthorn fruit in qualitative analysis and feature detection which provided a theoretical reference for the defects nondestructive detection of hawthorn fruit.

  6. New strategy for determination of anthocyanins, polyphenols and antioxidant capacity of Brassica oleracea liquid extract using infrared spectroscopies and multivariate regression

    NASA Astrophysics Data System (ADS)

    de Oliveira, Isadora R. N.; Roque, Jussara V.; Maia, Mariza P.; Stringheta, Paulo C.; Teófilo, Reinaldo F.

    2018-04-01

    A new method was developed to determine the antioxidant properties of red cabbage extract (Brassica oleracea) by mid (MID) and near (NIR) infrared spectroscopies and partial least squares (PLS) regression. A 70% (v/v) ethanolic extract of red cabbage was concentrated to 9° Brix and further diluted (12 to 100%) in water. The dilutions were used as external standards for the building of PLS models. For the first time, this strategy was applied for building multivariate regression models. Reference analyses and spectral data were obtained from diluted extracts. The determinate properties were total and monomeric anthocyanins, total polyphenols and antioxidant capacity by ABTS (2,2-azino-bis(3-ethyl-benzothiazoline-6-sulfonate)) and DPPH (2,2-diphenyl-1-picrylhydrazyl) methods. Ordered predictors selection (OPS) and genetic algorithm (GA) were used for feature selection before PLS regression (PLS-1). In addition, a PLS-2 regression was applied to all properties simultaneously. PLS-1 models provided more predictive models than did PLS-2 regression. PLS-OPS and PLS-GA models presented excellent prediction results with a correlation coefficient higher than 0.98. However, the best models were obtained using PLS and variable selection with the OPS algorithm and the models based on NIR spectra were considered more predictive for all properties. Then, these models provided a simple, rapid and accurate method for determination of red cabbage extract antioxidant properties and its suitability for use in the food industry.

  7. Integrated fingerprint and high wavenumber confocal Raman spectroscopy for in vivo diagnosis of cervical precancer

    NASA Astrophysics Data System (ADS)

    Duraipandian, Shiyamala; Zheng, Wei; Ng, Joseph; Low, Jeffrey J. H.; Ilancheran, A.; Huang, Zhiwei

    2013-03-01

    Raman spectroscopy is a vibrational spectroscopic technique capable of optically probing the compositional, conformational, and structural changes in the tissue associated with disease progression. The main goal of this work is to develop an integrated fingerprint (FP) and high wavenumber (HW) in vivo confocal Raman spectroscopy for simultaneous FP/HW tissue Raman spectral measurements. This work further explores the potential of integrated FP/HW Raman spectroscopy developed as a diagnostic tool for in vivo detection of cervical precancer. A total of 473 in vivo integrated FP/HW Raman spectra (340 normal and 133 precancer) were acquired from 35 patients within 1 s during clinical colposcopy. The major tissue Raman peaks are noticed around 854, 937, 1001, 1095, 1253, 1313, 1445, 1654, 2946 and 3400 cm-1, related to the molecular changes (e.g., proteins, lipids, glycogen, nucleic acids, water, etc.) that accompany the dysplastic transformation of tissue. The FP (800 - 1800 cm-1), HW (2800 - 3800 cm-1) and the integrated FP/HW Raman spectra were analyzed using partial least squares-discriminant analysis (PLS-DA) together with the leave-one patient-out, cross-validation. The developed PLS-DA classification models and receiver operating characteristics (ROC) curves for the FP, HW and integrated FP/HW spectroscopy further discloses that the performance of integrated FP/HW Raman spectroscopy is superior to that of all others in discriminating the dysplastic cervix. The results of this work indicate that the co-contributions of underlying rich biochemical information revealed by the complementary spectral modalities (FP and HW Raman) can improve the in vivo early diagnosis of cervical precancer at clinical colposcopy

  8. Decision-making in healthcare: a practical application of partial least square path modelling to coverage of newborn screening programmes

    PubMed Central

    2012-01-01

    Background Decision-making in healthcare is complex. Research on coverage decision-making has focused on comparative studies for several countries, statistical analyses for single decision-makers, the decision outcome and appraisal criteria. Accounting for decision processes extends the complexity, as they are multidimensional and process elements need to be regarded as latent constructs (composites) that are not observed directly. The objective of this study was to present a practical application of partial least square path modelling (PLS-PM) to evaluate how it offers a method for empirical analysis of decision-making in healthcare. Methods Empirical approaches that applied PLS-PM to decision-making in healthcare were identified through a systematic literature search. PLS-PM was used as an estimation technique for a structural equation model that specified hypotheses between the components of decision processes and the reasonableness of decision-making in terms of medical, economic and other ethical criteria. The model was estimated for a sample of 55 coverage decisions on the extension of newborn screening programmes in Europe. Results were evaluated by standard reliability and validity measures for PLS-PM. Results After modification by dropping two indicators that showed poor measures in the measurement models’ quality assessment and were not meaningful for newborn screening, the structural equation model estimation produced plausible results. The presence of three influences was supported: the links between both stakeholder participation or transparency and the reasonableness of decision-making; and the effect of transparency on the degree of scientific rigour of assessment. Reliable and valid measurement models were obtained to describe the composites of ‘transparency’, ‘participation’, ‘scientific rigour’ and ‘reasonableness’. Conclusions The structural equation model was among the first applications of PLS-PM to coverage decision-making. It allowed testing of hypotheses in situations where there are links between several non-observable constructs. PLS-PM was compatible in accounting for the complexity of coverage decisions to obtain a more realistic perspective for empirical analysis. The model specification can be used for hypothesis testing by using larger sample sizes and for data in the full domain of health technologies. PMID:22856325

  9. [Main Components of Xinjiang Lavender Essential Oil Determined by Partial Least Squares and Near Infrared Spectroscopy].

    PubMed

    Liao, Xiang; Wang, Qing; Fu, Ji-hong; Tang, Jun

    2015-09-01

    This work was undertaken to establish a quantitative analysis model which can rapid determinate the content of linalool, linalyl acetate of Xinjiang lavender essential oil. Totally 165 lavender essential oil samples were measured by using near infrared absorption spectrum (NIR), after analyzing the near infrared spectral absorption peaks of all samples, lavender essential oil have abundant chemical information and the interference of random noise may be relatively low on the spectral intervals of 7100~4500 cm(-1). Thus, the PLS models was constructed by using this interval for further analysis. 8 abnormal samples were eliminated. Through the clustering method, 157 lavender essential oil samples were divided into 105 calibration set samples and 52 validation set samples. Gas chromatography mass spectrometry (GC-MS) was used as a tool to determine the content of linalool and linalyl acetate in lavender essential oil. Then the matrix was established with the GC-MS raw data of two compounds in combination with the original NIR data. In order to optimize the model, different pretreatment methods were used to preprocess the raw NIR spectral to contrast the spectral filtering effect, after analysizing the quantitative model results of linalool and linalyl acetate, the root mean square error prediction (RMSEP) of orthogonal signal transformation (OSC) was 0.226, 0.558, spectrally, it was the optimum pretreatment method. In addition, forward interval partial least squares (FiPLS) method was used to exclude the wavelength points which has nothing to do with determination composition or present nonlinear correlation, finally 8 spectral intervals totally 160 wavelength points were obtained as the dataset. Combining the data sets which have optimized by OSC-FiPLS with partial least squares (PLS) to establish a rapid quantitative analysis model for determining the content of linalool and linalyl acetate in Xinjiang lavender essential oil, numbers of hidden variables of two components were 8 in the model. The performance of the model was evaluated according to root mean square error of cross-validation (RMSECV), root mean square error of prediction (RMSEP). In the model, RESECV of linalool and linalyl acetate were 0.170 and 0.416, respectively; RM-SEP were 0.188 and 0.364. The results indicated that raw data was pretreated by OSC and FiPLS, the NIR-PLS quantitative analysis model with good robustness, high measurement precision; it could quickly determine the content of linalool and linalyl acetate in lavender essential oil. In addition, the model has a favorable prediction ability. The study also provide a new effective method which could rapid quantitative analysis the major components of Xinjiang lavender essential oil.

  10. Simultaneous determination of the impurity and radial tensile strength of reduced glutathione tablets by a high selective NIR-PLS method.

    PubMed

    Li, Juan; Jiang, Yue; Fan, Qi; Chen, Yang; Wu, Ruanqi

    2014-05-05

    This paper establishes a high-throughput and high selective method to determine the impurity named oxidized glutathione (GSSG) and radial tensile strength (RTS) of reduced glutathione (GSH) tablets based on near infrared (NIR) spectroscopy and partial least squares (PLS). In order to build and evaluate the calibration models, the NIR diffuse reflectance spectra (DRS) and transmittance spectra (TS) for 330 GSH tablets were accurately measured by using the optimized parameter values. For analyzing GSSG or RTS of GSH tablets, the NIR-DRS or NIR-TS were selected, subdivided reasonably into calibration and prediction sets, and processed appropriately with chemometric techniques. After selecting spectral sub-ranges and neglecting spectrum outliers, the PLS calibration models were built and the factor numbers were optimized. Then, the PLS models were evaluated by the root mean square errors of calibration (RMSEC), cross-validation (RMSECV) and prediction (RMSEP), and by the correlation coefficients of calibration (R(c)) and prediction (R(p)). The results indicate that the proposed models have good performances. It is thus clear that the NIR-PLS can simultaneously, selectively, nondestructively and rapidly analyze the GSSG and RTS of GSH tablets, although the contents of GSSG impurity were quite low while those of GSH active pharmaceutical ingredient (API) quite high. This strategy can be an important complement to the common NIR methods used in the on-line analysis of API in pharmaceutical preparations. And this work expands the NIR applications in the high-throughput and extraordinarily selective analysis. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. Quantitative determination of wool in textile by near-infrared spectroscopy and multivariate models.

    PubMed

    Chen, Hui; Tan, Chao; Lin, Zan

    2018-08-05

    The wool content in textiles is a key quality index and the corresponding quantitative analysis takes an important position due to common adulterations in both raw and finished textiles. Conventional methods are maybe complicated, destructive, time-consuming, environment-unfriendly. Developing a quick, easy-to-use and green alternative method is interesting. The work focuses on exploring the feasibility of combining near-infrared (NIR) spectroscopy and several partial least squares (PLS)-based algorithms and elastic component regression (ECR) algorithms for measuring wool content in textile. A total of 108 cloth samples with wool content ranging from 0% to 100% (w/w) were collected and all the compositions are really existent in the market. The dataset was divided equally into the training and test sets for developing and validating calibration models. When using local PLS, the original spectrum axis was split into 20 sub-intervals. No obvious difference of performance can be seen for the local PLS models. The ECR model is comparable or superior to the other models due its flexibility, i.e., being transition state from PCR to PLS. It seems that ECR combined with NIR technique may be a potential method for determining wool content in textile products. In addition, it might have regulatory advantages to avoid time-consuming and environmental-unfriendly chemical analysis. Copyright © 2018 Elsevier B.V. All rights reserved.

  12. Quantification and Discrimination of in Vitro Regeneration Swertia nervosa at Different Growth Periods using the UPLC/UV Coupled with Chemometric Method.

    PubMed

    Li, Jie; Zhang, Ji; Zuo, Zhitian; Huang, Hengyu; Wang, Yuanzhong

    2018-05-09

    Background : Swertia nervosa (Wall. ex G. Don) C. B. Clarke, a promising traditional herbal medicine for the treatment of liver disorders, is endangered due to its extensive collection and unsustainable harvesting practices. Objective : The aim of this study is to discuss the diversity of metabolites (loganic acid, sweroside, swertiamarin, and gentiopicroside) at different growth stages and organs of Swertia nervosa using the ultra-high-performance LC (UPLC)/UV coupled with chemometric method. Methods : UPLC data, UV data, and data fusion were treated separately to find more useful information by partial least-squares discriminant analysis (PLS-DA). Hierarchical cluster analysis (HCA), an unsupervised method, was then employed for validating the results from PLS-DA. Results : Three strategies displayed different chemical information associated with the sample discrimination. UV information mainly contributed to the classification of different organs; UPLC information was prominently responsible for both organs and growth periods; the data fusion did not perform with apparent superiority compared with single data analysis, although it provided useful information to differentiate leaves that could not be recognized by UPLC. The quantification result showed that the content of swertiamarin was the highest compared with the other three metabolites, especially in leaves at the rooted stage (19.57 ± 5.34 mg/g). Therefore, we speculated that interactive transformations occurred among these four metabolites, facilitated by root formation. Conclusions : This work will contribute to exploitation of bioactive compounds of S. nervosa , as well as its large-scale propagation. Highlights : The roots formation may influence the distribution and accumulation of metabolites.

  13. Methodological aspects for metabolome visualization and characterization: a metabolomic evaluation of the 24 h evolution of human urine after cocoa powder consumption.

    PubMed

    Llorach-Asunción, R; Jauregui, O; Urpi-Sarda, M; Andres-Lacueva, C

    2010-01-20

    The LC-MS based metabolomics studies are characterized by the capacity to produce a large and complex dataset being mandatory to use the appropriate tools to recover and to interpret as maximum information as possible. In this context, a combined partial least square discriminat analysis (PLS-DA) and two-way hierarchical clustering (two-way HCA) using Bonferroni correction as filter is proposed to improve analysis in human urinary metabolome modifications in a nutritional intervention context. After overnight fasting, 10 subjects consumed cocoa powder with milk. Urine samples were collected before the ingestion product and at 0-6, 6-12, 12-24 h after test-meal consumption and analysed by LC-Q-ToF. The PLS-DA analysis showed a clear pattern related to the differences between before consumption period and the other three periods revealing relevant mass features in this separation, however, a weaker association between mass features and the three periods after cocoa consumption was observed. On the other hand, two-way HCA showed a separation of four urine time periods and point out the mass features associated with the corresponding urine times. The correlation matrix revealed complex relations between the mass features that could be used for metabolite identifications and to infer the possible metabolite origin. The reported results prove that combining visualization strategies would be an excellent way to produce new bioinformatic applications that help the scientific community to unravel the complex relations between the consumption of phytochemicals and their expected effects on health.

  14. Heritable IUGR and adult metabolic syndrome are reversible and associated with alterations in the metabolome following dietary supplementation of 1-carbon intermediates

    PubMed Central

    Seferovic, Maxim D.; Goodspeed, Danielle M.; Chu, Derrick M.; Krannich, Laura A.; Gonzalez-Rodriguez, Pablo J.; Cox, James E.; Aagaard, Kjersti M.

    2015-01-01

    Metabolic syndrome (MetS), following intrauterine growth restriction (IUGR), is epigenetically heritable. Recently, we abrogated the F2 adult phenotype with essential nutrient supplementation (ENS) of intermediates along the 1-carbon pathway. With the use of the same grandparental uterine artery ligation model, we profiled the F2 serum metabolome at weaning [postnatal day (d)21; n = 76] and adulthood (d160; n = 12) to test if MetS is preceded by alterations in the metabolome. Indicative of developmentally programmed MetS, adult F2, formerly IUGR rats, were obese (621 vs. 461 g; P < 0.0001), dyslipidemic (133 vs. 67 mg/dl; P < 0.001), and glucose intolerant (26 vs. 15 mg/kg/min; P < 0.01). Unbiased gas chromatography-mass spectrometry (GC-MS) profiling revealed 34 peaks corresponding to 12 nonredundant metabolites and 9 unknowns to be changing at weaning [false discovery rate (FDR) < 0.05]. Markers of later-in-life MetS included citric acid, glucosamine, myoinositol, and proline (P < 0.03). Hierarchical clustering revealed grouping by IUGR lineage and supplementation at d21 and d160. Weanlings grouped distinctly for ENS and IUGR by partial least-squares discriminate analysis (PLS-DA; P < 0.01), whereas paternal and maternal IUGR (IUGRpat/IUGRmat, respectively) control-fed rats, destined for MetS, had a distinct metabolome at weaning (randomForest analysis; class error < 0.1) and adulthood (PLS-DA; P < 0.05). In sum, we have found that alterations in the metabolome accompany heritable IUGR, precede adult-onset MetS, and are partially amenable to dietary intervention.—Seferovic, M. D., Goodspeed, D. M., Chu, D. M., Krannich, L. A., Gonzalez-Rodriguez, P. J., Cox, J. E., Aagaard, K. M. Heritable IUGR and adult metabolic syndrome are reversible and associated with alterations in the metabolome following dietary supplementation of one-carbon intermediates. PMID:25757570

  15. Metabolomic Analysis in Severe Childhood Pneumonia in The Gambia, West Africa: Findings from a Pilot Study

    PubMed Central

    Laiakis, Evagelia C.; Morris, Gerard A. J.; Fornace, Albert J.; Howie, Stephen R. C.

    2010-01-01

    Background Pneumonia remains the leading cause of death in young children globally and improved diagnostics are needed to better identify cases and reduce case fatality. Metabolomics, a rapidly evolving field aimed at characterizing metabolites in biofluids, has the potential to improve diagnostics in a range of diseases. The objective of this pilot study is to apply metabolomic analysis to childhood pneumonia to explore its potential to improve pneumonia diagnosis in a high-burden setting. Methodology/Principal Findings Eleven children with World Health Organization (WHO)-defined severe pneumonia of non-homogeneous aetiology were selected in The Gambia, West Africa, along with community controls. Metabolomic analysis of matched plasma and urine samples was undertaken using Ultra Performance Liquid Chromatography (UPLC) coupled to Time-of-Flight Mass Spectrometry (TOFMS). Biomarker extraction was done using SIMCA-P+ and Random Forests (RF). ‘Unsupervised’ (blinded) data were analyzed by Principal Component Analysis (PCA), while ‘supervised’ (unblinded) analysis was by Partial Least Squares-Discriminant Analysis (PLS-DA) and Orthogonal Projection to Latent Structures (OPLS). Potential markers were extracted from S-plots constructed following analysis with OPLS, and markers were chosen based on their contribution to the variation and correlation within the data set. The dataset was additionally analyzed with the machine-learning algorithm RF in order to address issues of model overfitting and markers were selected based on their variable importance ranking. Unsupervised PCA analysis revealed good separation of pneumonia and control groups, with even clearer separation of the groups with PLS-DA and OPLS analysis. Statistically significant differences (p<0.05) between groups were seen with the following metabolites: uric acid, hypoxanthine and glutamic acid were higher in plasma from cases, while L-tryptophan and adenosine-5′-diphosphate (ADP) were lower; uric acid and L-histidine were lower in urine from cases. The key limitation of this study is its small size. Conclusions/Significance Metabolomic analysis clearly distinguished severe pneumonia patients from community controls. The metabolites identified are important for the host response to infection through antioxidant, inflammatory and antimicrobial pathways, and energy metabolism. Larger studies are needed to determine whether these findings are pneumonia-specific and to distinguish organism-specific responses. Metabolomics has considerable potential to improve diagnostics for childhood pneumonia. PMID:20844590

  16. Metabolomic analysis in severe childhood pneumonia in the Gambia, West Africa: findings from a pilot study.

    PubMed

    Laiakis, Evagelia C; Morris, Gerard A J; Fornace, Albert J; Howie, Stephen R C

    2010-09-09

    Pneumonia remains the leading cause of death in young children globally and improved diagnostics are needed to better identify cases and reduce case fatality. Metabolomics, a rapidly evolving field aimed at characterizing metabolites in biofluids, has the potential to improve diagnostics in a range of diseases. The objective of this pilot study is to apply metabolomic analysis to childhood pneumonia to explore its potential to improve pneumonia diagnosis in a high-burden setting. Eleven children with World Health Organization (WHO)-defined severe pneumonia of non-homogeneous aetiology were selected in The Gambia, West Africa, along with community controls. Metabolomic analysis of matched plasma and urine samples was undertaken using Ultra Performance Liquid Chromatography (UPLC) coupled to Time-of-Flight Mass Spectrometry (TOFMS). Biomarker extraction was done using SIMCA-P+ and Random Forests (RF). 'Unsupervised' (blinded) data were analyzed by Principal Component Analysis (PCA), while 'supervised' (unblinded) analysis was by Partial Least Squares-Discriminant Analysis (PLS-DA) and Orthogonal Projection to Latent Structures (OPLS). Potential markers were extracted from S-plots constructed following analysis with OPLS, and markers were chosen based on their contribution to the variation and correlation within the data set. The dataset was additionally analyzed with the machine-learning algorithm RF in order to address issues of model overfitting and markers were selected based on their variable importance ranking. Unsupervised PCA analysis revealed good separation of pneumonia and control groups, with even clearer separation of the groups with PLS-DA and OPLS analysis. Statistically significant differences (p<0.05) between groups were seen with the following metabolites: uric acid, hypoxanthine and glutamic acid were higher in plasma from cases, while L-tryptophan and adenosine-5'-diphosphate (ADP) were lower; uric acid and L-histidine were lower in urine from cases. The key limitation of this study is its small size. Metabolomic analysis clearly distinguished severe pneumonia patients from community controls. The metabolites identified are important for the host response to infection through antioxidant, inflammatory and antimicrobial pathways, and energy metabolism. Larger studies are needed to determine whether these findings are pneumonia-specific and to distinguish organism-specific responses. Metabolomics has considerable potential to improve diagnostics for childhood pneumonia.

  17. Metabolic changes associated with papillary thyroid carcinoma: A nuclear magnetic resonance-based metabolomics study.

    PubMed

    Li, Yanyun; Chen, Minjian; Liu, Cuiping; Xia, Yankai; Xu, Bo; Hu, Yanhui; Chen, Ting; Shen, Meiping; Tang, Wei

    2018-05-01

    Papillary thyroid carcinoma (PTC) is the most common thyroid cancer. Nuclear magnetic resonance (NMR)‑based metabolomic technique is the gold standard in metabolite structural elucidation, and can provide different coverage of information compared with other metabolomic techniques. Here, we firstly conducted NMR based metabolomics study regarding detailed metabolic changes especially metabolic pathway changes related to PTC pathogenesis. 1H NMR-based metabolomic technique was adopted in conju-nction with multivariate analysis to analyze matched tumor and normal thyroid tissues obtained from 16 patients. The results were further annotated with Kyoto Encyclopedia of Genes and Genomes (KEGG), and Human Metabolome Database, and then were analyzed using modules of pathway analysis and enrichment analysis of MetaboAnalyst 3.0. Based on the analytical techniques, we established the models of principal component analysis (PCA), partial least squares-discriminant analysis (PLS-DA), and orthogonal partial least-squares discriminant analysis (OPLS‑DA) which could discriminate PTC from normal thyroid tissue, and found 15 robust differentiated metabolites from two OPLS-DA models. We identified 8 KEGG pathways and 3 pathways of small molecular pathway database which were significantly related to PTC by using pathway analysis and enrichment analysis, respectively, through which we identified metabolisms related to PTC including branched chain amino acid metabolism (leucine and valine), other amino acid metabolism (glycine and taurine), glycolysis (lactate), tricarboxylic acid cycle (citrate), choline metabolism (choline, ethanolamine and glycerolphosphocholine) and lipid metabolism (very-low‑density lipoprotein and low-density lipoprotein). In conclusion, the PTC was characterized with increased glycolysis and inhibited tricarboxylic acid cycle, increased oncogenic amino acids as well as abnormal choline and lipid metabolism. The findings in this study provide new insights into detailed metabolic changes of PTC, and hold great potential in the treatment of PTC.

  18. Differences in chewing sounds of dry-crisp snacks by multivariate data analysis

    NASA Astrophysics Data System (ADS)

    De Belie, N.; Sivertsvik, M.; De Baerdemaeker, J.

    2003-09-01

    Chewing sounds of different types of dry-crisp snacks (two types of potato chips, prawn crackers, cornflakes and low calorie snacks from extruded starch) were analysed to assess differences in sound emission patterns. The emitted sounds were recorded by a microphone placed over the ear canal. The first bite and the first subsequent chew were selected from the time signal and a fast Fourier transformation provided the power spectra. Different multivariate analysis techniques were used for classification of the snack groups. This included principal component analysis (PCA) and unfold partial least-squares (PLS) algorithms, as well as multi-way techniques such as three-way PLS, three-way PCA (Tucker3), and parallel factor analysis (PARAFAC) on the first bite and subsequent chew. The models were evaluated by calculating the classification errors and the root mean square error of prediction (RMSEP) for independent validation sets. It appeared that the logarithm of the power spectra obtained from the chewing sounds could be used successfully to distinguish the different snack groups. When different chewers were used, recalibration of the models was necessary. Multi-way models distinguished better between chewing sounds of different snack groups than PCA on bite or chew separately and than unfold PLS. From all three-way models applied, N-PLS with three components showed the best classification capabilities, resulting in classification errors of 14-18%. The major amount of incorrect classifications was due to one type of potato chips that had a very irregular shape, resulting in a wide variation of the emitted sounds.

  19. On-line monitoring of extraction process of Flos Lonicerae Japonicae using near infrared spectroscopy combined with synergy interval PLS and genetic algorithm

    NASA Astrophysics Data System (ADS)

    Yang, Yue; Wang, Lei; Wu, Yongjiang; Liu, Xuesong; Bi, Yuan; Xiao, Wei; Chen, Yong

    2017-07-01

    There is a growing need for the effective on-line process monitoring during the manufacture of traditional Chinese medicine to ensure quality consistency. In this study, the potential of near infrared (NIR) spectroscopy technique to monitor the extraction process of Flos Lonicerae Japonicae was investigated. A new algorithm of synergy interval PLS with genetic algorithm (Si-GA-PLS) was proposed for modeling. Four different PLS models, namely Full-PLS, Si-PLS, GA-PLS, and Si-GA-PLS, were established, and their performances in predicting two quality parameters (viz. total acid and soluble solid contents) were compared. In conclusion, Si-GA-PLS model got the best results due to the combination of superiority of Si-PLS and GA. For Si-GA-PLS, the determination coefficient (Rp2) and root-mean-square error for the prediction set (RMSEP) were 0.9561 and 147.6544 μg/ml for total acid, 0.9062 and 0.1078% for soluble solid contents, correspondingly. The overall results demonstrated that the NIR spectroscopy technique combined with Si-GA-PLS calibration is a reliable and non-destructive alternative method for on-line monitoring of the extraction process of TCM on the production scale.

  20. Novel method for the determination of average molecular weight of natural polymers based on 2D DOSY NMR and chemometrics: Example of heparin.

    PubMed

    Monakhova, Yulia B; Diehl, Bernd W K; Do, Tung X; Schulze, Margit; Witzleben, Steffen

    2018-02-05

    Apart from the characterization of impurities, the full characterization of heparin and low molecular weight heparin (LMWH) also requires the determination of average molecular weight, which is closely related to the pharmaceutical properties of anticoagulant drugs. To determine average molecular weight of these animal-derived polymer products, partial least squares regression (PLS) was utilized for modelling of diffused-ordered spectroscopy NMR data (DOSY) of a representative set of heparin (n=32) and LMWH (n=30) samples. The same sets of samples were measured by gel permeation chromatography (GPC) to obtain reference data. The application of PLS to the data led to calibration models with root mean square error of prediction of 498Da and 179Da for heparin and LMWH, respectively. The average coefficients of variation (CVs) did not exceed 2.1% excluding sample preparation (by successive measuring one solution, n=5) and 2.5% including sample preparation (by preparing and analyzing separate samples, n=5). An advantage of the method is that the sample after standard 1D NMR characterization can be used for the molecular weight determination without further manipulation. The accuracy of multivariate models is better than the previous results for other matrices employing internal standards. Therefore, DOSY experiment is recommended to be employed for the calculation of molecular weight of heparin products as a complementary measurement to standard 1D NMR quality control. The method can be easily transferred to other matrices as well. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. Chemometric classification of casework arson samples based on gasoline content.

    PubMed

    Sinkov, Nikolai A; Sandercock, P Mark L; Harynuk, James J

    2014-02-01

    Detection and identification of ignitable liquids (ILs) in arson debris is a critical part of arson investigations. The challenge of this task is due to the complex and unpredictable chemical nature of arson debris, which also contains pyrolysis products from the fire. ILs, most commonly gasoline, are complex chemical mixtures containing hundreds of compounds that will be consumed or otherwise weathered by the fire to varying extents depending on factors such as temperature, air flow, the surface on which IL was placed, etc. While methods such as ASTM E-1618 are effective, data interpretation can be a costly bottleneck in the analytical process for some laboratories. In this study, we address this issue through the application of chemometric tools. Prior to the application of chemometric tools such as PLS-DA and SIMCA, issues of chromatographic alignment and variable selection need to be addressed. Here we use an alignment strategy based on a ladder consisting of perdeuterated n-alkanes. Variable selection and model optimization was automated using a hybrid backward elimination (BE) and forward selection (FS) approach guided by the cluster resolution (CR) metric. In this work, we demonstrate the automated construction, optimization, and application of chemometric tools to casework arson data. The resulting PLS-DA and SIMCA classification models, trained with 165 training set samples, have provided classification of 55 validation set samples based on gasoline content with 100% specificity and sensitivity. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  2. Determination of butter adulteration with margarine using Raman spectroscopy.

    PubMed

    Uysal, Reyhan Selin; Boyaci, Ismail Hakki; Genis, Hüseyin Efe; Tamer, Ugur

    2013-12-15

    In this study, adulteration of butter with margarine was analysed using Raman spectroscopy combined with chemometric methods (principal component analysis (PCA), principal component regression (PCR), partial least squares (PLS)) and artificial neural networks (ANNs). Different butter and margarine samples were mixed at various concentrations ranging from 0% to 100% w/w. PCA analysis was applied for the classification of butters, margarines and mixtures. PCR, PLS and ANN were used for the detection of adulteration ratios of butter. Models were created using a calibration data set and developed models were evaluated using a validation data set. The coefficient of determination (R(2)) values between actual and predicted values obtained for PCR, PLS and ANN for the validation data set were 0.968, 0.987 and 0.978, respectively. In conclusion, a combination of Raman spectroscopy with chemometrics and ANN methods can be applied for testing butter adulteration. Copyright © 2013 Elsevier Ltd. All rights reserved.

  3. Assessment of infant formula quality and composition using Vis-NIR, MIR and Raman process analytical technologies.

    PubMed

    Wang, Xiao; Esquerre, Carlos; Downey, Gerard; Henihan, Lisa; O'Callaghan, Donal; O'Donnell, Colm

    2018-06-01

    In this study, visible and near-infrared (Vis-NIR), mid-infrared (MIR) and Raman process analytical technologies were investigated for assessment of infant formula quality and compositional parameters namely preheat temperature, storage temperature, storage time, fluorescence of advanced Maillard products and soluble tryptophan (FAST) index, soluble protein, fat and surface free fat (SFF) content. PLS-DA models developed using spectral data with appropriate data pre-treatment and significant variables selected using Martens' uncertainty test had good accuracy for the discrimination of preheat temperature (92.3-100%) and storage temperature (91.7-100%). The best PLS regression models developed yielded values for the ratio of prediction error to deviation (RPD) of 3.6-6.1, 2.1-2.7, 1.7-2.9, 1.6-2.6 and 2.5-3.0 for storage time, FAST index, soluble protein, fat and SFF content prediction respectively. Vis-NIR, MIR and Raman were demonstrated to be potential PAT tools for process control and quality assurance applications in infant formula and dairy ingredient manufacture. Copyright © 2018 Elsevier B.V. All rights reserved.

  4. Multimodal Classification of Mild Cognitive Impairment Based on Partial Least Squares.

    PubMed

    Wang, Pingyue; Chen, Kewei; Yao, Li; Hu, Bin; Wu, Xia; Zhang, Jiacai; Ye, Qing; Guo, Xiaojuan

    2016-08-10

    In recent years, increasing attention has been given to the identification of the conversion of mild cognitive impairment (MCI) to Alzheimer's disease (AD). Brain neuroimaging techniques have been widely used to support the classification or prediction of MCI. The present study combined magnetic resonance imaging (MRI), 18F-fluorodeoxyglucose PET (FDG-PET), and 18F-florbetapir PET (florbetapir-PET) to discriminate MCI converters (MCI-c, individuals with MCI who convert to AD) from MCI non-converters (MCI-nc, individuals with MCI who have not converted to AD in the follow-up period) based on the partial least squares (PLS) method. Two types of PLS models (informed PLS and agnostic PLS) were built based on 64 MCI-c and 65 MCI-nc from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. The results showed that the three-modality informed PLS model achieved better classification accuracy of 81.40%, sensitivity of 79.69%, and specificity of 83.08% compared with the single-modality model, and the three-modality agnostic PLS model also achieved better classification compared with the two-modality model. Moreover, combining the three modalities with clinical test score (ADAS-cog), the agnostic PLS model (independent data: florbetapir-PET; dependent data: FDG-PET and MRI) achieved optimal accuracy of 86.05%, sensitivity of 81.25%, and specificity of 90.77%. In addition, the comparison of PLS, support vector machine (SVM), and random forest (RF) showed greater diagnostic power of PLS. These results suggested that our multimodal PLS model has the potential to discriminate MCI-c from the MCI-nc and may therefore be helpful in the early diagnosis of AD.

  5. A Piecewise Local Partial Least Squares (PLS) Method for the Quantitative Analysis of Plutonium Nitrate Solutions

    DOE PAGES

    Lascola, Robert; O'Rourke, Patrick E.; Kyser, Edward A.

    2017-10-05

    Here, we have developed a piecewise local (PL) partial least squares (PLS) analysis method for total plutonium measurements by absorption spectroscopy in nitric acid-based nuclear material processing streams. Instead of using a single PLS model that covers all expected solution conditions, the method selects one of several local models based on an assessment of solution absorbance, acidity, and Pu oxidation state distribution. The local models match the global model for accuracy against the calibration set, but were observed in several instances to be more robust to variations associated with measurements in the process. The improvements are attributed to the relativemore » parsimony of the local models. Not all of the sources of spectral variation are uniformly present at each part of the calibration range. Thus, the global model is locally overfitting and susceptible to increased variance when presented with new samples. A second set of models quantifies the relative concentrations of Pu(III), (IV), and (VI). Standards containing a mixture of these species were not at equilibrium due to a disproportionation reaction. Therefore, a separate principal component analysis is used to estimate of the concentrations of the individual oxidation states in these standards in the absence of independent confirmatory analysis. The PL analysis approach is generalizable to other systems where the analysis of chemically complicated systems can be aided by rational division of the overall range of solution conditions into simpler sub-regions.« less

  6. A Piecewise Local Partial Least Squares (PLS) Method for the Quantitative Analysis of Plutonium Nitrate Solutions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lascola, Robert; O'Rourke, Patrick E.; Kyser, Edward A.

    Here, we have developed a piecewise local (PL) partial least squares (PLS) analysis method for total plutonium measurements by absorption spectroscopy in nitric acid-based nuclear material processing streams. Instead of using a single PLS model that covers all expected solution conditions, the method selects one of several local models based on an assessment of solution absorbance, acidity, and Pu oxidation state distribution. The local models match the global model for accuracy against the calibration set, but were observed in several instances to be more robust to variations associated with measurements in the process. The improvements are attributed to the relativemore » parsimony of the local models. Not all of the sources of spectral variation are uniformly present at each part of the calibration range. Thus, the global model is locally overfitting and susceptible to increased variance when presented with new samples. A second set of models quantifies the relative concentrations of Pu(III), (IV), and (VI). Standards containing a mixture of these species were not at equilibrium due to a disproportionation reaction. Therefore, a separate principal component analysis is used to estimate of the concentrations of the individual oxidation states in these standards in the absence of independent confirmatory analysis. The PL analysis approach is generalizable to other systems where the analysis of chemically complicated systems can be aided by rational division of the overall range of solution conditions into simpler sub-regions.« less

  7. Real-time monitoring of process parameters in rice wine fermentation by a portable spectral analytical system combined with multivariate analysis.

    PubMed

    Ouyang, Qin; Zhao, Jiewen; Pan, Wenxiu; Chen, Quansheng

    2016-01-01

    A portable and low-cost spectral analytical system was developed and used to monitor real-time process parameters, i.e. total sugar content (TSC), alcohol content (AC) and pH during rice wine fermentation. Various partial least square (PLS) algorithms were implemented to construct models. The performance of a model was evaluated by the correlation coefficient (Rp) and the root mean square error (RMSEP) in the prediction set. Among the models used, the synergy interval PLS (Si-PLS) was found to be superior. The optimal performance by the Si-PLS model for the TSC was Rp = 0.8694, RMSEP = 0.438; the AC was Rp = 0.8097, RMSEP = 0.617; and the pH was Rp = 0.9039, RMSEP = 0.0805. The stability and reliability of the system, as well as the optimal models, were verified using coefficients of variation, most of which were found to be less than 5%. The results suggest this portable system is a promising tool that could be used as an alternative method for rapid monitoring of process parameters during rice wine fermentation. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. Chemical data as markers of the geographical origins of sugarcane spirits.

    PubMed

    Serafim, F A T; Pereira-Filho, Edenir R; Franco, D W

    2016-04-01

    In an attempt to classify sugarcane spirits according to their geographic region of origin, chemical data for 24 analytes were evaluated in 50 cachaças produced using a similar procedure in selected regions of Brazil: São Paulo - SP (15), Minas Gerais - MG (11), Rio de Janeiro - RJ (11), Paraiba -PB (9), and Ceará - CE (4). Multivariate analysis was applied to the analytical results, and the predictive abilities of different classification methods were evaluated. Principal component analysis identified five groups, and chemical similarities were observed between MG and SP samples and between RJ and PB samples. CE samples presented a distinct chemical profile. Among the samples, partial linear square discriminant analysis (PLS-DA) classified 50.2% of the samples correctly, K-nearest neighbor (KNN) 86%, and soft independent modeling of class analogy (SIMCA) 56.2%. Therefore, in this proof of concept demonstration, the proposed approach based on chemical data satisfactorily predicted the cachaças' geographic origins. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. Classification of illicit heroin by UPLC-Q-TOF analysis of acidic and neutral manufacturing impurities.

    PubMed

    Liu, Cuimei; Hua, Zhendong; Bai, Yanping

    2015-12-01

    The illicit manufacture of heroin results in the formation of trace levels of acidic and neutral manufacturing impurities that provide valuable information about the manufacturing process used. In this work, a new ultra performance liquid chromatography-quadrupole-time of flight mass spectrometry (UPLC-Q-TOF) method; that features high resolution, mass accuracy and sensitivity for profiling neutral and acidic heroin manufacturing impurities was developed. After the UPLC-Q-TOF analysis, the retention times and m/z data pairs of acidic and neutral manufacturing impurities were detected, and 19 peaks were found to be evidently different between heroin samples from "Golden Triangle" and "Golden Crescent". Based on the data set of these 19 impurities in 150 authentic heroin samples, classification of heroin geographic origins was successfully achieved utilizing partial least squares discriminant analysis (PLS-DA). By analyzing another data set of 267 authentic heroin samples, the developed discrimiant model was validated and proved to be accurate and reliable. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  10. Correlation of sensory bitterness in dairy protein hydrolysates: Comparison of prediction models built using sensory, chromatographic and electronic tongue data.

    PubMed

    Newman, J; Egan, T; Harbourne, N; O'Riordan, D; Jacquier, J C; O'Sullivan, M

    2014-08-01

    Sensory evaluation can be problematic for ingredients with a bitter taste during research and development phase of new food products. In this study, 19 dairy protein hydrolysates (DPH) were analysed by an electronic tongue and their physicochemical characteristics, the data obtained from these methods were correlated with their bitterness intensity as scored by a trained sensory panel and each model was also assessed by its predictive capabilities. The physiochemical characteristics of the DPHs investigated were degree of hydrolysis (DH%), and data relating to peptide size and relative hydrophobicity from size exclusion chromatography (SEC) and reverse phase (RP) HPLC. Partial least square regression (PLS) was used to construct the prediction models. All PLS regressions had good correlations (0.78 to 0.93) with the strongest being the combination of data obtained from SEC and RP HPLC. However, the PLS with the strongest predictive power was based on the e-tongue which had the PLS regression with the lowest root mean predicted residual error sum of squares (PRESS) in the study. The results show that the PLS models constructed with the e-tongue and the combination of SEC and RP-HPLC has potential to be used for prediction of bitterness and thus reducing the reliance on sensory analysis in DPHs for future food research. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. FT-IR spectroscopy and multivariate analysis as an auxiliary tool for diagnosis of mental disorders: Bipolar and schizophrenia cases

    NASA Astrophysics Data System (ADS)

    Ogruc Ildiz, G.; Arslan, M.; Unsalan, O.; Araujo-Andrade, C.; Kurt, E.; Karatepe, H. T.; Yilmaz, A.; Yalcinkaya, O. B.; Herken, H.

    2016-01-01

    In this study, a methodology based on Fourier-transform infrared spectroscopy and principal component analysis and partial least square methods is proposed for the analysis of blood plasma samples in order to identify spectral changes correlated with some biomarkers associated with schizophrenia and bipolarity. Our main goal was to use the spectral information for the calibration of statistical models to discriminate and classify blood plasma samples belonging to bipolar and schizophrenic patients. IR spectra of 30 samples of blood plasma obtained from each, bipolar and schizophrenic patients and healthy control group were collected. The results obtained from principal component analysis (PCA) show a clear discrimination between the bipolar (BP), schizophrenic (SZ) and control group' (CG) blood samples that also give possibility to identify three main regions that show the major differences correlated with both mental disorders (biomarkers). Furthermore, a model for the classification of the blood samples was calibrated using partial least square discriminant analysis (PLS-DA), allowing the correct classification of BP, SZ and CG samples. The results obtained applying this methodology suggest that it can be used as a complimentary diagnostic tool for the detection and discrimination of these mental diseases.

  12. Ripening-dependent metabolic changes in the volatiles of pineapple (Ananas comosus (L.) Merr.) fruit: II. Multivariate statistical profiling of pineapple aroma compounds based on comprehensive two-dimensional gas chromatography-mass spectrometry.

    PubMed

    Steingass, Christof Björn; Jutzi, Manfred; Müller, Jenny; Carle, Reinhold; Schmarr, Hans-Georg

    2015-03-01

    Ripening-dependent changes of pineapple volatiles were studied in a nontargeted profiling analysis. Volatiles were isolated via headspace solid phase microextraction and analyzed by comprehensive 2D gas chromatography and mass spectrometry (HS-SPME-GC×GC-qMS). Profile patterns presented in the contour plots were evaluated applying image processing techniques and subsequent multivariate statistical data analysis. Statistical methods comprised unsupervised hierarchical cluster analysis (HCA) and principal component analysis (PCA) to classify the samples. Supervised partial least squares discriminant analysis (PLS-DA) and partial least squares (PLS) regression were applied to discriminate different ripening stages and describe the development of volatiles during postharvest storage, respectively. Hereby, substantial chemical markers allowing for class separation were revealed. The workflow permitted the rapid distinction between premature green-ripe pineapples and postharvest-ripened sea-freighted fruits. Volatile profiles of fully ripe air-freighted pineapples were similar to those of green-ripe fruits postharvest ripened for 6 days after simulated sea freight export, after PCA with only two principal components. However, PCA considering also the third principal component allowed differentiation between air-freighted fruits and the four progressing postharvest maturity stages of sea-freighted pineapples.

  13. Determination of propranolol hydrochloride in pharmaceutical preparations using near infrared spectrometry with fiber optic probe and multivariate calibration methods.

    PubMed

    Marques Junior, Jucelino Medeiros; Muller, Aline Lima Hermes; Foletto, Edson Luiz; da Costa, Adilson Ben; Bizzi, Cezar Augusto; Irineu Muller, Edson

    2015-01-01

    A method for determination of propranolol hydrochloride in pharmaceutical preparation using near infrared spectrometry with fiber optic probe (FTNIR/PROBE) and combined with chemometric methods was developed. Calibration models were developed using two variable selection models: interval partial least squares (iPLS) and synergy interval partial least squares (siPLS). The treatments based on the mean centered data and multiplicative scatter correction (MSC) were selected for models construction. A root mean square error of prediction (RMSEP) of 8.2 mg g(-1) was achieved using siPLS (s2i20PLS) algorithm with spectra divided into 20 intervals and combination of 2 intervals (8501 to 8801 and 5201 to 5501 cm(-1)). Results obtained by the proposed method were compared with those using the pharmacopoeia reference method and significant difference was not observed. Therefore, proposed method allowed a fast, precise, and accurate determination of propranolol hydrochloride in pharmaceutical preparations. Furthermore, it is possible to carry out on-line analysis of this active principle in pharmaceutical formulations with use of fiber optic probe.

  14. Error propagation of partial least squares for parameters optimization in NIR modeling.

    PubMed

    Du, Chenzhao; Dai, Shengyun; Qiao, Yanjiang; Wu, Zhisheng

    2018-03-05

    A novel methodology is proposed to determine the error propagation of partial least-square (PLS) for parameters optimization in near-infrared (NIR) modeling. The parameters include spectral pretreatment, latent variables and variable selection. In this paper, an open source dataset (corn) and a complicated dataset (Gardenia) were used to establish PLS models under different modeling parameters. And error propagation of modeling parameters for water quantity in corn and geniposide quantity in Gardenia were presented by both type І and type II error. For example, when variable importance in the projection (VIP), interval partial least square (iPLS) and backward interval partial least square (BiPLS) variable selection algorithms were used for geniposide in Gardenia, compared with synergy interval partial least squares (SiPLS), the error weight varied from 5% to 65%, 55% and 15%. The results demonstrated how and what extent the different modeling parameters affect error propagation of PLS for parameters optimization in NIR modeling. The larger the error weight, the worse the model. Finally, our trials finished a powerful process in developing robust PLS models for corn and Gardenia under the optimal modeling parameters. Furthermore, it could provide a significant guidance for the selection of modeling parameters of other multivariate calibration models. Copyright © 2017. Published by Elsevier B.V.

  15. Error propagation of partial least squares for parameters optimization in NIR modeling

    NASA Astrophysics Data System (ADS)

    Du, Chenzhao; Dai, Shengyun; Qiao, Yanjiang; Wu, Zhisheng

    2018-03-01

    A novel methodology is proposed to determine the error propagation of partial least-square (PLS) for parameters optimization in near-infrared (NIR) modeling. The parameters include spectral pretreatment, latent variables and variable selection. In this paper, an open source dataset (corn) and a complicated dataset (Gardenia) were used to establish PLS models under different modeling parameters. And error propagation of modeling parameters for water quantity in corn and geniposide quantity in Gardenia were presented by both type І and type II error. For example, when variable importance in the projection (VIP), interval partial least square (iPLS) and backward interval partial least square (BiPLS) variable selection algorithms were used for geniposide in Gardenia, compared with synergy interval partial least squares (SiPLS), the error weight varied from 5% to 65%, 55% and 15%. The results demonstrated how and what extent the different modeling parameters affect error propagation of PLS for parameters optimization in NIR modeling. The larger the error weight, the worse the model. Finally, our trials finished a powerful process in developing robust PLS models for corn and Gardenia under the optimal modeling parameters. Furthermore, it could provide a significant guidance for the selection of modeling parameters of other multivariate calibration models.

  16. Differentiation of Organically and Conventionally Grown Tomatoes by Chemometric Analysis of Combined Data from Proton Nuclear Magnetic Resonance and Mid-infrared Spectroscopy and Stable Isotope Analysis.

    PubMed

    Hohmann, Monika; Monakhova, Yulia; Erich, Sarah; Christoph, Norbert; Wachter, Helmut; Holzgrabe, Ulrike

    2015-11-04

    Because the basic suitability of proton nuclear magnetic resonance spectroscopy ((1)H NMR) to differentiate organic versus conventional tomatoes was recently proven, the approach to optimize (1)H NMR classification models (comprising overall 205 authentic tomato samples) by including additional data of isotope ratio mass spectrometry (IRMS, δ(13)C, δ(15)N, and δ(18)O) and mid-infrared (MIR) spectroscopy was assessed. Both individual and combined analytical methods ((1)H NMR + MIR, (1)H NMR + IRMS, MIR + IRMS, and (1)H NMR + MIR + IRMS) were examined using principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA), linear discriminant analysis (LDA), and common components and specific weight analysis (ComDim). With regard to classification abilities, fused data of (1)H NMR + MIR + IRMS yielded better validation results (ranging between 95.0 and 100.0%) than individual methods ((1)H NMR, 91.3-100%; MIR, 75.6-91.7%), suggesting that the combined examination of analytical profiles enhances authentication of organically produced tomatoes.

  17. Simultaneous fingerprint and high-wavenumber fiber-optic Raman spectroscopy improves in vivo diagnosis of esophageal squamous cell carcinoma at endoscopy

    NASA Astrophysics Data System (ADS)

    Wang, Jianfeng; Lin, Kan; Zheng, Wei; Yu Ho, Khek; Teh, Ming; Guan Yeoh, Khay; Huang, Zhiwei

    2015-08-01

    This work aims to evaluate clinical value of a fiber-optic Raman spectroscopy technique developed for in vivo diagnosis of esophageal squamous cell carcinoma (ESCC) during clinical endoscopy. We have developed a rapid fiber-optic Raman endoscopic system capable of simultaneously acquiring both fingerprint (FP)(800-1800 cm-1) and high-wavenumber (HW)(2800-3600 cm-1) Raman spectra from esophageal tissue in vivo. A total of 1172 in vivo FP/HW Raman spectra were acquired from 48 esophageal patients undergoing endoscopic examination. The total Raman dataset was split into two parts: 80% for training; while 20% for testing. Partial least squares-discriminant analysis (PLS-DA) and leave-one patient-out, cross validation (LOPCV) were implemented on training dataset to develop diagnostic algorithms for tissue classification. PLS-DA-LOPCV shows that simultaneous FP/HW Raman spectroscopy on training dataset provides a diagnostic sensitivity of 97.0% and specificity of 97.4% for ESCC classification. Further, the diagnostic algorithm applied to the independent testing dataset based on simultaneous FP/HW Raman technique gives a predictive diagnostic sensitivity of 92.7% and specificity of 93.6% for ESCC identification, which is superior to either FP or HW Raman technique alone. This work demonstrates that the simultaneous FP/HW fiber-optic Raman spectroscopy technique improves real-time in vivo diagnosis of esophageal neoplasia at endoscopy.

  18. Gait characteristics and their discriminative power in geriatric patients with and without cognitive impairment.

    PubMed

    Kikkert, Lisette H J; Vuillerme, Nicolas; van Campen, Jos P; Appels, Bregje A; Hortobágyi, Tibor; Lamoth, Claudine J C

    2017-08-15

    A detailed gait analysis (e.g., measures related to speed, self-affinity, stability, and variability) can help to unravel the underlying causes of gait dysfunction, and identify cognitive impairment. However, because geriatric patients present with multiple conditions that also affect gait, results from healthy old adults cannot easily be extrapolated to geriatric patients. Hence, we (1) quantified gait outcomes based on dynamical systems theory, and (2) determined their discriminative power in three groups: healthy old adults, geriatric patients with- and geriatric patients without cognitive impairment. For the present cross-sectional study, 25 healthy old adults recruited from community (65 ± 5.5 years), and 70 geriatric patients with (n = 39) and without (n = 31) cognitive impairment from the geriatric dayclinic of the MC Slotervaart hospital in Amsterdam (80 ± 6.6 years) were included. Participants walked for 3 min during single- and dual-tasking at self-selected speed while 3D trunk accelerations were registered with an IPod touch G4. We quantified 23 gait outcomes that reflect multiple gait aspects. A multivariate model was built using Partial Least Square- Discriminant Analysis (PLS-DA) that best modelled participant group from gait outcomes. For single-task walking, the PLS-DA model consisted of 4 Latent Variables that explained 63 and 41% of the variance in gait outcomes and group, respectively. Outcomes related to speed, regularity, predictability, and stability of trunk accelerations revealed with the highest discriminative power (VIP > 1). A high proportion of healthy old adults (96 and 93% for single- and dual-task, respectively) was correctly classified based on the gait outcomes. The discrimination of geriatric patients with and without cognitive impairment was poor, with 57% (single-task) and 64% (dual-task) of the patients misclassified. While geriatric patients vs. healthy old adults walked slower, and less regular, predictable, and stable, we found no differences in gait between geriatric patients with and without cognitive impairment. The effects of multiple comorbidities on geriatric patients' gait possibly causes a 'floor-effect', with no room for further deterioration when patients develop cognitive impairment. An accurate identification of cognitive status thus necessitates a multifactorial approach.

  19. Determination of polyphenolic compounds of red wines by UV-VIS-NIR spectroscopy and chemometrics tools.

    PubMed

    Martelo-Vidal, M J; Vázquez, M

    2014-09-01

    Spectral analysis is a quick and non-destructive method to analyse wine. In this work, trans-resveratrol, oenin, malvin, catechin, epicatechin, quercetin and syringic acid were determined in commercial red wines from DO Rías Baixas and DO Ribeira Sacra (Spain) by UV-VIS-NIR spectroscopy. Calibration models were developed using principal component regression (PCR) or partial least squares (PLS) regression. HPLC was used as reference method. The results showed that reliable PLS models were obtained to quantify all polyphenols for Rías Baixas wines. For Ribeira Sacra, feasible models were obtained to determine quercetin, epicatechin, oenin and syringic acid. PCR calibration models showed worst reliable of prediction than PLS models. For red wines from mencía grapes, feasible models were obtained for catechin and oenin, regardless the geographical origin. The results obtained demonstrate that UV-VIS-NIR spectroscopy can be used to determine individual polyphenolic compounds in red wines. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. A preliminary MTD-PLS study for androgen receptor binding of steroid compounds

    NASA Astrophysics Data System (ADS)

    Bora, Alina; Seclaman, E.; Kurunczi, L.; Funar-Timofei, Simona

    The relative binding affinities (RBA) of a series of 30 steroids for Human Androgen Receptor (AR) were used to initiate a MTD-PLS study. The 3D structures of all the compounds were obtained through geometry optimization in the framework of AM1 semiempirical quantum chemical method. The MTD hypermolecule (HM) was constructed, superposing these structures on the AR-bonded dihydrotestosterone (DHT) skeleton obtained from PDB (AR complex, ID 1I37). The parameters characterizing the HM vertices were collected using: AM1 charges, XlogP fragmental values, calculated fragmental polarizabilities (from refractivities), volumes, and H-bond parameters (Raevsky's thermodynamic originated scale). The resulted QSAR data matrix was submitted to PCA (Principal Component Analysis) and PLS (Projections in Latent Structures) procedure (SIMCA P 9.0); five compounds were selected as test set, and the remaining 25 molecules were used as training set. In the PLS procedure supplementary chemical information was introduced, i.e. the steric effect was always considered detrimental, and the hydrophobic and van der Waals interactions were imposed to be beneficial. The initial PLS model using the entire training set has the following characteristics: R2Y = 0.584, Q2 = 0.344. Based on distances to the model criterions (DMODX and DMODY), five compounds were eliminated and the obtained final model had the following characteristics: R2Y D 0.891, Q2 D 0.591. For this the external predictivity on the test set was unsatisfactory. A tentative explanation for these behaviors is the weak information content of the input QSAR matrix for the present series comparatively with other successful MTD-PLS modeling published elsewhere.

  1. Differentiation of Chinese rice wines from different wineries based on mineral elemental fingerprinting.

    PubMed

    Shen, Fei; Wu, Jian; Ying, Yibin; Li, Bobin; Jiang, Tao

    2013-12-15

    Discrimination of Chinese rice wines from three well-known wineries ("Guyuelongshan", "Kuaijishan", and "Pagoda") in China has been carried out according to mineral element contents in this study. Nineteen macro and trace mineral elements (Na, Mg, Al, K, Ca, Mn, Fe, Cu, Zn, V, Cr, Co, Ni, As, Se, Mo, Cd, Ba and Pb) were determined by inductively coupled plasma mass spectrometry (ICP-MS) in 117 samples. Then the experimental data were subjected to analysis of variance (ANOVA) and principal component analysis (PCA) to reveal significant differences and potential patterns between samples. Stepwise linear discriminant analysis (LDA) and partial least square discriminant analysis (PLS-DA) were applied to develop classification models and achieved correct classified rates of 100% and 97.4% for the prediction sample set, respectively. The discrimination could be attributed to different raw materials (mainly water) and elaboration processes employed. The results indicate that the element compositions combined with multivariate analysis can be used as fingerprinting techniques to protect prestigious wineries and enable the authenticity of Chinese rice wine. Copyright © 2013 Elsevier Ltd. All rights reserved.

  2. Ultrasound-triggered effects of the microbubbles coupled to GDNF- and Nurr1-loaded PEGylated liposomes in a rat model of Parkinson's disease.

    PubMed

    Yue, Peijian; Gao, Lin; Wang, Xuejing; Ding, Xuebing; Teng, Junfang

    2018-06-01

    The purpose of this study was to investigate ultrasound-triggered effects of the glial cell line-derived neurotrophic factor (GDNF) + nuclear receptor-related factor 1 (Nurr1)-polyethylene glycol (PEG)ylated liposomes-coupled microbubbles (PLs-GDNF + Nurr1-MBs) on behavioral impairment and neuron loss in a rat model of Parkinson's disease (PD). The unloaded PEGylated liposomes-coupled microbubbles (PLs-MBs) were characterized for zeta potential, particle size, and concentration. 6-hydroxydopamine (6-OHDA) was used to establish the PD rat model. Rotational, climbing pole, and suspension tests were used to detect behavioral impairment. The immunohistochemical staining of tyrosine hydroxylase (TH) and dopamine transporter (DAT) was used to assess the neuron loss. Western blot and quantitative real-time PCR (qRT-PCR) analysis were used to measure the expression levels of GDNF and Nurr1. The particle size of PLs-MBs was gradually increased, while the concentration and absolute zeta potential were gradually decreased as the time prolongs. 6-OHDA increased amphetamine-induced rotations and loss of dopaminergic neurons as compared to sham group. Interestingly, PLs-GDNF-MBs or PLs-Nurr1-MBs decreased rotations and increased the TH and DAT immunoreactivity. Combined of both genes resulted in a robust reduction in the rotations and a greater increase of the dopaminergic neurons. The delivery of PLs-GDNF + Nurr1-MBs into the brains using magnetic resonance imaging (MRI)-guided focused ultrasound may be more efficacious for the treatment of PD than the single treatment. © 2017 Wiley Periodicals, Inc.

  3. Association between the metabolome and low bone mineral density in Taiwanese women determined by (1)H NMR spectroscopy.

    PubMed

    You, Ying-Shu; Lin, Ching-Yu; Liang, Hao-Jan; Lee, Shen-Hung; Tsai, Keh-Sung; Chiou, Jeng-Min; Chen, Yen-Ching; Tsao, Chwen-Keng; Chen, Jen-Hau

    2014-01-01

    Osteoporosis is related to the alteration of specific circulating metabolites. However, previous studies on only a few metabolites inadequately explain the pathogenesis of this complex syndrome. To date, no study has related the metabolome to bone mineral density (BMD), which would provide an overview of metabolism status and may be useful in clinical practice. This cross-sectional study involved 601 healthy Taiwanese women aged 40 to 55 years recruited from MJ Health Management Institution between 2009 and 2010. Participants were classified according to high (2nd tertile plus 3rd tertile) and low (1st tertile) BMD groups. The plasma metabolome was evaluated by proton nuclear magnetic resonance spectroscopy ((1) H NMR). Principal components analysis (PCA), partial least-squares discriminant analysis (PLS-DA), and logistic regression analysis were used to assess the association between the metabolome and BMD. The high and low BMD groups could be differentiated by PLS-DA but not PCA in postmenopausal women (Q(2)  = 0.05, ppermutation  = 0.04). Among postmenopausal women, elevated glutamine was significantly associated with low BMD (adjusted odds ratio [AOR] = 5.10); meanwhile, elevated lactate (AOR = 0.55), acetone (AOR = 0.51), lipids (AOR = 0.04), and very low-density lipoprotein (AOR = 0.49) protected against low BMD. To the best of our knowledge, this study is the first to identify a group of metabolites for characterizing low BMD in postmenopausal women using a (1) H NMR-based metabolomic approach. The metabolic profile may be useful for predicting the risk of osteoporosis in postmenopausal women at an early age. © 2014 American Society for Bone and Mineral Research.

  4. Mass spectrometry for the characterization of brewing process.

    PubMed

    Vivian, Adriana Fu; Aoyagui, Caroline Tiemi; de Oliveira, Diogo Noin; Catharino, Rodrigo Ramos

    2016-11-01

    Beer is a carbonated alcoholic beverage produced by fermenting ingredients containing starch, especially malted cereals, and other compounds such as water, hops and yeast. The process comprises five main steps: malting, mashing, boiling, fermentation and maturation. There has been growing interest in the subject, since there is increasing demand for beer quality aspects and beer is a ubiquitous alcoholic beverage in the world. This study is based on the manufacturing process of a Brazilian craft brewery, which is characterized by withdrawing samples during key production stages and using electrospray ionization (ESI) high-resolution mass spectrometry (HRMS), a selective and reliable technique used in the identification of substances in an expeditious and practical way. Multivariate data analysis, namely partial least squares discriminant analysis (PLS-DA) is used to define its markers. In both positive and negative modes of PLS-DA score plot, it is possible to notice differences between each stage. VIP score analysis pointed out markers coherent with the process, such as barley components ((+)-catechin), small peptide varieties, hop content (humulone), yeast metabolic compounds and, in maturation, flavoring compounds (caproic acid, glutaric acid and 2,3-butanediol). Besides that, it was possible to identify other important substances such as off-flavor precursors and other different trace compounds, according to the focus given. This is an attractive alternative for the control of food and beverage industry, allowing a quick assessment of process status before it is finished, preventing higher production costs, ensuring quality and helping the control of desirable features, as flavor, foam stability and drinkability. Covering different classes of compounds, this approach suggests a novel analytical strategy: "processomics", aiming at understanding processes in detail, promoting control and being able to make improvements. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Analysis of pork adulteration in beef meatball using Fourier transform infrared (FTIR) spectroscopy.

    PubMed

    Rohman, A; Sismindari; Erwanto, Y; Che Man, Yaakob B

    2011-05-01

    Meatball is one of the favorite foods in Indonesia. The adulteration of pork in beef meatball is frequently occurring. This study was aimed to develop a fast and non destructive technique for the detection and quantification of pork in beef meatball using Fourier transform infrared (FTIR) spectroscopy and partial least square (PLS) calibration. The spectral bands associated with pork fat (PF), beef fat (BF), and their mixtures in meatball formulation were scanned, interpreted, and identified by relating them to those spectroscopically representative to pure PF and BF. For quantitative analysis, PLS regression was used to develop a calibration model at the selected fingerprint regions of 1200-1000 cm(-1). The equation obtained for the relationship between actual PF value and FTIR predicted values in PLS calibration model was y = 0.999x + 0.004, with coefficient of determination (R(2)) and root mean square error of calibration are 0.999 and 0.442, respectively. The PLS calibration model was subsequently used for the prediction of independent samples using laboratory made meatball samples containing the mixtures of BF and PF. Using 4 principal components, root mean square error of prediction is 0.742. The results showed that FTIR spectroscopy can be used for the detection and quantification of pork in beef meatball formulation for Halal verification purposes. Copyright © 2010 The American Meat Science Association. Published by Elsevier Ltd. All rights reserved.

  6. Cross-cultural adaptation and validation of the teamwork climate scale.

    PubMed

    Silva, Mariana Charantola; Peduzzi, Marina; Sangaleti, Carine Teles; Silva, Dirceu da; Agreli, Heloise Fernandes; West, Michael A; Anderson, Neil R

    2016-08-22

    To adapt and validate the Team Climate Inventory scale, of teamwork climate measurement, for the Portuguese language, in the context of primary health care in Brazil. Methodological study with quantitative approach of cross-cultural adaptation (translation, back-translation, synthesis, expert committee, and pretest) and validation with 497 employees from 72 teams of the Family Health Strategy in the city of Campinas, SP, Southeastern Brazil. We verified reliability by the Cronbach's alpha, construct validity by the confirmatory factor analysis with SmartPLS software, and correlation by the job satisfaction scale. We problematized the overlap of items 9, 11, and 12 of the "participation in the team" factor and the "team goals" factor regarding its definition. The validation showed no overlapping of items and the reliability ranged from 0.92 to 0.93. The confirmatory factor analysis indicated suitability of the proposed model with distribution of the 38 items in the four factors. The correlation between teamwork climate and job satisfaction was significant. The version of the scale in Brazilian Portuguese was validated and can be used in the context of primary health care in the Country, constituting an adequate tool for the assessment and diagnosis of teamwork. Adaptar e validar a escala Team Climate Invetory, de medida do clima de trabalho em equipe, para o idioma português, no contexto da atenção primária à saúde no Brasil. Estudo metodológico com abordagem quantitativa de adaptação transcultural (tradução, retrotradução, síntese, comitê de especialistas e pré-teste) e validação com 497 trabalhadores de 72 equipes da Estratégia Saúde da Família no município de Campinas, São Paulo. Verificou-se confiabilidade pelo alfa de Cronbach, validade de construto pela análise fatorial confirmatória pelo software SmartPLS e correlação com escala de satisfação no trabalho. Foi problematizado a sobreposição dos itens 9, 11 e 12 do fator participação na equipe e o fator objetivos da equipe no tocante à sua definição. A validação mostrou que não houve sobreposição dos itens e a confiabilidade variou entre 0,92 a 0,93. A análise fatorial confirmatória indicou adequação do modelo proposto com distribuição dos 38 itens nos quatro fatores. A correlação entre clima de trabalho em equipe e satisfação no trabalho foi significativa. A versão da escala em português falado no Brasil foi validada e pode ser utilizada no contexto da atenção primária à saúde no País, constituindo ferramenta adequada para a avaliação e diagnóstico do trabalho em equipe.

  7. Tissue metabolic changes for effects of pirfenidone in rats of acute paraquat poisoning by GC-MS.

    PubMed

    Ma, Jianshe; Sun, Fa; Chen, Bingbao; Tu, Xiaoting; Peng, Xiufa; Wen, Congcong; Hu, Lufeng; Wang, Xianqin

    2017-12-01

    We developed a metabolomic method to evaluate the effect of pirfenidone on rats with acute paraquat (PQ) poisoning, through the analysis of various tissues (lung, liver, kidney, and heart), by gas chromatography-mass spectrometry (GC-MS). Thirty-eight rats were randomly divided into a control group, an acute PQ (20 mg kg -1 ) poisoning group, a pirfenidone (20 mg kg -1 ) treatment group, and a pirfenidone (40 mg kg -1 ) treatment group. Partial least squares-discriminate analysis (PLS-DA) revealed metabolic alterations in rat tissue samples from the two pirfenidone treatment groups after acute PQ poisoning. The PLS-DA 3D score chart showed that the rats in the acute PQ poisoning group were clearly distinguished from the rats in the control group. Also, the two pirfenidone treatment groups were distinguished from the acute PQ poisoning group and control group. Additionally, the pirfenidone (40 mg kg -1 ) treatment group was separated farther than the pirfenidone (20 mg kg -1 ) treatment group from the acute PQ poisoning group. Evaluation of the pathological changes in the rat tissues revealed that treatment with pirfenidone appeared to decrease pulmonary fibrosis in the acute PQ poisoning rats. The results indicate that pirfenidone induced beneficial metabolic alterations in the tissues of rats with acute PQ poisoning. Rats with acute PQ poisoning exhibited a certain reduction in biochemical indicators after treatment with pirfenidone, indicating that pirfenidone could protect liver and kidney function. Accordingly, the developed metabolomic approach proved to be useful to elucidate the effect of pirfenidone in rats of acute PQ poisoning.

  8. Rapid detection of Pseudomonas aeruginosa biomarkers in biological fluids using surface-enhanced Raman scattering

    NASA Astrophysics Data System (ADS)

    Wu, Xiaomeng; Chen, Jing; Zhao, Yiping; Zughaier, Susu M.

    2014-05-01

    Pseudomonas aeruginosa (PA) is an opportunistic pathogen that causes major infection not only in Cystic Fibrosis patients but also in chronic obstructive pulmonary disease and in critically ill patients in intensive care units. Successful antibiotic treatment of the infection relies on accurate and rapid identification of the infectious agents. Conventional microbiological detection methods usually take more than 3 days to obtain accurate results. We have developed a rapid diagnostic technique based on surface-enhanced Raman scattering to directly identify PA from biological fluids. P. aeruginosa strains, PAO1 and PA14, are cultured in lysogeny broth, and the SERS spectra of the broth show the signature Raman peaks from pyocyanin and pyoverdine, two major biomarkers that P. aeruginosa secretes during its growth, as well as lipopolysaccharides. This provides the evidence that the presence of these biomarkers can be used to indicate P. aeruginosa infection. A total of 22 clinical exhaled breath condensates (EBC) samples were obtained from subjects with CF disease and from non-CF healthy donors. SERS spectra of these EBC samples were obtained and further analyzed by both principle component analysis and partial least square-discriminant analysis (PLS-DA). PLS-DA can discriminate the samples with P. aeruginosa infection and the ones without P. aeruginosa infection at 99.3% sensitivity and 99.6% specificity. In addition, this technique can also discriminate samples from subject with CF disease and healthy donor with 97.5% sensitivity and 100% specificity. These results demonstrate the potential of using SERS of EBC samples as a rapid diagnostic tool to detect PA infection.

  9. Selected-ion flow-tube mass-spectrometry (SIFT-MS) fingerprinting versus chemical profiling for geographic traceability of Moroccan Argan oils.

    PubMed

    Kharbach, Mourad; Kamal, Rabie; Mansouri, Mohammed Alaoui; Marmouzi, Ilias; Viaene, Johan; Cherrah, Yahia; Alaoui, Katim; Vercammen, Joeri; Bouklouze, Abdelaziz; Vander Heyden, Yvan

    2018-10-15

    This study investigated the effectiveness of SIFT-MS versus chemical profiling, both coupled to multivariate data analysis, to classify 95 Extra Virgin Argan Oils (EVAO), originating from five Moroccan Argan forest locations. The full scan option of SIFT-MS, is suitable to indicate the geographic origin of EVAO based on the fingerprints obtained using the three chemical ionization precursors (H 3 O + , NO + and O 2 + ). The chemical profiling (including acidity, peroxide value, spectrophotometric indices, fatty acids, tocopherols- and sterols composition) was also used for classification. Partial least squares discriminant analysis (PLS-DA), soft independent modeling of class analogy (SIMCA), K-nearest neighbors (KNN), and support vector machines (SVM), were compared. The SIFT-MS data were therefore fed to variable-selection methods to find potential biomarkers for classification. The classification models based either on chemical profiling or SIFT-MS data were able to classify the samples with high accuracy. SIFT-MS was found to be advantageous for rapid geographic classification. Copyright © 2018 Elsevier Ltd. All rights reserved.

  10. Assessing the varietal origin of extra-virgin olive oil using liquid chromatography fingerprints of phenolic compound, data fusion and chemometrics.

    PubMed

    Bajoub, Aadil; Medina-Rodríguez, Santiago; Gómez-Romero, María; Ajal, El Amine; Bagur-González, María Gracia; Fernández-Gutiérrez, Alberto; Carrasco-Pancorbo, Alegría

    2017-01-15

    High Performance Liquid Chromatography (HPLC) with diode array (DAD) and fluorescence (FLD) detection was used to acquire the fingerprints of the phenolic fraction of monovarietal extra-virgin olive oils (extra-VOOs) collected over three consecutive crop seasons (2011/2012-2013/2014). The chromatographic fingerprints of 140 extra-VOO samples processed from olive fruits of seven olive varieties, were recorded and statistically treated for varietal authentication purposes. First, DAD and FLD chromatographic-fingerprint datasets were separately processed and, subsequently, were joined using "Low-level" and "Mid-Level" data fusion methods. After the preliminary examination by principal component analysis (PCA), three supervised pattern recognition techniques, Partial Least Squares Discriminant Analysis (PLS-DA), Soft Independent Modeling of Class Analogies (SIMCA) and K-Nearest Neighbors (k-NN) were applied to the four chromatographic-fingerprinting matrices. The classification models built were very sensitive and selective, showing considerably good recognition and prediction abilities. The combination "chromatographic dataset+chemometric technique" allowing the most accurate classification for each monovarietal extra-VOO was highlighted. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Towards molecular design using 2D-molecular contour maps obtained from PLS regression coefficients

    NASA Astrophysics Data System (ADS)

    Borges, Cleber N.; Barigye, Stephen J.; Freitas, Matheus P.

    2017-12-01

    The multivariate image analysis descriptors used in quantitative structure-activity relationships are direct representations of chemical structures as they are simply numerical decodifications of pixels forming the 2D chemical images. These MDs have found great utility in the modeling of diverse properties of organic molecules. Given the multicollinearity and high dimensionality of the data matrices generated with the MIA-QSAR approach, modeling techniques that involve the projection of the data space onto orthogonal components e.g. Partial Least Squares (PLS) have been generally used. However, the chemical interpretation of the PLS-based MIA-QSAR models, in terms of the structural moieties affecting the modeled bioactivity has not been straightforward. This work describes the 2D-contour maps based on the PLS regression coefficients, as a means of assessing the relevance of single MIA predictors to the response variable, and thus allowing for the structural, electronic and physicochemical interpretation of the MIA-QSAR models. A sample study to demonstrate the utility of the 2D-contour maps to design novel drug-like molecules is performed using a dataset of some anti-HIV-1 2-amino-6-arylsulfonylbenzonitriles and derivatives, and the inferences obtained are consistent with other reports in the literature. In addition, the different schemes for encoding atomic properties in molecules are discussed and evaluated.

  12. Discrimination and prediction of cultivation age and parts of Panax ginseng by Fourier-transform infrared spectroscopy combined with multivariate statistical analysis.

    PubMed

    Lee, Byeong-Ju; Kim, Hye-Youn; Lim, Sa Rang; Huang, Linfang; Choi, Hyung-Kyoon

    2017-01-01

    Panax ginseng C.A. Meyer is a herb used for medicinal purposes, and its discrimination according to cultivation age has been an important and practical issue. This study employed Fourier-transform infrared (FT-IR) spectroscopy with multivariate statistical analysis to obtain a prediction model for discriminating cultivation ages (5 and 6 years) and three different parts (rhizome, tap root, and lateral root) of P. ginseng. The optimal partial-least-squares regression (PLSR) models for discriminating ginseng samples were determined by selecting normalization methods, number of partial-least-squares (PLS) components, and variable influence on projection (VIP) cutoff values. The best prediction model for discriminating 5- and 6-year-old ginseng was developed using tap root, vector normalization applied after the second differentiation, one PLS component, and a VIP cutoff of 1.0 (based on the lowest root-mean-square error of prediction value). In addition, for discriminating among the three parts of P. ginseng, optimized PLSR models were established using data sets obtained from vector normalization, two PLS components, and VIP cutoff values of 1.5 (for 5-year-old ginseng) and 1.3 (for 6-year-old ginseng). To our knowledge, this is the first study to provide a novel strategy for rapidly discriminating the cultivation ages and parts of P. ginseng using FT-IR by selected normalization methods, number of PLS components, and VIP cutoff values.

  13. Discrimination and prediction of cultivation age and parts of Panax ginseng by Fourier-transform infrared spectroscopy combined with multivariate statistical analysis

    PubMed Central

    Lim, Sa Rang; Huang, Linfang

    2017-01-01

    Panax ginseng C.A. Meyer is a herb used for medicinal purposes, and its discrimination according to cultivation age has been an important and practical issue. This study employed Fourier-transform infrared (FT-IR) spectroscopy with multivariate statistical analysis to obtain a prediction model for discriminating cultivation ages (5 and 6 years) and three different parts (rhizome, tap root, and lateral root) of P. ginseng. The optimal partial-least-squares regression (PLSR) models for discriminating ginseng samples were determined by selecting normalization methods, number of partial-least-squares (PLS) components, and variable influence on projection (VIP) cutoff values. The best prediction model for discriminating 5- and 6-year-old ginseng was developed using tap root, vector normalization applied after the second differentiation, one PLS component, and a VIP cutoff of 1.0 (based on the lowest root-mean-square error of prediction value). In addition, for discriminating among the three parts of P. ginseng, optimized PLSR models were established using data sets obtained from vector normalization, two PLS components, and VIP cutoff values of 1.5 (for 5-year-old ginseng) and 1.3 (for 6-year-old ginseng). To our knowledge, this is the first study to provide a novel strategy for rapidly discriminating the cultivation ages and parts of P. ginseng using FT-IR by selected normalization methods, number of PLS components, and VIP cutoff values. PMID:29049369

  14. Combining FT-IR spectroscopy and multivariate analysis for qualitative and quantitative analysis of the cell wall composition changes during apples development.

    PubMed

    Szymanska-Chargot, M; Chylinska, M; Kruk, B; Zdunek, A

    2015-01-22

    The aim of this work was to quantitatively and qualitatively determine the composition of the cell wall material from apples during development by means of Fourier transform infrared (FT-IR) spectroscopy. The FT-IR region of 1500-800 cm(-1), containing characteristic bands for galacturonic acid, hemicellulose and cellulose, was examined using principal component analysis (PCA), k-means clustering and partial least squares (PLS). The samples were differentiated by development stage and cultivar using PCA and k-means clustering. PLS calibration models for galacturonic acid, hemicellulose and cellulose content from FT-IR spectra were developed and validated with the reference data. PLS models were tested using the root-mean-square errors of cross-validation for contents of galacturonic acid, hemicellulose and cellulose which was 8.30 mg/g, 4.08% and 1.74%, respectively. It was proven that FT-IR spectroscopy combined with chemometric methods has potential for fast and reliable determination of the main constituents of fruit cell walls. Copyright © 2014 Elsevier Ltd. All rights reserved.

  15. Emerging approach for analytical characterization and geographical classification of Moroccan and French honeys by means of a voltammetric electronic tongue.

    PubMed

    El Alami El Hassani, Nadia; Tahri, Khalid; Llobet, Eduard; Bouchikhi, Benachir; Errachid, Abdelhamid; Zine, Nadia; El Bari, Nezha

    2018-03-15

    Moroccan and French honeys from different geographical areas were classified and characterized by applying a voltammetric electronic tongue (VE-tongue) coupled to analytical methods. The studied parameters include color intensity, free lactonic and total acidity, proteins, phenols, hydroxymethylfurfural content (HMF), sucrose, reducing and total sugars. The geographical classification of different honeys was developed through three-pattern recognition techniques: principal component analysis (PCA), support vector machines (SVMs) and hierarchical cluster analysis (HCA). Honey characterization was achieved by partial least squares modeling (PLS). All the PLS models developed were able to accurately estimate the correct values of the parameters analyzed using as input the voltammetric experimental data (i.e. r>0.9). This confirms the potential ability of the VE-tongue for performing a rapid characterization of honeys via PLS in which an uncomplicated, cost-effective sample preparation process that does not require the use of additional chemicals is implemented. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Metabolic analysis of elicited cell suspension cultures of Cannabis sativa L. by (1)H-NMR spectroscopy.

    PubMed

    Pec, Jaroslav; Flores-Sanchez, Isvett Josefina; Choi, Young Hae; Verpoorte, Robert

    2010-07-01

    Cannabis sativa L. plants produce a diverse array of secondary metabolites. Cannabis cell cultures were treated with jasmonic acid (JA) and pectin as elicitors to evaluate their effect on metabolism from two cell lines using NMR spectroscopy and multivariate data analysis. According to principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA), the chloroform extract of the pectin-treated cultures were more different than control and JA-treated cultures; but in the methanol/water extract the metabolome of the JA-treated cells showed clear differences with control and pectin-treated cultures. Tyrosol, an antioxidant metabolite, was detected in cannabis cell cultures. The tyrosol content increased after eliciting with JA.

  17. Measurement of process variables in solid-state fermentation of wheat straw using FT-NIR spectroscopy and synergy interval PLS algorithm

    NASA Astrophysics Data System (ADS)

    Jiang, Hui; Liu, Guohai; Mei, Congli; Yu, Shuang; Xiao, Xiahong; Ding, Yuhan

    2012-11-01

    The feasibility of rapid determination of the process variables (i.e. pH and moisture content) in solid-state fermentation (SSF) of wheat straw using Fourier transform near infrared (FT-NIR) spectroscopy was studied. Synergy interval partial least squares (siPLS) algorithm was implemented to calibrate regression model. The number of PLS factors and the number of subintervals were optimized simultaneously by cross-validation. The performance of the prediction model was evaluated according to the root mean square error of cross-validation (RMSECV), the root mean square error of prediction (RMSEP) and the correlation coefficient (R). The measurement results of the optimal model were obtained as follows: RMSECV = 0.0776, Rc = 0.9777, RMSEP = 0.0963, and Rp = 0.9686 for pH model; RMSECV = 1.3544% w/w, Rc = 0.8871, RMSEP = 1.4946% w/w, and Rp = 0.8684 for moisture content model. Finally, compared with classic PLS and iPLS models, the siPLS model revealed its superior performance. The overall results demonstrate that FT-NIR spectroscopy combined with siPLS algorithm can be used to measure process variables in solid-state fermentation of wheat straw, and NIR spectroscopy technique has a potential to be utilized in SSF industry.

  18. An In Silico Method for Screening Nicotine Derivatives as Cytochrome P450 2A6 Selective Inhibitors Based on Kernel Partial Least Squares

    PubMed Central

    Wang, Yonghua; Li, Yan; Wang, Bin

    2007-01-01

    Nicotine and a variety of other drugs and toxins are metabolized by cytochrome P450 (CYP) 2A6. The aim of the present study was to build a quantitative structure-activity relationship (QSAR) model to predict the activities of nicotine analogues on CYP2A6. Kernel partial least squares (K-PLS) regression was employed with the electro-topological descriptors to build the computational models. Both the internal and external predictabilities of the models were evaluated with test sets to ensure their validity and reliability. As a comparison to K-PLS, a standard PLS algorithm was also applied on the same training and test sets. Our results show that the K-PLS produced reasonable results that outperformed the PLS model on the datasets. The obtained K-PLS model will be helpful for the design of novel nicotine-like selective CYP2A6 inhibitors.

  19. Fingerprinting of egg and oil binders in painted artworks by matrix-assisted laser desorption ionization time-of-flight mass spectrometry analysis of lipid oxidation by-products.

    PubMed

    Calvano, C D; van der Werf, I D; Palmisano, F; Sabbatini, L

    2011-06-01

    A matrix-assisted laser desorption ionization time-of-flight mass spectrometry-based approach was applied for the detection of various lipid classes, such as triacylglycerols (TAGs) and phospholipids (PLs), and their oxidation by-products in extracts of small (50-100 μg) samples obtained from painted artworks. Ageing of test specimens under various conditions, including the presence of different pigments, was preliminarily investigated. During ageing, the TAGs and PLs content decreased, whereas the amount of diglycerides, short-chain oxidative products arising from TAGs and PLs, and oxidized TAGs and PLs components increased. The examination of a series of model paint samples gave a clear indication that specific ions produced by oxidative cleavage of PLs and/or TAGs may be used as markers for egg and drying oil-based binders. Their elemental composition and hypothetical structure are also tentatively proposed. Moreover, the simultaneous presence of egg and oil binders can be easily and unambiguously ascertained through the simultaneous occurrence of the relevant specific markers. The potential of the proposed approach was demonstrated for the first time by the analysis of real samples from a polyptych of Bartolomeo Vivarini (fifteenth century) and a "French school" canvas painting (seventeenth century).

  20. Developmental cigarette smoke exposure II: Hepatic proteome profiles in 6 month old adult offspring.

    PubMed

    Neal, Rachel E; Chen, Jing; Webb, Cindy; Stocke, Kendall; Gambrell, Caitlin; Greene, Robert M; Pisano, M Michele

    2016-10-01

    Utilizing a mouse model of 'active' developmental cigarette smoke exposure (CSE) [gestational day (GD) 1 through postnatal day (PD) 21] characterized by offspring low birth weight, the impact of developmental CSE on liver proteome profiles of adult offspring at 6 months of age was determined. Liver tissue was collected from Sham- and CSE-offspring for 2D-SDS-PAGE based proteome analysis with Partial Least Squares-Discriminant Analysis (PLS-DA). A similar study conducted at the cessation of exposure to cigarette smoke documented decreased gluconeogenesis coupled to oxidative stress in weanling offspring. In the current study, exposure throughout development to cigarette smoke resulted in impaired hepatic carbohydrate metabolism, decreased serum glucose levels, and increased gluconeogenic regulatory enzyme abundances during the fed-state coupled to decreased expression of SIRT1 as well as increased PEPCK and PGC1α expression. Together these findings indicate inappropriately timed gluconeogenesis that may reflect impaired insulin signaling in mature offspring exposed to 'active' developmental CSE. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. PGI chicory (Cichorium intybus L.) traceability by means of HRMAS-NMR spectroscopy: a preliminary study.

    PubMed

    Ritota, Mena; Casciani, Lorena; Valentini, Massimiliano

    2013-05-01

    Analytical traceability of PGI and PDO foods (Protected Geographical Indication and Protected Denomination Origin respectively) is one of the most challenging tasks of current applied research. Here we proposed a metabolomic approach based on the combination of (1)H high-resolution magic angle spinning-nuclear magnetic resonance (HRMAS-NMR) spectroscopy with multivariate analysis, i.e. PLS-DA, as a reliable tool for the traceability of Italian PGI chicories (Cichorium intybus L.), i.e. Radicchio Rosso di Treviso and Radicchio Variegato di Castelfranco, also known as red and red-spotted, respectively. The metabolic profile was gained by means of HRMAS-NMR, and multivariate data analysis allowed us to build statistical models capable of providing clear discrimination among the two varieties and classification according to the geographical origin. Based on Variable Importance in Projection values, the molecular markers for classifying the different types of red chicories analysed were found accounting for both the cultivar and the place of origin. © 2012 Society of Chemical Industry.

  2. Discrimination of tomatoes bred by spaceflight mutagenesis using visible/near infrared spectroscopy and chemometrics

    NASA Astrophysics Data System (ADS)

    Shao, Yongni; Xie, Chuanqi; Jiang, Linjun; Shi, Jiahui; Zhu, Jiajin; He, Yong

    2015-04-01

    Visible/near infrared spectroscopy (Vis/NIR) based on sensitive wavelengths (SWs) and chemometrics was proposed to discriminate different tomatoes bred by spaceflight mutagenesis from their leafs or fruits (green or mature). The tomato breeds were mutant M1, M2 and their parent. Partial least squares (PLS) analysis and least squares-support vector machine (LS-SVM) were implemented for calibration models. PLS analysis was implemented for calibration models with different wavebands including the visible region (400-700 nm) and the near infrared region (700-1000 nm). The best PLS models were achieved in the visible region for the leaf and green fruit samples and in the near infrared region for the mature fruit samples. Furthermore, different latent variables (4-8 LVs for leafs, 5-9 LVs for green fruits, and 4-9 LVs for mature fruits) were used as inputs of LS-SVM to develop the LV-LS-SVM models with the grid search technique and radial basis function (RBF) kernel. The optimal LV-LS-SVM models were achieved with six LVs for the leaf samples, seven LVs for green fruits, and six LVs for mature fruits, respectively, and they outperformed the PLS models. Moreover, independent component analysis (ICA) was executed to select several SWs based on loading weights. The optimal LS-SVM model was achieved with SWs of 550-560 nm, 562-574 nm, 670-680 nm and 705-715 nm for the leaf samples; 548-556 nm, 559-564 nm, 678-685 nm and 962-974 nm for the green fruit samples; and 712-718 nm, 720-729 nm, 968-978 nm and 820-830 nm for the mature fruit samples. All of them had better performance than PLS and LV-LS-SVM, with the parameters of correlation coefficient (rp), root mean square error of prediction (RMSEP) and bias of 0.9792, 0.2632 and 0.0901 based on leaf discrimination, 0.9837, 0.2783 and 0.1758 based on green fruit discrimination, 0.9804, 0.2215 and -0.0035 based on mature fruit discrimination, respectively. The overall results indicated that ICA was an effective way for the selection of SWs, and the Vis/NIR combined with LS-SVM models had the capability to predict the different breeds (mutant M1, mutant M2 and their parent) of tomatoes from leafs and fruits.

  3. Kernel Partial Least Squares for Nonlinear Regression and Discrimination

    NASA Technical Reports Server (NTRS)

    Rosipal, Roman; Clancy, Daniel (Technical Monitor)

    2002-01-01

    This paper summarizes recent results on applying the method of partial least squares (PLS) in a reproducing kernel Hilbert space (RKHS). A previously proposed kernel PLS regression model was proven to be competitive with other regularized regression methods in RKHS. The family of nonlinear kernel-based PLS models is extended by considering the kernel PLS method for discrimination. Theoretical and experimental results on a two-class discrimination problem indicate usefulness of the method.

  4. Comparison of the meat metabolite composition of Linwu and Pekin ducks using 600 MHz 1H nuclear magnetic resonance spectroscopy.

    PubMed

    Wang, Xiangrong; Fang, Chengkun; He, Jianhua; Dai, Qiuzhong; Fang, Rejun

    2017-01-01

    In an effort to further understand of the differences of meat flavor and texture between Linwu ducks and Pekin ducks at market age, we investigated the meat metabolite composition of the two breeds of ducks using 600 MHz 1 H nuclear magnetic resonance (NMR) spectroscopy. Comprehensive multivariate data analysis including principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA), and orthogonal projection to latent structure-discriminant analysis (OPLS-DA) were applied to analyze the 1 H-NMR profiling data to identify the distinguishing metabolites of breast meat between two breeds of ducks. Compared with 42-d-old Pekin duck meat, breast from 72-d-old Linwu duck has higher concentration of anserine, carnosine, homocarnosine, and nicotinamide, but significantly lower concentration of succinate, creatine, and myo-inositol. These results contribute to a better understanding of the differences in meat metabolite composition between 72-d-old Linwu and 42-d-old Pekin ducks, which could be used to help assess the quality of duck meat as a food. © 2016 Poultry Science Association Inc.

  5. Measurement of process variables in solid-state fermentation of wheat straw using FT-NIR spectroscopy and synergy interval PLS algorithm.

    PubMed

    Jiang, Hui; Liu, Guohai; Mei, Congli; Yu, Shuang; Xiao, Xiahong; Ding, Yuhan

    2012-11-01

    The feasibility of rapid determination of the process variables (i.e. pH and moisture content) in solid-state fermentation (SSF) of wheat straw using Fourier transform near infrared (FT-NIR) spectroscopy was studied. Synergy interval partial least squares (siPLS) algorithm was implemented to calibrate regression model. The number of PLS factors and the number of subintervals were optimized simultaneously by cross-validation. The performance of the prediction model was evaluated according to the root mean square error of cross-validation (RMSECV), the root mean square error of prediction (RMSEP) and the correlation coefficient (R). The measurement results of the optimal model were obtained as follows: RMSECV=0.0776, R(c)=0.9777, RMSEP=0.0963, and R(p)=0.9686 for pH model; RMSECV=1.3544% w/w, R(c)=0.8871, RMSEP=1.4946% w/w, and R(p)=0.8684 for moisture content model. Finally, compared with classic PLS and iPLS models, the siPLS model revealed its superior performance. The overall results demonstrate that FT-NIR spectroscopy combined with siPLS algorithm can be used to measure process variables in solid-state fermentation of wheat straw, and NIR spectroscopy technique has a potential to be utilized in SSF industry. Copyright © 2012 Elsevier B.V. All rights reserved.

  6. Irreversible dual inhibitory mode: the novel Btk inhibitor PLS-123 demonstrates promising anti-tumor activity in human B-cell lymphoma.

    PubMed

    Ding, Ning; Li, Xitao; Shi, Yunfei; Ping, Lingyan; Wu, Lina; Fu, Kai; Feng, Lixia; Zheng, Xiaohui; Song, Yuqin; Pan, Zhengying; Zhu, Jun

    2015-06-20

    The B-cell receptor (BCR) signaling pathway has gained significant attention as a therapeutic target in B-cell malignancies. Recently, several drugs that target the BCR signaling pathway, especially the Btk inhibitor ibrutinib, have demonstrated notable therapeutic effects in relapsed/refractory patients, which indicates that pharmacological inhibition of BCR pathway holds promise in B-cell lymphoma treatment. Here we present a novel covalent irreversible Btk inhibitor PLS-123 with more potent anti-proliferative activity compared with ibrutinib in multiple cellular and in vivo models through effective apoptosis induction and dual-action inhibitory mode of Btk activation. The phosphorylation of BCR downstream activating AKT/mTOR and MAPK signal pathways was also more significantly reduced after treatment with PLS-123 than ibrutinib. Gene expression profile analysis further suggested that the different selectivity profile of PLS-123 led to significant downregulation of oncogenic gene PTPN11 expression, which might also offer new opportunities beyond what ibrutinib has achieved. In addition, PLS-123 dose-dependently attenuated BCR- and chemokine-mediated lymphoma cell adhesion and migration. Taken together, Btk inhibitor PLS-123 suggested a new direction to pharmacologically modulate Btk function and develop novel therapeutic drug for B-cell lymphoma treatment.

  7. Irreversible dual inhibitory mode: the novel Btk inhibitor PLS-123 demonstrates promising anti-tumor activity in human B-cell lymphoma

    PubMed Central

    Ding, Ning; Li, Xitao; Shi, Yunfei; Ping, Lingyan; Wu, Lina; Fu, Kai; Feng, Lixia; Zheng, Xiaohui; Song, Yuqin; Pan, Zhengying; Zhu, Jun

    2015-01-01

    The B-cell receptor (BCR) signaling pathway has gained significant attention as a therapeutic target in B-cell malignancies. Recently, several drugs that target the BCR signaling pathway, especially the Btk inhibitor ibrutinib, have demonstrated notable therapeutic effects in relapsed/refractory patients, which indicates that pharmacological inhibition of BCR pathway holds promise in B-cell lymphoma treatment. Here we present a novel covalent irreversible Btk inhibitor PLS-123 with more potent anti-proliferative activity compared with ibrutinib in multiple cellular and in vivo models through effective apoptosis induction and dual-action inhibitory mode of Btk activation. The phosphorylation of BCR downstream activating AKT/mTOR and MAPK signal pathways was also more significantly reduced after treatment with PLS-123 than ibrutinib. Gene expression profile analysis further suggested that the different selectivity profile of PLS-123 led to significant downregulation of oncogenic gene PTPN11 expression, which might also offer new opportunities beyond what ibrutinib has achieved. In addition, PLS-123 dose-dependently attenuated BCR- and chemokine-mediated lymphoma cell adhesion and migration. Taken together, Btk inhibitor PLS-123 suggested a new direction to pharmacologically modulate Btk function and develop novel therapeutic drug for B-cell lymphoma treatment. PMID:25944695

  8. Partial least squares methods for spectrally estimating lunar soil FeO abundance: A stratified approach to revealing nonlinear effect and qualitative interpretation

    NASA Astrophysics Data System (ADS)

    Li, Lin

    2008-12-01

    Partial least squares (PLS) regressions were applied to lunar highland and mare soil data characterized by the Lunar Soil Characterization Consortium (LSCC) for spectral estimation of the abundance of lunar soil chemical constituents FeO and Al2O3. The LSCC data set was split into a number of subsets including the total highland, Apollo 16, Apollo 14, and total mare soils, and then PLS was applied to each to investigate the effect of nonlinearity on the performance of the PLS method. The weight-loading vectors resulting from PLS were analyzed to identify mineral species responsible for spectral estimation of the soil chemicals. The results from PLS modeling indicate that the PLS performance depends on the correlation of constituents of interest to their major mineral carriers, and the Apollo 16 soils are responsible for the large errors of FeO and Al2O3 estimates when the soils were modeled along with other types of soils. These large errors are primarily attributed to the degraded correlation FeO to pyroxene for the relatively mature Apollo 16 soils as a result of space weathering and secondary to the interference of olivine. PLS consistently yields very accurate fits to the two soil chemicals when applied to mare soils. Although Al2O3 has no spectrally diagnostic characteristics, this chemical can be predicted for all subset data by PLS modeling at high accuracies because of its correlation to FeO. This correlation is reflected in the symmetry of the PLS weight-loading vectors for FeO and Al2O3, which prove to be very useful for qualitative interpretation of the PLS results. However, this qualitative interpretation of PLS modeling cannot be achieved using principal component regression loading vectors.

  9. Human Catestatin Alters Gut Microbiota Composition in Mice

    PubMed Central

    Rabbi, Mohammad F.; Munyaka, Peris M.; Eissa, Nour; Metz-Boutigue, Marie-Hélène; Khafipour, Ehsan; Ghia, Jean Eric

    2017-01-01

    The mammalian intestinal tract is heavily colonized with a dense, complex, and diversified microbial populations. In healthy individuals, an array of epithelial antimicrobial agents is secreted in the gut to aid intestinal homeostasis. Enterochromaffin cells (EC) in the intestinal epithelium are a major source of chromogranin A (CgA), which is a pro-hormone and can be cleaved into many bioactive peptides that include catestatin (CST). This study was carried out to evaluate the possible impact of CST on gut microbiota in vivo using a mouse model. The CST (Human CgA352−372) or normal saline was intrarectally administered in C57BL/6 male mice for 6 days and then sacrificed. Feces and colonic mucosa tissue samples were collected, DNA was extracted, the V4 region of bacterial 16S rRNA gene was amplified and subjected to MiSeq Illumina sequencing. The α-diversity was calculated using Chao 1 and β-diversity was determined using QIIME. Differences at the genus level were determined using partial least square discriminant analysis (PLS-DA). Phylogenetic investigation of communities by reconstruction of unobserved states (PICRUSt) was used to predict functional capacity of bacterial community. CST treatment did not modify bacterial richness in fecal and colonic mucosa-associated microbiota; however, treatment significantly modified bacterial community composition between the groups. Also, CST-treated mice had a significantly lower relative abundance of Firmicutes and higher abundance of Bacteroidetes, observed only in fecal samples. However, at lower phylogenetic levels, PLS-DA analysis revealed that some bacterial taxa were significantly associated with the CST-treated mice in both fecal and colonic mucosa samples. In addition, differences in predicted microbial functional pathways in both fecal and colonic mucosa samples were detected. The results support the hypothesis that CST treatment modulates gut microbiota composition under non-pathophysiological conditions, however, the result of this study needs to be further validated in a larger experiment. The data may open new avenues for the development of a potential new line of antimicrobial peptides and their use as therapeutic agents to treat several inflammatory conditions of the gastrointestinal tract, such as inflammatory bowel disease (IBD), inflammatory bowel syndrome (IBS), or other health conditions. PMID:28144234

  10. Use of Near-Infrared Spectroscopy and Chemometrics for the Nondestructive Identification of Concealed Damage in Raw Almonds (Prunus dulcis).

    PubMed

    Rogel-Castillo, Cristian; Boulton, Roger; Opastpongkarn, Arunwong; Huang, Guangwei; Mitchell, Alyson E

    2016-07-27

    Concealed damage (CD) is defined as a brown discoloration of the kernel interior (nutmeat) that appears only after moderate to high heat treatment (e.g., blanching, drying, roasting, etc.). Raw almonds with CD have no visible defects before heat treatment. Currently, there are no screening methods available for detecting CD in raw almonds. Herein, the feasibility of using near-infrared (NIR) spectroscopy between 1125 and 2153 nm for the detection of CD in almonds is demonstrated. Almond kernels with CD have less NIR absorbance in the region related with oil, protein, and carbohydrates. With the use of partial least squares discriminant analysis (PLS-DA) and selection of specific wavelengths, three classification models were developed. The calibration models have false-positive and false-negative error rates ranging between 12.4 and 16.1% and between 10.6 and 17.2%, respectively. The percent error rates ranged between 8.2 and 9.2%. Second-derivative preprocessing of the selected wavelength resulted in the most robust predictive model.

  11. Metabolomic Analysis of Oxidative and Glycolytic Skeletal Muscles by Matrix-Assisted Laser Desorption/IonizationMass Spectrometric Imaging (MALDI MSI)

    NASA Astrophysics Data System (ADS)

    Tsai, Yu-Hsuan; Garrett, Timothy J.; Carter, Christy S.; Yost, Richard A.

    2015-06-01

    Skeletal muscles are composed of heterogeneous muscle fibers that have different physiological, morphological, biochemical, and histological characteristics. In this work, skeletal muscles extensor digitorum longus, soleus, and whole gastrocnemius were analyzed by matrix-assisted laser desorption/ionization mass spectrometry to characterize small molecule metabolites of oxidative and glycolytic muscle fiber types as well as to visualize biomarker localization. Multivariate data analysis such as principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were performed to extract significant features. Different metabolic fingerprints were observed from oxidative and glycolytic fibers. Higher abundances of biomolecules such as antioxidant anserine as well as acylcarnitines were observed in the glycolytic fibers, whereas taurine and some nucleotides were found to be localized in the oxidative fibers.

  12. At-line determination of pharmaceuticals small molecule's blending end point using chemometric modeling combined with Fourier transform near infrared spectroscopy

    NASA Astrophysics Data System (ADS)

    Tewari, Jagdish; Strong, Richard; Boulas, Pierre

    2017-02-01

    This article summarizes the development and validation of a Fourier transform near infrared spectroscopy (FT-NIR) method for the rapid at-line prediction of active pharmaceutical ingredient (API) in a powder blend to optimize small molecule formulations. The method was used to determine the blend uniformity end-point for a pharmaceutical solid dosage formulation containing a range of API concentrations. A set of calibration spectra from samples with concentrations ranging from 1% to 15% of API (w/w) were collected at-line from 4000 to 12,500 cm- 1. The ability of the FT-NIR method to predict API concentration in the blend samples was validated against a reference high performance liquid chromatography (HPLC) method. The prediction efficiency of four different types of multivariate data modeling methods such as partial least-squares 1 (PLS1), partial least-squares 2 (PLS2), principal component regression (PCR) and artificial neural network (ANN), were compared using relevant multivariate figures of merit. The prediction ability of the regression models were cross validated against results generated with the reference HPLC method. PLS1 and ANN showed excellent and superior prediction abilities when compared to PLS2 and PCR. Based upon these results and because of its decreased complexity compared to ANN, PLS1 was selected as the best chemometric method to predict blend uniformity at-line. The FT-NIR measurement and the associated chemometric analysis were implemented in the production environment for rapid at-line determination of the end-point of the small molecule blending operation. FIGURE 1: Correlation coefficient vs Rank plot FIGURE 2: FT-NIR spectra of different steps of Blend and final blend FIGURE 3: Predictions ability of PCR FIGURE 4: Blend uniformity predication ability of PLS2 FIGURE 5: Prediction efficiency of blend uniformity using ANN FIGURE 6: Comparison of prediction efficiency of chemometric models TABLE 1: Order of Addition for Blending Steps

  13. Additive Partial Least Squares for efficient modelling of independent variance sources demonstrated on practical case studies.

    PubMed

    Luoma, Pekka; Natschläger, Thomas; Malli, Birgit; Pawliczek, Marcin; Brandstetter, Markus

    2018-05-12

    A model recalibration method based on additive Partial Least Squares (PLS) regression is generalized for multi-adjustment scenarios of independent variance sources (referred to as additive PLS - aPLS). aPLS allows for effortless model readjustment under changing measurement conditions and the combination of independent variance sources with the initial model by means of additive modelling. We demonstrate these distinguishing features on two NIR spectroscopic case-studies. In case study 1 aPLS was used as a readjustment method for an emerging offset. The achieved RMS error of prediction (1.91 a.u.) was of similar level as before the offset occurred (2.11 a.u.). In case-study 2 a calibration combining different variance sources was conducted. The achieved performance was of sufficient level with an absolute error being better than 0.8% of the mean concentration, therefore being able to compensate negative effects of two independent variance sources. The presented results show the applicability of the aPLS approach. The main advantages of the method are that the original model stays unadjusted and that the modelling is conducted on concrete changes in the spectra thus supporting efficient (in most cases straightforward) modelling. Additionally, the method is put into context of existing machine learning algorithms. Copyright © 2018 Elsevier B.V. All rights reserved.

  14. Crop/weed discrimination using near-infrared reflectance spectroscopy (NIRS)

    NASA Astrophysics Data System (ADS)

    Zhang, Yun; He, Yong

    2006-09-01

    The traditional uniform herbicide application often results in an over chemical residues on soil, crop plants and agriculture produce, which have imperiled the environment and food security. Near-infrared reflectance spectroscopy (NIRS) offers a promising means for weed detection and site-specific herbicide application. In laboratory, a total of 90 samples (30 for each species) of the detached leaves of two weeds, i.e., threeseeded mercury (Acalypha australis L.) and fourleafed duckweed (Marsilea quadrfolia L.), and one crop soybean (Glycine max) was investigated for NIRS on 325- 1075 nm using a field spectroradiometer. 20 absorbance samples of each species after pretreatment were exported and the lacked Y variables were assigned independent values for partial least squares (PLS) analysis. During the combined principle component analysis (PCA) on 400-1000 nm, the PC1 and PC2 could together explain over 91% of the total variance and detect the three plant species with 98.3% accuracy. The full-cross validation results of PLS, i.e., standard error of prediction (SEP) 0.247, correlation coefficient (r) 0.954 and root mean square error of prediction (RMSEP) 0.245, indicated an optimum model for weed identification. By predicting the remaining 10 samples of each species in the PLS model, the results with deviation presented a 100% crop/weed detection rate. Thus, it could be concluded that PLS was an available alternative of for qualitative weed discrimination on NTRS.

  15. Application of near-infrared spectroscopy for the rapid quality assessment of Radix Paeoniae Rubra

    NASA Astrophysics Data System (ADS)

    Zhan, Hao; Fang, Jing; Tang, Liying; Yang, Hongjun; Li, Hua; Wang, Zhuju; Yang, Bin; Wu, Hongwei; Fu, Meihong

    2017-08-01

    Near-infrared (NIR) spectroscopy with multivariate analysis was used to quantify gallic acid, catechin, albiflorin, and paeoniflorin in Radix Paeoniae Rubra, and the feasibility to classify the samples originating from different areas was investigated. A new high-performance liquid chromatography method was developed and validated to analyze gallic acid, catechin, albiflorin, and paeoniflorin in Radix Paeoniae Rubra as the reference. Partial least squares (PLS), principal component regression (PCR), and stepwise multivariate linear regression (SMLR) were performed to calibrate the regression model. Different data pretreatments such as derivatives (1st and 2nd), multiplicative scatter correction, standard normal variate, Savitzky-Golay filter, and Norris derivative filter were applied to remove the systematic errors. The performance of the model was evaluated according to the root mean square of calibration (RMSEC), root mean square error of prediction (RMSEP), root mean square error of cross-validation (RMSECV), and correlation coefficient (r). The results show that compared to PCR and SMLR, PLS had a lower RMSEC, RMSECV, and RMSEP and higher r for all the four analytes. PLS coupled with proper pretreatments showed good performance in both the fitting and predicting results. Furthermore, the original areas of Radix Paeoniae Rubra samples were partly distinguished by principal component analysis. This study shows that NIR with PLS is a reliable, inexpensive, and rapid tool for the quality assessment of Radix Paeoniae Rubra.

  16. Design of experiments-based monitoring of critical quality attributes for the spray-drying process of insulin by NIR spectroscopy.

    PubMed

    Maltesen, Morten Jonas; van de Weert, Marco; Grohganz, Holger

    2012-09-01

    Moisture content and aerodynamic particle size are critical quality attributes for spray-dried protein formulations. In this study, spray-dried insulin powders intended for pulmonary delivery were produced applying design of experiments methodology. Near infrared spectroscopy (NIR) in combination with preprocessing and multivariate analysis in the form of partial least squares projections to latent structures (PLS) were used to correlate the spectral data with moisture content and aerodynamic particle size measured by a time of flight principle. PLS models predicting the moisture content were based on the chemical information of the water molecules in the NIR spectrum. Models yielded prediction errors (RMSEP) between 0.39% and 0.48% with thermal gravimetric analysis used as reference method. The PLS models predicting the aerodynamic particle size were based on baseline offset in the NIR spectra and yielded prediction errors between 0.27 and 0.48 μm. The morphology of the spray-dried particles had a significant impact on the predictive ability of the models. Good predictive models could be obtained for spherical particles with a calibration error (RMSECV) of 0.22 μm, whereas wrinkled particles resulted in much less robust models with a Q (2) of 0.69. Based on the results in this study, NIR is a suitable tool for process analysis of the spray-drying process and for control of moisture content and particle size, in particular for smooth and spherical particles.

  17. Monitoring of chicken meat freshness by means of a colorimetric sensor array.

    PubMed

    Salinas, Yolanda; Ros-Lis, José V; Vivancos, José-L; Martínez-Máñez, Ramón; Marcos, M Dolores; Aucejo, Susana; Herranz, Nuria; Lorente, Inmaculada

    2012-08-21

    A new optoelectronic nose to monitor chicken meat ageing has been developed. It is based on 16 pigments prepared by the incorporation of different dyes (pH indicators, Lewis acids, hydrogen-bonding derivatives, selective probes and natural dyes) into inorganic materials (UVM-7, silica and alumina). The colour changes of the sensor array were characteristic of chicken ageing in a modified packaging atmosphere (30% CO(2)-70% N(2)). The chromogenic array data were processed with qualitative (PCA) and quantitative (PLS) tools. The PCA statistical analysis showed a high degree of dispersion, with nine dimensions required to explain 95% of variance. Despite this high dimensionality, a tridimensional representation of the three principal components was able to differentiate ageing with 2-day intervals. Moreover, the PLS statistical analysis allows the creation of a model to correlate the chromogenic data with chicken meat ageing. The model offers a PLS prediction model for ageing with values of 0.9937, 0.0389 and 0.994 for the slope, the intercept and the regression coefficient, respectively, and is in agreement with the perfect fit between the predicted and measured values observed. The results suggest the feasibility of this system to help develop optoelectronic noses that monitor food freshness.

  18. Plasma metabonomics study of the patients with acute anterior uveitis based on ultra-performance liquid chromatography-mass spectrometry.

    PubMed

    Guo, Junguo; Yan, Tingqin; Bi, Hongsheng; Xie, Xiaofeng; Wang, Xingrong; Guo, Dadong; Jiang, Haiqiang

    2014-06-01

    The identification of the biomarkers of patients with acute anterior uveitis (AAU) may allow for a less invasive and more accurate diagnosis, as well as serving as a predictor in AAU progression and treatment response. The aim of this study was to identify the potential biomarkers and the metabolic pathways from plasma in patients with AAU. Both plasma metabolic biomarkers and metabolic pathways in the AAU patients versus healthy volunteers were investigated using ultra-performance liquid chromatography-mass spectrometry (UPLC-MS) and a metabonomics approach. The principal component analysis (PCA) was used to separate AAU patients from healthy volunteers as well as to identify the different biomarkers between the two groups. Metabolic compounds were matched to the KEGG, METLIN, and HMDB databases, and metabolic pathways associated with AAU were identified. The PCA for UPLC-MS data shows that the metabolites in AAU patients were significantly different from those of healthy volunteers. Of the 4,396 total features detected by UPLC-MS, 102 features were significantly different between AAU patients and healthy volunteers according to the variable importance plot (VIP) values (greater than two) of partial least squares discriminate analysis (PLS-DA). Thirty-three metabolic compounds were identified and were considered as potential biomarkers. Meanwhile, ten metabolic pathways were found that were related to the AAU according to the identified biomarkers. These data suggest that metabolomics study can identify potential metabolites that differ between AAU patients and healthy volunteers. Based on the PCA, PLS-DA, several potential metabolic biomarkers and pathways in AAU patients were found and identified. In addition, the UPLC-MS technique combined with metabonomics could be a suitable systematic biology tool in research in clinical problems in ophthalmology, and can provide further insight into the pathophysiology of AAU.

  19. Partial Least Square Discriminant Analysis Based on Normalized Two-Stage Vegetation Indices for Mapping Damage from Rice Diseases Using PlanetScope Datasets.

    PubMed

    Shi, Yue; Huang, Wenjiang; Ye, Huichun; Ruan, Chao; Xing, Naichen; Geng, Yun; Dong, Yingying; Peng, Dailiang

    2018-06-11

    In recent decades, rice disease co-epidemics have caused tremendous damage to crop production in both China and Southeast Asia. A variety of remote sensing based approaches have been developed and applied to map diseases distribution using coarse- to moderate-resolution imagery. However, the detection and discrimination of various disease species infecting rice were seldom assessed using high spatial resolution data. The aims of this study were (1) to develop a set of normalized two-stage vegetation indices (VIs) for characterizing the progressive development of different diseases with rice; (2) to explore the performance of combined normalized two-stage VIs in partial least square discriminant analysis (PLS-DA); and (3) to map and evaluate the damage caused by rice diseases at fine spatial scales, for the first time using bi-temporal, high spatial resolution imagery from PlanetScope datasets at a 3 m spatial resolution. Our findings suggest that the primary biophysical parameters caused by different disease (e.g., changes in leaf area, pigment contents, or canopy morphology) can be captured using combined normalized two-stage VIs. PLS-DA was able to classify rice diseases at a sub-field scale, with an overall accuracy of 75.62% and a Kappa value of 0.47. The approach was successfully applied during a typical co-epidemic outbreak of rice dwarf (Rice dwarf virus, RDV), rice blast ( Magnaporthe oryzae ), and glume blight ( Phyllosticta glumarum ) in Guangxi Province, China. Furthermore, our approach highlighted the feasibility of the method in capturing heterogeneous disease patterns at fine spatial scales over the large spatial extents.

  20. Seized cannabis seeds cultivated in greenhouse: A chemical study by gas chromatography-mass spectrometry and chemometric analysis.

    PubMed

    Mariotti, Kristiane de Cássia; Marcelo, Marcelo Caetano Alexandre; Ortiz, Rafael S; Borille, Bruna Tassi; Dos Reis, Monique; Fett, Mauro Sander; Ferrão, Marco Flôres; Limberger, Renata Pereira

    2016-01-01

    Cannabis sativa L. is cultivated in most regions of the world. In 2013, the Brazilian Federal Police (BFP) reported 220 tons of marijuana seized and about 800,000 cannabis plants eradicated. Efforts to eradicate cannabis production may have contributed to the development of a new form of international drug trafficking in Brazil: the sending of cannabis seeds in small amounts to urban centers by logistics postal. This new and increasing panorama of cannabis trafficking in Brazil, encouraged the chemical study of cannabis seeds cultivated in greenhouses by gas-chromatography coupled with mass spectrometry (GC-MS) associated with exploratory and discriminant analysis. Fifty cannabis seeds of different varieties and brands, seized by the BFP were cultivated under predefined conditions for a period of 4.5 weeks, 5.5 weeks, 7.5 weeks, 10 weeks and 12 weeks. Aerial parts were analyzed and cannabigerol, cannabinol, cannabidiol, cannabichromene Δ9-tetrahydrocannabinol (THC) and other terpenoids were detected. The chromatographic chemical profiles of the samples were significantly different, probably due to different variety, light exposition and age. THC content increased with the age of the plant, however, for other cannabinoids, this correlation was not observed. The chromatograms were plotted in a matrix with 50 rows (samples) and 3886 columns (abundance in a retention time) and submitted to PCA, HCA and PLS-DA after pretreatment (normalization, first derivative and autoscale). The PCA and HCA showed age separation between samples however it was not possible to verify the separation by varieties and brands. The PLS-DA classification provides a satisfactory prediction of plant age. Copyright © 2015 The Chartered Society of Forensic Sciences. Published by Elsevier Ireland Ltd. All rights reserved.

  1. Quantitative analysis of red wine tannins using Fourier-transform mid-infrared spectrometry.

    PubMed

    Fernandez, Katherina; Agosin, Eduardo

    2007-09-05

    Tannin content and composition are critical quality components of red wines. No spectroscopic method assessing these phenols in wine has been described so far. We report here a new method using Fourier transform mid-infrared (FT-MIR) spectroscopy and chemometric techniques for the quantitative analysis of red wine tannins. Calibration models were developed using protein precipitation and phloroglucinolysis as analytical reference methods. After spectra preprocessing, six different predictive partial least-squares (PLS) models were evaluated, including the use of interval selection procedures such as iPLS and CSMWPLS. PLS regression with full-range (650-4000 cm(-1)), second derivative of the spectra and phloroglucinolysis as the reference method gave the most accurate determination for tannin concentration (RMSEC = 2.6%, RMSEP = 9.4%, r = 0.995). The prediction of the mean degree of polymerization (mDP) of the tannins also gave a reasonable prediction (RMSEC = 6.7%, RMSEP = 10.3%, r = 0.958). These results represent the first step in the development of a spectroscopic methodology for the quantification of several phenolic compounds that are critical for wine quality.

  2. Comparison of multivariate analysis methods for extracting the paraffin component from the paraffin-embedded cancer tissue spectra for Raman imaging

    NASA Astrophysics Data System (ADS)

    Meksiarun, Phiranuphon; Ishigaki, Mika; Huck-Pezzei, Verena A. C.; Huck, Christian W.; Wongravee, Kanet; Sato, Hidetoshi; Ozaki, Yukihiro

    2017-03-01

    This study aimed to extract the paraffin component from paraffin-embedded oral cancer tissue spectra using three multivariate analysis (MVA) methods; Independent Component Analysis (ICA), Partial Least Squares (PLS) and Independent Component - Partial Least Square (IC-PLS). The estimated paraffin components were used for removing the contribution of paraffin from the tissue spectra. These three methods were compared in terms of the efficiency of paraffin removal and the ability to retain the tissue information. It was found that ICA, PLS and IC-PLS could remove the paraffin component from the spectra at almost the same level while Principal Component Analysis (PCA) was incapable. In terms of retaining cancer tissue spectral integrity, effects of PLS and IC-PLS on the non-paraffin region were significantly less than that of ICA where cancer tissue spectral areas were deteriorated. The paraffin-removed spectra were used for constructing Raman images of oral cancer tissue and compared with Hematoxylin and Eosin (H&E) stained tissues for verification. This study has demonstrated the capability of Raman spectroscopy together with multivariate analysis methods as a diagnostic tool for the paraffin-embedded tissue section.

  3. Qualitative and quantitative analysis of ochratoxin A contamination in green coffee beans using Fourier transform near infrared spectroscopy.

    PubMed

    Taradolsirithitikul, Panchita; Sirisomboon, Panmanas; Dachoupakan Sirisomboon, Cheewanun

    2017-03-01

    Ochratoxin A (OTA) contamination is highly prevalent in a variety of agricultural products including the commercially important coffee bean. As such, rapid and accurate detection methods are considered necessary for the identification of OTA in green coffee beans. The goal of this research was to apply Fourier transform near infrared spectroscopy to detect and classify OTA contamination in green coffee beans in both a quantitative and qualitative manner. PLSR models were generated using pretreated spectroscopic data to predict the OTA concentration. The best model displayed a correlation coefficient (r) of 0.814, a standard error of prediction (SEP and bias of 1.965 µg kg -1 and 0.358 µg kg -1 , respectively. Additionally, a PLS-DA model was also generated, displaying a classification accuracy of 96.83% for a non-OTA contaminated model and 80.95% for an OTA contaminated model, with an overall classification accuracy of 88.89%. The results demonstrate that the developed model could be used for detecting OTA contamination in green coffee beans in either a quantitative or qualitative manner. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.

  4. [Identification of two varieties of Citri Fructus by fingerprint and chemometrics].

    PubMed

    Su, Jing-hua; Zhang, Chao; Sun, Lei; Gu, Bing-ren; Ma, Shuang-cheng

    2015-06-01

    Citri Fructus identification by fingerprint and chemometrics was investigated in this paper. Twenty-three Citri Fructus samples were collected which referred to two varieties as Cirtus wilsonii and C. medica recorded in Chinese Pharmacopoeia. HPLC chromatograms were obtained. The components were partly identified by reference substances, and then common pattern was established for chemometrics analysis. Similarity analysis, principal component analysis (PCA) , partial least squares-discriminant analysis (PLS-DA) and hierarchical cluster analysis heatmap were applied. The results indicated that C. wilsonii and C. medica could be ideally classified with common pattern contained twenty-five characteristic peaks. Besides, preliminary pattern recognition had verified the chemometrics analytical results. Absolute peak area (APA) was used for relevant quantitative analysis, results showed the differences between two varieties and it was valuable for further quality control as selection of characteristic components.

  5. On-line monitoring the extract process of Fu-fang Shuanghua oral solution using near infrared spectroscopy and different PLS algorithms

    NASA Astrophysics Data System (ADS)

    Kang, Qian; Ru, Qingguo; Liu, Yan; Xu, Lingyan; Liu, Jia; Wang, Yifei; Zhang, Yewen; Li, Hui; Zhang, Qing; Wu, Qing

    2016-01-01

    An on-line near infrared (NIR) spectroscopy monitoring method with an appropriate multivariate calibration method was developed for the extraction process of Fu-fang Shuanghua oral solution (FSOS). On-line NIR spectra were collected through two fiber optic probes, which were designed to transmit NIR radiation by a 2 mm flange. Partial least squares (PLS), interval PLS (iPLS) and synergy interval PLS (siPLS) algorithms were used comparatively for building the calibration regression models. During the extraction process, the feasibility of NIR spectroscopy was employed to determine the concentrations of chlorogenic acid (CA) content, total phenolic acids contents (TPC), total flavonoids contents (TFC) and soluble solid contents (SSC). High performance liquid chromatography (HPLC), ultraviolet spectrophotometric method (UV) and loss on drying methods were employed as reference methods. Experiment results showed that the performance of siPLS model is the best compared with PLS and iPLS. The calibration models for AC, TPC, TFC and SSC had high values of determination coefficients of (R2) (0.9948, 0.9992, 0.9950 and 0.9832) and low root mean square error of cross validation (RMSECV) (0.0113, 0.0341, 0.1787 and 1.2158), which indicate a good correlation between reference values and NIR predicted values. The overall results show that the on line detection method could be feasible in real application and would be of great value for monitoring the mixed decoction process of FSOS and other Chinese patent medicines.

  6. Metabolomic profiling of doxycycline treatment in chronic obstructive pulmonary disease.

    PubMed

    Singh, Brajesh; Jana, Saikat K; Ghosh, Nilanjana; Das, Soumen K; Joshi, Mamata; Bhattacharyya, Parthasarathi; Chaudhury, Koel

    2017-01-05

    Serum metabolic profiling can identify the metabolites responsible for discrimination between doxycycline treated and untreated chronic obstructive pulmonary disease (COPD) and explain the possible effect of doxycycline in improving the disease conditions. 1 H nuclear magnetic resonance (NMR)-based metabolomics was used to obtain serum metabolic profiles of 60 add-on doxycycline treated COPD patients and 40 patients receiving standard therapy. The acquired data were analyzed using multivariate principal component analysis (PCA), partial least-squares-discriminant analysis (PLS-DA), and orthogonal projection to latent structure with discriminant analysis (OPLS-DA). A clear metabolic differentiation was apparent between the pre and post doxycycline treated group. The distinguishing metabolites lactate and fatty acids were significantly down-regulated and formate, citrate, imidazole and l-arginine upregulated. Lactate and folate are further validated biochemically. Metabolic changes, such as decreased lactate level, inhibited arginase activity and lowered fatty acid level observed in COPD patients in response to add-on doxycycline treatment, reflect the anti-inflammatory action of the drug. Doxycycline as a possible therapeutic option for COPD seems promising. Copyright © 2016 Elsevier B.V. All rights reserved.

  7. Rapid analysis of sugars in honey by processing Raman spectrum using chemometric methods and artificial neural networks.

    PubMed

    Özbalci, Beril; Boyaci, İsmail Hakkı; Topcu, Ali; Kadılar, Cem; Tamer, Uğur

    2013-02-15

    The aim of this study was to quantify glucose, fructose, sucrose and maltose contents of honey samples using Raman spectroscopy as a rapid method. By performing a single measurement, quantifications of sugar contents have been said to be unaffordable according to the molecular similarities between sugar molecules in honey matrix. This bottleneck was overcome by coupling Raman spectroscopy with chemometric methods (principal component analysis (PCA) and partial least squares (PLS)) and an artificial neural network (ANN). Model solutions of four sugars were processed with PCA and significant separation was observed. This operation, done with the spectral features by using PLS and ANN methods, led to the discriminant analysis of sugar contents. Models/trained networks were created using a calibration data set and evaluated using a validation data set. The correlation coefficient values between actual and predicted values of glucose, fructose, sucrose and maltose were determined as 0.964, 0.965, 0.968 and 0.949 for PLS and 0.965, 0.965, 0.978 and 0.956 for ANN, respectively. The requirement of rapid analysis of sugar contents of commercial honeys has been met by the data processed within this article. Copyright © 2012 Elsevier Ltd. All rights reserved.

  8. A Cultural Diffusion Model for the Rise and Fall of Programming Languages.

    PubMed

    Valverde, Sergi; Solé, Ricard V

    2015-07-01

    Our interaction with complex computing machines is mediated by programming languages (PLs), which constitute one of the major innovations in the evolution of technology. PLs allow flexible, scalable, and fast use of hardware and are largely responsible for shaping the history of information technology since the rise of computers in the 1950s. The rapid growth and impact of computers were followed closely by the development of PLs. As occurs with natural, human languages, PLs have emerged and gone extinct. There has been always a diversity of coexisting PLs that compete somewhat while occupying special niches. Here we show that the statistical patterns of language adoption, rise, and fall can be accounted for by a simple model in which a set of programmers can use several PLs, decide to use existing PLs used by other programmers, or decide not to use them. Our results highlight the influence of strong communities of practice in the diffusion of PL innovations.

  9. Multi-analyte quantification in bioprocesses by Fourier-transform-infrared spectroscopy by partial least squares regression and multivariate curve resolution.

    PubMed

    Koch, Cosima; Posch, Andreas E; Goicoechea, Héctor C; Herwig, Christoph; Lendl, Bernhard

    2014-01-07

    This paper presents the quantification of Penicillin V and phenoxyacetic acid, a precursor, inline during Pencillium chrysogenum fermentations by FTIR spectroscopy and partial least squares (PLS) regression and multivariate curve resolution - alternating least squares (MCR-ALS). First, the applicability of an attenuated total reflection FTIR fiber optic probe was assessed offline by measuring standards of the analytes of interest and investigating matrix effects of the fermentation broth. Then measurements were performed inline during four fed-batch fermentations with online HPLC for the determination of Penicillin V and phenoxyacetic acid as reference analysis. PLS and MCR-ALS models were built using these data and validated by comparison of single analyte spectra with the selectivity ratio of the PLS models and the extracted spectral traces of the MCR-ALS models, respectively. The achieved root mean square errors of cross-validation for the PLS regressions were 0.22 g L(-1) for Penicillin V and 0.32 g L(-1) for phenoxyacetic acid and the root mean square errors of prediction for MCR-ALS were 0.23 g L(-1) for Penicillin V and 0.15 g L(-1) for phenoxyacetic acid. A general work-flow for building and assessing chemometric regression models for the quantification of multiple analytes in bioprocesses by FTIR spectroscopy is given. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

  10. Statistical variation in progressive scrambling

    NASA Astrophysics Data System (ADS)

    Clark, Robert D.; Fox, Peter C.

    2004-07-01

    The two methods most often used to evaluate the robustness and predictivity of partial least squares (PLS) models are cross-validation and response randomization. Both methods may be overly optimistic for data sets that contain redundant observations, however. The kinds of perturbation analysis widely used for evaluating model stability in the context of ordinary least squares regression are only applicable when the descriptors are independent of each other and errors are independent and normally distributed; neither assumption holds for QSAR in general and for PLS in particular. Progressive scrambling is a novel, non-parametric approach to perturbing models in the response space in a way that does not disturb the underlying covariance structure of the data. Here, we introduce adjustments for two of the characteristic values produced by a progressive scrambling analysis - the deprecated predictivity (Q_s^{ast^2}) and standard error of prediction (SDEP s * ) - that correct for the effect of introduced perturbation. We also explore the statistical behavior of the adjusted values (Q_0^{ast^2} and SDEP 0 * ) and the sensitivity to perturbation (d q 2/d r yy ' 2). It is shown that the three statistics are all robust for stable PLS models, in terms of the stochastic component of their determination and of their variation due to sampling effects involved in training set selection.

  11. Evaluation of saffron (Crocus sativus L.) adulteration with plant adulterants by (1)H NMR metabolite fingerprinting.

    PubMed

    Petrakis, Eleftherios A; Cagliani, Laura R; Polissiou, Moschos G; Consonni, Roberto

    2015-04-15

    In the present work, a preliminary study for the detection of adulterated saffron and the identification of the adulterant used by means of (1)H NMR and chemometrics is reported. Authentic Greek saffron and four typical plant-derived materials utilised as bulking agents in saffron, i.e., Crocus sativus stamens, safflower, turmeric, and gardenia were investigated. A two-step approach, relied on the application of both OPLS-DA and O2PLS-DA models to the (1)H NMR data, was adopted to perform authentication and prediction of authentic and adulterated saffron. Taking into account the deficiency of established methodologies to detect saffron adulteration with plant adulterants, the method developed resulted reliable in assessing the type of adulteration and could be viable for dealing with extensive saffron frauds at a minimum level of 20% (w/w). Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. Evaluation of metabolites extraction strategies for identifying different brain regions and their relationship with alcohol preference and gender difference using NMR metabolomics.

    PubMed

    Wang, Jie; Zeng, Hao-Long; Du, Hongying; Liu, Zeyuan; Cheng, Ji; Liu, Taotao; Hu, Ting; Kamal, Ghulam Mustafa; Li, Xihai; Liu, Huili; Xu, Fuqiang

    2018-03-01

    Metabolomics generate a profile of small molecules from cellular/tissue metabolism, which could directly reflect the mechanisms of complex networks of biochemical reactions. Traditional metabolomics methods, such as OPLS-DA, PLS-DA are mainly used for binary class discrimination. Multiple groups are always involved in the biological system, especially for brain research. Multiple brain regions are involved in the neuronal study of brain metabolic dysfunctions such as alcoholism, Alzheimer's disease, etc. In the current study, 10 different brain regions were utilized for comparative studies between alcohol preferring and non-preferring rats, male and female rats respectively. As many classes are involved (ten different regions and four types of animals), traditional metabolomics methods are no longer efficient for showing differentiation. Here, a novel strategy based on the decision tree algorithm was employed for successfully constructing different classification models to screen out the major characteristics of ten brain regions at the same time. Subsequently, this method was also utilized to select the major effective brain regions related to alcohol preference and gender difference. Compared with the traditional multivariate statistical methods, the decision tree could construct acceptable and understandable classification models for multi-class data analysis. Therefore, the current technology could also be applied to other general metabolomics studies involving multi class data. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Kernel PLS-SVC for Linear and Nonlinear Discrimination

    NASA Technical Reports Server (NTRS)

    Rosipal, Roman; Trejo, Leonard J.; Matthews, Bryan

    2003-01-01

    A new methodology for discrimination is proposed. This is based on kernel orthonormalized partial least squares (PLS) dimensionality reduction of the original data space followed by support vector machines for classification. Close connection of orthonormalized PLS and Fisher's approach to linear discrimination or equivalently with canonical correlation analysis is described. This gives preference to use orthonormalized PLS over principal component analysis. Good behavior of the proposed method is demonstrated on 13 different benchmark data sets and on the real world problem of the classification finger movement periods versus non-movement periods based on electroencephalogram.

  14. Indonesian palm civet coffee discrimination using UV-visible spectroscopy and several chemometrics methods

    NASA Astrophysics Data System (ADS)

    Yulia, M.; Suhandy, D.

    2017-05-01

    Indonesian palm civet coffee or kopi luwak (Indonesian words for coffee and palm civet) is well known as the world’s priciest and rarest coffee. To protect the authenticity of luwak coffee and protect consumer from luwak coffee adulteration, it is very important to develop a simple and inexpensive method to discriminate between civet and non-civet coffee. The discrimination between civet and non-civet coffee in ground roasted (powder) samples is very challenging since it is very difficult to distinguish between the two by using conventional method. In this research, the use of UV-Visible spectra combined with two chemometric methods, SIMCA and PLS-DA, was evaluated to discriminate civet and non-civet ground coffee samples. The spectral data of civet and non-civet coffee were acquired using UV-Vis spectrometer (Genesys™ 10S UV-Vis, Thermo Scientific, USA). The result shows that using both supervised discrimination methods: SIMCA and PLS-DA, all samples were correctly classified into their corresponding classes with 100% rate for accuracy, sensitivity and specificity, respectively.

  15. The use of chemometrics to study multifunctional indole alkaloids from Psychotria nemorosa (Palicourea comb. nov.). Part II: Indication of peaks related to the inhibition of butyrylcholinesterase and monoamine oxidase-A.

    PubMed

    Klein-Júnior, Luiz C; Viaene, Johan; Tuenter, Emmy; Salton, Juliana; Gasper, André L; Apers, Sandra; Andries, Jan P M; Pieters, Luc; Henriques, Amélia T; Vander Heyden, Yvan

    2016-09-09

    Psychotria nemorosa is chemically characterized by indole alkaloids and displays significant inhibitory activity on butyrylcholinesterase (BChE) and monoamine oxidase-A (MAO-A), both enzymes related to neurodegenerative disorders. In the present study, 43 samples of P. nemorosa leaves were extracted and fractionated in accordance to previously optimized methods (see Part I). These fractions were analyzed by means of UPLC-DAD and assayed for their BChE and MAO-A inhibitory potencies. The chromatographic fingerprint data was first aligned using correlation optimized warping and Principal Component Analysis to explore the data structure was performed. Multivariate calibration techniques, namely Partial Least Squares (PLS1), PLS2 and Orthogonal Projections to Latent Structure (O-PLS1), were evaluated for modelling the activities as a function of the fingerprints. Since the best results were obtained with O-PLS1 model (RMSECV=9.3 and 3.3 for BChE and MAO-A, respectively), the regression coefficients of the model were analyzed and plotted relative to the original fingerprints. Four peaks were indicated as multifunctional compounds, with the capacity to impair both BChE and MAO-A activities. In order to confirm these results, a semi-prep HPLC technique was used and a fraction containing the four peaks was purified and evaluated in vitro. It was observed that the fraction exhibited an IC50 of 2.12μgmL(-1) for BChE and 1.07μgmL(-1) for MAO-A. These results reinforce the prediction obtained by O-PLS1 modelling. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. Discrimination of tomatoes bred by spaceflight mutagenesis using visible/near infrared spectroscopy and chemometrics.

    PubMed

    Shao, Yongni; Xie, Chuanqi; Jiang, Linjun; Shi, Jiahui; Zhu, Jiajin; He, Yong

    2015-04-05

    Visible/near infrared spectroscopy (Vis/NIR) based on sensitive wavelengths (SWs) and chemometrics was proposed to discriminate different tomatoes bred by spaceflight mutagenesis from their leafs or fruits (green or mature). The tomato breeds were mutant M1, M2 and their parent. Partial least squares (PLS) analysis and least squares-support vector machine (LS-SVM) were implemented for calibration models. PLS analysis was implemented for calibration models with different wavebands including the visible region (400-700 nm) and the near infrared region (700-1000 nm). The best PLS models were achieved in the visible region for the leaf and green fruit samples and in the near infrared region for the mature fruit samples. Furthermore, different latent variables (4-8 LVs for leafs, 5-9 LVs for green fruits, and 4-9 LVs for mature fruits) were used as inputs of LS-SVM to develop the LV-LS-SVM models with the grid search technique and radial basis function (RBF) kernel. The optimal LV-LS-SVM models were achieved with six LVs for the leaf samples, seven LVs for green fruits, and six LVs for mature fruits, respectively, and they outperformed the PLS models. Moreover, independent component analysis (ICA) was executed to select several SWs based on loading weights. The optimal LS-SVM model was achieved with SWs of 550-560 nm, 562-574 nm, 670-680 nm and 705-71 5 nm for the leaf samples; 548-556 nm, 559-564 nm, 678-685 nm and 962-974 nm for the green fruit samples; and 712-718 nm, 720-729 nm, 968-978 nm and 820-830 nm for the mature fruit samples. All of them had better performance than PLS and LV-LS-SVM, with the parameters of correlation coefficient (rp), root mean square error of prediction (RMSEP) and bias of 0.9792, 0.2632 and 0.0901 based on leaf discrimination, 0.9837, 0.2783 and 0.1758 based on green fruit discrimination, 0.9804, 0.2215 and -0.0035 based on mature fruit discrimination, respectively. The overall results indicated that ICA was an effective way for the selection of SWs, and the Vis/NIR combined with LS-SVM models had the capability to predict the different breeds (mutant M1, mutant M2 and their parent) of tomatoes from leafs and fruits. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. Evaluation of in-line Raman data for end-point determination of a coating process: Comparison of Science-Based Calibration, PLS-regression and univariate data analysis.

    PubMed

    Barimani, Shirin; Kleinebudde, Peter

    2017-10-01

    A multivariate analysis method, Science-Based Calibration (SBC), was used for the first time for endpoint determination of a tablet coating process using Raman data. Two types of tablet cores, placebo and caffeine cores, received a coating suspension comprising a polyvinyl alcohol-polyethylene glycol graft-copolymer and titanium dioxide to a maximum coating thickness of 80µm. Raman spectroscopy was used as in-line PAT tool. The spectra were acquired every minute and correlated to the amount of applied aqueous coating suspension. SBC was compared to another well-known multivariate analysis method, Partial Least Squares-regression (PLS) and a simpler approach, Univariate Data Analysis (UVDA). All developed calibration models had coefficient of determination values (R 2 ) higher than 0.99. The coating endpoints could be predicted with root mean square errors (RMSEP) less than 3.1% of the applied coating suspensions. Compared to PLS and UVDA, SBC proved to be an alternative multivariate calibration method with high predictive power. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Calibration sets selection strategy for the construction of robust PLS models for prediction of biodiesel/diesel blends physico-chemical properties using NIR spectroscopy

    NASA Astrophysics Data System (ADS)

    Palou, Anna; Miró, Aira; Blanco, Marcelo; Larraz, Rafael; Gómez, José Francisco; Martínez, Teresa; González, Josep Maria; Alcalà, Manel

    2017-06-01

    Even when the feasibility of using near infrared (NIR) spectroscopy combined with partial least squares (PLS) regression for prediction of physico-chemical properties of biodiesel/diesel blends has been widely demonstrated, inclusion in the calibration sets of the whole variability of diesel samples from diverse production origins still remains as an important challenge when constructing the models. This work presents a useful strategy for the systematic selection of calibration sets of samples of biodiesel/diesel blends from diverse origins, based on a binary code, principal components analysis (PCA) and the Kennard-Stones algorithm. Results show that using this methodology the models can keep their robustness over time. PLS calculations have been done using a specialized chemometric software as well as the software of the NIR instrument installed in plant, and both produced RMSEP under reproducibility values of the reference methods. The models have been proved for on-line simultaneous determination of seven properties: density, cetane index, fatty acid methyl esters (FAME) content, cloud point, boiling point at 95% of recovery, flash point and sulphur.

  19. Novel near-infrared spectrum analysis tool: Synergy adaptive moving window model based on immune clone algorithm.

    PubMed

    Wang, Shenghao; Zhang, Yuyan; Cao, Fuyi; Pei, Zhenying; Gao, Xuewei; Zhang, Xu; Zhao, Yong

    2018-02-13

    This paper presents a novel spectrum analysis tool named synergy adaptive moving window modeling based on immune clone algorithm (SA-MWM-ICA) considering the tedious and inconvenient labor involved in the selection of pre-processing methods and spectral variables by prior experience. In this work, immune clone algorithm is first introduced into the spectrum analysis field as a new optimization strategy, covering the shortage of the relative traditional methods. Based on the working principle of the human immune system, the performance of the quantitative model is regarded as antigen, and a special vector corresponding to the above mentioned antigen is regarded as antibody. The antibody contains a pre-processing method optimization region which is created by 11 decimal digits, and a spectrum variable optimization region which is formed by some moving windows with changeable width and position. A set of original antibodies are created by modeling with this algorithm. After calculating the affinity of these antibodies, those with high affinity will be selected to clone. The regulation for cloning is that the higher the affinity, the more copies will be. In the next step, another import operation named hyper-mutation is applied to the antibodies after cloning. Moreover, the regulation for hyper-mutation is that the lower the affinity, the more possibility will be. Several antibodies with high affinity will be created on the basis of these steps. Groups of simulated dataset, gasoline near-infrared spectra dataset, and soil near-infrared spectra dataset are employed to verify and illustrate the performance of SA-MWM-ICA. Analysis results show that the performance of the quantitative models adopted by SA-MWM-ICA are better especially for structures with relatively complex spectra than traditional models such as partial least squares (PLS), moving window PLS (MWPLS), genetic algorithm PLS (GAPLS), and pretreatment method classification and adjustable parameter changeable size moving window PLS (CA-CSMWPLS). The selected pre-processing methods and spectrum variables are easily explained. The proposed method will converge in few generations and can be used not only for near-infrared spectroscopy analysis but also for other similar spectral analysis, such as infrared spectroscopy. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Urine and plasma metabonomics coupled with UHPLC-QTOF/MS and multivariate data analysis on potential biomarkers in anemia and hematinic effects of herb pair Gui-Hong.

    PubMed

    Li, Shujiao; Lin, Hang; Qu, Cheng; Tang, Yuping; Shen, Juan; Li, Weixia; Yue, Shijun; Kai, Jun; Shang, Guanxiong; Zhu, Zhenhua; Zhang, Changbin; Liu, Pei; Yan, Hui; Zhang, Li; Qian, Li; Qian, Dawei; Duan, Jin-ao

    2015-07-21

    The compatibility of Angelicae Sinensis Radix (Danggui) and Carthami Flos (Honghua), a famous herb pair Gui-Hong, can produce synergistic and complementary hematinic effects. Our previous studies have indicated that Gui-Hong has therapeutic potential treatment in hemolytic and aplastic anemia (HAA). The present study aimed to investigate the hematinic effects of Danggui, Honghua and Gui-Hong on HAA rats induced by acetyl phenylhydrazine (APH) and cyclophosphamide (CP) and to explore the underlying hematinic regulation mechanisms. Rats were divided into 5 groups, and drugs were administered by oral gavage one time each day for continuous 7 days from the experiment began. Urine and plasma were analyzed by ultra-high-performance liquid chromatography coupled to quadrupole time-of-flight mass spectrometry (UHPLC-QTOF/MS). Partial least-squares discriminate analysis (PLS-DA) models were built to evaluate the therapeutic effects of Danggui, Honghua and Gui-Hong. Pearson correlation matrix analysis method was used to discover the correlations between potential biomarkers and biochemical indicators of HAA rats. Seven potential biomarkers contribute to the separation of model group and control group were tentatively identified. The levels of l-kynurenine, phenylalanine, nicotinic acid and sphingosine increased significantly (P<0.05) in HAA rats, while the levels of l-isoleucine, l-tyrosine and serotonin decreased significantly (P<0.05) in comparison with control rats. Those endogenous metabolites were chiefly involved in phenylalanine, tyrosine and tryptophan biosynthesis, valine, leucine and isoleucine biosynthesis, tryptophan metabolism and tyrosine metabolism. The metabolic deviations could be regulated closer to normal level after Danggui, Honghua and Gui-Hong intervention. In term of hematinic effects, Gui-Hong was the most effective as shown by the relative distance in PLS-DA score plots and relative intensity of potential biomarkers. The result reflected the synergic action between Danggui and Honghua. The above results were found to be reasonable in explaining the hematinic effects mechanism of Gui-Hong. The results of routine blood, urinary metabolic pattern and plasma metabolic pattern show the Danggui, Honghua and Gui-Hong groups are moving toward the control group and the HAA was being prevented and alleviated. The effect of Gui-Hong group is more remarkable than Danggui and Honghua groups. Some potential biomarkers like l-kynurenine, phenylalanine, l-isoleucine, l-tyrosine, serotonin, nicotinic acid and sphingosine have been found and identified. The work shows that the metabonomics method is a promising tool in the efficacy and mechanism research of traditional Chinese medicines. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  1. Variable selection in near-infrared spectroscopy: benchmarking of feature selection methods on biodiesel data.

    PubMed

    Balabin, Roman M; Smirnov, Sergey V

    2011-04-29

    During the past several years, near-infrared (near-IR/NIR) spectroscopy has increasingly been adopted as an analytical tool in various fields from petroleum to biomedical sectors. The NIR spectrum (above 4000 cm(-1)) of a sample is typically measured by modern instruments at a few hundred of wavelengths. Recently, considerable effort has been directed towards developing procedures to identify variables (wavelengths) that contribute useful information. Variable selection (VS) or feature selection, also called frequency selection or wavelength selection, is a critical step in data analysis for vibrational spectroscopy (infrared, Raman, or NIRS). In this paper, we compare the performance of 16 different feature selection methods for the prediction of properties of biodiesel fuel, including density, viscosity, methanol content, and water concentration. The feature selection algorithms tested include stepwise multiple linear regression (MLR-step), interval partial least squares regression (iPLS), backward iPLS (BiPLS), forward iPLS (FiPLS), moving window partial least squares regression (MWPLS), (modified) changeable size moving window partial least squares (CSMWPLS/MCSMWPLSR), searching combination moving window partial least squares (SCMWPLS), successive projections algorithm (SPA), uninformative variable elimination (UVE, including UVE-SPA), simulated annealing (SA), back-propagation artificial neural networks (BP-ANN), Kohonen artificial neural network (K-ANN), and genetic algorithms (GAs, including GA-iPLS). Two linear techniques for calibration model building, namely multiple linear regression (MLR) and partial least squares regression/projection to latent structures (PLS/PLSR), are used for the evaluation of biofuel properties. A comparison with a non-linear calibration model, artificial neural networks (ANN-MLP), is also provided. Discussion of gasoline, ethanol-gasoline (bioethanol), and diesel fuel data is presented. The results of other spectroscopic techniques application, such as Raman, ultraviolet-visible (UV-vis), or nuclear magnetic resonance (NMR) spectroscopies, can be greatly improved by an appropriate feature selection choice. Copyright © 2011 Elsevier B.V. All rights reserved.

  2. Baseline correction combined partial least squares algorithm and its application in on-line Fourier transform infrared quantitative analysis.

    PubMed

    Peng, Jiangtao; Peng, Silong; Xie, Qiong; Wei, Jiping

    2011-04-01

    In order to eliminate the lower order polynomial interferences, a new quantitative calibration algorithm "Baseline Correction Combined Partial Least Squares (BCC-PLS)", which combines baseline correction and conventional PLS, is proposed. By embedding baseline correction constraints into PLS weights selection, the proposed calibration algorithm overcomes the uncertainty in baseline correction and can meet the requirement of on-line attenuated total reflectance Fourier transform infrared (ATR-FTIR) quantitative analysis. The effectiveness of the algorithm is evaluated by the analysis of glucose and marzipan ATR-FTIR spectra. BCC-PLS algorithm shows improved prediction performance over PLS. The root mean square error of cross-validation (RMSECV) on marzipan spectra for the prediction of the moisture is found to be 0.53%, w/w (range 7-19%). The sugar content is predicted with a RMSECV of 2.04%, w/w (range 33-68%). Copyright © 2011 Elsevier B.V. All rights reserved.

  3. Rapid analysis of glucose, fructose, sucrose, and maltose in honeys from different geographic regions using fourier transform infrared spectroscopy and multivariate analysis.

    PubMed

    Wang, Jun; Kliks, Michael M; Jun, Soojin; Jackson, Mel; Li, Qing X

    2010-03-01

    Quantitative analysis of glucose, fructose, sucrose, and maltose in different geographic origin honey samples in the world using the Fourier transform infrared (FTIR) spectroscopy and chemometrics such as partial least squares (PLS) and principal component regression was studied. The calibration series consisted of 45 standard mixtures, which were made up of glucose, fructose, sucrose, and maltose. There were distinct peak variations of all sugar mixtures in the spectral "fingerprint" region between 1500 and 800 cm(-1). The calibration model was successfully validated using 7 synthetic blend sets of sugars. The PLS 2nd-derivative model showed the highest degree of prediction accuracy with a highest R(2) value of 0.999. Along with the canonical variate analysis, the calibration model further validated by high-performance liquid chromatography measurements for commercial honey samples demonstrates that FTIR can qualitatively and quantitatively determine the presence of glucose, fructose, sucrose, and maltose in multiple regional honey samples.

  4. Identification and topographical characterisation of microbial nanowires in Nostoc punctiforme.

    PubMed

    Sure, Sandeep; Torriero, Angel A J; Gaur, Aditya; Li, Lu Hua; Chen, Ying; Tripathi, Chandrakant; Adholeya, Alok; Ackland, M Leigh; Kochar, Mandira

    2016-03-01

    Extracellular pili-like structures (PLS) produced by cyanobacteria have been poorly explored. We have done detailed topographical and electrical characterisation of PLS in Nostoc punctiforme PCC 73120 using transmission electron microscopy (TEM) and conductive atomic force microscopy (CAFM). TEM analysis showed that N. punctiforme produces two separate types of PLS differing in their length and diameter. The first type of PLS are 6-7.5 nm in diameter and 0.5-2 µm in length (short/thin PLS) while the second type of PLS are ~20-40 nm in diameter and more than 10 µm long (long/thick PLS). This is the first study to report long/thick PLS in N. punctiforme. Electrical characterisation of these two different PLS by CAFM showed that both are electrically conductive and can act as microbial nanowires. This is the first report to show two distinct PLS and also identifies microbial nanowires in N. punctiforme. This study paves the way for more detailed investigation of N. punctiforme nanowires and their potential role in cell physiology and symbiosis with plants.

  5. Phytochemical diversity of cranberry (Vaccinium macrocarpon Aiton) cultivars by anthocyanin determination and metabolomic profiling with chemometric analysis.

    PubMed

    Brown, Paula N; Murch, Susan J; Shipley, Paul

    2012-01-11

    Originally native to the eastern United States, American cranberry ( Vaccinium macrocarpon Aiton, family Ericaceae) cultivation of native and hybrid varieties has spread across North America. Herein is reported the phytochemical diversity of five cranberry cultivars (Stevens, Ben Lear, Bergman, Pilgrim, and GH1) collected in the Greater Vancouver Regional District, by anthocyanin content and UPLC-TOF-MS metabolomic profiling. The anthocyanin content for biological replicates (n = 5) was determined as 7.98 ± 5.83, Ben Lear; 7.02 ± 1.75, Bergman; 6.05 ± 2.51, GH1; 3.28 ± 1.88, Pilgrim; and 2.81 ± 0.81, Stevens. Using subtractive metabonomic algorithms 6481 compounds were found conserved across all varietals, with 136 (Ben Lear), 84 (Bergman), 91 (GH1), 128 (Pilgrim), and 165 (Stevens) unique compounds observed. Principal component analysis (PCA) did not differentiate varieties, whereas partial least-squares discriminate analysis (PLS-DA) exhibited clustering patterns. Univariate statistical approaches were applied to the data set, establishing significance of values and assessing quality of the models. Metabolomic profiling with chemometric analysis proved to be useful for characterizing metabonomic changes across cranberry varieties.

  6. Total sulfur determination in residues of crude oil distillation using FT-IR/ATR and variable selection methods

    NASA Astrophysics Data System (ADS)

    Müller, Aline Lima Hermes; Picoloto, Rochele Sogari; Mello, Paola de Azevedo; Ferrão, Marco Flores; dos Santos, Maria de Fátima Pereira; Guimarães, Regina Célia Lourenço; Müller, Edson Irineu; Flores, Erico Marlon Moraes

    2012-04-01

    Total sulfur concentration was determined in atmospheric residue (AR) and vacuum residue (VR) samples obtained from petroleum distillation process by Fourier transform infrared spectroscopy with attenuated total reflectance (FT-IR/ATR) in association with chemometric methods. Calibration and prediction set consisted of 40 and 20 samples, respectively. Calibration models were developed using two variable selection models: interval partial least squares (iPLS) and synergy interval partial least squares (siPLS). Different treatments and pre-processing steps were also evaluated for the development of models. The pre-treatment based on multiplicative scatter correction (MSC) and the mean centered data were selected for models construction. The use of siPLS as variable selection method provided a model with root mean square error of prediction (RMSEP) values significantly better than those obtained by PLS model using all variables. The best model was obtained using siPLS algorithm with spectra divided in 20 intervals and combinations of 3 intervals (911-824, 823-736 and 737-650 cm-1). This model produced a RMSECV of 400 mg kg-1 S and RMSEP of 420 mg kg-1 S, showing a correlation coefficient of 0.990.

  7. [Rapid determination of COD in aquaculture water based on LS-SVM with ultraviolet/visible spectroscopy].

    PubMed

    Liu, Xue-Mei; Zhang, Hai-Liang

    2014-10-01

    Ultraviolet/visible (UV/Vis) spectroscopy was studied for the rapid determination of chemical oxygen demand (COD), which was an indicator to measure the concentration of organic matter in aquaculture water. In order to reduce the influence of the absolute noises of the spectra, the extracted 135 absorbance spectra were preprocessed by Savitzky-Golay smoothing (SG), EMD, and wavelet transform (WT) methods. The preprocessed spectra were then used to select latent variables (LVs) by partial least squares (PLS) methods. Partial least squares (PLS) was used to build models with the full spectra, and back- propagation neural network (BPNN) and least square support vector machine (LS-SVM) were applied to build models with the selected LVs. The overall results showed that BPNN and LS-SVM models performed better than PLS models, and the LS-SVM models with LVs based on WT preprocessed spectra obtained the best results with the determination coefficient (r2) and RMSE being 0. 83 and 14. 78 mg · L(-1) for calibration set, and 0.82 and 14.82 mg · L(-1) for the prediction set respectively. The method showed the best performance in LS-SVM model. The results indicated that it was feasible to use UV/Vis with LVs which were obtained by PLS method, combined with LS-SVM calibration could be applied to the rapid and accurate determination of COD in aquaculture water. Moreover, this study laid the foundation for further implementation of online analysis of aquaculture water and rapid determination of other water quality parameters.

  8. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Coble, Jamie; Orton, Christopher; Schwantes, Jon

    Abstract—The Multi-Isotope Process (MIP) Monitor provides an efficient approach to monitoring the process conditions in used nuclear fuel reprocessing facilities to support process verification and validation. The MIP Monitor applies multivariate analysis to gamma spectroscopy of reprocessing streams in order to detect small changes in the gamma spectrum, which may indicate changes in process conditions. This research extends the MIP Monitor by characterizing a used fuel sample after initial dissolution according to the type of reactor of origin (pressurized or boiling water reactor), initial enrichment, burn up, and cooling time. Simulated gamma spectra were used to develop and test threemore » fuel characterization algorithms. The classification and estimation models employed are based on the partial least squares regression (PLS) algorithm. A PLS discriminate analysis model was developed which perfectly classified reactor type. Locally weighted PLS models were fitted on-the-fly to estimate continuous fuel characteristics. Burn up was predicted within 0.1% root mean squared percent error (RMSPE) and both cooling time and initial enrichment within approximately 2% RMSPE. This automated fuel characterization can be used to independently verify operator declarations of used fuel characteristics and inform the MIP Monitor anomaly detection routines at later stages of the fuel reprocessing stream to improve sensitivity to changes in operational parameters and material diversions.« less

  9. Identification of Pulmonary Edema in Forensic Autopsy Cases of Sudden Cardiac Death Using Fourier Transform Infrared Microspectroscopy: A Pilot Study.

    PubMed

    Lin, Hancheng; Luo, Yiwen; Sun, Qiran; Zhang, Ji; Tuo, Ya; Zhang, Zhong; Wang, Lei; Deng, Kaifei; Chen, Yijiu; Huang, Ping; Wang, Zhenyuan

    2018-02-20

    Many studies have proven the usefulness of biofluid-based infrared spectroscopy in the clinical domain for diagnosis and monitoring the progression of diseases. Here we present a state-of-the-art study in the forensic field that employed Fourier transform infrared microspectroscopy for postmortem diagnosis of sudden cardiac death (SCD) by in situ biochemical investigation of alveolar edema fluid in lung tissue sections. The results of amide-related spectral absorbance analysis demonstrated that the pulmonary edema fluid of the SCD group was richer in protein components than that of the neurologic catastrophe (NC) and lethal multiple injuries (LMI) groups. The complementary results of unsupervised principle component analysis (PCA) and genetic algorithm-guided partial least-squares discriminant analysis (GA-PLS-DA) further indicated different global spectral band patterns of pulmonary edema fluids between these three groups. Ultimately, a random forest (RF) classification model for postmortem diagnosis of SCD was built and achieved good sensitivity and specificity scores of 97.3% and 95.5%, respectively. Classification predictions of unknown pulmonary edema fluid collected from 16 cases were also performed by the model, resulting in 100% correct discrimination. This pilot study demonstrates that FTIR microspectroscopy in combination with chemometrics has the potential to be an effective aid for postmortem diagnosis of SCD.

  10. Linear and nonlinear methods in modeling the aqueous solubility of organic compounds.

    PubMed

    Catana, Cornel; Gao, Hua; Orrenius, Christian; Stouten, Pieter F W

    2005-01-01

    Solubility data for 930 diverse compounds have been analyzed using linear Partial Least Square (PLS) and nonlinear PLS methods, Continuum Regression (CR), and Neural Networks (NN). 1D and 2D descriptors from MOE package in combination with E-state or ISIS keys have been used. The best model was obtained using linear PLS for a combination between 22 MOE descriptors and 65 ISIS keys. It has a correlation coefficient (r2) of 0.935 and a root-mean-square error (RMSE) of 0.468 log molar solubility (log S(w)). The model validated on a test set of 177 compounds not included in the training set has r2 0.911 and RMSE 0.475 log S(w). The descriptors were ranked according to their importance, and at the top of the list have been found the 22 MOE descriptors. The CR model produced results as good as PLS, and because of the way in which cross-validation has been done it is expected to be a valuable tool in prediction besides PLS model. The statistics obtained using nonlinear methods did not surpass those got with linear ones. The good statistic obtained for linear PLS and CR recommends these models to be used in prediction when it is difficult or impossible to make experimental measurements, for virtual screening, combinatorial library design, and efficient leads optimization.

  11. Molecular descriptor subset selection in theoretical peptide quantitative structure-retention relationship model development using nature-inspired optimization algorithms.

    PubMed

    Žuvela, Petar; Liu, J Jay; Macur, Katarzyna; Bączek, Tomasz

    2015-10-06

    In this work, performance of five nature-inspired optimization algorithms, genetic algorithm (GA), particle swarm optimization (PSO), artificial bee colony (ABC), firefly algorithm (FA), and flower pollination algorithm (FPA), was compared in molecular descriptor selection for development of quantitative structure-retention relationship (QSRR) models for 83 peptides that originate from eight model proteins. The matrix with 423 descriptors was used as input, and QSRR models based on selected descriptors were built using partial least squares (PLS), whereas root mean square error of prediction (RMSEP) was used as a fitness function for their selection. Three performance criteria, prediction accuracy, computational cost, and the number of selected descriptors, were used to evaluate the developed QSRR models. The results show that all five variable selection methods outperform interval PLS (iPLS), sparse PLS (sPLS), and the full PLS model, whereas GA is superior because of its lowest computational cost and higher accuracy (RMSEP of 5.534%) with a smaller number of variables (nine descriptors). The GA-QSRR model was validated initially through Y-randomization. In addition, it was successfully validated with an external testing set out of 102 peptides originating from Bacillus subtilis proteomes (RMSEP of 22.030%). Its applicability domain was defined, from which it was evident that the developed GA-QSRR exhibited strong robustness. All the sources of the model's error were identified, thus allowing for further application of the developed methodology in proteomics.

  12. Laser-Induced Breakdown Spectroscopy for Rapid Discrimination of Heavy-Metal-Contaminated Seafood Tegillarca granosa

    PubMed Central

    Ji, Guoli; Ye, Pengchao; Shi, Yijian; Yuan, Leiming; Chen, Xiaojing; Yuan, Mingshun; Zhu, Dehua; Chen, Xi; Hu, Xinyu; Jiang, Jing

    2017-01-01

    Tegillarca granosa samples contaminated artificially by three kinds of toxic heavy metals including zinc (Zn), cadmium (Cd), and lead (Pb) were attempted to be distinguished using laser-induced breakdown spectroscopy (LIBS) technology and pattern recognition methods in this study. The measured spectra were firstly processed by a wavelet transform algorithm (WTA), then the generated characteristic information was subsequently expressed by an information gain algorithm (IGA). As a result, 30 variables obtained were used as input variables for three classifiers: partial least square discriminant analysis (PLS-DA), support vector machine (SVM), and random forest (RF), among which the RF model exhibited the best performance, with 93.3% discrimination accuracy among those classifiers. Besides, the extracted characteristic information was used to reconstruct the original spectra by inverse WTA, and the corresponding attribution of the reconstructed spectra was then discussed. This work indicates that the healthy shellfish samples of Tegillarca granosa could be distinguished from the toxic heavy-metal-contaminated ones by pattern recognition analysis combined with LIBS technology, which only requires minimal pretreatments. PMID:29149053

  13. Effect of management (organic vs conventional) on volatile profiles of six plum cultivars (Prunus salicina Lindl.). A chemometric approach for varietal classification and determination of potential markers.

    PubMed

    Cuevas, F J; Moreno-Rojas, J M; Arroyo, F; Daza, A; Ruiz-Moreno, M J

    2016-05-15

    The volatile profiles of six plum cultivars ('Laetitia', 'Primetime', 'Sapphire', 'Showtime', 'Songold' and 'Souvenir') produced under two management systems (conventional and organic) and harvested in two consecutive years were obtained by HS-SPME-GC-MS. Twenty-five metabolites were determined, five of which (pentanal, (E)-2-heptenal, 1-octanol, eucalyptol and 2-pentylfuran) are reported for the first time in Prunus salicina Lindl. Hexanal stood out as a major volatile compound affected by the management system. In addition, partial least square discriminant analysis (PLS-DA) achieved an effective classification of genotypes based on their volatile profiles. A high classification accuracy model was obtained with a sensitivity of 97.9% and a specificity of 99.6%. Furthermore, the application of a dual criterion, based on a method of variable selection, VIP (variable importance in projection) and the results of a univariate analysis (ANOVA), allowed the identification of potential volatile markers in 'Primetime', 'Showtime' and 'Souvenir' genotypes (cultivars not characterised to date). Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Effects of Flavor and Texture on the Sensory Perception of Gouda-Type Cheese Varieties during Ripening Using Multivariate Analysis.

    PubMed

    Shiota, Makoto; Iwasawa, Ai; Suzuki-Iwashima, Ai; Iida, Fumiko

    2015-12-01

    The impact of flavor composition, texture, and other factors on desirability of different commercial sources of Gouda-type cheese using multivariate analyses on the basis of sensory and instrumental analyses were investigated. Volatile aroma compounds were measured using headspace solid-phase microextraction gas chromatography/mass spectrometry (GC/MS) and steam distillation extraction (SDE)-GC/MS, and fatty acid composition, low-molecular-weight compounds, including amino acids, and organic acids, as well pH, texture, and color were measured to determine their relationship with sensory perception. Orthogonal partial least squares-discriminant analysis (OPLS-DA) was performed to discriminate between 2 different ripening periods in 7 sample sets, revealing that ethanol, ethyl acetate, hexanoic acid, and octanoic acid increased with increasing sensory attribute scores for sweetness, fruity, and sulfurous. A partial least squares (PLS) regression model was constructed to predict the desirability of cheese using these parameters. We showed that texture and buttery flavors are important factors affecting the desirability of Gouda-type cheeses for Japanese consumers using these multivariate analyses. © 2015 Institute of Food Technologists®

  15. Menthol smokers: metabolomic profiling and smoking behavior

    PubMed Central

    Hsu, Ping-Ching; Lan, Renny S.; Brasky, Theodore M.; Marian, Catalin; Cheema, Amrita K.; Ressom, Habtom W.; Loffredo, Christopher A.; Pickworth, Wallace B.; Shields, Peter G.

    2016-01-01

    Background The use of menthol in cigarettes and marketing is under consideration for regulation by the FDA. However, the effects of menthol on smoking behavior and carcinogen exposure have been inconclusive. We previously reported metabolomic profiling for cigarette smokers, and novelly identified a menthol-glucuronide (MG) as the most significant metabolite directly related to smoking. Here, MG is studied in relation to smoking behavior and metabolomic profiles. Methods A cross-sectional study of 105 smokers who smoked two cigarettes in the laboratory one hour apart. Blood nicotine, MG and exhaled carbon monoxide (CO) boosts were determined (the difference before and after smoking). Spearman's correlation, Chi-Square and ANCOVA adjusted for gender, race and cotinine levels for menthol smokers assessed the relationship of MG boost, smoking behavior, and metabolic profiles. Multivariate metabolite characterization using supervised Partial Least Squares-Discriminant Analysis (PLS-DA) was carried out for the classification of metabolomics profiles. Results MG boost was positively correlated with CO boost, nicotine boost, average puff volume, puff duration, and total smoke exposure. Classification using PLS-DA, MG was the top metabolite discriminating metabolome of menthol vs. non-menthol smokers. Among menthol smokers, forty-two metabolites were significantly correlated with MG boost, which linked to cellular functions such as of cell death, survival, and movement. Conclusion Plasma MG boost is a new smoking behavior biomarker that may provides novel information over self-reported use of menthol cigarettes by integrating different smoking measures for understanding smoking behavior and harm of menthol cigarettes. Impacts These results provide insight into the biological effect of menthol smoking. PMID:27628308

  16. The Role of Safety Culture in Influencing Provider Perceptions of Patient Safety.

    PubMed

    Bishop, Andrea C; Boyle, Todd A

    2016-12-01

    To determine how provider perceptions of safety culture influence their involvement in patient safety practices. Health-care providers were surveyed in 2 tertiary hospitals located in Atlantic Canada, composed of 4 units in total. The partial least squares (PLS) approach to structural equation modeling was used to analyze the data. Latent variables provider PLS model encompassed the hypothesized relationships between provider characteristics, safety culture, perceptions of patient safety practices, and actual performance of patient safety practices, using the Health Belief Model (HBM) as a guide. Data analysis was conducted using SmartPLS. A total of 113 health-care providers completed a survey out of an eligible 318, representing a response rate of 35.5%. The final PLS model showed acceptable internal consistency with all four latent variables having a composite reliability score above the recommended 0.70 cutoff value (safety culture = 0.86, threat = 0.76, expectations = 0.83, PS practices = 0.75). Discriminant validity was established, and all path coefficients were found to be significant at the α = 0.05 level using nonparametric bootstrapping. The survey results show that safety culture accounted for 34% of the variance in perceptions of threat and 42% of the variance in expectations. This research supports the role that safety culture plays in the promotion and maintenance of patient safety activities for health-care providers. As such, it is recommended that the introduction of new patient safety strategies follow a thorough exploration of an organization's safety culture.

  17. Improving Global Models of Remotely Sensed Ocean Chlorophyll Content Using Partial Least Squares and Geographically Weighted Regression

    NASA Astrophysics Data System (ADS)

    Gholizadeh, H.; Robeson, S. M.

    2015-12-01

    Empirical models have been widely used to estimate global chlorophyll content from remotely sensed data. Here, we focus on the standard NASA empirical models that use blue-green band ratios. These band ratio ocean color (OC) algorithms are in the form of fourth-order polynomials and the parameters of these polynomials (i.e. coefficients) are estimated from the NASA bio-Optical Marine Algorithm Data set (NOMAD). Most of the points in this data set have been sampled from tropical and temperate regions. However, polynomial coefficients obtained from this data set are used to estimate chlorophyll content in all ocean regions with different properties such as sea-surface temperature, salinity, and downwelling/upwelling patterns. Further, the polynomial terms in these models are highly correlated. In sum, the limitations of these empirical models are as follows: 1) the independent variables within the empirical models, in their current form, are correlated (multicollinear), and 2) current algorithms are global approaches and are based on the spatial stationarity assumption, so they are independent of location. Multicollinearity problem is resolved by using partial least squares (PLS). PLS, which transforms the data into a set of independent components, can be considered as a combined form of principal component regression (PCR) and multiple regression. Geographically weighted regression (GWR) is also used to investigate the validity of spatial stationarity assumption. GWR solves a regression model over each sample point by using the observations within its neighbourhood. PLS results show that the empirical method underestimates chlorophyll content in high latitudes, including the Southern Ocean region, when compared to PLS (see Figure 1). Cluster analysis of GWR coefficients also shows that the spatial stationarity assumption in empirical models is not likely a valid assumption.

  18. Partial Least Squares for Discrimination in fMRI Data

    PubMed Central

    Andersen, Anders H.; Rayens, William S.; Liu, Yushu; Smith, Charles D.

    2011-01-01

    Multivariate methods for discrimination were used in the comparison of brain activation patterns between groups of cognitively normal women who are at either high or low Alzheimer's disease risk based on family history and apolipoprotein-E4 status. Linear discriminant analysis (LDA) was preceded by dimension reduction using either principal component analysis (PCA), partial least squares (PLS), or a new oriented partial least squares (OrPLS) method. The aim was to identify a spatial pattern of functionally connected brain regions that was differentially expressed by the risk groups and yielded optimal classification accuracy. Multivariate dimension reduction is required prior to LDA when the data contains more feature variables than there are observations on individual subjects. Whereas PCA has been commonly used to identify covariance patterns in neuroimaging data, this approach only identifies gross variability and is not capable of distinguishing among-groups from within-groups variability. PLS and OrPLS provide a more focused dimension reduction by incorporating information on class structure and therefore lead to more parsimonious models for discrimination. Performance was evaluated in terms of the cross-validated misclassification rates. The results support the potential of using fMRI as an imaging biomarker or diagnostic tool to discriminate individuals with disease or high risk. PMID:22227352

  19. Simultaneous determination of vitamin B12 and its derivatives using some of multivariate calibration 1 (MVC1) techniques

    NASA Astrophysics Data System (ADS)

    Samadi-Maybodi, Abdolraouf; Darzi, S. K. Hassani Nejad

    2008-10-01

    Resolution of binary mixtures of vitamin B12, methylcobalamin and B12 coenzyme with minimum sample pre-treatment and without analyte separation has been successfully achieved by methods of partial least squares algorithm with one dependent variable (PLS1), orthogonal signal correction/partial least squares (OSC/PLS), principal component regression (PCR) and hybrid linear analysis (HLA). Data of analysis were obtained from UV-vis spectra. The UV-vis spectra of the vitamin B12, methylcobalamin and B12 coenzyme were recorded in the same spectral conditions. The method of central composite design was used in the ranges of 10-80 mg L -1 for vitamin B12 and methylcobalamin and 20-130 mg L -1 for B12 coenzyme. The models refinement procedure and validation were performed by cross-validation. The minimum root mean square error of prediction (RMSEP) was 2.26 mg L -1 for vitamin B12 with PLS1, 1.33 mg L -1 for methylcobalamin with OSC/PLS and 3.24 mg L -1 for B12 coenzyme with HLA techniques. Figures of merit such as selectivity, sensitivity, analytical sensitivity and LOD were determined for three compounds. The procedure was successfully applied to simultaneous determination of three compounds in synthetic mixtures and in a pharmaceutical formulation.

  20. [Analyzing and modeling methods of near infrared spectroscopy for in-situ prediction of oil yield from oil shale].

    PubMed

    Liu, Jie; Zhang, Fu-Dong; Teng, Fei; Li, Jun; Wang, Zhi-Hong

    2014-10-01

    In order to in-situ detect the oil yield of oil shale, based on portable near infrared spectroscopy analytical technology, with 66 rock core samples from No. 2 well drilling of Fuyu oil shale base in Jilin, the modeling and analyzing methods for in-situ detection were researched. By the developed portable spectrometer, 3 data formats (reflectance, absorbance and K-M function) spectra were acquired. With 4 different modeling data optimization methods: principal component-mahalanobis distance (PCA-MD) for eliminating abnormal samples, uninformative variables elimination (UVE) for wavelength selection and their combina- tions: PCA-MD + UVE and UVE + PCA-MD, 2 modeling methods: partial least square (PLS) and back propagation artificial neural network (BPANN), and the same data pre-processing, the modeling and analyzing experiment were performed to determine the optimum analysis model and method. The results show that the data format, modeling data optimization method and modeling method all affect the analysis precision of model. Results show that whether or not using the optimization method, reflectance or K-M function is the proper spectrum format of the modeling database for two modeling methods. Using two different modeling methods and four different data optimization methods, the model precisions of the same modeling database are different. For PLS modeling method, the PCA-MD and UVE + PCA-MD data optimization methods can improve the modeling precision of database using K-M function spectrum data format. For BPANN modeling method, UVE, UVE + PCA-MD and PCA- MD + UVE data optimization methods can improve the modeling precision of database using any of the 3 spectrum data formats. In addition to using the reflectance spectra and PCA-MD data optimization method, modeling precision by BPANN method is better than that by PLS method. And modeling with reflectance spectra, UVE optimization method and BPANN modeling method, the model gets the highest analysis precision, its correlation coefficient (Rp) is 0.92, and its standard error of prediction (SEP) is 0.69%.

  1. Monitoring of beer fermentation based on hybrid electronic tongue.

    PubMed

    Kutyła-Olesiuk, Anna; Zaborowski, Michał; Prokaryn, Piotr; Ciosek, Patrycja

    2012-10-01

    Monitoring of biotechnological processes, including fermentation is extremely important because of the rapidly occurring changes in the composition of the samples during the production. In the case of beer, the analysis of physicochemical parameters allows for the determination of the stage of fermentation process and the control of its possible perturbations. As a tool to control the beer production process a sensor array can be used, composed of potentiometric and voltammetric sensors (so-called hybrid Electronic Tongue, h-ET). The aim of this study is to apply electronic tongue system to distinguish samples obtained during alcoholic fermentation. The samples originate from batch of homemade beer fermentation and from two stages of the process: fermentation reaction and maturation of beer. The applied sensor array consists of 10 miniaturized ion-selective electrodes (potentiometric ET) and silicon based 3-electrode voltammetric transducers (voltammetric ET). The obtained results were processed using Partial Least Squares (PLS) and Partial Least Squares-Discriminant Analysis (PLS-DA). For potentiometric data, voltammetric data, and combined potentiometric and voltammetric data, comparison of the classification ability was conducted based on Root Mean Squared Error (RMSE), sensitivity, specificity, and coefficient F calculation. It is shown, that in the contrast to the separately used techniques, the developed hybrid system allowed for a better characterization of the beer samples. Data fusion in hybrid ET enables to obtain better results both in qualitative analysis (RMSE, specificity, sensitivity) and in quantitative analysis (RMSE, R(2), a, b). Copyright © 2012 Elsevier B.V. All rights reserved.

  2. QSAR Study of p56lck Protein Tyrosine Kinase Inhibitory Activity of Flavonoid Derivatives Using MLR and GA-PLS

    PubMed Central

    Fassihi, Afshin; Sabet, Razieh

    2008-01-01

    Quantitative relationships between molecular structure and p56lck protein tyrosine kinase inhibitory activity of 50 flavonoid derivatives are discovered by MLR and GA-PLS methods. Different QSAR models revealed that substituent electronic descriptors (SED) parameters have significant impact on protein tyrosine kinase inhibitory activity of the compounds. Between the two statistical methods employed, GA-PLS gave superior results. The resultant GA-PLS model had a high statistical quality (R2 = 0.74 and Q2 = 0.61) for predicting the activity of the inhibitors. The models proposed in the present work are more useful in describing QSAR of flavonoid derivatives as p56lck protein tyrosine kinase inhibitors than those provided previously. PMID:19325836

  3. Determination of rice syrup adulterant concentration in honey using three-dimensional fluorescence spectra and multivariate calibrations

    NASA Astrophysics Data System (ADS)

    Chen, Quansheng; Qi, Shuai; Li, Huanhuan; Han, Xiaoyan; Ouyang, Qin; Zhao, Jiewen

    2014-10-01

    To rapidly and efficiently detect the presence of adulterants in honey, three-dimensional fluorescence spectroscopy (3DFS) technique was employed with the help of multivariate calibration. The data of 3D fluorescence spectra were compressed using characteristic extraction and the principal component analysis (PCA). Then, partial least squares (PLS) and back propagation neural network (BP-ANN) algorithms were used for modeling. The model was optimized by cross validation, and its performance was evaluated according to root mean square error of prediction (RMSEP) and correlation coefficient (R) in prediction set. The results showed that BP-ANN model was superior to PLS models, and the optimum prediction results of the mixed group (sunflower ± longan ± buckwheat ± rape) model were achieved as follow: RMSEP = 0.0235 and R = 0.9787 in the prediction set. The study demonstrated that the 3D fluorescence spectroscopy technique combined with multivariate calibration has high potential in rapid, nondestructive, and accurate quantitative analysis of honey adulteration.

  4. Determination of total flavonoids content in fresh Ginkgo biloba leaf with different colors using near infrared spectroscopy

    NASA Astrophysics Data System (ADS)

    Shi, Ji-yong; Zou, Xiao-bo; Zhao, Jie-wen; Mel, Holmes; Wang, Kai-liang; Wang, Xue; Chen, Hong

    Total flavonoids content is often considered an important quality index of Ginkgo biloba leaf. The feasibility of using near infrared (NIR) spectra at the wavelength range of 10,000-4000 cm-1 for rapid and nondestructive determination of total flavonoids content in G. biloba leaf was investigated. 120 fresh G. biloba leaves in different colors (green, green-yellowish and yellow) were used to spectra acquisition and total flavonoids determination. Partial least squares (PLS), interval partial least squares (iPLS) and synergy interval partial least squares (SiPLS) were used to develop calibration models for total flavonoids content in two colors leaves (green-yellowish and yellow) and three colors leaves (green, green-yellowish and yellow), respectively. The level of total flavonoids content for green, green-yellowish and yellow leaves was in an increasing order. Two characteristic wavelength regions (5840-6090 cm-1 and 6620-6880 cm-1), which corresponded to the absorptions of two aromatic rings in basic flavonoid structure, were selected by SiPLS. The optimal SiPLS model for total flavonoids content in the two colors leaves (r2 = 0.82, RMSEP = 2.62 mg g-1) had better performance than PLS and iPLS models. It could be concluded that NIR spectroscopy has significant potential in the nondestructive determination of total flavonoids content in fresh G. biloba leaf.

  5. Total sulfur determination in residues of crude oil distillation using FT-IR/ATR and variable selection methods.

    PubMed

    Müller, Aline Lima Hermes; Picoloto, Rochele Sogari; de Azevedo Mello, Paola; Ferrão, Marco Flores; de Fátima Pereira dos Santos, Maria; Guimarães, Regina Célia Lourenço; Müller, Edson Irineu; Flores, Erico Marlon Moraes

    2012-04-01

    Total sulfur concentration was determined in atmospheric residue (AR) and vacuum residue (VR) samples obtained from petroleum distillation process by Fourier transform infrared spectroscopy with attenuated total reflectance (FT-IR/ATR) in association with chemometric methods. Calibration and prediction set consisted of 40 and 20 samples, respectively. Calibration models were developed using two variable selection models: interval partial least squares (iPLS) and synergy interval partial least squares (siPLS). Different treatments and pre-processing steps were also evaluated for the development of models. The pre-treatment based on multiplicative scatter correction (MSC) and the mean centered data were selected for models construction. The use of siPLS as variable selection method provided a model with root mean square error of prediction (RMSEP) values significantly better than those obtained by PLS model using all variables. The best model was obtained using siPLS algorithm with spectra divided in 20 intervals and combinations of 3 intervals (911-824, 823-736 and 737-650 cm(-1)). This model produced a RMSECV of 400 mg kg(-1) S and RMSEP of 420 mg kg(-1) S, showing a correlation coefficient of 0.990. Copyright © 2011 Elsevier B.V. All rights reserved.

  6. Application of visible and near-infrared spectroscopy to classification of Miscanthus species

    DOE PAGES

    Jin, Xiaoli; Chen, Xiaoling; Xiao, Liang; ...

    2017-04-03

    Here, the feasibility of visible and near infrared (NIR) spectroscopy as tool to classify Miscanthus samples was explored in this study. Three types of Miscanthus plants, namely, M. sinensis, M. sacchariflorus and M. fIoridulus, were analyzed using a NIR spectrophotometer. Several classification models based on the NIR spectra data were developed using line discriminated analysis (LDA), partial least squares (PLS), least squares support vector machine regression (LSSVR), radial basis function (RBF) and neural network (NN). The principal component analysis (PCA) presented rough classification with overlapping samples, while the models of Line_LSSVR, RBF_LSSVR and RBF_NN presented almost same calibration and validationmore » results. Due to the higher speed of Line_LSSVR than RBF_LSSVR and RBF_NN, we selected the line_LSSVR model as a representative. In our study, the model based on line_LSSVR showed higher accuracy than LDA and PLS models. The total correct classification rates of 87.79 and 96.51% were observed based on LDA and PLS model in the testing set, respectively, while the line_LSSVR showed 99.42% of total correct classification rate. Meanwhile, the lin_LSSVR model in the testing set showed correct classification rate of 100, 100 and 96.77% for M. sinensis, M. sacchariflorus and M. fIoridulus, respectively. The lin_LSSVR model assigned 99.42% of samples to the right groups, except one M. fIoridulus sample. The results demonstrated that NIR spectra combined with a preliminary morphological classification could be an effective and reliable procedure for the classification of Miscanthus species.« less

  7. Application of visible and near-infrared spectroscopy to classification of Miscanthus species

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jin, Xiaoli; Chen, Xiaoling; Xiao, Liang

    Here, the feasibility of visible and near infrared (NIR) spectroscopy as tool to classify Miscanthus samples was explored in this study. Three types of Miscanthus plants, namely, M. sinensis, M. sacchariflorus and M. fIoridulus, were analyzed using a NIR spectrophotometer. Several classification models based on the NIR spectra data were developed using line discriminated analysis (LDA), partial least squares (PLS), least squares support vector machine regression (LSSVR), radial basis function (RBF) and neural network (NN). The principal component analysis (PCA) presented rough classification with overlapping samples, while the models of Line_LSSVR, RBF_LSSVR and RBF_NN presented almost same calibration and validationmore » results. Due to the higher speed of Line_LSSVR than RBF_LSSVR and RBF_NN, we selected the line_LSSVR model as a representative. In our study, the model based on line_LSSVR showed higher accuracy than LDA and PLS models. The total correct classification rates of 87.79 and 96.51% were observed based on LDA and PLS model in the testing set, respectively, while the line_LSSVR showed 99.42% of total correct classification rate. Meanwhile, the lin_LSSVR model in the testing set showed correct classification rate of 100, 100 and 96.77% for M. sinensis, M. sacchariflorus and M. fIoridulus, respectively. The lin_LSSVR model assigned 99.42% of samples to the right groups, except one M. fIoridulus sample. The results demonstrated that NIR spectra combined with a preliminary morphological classification could be an effective and reliable procedure for the classification of Miscanthus species.« less

  8. Application of visible and near-infrared spectroscopy to classification of Miscanthus species.

    PubMed

    Jin, Xiaoli; Chen, Xiaoling; Xiao, Liang; Shi, Chunhai; Chen, Liang; Yu, Bin; Yi, Zili; Yoo, Ji Hye; Heo, Kweon; Yu, Chang Yeon; Yamada, Toshihiko; Sacks, Erik J; Peng, Junhua

    2017-01-01

    The feasibility of visible and near infrared (NIR) spectroscopy as tool to classify Miscanthus samples was explored in this study. Three types of Miscanthus plants, namely, M. sinensis, M. sacchariflorus and M. fIoridulus, were analyzed using a NIR spectrophotometer. Several classification models based on the NIR spectra data were developed using line discriminated analysis (LDA), partial least squares (PLS), least squares support vector machine regression (LSSVR), radial basis function (RBF) and neural network (NN). The principal component analysis (PCA) presented rough classification with overlapping samples, while the models of Line_LSSVR, RBF_LSSVR and RBF_NN presented almost same calibration and validation results. Due to the higher speed of Line_LSSVR than RBF_LSSVR and RBF_NN, we selected the line_LSSVR model as a representative. In our study, the model based on line_LSSVR showed higher accuracy than LDA and PLS models. The total correct classification rates of 87.79 and 96.51% were observed based on LDA and PLS model in the testing set, respectively, while the line_LSSVR showed 99.42% of total correct classification rate. Meanwhile, the lin_LSSVR model in the testing set showed correct classification rate of 100, 100 and 96.77% for M. sinensis, M. sacchariflorus and M. fIoridulus, respectively. The lin_LSSVR model assigned 99.42% of samples to the right groups, except one M. fIoridulus sample. The results demonstrated that NIR spectra combined with a preliminary morphological classification could be an effective and reliable procedure for the classification of Miscanthus species.

  9. Application of visible and near-infrared spectroscopy to classification of Miscanthus species

    PubMed Central

    Shi, Chunhai; Chen, Liang; Yu, Bin; Yi, Zili; Yoo, Ji Hye; Heo, Kweon; Yu, Chang Yeon; Yamada, Toshihiko; Sacks, Erik J.; Peng, Junhua

    2017-01-01

    The feasibility of visible and near infrared (NIR) spectroscopy as tool to classify Miscanthus samples was explored in this study. Three types of Miscanthus plants, namely, M. sinensis, M. sacchariflorus and M. fIoridulus, were analyzed using a NIR spectrophotometer. Several classification models based on the NIR spectra data were developed using line discriminated analysis (LDA), partial least squares (PLS), least squares support vector machine regression (LSSVR), radial basis function (RBF) and neural network (NN). The principal component analysis (PCA) presented rough classification with overlapping samples, while the models of Line_LSSVR, RBF_LSSVR and RBF_NN presented almost same calibration and validation results. Due to the higher speed of Line_LSSVR than RBF_LSSVR and RBF_NN, we selected the line_LSSVR model as a representative. In our study, the model based on line_LSSVR showed higher accuracy than LDA and PLS models. The total correct classification rates of 87.79 and 96.51% were observed based on LDA and PLS model in the testing set, respectively, while the line_LSSVR showed 99.42% of total correct classification rate. Meanwhile, the lin_LSSVR model in the testing set showed correct classification rate of 100, 100 and 96.77% for M. sinensis, M. sacchariflorus and M. fIoridulus, respectively. The lin_LSSVR model assigned 99.42% of samples to the right groups, except one M. fIoridulus sample. The results demonstrated that NIR spectra combined with a preliminary morphological classification could be an effective and reliable procedure for the classification of Miscanthus species. PMID:28369059

  10. Chemiluminescence-based multivariate sensing of local equivalence ratios in premixed atmospheric methane-air flames

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tripathi, Markandey M.; Krishnan, Sundar R.; Srinivasan, Kalyan K.

    Chemiluminescence emissions from OH*, CH*, C2, and CO2 formed within the reaction zone of premixed flames depend upon the fuel-air equivalence ratio in the burning mixture. In the present paper, a new partial least square regression (PLS-R) based multivariate sensing methodology is investigated and compared with an OH*/CH* intensity ratio-based calibration model for sensing equivalence ratio in atmospheric methane-air premixed flames. Five replications of spectral data at nine different equivalence ratios ranging from 0.73 to 1.48 were used in the calibration of both models. During model development, the PLS-R model was initially validated with the calibration data set using themore » leave-one-out cross validation technique. Since the PLS-R model used the entire raw spectral intensities, it did not need the nonlinear background subtraction of CO2 emission that is required for typical OH*/CH* intensity ratio calibrations. An unbiased spectral data set (not used in the PLS-R model development), for 28 different equivalence ratio conditions ranging from 0.71 to 1.67, was used to predict equivalence ratios using the PLS-R and the intensity ratio calibration models. It was found that the equivalence ratios predicted with the PLS-R based multivariate calibration model matched the experimentally measured equivalence ratios within 7%; whereas, the OH*/CH* intensity ratio calibration grossly underpredicted equivalence ratios in comparison to measured equivalence ratios, especially under rich conditions ( > 1.2). The practical implications of the chemiluminescence-based multivariate equivalence ratio sensing methodology are also discussed.« less

  11. Identification of pulmonary edema in forensic autopsy cases of fatal anaphylactic shock using Fourier transform infrared microspectroscopy.

    PubMed

    Lin, Hancheng; Luo, Yiwen; Wang, Lei; Deng, Kaifei; Sun, Qiran; Fang, Ruoxi; Wei, Xin; Zha, Shuai; Wang, Zhenyuan; Huang, Ping

    2018-03-01

    Anaphylaxis is a rapid allergic reaction that may cause sudden death. Currently, postmortem diagnosis of anaphylactic shock is sometimes difficult and often achieved through exclusion. The aim of our study was to investigate whether Fourier transform infrared (FTIR) microspectroscopy combined with pattern recognition methods would be complementary to traditional methods and provide a more accurate postmortem diagnosis of fatal anaphylactic shock. First, the results of spectral peak area analysis showed that the pulmonary edema fluid of the fatal anaphylactic shock group was richer in protein components than the control group, which included mechanical asphyxia, brain injury, and acute cardiac death. Subsequently, principle component analysis (PCA) was performed and showed that the anaphylactic shock group contained more turn and α-helix protein structures as well as less tyrosine-rich proteins than the control group. Ultimately, a partial least-square discriminant analysis (PLS-DA) model combined with a variables selection method called the genetic algorithm (GA) was built and demonstrated good separation between these two groups. This pilot study demonstrates that FTIR microspectroscopy has the potential to be an effective aid for postmortem diagnosis of fatal anaphylactic shock.

  12. Feasibility of laser-induced breakdown spectroscopy (LIBS) for classification of sea salts.

    PubMed

    Tan, Man Minh; Cui, Sheng; Yoo, Jonghyun; Han, Song-Hee; Ham, Kyung-Sik; Nam, Sang-Ho; Lee, Yonghoon

    2012-03-01

    We have investigated the feasibility of laser-induced breakdown spectroscopy (LIBS) as a fast, reliable classification tool for sea salts. For 11 kinds of sea salts, potassium (K), magnesium (Mg), calcium (Ca), and aluminum (Al), concentrations were measured by inductively coupled plasma-atomic emission spectroscopy (ICP-AES), and the LIBS spectra were recorded in the narrow wavelength region between 760 and 800 nm where K (I), Mg (I), Ca (II), Al (I), and cyanide (CN) band emissions are observed. The ICP-AES measurements revealed that the K, Mg, Ca, and Al concentrations varied significantly with the provenance of each salt. The relative intensities of the K (I), Mg (I), Ca (II), and Al (I) peaks observed in the LIBS spectra are consistent with the results using ICP-AES. The principal component analysis of the LIBS spectra provided the score plot with quite a high degree of clustering. This indicates that classification of sea salts by chemometric analysis of LIBS spectra is very promising. Classification models were developed by partial least squares discriminant analysis (PLS-DA) and evaluated. In addition, the Al (I) peaks enabled us to discriminate between different production methods of the salts. © 2012 Society for Applied Spectroscopy

  13. Redescription of the Holotype of Culex (Culex) Peus Speiser and Taxonomy of Culex (Culex) Stigmatosoma Dyar and Thriambus Dyar (Diptera: Culicidae)

    DTIC Science & Technology

    1988-10-12

    Ver- teneuil, Brussels. 257 pp. + 5 pls. Forattini, 0. P. 1965. Entomologia Medica. 2” Vol- ume_ Editora da Universidade de Sao Paulo. 506 PP...Publica, Boletines T&r& cos, Serie A: Entomologia Medica y Parasitologia No. 1. 66 pp. Matheson, R. 1944. Handbook of the Mosquitoes of North America

  14. Bayesian regression models outperform partial least squares methods for predicting milk components and technological properties using infrared spectral data

    PubMed Central

    Ferragina, A.; de los Campos, G.; Vazquez, A. I.; Cecchinato, A.; Bittante, G.

    2017-01-01

    The aim of this study was to assess the performance of Bayesian models commonly used for genomic selection to predict “difficult-to-predict” dairy traits, such as milk fatty acid (FA) expressed as percentage of total fatty acids, and technological properties, such as fresh cheese yield and protein recovery, using Fourier-transform infrared (FTIR) spectral data. Our main hypothesis was that Bayesian models that can estimate shrinkage and perform variable selection may improve our ability to predict FA traits and technological traits above and beyond what can be achieved using the current calibration models (e.g., partial least squares, PLS). To this end, we assessed a series of Bayesian methods and compared their prediction performance with that of PLS. The comparison between models was done using the same sets of data (i.e., same samples, same variability, same spectral treatment) for each trait. Data consisted of 1,264 individual milk samples collected from Brown Swiss cows for which gas chromatographic FA composition, milk coagulation properties, and cheese-yield traits were available. For each sample, 2 spectra in the infrared region from 5,011 to 925 cm−1 were available and averaged before data analysis. Three Bayesian models: Bayesian ridge regression (Bayes RR), Bayes A, and Bayes B, and 2 reference models: PLS and modified PLS (MPLS) procedures, were used to calibrate equations for each of the traits. The Bayesian models used were implemented in the R package BGLR (http://cran.r-project.org/web/packages/BGLR/index.html), whereas the PLS and MPLS were those implemented in the WinISI II software (Infrasoft International LLC, State College, PA). Prediction accuracy was estimated for each trait and model using 25 replicates of a training-testing validation procedure. Compared with PLS, which is currently the most widely used calibration method, MPLS and the 3 Bayesian methods showed significantly greater prediction accuracy. Accuracy increased in moving from calibration to external validation methods, and in moving from PLS and MPLS to Bayesian methods, particularly Bayes A and Bayes B. The maximum R2 value of validation was obtained with Bayes B and Bayes A. For the FA, C10:0 (% of each FA on total FA basis) had the highest R2 (0.75, achieved with Bayes A and Bayes B), and among the technological traits, fresh cheese yield R2 of 0.82 (achieved with Bayes B). These 2 methods have proven to be useful instruments in shrinking and selecting very informative wavelengths and inferring the structure and functions of the analyzed traits. We conclude that Bayesian models are powerful tools for deriving calibration equations, and, importantly, these equations can be easily developed using existing open-source software. As part of our study, we provide scripts based on the open source R software BGLR, which can be used to train customized prediction equations for other traits or populations. PMID:26387015

  15. Multivariate estimation of the limit of detection by orthogonal partial least squares in temperature-modulated MOX sensors.

    PubMed

    Burgués, Javier; Marco, Santiago

    2018-08-17

    Metal oxide semiconductor (MOX) sensors are usually temperature-modulated and calibrated with multivariate models such as partial least squares (PLS) to increase the inherent low selectivity of this technology. The multivariate sensor response patterns exhibit heteroscedastic and correlated noise, which suggests that maximum likelihood methods should outperform PLS. One contribution of this paper is the comparison between PLS and maximum likelihood principal components regression (MLPCR) in MOX sensors. PLS is often criticized by the lack of interpretability when the model complexity increases beyond the chemical rank of the problem. This happens in MOX sensors due to cross-sensitivities to interferences, such as temperature or humidity and non-linearity. Additionally, the estimation of fundamental figures of merit, such as the limit of detection (LOD), is still not standardized in multivariate models. Orthogonalization methods, such as orthogonal projection to latent structures (O-PLS), have been successfully applied in other fields to reduce the complexity of PLS models. In this work, we propose a LOD estimation method based on applying the well-accepted univariate LOD formulas to the scores of the first component of an orthogonal PLS model. The resulting LOD is compared to the multivariate LOD range derived from error-propagation. The methodology is applied to data extracted from temperature-modulated MOX sensors (FIS SB-500-12 and Figaro TGS 3870-A04), aiming at the detection of low concentrations of carbon monoxide in the presence of uncontrolled humidity (chemical noise). We found that PLS models were simpler and more accurate than MLPCR models. Average LOD values of 0.79 ppm (FIS) and 1.06 ppm (Figaro) were found using the approach described in this paper. These values were contained within the LOD ranges obtained with the error-propagation approach. The mean LOD increased to 1.13 ppm (FIS) and 1.59 ppm (Figaro) when considering validation samples collected two weeks after calibration, which represents a 43% and 46% degradation, respectively. The orthogonal score-plot was a very convenient tool to visualize MOX sensor data and to validate the LOD estimates. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Rapid and non-invasive analysis of deoxynivalenol in durum and common wheat by Fourier-Transform Near Infrared (FT-NIR) spectroscopy.

    PubMed

    De Girolamo, A; Lippolis, V; Nordkvist, E; Visconti, A

    2009-06-01

    Fourier transform near-infrared spectroscopy (FT-NIR) was used for rapid and non-invasive analysis of deoxynivalenol (DON) in durum and common wheat. The relevance of using ground wheat samples with a homogeneous particle size distribution to minimize measurement variations and avoid DON segregation among particles of different sizes was established. Calibration models for durum wheat, common wheat and durum + common wheat samples, with particle size <500 microm, were obtained by using partial least squares (PLS) regression with an external validation technique. Values of root mean square error of prediction (RMSEP, 306-379 microg kg(-1)) were comparable and not too far from values of root mean square error of cross-validation (RMSECV, 470-555 microg kg(-1)). Coefficients of determination (r(2)) indicated an "approximate to good" level of prediction of the DON content by FT-NIR spectroscopy in the PLS calibration models (r(2) = 0.71-0.83), and a "good" discrimination between low and high DON contents in the PLS validation models (r(2) = 0.58-0.63). A "limited to good" practical utility of the models was ascertained by range error ratio (RER) values higher than 6. A qualitative model, based on 197 calibration samples, was developed to discriminate between blank and naturally contaminated wheat samples by setting a cut-off at 300 microg kg(-1) DON to separate the two classes. The model correctly classified 69% of the 65 validation samples with most misclassified samples (16 of 20) showing DON contamination levels quite close to the cut-off level. These findings suggest that FT-NIR analysis is suitable for the determination of DON in unprocessed wheat at levels far below the maximum permitted limits set by the European Commission.

  17. A new technique for spectrophotometric determination of pseudoephedrine and guaifenesin in syrup and synthetic mixture.

    PubMed

    Riahi, Siavash; Hadiloo, Farshad; Milani, Seyed Mohammad R; Davarkhah, Nazila; Ganjali, Mohammad R; Norouzi, Parviz; Seyfi, Payam

    2011-05-01

    The accuracy in predicting different chemometric methods was compared when applied on ordinary UV spectra and first order derivative spectra. Principal component regression (PCR) and partial least squares with one dependent variable (PLS1) and two dependent variables (PLS2) were applied on spectral data of pharmaceutical formula containing pseudoephedrine (PDP) and guaifenesin (GFN). The ability to derivative in resolved overlapping spectra chloropheniramine maleate was evaluated when multivariate methods are adopted for analysis of two component mixtures without using any chemical pretreatment. The chemometrics models were tested on an external validation dataset and finally applied to the analysis of pharmaceuticals. Significant advantages were found in analysis of the real samples when the calibration models from derivative spectra were used. It should also be mentioned that the proposed method is a simple and rapid way requiring no preliminary separation steps and can be used easily for the analysis of these compounds, especially in quality control laboratories. Copyright © 2011 John Wiley & Sons, Ltd.

  18. Analysis of Flavonoid in Medicinal Plant Extract Using Infrared Spectroscopy and Chemometrics

    PubMed Central

    Retnaningtyas, Yuni; Nuri; Lukman, Hilmia

    2016-01-01

    Infrared (IR) spectroscopy combined with chemometrics has been developed for simple analysis of flavonoid in the medicinal plant extract. Flavonoid was extracted from medicinal plant leaves by ultrasonication and maceration. IR spectra of selected medicinal plant extract were correlated with flavonoid content using chemometrics. The chemometric method used for calibration analysis was Partial Last Square (PLS) and the methods used for classification analysis were Linear Discriminant Analysis (LDA), Soft Independent Modelling of Class Analogies (SIMCA), and Support Vector Machines (SVM). In this study, the calibration of NIR model that showed best calibration with R 2 and RMSEC value was 0.9916499 and 2.1521897, respectively, while the accuracy of all classification models (LDA, SIMCA, and SVM) was 100%. R 2 and RMSEC of calibration of FTIR model were 0.8653689 and 8.8958149, respectively, while the accuracy of LDA, SIMCA, and SVM was 86.0%, 91.2%, and 77.3%, respectively. PLS and LDA of NIR models were further used to predict unknown flavonoid content in commercial samples. Using these models, the significance of flavonoid content that has been measured by NIR and UV-Vis spectrophotometry was evaluated with paired samples t-test. The flavonoid content that has been measured with both methods gave no significant difference. PMID:27529051

  19. Laser-Induced Breakdown Spectroscopy (LIBS) Measurement of Uranium in Molten Salt.

    PubMed

    Williams, Ammon; Phongikaroon, Supathorn

    2018-01-01

    In this current study, the molten salt aerosol-laser-induced breakdown spectroscopy (LIBS) system was used to measure the uranium (U) content in a ternary UCl 3 -LiCl-KCl salt to investigate and assess a near real-time analytical approach for material safeguards and accountability. Experiments were conducted using five different U concentrations to determine the analytical figures of merit for the system with respect to U. In the analysis, three U lines were used to develop univariate calibration curves at the 367.01 nm, 385.96 nm, and 387.10 nm lines. The 367.01 nm line had the lowest limit of detection (LOD) of 0.065 wt% U. The 385.96 nm line had the best root mean square error of cross-validation (RMSECV) of 0.20 wt% U. In addition to the univariate calibration approach, a multivariate partial least squares (PLS) model was developed to further analyze the data. Using partial least squares (PLS) modeling, an RMSECV of 0.085 wt% U was determined. The RMSECV from the multivariate approach was significantly better than the univariate case and the PLS model is recommended for future LIBS analysis. Overall, the aerosol-LIBS system performed well in monitoring the U concentration and it is expected that the system could be used to quantitatively determine the U compositions within the normal operational concentrations of U in pyroprocessing molten salts.

  20. Partial least squares density modeling (PLS-DM) - a new class-modeling strategy applied to the authentication of olives in brine by near-infrared spectroscopy.

    PubMed

    Oliveri, Paolo; López, M Isabel; Casolino, M Chiara; Ruisánchez, Itziar; Callao, M Pilar; Medini, Luca; Lanteri, Silvia

    2014-12-03

    A new class-modeling method, referred to as partial least squares density modeling (PLS-DM), is presented. The method is based on partial least squares (PLS), using a distance-based sample density measurement as the response variable. Potential function probability density is subsequently calculated on PLS scores and used, jointly with residual Q statistics, to develop efficient class models. The influence of adjustable model parameters on the resulting performances has been critically studied by means of cross-validation and application of the Pareto optimality criterion. The method has been applied to verify the authenticity of olives in brine from cultivar Taggiasca, based on near-infrared (NIR) spectra recorded on homogenized solid samples. Two independent test sets were used for model validation. The final optimal model was characterized by high efficiency and equilibrate balance between sensitivity and specificity values, if compared with those obtained by application of well-established class-modeling methods, such as soft independent modeling of class analogy (SIMCA) and unequal dispersed classes (UNEQ). Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Retention modelling of polychlorinated biphenyls in comprehensive two-dimensional gas chromatography.

    PubMed

    D'Archivio, Angelo Antonio; Incani, Angela; Ruggieri, Fabrizio

    2011-01-01

    In this paper, we use a quantitative structure-retention relationship (QSRR) method to predict the retention times of polychlorinated biphenyls (PCBs) in comprehensive two-dimensional gas chromatography (GC×GC). We analyse the GC×GC retention data taken from the literature by comparing predictive capability of different regression methods. The various models are generated using 70 out of 209 PCB congeners in the calibration stage, while their predictive performance is evaluated on the remaining 139 compounds. The two-dimensional chromatogram is initially estimated by separately modelling retention times of PCBs in the first and in the second column ((1) t (R) and (2) t (R), respectively). In particular, multilinear regression (MLR) combined with genetic algorithm (GA) variable selection is performed to extract two small subsets of predictors for (1) t (R) and (2) t (R) from a large set of theoretical molecular descriptors provided by the popular software Dragon, which after removal of highly correlated or almost constant variables consists of 237 structure-related quantities. Based on GA-MLR analysis, a four-dimensional and a five-dimensional relationship modelling (1) t (R) and (2) t (R), respectively, are identified. Single-response partial least square (PLS-1) regression is alternatively applied to independently model (1) t (R) and (2) t (R) without the need for preliminary GA variable selection. Further, we explore the possibility of predicting the two-dimensional chromatogram of PCBs in a single calibration procedure by using a two-response PLS (PLS-2) model or a feed-forward artificial neural network (ANN) with two output neurons. In the first case, regression is carried out on the full set of 237 descriptors, while the variables previously selected by GA-MLR are initially considered as ANN inputs and subjected to a sensitivity analysis to remove the redundant ones. Results show PLS-1 regression exhibits a noticeably better descriptive and predictive performance than the other investigated approaches. The observed values of determination coefficients for (1) t (R) and (2) t (R) in calibration (0.9999 and 0.9993, respectively) and prediction (0.9987 and 0.9793, respectively) provided by PLS-1 demonstrate that GC×GC behaviour of PCBs is properly modelled. In particular, the predicted two-dimensional GC×GC chromatogram of 139 PCBs not involved in the calibration stage closely resembles the experimental one. Based on the above lines of evidence, the proposed approach ensures accurate simulation of the whole GC×GC chromatogram of PCBs using experimental determination of only 1/3 retention data of representative congeners.

  2. Energy dispersive X-ray fluorescence and scattering assessment of soil quality via partial least squares and artificial neural networks analytical modeling approaches.

    PubMed

    Kaniu, M I; Angeyo, K H; Mwala, A K; Mwangi, F K

    2012-08-30

    Soil quality assessment (SQA) calls for rapid, simple and affordable but accurate analysis of soil quality indicators (SQIs). Routine methods of soil analysis are tedious and expensive. Energy dispersive X-ray fluorescence and scattering (EDXRFS) spectrometry in conjunction with chemometrics is a potentially powerful method for rapid SQA. In this study, a 25 m Ci (109)Cd isotope source XRF spectrometer was used to realize EDXRFS spectrometry of soils. Glycerol (a simulate of "organic" soil solution) and kaolin (a model clay soil) doped with soil micro (Fe, Cu, Zn) and macro (NO(3)(-), SO(4)(2-), H(2)PO(4)(-)) nutrients were used to train multivariate chemometric calibration models for direct (non-invasive) analysis of SQIs based on partial least squares (PLS) and artificial neural networks (ANN). The techniques were compared for each SQI with respect to speed, robustness, correction ability for matrix effects, and resolution of spectral overlap. The method was then applied to perform direct rapid analysis of SQIs in field soils. A one-way ANOVA test showed no statistical difference at 95% confidence interval between PLS and ANN results compared to reference soil nutrients. PLS was more accurate analyzing C, N, Na, P and Zn (R(2)>0.9) and low SEP of (0.05%, 0.01%, 0.01%, and 1.98 μg g(-1)respectively), while ANN was better suited for analysis of Mg, Cu and Fe (R(2)>0.9 and SEP of 0.08%, 4.02 μg g(-1), and 0.88 μg g(-1) respectively). Copyright © 2012 Elsevier B.V. All rights reserved.

  3. Metabolic profiling and predicting the free radical scavenging activity of guava (Psidium guajava L.) leaves according to harvest time by 1H-nuclear magnetic resonance spectroscopy.

    PubMed

    Kim, So-Hyun; Cho, Somi K; Hyun, Sun-Hee; Park, Hae-Eun; Kim, Young-Suk; Choi, Hyung-Kyoon

    2011-01-01

    Guava leaves were classified and the free radical scavenging activity (FRSA) evaluated according to different harvest times by using the (1)H-NMR-based metabolomic technique. A principal component analysis (PCA) of (1)H-NMR data from the guava leaves provided clear clusters according to the harvesting time. A partial least squares (PLS) analysis indicated a correlation between the metabolic profile and FRSA. FRSA levels of the guava leaves harvested during May and August were high, and those leaves contained higher amounts of 3-hydroxybutyric acid, acetic acid, glutamic acid, asparagine, citric acid, malonic acid, trans-aconitic acid, ascorbic acid, maleic acid, cis-aconitic acid, epicatechin, protocatechuic acid, and xanthine than the leaves harvested during October and December. Epicatechin and protocatechuic acid among those compounds seem to have enhanced FRSA of the guava leaf samples harvested in May and August. A PLS regression model was established to predict guava leaf FRSA at different harvesting times by using a (1)H-NMR data set. The predictability of the PLS model was then tested by internal and external validation. The results of this study indicate that (1)H-NMR-based metabolomic data could usefully characterize guava leaves according to their time of harvesting.

  4. The influence of different diets on metabolism and atherosclerosis processes-A porcine model: Blood serum, urine and tissues 1H NMR metabolomics targeted analysis.

    PubMed

    Zabek, Adam; Paslawski, Robert; Paslawska, Urszula; Wojtowicz, Wojciech; Drozdz, Katarzyna; Polakof, Sergio; Podhorska, Marzena; Dziegiel, Piotr; Mlynarz, Piotr; Szuba, Andrzej

    2017-01-01

    The global epidemic of cardiovascular diseases leads to increased morbidity and mortality caused mainly by myocardial infarction and stroke. Atherosclerosis is the major pathological process behind this epidemic. We designed a novel model of atherosclerosis in swine. Briefly, the first group (11 pigs) received normal pig feed (balanced diet group-BDG) for 12 months, the second group (9 pigs) was fed a Western high-calorie diet (unbalanced diet group-UDG) for 12 months, the third group (8 pigs) received a Western type high-calorie diet for 9 months later replaced by a normal diet for 3 months (regression group-RG). Clinical measurements included zoometric data, arterial blood pressure, heart rate and ultrasonographic evaluation of femoral arteries. Then, the animals were sacrificed and the blood serum, urine and skeletal muscle tissue were collected and 1H NMR based metabolomics studies with the application of fingerprinting PLS-DA and univariate analysis were done. Our results have shown that the molecular disturbances might overlap with other diseases such as onset of diabetes, sleep apnea and other obesity accompanied diseases. Moreover, we revealed that once initiated, molecular changes did not return to homeostatic equilibrium, at least for the duration of this experiment.

  5. Evaluation of the diagnostic potential of ex vivo Raman spectroscopy in gastric cancers: fingerprint versus high wavenumber

    NASA Astrophysics Data System (ADS)

    Zhou, Xueqian; Dai, Jianhua; Chen, Yao; Duan, Guangjie; Liu, Yulong; Zhang, Hua; Wu, Hongbo; Peng, Guiyong

    2016-10-01

    The aim of this study was to apply Raman spectroscopy in the high wavenumber (HW) region (2800 to 3000 cm-1) for ex vivo detection of gastric cancer and compare its diagnostic potential with that of the fingerprint (FP) region (800 to 1800 cm-1). Raman spectra were collected in the FP and HW regions to differentiate between normal mucosa (n=38) and gastric cancer (n=37). The distinctive Raman spectral differences between normal and cancer tissues are observed at 853, 879, 1157, 1319, 1338, 1448, and 2932 cm-1 and are primarily related to proteins, lipids, nucleic acids, collagen, and carotenoids in the tissue. In FP and HW Raman spectroscopy for diagnosis of gastric cancer, multivariate diagnostic algorithms based on partial-least-squares discriminant analysis, together with leave-one-sample-out cross validation, yielded diagnostic sensitivities of 94.59% and 81.08%, and specificities of 86.84% and 71.05%, respectively. Receiver operating characteristic analysis further confirmed that the FP region model performance is superior to that of the HW region model. Better differentiation between normal and gastric cancer tissues can be achieved using FP Raman spectroscopy and PLS-DA techniques, but the complementary natures of the FP and HW regions make both of them useful in diagnosis of gastric cancer.

  6. Computer aided diagnosis system for the Alzheimer's disease based on partial least squares and random forest SPECT image classification.

    PubMed

    Ramírez, J; Górriz, J M; Segovia, F; Chaves, R; Salas-Gonzalez, D; López, M; Alvarez, I; Padilla, P

    2010-03-19

    This letter shows a computer aided diagnosis (CAD) technique for the early detection of the Alzheimer's disease (AD) by means of single photon emission computed tomography (SPECT) image classification. The proposed method is based on partial least squares (PLS) regression model and a random forest (RF) predictor. The challenge of the curse of dimensionality is addressed by reducing the large dimensionality of the input data by downscaling the SPECT images and extracting score features using PLS. A RF predictor then forms an ensemble of classification and regression tree (CART)-like classifiers being its output determined by a majority vote of the trees in the forest. A baseline principal component analysis (PCA) system is also developed for reference. The experimental results show that the combined PLS-RF system yields a generalization error that converges to a limit when increasing the number of trees in the forest. Thus, the generalization error is reduced when using PLS and depends on the strength of the individual trees in the forest and the correlation between them. Moreover, PLS feature extraction is found to be more effective for extracting discriminative information from the data than PCA yielding peak sensitivity, specificity and accuracy values of 100%, 92.7%, and 96.9%, respectively. Moreover, the proposed CAD system outperformed several other recently developed AD CAD systems. Copyright 2010 Elsevier Ireland Ltd. All rights reserved.

  7. The Application of Ultra-High-Performance Liquid Chromatography Coupled with a LTQ-Orbitrap Mass Technique to Reveal the Dynamic Accumulation of Secondary Metabolites in Licorice under ABA Stress.

    PubMed

    Li, Da; Xu, Guojie; Ren, Guangxi; Sun, Yufeng; Huang, Ying; Liu, Chunsheng

    2017-10-20

    The traditional medicine licorice is the most widely consumed herbal product in the world. Although much research work on studying the changes in the active compounds of licorice has been reported, there are still many areas, such as the dynamic accumulation of secondary metabolites in licorice, that need to be further studied. In this study, the secondary metabolites from licorice under two different methods of stress were investigated by ultra-high-performance liquid chromatography coupled with hybrid linear ion trap-Orbitrap mass spectrometry (UHPLC-LTQ-Orbitrap-MS). A complex continuous coordination of flavonoids and triterpenoids in a network was modulated by different methods of stress during growth. The results showed that a total of 51 secondary metabolites were identified in licorice under ABA stress. The partial least squares-discriminate analysis (PLS-DA) revealed the distinction of obvious compounds among stress-specific districts relative to ABA stress. The targeted results showed that there were significant differences in the accumulation patterns of the deeply targeted 41 flavonoids and 10 triterpenoids compounds by PCA and PLS-DA analyses. To survey the effects of flavonoid and triterpenoid metabolism under ABA stress, we inspected the stress-specific metabolic changes. Our study testified that the majority of flavonoids and triterpenoids were elevated in licorice under ABA stress, while the signature metabolite affecting the dynamic accumulation of secondary metabolites was detected. Taken together, our results suggest that ABA-specific metabolite profiling dynamically changed in terms of the biosynthesis of flavonoids and triterpenoids, which may offer new trains of thought on the regular pattern of dynamic accumulation of secondary metabolites in licorice at the metabolite level. Our results also provide a reference for clinical applications and directional planting and licorice breeding.

  8. GC–MS-Based Metabonomic Profiling Displayed Differing Effects of Borna Disease Virus Natural Strain Hu-H1 and Laboratory Strain V Infection in Rat Cortical Neurons

    PubMed Central

    Liu, Siwen; Bode, Liv; Zhang, Lujun; He, Peng; Huang, Rongzhong; Sun, Lin; Chen, Shigang; Zhang, Hong; Guo, Yujie; Zhou, Jingjing; Fu, Yuying; Zhu, Dan; Xie, Peng

    2015-01-01

    Borna disease virus (BDV) persists in the central nervous systems of a wide variety of vertebrates and causes behavioral disorders. Previous studies have revealed that metabolic perturbations are associated with BDV infection. However, the pathophysiological effects of different viral strains remain largely unknown. Rat cortical neurons infected with human strain BDV Hu-H1, laboratory BDV Strain V, and non-infected control (CON) cells were cultured in vitro. At day 12 post-infection, a gas chromatography coupled with mass spectrometry (GC–MS) metabonomic approach was used to differentiate the metabonomic profiles of 35 independent intracellular samples from Hu-H1-infected cells (n = 12), Strain V-infected cells (n = 12), and CON cells (n = 11). Partial least squares discriminant analysis (PLS-DA) was performed to demonstrate discrimination between the three groups. Further statistical testing determined which individual metabolites displayed significant differences between groups. PLS-DA demonstrated that the whole metabolic pattern enabled statistical discrimination between groups. We identified 31 differential metabolites in the Hu-H1 and CON groups (21 decreased and 10 increased in Hu-H1 relative to CON), 35 differential metabolites in the Strain V and CON groups (30 decreased and 5 increased in Strain V relative to CON), and 21 differential metabolites in the Hu-H1 and Strain V groups (8 decreased and 13 increased in Hu-H1 relative to Strain V). Comparative metabonomic profiling revealed divergent perturbations in key energy and amino acid metabolites between natural strain Hu-H1 and laboratory Strain V of BDV. The two BDV strains differentially alter metabolic pathways of rat cortical neurons in vitro. Their systematic classification provides a valuable template for improved BDV strain definition in future studies. PMID:26287181

  9. Random sample consensus combined with partial least squares regression (RANSAC-PLS) for microbial metabolomics data mining and phenotype improvement.

    PubMed

    Teoh, Shao Thing; Kitamura, Miki; Nakayama, Yasumune; Putri, Sastia; Mukai, Yukio; Fukusaki, Eiichiro

    2016-08-01

    In recent years, the advent of high-throughput omics technology has made possible a new class of strain engineering approaches, based on identification of possible gene targets for phenotype improvement from omic-level comparison of different strains or growth conditions. Metabolomics, with its focus on the omic level closest to the phenotype, lends itself naturally to this semi-rational methodology. When a quantitative phenotype such as growth rate under stress is considered, regression modeling using multivariate techniques such as partial least squares (PLS) is often used to identify metabolites correlated with the target phenotype. However, linear modeling techniques such as PLS require a consistent metabolite-phenotype trend across the samples, which may not be the case when outliers or multiple conflicting trends are present in the data. To address this, we proposed a data-mining strategy that utilizes random sample consensus (RANSAC) to select subsets of samples with consistent trends for construction of better regression models. By applying a combination of RANSAC and PLS (RANSAC-PLS) to a dataset from a previous study (gas chromatography/mass spectrometry metabolomics data and 1-butanol tolerance of 19 yeast mutant strains), new metabolites were indicated to be correlated with tolerance within certain subsets of the samples. The relevance of these metabolites to 1-butanol tolerance were then validated from single-deletion strains of corresponding metabolic genes. The results showed that RANSAC-PLS is a promising strategy to identify unique metabolites that provide additional hints for phenotype improvement, which could not be detected by traditional PLS modeling using the entire dataset. Copyright © 2016 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  10. SEM-PLS Analysis of Inhibiting Factors of Cost Performance for Large Construction Projects in Malaysia: Perspective of Clients and Consultants

    PubMed Central

    Memon, Aftab Hameed; Rahman, Ismail Abdul

    2014-01-01

    This study uncovered inhibiting factors to cost performance in large construction projects of Malaysia. Questionnaire survey was conducted among clients and consultants involved in large construction projects. In the questionnaire, a total of 35 inhibiting factors grouped in 7 categories were presented to the respondents for rating significant level of each factor. A total of 300 questionnaire forms were distributed. Only 144 completed sets were received and analysed using advanced multivariate statistical software of Structural Equation Modelling (SmartPLS v2). The analysis involved three iteration processes where several of the factors were deleted in order to make the model acceptable. The result of the analysis found that R 2 value of the model is 0.422 which indicates that the developed model has a substantial impact on cost performance. Based on the final form of the model, contractor's site management category is the most prominent in exhibiting effect on cost performance of large construction projects. This finding is validated using advanced techniques of power analysis. This vigorous multivariate analysis has explicitly found the significant category which consists of several causative factors to poor cost performance in large construction projects. This will benefit all parties involved in construction projects for controlling cost overrun. PMID:24693227

  11. SEM-PLS analysis of inhibiting factors of cost performance for large construction projects in Malaysia: perspective of clients and consultants.

    PubMed

    Memon, Aftab Hameed; Rahman, Ismail Abdul

    2014-01-01

    This study uncovered inhibiting factors to cost performance in large construction projects of Malaysia. Questionnaire survey was conducted among clients and consultants involved in large construction projects. In the questionnaire, a total of 35 inhibiting factors grouped in 7 categories were presented to the respondents for rating significant level of each factor. A total of 300 questionnaire forms were distributed. Only 144 completed sets were received and analysed using advanced multivariate statistical software of Structural Equation Modelling (SmartPLS v2). The analysis involved three iteration processes where several of the factors were deleted in order to make the model acceptable. The result of the analysis found that R(2) value of the model is 0.422 which indicates that the developed model has a substantial impact on cost performance. Based on the final form of the model, contractor's site management category is the most prominent in exhibiting effect on cost performance of large construction projects. This finding is validated using advanced techniques of power analysis. This vigorous multivariate analysis has explicitly found the significant category which consists of several causative factors to poor cost performance in large construction projects. This will benefit all parties involved in construction projects for controlling cost overrun.

  12. Hybrid robust model based on an improved functional link neural network integrating with partial least square (IFLNN-PLS) and its application to predicting key process variables.

    PubMed

    He, Yan-Lin; Xu, Yuan; Geng, Zhi-Qiang; Zhu, Qun-Xiong

    2016-03-01

    In this paper, a hybrid robust model based on an improved functional link neural network integrating with partial least square (IFLNN-PLS) is proposed. Firstly, an improved functional link neural network with small norm of expanded weights and high input-output correlation (SNEWHIOC-FLNN) was proposed for enhancing the generalization performance of FLNN. Unlike the traditional FLNN, the expanded variables of the original inputs are not directly used as the inputs in the proposed SNEWHIOC-FLNN model. The original inputs are attached to some small norm of expanded weights. As a result, the correlation coefficient between some of the expanded variables and the outputs is enhanced. The larger the correlation coefficient is, the more relevant the expanded variables tend to be. In the end, the expanded variables with larger correlation coefficient are selected as the inputs to improve the performance of the traditional FLNN. In order to test the proposed SNEWHIOC-FLNN model, three UCI (University of California, Irvine) regression datasets named Housing, Concrete Compressive Strength (CCS), and Yacht Hydro Dynamics (YHD) are selected. Then a hybrid model based on the improved FLNN integrating with partial least square (IFLNN-PLS) was built. In IFLNN-PLS model, the connection weights are calculated using the partial least square method but not the error back propagation algorithm. Lastly, IFLNN-PLS was developed as an intelligent measurement model for accurately predicting the key variables in the Purified Terephthalic Acid (PTA) process and the High Density Polyethylene (HDPE) process. Simulation results illustrated that the IFLNN-PLS could significant improve the prediction performance. Copyright © 2015 ISA. Published by Elsevier Ltd. All rights reserved.

  13. Assessing Wheat Traits by Spectral Reflectance: Do We Really Need to Focus on Predicted Trait-Values or Directly Identify the Elite Genotypes Group?

    PubMed Central

    Garriga, Miguel; Romero-Bravo, Sebastián; Estrada, Félix; Escobar, Alejandro; Matus, Iván A.; del Pozo, Alejandro; Astudillo, Cesar A.; Lobos, Gustavo A.

    2017-01-01

    Phenotyping, via remote and proximal sensing techniques, of the agronomic and physiological traits associated with yield potential and drought adaptation could contribute to improvements in breeding programs. In the present study, 384 genotypes of wheat (Triticum aestivum L.) were tested under fully irrigated (FI) and water stress (WS) conditions. The following traits were evaluated and assessed via spectral reflectance: Grain yield (GY), spikes per square meter (SM2), kernels per spike (KPS), thousand-kernel weight (TKW), chlorophyll content (SPAD), stem water soluble carbohydrate concentration and content (WSC and WSCC, respectively), carbon isotope discrimination (Δ13C), and leaf area index (LAI). The performances of spectral reflectance indices (SRIs), four regression algorithms (PCR, PLSR, ridge regression RR, and SVR), and three classification methods (PCA-LDA, PLS-DA, and kNN) were evaluated for the prediction of each trait. For the classification approaches, two classes were established for each trait: The lower 80% of the trait variability range (Class 1) and the remaining 20% (Class 2 or elite genotypes). Both the SRIs and regression methods performed better when data from FI and WS were combined. The traits that were best estimated by SRIs and regression methods were GY and Δ13C. For most traits and conditions, the estimations provided by RR and SVR were the same, or better than, those provided by the SRIs. PLS-DA showed the best performance among the categorical methods and, unlike the SRI and regression models, most traits were relatively well-classified within a specific hydric condition (FI or WS), proving that classification approach is an effective tool to be explored in future studies related to genotype selection. PMID:28337210

  14. Assessing Wheat Traits by Spectral Reflectance: Do We Really Need to Focus on Predicted Trait-Values or Directly Identify the Elite Genotypes Group?

    PubMed

    Garriga, Miguel; Romero-Bravo, Sebastián; Estrada, Félix; Escobar, Alejandro; Matus, Iván A; Del Pozo, Alejandro; Astudillo, Cesar A; Lobos, Gustavo A

    2017-01-01

    Phenotyping, via remote and proximal sensing techniques, of the agronomic and physiological traits associated with yield potential and drought adaptation could contribute to improvements in breeding programs. In the present study, 384 genotypes of wheat ( Triticum aestivum L.) were tested under fully irrigated (FI) and water stress (WS) conditions. The following traits were evaluated and assessed via spectral reflectance: Grain yield (GY), spikes per square meter (SM2), kernels per spike (KPS), thousand-kernel weight (TKW), chlorophyll content (SPAD), stem water soluble carbohydrate concentration and content (WSC and WSCC, respectively), carbon isotope discrimination (Δ 13 C), and leaf area index (LAI). The performances of spectral reflectance indices (SRIs), four regression algorithms (PCR, PLSR, ridge regression RR, and SVR), and three classification methods (PCA-LDA, PLS-DA, and k NN) were evaluated for the prediction of each trait. For the classification approaches, two classes were established for each trait: The lower 80% of the trait variability range (Class 1) and the remaining 20% (Class 2 or elite genotypes). Both the SRIs and regression methods performed better when data from FI and WS were combined. The traits that were best estimated by SRIs and regression methods were GY and Δ 13 C. For most traits and conditions, the estimations provided by RR and SVR were the same, or better than, those provided by the SRIs. PLS-DA showed the best performance among the categorical methods and, unlike the SRI and regression models, most traits were relatively well-classified within a specific hydric condition (FI or WS), proving that classification approach is an effective tool to be explored in future studies related to genotype selection.

  15. Application of reflectance spectroscopies (FTIR-ATR & FT-NIR) coupled with multivariate methods for robust in vivo detection of begomovirus infection in papaya leaves

    NASA Astrophysics Data System (ADS)

    Haq, Quazi M. I.; Mabood, Fazal; Naureen, Zakira; Al-Harrasi, Ahmed; Gilani, Sayed A.; Hussain, Javid; Jabeen, Farah; Khan, Ajmal; Al-Sabari, Ruqaya S. M.; Al-khanbashi, Fatema H. S.; Al-Fahdi, Amira A. M.; Al-Zaabi, Ahoud K. A.; Al-Shuraiqi, Fatma A. M.; Al-Bahaisi, Iman M.

    2018-06-01

    Nucleic acid & serology based methods have revolutionized plant disease detection, however, they are not very reliable at asymptomatic stage, especially in case of pathogen with systemic infection, in addition, they need at least 1-2 days for sample harvesting, processing, and analysis. In this study, two reflectance spectroscopies i.e. Near Infrared reflectance spectroscopy (NIR) and Fourier-Transform-Infrared spectroscopy with Attenuated Total Reflection (FT-IR, ATR) coupled with multivariate exploratory methods like Principle Component Analysis (PCA) and Partial least square discriminant analysis (PLS-DA) have been deployed to detect begomovirus infection in papaya leaves. The application of those techniques demonstrates that they are very useful for robust in vivo detection of plant begomovirus infection. These methods are simple, sensitive, reproducible, precise, and do not require any lengthy samples preparation procedures.

  16. Development and Validation of a Near-Infrared Spectroscopy Method for the Prediction of Acrylamide Content in French-Fried Potato.

    PubMed

    Adedipe, Oluwatosin E; Johanningsmeier, Suzanne D; Truong, Van-Den; Yencho, G Craig

    2016-03-02

    This study investigated the ability of near-infrared spectroscopy (NIRS) to predict acrylamide content in French-fried potato. Potato flour spiked with acrylamide (50-8000 μg/kg) was used to determine if acrylamide could be accurately predicted in a potato matrix. French fries produced with various pretreatments and cook times (n = 84) and obtained from quick-service restaurants (n = 64) were used for model development and validation. Acrylamide was quantified using gas chromatography-mass spectrometry, and reflectance spectra (400-2500 nm) of each freeze-dried sample were captured on a Foss XDS Rapid Content Analyzer-NIR spectrometer. Partial least-squares (PLS) discriminant analysis and PLS regression modeling demonstrated that NIRS could accurately detect acrylamide content as low as 50 μg/kg in the model potato matrix. Prediction errors of 135 μg/kg (R(2) = 0.98) and 255 μg/kg (R(2) = 0.93) were achieved with the best PLS models for acrylamide prediction in Russet Norkotah French-fried potato and multiple samples of unknown varieties, respectively. The findings indicate that NIRS can be used as a screening tool in potato breeding and potato processing research to reduce acrylamide in the food supply.

  17. [Local Regression Algorithm Based on Net Analyte Signal and Its Application in Near Infrared Spectral Analysis].

    PubMed

    Zhang, Hong-guang; Lu, Jian-gang

    2016-02-01

    Abstract To overcome the problems of significant difference among samples and nonlinearity between the property and spectra of samples in spectral quantitative analysis, a local regression algorithm is proposed in this paper. In this algorithm, net signal analysis method(NAS) was firstly used to obtain the net analyte signal of the calibration samples and unknown samples, then the Euclidean distance between net analyte signal of the sample and net analyte signal of calibration samples was calculated and utilized as similarity index. According to the defined similarity index, the local calibration sets were individually selected for each unknown sample. Finally, a local PLS regression model was built on each local calibration sets for each unknown sample. The proposed method was applied to a set of near infrared spectra of meat samples. The results demonstrate that the prediction precision and model complexity of the proposed method are superior to global PLS regression method and conventional local regression algorithm based on spectral Euclidean distance.

  18. Sub-Model Partial Least Squares for Improved Accuracy in Quantitative Laser Induced Breakdown Spectroscopy

    NASA Astrophysics Data System (ADS)

    Anderson, R. B.; Clegg, S. M.; Frydenvang, J.

    2015-12-01

    One of the primary challenges faced by the ChemCam instrument on the Curiosity Mars rover is developing a regression model that can accurately predict the composition of the wide range of target types encountered (basalts, calcium sulfate, feldspar, oxides, etc.). The original calibration used 69 rock standards to train a partial least squares (PLS) model for each major element. By expanding the suite of calibration samples to >400 targets spanning a wider range of compositions, the accuracy of the model was improved, but some targets with "extreme" compositions (e.g. pure minerals) were still poorly predicted. We have therefore developed a simple method, referred to as "submodel PLS", to improve the performance of PLS across a wide range of target compositions. In addition to generating a "full" (0-100 wt.%) PLS model for the element of interest, we also generate several overlapping submodels (e.g. for SiO2, we generate "low" (0-50 wt.%), "mid" (30-70 wt.%), and "high" (60-100 wt.%) models). The submodels are generally more accurate than the "full" model for samples within their range because they are able to adjust for matrix effects that are specific to that range. To predict the composition of an unknown target, we first predict the composition with the submodels and the "full" model. Then, based on the predicted composition from the "full" model, the appropriate submodel prediction can be used (e.g. if the full model predicts a low composition, use the "low" model result, which is likely to be more accurate). For samples with "full" predictions that occur in a region of overlap between submodels, the submodel predictions are "blended" using a simple linear weighted sum. The submodel PLS method shows improvements in most of the major elements predicted by ChemCam and reduces the occurrence of negative predictions for low wt.% targets. Submodel PLS is currently being used in conjunction with ICA regression for the major element compositions of ChemCam data.

  19. Quantification of amine functional groups and their influence on OM/OC in the IMPROVE network

    NASA Astrophysics Data System (ADS)

    Kamruzzaman, Mohammed; Takahama, Satoshi; Dillner, Ann M.

    2018-01-01

    Recently, we developed a method using FT-IR spectroscopy coupled with partial least squares (PLS) regression to measure the four most abundant organic functional groups, aliphatic C-H, alcohol OH, carboxylic acid OH and carbonyl C=O, in atmospheric particulate matter. These functional groups are summed to estimate organic matter (OM) while the carbon from the functional groups is summed to estimate organic carbon (OC). With this method, OM and OM/OC can be estimated for each sample rather than relying on one assumed value to convert OC measurements to OM. This study continues the development of the FT-IR and PLS method for estimating OM and OM/OC by including the amine functional group. Amines are ubiquitous in the atmosphere and come from motor vehicle exhaust, animal husbandry, biomass burning, and vegetation among other sources. In this study, calibration standards for amines are produced by aerosolizing individual amine compounds and collecting them on PTFE filters using an IMPROVE sampler, thereby mimicking the filter media and collection geometry of ambient standards. The moles of amine functional group on each standard and a narrow range of amine-specific wavenumbers in the FT-IR spectra (wavenumber range 1 550-1 500 cm-1) are used to develop a PLS calibration model. The PLS model is validated using three methods: prediction of a set of laboratory standards not included in the model, a peak height analysis and a PLS model with a broader wavenumber range. The model is then applied to the ambient samples collected throughout 2013 from 16 IMPROVE sites in the USA. Urban sites have higher amine concentrations than most rural sites, but amine functional groups account for a lower fraction of OM at urban sites. Amine concentrations, contributions to OM and seasonality vary by site and sample. Amine has a small impact on the annual average OM/OC for urban sites, but for some rural sites including amine in the OM/OC calculations increased OM/OC by 0.1 or more.

  20. Advanced stability indicating chemometric methods for quantitation of amlodipine and atorvastatin in their quinary mixture with acidic degradation products

    NASA Astrophysics Data System (ADS)

    Darwish, Hany W.; Hassan, Said A.; Salem, Maissa Y.; El-Zeany, Badr A.

    2016-02-01

    Two advanced, accurate and precise chemometric methods are developed for the simultaneous determination of amlodipine besylate (AML) and atorvastatin calcium (ATV) in the presence of their acidic degradation products in tablet dosage forms. The first method was Partial Least Squares (PLS-1) and the second was Artificial Neural Networks (ANN). PLS was compared to ANN models with and without variable selection procedure (genetic algorithm (GA)). For proper analysis, a 5-factor 5-level experimental design was established resulting in 25 mixtures containing different ratios of the interfering species. Fifteen mixtures were used as calibration set and the other ten mixtures were used as validation set to validate the prediction ability of the suggested models. The proposed methods were successfully applied to the analysis of pharmaceutical tablets containing AML and ATV. The methods indicated the ability of the mentioned models to solve the highly overlapped spectra of the quinary mixture, yet using inexpensive and easy to handle instruments like the UV-VIS spectrophotometer.

  1. Bayesian regression models outperform partial least squares methods for predicting milk components and technological properties using infrared spectral data.

    PubMed

    Ferragina, A; de los Campos, G; Vazquez, A I; Cecchinato, A; Bittante, G

    2015-11-01

    The aim of this study was to assess the performance of Bayesian models commonly used for genomic selection to predict "difficult-to-predict" dairy traits, such as milk fatty acid (FA) expressed as percentage of total fatty acids, and technological properties, such as fresh cheese yield and protein recovery, using Fourier-transform infrared (FTIR) spectral data. Our main hypothesis was that Bayesian models that can estimate shrinkage and perform variable selection may improve our ability to predict FA traits and technological traits above and beyond what can be achieved using the current calibration models (e.g., partial least squares, PLS). To this end, we assessed a series of Bayesian methods and compared their prediction performance with that of PLS. The comparison between models was done using the same sets of data (i.e., same samples, same variability, same spectral treatment) for each trait. Data consisted of 1,264 individual milk samples collected from Brown Swiss cows for which gas chromatographic FA composition, milk coagulation properties, and cheese-yield traits were available. For each sample, 2 spectra in the infrared region from 5,011 to 925 cm(-1) were available and averaged before data analysis. Three Bayesian models: Bayesian ridge regression (Bayes RR), Bayes A, and Bayes B, and 2 reference models: PLS and modified PLS (MPLS) procedures, were used to calibrate equations for each of the traits. The Bayesian models used were implemented in the R package BGLR (http://cran.r-project.org/web/packages/BGLR/index.html), whereas the PLS and MPLS were those implemented in the WinISI II software (Infrasoft International LLC, State College, PA). Prediction accuracy was estimated for each trait and model using 25 replicates of a training-testing validation procedure. Compared with PLS, which is currently the most widely used calibration method, MPLS and the 3 Bayesian methods showed significantly greater prediction accuracy. Accuracy increased in moving from calibration to external validation methods, and in moving from PLS and MPLS to Bayesian methods, particularly Bayes A and Bayes B. The maximum R(2) value of validation was obtained with Bayes B and Bayes A. For the FA, C10:0 (% of each FA on total FA basis) had the highest R(2) (0.75, achieved with Bayes A and Bayes B), and among the technological traits, fresh cheese yield R(2) of 0.82 (achieved with Bayes B). These 2 methods have proven to be useful instruments in shrinking and selecting very informative wavelengths and inferring the structure and functions of the analyzed traits. We conclude that Bayesian models are powerful tools for deriving calibration equations, and, importantly, these equations can be easily developed using existing open-source software. As part of our study, we provide scripts based on the open source R software BGLR, which can be used to train customized prediction equations for other traits or populations. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  2. The Successive Projections Algorithm for interval selection in trilinear partial least-squares with residual bilinearization.

    PubMed

    Gomes, Adriano de Araújo; Alcaraz, Mirta Raquel; Goicoechea, Hector C; Araújo, Mario Cesar U

    2014-02-06

    In this work the Successive Projection Algorithm is presented for intervals selection in N-PLS for three-way data modeling. The proposed algorithm combines noise-reduction properties of PLS with the possibility of discarding uninformative variables in SPA. In addition, second-order advantage can be achieved by the residual bilinearization (RBL) procedure when an unexpected constituent is present in a test sample. For this purpose, SPA was modified in order to select intervals for use in trilinear PLS. The ability of the proposed algorithm, namely iSPA-N-PLS, was evaluated on one simulated and two experimental data sets, comparing the results to those obtained by N-PLS. In the simulated system, two analytes were quantitated in two test sets, with and without unexpected constituent. In the first experimental system, the determination of the four fluorophores (l-phenylalanine; l-3,4-dihydroxyphenylalanine; 1,4-dihydroxybenzene and l-tryptophan) was conducted with excitation-emission data matrices. In the second experimental system, quantitation of ofloxacin was performed in water samples containing two other uncalibrated quinolones (ciprofloxacin and danofloxacin) by high performance liquid chromatography with UV-vis diode array detector. For comparison purpose, a GA algorithm coupled with N-PLS/RBL was also used in this work. In most of the studied cases iSPA-N-PLS proved to be a promising tool for selection of variables in second-order calibration, generating models with smaller RMSEP, when compared to both the global model using all of the sensors in two dimensions and GA-NPLS/RBL. Copyright © 2013 Elsevier B.V. All rights reserved.

  3. Multivariate analysis of gamma spectra to characterize used nuclear fuel

    DOE PAGES

    Coble, Jamie; Orton, Christopher; Schwantes, Jon

    2017-01-17

    The Multi-Isotope Process (MIP) Monitor provides an efficient means to monitor the process conditions in used nuclear fuel reprocessing facilities to support process verification and validation. The MIP Monitor applies multivariate analysis to gamma spectroscopy of key stages in the reprocessing stream in order to detect small changes in the gamma spectrum, which may indicate changes in process conditions. This research extends the MIP Monitor by characterizing a used fuel sample after initial dissolution according to the type of reactor of origin (pressurized or boiling water reactor; PWR and BWR, respectively), initial enrichment, burn up, and cooling time. Simulated gammamore » spectra were used in this paper to develop and test three fuel characterization algorithms. The classification and estimation models employed are based on the partial least squares regression (PLS) algorithm. A PLS discriminate analysis model was developed which perfectly classified reactor type for the three PWR and three BWR reactor designs studied. Locally weighted PLS models were fitted on-the-fly to estimate the remaining fuel characteristics. For the simulated gamma spectra considered, burn up was predicted with 0.1% root mean squared percent error (RMSPE) and both cooling time and initial enrichment with approximately 2% RMSPE. Finally, this approach to automated fuel characterization can be used to independently verify operator declarations of used fuel characteristics and to inform the MIP Monitor anomaly detection routines at later stages of the fuel reprocessing stream to improve sensitivity to changes in operational parameters that may indicate issues with operational control or malicious activities.« less

  4. Multivariate analysis of gamma spectra to characterize used nuclear fuel

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Coble, Jamie; Orton, Christopher; Schwantes, Jon

    The Multi-Isotope Process (MIP) Monitor provides an efficient means to monitor the process conditions in used nuclear fuel reprocessing facilities to support process verification and validation. The MIP Monitor applies multivariate analysis to gamma spectroscopy of key stages in the reprocessing stream in order to detect small changes in the gamma spectrum, which may indicate changes in process conditions. This research extends the MIP Monitor by characterizing a used fuel sample after initial dissolution according to the type of reactor of origin (pressurized or boiling water reactor; PWR and BWR, respectively), initial enrichment, burn up, and cooling time. Simulated gammamore » spectra were used in this paper to develop and test three fuel characterization algorithms. The classification and estimation models employed are based on the partial least squares regression (PLS) algorithm. A PLS discriminate analysis model was developed which perfectly classified reactor type for the three PWR and three BWR reactor designs studied. Locally weighted PLS models were fitted on-the-fly to estimate the remaining fuel characteristics. For the simulated gamma spectra considered, burn up was predicted with 0.1% root mean squared percent error (RMSPE) and both cooling time and initial enrichment with approximately 2% RMSPE. Finally, this approach to automated fuel characterization can be used to independently verify operator declarations of used fuel characteristics and to inform the MIP Monitor anomaly detection routines at later stages of the fuel reprocessing stream to improve sensitivity to changes in operational parameters that may indicate issues with operational control or malicious activities.« less

  5. Detection of Butter Adulteration with Lard by Employing (1)H-NMR Spectroscopy and Multivariate Data Analysis.

    PubMed

    Fadzillah, Nurrulhidayah Ahmad; Man, Yaakob bin Che; Rohman, Abdul; Rosman, Arieff Salleh; Ismail, Amin; Mustafa, Shuhaimi; Khatib, Alfi

    2015-01-01

    The authentication of food products from the presence of non-allowed components for certain religion like lard is very important. In this study, we used proton Nuclear Magnetic Resonance ((1)H-NMR) spectroscopy for the analysis of butter adulterated with lard by simultaneously quantification of all proton bearing compounds, and consequently all relevant sample classes. Since the spectra obtained were too complex to be analyzed visually by the naked eyes, the classification of spectra was carried out.The multivariate calibration of partial least square (PLS) regression was used for modelling the relationship between actual value of lard and predicted value. The model yielded a highest regression coefficient (R(2)) of 0.998 and the lowest root mean square error calibration (RMSEC) of 0.0091% and root mean square error prediction (RMSEP) of 0.0090, respectively. Cross validation testing evaluates the predictive power of the model. PLS model was shown as good models as the intercept of R(2)Y and Q(2)Y were 0.0853 and -0.309, respectively.

  6. Application of Fourier transform infrared spectroscopy and chemometrics for differentiation of Salmonella enterica serovar Enteritidis phage types.

    PubMed

    Preisner, Ornella; Guiomar, Raquel; Machado, Jorge; Menezes, José Cardoso; Lopes, João Almeida

    2010-06-01

    Fourier transform infrared (FT-IR) spectroscopy and chemometric techniques were used to discriminate five closely related Salmonella enterica serotype Enteritidis phage types, phage type 1 (PT1), PT1b, PT4b, PT6, and PT6a. Intact cells and outer membrane protein (OMP) extracts from bacterial cell membranes were subjected to FT-IR analysis in transmittance mode. Spectra were collected over a wavenumber range from 4,000 to 600 cm(-1). Partial least-squares discriminant analysis (PLS-DA) was used to develop calibration models based on preprocessed FT-IR spectra. The analysis based on OMP extracts provided greater separation between the Salmonella Enteritidis PT1-PT1b, PT4b, and PT6-PT6a groups than the intact cell analysis. When these three phage type groups were considered, the method based on OMP extract FT-IR spectra was 100% accurate. Moreover, complementary local models that considered only the PT1-PT1b and PT6-PT6a groups were developed, and the level of discrimination increased. PT1 and PT1b isolates were differentiated successfully with the local model using the entire OMP extract spectrum (98.3% correct predictions), whereas the accuracy of discrimination between PT6 and PT6a isolates was 86.0%. Isolates belonging to different phage types (PT19, PT20, and PT21) were used with the model to test its robustness. For the first time it was demonstrated that FT-IR analysis of OMP extracts can be used for construction of robust models that allow fast and accurate discrimination of different Salmonella Enteritidis phage types.

  7. Experimental design based 3-D QSAR analysis of steroid-protein interactions: Application to human CBG complexes

    NASA Astrophysics Data System (ADS)

    Norinder, Ulf

    1990-12-01

    An experimental design based 3-D QSAR analysis using a combination of principal component and PLS analysis is presented and applied to human corticosteroid-binding globulin complexes. The predictive capability of the created model is good. The technique can also be used as guidance when selecting new compounds to be investigated.

  8. Prediction of the distillation temperatures of crude oils using ¹H NMR and support vector regression with estimated confidence intervals.

    PubMed

    Filgueiras, Paulo R; Terra, Luciana A; Castro, Eustáquio V R; Oliveira, Lize M S L; Dias, Júlio C M; Poppi, Ronei J

    2015-09-01

    This paper aims to estimate the temperature equivalent to 10% (T10%), 50% (T50%) and 90% (T90%) of distilled volume in crude oils using (1)H NMR and support vector regression (SVR). Confidence intervals for the predicted values were calculated using a boosting-type ensemble method in a procedure called ensemble support vector regression (eSVR). The estimated confidence intervals obtained by eSVR were compared with previously accepted calculations from partial least squares (PLS) models and a boosting-type ensemble applied in the PLS method (ePLS). By using the proposed boosting strategy, it was possible to identify outliers in the T10% property dataset. The eSVR procedure improved the accuracy of the distillation temperature predictions in relation to standard PLS, ePLS and SVR. For T10%, a root mean square error of prediction (RMSEP) of 11.6°C was obtained in comparison with 15.6°C for PLS, 15.1°C for ePLS and 28.4°C for SVR. The RMSEPs for T50% were 24.2°C, 23.4°C, 22.8°C and 14.4°C for PLS, ePLS, SVR and eSVR, respectively. For T90%, the values of RMSEP were 39.0°C, 39.9°C and 39.9°C for PLS, ePLS, SVR and eSVR, respectively. The confidence intervals calculated by the proposed boosting methodology presented acceptable values for the three properties analyzed; however, they were lower than those calculated by the standard methodology for PLS. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Propellant's differentiation using FTIR-photoacoustic detection for forensic studies of improvised explosive devices.

    PubMed

    Álvarez, Ángela; Yáñez, Jorge; Contreras, David; Saavedra, Renato; Sáez, Pedro; Amarasiriwardena, Dulasiri

    2017-11-01

    The use of propellant for making improvised explosive devices (IED) is an incipient criminal practice. Propellant can be used as initiator in explosive mixtures along with other components such as coal, ammonium nitrate, sulfur, etc. The identification of the propellant's brand used in homemade explosives can provide additional forensic information of this evidence. In this work, four of the most common propellant brands were characterized by Fourier-transform infrared photoacoustic spectroscopy (FTIR-PAS) which is a non-destructive micro-analytical technique. Spectra shows characteristic signals of typical compounds in the propellants, such as nitrocellulose, nitroglycerin, guanidine, diphenylamine, etc. The differentiation of propellant components was achieved by using FTIR-PAS combined with chemometric methods of classification. Principal component analysis (PCA) and soft independent modelling of class analogy (SIMCA) were used to achieve an effective differentiation and classification (100%) of propellant brands. Furthermore, propellant brand differentiation was also assessed using partial least squares discriminant analyses (PLS-DA) by leave one out cross (∼97%) and external (∼100%) validation method. Our results show the ability of FTIR-PAS combined with chemometric analysis to identify and differentiate propellant brands in different explosive formulations of IED. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. An Advanced Analytical Chemistry Experiment Using Gas Chromatography-Mass Spectrometry, MATLAB, and Chemometrics to Predict Biodiesel Blend Percent Composition

    ERIC Educational Resources Information Center

    Pierce, Karisa M.; Schale, Stephen P.; Le, Trang M.; Larson, Joel C.

    2011-01-01

    We present a laboratory experiment for an advanced analytical chemistry course where we first focus on the chemometric technique partial least-squares (PLS) analysis applied to one-dimensional (1D) total-ion-current gas chromatography-mass spectrometry (GC-TIC) separations of biodiesel blends. Then, we focus on n-way PLS (n-PLS) applied to…

  11. Metabonomic Profiling of TASTPM Transgenic Alzheimer's Disease Mouse Model

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, Zeping; Browne, Edward R.; Liu, Tao

    2012-12-07

    Identification of molecular mechanisms underlying early stage Alzheimer’s disease (AD) is important for the development of new therapies against and diagnosis of AD. In this study, non-targeted metabotyping of TASTPM transgenic AD mice was performed. The metabolic profiles of both brain and plasma of TASTPM mice were characterized using gas chromatography-mass spectrometry and compared to those of wild type C57BL/6J mice. TASTPM mice were metabolically distinct compared to wild type mice (Q28 Y = 0.587 and 0.766 for PLS-DA models derived from brain and plasma, respectively). A number of metabolites were found to be perturbed in TASTPM mice in bothmore » brain (D11 fructose, L-valine, L-serine, L-threonine, zymosterol) and plasma (D-glucose, D12 galactose, linoleic acid, arachidonic acid, palmitic acid and D-gluconic acid). In addition, enzyme immunoassay confirmed that selected endogenous steroids were significantly perturbed in brain (androstenedione and 17-OH-progesterone) and plasma (cortisol and testosterone) of TASTPM mice. Ingenuity pathway analysis revealed that perturbations related to amino acid metabolism (brain), steroid biosynthesis (brain), linoleic acid metabolism (plasma) and energy metabolism (plasma) accounted for the differentiation of TASTPM and wild-type« less

  12. Vis-NIR spectrometric determination of Brix and sucrose in sugar production samples using kernel partial least squares with interval selection based on the successive projections algorithm.

    PubMed

    de Almeida, Valber Elias; de Araújo Gomes, Adriano; de Sousa Fernandes, David Douglas; Goicoechea, Héctor Casimiro; Galvão, Roberto Kawakami Harrop; Araújo, Mario Cesar Ugulino

    2018-05-01

    This paper proposes a new variable selection method for nonlinear multivariate calibration, combining the Successive Projections Algorithm for interval selection (iSPA) with the Kernel Partial Least Squares (Kernel-PLS) modelling technique. The proposed iSPA-Kernel-PLS algorithm is employed in a case study involving a Vis-NIR spectrometric dataset with complex nonlinear features. The analytical problem consists of determining Brix and sucrose content in samples from a sugar production system, on the basis of transflectance spectra. As compared to full-spectrum Kernel-PLS, the iSPA-Kernel-PLS models involve a smaller number of variables and display statistically significant superiority in terms of accuracy and/or bias in the predictions. Published by Elsevier B.V.

  13. A nonpolar, nonamphiphilic molecule can accelerate adsorption of phospholipids and lower their surface tension at the air/water interface.

    PubMed

    Nguyen, Phuc Nghia; Trinh Dang, Thuan Thao; Waton, Gilles; Vandamme, Thierry; Krafft, Marie Pierre

    2011-10-04

    The adsorption dynamics of a series of phospholipids (PLs) at the interface between an aqueous solution or dispersion of the PL and a gas phase containing the nonpolar, nonamphiphilic linear perfluorocarbon perfluorohexane (PFH) was studied by bubble profile analysis tensiometry. The PLs investigated were dioctanoylphosphatidylcholine (DiC(8)-PC), dilaurylphosphatidylcholine, dimyristoylphosphatidylcholine, and dipalmitoylphosphatidylcholine. The gas phase consisted of air or air saturated with PFH. The perfluorocarbon gas was found to have an unexpected, strong effect on both the adsorption rate and the equilibrium interfacial tension (γ(eq)) of the PLs. First, for all of the PLs, and at all concentrations investigated, the γ(eq) values were significantly lower (by up to 10 mN m(-1)) when PFH was present in the gas phase. The efficacy of PFH in decreasing γ(eq) depends on the ability of PLs to form micelles or vesicles in water. For vesicles, it also depends on the gel or fluid state of the membranes. Second, the adsorption rates of all the PLs at the interface (as assessed by the time required for the initial interfacial tension to be reduced by 30%) are significantly accelerated (by up to fivefold) by the presence of PFH for the lower PL concentrations. Both the surface-tension reducing effect and the adsorption rate increasing effect establish that PFH has a strong interaction with the PL monolayer and acts as a cosurfactant at the interface, despite the absence of any amphiphilic character. Fitting the adsorption profiles of DiC(8)-PC at the PFH-saturated air/aqueous solution interface with the modified Frumkin model indicated that the PFH molecule lay horizontally at the interface. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Purified human MDR 1 modulates membrane potential in reconstituted proteoliposomes.

    PubMed

    Howard, Ellen M; Roepe, Paul D

    2003-04-01

    Human multidrug resistance (hu MDR 1) cDNA was fused to a P. shermanii transcarboxylase biotin acceptor domain (TCBD), and the fusion protein was heterologously overexpressed at high yield in K(+)-uptake deficient Saccharomyces cerevisiae yeast strain 9.3, purified by avidin-biotin chromatography, and reconstituted into proteoliposomes (PLs) formed with Escherichia coli lipid. As measured by pH- dependent ATPase activity, purified, reconstituted, biotinylated MDR-TCBD protein is fully functional. Dodecyl maltoside proved to be the most effective detergent for the membrane solubilization of MDR-TCBD, and various salts were found to significantly affect reconstitution into PLs. After extensive analysis, we find that purified reconstituted MDR-TCBD protein does not catalyze measurable H(+) pumping in the presence of ATP. In the presence of physiologic [ATP], K(+)/Na(+) diffusion potentials monitored by either anionic oxonol or cationic carbocyanine are easily established upon addition of valinomycin to either control or MDR-TCBD PLs. However, in the absence of ATP, although control PLs still maintain easily measurable K(+)/Na(+) diffusion potentials upon addition of valinomycin, MDR-TCBD PLs do not. Dissipation of potential by MDR-TCBD is clearly [ATP] dependent and also appears to be Cl(-) dependent, since replacing Cl(-) with equimolar glutamate restores the ability of MDR-TCBD PLs to form a membrane potential in the absence of physiologic [ATP]. The data are difficult to reconcile with models that might propose ATP-catalyzed "pumping" of the fluorescent probes we use and are more consistent with electrically passive anion transport via MDR-TCBD protein, but only at low [ATP]. These observations may help to resolve the confusing array of data related to putative ion transport by hu MDR 1 protein.

  15. LC-MS based metabolomics and chemometrics study of the toxic effects of copper on Saccharomyces cerevisiae.

    PubMed

    Farrés, Mireia; Piña, Benjamí; Tauler, Romà

    2016-08-01

    Copper containing fungicides are used to protect vineyards from fungal infections. Higher residues of copper in grapes at toxic concentrations are potentially toxic and affect the microorganisms living in vineyards, such as Saccharomyces cerevisiae. In this study, the response of the metabolic profiles of S. cerevisiae at different concentrations of copper sulphate (control, 1 mM, 3 mM and 6 mM) was analysed by liquid chromatography coupled to mass spectrometry (LC-MS) and multivariate curve resolution-alternating least squares (MCR-ALS) using an untargeted metabolomics approach. Peak areas of the MCR-ALS resolved elution profiles in control and in Cu(ii)-treated samples were compared using partial least squares regression (PLSR) and PLS-discriminant analysis (PLS-DA), and the intracellular metabolites best contributing to sample discrimination were selected and identified. Fourteen metabolites showed significant concentration changes upon Cu(ii) exposure, following a dose-response effect. The observed changes were consistent with the expected effects of Cu(ii) toxicity, including oxidative stress and DNA damage. This research confirmed that LC-MS based metabolomics coupled to chemometric methods are a powerful approach for discerning metabolomics changes in S. cerevisiae and for elucidating modes of toxicity of environmental stressors, including heavy metals like Cu(ii).

  16. Convolutional neural networks for vibrational spectroscopic data analysis.

    PubMed

    Acquarelli, Jacopo; van Laarhoven, Twan; Gerretzen, Jan; Tran, Thanh N; Buydens, Lutgarde M C; Marchiori, Elena

    2017-02-15

    In this work we show that convolutional neural networks (CNNs) can be efficiently used to classify vibrational spectroscopic data and identify important spectral regions. CNNs are the current state-of-the-art in image classification and speech recognition and can learn interpretable representations of the data. These characteristics make CNNs a good candidate for reducing the need for preprocessing and for highlighting important spectral regions, both of which are crucial steps in the analysis of vibrational spectroscopic data. Chemometric analysis of vibrational spectroscopic data often relies on preprocessing methods involving baseline correction, scatter correction and noise removal, which are applied to the spectra prior to model building. Preprocessing is a critical step because even in simple problems using 'reasonable' preprocessing methods may decrease the performance of the final model. We develop a new CNN based method and provide an accompanying publicly available software. It is based on a simple CNN architecture with a single convolutional layer (a so-called shallow CNN). Our method outperforms standard classification algorithms used in chemometrics (e.g. PLS) in terms of accuracy when applied to non-preprocessed test data (86% average accuracy compared to the 62% achieved by PLS), and it achieves better performance even on preprocessed test data (96% average accuracy compared to the 89% achieved by PLS). For interpretability purposes, our method includes a procedure for finding important spectral regions, thereby facilitating qualitative interpretation of results. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. The subgingival microbiota of Papillon-Lefèvre syndrome.

    PubMed

    Albandar, Jasim M; Khattab, Razan; Monem, Fawza; Barbuto, Sara M; Paster, Bruce J

    2012-07-01

    There is little information about the microbiologic profiles of periodontal lesions in Papillon-Lefèvre syndrome (PLS) and the significance of bacteria in the pathogenesis of periodontitis in these patients. This comprehensive analysis of the subgingival microbiota in patients with PLS used 16S ribosomal RNA (rRNA) clonal analysis and the 16S rRNA-based Human Oral Microbe Identification Microarray (HOMIM). Thirteen patients with PLS from seven unrelated families volunteered for this microbiologic study. Subgingival plaque was collected with sterile paper points from multiple sites with ≥5 mm probing depth, and whole genomic DNA was extracted. The 16S rRNA genes were amplified, cloned, and sequenced. The samples were then probed for ≈300 predominant oral bacterial species using the HOMIM. The most commonly detected phylotypes in the clonal analysis were Gemella morbillorum, Gemella haemolysans, Granulicatella adiacens, Lachnospiraceae OT 100 (EI074), Parvimonas micra, Selenomonas noxia, and Veillonella parvula. As a group, streptococci were commonly detected in these individuals. In the HOMIM analysis, a total of 170 bacterial species/phylotypes were detected, with a range of 40 to 80 species per patient with PLS. Of these, 12 bacterial species were detected in medium to high levels in ≥50% of the individuals. The high-frequency strains were clustered into eight groups: Aggregatibacter actinomycetemcomitans, Campylobacter spp., Capnocytophaga granulosa, G. morbillorum, P. micra, Porphyromonas endodontalis, Streptococcus spp., and Tannerella forsythia. The subgingival microbiota in PLS is diverse. Periodontal pathogens commonly associated with chronic and aggressive periodontitis and opportunistic pathogens may be associated with the development of severe periodontitis in patients with PLS.

  18. Novel PLS3 variants in X-linked osteoporosis: Exploring bone material properties.

    PubMed

    Balasubramanian, Meena; Fratzl-Zelman, Nadja; O'Sullivan, Rory; Bull, Mary; Fa Peel, Nicola; Pollitt, Rebecca C; Jones, Rebecca; Milne, Elizabeth; Smith, Kath; Roschger, Paul; Klaushofer, Klaus; Bishop, Nicholas J

    2018-05-07

    Idiopathic Juvenile Osteoporosis (IJO) refers to significantly lower than expected bone mass manifesting in childhood with no identifiable aetiology. IJO classically presents in early pubertal period with multiple fractures including metaphyseal and vertebral crush fractures, and low bone-mass. Here we describe two patients and provide information on their clinical phenotype, genotype and bone material analysis in one of the patients. Patient 1: 40-year old adult male diagnosed with IJO in childhood who re-presented with a hip fracture as an adult. Genetic analysis identified a pathogenic PLS3 hemizygous variant, c.1765del in exon 16. Patient 2: 15-year old boy with multiple vertebral fractures and bone biopsy findings suggestive of IJO who also has a diagnosis of autism spectrum disorder. Genetic analysis identified a maternally inherited PLS3 pathogenic c.1295T>A variant in exon 12. Analyses of the transiliac bone sample revealed severe reduction of trabecular volume and bone turnover indices and elevated bone matrix mineralisation. We propose that genetic testing for PLS3 should be undertaken in patients presenting with a current or previous history of IJO as this has implications for genetic counselling and cascade screening. The extensive evaluation of the transiliac biopsy sample of Patient 2 revealed a novel bone phenotype. This report includes a review of IJO and genetic causes of osteoporosis, and suggests that existing cases of IJO should be screened for PLS3. Through analysis of bone material properties in Patient 2, we can conclude that PLS3 does have a role in bone mineralisation. © 2018 Wiley Periodicals, Inc.

  19. Application of Fourier transform infrared spectroscopy with chemometrics on postmortem interval estimation based on pericardial fluids.

    PubMed

    Zhang, Ji; Li, Bing; Wang, Qi; Wei, Xin; Feng, Weibo; Chen, Yijiu; Huang, Ping; Wang, Zhenyuan

    2017-12-21

    Postmortem interval (PMI) evaluation remains a challenge in the forensic community due to the lack of efficient methods. Studies have focused on chemical analysis of biofluids for PMI estimation; however, no reports using spectroscopic methods in pericardial fluid (PF) are available. In this study, Fourier transform infrared (FTIR) spectroscopy with attenuated total reflectance (ATR) accessory was applied to collect comprehensive biochemical information from rabbit PF at different PMIs. The PMI-dependent spectral signature was determined by two-dimensional (2D) correlation analysis. The partial least square (PLS) and nu-support vector machine (nu-SVM) models were then established based on the acquired spectral dataset. Spectral variables associated with amide I, amide II, COO - , C-H bending, and C-O or C-OH vibrations arising from proteins, polypeptides, amino acids and carbohydrates, respectively, were susceptible to PMI in 2D correlation analysis. Moreover, the nu-SVM model appeared to achieve a more satisfactory prediction than the PLS model in calibration; the reliability of both models was determined in an external validation set. The study shows the possibility of application of ATR-FTIR methods in postmortem interval estimation using PF samples.

  20. Robust PLS approach for KPI-related prediction and diagnosis against outliers and missing data

    NASA Astrophysics Data System (ADS)

    Yin, Shen; Wang, Guang; Yang, Xu

    2014-07-01

    In practical industrial applications, the key performance indicator (KPI)-related prediction and diagnosis are quite important for the product quality and economic benefits. To meet these requirements, many advanced prediction and monitoring approaches have been developed which can be classified into model-based or data-driven techniques. Among these approaches, partial least squares (PLS) is one of the most popular data-driven methods due to its simplicity and easy implementation in large-scale industrial process. As PLS is totally based on the measured process data, the characteristics of the process data are critical for the success of PLS. Outliers and missing values are two common characteristics of the measured data which can severely affect the effectiveness of PLS. To ensure the applicability of PLS in practical industrial applications, this paper introduces a robust version of PLS to deal with outliers and missing values, simultaneously. The effectiveness of the proposed method is finally demonstrated by the application results of the KPI-related prediction and diagnosis on an industrial benchmark of Tennessee Eastman process.

  1. Attenuated total reflectance-FT-IR spectroscopy for gunshot residue analysis: potential for ammunition determination.

    PubMed

    Bueno, Justin; Sikirzhytski, Vitali; Lednev, Igor K

    2013-08-06

    The ability to link a suspect to a particular shooting incident is a principal task for many forensic investigators. Here, we attempt to achieve this goal by analysis of gunshot residue (GSR) through the use of attenuated total reflectance (ATR) Fourier transform infrared spectroscopy (FT-IR) combined with statistical analysis. The firearm discharge process is analogous to a complex chemical process. Therefore, the products of this process (GSR) will vary based upon numerous factors, including the specific combination of the firearm and ammunition which was discharged. Differentiation of FT-IR data, collected from GSR particles originating from three different firearm-ammunition combinations (0.38 in., 0.40 in., and 9 mm calibers), was achieved using projection to latent structures discriminant analysis (PLS-DA). The technique was cross (leave-one-out), both internally and externally, validated. External validation was achieved via assignment (caliber identification) of unknown FT-IR spectra from unknown GSR particles. The results demonstrate great potential for ATR-FT-IR spectroscopic analysis of GSR for forensic purposes.

  2. Nondestructive evaluation of soluble solid content in strawberry by near infrared spectroscopy

    NASA Astrophysics Data System (ADS)

    Guo, Zhiming; Huang, Wenqian; Chen, Liping; Wang, Xiu; Peng, Yankun

    This paper indicates the feasibility to use near infrared (NIR) spectroscopy combined with synergy interval partial least squares (siPLS) algorithms as a rapid nondestructive method to estimate the soluble solid content (SSC) in strawberry. Spectral preprocessing methods were optimized selected by cross-validation in the model calibration. Partial least squares (PLS) algorithm was conducted on the calibration of regression model. The performance of the final model was back-evaluated according to root mean square error of calibration (RMSEC) and correlation coefficient (R2 c) in calibration set, and tested by mean square error of prediction (RMSEP) and correlation coefficient (R2 p) in prediction set. The optimal siPLS model was obtained with after first derivation spectra preprocessing. The measurement results of best model were achieved as follow: RMSEC = 0.2259, R2 c = 0.9590 in the calibration set; and RMSEP = 0.2892, R2 p = 0.9390 in the prediction set. This work demonstrated that NIR spectroscopy and siPLS with efficient spectral preprocessing is a useful tool for nondestructively evaluation SSC in strawberry.

  3. Evaluation of the efficiency of continuous wavelet transform as processing and preprocessing algorithm for resolution of overlapped signals in univariate and multivariate regression analyses; an application to ternary and quaternary mixtures

    NASA Astrophysics Data System (ADS)

    Hegazy, Maha A.; Lotfy, Hayam M.; Mowaka, Shereen; Mohamed, Ekram Hany

    2016-07-01

    Wavelets have been adapted for a vast number of signal-processing applications due to the amount of information that can be extracted from a signal. In this work, a comparative study on the efficiency of continuous wavelet transform (CWT) as a signal processing tool in univariate regression and a pre-processing tool in multivariate analysis using partial least square (CWT-PLS) was conducted. These were applied to complex spectral signals of ternary and quaternary mixtures. CWT-PLS method succeeded in the simultaneous determination of a quaternary mixture of drotaverine (DRO), caffeine (CAF), paracetamol (PAR) and p-aminophenol (PAP, the major impurity of paracetamol). While, the univariate CWT failed to simultaneously determine the quaternary mixture components and was able to determine only PAR and PAP, the ternary mixtures of DRO, CAF, and PAR and CAF, PAR, and PAP. During the calculations of CWT, different wavelet families were tested. The univariate CWT method was validated according to the ICH guidelines. While for the development of the CWT-PLS model a calibration set was prepared by means of an orthogonal experimental design and their absorption spectra were recorded and processed by CWT. The CWT-PLS model was constructed by regression between the wavelet coefficients and concentration matrices and validation was performed by both cross validation and external validation sets. Both methods were successfully applied for determination of the studied drugs in pharmaceutical formulations.

  4. Fast-HPLC Fingerprinting to Discriminate Olive Oil from Other Edible Vegetable Oils by Multivariate Classification Methods.

    PubMed

    Jiménez-Carvelo, Ana M; González-Casado, Antonio; Pérez-Castaño, Estefanía; Cuadros-Rodríguez, Luis

    2017-03-01

    A new analytical method for the differentiation of olive oil from other vegetable oils using reversed-phase LC and applying chemometric techniques was developed. A 3 cm short column was used to obtain the chromatographic fingerprint of the methyl-transesterified fraction of each vegetable oil. The chromatographic analysis took only 4 min. The multivariate classification methods used were k-nearest neighbors, partial least-squares (PLS) discriminant analysis, one-class PLS, support vector machine classification, and soft independent modeling of class analogies. The discrimination of olive oil from other vegetable edible oils was evaluated by several classification quality metrics. Several strategies for the classification of the olive oil were used: one input-class, two input-class, and pseudo two input-class.

  5. Glucose determination in human aqueous humor with Raman spectroscopy

    NASA Technical Reports Server (NTRS)

    Lambert, James L.; Pelletier, Christine C.; Borchert, Mark

    2005-01-01

    It has been suggested that spectroscopic analysis of the aqueous humor of the eye could be used to indirectly predict blood glucose levels in diabetics noninvasively. We have been investigating this potential using Raman spectroscopy in combination with partial least squares (PLS) analysis. We have determined that glucose at clinically relevant concentrations can be accurately predicted in human aqueous humor in vitro using a PLS model based on artificial aqueous humor. We have further determined that with proper instrument design, the light energy necessary to achieve clinically acceptable prediction of glucose does not damage the retinas of rabbits and can be delivered at powers below internationally acceptable safety limits. Herein we summarize our current results and address our strategies to improve instrument design. 2005 Society of Photo-Optical Instrumentation Engineers.

  6. Cole-Cole, linear and multivariate modeling of capacitance data for on-line monitoring of biomass.

    PubMed

    Dabros, Michal; Dennewald, Danielle; Currie, David J; Lee, Mark H; Todd, Robert W; Marison, Ian W; von Stockar, Urs

    2009-02-01

    This work evaluates three techniques of calibrating capacitance (dielectric) spectrometers used for on-line monitoring of biomass: modeling of cell properties using the theoretical Cole-Cole equation, linear regression of dual-frequency capacitance measurements on biomass concentration, and multivariate (PLS) modeling of scanning dielectric spectra. The performance and robustness of each technique is assessed during a sequence of validation batches in two experimental settings of differing signal noise. In more noisy conditions, the Cole-Cole model had significantly higher biomass concentration prediction errors than the linear and multivariate models. The PLS model was the most robust in handling signal noise. In less noisy conditions, the three models performed similarly. Estimates of the mean cell size were done additionally using the Cole-Cole and PLS models, the latter technique giving more satisfactory results.

  7. Simulated Annealing Based Hybrid Forecast for Improving Daily Municipal Solid Waste Generation Prediction

    PubMed Central

    Song, Jingwei; He, Jiaying; Zhu, Menghua; Tan, Debao; Zhang, Yu; Ye, Song; Shen, Dingtao; Zou, Pengfei

    2014-01-01

    A simulated annealing (SA) based variable weighted forecast model is proposed to combine and weigh local chaotic model, artificial neural network (ANN), and partial least square support vector machine (PLS-SVM) to build a more accurate forecast model. The hybrid model was built and multistep ahead prediction ability was tested based on daily MSW generation data from Seattle, Washington, the United States. The hybrid forecast model was proved to produce more accurate and reliable results and to degrade less in longer predictions than three individual models. The average one-week step ahead prediction has been raised from 11.21% (chaotic model), 12.93% (ANN), and 12.94% (PLS-SVM) to 9.38%. Five-week average has been raised from 13.02% (chaotic model), 15.69% (ANN), and 15.92% (PLS-SVM) to 11.27%. PMID:25301508

  8. A metabolomics-driven approach to predict cocoa product consumption by designing a multimetabolite biomarker model in free-living subjects from the PREDIMED study.

    PubMed

    Garcia-Aloy, Mar; Llorach, Rafael; Urpi-Sarda, Mireia; Jáuregui, Olga; Corella, Dolores; Ruiz-Canela, Miguel; Salas-Salvadó, Jordi; Fitó, Montserrat; Ros, Emilio; Estruch, Ramon; Andres-Lacueva, Cristina

    2015-02-01

    The aim of the current study was to apply an untargeted metabolomics strategy to characterize a model of cocoa intake biomarkers in a free-living population. An untargeted HPLC-q-ToF-MS based metabolomics approach was applied to human urine from 32 consumers of cocoa or derived products (CC) and 32 matched control subjects with no consumption of cocoa products (NC). The multivariate statistical analysis (OSC-PLS-DA) showed clear differences between CC and NC groups. The discriminant biomarkers identified were mainly related to the metabolic pathways of theobromine and polyphenols, as well as to cocoa processing. Consumption of cocoa products was also associated with reduced urinary excretions of methylglutarylcarnitine, which could be related to effects of cocoa exposure on insulin resistance. To improve the prediction of cocoa consumption, a combined urinary metabolite model was constructed. ROC curves were performed to evaluate the model and individual metabolites. The AUC values (95% CI) for the model were 95.7% (89.8-100%) and 92.6% (81.9-100%) in training and validation sets, respectively, whereas the AUCs for individual metabolites were <90%. The metabolic signature of cocoa consumption in free-living subjects reveals that combining different metabolites as biomarker models improves prediction of dietary exposure to cocoa. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  9. Metabolomics and proteomics technologies to explore the herbal preparation affecting metabolic disorders using high resolution mass spectrometry.

    PubMed

    Zhang, Aihua; Zhou, Xiaohang; Zhao, Hongwei; Zou, Shiyu; Ma, Chung Wah; Liu, Qi; Sun, Hui; Liu, Liang; Wang, Xijun

    2017-01-31

    An integrative metabolomics and proteomics approach can provide novel insights in the understanding of biological systems. We have integrated proteome and metabolome data sets for a holistic view of the molecular mechanisms in disease. Using quantitative iTRAQ-LC-MS/MS proteomics coupled with UPLC-Q-TOF-HDMS based metabolomics, we determined the protein and metabolite expression changes in the kidney-yang deficiency syndrome (KYDS) rat model and further investigated the intervention effects of the Jinkui Shenqi Pill (JSP). The VIP-plot of the orthogonal PLS-DA (OPLS-DA) was used for discovering the potential biomarkers to clarify the therapeutic mechanisms of JSP in treating KYDS. The results showed that JSP can alleviate the kidney impairment induced by KYDS. Sixty potential biomarkers, including 5-l-glutamyl-taurine, phenylacetaldehyde, 4,6-dihydroxyquinoline, and xanthurenic acid etc., were definitely up- or down-regulated. The regulatory effect of JSP on the disturbed metabolic pathways was proved by the established metabonomic method. Using pathway analyses, we identified the disturbed metabolic pathways such as taurine and hypotaurine metabolism, pyrimidine metabolism, tyrosine metabolism, tryptophan metabolism, histidine metabolism, steroid hormone biosynthesis, etc. Furthermore, using iTRAQ-based quantitative proteomics analysis, seventeen differential proteins were identified and significantly altered by the JSP treatment. These proteins appear to be involved in Wnt, chemokine, PPAR, and MAPK signaling pathways, etc. Functional pathway analysis revealed that most of the proteins were found to play a key role in the regulation of metabolism pathways. Bioinformatics analysis with the IPA software found that these differentially-expressed moleculars had a strong correlation with the α-adrenergic signaling, FGF signaling, etc. Our data indicate that high-throughput metabolomics and proteomics can provide an insight on the herbal preparations affecting the metabolic disorders using high resolution mass spectrometry.

  10. A New Approach to Age-Period-Cohort Analysis Using Partial Least Squares Regression: The Trend in Blood Pressure in the Glasgow Alumni Cohort

    PubMed Central

    Tu, Yu-Kang; Davey Smith, George; Gilthorpe, Mark S.

    2011-01-01

    Due to a problem of identification, how to estimate the distinct effects of age, time period and cohort has been a controversial issue in the analysis of trends in health outcomes in epidemiology. In this study, we propose a novel approach, partial least squares (PLS) analysis, to separate the effects of age, period, and cohort. Our example for illustration is taken from the Glasgow Alumni cohort. A total of 15,322 students (11,755 men and 3,567 women) received medical screening at the Glasgow University between 1948 and 1968. The aim is to investigate the secular trends in blood pressure over 1925 and 1950 while taking into account the year of examination and age at examination. We excluded students born before 1925 or aged over 25 years at examination and those with missing values in confounders from the analyses, resulting in 12,546 and 12,516 students for analysis of systolic and diastolic blood pressure, respectively. PLS analysis shows that both systolic and diastolic blood pressure increased with students' age, and students born later had on average lower blood pressure (SBP: −0.17 mmHg/per year [95% confidence intervals: −0.19 to −0.15] for men and −0.25 [−0.28 to −0.22] for women; DBP: −0.14 [−0.15 to −0.13] for men; −0.09 [−0.11 to −0.07] for women). PLS also shows a decreasing trend in blood pressure over the examination period. As identification is not a problem for PLS, it provides a flexible modelling strategy for age-period-cohort analysis. More emphasis is then required to clarify the substantive and conceptual issues surrounding the definitions and interpretations of age, period and cohort effects. PMID:21556329

  11. Prostate Cancer Patients-Negative Biopsy Controls Discrimination by Untargeted Metabolomics Analysis of Urine by LC-QTOF: Upstream Information on Other Omics

    NASA Astrophysics Data System (ADS)

    Fernández-Peralbo, M. A.; Gómez-Gómez, E.; Calderón-Santiago, M.; Carrasco-Valiente, J.; Ruiz-García, J.; Requena-Tapia, M. J.; Luque de Castro, M. D.; Priego-Capote, F.

    2016-12-01

    The existing clinical biomarkers for prostate cancer (PCa) diagnosis are far from ideal (e.g., the prostate specific antigen (PSA) serum level suffers from lack of specificity, providing frequent false positives leading to over-diagnosis). A key step in the search for minimum invasive tests to complement or replace PSA should be supported on the changes experienced by the biochemical pathways in PCa patients as compared to negative biopsy control individuals. In this research a comprehensive global analysis by LC-QTOF was applied to urine from 62 patients with a clinically significant PCa and 42 healthy individuals, both groups confirmed by biopsy. An unpaired t-test (p-value < 0.05) provided 28 significant metabolites tentatively identified in urine, used to develop a partial least squares discriminant analysis (PLS-DA) model characterized by 88.4 and 92.9% of sensitivity and specificity, respectively. Among the 28 significant metabolites 27 were present at lower concentrations in PCa patients than in control individuals, while only one reported higher concentrations in PCa patients. The connection among the biochemical pathways in which they are involved (DNA methylation, epigenetic marks on histones and RNA cap methylation) could explain the concentration changes with PCa and supports, once again, the role of metabolomics in upstream processes.

  12. Mechanisms behind the estimation of photosynthesis traits from leaf reflectance observations

    NASA Astrophysics Data System (ADS)

    Dechant, Benjamin; Cuntz, Matthias; Doktor, Daniel; Vohland, Michael

    2016-04-01

    Many studies have investigated the reflectance-based estimation of leaf chlorophyll, water and dry matter contents of plants. Only few studies focused on photosynthesis traits, however. The maximum potential uptake of carbon dioxide under given environmental conditions is determined mainly by RuBisCO activity, limiting carboxylation, or the speed of photosynthetic electron transport. These two main limitations are represented by the maximum carboxylation capacity, V cmax,25, and the maximum electron transport rate, Jmax,25. These traits were estimated from leaf reflectance before but the mechanisms underlying the estimation remain rather speculative. The aim of this study was therefore to reveal the mechanisms behind reflectance-based estimation of V cmax,25 and Jmax,25. Leaf reflectance, photosynthetic response curves as well as nitrogen content per area, Narea, and leaf mass per area, LMA, were measured on 37 deciduous tree species. V cmax,25 and Jmax,25 were determined from the response curves. Partial Least Squares (PLS) regression models for the two photosynthesis traits V cmax,25 and Jmax,25 as well as Narea and LMA were studied using a cross-validation approach. Analyses of linear regression models based on Narea and other leaf traits estimated via PROSPECT inversion, PLS regression coefficients and model residuals were conducted in order to reveal the mechanisms behind the reflectance-based estimation. We found that V cmax,25 and Jmax,25 can be estimated from leaf reflectance with good to moderate accuracy for a large number of species and different light conditions. The dominant mechanism behind the estimations was the strong relationship between photosynthesis traits and leaf nitrogen content. This was concluded from very strong relationships between PLS regression coefficients, the model residuals as well as the prediction performance of Narea- based linear regression models compared to PLS regression models. While the PLS regression model for V cmax,25 was fully based on the correlation to Narea, the PLS regression model for Jmax,25 was not entirely based on it. Analyses of the contributions of different parts of the reflectance spectrum revealed that the information contributing to the Jmax,25 PLS regression model in addition to the main source of information, Narea, was mainly located in the visible part of the spectrum (500-900 nm). Estimated chlorophyll content could be excluded as potential source of this extra information. The PLS regression coefficients of the Jmax,25 model indicated possible contributions from chlorophyll fluorescence and cytochrome f content. In summary, we found that the main mechanism behind the estimation of V cmax,25 and Jmax,25 from leaf reflectance observations is the correlation to Narea but that there is additional information related to Jmax,25 mainly in the visible part of the spectrum.

  13. Visible and near infrared spectroscopy as an authentication tool: Preliminary investigation of the prediction of the ageing time of beef steaks.

    PubMed

    Moran, Lara; Andres, Sonia; Allen, Paul; Moloney, Aidan P

    2018-08-01

    Visible-near infrared spectroscopy (Vis-NIRS) has been suggested to have potential for authentication of food products. The aim of the present preliminary study was to assess if this technology can be used to authenticate the ageing time (3, 7, 14 and 21 days post mortem) of beef steaks from three different muscles (M. Longissimus thoracis, M. Gluteus medius and M. Semitendinosus). Various mathematical pre-treatments were applied to the spectra to correct scattering and overlapping effects, and then partial least squares-discrimination analysis (PLS-DA) procedures applied. The best models were specific for each muscle, and the ability of prediction of ageing time was validated using full (leave-one-out) cross-validation, whereas authentication performance was evaluated using the parameters of sensitivity, specificity and overall correct classification. The results indicate that overall correct classification ranging from 94.2 to 100% was achieved, depending on the muscle. In conclusion, Vis-NIRS technology seems a valid tool for the authentication of ageing time of beef steaks. Copyright © 2018 Elsevier Ltd. All rights reserved.

  14. Is environmental sustainability a strategic priority for logistics service providers?

    PubMed

    Evangelista, Pietro; Colicchia, Claudia; Creazza, Alessandro

    2017-08-01

    Despite an increasing number of third-party logistics service providers (3PLs) regard environmental sustainability as a key area of management, there is still great uncertainty on how 3PLs implement environmental strategies and on how they translate green efforts into practice. Through a multiple case study analysis, this paper explores the environmental strategies of a sample of medium-sized 3PLs operating in Italy and the UK, in terms of environmental organizational culture, initiatives, and influencing factors. Our analysis shows that, notwithstanding environmental sustainability is generally recognised as a strategic priority, a certain degree of diversity in the deployment of environmental strategies still exists. This paper is original since the extant literature on green strategies of 3PLs provides findings predominantly from a single country perspective and mainly investigates large/multinational organizations. It also provides indications to help managers of medium-sized 3PLs in positioning their business. This is particularly meaningful in the 3PL industry, where medium-sized organizations significantly contribute to the generated turnover and market value. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. The extraction of simple relationships in growth factor-specific multiple-input and multiple-output systems in cell-fate decisions by backward elimination PLS regression.

    PubMed

    Akimoto, Yuki; Yugi, Katsuyuki; Uda, Shinsuke; Kudo, Takamasa; Komori, Yasunori; Kubota, Hiroyuki; Kuroda, Shinya

    2013-01-01

    Cells use common signaling molecules for the selective control of downstream gene expression and cell-fate decisions. The relationship between signaling molecules and downstream gene expression and cellular phenotypes is a multiple-input and multiple-output (MIMO) system and is difficult to understand due to its complexity. For example, it has been reported that, in PC12 cells, different types of growth factors activate MAP kinases (MAPKs) including ERK, JNK, and p38, and CREB, for selective protein expression of immediate early genes (IEGs) such as c-FOS, c-JUN, EGR1, JUNB, and FOSB, leading to cell differentiation, proliferation and cell death; however, how multiple-inputs such as MAPKs and CREB regulate multiple-outputs such as expression of the IEGs and cellular phenotypes remains unclear. To address this issue, we employed a statistical method called partial least squares (PLS) regression, which involves a reduction of the dimensionality of the inputs and outputs into latent variables and a linear regression between these latent variables. We measured 1,200 data points for MAPKs and CREB as the inputs and 1,900 data points for IEGs and cellular phenotypes as the outputs, and we constructed the PLS model from these data. The PLS model highlighted the complexity of the MIMO system and growth factor-specific input-output relationships of cell-fate decisions in PC12 cells. Furthermore, to reduce the complexity, we applied a backward elimination method to the PLS regression, in which 60 input variables were reduced to 5 variables, including the phosphorylation of ERK at 10 min, CREB at 5 min and 60 min, AKT at 5 min and JNK at 30 min. The simple PLS model with only 5 input variables demonstrated a predictive ability comparable to that of the full PLS model. The 5 input variables effectively extracted the growth factor-specific simple relationships within the MIMO system in cell-fate decisions in PC12 cells.

  16. Coexistence of antiphospholipid antibodies and cephalalgia.

    PubMed

    Islam, Md Asiful; Alam, Fahmida; Gan, Siew Hua; Cavestro, Cinzia; Wong, Kah Keng

    2018-03-01

    Background The occurrence of antiphospholipid antibodies (aPLs) and headache comorbidity in the presence or absence of underlying autoimmune diseases remains unclear. Aim The aim of this review was to summarize the relationship between headache and aPLs based on evidences from cohort studies and case reports, in addition to examining the treatment strategies that resolved headache in aPLs-positive individuals. Methods A comprehensive literature search was conducted through PubMed, ISI Web of Science and Google Scholar. A total of 559 articles were screened and the appropriate articles were selected based on quality and level of evidence. Results Cohort studies (n = 27) from Europe, North America and Asia demonstrated comorbidity of aPLs and headache in antiphospholipid syndrome, systemic lupus erythematosus (SLE) and neuropsychiatric SLE patients. Significantly higher association between migraine and aPLs was observed (n = 170/779; p < 0.0001) in individuals without any underlying diseases. Our analysis of shortlisted case reports (n = 17) showed that a higher frequency of anticardiolipin antibodies were present in subjects with different autoimmune disorders (70.6%). Corticosteroids were highly effective in resolving headache in aPLs-positive individuals. Conclusion Higher frequency of comorbidity between aPLs and headache was observed in healthy individuals and patient cases. Therefore, experimental studies are warranted to evaluate the aPLs-induced pathogenic mechanism of headache.

  17. Prognosis Biomarkers of Severe Sepsis and Septic Shock by 1H NMR Urine Metabolomics in the Intensive Care Unit

    PubMed Central

    Modesto-Alapont, Vicente; Gonzalez-Marrachelli, Vannina; Vento-Rehues, Rosa; Jorda-Miñana, Angela; Blanquer-Olivas, Jose; Monleon, Daniel

    2015-01-01

    Early diagnosis and patient stratification may improve sepsis outcome by a timely start of the proper specific treatment. We aimed to identify metabolomic biomarkers of sepsis in urine by 1H-NMR spectroscopy to assess the severity and to predict outcomes. Urine samples were collected from 64 patients with severe sepsis or septic shock in the ICU for a 1H NMR spectra acquisition. A supervised analysis was performed on the processed spectra, and a predictive model for prognosis (30-days mortality/survival) of sepsis was constructed using partial least-squares discriminant analysis (PLS-DA). In addition, we compared the prediction power of metabolomics data respect the Sequential Organ Failure Assessment (SOFA) score. Supervised multivariate analysis afforded a good predictive model to distinguish the patient groups and detect specific metabolic patterns. Negative prognosis patients presented higher values of ethanol, glucose and hippurate, and on the contrary, lower levels of methionine, glutamine, arginine and phenylalanine. These metabolites could be part of a composite biopattern of the human metabolic response to sepsis shock and its mortality in ICU patients. The internal cross-validation showed robustness of the metabolic predictive model obtained and a better predictive ability in comparison with SOFA values. Our results indicate that NMR metabolic profiling might be helpful for determining the metabolomic phenotype of worst-prognosis septic patients in an early stage. A predictive model for the evolution of septic patients using these metabolites was able to classify cases with more sensitivity and specificity than the well-established organ dysfunction score SOFA. PMID:26565633

  18. Human Milk Plasmalogens Are Highly Enriched in Long-Chain PUFAs.

    PubMed

    Moukarzel, Sara; Dyer, Roger A; Keller, Bernd O; Elango, Rajavel; Innis, Sheila M

    2016-11-01

    Human milk contains unique glycerophospholipids, including ethanolamine-containing plasmalogens (Pls-PEs) in the milk fat globule membrane, which have been implicated in infant brain development. Brain Pls-PEs accumulate postnatally and are enriched in long-chain polyunsaturated fatty acids (LC-PUFAs), particularly docosahexaenoic acid (DHA). Fatty acid (FA) composition of Pls-PEs in milk is poorly understood because of the analytical challenges in separating Pls-PEs from other phospholipids in the predominating presence of triacylglycerols. The variability of Pls-PE FAs and the potential role of maternal diet remain unknown. Our primary objectives were to establish improved methodology for extracting Pls-PEs from human milk, enabling FA analysis, and to compare FA composition between Pls-PEs and 2 major milk phospholipids, phosphatidylcholine and phosphatidylethanolamine. Our secondary objective was to explore associations between maternal DHA intake and DHA in milk phospholipids and variability in phospholipid-DHA within a woman. Mature milk was collected from 25 women, with 4 providing 3 milk samples on 3 separate days. Lipids were extracted, and phospholipids were removed by solid phase extraction. Pls-PEs were separated by using normal-phase HPLC, recovered and analyzed for FAs by GLC. Diet was assessed by using a validated food-frequency questionnaire. Pls-PE concentration in human milk was significantly higher in LC-PUFAs than phosphatidylethanolamine and phosphatidylcholine, including arachidonic acid (AA) and DHA. The mean ± SD concentration of AAs in Pls-PEs was ∼2.5-fold higher than in phosphatidylethanolamine (10.5 ± 1.71 and 3.82 ± 0.92 g/100 g, respectively). DHA in Pls-PEs varied across women (0.95-6.51 g/100 g), likely independent of maternal DHA intake. Pls-PE DHA also varied within a woman across days (CV ranged from 9.8% to 28%). Human milk provides the infant with LC-PUFAs from multiple lipid pools, including a source from Pls-PEs. The biological determinants of Pls-PE FAs and physiological relevance to the breastfed infant remain to be elucidated. © 2016 American Society for Nutrition.

  19. Development of an in-line Raman spectroscopic method for continuous API quantification during twin-screw wet granulation.

    PubMed

    Harting, Julia; Kleinebudde, Peter

    2018-04-01

    Raman spectroscopy was evaluated as a process analytical technology (PAT) tool for continuous API quantification during twin-screw wet granulation. Therefore, a Raman probe was implemented in front of the granulator barrel. This setup enabled the collection of Raman spectra upon a constant granule flow. To develop an in-line PLS calibration model, eight binary mixtures of the API and lactose monohydrate with API contents between 5 and 50% were pre-blended and granulated in a twin-screw granulator with a screw speed of 150 rpm and a powder feed rate of 40 g/min. Water was used as a granulation liquid with different liquid to solid ratios depending on the API content. Ibuprofen and diclofenac sodium were chosen as model drugs and separated PLS models were built for each API. The predictive performance of the developed PLS models was determined by granulating and monitoring new test samples containing different API concentrations. This evaluation showed that the models were able to predict the API concentration with an RMSEP of 0.59% for ibuprofen and 1.5% for diclofenac sodium. In a second part, the developed in-line Raman spectroscopic method was used to determine the API concentration during a split feeding process. Therefore, the API and lactose monohydrate were added by two independently adjustable feeders into the twin-screw granulator barrel. The in-line spectroscopy analysis which was verified by UV-analysis indicated that the mixing ability of the twin-screw granulator was good for the used settings and all adjusted API concentrations. Copyright © 2018 Elsevier B.V. All rights reserved.

  20. Relationship between Composition and Toxicity of Motor Vehicle Emission Samples

    PubMed Central

    McDonald, Jacob D.; Eide, Ingvar; Seagrave, JeanClare; Zielinska, Barbara; Whitney, Kevin; Lawson, Douglas R.; Mauderly, Joe L.

    2004-01-01

    In this study we investigated the statistical relationship between particle and semivolatile organic chemical constituents in gasoline and diesel vehicle exhaust samples, and toxicity as measured by inflammation and tissue damage in rat lungs and mutagenicity in bacteria. Exhaust samples were collected from “normal” and “high-emitting” gasoline and diesel light-duty vehicles. We employed a combination of principal component analysis (PCA) and partial least-squares regression (PLS; also known as projection to latent structures) to evaluate the relationships between chemical composition of vehicle exhaust and toxicity. The PLS analysis revealed the chemical constituents covarying most strongly with toxicity and produced models predicting the relative toxicity of the samples with good accuracy. The specific nitro-polycyclic aromatic hydrocarbons important for mutagenicity were the same chemicals that have been implicated by decades of bioassay-directed fractionation. These chemicals were not related to lung toxicity, which was associated with organic carbon and select organic compounds that are present in lubricating oil. The results demonstrate the utility of the PCA/PLS approach for evaluating composition–response relationships in complex mixture exposures and also provide a starting point for confirming causality and determining the mechanisms of the lung effects. PMID:15531438

  1. [Establishment of the Mathematical Model for PMI Estimation Using FTIR Spectroscopy and Data Mining Method].

    PubMed

    Wang, L; Qin, X C; Lin, H C; Deng, K F; Luo, Y W; Sun, Q R; Du, Q X; Wang, Z Y; Tuo, Y; Sun, J H

    2018-02-01

    To analyse the relationship between Fourier transform infrared (FTIR) spectrum of rat's spleen tissue and postmortem interval (PMI) for PMI estimation using FTIR spectroscopy combined with data mining method. Rats were sacrificed by cervical dislocation, and the cadavers were placed at 20 ℃. The FTIR spectrum data of rats' spleen tissues were taken and measured at different time points. After pretreatment, the data was analysed by data mining method. The absorption peak intensity of rat's spleen tissue spectrum changed with the PMI, while the absorption peak position was unchanged. The results of principal component analysis (PCA) showed that the cumulative contribution rate of the first three principal components was 96%. There was an obvious clustering tendency for the spectrum sample at each time point. The methods of partial least squares discriminant analysis (PLS-DA) and support vector machine classification (SVMC) effectively divided the spectrum samples with different PMI into four categories (0-24 h, 48-72 h, 96-120 h and 144-168 h). The determination coefficient ( R ²) of the PMI estimation model established by PLS regression analysis was 0.96, and the root mean square error of calibration (RMSEC) and root mean square error of cross validation (RMSECV) were 9.90 h and 11.39 h respectively. In prediction set, the R ² was 0.97, and the root mean square error of prediction (RMSEP) was 10.49 h. The FTIR spectrum of the rat's spleen tissue can be effectively analyzed qualitatively and quantitatively by the combination of FTIR spectroscopy and data mining method, and the classification and PLS regression models can be established for PMI estimation. Copyright© by the Editorial Department of Journal of Forensic Medicine.

  2. Determination of main fruits in adulterated nectars by ATR-FTIR spectroscopy combined with multivariate calibration and variable selection methods.

    PubMed

    Miaw, Carolina Sheng Whei; Assis, Camila; Silva, Alessandro Rangel Carolino Sales; Cunha, Maria Luísa; Sena, Marcelo Martins; de Souza, Scheilla Vitorino Carvalho

    2018-07-15

    Grape, orange, peach and passion fruit nectars were formulated and adulterated by dilution with syrup, apple and cashew juices at 10 levels for each adulterant. Attenuated total reflectance Fourier transform mid infrared (ATR-FTIR) spectra were obtained. Partial least squares (PLS) multivariate calibration models allied to different variable selection methods, such as interval partial least squares (iPLS), ordered predictors selection (OPS) and genetic algorithm (GA), were used to quantify the main fruits. PLS improved by iPLS-OPS variable selection showed the highest predictive capacity to quantify the main fruit contents. The selected variables in the final models varied from 72 to 100; the root mean square errors of prediction were estimated from 0.5 to 2.6%; the correlation coefficients of prediction ranged from 0.948 to 0.990; and, the mean relative errors of prediction varied from 3.0 to 6.7%. All of the developed models were validated. Copyright © 2018 Elsevier Ltd. All rights reserved.

  3. Free variable selection QSPR study to predict 19F chemical shifts of some fluorinated organic compounds using Random Forest and RBF-PLS methods

    NASA Astrophysics Data System (ADS)

    Goudarzi, Nasser

    2016-04-01

    In this work, two new and powerful chemometrics methods are applied for the modeling and prediction of the 19F chemical shift values of some fluorinated organic compounds. The radial basis function-partial least square (RBF-PLS) and random forest (RF) are employed to construct the models to predict the 19F chemical shifts. In this study, we didn't used from any variable selection method and RF method can be used as variable selection and modeling technique. Effects of the important parameters affecting the ability of the RF prediction power such as the number of trees (nt) and the number of randomly selected variables to split each node (m) were investigated. The root-mean-square errors of prediction (RMSEP) for the training set and the prediction set for the RBF-PLS and RF models were 44.70, 23.86, 29.77, and 23.69, respectively. Also, the correlation coefficients of the prediction set for the RBF-PLS and RF models were 0.8684 and 0.9313, respectively. The results obtained reveal that the RF model can be used as a powerful chemometrics tool for the quantitative structure-property relationship (QSPR) studies.

  4. Using color histograms and SPA-LDA to classify bacteria.

    PubMed

    de Almeida, Valber Elias; da Costa, Gean Bezerra; de Sousa Fernandes, David Douglas; Gonçalves Dias Diniz, Paulo Henrique; Brandão, Deysiane; de Medeiros, Ana Claudia Dantas; Véras, Germano

    2014-09-01

    In this work, a new approach is proposed to verify the differentiating characteristics of five bacteria (Escherichia coli, Enterococcus faecalis, Streptococcus salivarius, Streptococcus oralis, and Staphylococcus aureus) by using digital images obtained with a simple webcam and variable selection by the Successive Projections Algorithm associated with Linear Discriminant Analysis (SPA-LDA). In this sense, color histograms in the red-green-blue (RGB), hue-saturation-value (HSV), and grayscale channels and their combinations were used as input data, and statistically evaluated by using different multivariate classifiers (Soft Independent Modeling by Class Analogy (SIMCA), Principal Component Analysis-Linear Discriminant Analysis (PCA-LDA), Partial Least Squares Discriminant Analysis (PLS-DA) and Successive Projections Algorithm-Linear Discriminant Analysis (SPA-LDA)). The bacteria strains were cultivated in a nutritive blood agar base layer for 24 h by following the Brazilian Pharmacopoeia, maintaining the status of cell growth and the nature of nutrient solutions under the same conditions. The best result in classification was obtained by using RGB and SPA-LDA, which reached 94 and 100 % of classification accuracy in the training and test sets, respectively. This result is extremely positive from the viewpoint of routine clinical analyses, because it avoids bacterial identification based on phenotypic identification of the causative organism using Gram staining, culture, and biochemical proofs. Therefore, the proposed method presents inherent advantages, promoting a simpler, faster, and low-cost alternative for bacterial identification.

  5. Simultaneous determination of estrogens (ethinylestradiol and norgestimate) concentrations in human and bovine serum albumin by use of fluorescence spectroscopy and multivariate regression analysis.

    PubMed

    Hordge, LaQuana N; McDaniel, Kiara L; Jones, Derick D; Fakayode, Sayo O

    2016-05-15

    The endocrine disruption property of estrogens necessitates the immediate need for effective monitoring and development of analytical protocols for their analyses in biological and human specimens. This study explores the first combined utility of a steady-state fluorescence spectroscopy and multivariate partial-least-square (PLS) regression analysis for the simultaneous determination of two estrogens (17α-ethinylestradiol (EE) and norgestimate (NOR)) concentrations in bovine serum albumin (BSA) and human serum albumin (HSA) samples. The influence of EE and NOR concentrations and temperature on the emission spectra of EE-HSA EE-BSA, NOR-HSA, and NOR-BSA complexes was also investigated. The binding of EE with HSA and BSA resulted in increase in emission characteristics of HSA and BSA and a significant blue spectra shift. In contrast, the interaction of NOR with HSA and BSA quenched the emission characteristics of HSA and BSA. The observed emission spectral shifts preclude the effective use of traditional univariate regression analysis of fluorescent data for the determination of EE and NOR concentrations in HSA and BSA samples. Multivariate partial-least-squares (PLS) regression analysis was utilized to correlate the changes in emission spectra with EE and NOR concentrations in HSA and BSA samples. The figures-of-merit of the developed PLS regression models were excellent, with limits of detection as low as 1.6×10(-8) M for EE and 2.4×10(-7) M for NOR and good linearity (R(2)>0.994985). The PLS models correctly predicted EE and NOR concentrations in independent validation HSA and BSA samples with a root-mean-square-percent-relative-error (RMS%RE) of less than 6.0% at physiological condition. On the contrary, the use of univariate regression resulted in poor predictions of EE and NOR in HSA and BSA samples, with RMS%RE larger than 40% at physiological conditions. High accuracy, low sensitivity, simplicity, low-cost with no prior analyte extraction or separation required makes this method promising, compelling, and attractive alternative for the rapid determination of estrogen concentrations in biomedical and biological specimens, pharmaceuticals, or environmental samples. Published by Elsevier B.V.

  6. Mid-infrared and near-infrared spectroscopy for rapid detection of Gardeniae Fructus by a liquid-liquid extraction process.

    PubMed

    Tao, Lingyan; Lin, Zhonglin; Chen, Jiashan; Wu, Yongjiang; Liu, Xuesong

    2017-10-25

    Gardeniae Fructus is widely used in the pharmaceutical industry, and many studies have confirmed its medical and economic value. In this study, samples collected from different liquid-liquid extraction batches of Gardeniae Fructus were detected by mid-infrared (MIR) and near-infrared (NIR) spectroscopy. Seven analytes, neochlorogenic acid (5-CQA), cryptochlorogenic acid (4-CQA), chlorogenic acid (3-CQA), geniposidic acid (GEA), deacetyl-asperulosidic acid methyl ester (DAAME), genipin-gentiobioside (GGB), and gardenoside (GA), were chosen as quality property indexes of Gardeniae Fructus. The two kinds of spectra were each used to build models by single partial least squares (PLS). Additionally, both spectral data were combined and modeled by multiblock PLS. For single spectroscopy modeling results, NIR had a better prediction for high-concentration analytes (3-CQA, DAAME, GGB, and GA) whereas MIR performed better for low-concentration analytes (5-CQA, 4-CQA, and GEA). The multiblock methodology was found to be better compared to single spectroscopy models for all seven analytes. Specifically, the coefficients of determination (R 2 ) of the NIR, MIR, and multiblock PLS calibration models of all seven components were higher than 0.95. Relative standard errors of prediction (RSEP) were all less than 7%, except for models of GGB, which were 10.36%, 13.24%, and 8.15% for the NIR-PLS, MIR-PLS, and multiblock models, respectively. These results indicate that MIR and NIR spectrographic techniques could provide a new choice for quality control in industrial production of Gardeniae Fructus. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Use of partial least squares regression for the multivariate calibration of hazardous air pollutants in open-path FT-IR spectrometry

    NASA Astrophysics Data System (ADS)

    Hart, Brian K.; Griffiths, Peter R.

    1998-06-01

    Partial least squares (PLS) regression has been evaluated as a robust calibration technique for over 100 hazardous air pollutants (HAPs) measured by open path Fourier transform infrared (OP/FT-IR) spectrometry. PLS has the advantage over the current recommended calibration method of classical least squares (CLS), in that it can look at the whole useable spectrum (700-1300 cm-1, 2000-2150 cm-1, and 2400-3000 cm-1), and detect several analytes simultaneously. Up to one hundred HAPs synthetically added to OP/FT-IR backgrounds have been simultaneously calibrated and detected using PLS. PLS also has the advantage in requiring less preprocessing of spectra than that which is required in CLS calibration schemes, allowing PLS to provide user independent real-time analysis of OP/FT-IR spectra.

  8. Intra-regional classification of grape seeds produced in Mendoza province (Argentina) by multi-elemental analysis and chemometrics tools.

    PubMed

    Canizo, Brenda V; Escudero, Leticia B; Pérez, María B; Pellerano, Roberto G; Wuilloud, Rodolfo G

    2018-03-01

    The feasibility of the application of chemometric techniques associated with multi-element analysis for the classification of grape seeds according to their provenance vineyard soil was investigated. Grape seed samples from different localities of Mendoza province (Argentina) were evaluated. Inductively coupled plasma mass spectrometry (ICP-MS) was used for the determination of twenty-nine elements (Ag, As, Ce, Co, Cs, Cu, Eu, Fe, Ga, Gd, La, Lu, Mn, Mo, Nb, Nd, Ni, Pr, Rb, Sm, Te, Ti, Tl, Tm, U, V, Y, Zn and Zr). Once the analytical data were collected, supervised pattern recognition techniques such as linear discriminant analysis (LDA), partial least square discriminant analysis (PLS-DA), k-nearest neighbors (k-NN), support vector machine (SVM) and Random Forest (RF) were applied to construct classification/discrimination rules. The results indicated that nonlinear methods, RF and SVM, perform best with up to 98% and 93% accuracy rate, respectively, and therefore are excellent tools for classification of grapes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Development and validation of an APCI-MS/GC–MS approach for the classification and prediction of Cheddar cheese maturity

    PubMed Central

    Gan, Heng Hui; Yan, Bingnan; Linforth, Robert S.T.; Fisk, Ian D.

    2016-01-01

    Headspace techniques have been extensively employed in food analysis to measure volatile compounds, which play a central role in the perceived quality of food. In this study atmospheric pressure chemical ionisation-mass spectrometry (APCI-MS), coupled with gas chromatography–mass spectrometry (GC–MS), was used to investigate the complex mix of volatile compounds present in Cheddar cheeses of different maturity, processing and recipes to enable characterisation of the cheeses based on their ripening stages. Partial least squares-linear discriminant analysis (PLS-DA) provided a 70% success rate in correct prediction of the age of the cheeses based on their key headspace volatile profiles. In addition to predicting maturity, the analytical results coupled with chemometrics offered a rapid and detailed profiling of the volatile component of Cheddar cheeses, which could offer a new tool for quality assessment and accelerate product development. PMID:26212994

  10. Microorganisms detection on substrates using QCL spectroscopy

    NASA Astrophysics Data System (ADS)

    Padilla-Jiménez, Amira C.; Ortiz-Rivera, William; Castro-Suarez, John R.; Ríos-Velázquez, Carlos; Vázquez-Ayala, Iris; Hernández-Rivera, Samuel P.

    2013-05-01

    Recent investigations have focused on the improvement of rapid and accurate methods to develop spectroscopic markers of compounds constituting microorganisms that are considered biological threats. Quantum cascade lasers (QCL) systems have revolutionized many areas of research and development in defense and security applications, including his area of research. Infrared spectroscopy detection based on QCL was employed to acquire mid infrared (MIR) spectral signatures of Bacillus thuringiensis (Bt), Escherichia coli (Ec) and Staphylococcus epidermidis (Se), which were used as biological agent simulants of biothreats. The experiments were carried out in reflection mode on various substrates such as cardboard, glass, travel baggage, wood and stainless steel. Chemometrics statistical routines such as principal component analysis (PCA) regression and partial least squares-discriminant analysis (PLS-DA) were applied to the recorded MIR spectra. The results show that the infrared vibrational techniques investigated are useful for classification/detection of the target microorganisms on the types of substrates studied.

  11. Metabolomics combined with chemometric tools (PCA, HCA, PLS-DA and SVM) for screening cassava (Manihot esculenta Crantz) roots during postharvest physiological deterioration.

    PubMed

    Uarrota, Virgílio Gavicho; Moresco, Rodolfo; Coelho, Bianca; Nunes, Eduardo da Costa; Peruch, Luiz Augusto Martins; Neubert, Enilto de Oliveira; Rocha, Miguel; Maraschin, Marcelo

    2014-10-15

    Cassava roots are an important source of dietary and industrial carbohydrates and suffer markedly from postharvest physiological deterioration (PPD). This paper deals with metabolomics combined with chemometric tools for screening the chemical and enzymatic composition in several genotypes of cassava roots during PPD. Metabolome analyses showed increases in carotenoids, flavonoids, anthocyanins, phenolics, reactive scavenging species, and enzymes (superoxide dismutase family, hydrogen peroxide, and catalase) until 3-5days postharvest. PPD correlated negatively with phenolics and carotenoids and positively with anthocyanins and flavonoids. Chemometric tools such as principal component analysis, partial least squares discriminant analysis, and support vector machines discriminated well cassava samples and enabled a good prediction of samples. Hierarchical clustering analyses grouped samples according to their levels of PPD and chemical compositions. Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. Application of multispectral imaging to determine quality attributes and ripeness stage in strawberry fruit.

    PubMed

    Liu, Changhong; Liu, Wei; Lu, Xuzhong; Ma, Fei; Chen, Wei; Yang, Jianbo; Zheng, Lei

    2014-01-01

    Multispectral imaging with 19 wavelengths in the range of 405-970 nm has been evaluated for nondestructive determination of firmness, total soluble solids (TSS) content and ripeness stage in strawberry fruit. Several analysis approaches, including partial least squares (PLS), support vector machine (SVM) and back propagation neural network (BPNN), were applied to develop theoretical models for predicting the firmness and TSS of intact strawberry fruit. Compared with PLS and SVM, BPNN considerably improved the performance of multispectral imaging for predicting firmness and total soluble solids content with the correlation coefficient (r) of 0.94 and 0.83, SEP of 0.375 and 0.573, and bias of 0.035 and 0.056, respectively. Subsequently, the ability of multispectral imaging technology to classify fruit based on ripeness stage was tested using SVM and principal component analysis-back propagation neural network (PCA-BPNN) models. The higher classification accuracy of 100% was achieved using SVM model. Moreover, the results of all these models demonstrated that the VIS parts of the spectra were the main contributor to the determination of firmness, TSS content estimation and classification of ripeness stage in strawberry fruit. These results suggest that multispectral imaging, together with suitable analysis model, is a promising technology for rapid estimation of quality attributes and classification of ripeness stage in strawberry fruit.

  13. Improved Quantitative Analysis of Ion Mobility Spectrometry by Chemometric Multivariate Calibration

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fraga, Carlos G.; Kerr, Dayle; Atkinson, David A.

    2009-09-01

    Traditional peak-area calibration and the multivariate calibration methods of principle component regression (PCR) and partial least squares (PLS), including unfolded PLS (U-PLS) and multi-way PLS (N-PLS), were evaluated for the quantification of 2,4,6-trinitrotoluene (TNT) and cyclo-1,3,5-trimethylene-2,4,6-trinitramine (RDX) in Composition B samples analyzed by temperature step desorption ion mobility spectrometry (TSD-IMS). The true TNT and RDX concentrations of eight Composition B samples were determined by high performance liquid chromatography with UV absorbance detection. Most of the Composition B samples were found to have distinct TNT and RDX concentrations. Applying PCR and PLS on the exact same IMS spectra used for themore » peak-area study improved quantitative accuracy and precision approximately 3 to 5 fold and 2 to 4 fold, respectively. This in turn improved the probability of correctly identifying Composition B samples based upon the estimated RDX and TNT concentrations from 11% with peak area to 44% and 89% with PLS. This improvement increases the potential of obtaining forensic information from IMS analyzers by providing some ability to differentiate or match Composition B samples based on their TNT and RDX concentrations.« less

  14. Modelling mercury accumulation in minerogenic peat combining FTIR-ATR spectroscopy and partial least squares (PLS)

    NASA Astrophysics Data System (ADS)

    Pérez-Rodríguez, Marta; Horák-Terra, Ingrid; Rodríguez-Lado, Luis; Martínez Cortizas, Antonio

    2016-11-01

    Despite its potential, infrared spectroscopy combined with multivariate statistics has been seldom used to model peat properties with environmental value, such us the concentration of potentially toxic metals. In this research, we applied attenuated total reflectance (ATR) Fourier-Transform Infrared (FTIR) spectroscopy to evaluate the ability of the technique to predict mercury concentrations in late-Pleistocene/Holocene peat from a minerogenic peatland from Minas Gerais (Brazil). Mercury concentrations were analysed using a Milestone DMA-80 analyzer and attenuated total reflectance FTIR-ATR was performed using a Gladi-ATR (Pike Technologies) in the mid IR spectrum (4000-400 cm- 1). Concentrations were modelled using principal components (PCR) and partial least squares regression (PLS). The performance of the models varied between moderate and very good (R2 0.67-0.90), with low RMSD values (0.35-1.06). A PLS model based on three latent vectors (LV1 to LV3) provided the best (R2 0.90, RMSD 0.35) results. LV1 reflected total organic matter content versus mineral matter (mainly quartz from local fluxes), LV2 was related to dust deposition from regional sources, and LV3 reflected peat organic matter decomposition. Compared to a previous investigation based on geochemical data, the spectroscopy-based PLS model performed better, but it has to be complemented with additional data (as δ13 C ratios) to reliably reproduce the changes of the factors controlling mercury accumulation over time. This, time- and cost-effective, methodology may help to develop multi-core approaches to study the within and between mire (of a similar type and area) variability in mercury accumulation, and probably also other peat properties. Fig. S2 Loadings weights of the three and two significant components from the direct (dPCR) and transposed (trPCR) PCR models. Fig. S3 Depth records of the cumulative effects of the factors involved in the variation of mercury concentrations. Left, MIR-PLS model; centre, MIR-PLS + δ13 C data model; right, geochemical model from Pérez-Rodríguez et al. [44].

  15. UV–Vis and ATR–FTIR spectroscopic investigations of postmortem interval based on the changes in rabbit plasma

    PubMed Central

    Wang, Qi; He, Haijun; Li, Bing; Lin, Hancheng; Zhang, Yinming; Zhang, Ji

    2017-01-01

    Estimating PMI is of great importance in forensic investigations. Although many methods are used to estimate the PMI, a few investigations focus on the postmortem redistribution. In this study, ultraviolet–visible (UV–Vis) measurement combined with visual inspection indicated a regular diffusion of hemoglobin into plasma after death showing the redistribution of postmortem components in blood. Thereafter, attenuated total reflection–Fourier transform infrared (ATR–FTIR) spectroscopy was used to confirm the variations caused by this phenomenon. First, full-spectrum partial least-squares (PLS) and genetic algorithm combined with PLS (GA-PLS) models were constructed to predict the PMI. The performance of GA-PLS model was better than that of full-spectrum PLS model based on its root mean square error (RMSE) of cross-validation of 3.46 h (R2 = 0.95) and the RMSE of prediction of 3.46 h (R2 = 0.94). The investigation on the similarity of spectra between blood plasma and formed elements also supported the role of redistribution of components in spectral changes in postmortem plasma. These results demonstrated that ATR-FTIR spectroscopy coupled with the advanced mathematical methods could serve as a convenient and reliable tool to study the redistribution of postmortem components and estimate the PMI. PMID:28753641

  16. Solid-phase cadmium speciation in soil using L3-edge XANES spectroscopy with partial least-squares regression.

    PubMed

    Siebers, Nina; Kruse, Jens; Eckhardt, Kai-Uwe; Hu, Yongfeng; Leinweber, Peter

    2012-07-01

    Cadmium (Cd) has a high toxicity and resolving its speciation in soil is challenging but essential for estimating the environmental risk. In this study partial least-square (PLS) regression was tested for its capability to deconvolute Cd L(3)-edge X-ray absorption near-edge structure (XANES) spectra of multi-compound mixtures. For this, a library of Cd reference compound spectra and a spectrum of a soil sample were acquired. A good coefficient of determination (R(2)) of Cd compounds in mixtures was obtained for the PLS model using binary and ternary mixtures of various Cd reference compounds proving the validity of this approach. In order to describe complex systems like soil, multi-compound mixtures of a variety of Cd compounds must be included in the PLS model. The obtained PLS regression model was then applied to a highly Cd-contaminated soil revealing Cd(3)(PO(4))(2) (36.1%), Cd(NO(3))(2)·4H(2)O (24.5%), Cd(OH)(2) (21.7%), CdCO(3) (17.1%) and CdCl(2) (0.4%). These preliminary results proved that PLS regression is a promising approach for a direct determination of Cd speciation in the solid phase of a soil sample.

  17. New PLS analysis approach to wine volatile compounds characterization by near infrared spectroscopy (NIR).

    PubMed

    Genisheva, Z; Quintelas, C; Mesquita, D P; Ferreira, E C; Oliveira, J M; Amaral, A L

    2018-04-25

    This work aims to explore the potential of near infrared (NIR) spectroscopy to quantify volatile compounds in Vinho Verde wines, commonly determined by gas chromatography. For this purpose, 105 Vinho Verde wine samples were analyzed using Fourier transform near infrared (FT-NIR) transmission spectroscopy in the range of 5435 cm -1 to 6357 cm -1 . Boxplot and principal components analysis (PCA) were performed for clusters identification and outliers removal. A partial least square (PLS) regression was then applied to develop the calibration models, by a new iterative approach. The predictive ability of the models was confirmed by an external validation procedure with an independent sample set. The obtained results could be considered as quite good with coefficients of determination (R 2 ) varying from 0.94 to 0.97. The current methodology, using NIR spectroscopy and chemometrics, can be seen as a promising rapid tool to determine volatile compounds in Vinho Verde wines. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Automatic variable selection method and a comparison for quantitative analysis in laser-induced breakdown spectroscopy

    NASA Astrophysics Data System (ADS)

    Duan, Fajie; Fu, Xiao; Jiang, Jiajia; Huang, Tingting; Ma, Ling; Zhang, Cong

    2018-05-01

    In this work, an automatic variable selection method for quantitative analysis of soil samples using laser-induced breakdown spectroscopy (LIBS) is proposed, which is based on full spectrum correction (FSC) and modified iterative predictor weighting-partial least squares (mIPW-PLS). The method features automatic selection without artificial processes. To illustrate the feasibility and effectiveness of the method, a comparison with genetic algorithm (GA) and successive projections algorithm (SPA) for different elements (copper, barium and chromium) detection in soil was implemented. The experimental results showed that all the three methods could accomplish variable selection effectively, among which FSC-mIPW-PLS required significantly shorter computation time (12 s approximately for 40,000 initial variables) than the others. Moreover, improved quantification models were got with variable selection approaches. The root mean square errors of prediction (RMSEP) of models utilizing the new method were 27.47 (copper), 37.15 (barium) and 39.70 (chromium) mg/kg, which showed comparable prediction effect with GA and SPA.

  19. Using MetaboAnalyst 3.0 for Comprehensive Metabolomics Data Analysis.

    PubMed

    Xia, Jianguo; Wishart, David S

    2016-09-07

    MetaboAnalyst (http://www.metaboanalyst.ca) is a comprehensive Web application for metabolomic data analysis and interpretation. MetaboAnalyst handles most of the common metabolomic data types from most kinds of metabolomics platforms (MS and NMR) for most kinds of metabolomics experiments (targeted, untargeted, quantitative). In addition to providing a variety of data processing and normalization procedures, MetaboAnalyst also supports a number of data analysis and data visualization tasks using a range of univariate, multivariate methods such as PCA (principal component analysis), PLS-DA (partial least squares discriminant analysis), heatmap clustering and machine learning methods. MetaboAnalyst also offers a variety of tools for metabolomic data interpretation including MSEA (metabolite set enrichment analysis), MetPA (metabolite pathway analysis), and biomarker selection via ROC (receiver operating characteristic) curve analysis, as well as time series and power analysis. This unit provides an overview of the main functional modules and the general workflow of the latest version of MetaboAnalyst (MetaboAnalyst 3.0), followed by eight detailed protocols. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.

  20. Advanced stability indicating chemometric methods for quantitation of amlodipine and atorvastatin in their quinary mixture with acidic degradation products.

    PubMed

    Darwish, Hany W; Hassan, Said A; Salem, Maissa Y; El-Zeany, Badr A

    2016-02-05

    Two advanced, accurate and precise chemometric methods are developed for the simultaneous determination of amlodipine besylate (AML) and atorvastatin calcium (ATV) in the presence of their acidic degradation products in tablet dosage forms. The first method was Partial Least Squares (PLS-1) and the second was Artificial Neural Networks (ANN). PLS was compared to ANN models with and without variable selection procedure (genetic algorithm (GA)). For proper analysis, a 5-factor 5-level experimental design was established resulting in 25 mixtures containing different ratios of the interfering species. Fifteen mixtures were used as calibration set and the other ten mixtures were used as validation set to validate the prediction ability of the suggested models. The proposed methods were successfully applied to the analysis of pharmaceutical tablets containing AML and ATV. The methods indicated the ability of the mentioned models to solve the highly overlapped spectra of the quinary mixture, yet using inexpensive and easy to handle instruments like the UV-VIS spectrophotometer. Copyright © 2015 Elsevier B.V. All rights reserved.

  1. Comparative artificial neural network and partial least squares models for analysis of Metronidazole, Diloxanide, Spiramycin and Cliquinol in pharmaceutical preparations.

    PubMed

    Elkhoudary, Mahmoud M; Abdel Salam, Randa A; Hadad, Ghada M

    2014-09-15

    Metronidazole (MNZ) is a widely used antibacterial and amoebicide drug. Therefore, it is important to develop a rapid and specific analytical method for the determination of MNZ in mixture with Spiramycin (SPY), Diloxanide (DIX) and Cliquinol (CLQ) in pharmaceutical preparations. This work describes simple, sensitive and reliable six multivariate calibration methods, namely linear and nonlinear artificial neural networks preceded by genetic algorithm (GA-ANN) and principle component analysis (PCA-ANN) as well as partial least squares (PLS) either alone or preceded by genetic algorithm (GA-PLS) for UV spectrophotometric determination of MNZ, SPY, DIX and CLQ in pharmaceutical preparations with no interference of pharmaceutical additives. The results manifest the problem of nonlinearity and how models like ANN can handle it. Analytical performance of these methods was statistically validated with respect to linearity, accuracy, precision and specificity. The developed methods indicate the ability of the previously mentioned multivariate calibration models to handle and solve UV spectra of the four components' mixtures using easy and widely used UV spectrophotometer. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. Comparative artificial neural network and partial least squares models for analysis of Metronidazole, Diloxanide, Spiramycin and Cliquinol in pharmaceutical preparations

    NASA Astrophysics Data System (ADS)

    Elkhoudary, Mahmoud M.; Abdel Salam, Randa A.; Hadad, Ghada M.

    2014-09-01

    Metronidazole (MNZ) is a widely used antibacterial and amoebicide drug. Therefore, it is important to develop a rapid and specific analytical method for the determination of MNZ in mixture with Spiramycin (SPY), Diloxanide (DIX) and Cliquinol (CLQ) in pharmaceutical preparations. This work describes simple, sensitive and reliable six multivariate calibration methods, namely linear and nonlinear artificial neural networks preceded by genetic algorithm (GA-ANN) and principle component analysis (PCA-ANN) as well as partial least squares (PLS) either alone or preceded by genetic algorithm (GA-PLS) for UV spectrophotometric determination of MNZ, SPY, DIX and CLQ in pharmaceutical preparations with no interference of pharmaceutical additives. The results manifest the problem of nonlinearity and how models like ANN can handle it. Analytical performance of these methods was statistically validated with respect to linearity, accuracy, precision and specificity. The developed methods indicate the ability of the previously mentioned multivariate calibration models to handle and solve UV spectra of the four components’ mixtures using easy and widely used UV spectrophotometer.

  3. Seminal, clinical and colour-Doppler ultrasound correlations of prostatitis-like symptoms in males of infertile couples.

    PubMed

    Lotti, F; Corona, G; Mondaini, N; Maseroli, E; Rossi, M; Filimberti, E; Noci, I; Forti, G; Maggi, M

    2014-01-01

    'Prostatitis-like symptoms' (PLS) are a cluster of bothersome conditions defined as 'perineal and/or ejaculatory pain or discomfort and National Institutes of Health-Chronic Prostatitis Symptom Index (NIH-CPSI) pain subdomain score ≥4' (Nickel's criteria). PLS may originate from the prostate or from other portions of the male genital tract. Although PLS could be associated with 'prostatitis', they should not be confused. The NIH-CPSI is considered the gold-standard for assessing PLS severity. Although previous studies investigated the impact of prostatitis, vesiculitis or epididymitis on semen parameters, correlations between their related symptoms and seminal or scrotal/transrectal colour-Doppler ultrasound (CDU) characteristics have not been carefully determined. And no previous study evaluated the CDU features of PLS in infertile men. This study was aimed at investigating possible associations among NIH-CPSI (total and subdomain) scores and PLS, with seminal, clinical and scrotal/transrectal CDU parameters in a cohort of males of infertile couples. PLS of 400 men (35.8 ± 7.2 years) with a suspected male factor were assessed by the NIH-CPSI. All patients underwent, during the same day, semen analysis, seminal plasma interleukin 8 (sIL-8, a marker of male genital tract inflammation), biochemical evaluation, urine/seminal cultures, scrotal/transrectal CDU. PLS was detected in 39 (9.8%) subjects. After adjusting for age, waist and total testosterone (TT), no association among NIH-CPSI (total or subdomain) scores or PLS and sperm parameters was observed. However, we found a positive association with current positive urine and/or seminal cultures, sIL-8 levels and CDU features suggestive of inflammation of the epididymis, seminal vesicles, prostate, but not of the testis. The aforementioned significant associations of PLS were further confirmed by comparing PLS patients with age-, waist- and TT-matched PLS-free patients (1 : 3 ratio). In conclusion, NIH-CPSI scores and PLS evaluated in males of infertile couples, are not related to sperm parameters, but mainly to clinical and CDU signs of infection/inflammation. © 2013 American Society of Andrology and European Academy of Andrology.

  4. Application of principal component regression and artificial neural network in FT-NIR soluble solids content determination of intact pear fruit

    NASA Astrophysics Data System (ADS)

    Ying, Yibin; Liu, Yande; Fu, Xiaping; Lu, Huishan

    2005-11-01

    The artificial neural networks (ANNs) have been used successfully in applications such as pattern recognition, image processing, automation and control. However, majority of today's applications of ANNs is back-propagate feed-forward ANN (BP-ANN). In this paper, back-propagation artificial neural networks (BP-ANN) were applied for modeling soluble solid content (SSC) of intact pear from their Fourier transform near infrared (FT-NIR) spectra. One hundred and sixty-four pear samples were used to build the calibration models and evaluate the models predictive ability. The results are compared to the classical calibration approaches, i.e. principal component regression (PCR), partial least squares (PLS) and non-linear PLS (NPLS). The effects of the optimal methods of training parameters on the prediction model were also investigated. BP-ANN combine with principle component regression (PCR) resulted always better than the classical PCR, PLS and Weight-PLS methods, from the point of view of the predictive ability. Based on the results, it can be concluded that FT-NIR spectroscopy and BP-ANN models can be properly employed for rapid and nondestructive determination of fruit internal quality.

  5. The crucial role of the Pls1 tetraspanin during ascospore germination in Podospora anserina provides an example of the convergent evolution of morphogenetic processes in fungal plant pathogens and saprobes.

    PubMed

    Lambou, Karine; Malagnac, Fabienne; Barbisan, Crystel; Tharreau, Didier; Lebrun, Marc-Henri; Silar, Philippe

    2008-10-01

    Pls1 tetraspanins were shown for some pathogenic fungi to be essential for appressorium-mediated penetration into their host plants. We show here that Podospora anserina, a saprobic fungus lacking appressorium, contains PaPls1, a gene orthologous to known PLS1 genes. Inactivation of PaPls1 demonstrates that this gene is specifically required for the germination of ascospores in P. anserina. These ascospores are heavily melanized cells that germinate under inducing conditions through a specific pore. On the contrary, MgPLS1, which fully complements a DeltaPaPls1 ascospore germination defect, has no role in the germination of Magnaporthe grisea nonmelanized ascospores but is required for the formation of the penetration peg at the pore of its melanized appressorium. P. anserina mutants with mutation of PaNox2, which encodes the NADPH oxidase of the NOX2 family, display the same ascospore-specific germination defect as the DeltaPaPls1 mutant. Both mutant phenotypes are suppressed by the inhibition of melanin biosynthesis, suggesting that they are involved in the same cellular process required for the germination of P. anserina melanized ascospores. The analysis of the distribution of PLS1 and NOX2 genes in fungal genomes shows that they are either both present or both absent. These results indicate that the germination of P. anserina ascospores and the formation of the M. grisea appressorium penetration peg use the same molecular machinery that includes Pls1 and Nox2. This machinery is specifically required for the emergence of polarized hyphae from reinforced structures such as appressoria and ascospores. Its recurrent recruitment during fungal evolution may account for some of the morphogenetic convergence observed in fungi.

  6. Survey of Voyager plasma science ions at Jupiter: 1. Analysis method

    NASA Astrophysics Data System (ADS)

    Bagenal, F.; Dougherty, L. P.; Bodisch, K. M.; Richardson, J. D.; Belcher, J. M.

    2017-08-01

    The Voyagers 1 and 2 spacecraft flew by Jupiter in March and July of 1979, respectively. The Plasma Science instrument (PLS) acquired detailed measurements of the plasma environment in the equatorial region of the magnetosphere between 4.9 and 4 RJ. While bulk plasma properties such as charge density, ion temperature, and bulk flow were reasonably well determined, the ion composition was only well constrained in occasional regions of cold plasma. The ion data obtained by the PLS instrument have been reanalyzed using physical chemistry models to constrain the composition and reduce the number of free parameters, particularly in regions of hotter plasma. This paper describes the method used for fitting the plasma data and presents the results versus time. Two companion papers describe the composition of heavy ions and present analysis of protons plus other minor ions.

  7. Linked functional network abnormalities during intrinsic and extrinsic activity in schizophrenia as revealed by a data-fusion approach.

    PubMed

    Hashimoto, Ryu-Ichiro; Itahashi, Takashi; Okada, Rieko; Hasegawa, Sayaka; Tani, Masayuki; Kato, Nobumasa; Mimura, Masaru

    2018-01-01

    Abnormalities in functional brain networks in schizophrenia have been studied by examining intrinsic and extrinsic brain activity under various experimental paradigms. However, the identified patterns of abnormal functional connectivity (FC) vary depending on the adopted paradigms. Thus, it is unclear whether and how these patterns are inter-related. In order to assess relationships between abnormal patterns of FC during intrinsic activity and those during extrinsic activity, we adopted a data-fusion approach and applied partial least square (PLS) analyses to FC datasets from 25 patients with chronic schizophrenia and 25 age- and sex-matched normal controls. For the input to the PLS analyses, we generated a pair of FC maps during the resting state (REST) and the auditory deviance response (ADR) from each participant using the common seed region in the left middle temporal gyrus, which is a focus of activity associated with auditory verbal hallucinations (AVHs). PLS correlation (PLS-C) analysis revealed that patients with schizophrenia have significantly lower loadings of a component containing positive FCs in default-mode network regions during REST and a component containing positive FCs in the auditory and attention-related networks during ADR. Specifically, loadings of the REST component were significantly correlated with the severities of positive symptoms and AVH in patients with schizophrenia. The co-occurrence of such altered FC patterns during REST and ADR was replicated using PLS regression, wherein FC patterns during REST are modeled to predict patterns during ADR. These findings provide an integrative understanding of altered FCs during intrinsic and extrinsic activity underlying core schizophrenia symptoms.

  8. Tablet potency of Tianeptine in coated tablets by near infrared spectroscopy: model optimisation, calibration transfer and confidence intervals.

    PubMed

    Boiret, Mathieu; Meunier, Loïc; Ginot, Yves-Michel

    2011-02-20

    A near infrared (NIR) method was developed for determination of tablet potency of active pharmaceutical ingredient (API) in a complex coated tablet matrix. The calibration set contained samples from laboratory and production scale batches. The reference values were obtained by high performance liquid chromatography (HPLC) and partial least squares (PLS) regression was used to establish a model. The model was challenged by calculating tablet potency of two external test sets. Root mean square errors of prediction were respectively equal to 2.0% and 2.7%. To use this model with a second spectrometer from the production field, a calibration transfer method called piecewise direct standardisation (PDS) was used. After the transfer, the root mean square error of prediction of the first test set was 2.4% compared to 4.0% without transferring the spectra. A statistical technique using bootstrap of PLS residuals was used to estimate confidence intervals of tablet potency calculations. This method requires an optimised PLS model, selection of the bootstrap number and determination of the risk. In the case of a chemical analysis, the tablet potency value will be included within the confidence interval calculated by the bootstrap method. An easy to use graphical interface was developed to easily determine if the predictions, surrounded by minimum and maximum values, are within the specifications defined by the regulatory organisation. Copyright © 2010 Elsevier B.V. All rights reserved.

  9. A multi-model fusion strategy for multivariate calibration using near and mid-infrared spectra of samples from brewing industry

    NASA Astrophysics Data System (ADS)

    Tan, Chao; Chen, Hui; Wang, Chao; Zhu, Wanping; Wu, Tong; Diao, Yuanbo

    2013-03-01

    Near and mid-infrared (NIR/MIR) spectroscopy techniques have gained great acceptance in the industry due to their multiple applications and versatility. However, a success of application often depends heavily on the construction of accurate and stable calibration models. For this purpose, a simple multi-model fusion strategy is proposed. It is actually the combination of Kohonen self-organizing map (KSOM), mutual information (MI) and partial least squares (PLSs) and therefore named as KMICPLS. It works as follows: First, the original training set is fed into a KSOM for unsupervised clustering of samples, on which a series of training subsets are constructed. Thereafter, on each of the training subsets, a MI spectrum is calculated and only the variables with higher MI values than the mean value are retained, based on which a candidate PLS model is constructed. Finally, a fixed number of PLS models are selected to produce a consensus model. Two NIR/MIR spectral datasets from brewing industry are used for experiments. The results confirms its superior performance to two reference algorithms, i.e., the conventional PLS and genetic algorithm-PLS (GAPLS). It can build more accurate and stable calibration models without increasing the complexity, and can be generalized to other NIR/MIR applications.

  10. Application of NIRS coupled with PLS regression as a rapid, non-destructive alternative method for quantification of KBA in Boswellia sacra

    NASA Astrophysics Data System (ADS)

    Al-Harrasi, Ahmed; Rehman, Najeeb Ur; Mabood, Fazal; Albroumi, Muhammaed; Ali, Liaqat; Hussain, Javid; Hussain, Hidayat; Csuk, René; Khan, Abdul Latif; Alam, Tanveer; Alameri, Saif

    2017-09-01

    In the present study, for the first time, NIR spectroscopy coupled with PLS regression as a rapid and alternative method was developed to quantify the amount of Keto-β-Boswellic Acid (KBA) in different plant parts of Boswellia sacra and the resin exudates of the trunk. NIR spectroscopy was used for the measurement of KBA standards and B. sacra samples in absorption mode in the wavelength range from 700-2500 nm. PLS regression model was built from the obtained spectral data using 70% of KBA standards (training set) in the range from 0.1 ppm to 100 ppm. The PLS regression model obtained was having R-square value of 98% with 0.99 corelationship value and having good prediction with RMSEP value 3.2 and correlation of 0.99. It was then used to quantify the amount of KBA in the samples of B. sacra. The results indicated that the MeOH extract of resin has the highest concentration of KBA (0.6%) followed by essential oil (0.1%). However, no KBA was found in the aqueous extract. The MeOH extract of the resin was subjected to column chromatography to get various sub-fractions at different polarity of organic solvents. The sub-fraction at 4% MeOH/CHCl3 (4.1% of KBA) was found to contain the highest percentage of KBA followed by another sub-fraction at 2% MeOH/CHCl3 (2.2% of KBA). The present results also indicated that KBA is only present in the gum-resin of the trunk and not in all parts of the plant. These results were further confirmed through HPLC analysis and therefore it is concluded that NIRS coupled with PLS regression is a rapid and alternate method for quantification of KBA in Boswellia sacra. It is non-destructive, rapid, sensitive and uses simple methods of sample preparation.

  11. Analysis of multi-mode to single-mode conversion at 635 nm and 1550 nm

    NASA Astrophysics Data System (ADS)

    Zamora, Vanessa; Bogatzki, Angelina; Arndt-Staufenbiel, Norbert; Hofmann, Jens; Schröder, Henning

    2016-03-01

    We propose two low-cost and robust optical fiber systems based on the photonic lantern (PL) technology for operating at 635 nm and 1550 nm. The PL is an emerging technology that couples light from a multi-mode (MM) fiber to several single-mode (SM) fibers via a low-loss adiabatic transition. This bundle of SM fibers is observed as a MM fiber system whose spatial modes are the degenerate supermodes of the bundle. The adiabatic transition allows that those supermodes evolve into the modes of the MM fiber. Simulations of the MM fiber end structure and its taper transition have been performed via functional mode solver tools in order to understand the modal evolution in PLs. The modelled design consists of 7 SM fibers inserted into a low-index capillary. The material and geometry of the PLs are chosen such that the supermodes match to the spatial modes of the desired step-index MM fiber in a moderate loss transmission. The dispersion of materials is also considered. These parameters are studied in two PL systems in order to reach a spectral transmission from 450 nm to 1600 nm. Additionally, an analysis of the geometry and losses due to the mismatching of modes is presented. PLs are typically used in the fields of astrophotonics and space photonics. Recently, they are demonstrated as mode converters in telecommunications, especially focusing on spatial division multiplexing. In this study, we show the use of PLs as a promising interconnecting tool for the development of miniaturized spectrometers operating in a broad wavelength range.

  12. A rapid quantification method for the screening indicator for β-thalassemia with near-infrared spectroscopy

    NASA Astrophysics Data System (ADS)

    Chen, Jiemei; Peng, Lijun; Han, Yun; Yao, Lijun; Zhang, Jing; Pan, Tao

    2018-03-01

    Near-infrared (NIR) spectroscopy combined with chemometrics was applied to rapidly analyse haemoglobin A2 (HbA2) for β-thalassemia screening in human haemolysate samples. The relative content indicator HbA2 was indirectly quantified by simultaneous analysis of two absolute content indicators (Hb and Hb • HbA2). According to the comprehensive prediction effect of the multiple partitioning of calibration and prediction sets, the parameters were optimized to achieve modelling stability, and the preferred models were validated using the samples not involved in modelling. Savitzky-Golay smoothing was firstly used for the spectral pretreatment. The absorbance optimization partial least squares (AO-PLS) was used to eliminate high-absorption wave-bands appropriately. The equidistant combination PLS (EC-PLS) was further used to optimize wavelength models. The selected optimal models were I = 856 nm, N = 16, G = 1 and F = 6 for Hb and I = 988 nm, N = 12, G = 2 and F = 5 for Hb • HbA2. Through independent validation, the root-mean-square errors and correlation coefficients for prediction (RMSEP, RP) were 3.50 g L- 1 and 0.977 for Hb and 0.38 g L- 1 and 0.917 for Hb • HbA2, respectively. The predicted values of relative percentage HbA2 were further calculated, and the calculated RMSEP and RP were 0.31% and 0.965, respectively. The sensitivity and specificity for β-thalassemia both reached 100%. Therefore, the prediction of HbA2 achieved high accuracy for distinguishing β-thalassemia. The local optimal models for single parameter and the optimal equivalent model sets were proposed, providing more models to match possible constraints in practical applications. The NIR analysis method for the screening indicator of β-thalassemia was successfully established. The proposed method was rapid, simple and promising for thalassemia screening in a large population.

  13. Evaluation of the efficiency of continuous wavelet transform as processing and preprocessing algorithm for resolution of overlapped signals in univariate and multivariate regression analyses; an application to ternary and quaternary mixtures.

    PubMed

    Hegazy, Maha A; Lotfy, Hayam M; Mowaka, Shereen; Mohamed, Ekram Hany

    2016-07-05

    Wavelets have been adapted for a vast number of signal-processing applications due to the amount of information that can be extracted from a signal. In this work, a comparative study on the efficiency of continuous wavelet transform (CWT) as a signal processing tool in univariate regression and a pre-processing tool in multivariate analysis using partial least square (CWT-PLS) was conducted. These were applied to complex spectral signals of ternary and quaternary mixtures. CWT-PLS method succeeded in the simultaneous determination of a quaternary mixture of drotaverine (DRO), caffeine (CAF), paracetamol (PAR) and p-aminophenol (PAP, the major impurity of paracetamol). While, the univariate CWT failed to simultaneously determine the quaternary mixture components and was able to determine only PAR and PAP, the ternary mixtures of DRO, CAF, and PAR and CAF, PAR, and PAP. During the calculations of CWT, different wavelet families were tested. The univariate CWT method was validated according to the ICH guidelines. While for the development of the CWT-PLS model a calibration set was prepared by means of an orthogonal experimental design and their absorption spectra were recorded and processed by CWT. The CWT-PLS model was constructed by regression between the wavelet coefficients and concentration matrices and validation was performed by both cross validation and external validation sets. Both methods were successfully applied for determination of the studied drugs in pharmaceutical formulations. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. Metabolomics of Breast Cancer Using High-Resolution Magic Angle Spinning Magnetic Resonance Spectroscopy: Correlations with 18F-FDG Positron Emission Tomography-Computed Tomography, Dynamic Contrast-Enhanced and Diffusion-Weighted Imaging MRI.

    PubMed

    Yoon, Haesung; Yoon, Dahye; Yun, Mijin; Choi, Ji Soo; Park, Vivian Youngjean; Kim, Eun-Kyung; Jeong, Joon; Koo, Ja Seung; Yoon, Jung Hyun; Moon, Hee Jung; Kim, Suhkmann; Kim, Min Jung

    2016-01-01

    Our goal in this study was to find correlations between breast cancer metabolites and conventional quantitative imaging parameters using high-resolution magic angle spinning (HR-MAS) magnetic resonance spectroscopy (MRS) and to find breast cancer subgroups that show high correlations between metabolites and imaging parameters. Between August 2010 and December 2013, we included 53 female patients (mean age 49.6 years; age range 32-75 years) with a total of 53 breast lesions assessed by the Breast Imaging Reporting and Data System. They were enrolled under the following criteria: breast lesions larger than 1 cm in diameter which 1) were suspicious for malignancy on mammography or ultrasound (US), 2) were pathologically confirmed to be breast cancer with US-guided core-needle biopsy (CNB) 3) underwent 3 Tesla MRI with dynamic contrast-enhanced (DCE) and diffusion-weighted imaging (DWI) and positron emission tomography-computed tomography (PET-CT), and 4) had an attainable immunohistochemistry profile from CNB. We acquired spectral data by HR-MAS MRS with CNB specimens and expressed the data as relative metabolite concentrations. We compared the metabolites with the signal enhancement ratio (SER), maximum standardized FDG uptake value (SUV max), apparent diffusion coefficient (ADC), and histopathologic prognostic factors for correlation. We calculated Spearman correlations and performed a partial least squares-discriminant analysis (PLS-DA) to further classify patient groups into subgroups to find correlation differences between HR-MAS spectroscopic values and conventional imaging parameters. In a multivariate analysis, the PLS-DA models built with HR-MAS MRS metabolic profiles showed visible discrimination between high and low SER, SUV, and ADC. In luminal subtype breast cancer, compared to all cases, high SER, ADV, and SUV were more closely clustered by visual assessment. Multiple metabolites were correlated with SER and SUV in all cases. Multiple metabolites showed correlations with SER and SUV in the ER positive, HER2 negative, and Ki-67 negative groups. High levels of PC, choline, and glycine acquired from HR-MAS MRS using CNB specimens were noted in the high SER group via DCE MRI and the high SUV group via PET-CT, with significant correlations between choline and SER and between PC and SUV. Further studies should investigate whether HR-MAS MRS using CNB specimens can provide similar or more prognostic information than conventional quantitative imaging parameters.

  15. Metabolomics analysis reveals 6-benzylaminopurine as a stimulator for improving lipid and DHA accumulation of Aurantiochytriumsp.

    PubMed

    Yu, Xin-Jun; Sun, Jie; Zheng, Jian-Yong; Sun, Ya-Qi; Wang, Zhao

    2016-04-01

    Phytohormones are chemical messengers that have a positive effect on biodiesel production of microalgae at low concentrations. However, the effect of phytohormone 6-benzylaminopurine on lipid and docosahexaenoic acid (DHA) production in marine DHA-producer Aurantiochytrium has never been reported. In this study, a GC-MS-based metabolomics method combined with a multivariate analysis is applied to reveal the metabolic mechanism of 6-benzylaminopurine enhancing production of lipid and DHA in Aurantiochytrium sp.YLH70. In total, 71 metabolites were identified by GC-MS. The PCA model revealed that 76.9% of metabolite variation was related to 6-benzylaminopurine treatment, and overall metabolomics profiles between the 6-benzylaminopurine and control groups were clearly discriminated. Forty-six metabolites identified by the PLS-DA model were responsible for responding to 6-benzylaminopurine. Metabolic analysis showed that 6-benzylaminopurine could accelerate the rate of utilization of glucose in Aurantiochytrium sp. YLH70, and the metabolic flux from glycolysis, TCA cycle and mevalonate pathway to fatty acids biosynthesis was promoted. Moreover, the anti-stress mechanism in Aurantiochytrium sp.YLH70 might be induced by 6-benzylaminopurine. Metabolomics is a suitable tool to discover the metabolic mechanism for improving lipid and DHA accumulation in a microorganism. 6-benzylaminopurine has the potential to stimulate lipid and DHA production of Aurantiochytrium sp.YLH70 for industrial purposes. © 2015 The Authors. Journal of Chemical Technology & Biotechnology published by John Wiley & Sons Ltd on behalf of Society of Chemical Industry.

  16. Mixture quantification using PLS in plastic scintillation measurements.

    PubMed

    Bagán, H; Tarancón, A; Rauret, G; García, J F

    2011-06-01

    This article reports the capability of plastic scintillation (PS) combined with multivariate calibration (Partial least squares; PLS) to detect and quantify alpha and beta emitters in mixtures. While several attempts have been made with this purpose in mind using liquid scintillation (LS), no attempt was done using PS that has the great advantage of not producing mixed waste after the measurements are performed. Following this objective, ternary mixtures of alpha and beta emitters ((241)Am, (137)Cs and (90)Sr/(90)Y) have been quantified. Procedure optimisation has evaluated the use of the net spectra or the sample spectra, the inclusion of different spectra obtained at different values of the Pulse Shape Analysis parameter and the application of the PLS1 or PLS2 algorithms. The conclusions show that the use of PS+PLS2 applied to the sample spectra, without the use of any pulse shape discrimination, allows quantification of the activities with relative errors less than 10% in most of the cases. This procedure not only allows quantification of mixtures but also reduces measurement time (no blanks are required) and the application of this procedure does not require detectors that include the pulse shape analysis parameter. Copyright © 2011 Elsevier Ltd. All rights reserved.

  17. Partial least squares based identification of Duchenne muscular dystrophy specific genes.

    PubMed

    An, Hui-bo; Zheng, Hua-cheng; Zhang, Li; Ma, Lin; Liu, Zheng-yan

    2013-11-01

    Large-scale parallel gene expression analysis has provided a greater ease for investigating the underlying mechanisms of Duchenne muscular dystrophy (DMD). Previous studies typically implemented variance/regression analysis, which would be fundamentally flawed when unaccounted sources of variability in the arrays existed. Here we aim to identify genes that contribute to the pathology of DMD using partial least squares (PLS) based analysis. We carried out PLS-based analysis with two datasets downloaded from the Gene Expression Omnibus (GEO) database to identify genes contributing to the pathology of DMD. Except for the genes related to inflammation, muscle regeneration and extracellular matrix (ECM) modeling, we found some genes with high fold change, which have not been identified by previous studies, such as SRPX, GPNMB, SAT1, and LYZ. In addition, downregulation of the fatty acid metabolism pathway was found, which may be related to the progressive muscle wasting process. Our results provide a better understanding for the downstream mechanisms of DMD.

  18. A PLS-based extractive spectrophotometric method for simultaneous determination of carbamazepine and carbamazepine-10,11-epoxide in plasma and comparison with HPLC

    NASA Astrophysics Data System (ADS)

    Hemmateenejad, Bahram; Rezaei, Zahra; Khabnadideh, Soghra; Saffari, Maryam

    2007-11-01

    Carbamazepine (CBZ) undergoes enzyme biotransformation through epoxidation with the formation of its metabolite, carbamazepine-10,11-epoxide (CBZE). A simple chemometrics-assisted spectrophotometric method has been proposed for simultaneous determination of CBZ and CBZE in plasma. A liquid extraction procedure was operated to separate the analytes from plasma, and the UV absorbance spectra of the resultant solutions were subjected to partial least squares (PLS) regression. The optimum number of PLS latent variables was selected according to the PRESS values of leave-one-out cross-validation. A HPLC method was also employed for comparison. The respective mean recoveries for analysis of CBZ and CBZE in synthetic mixtures were 102.57 (±0.25)% and 103.00 (±0.09)% for PLS and 99.40 (±0.15)% and 102.20 (±0.02)%. The concentrations of CBZ and CBZE were also determined in five patients using the PLS and HPLC methods. The results showed that the data obtained by PLS were comparable with those obtained by HPLC method.

  19. Random forest models to predict aqueous solubility.

    PubMed

    Palmer, David S; O'Boyle, Noel M; Glen, Robert C; Mitchell, John B O

    2007-01-01

    Random Forest regression (RF), Partial-Least-Squares (PLS) regression, Support Vector Machines (SVM), and Artificial Neural Networks (ANN) were used to develop QSPR models for the prediction of aqueous solubility, based on experimental data for 988 organic molecules. The Random Forest regression model predicted aqueous solubility more accurately than those created by PLS, SVM, and ANN and offered methods for automatic descriptor selection, an assessment of descriptor importance, and an in-parallel measure of predictive ability, all of which serve to recommend its use. The prediction of log molar solubility for an external test set of 330 molecules that are solid at 25 degrees C gave an r2 = 0.89 and RMSE = 0.69 log S units. For a standard data set selected from the literature, the model performed well with respect to other documented methods. Finally, the diversity of the training and test sets are compared to the chemical space occupied by molecules in the MDL drug data report, on the basis of molecular descriptors selected by the regression analysis.

  20. Improved quantitative analysis of spectra using a new method of obtaining derivative spectra based on a singular perturbation technique.

    PubMed

    Li, Zhigang; Wang, Qiaoyun; Lv, Jiangtao; Ma, Zhenhe; Yang, Linjuan

    2015-06-01

    Spectroscopy is often applied when a rapid quantitative analysis is required, but one challenge is the translation of raw spectra into a final analysis. Derivative spectra are often used as a preliminary preprocessing step to resolve overlapping signals, enhance signal properties, and suppress unwanted spectral features that arise due to non-ideal instrument and sample properties. In this study, to improve quantitative analysis of near-infrared spectra, derivatives of noisy raw spectral data need to be estimated with high accuracy. A new spectral estimator based on singular perturbation technique, called the singular perturbation spectra estimator (SPSE), is presented, and the stability analysis of the estimator is given. Theoretical analysis and simulation experimental results confirm that the derivatives can be estimated with high accuracy using this estimator. Furthermore, the effectiveness of the estimator for processing noisy infrared spectra is evaluated using the analysis of beer spectra. The derivative spectra of the beer and the marzipan are used to build the calibration model using partial least squares (PLS) modeling. The results show that the PLS based on the new estimator can achieve better performance compared with the Savitzky-Golay algorithm and can serve as an alternative choice for quantitative analytical applications.

  1. Analyses of direct and indirect impacts of a positive list system on pharmaceutical R&D investments.

    PubMed

    Han, Euna; Kim, Tae Hyun; Jeung, Myung Jin; Lee, Eui-Kyung

    2013-07-01

    The South Korean government recently enacted a Positive List System (PLS) as a major change of the national formulary listing system and reimbursed prices for pharmaceutical products. Regardless of the primary goal of the PLS, its implementation might have spillover effects by influencing the pharmaceutical industry's research and development (R&D), potentially leading to a variety of responses by firms in relation to their R&D activities. We investigated the spillover effect of the PLS on R&D investments of the pharmaceutical industry in Korea through both direct and indirect channels, examining the influence of the PLS on sales profit and cash flow. Data from 9 years (5 before and 4 after PLS implementation) were drawn from the financial statements of firms whose stocks were exchanged in 2 official stock markets in Korea (526 firms) and additional pharmaceutical firms whose financial performance was officially audited by external reviewers (263 firms). Longitudinal analyses were conducted, using the panel nature of the data to control for permanent unobserved firm heterogeneity. Our results showed that the PLS was directly associated with R&D investments. In contrast, its indirect impacts stemming from the influence on sales profit and cash flow were minimal and statistically nonsignificant. The gross impact of the PLS on R&D investments increased moving further from the enactment year; R&D investments were reduced by 18.3% to 25.8% in 2009-2010 (compared with before PLS implementation) in the firm fixed-effects model. We also found that such negative direct and gross impacts of the PLS on R&D investments were significant only in firms without newly developed chemical entities. Considering the gross negative impact of the PLS on R&D investments of pharmaceutical firms and the heterogeneous response of these firms by the R&D activities, governmental efforts of cost-containment may need to consider the spillover impact of the PLS on pharmaceutical innovation. Copyright © 2013 Elsevier HS Journals, Inc. All rights reserved.

  2. OPLS statistical model versus linear regression to assess sonographic predictors of stroke prognosis.

    PubMed

    Vajargah, Kianoush Fathi; Sadeghi-Bazargani, Homayoun; Mehdizadeh-Esfanjani, Robab; Savadi-Oskouei, Daryoush; Farhoudi, Mehdi

    2012-01-01

    The objective of the present study was to assess the comparable applicability of orthogonal projections to latent structures (OPLS) statistical model vs traditional linear regression in order to investigate the role of trans cranial doppler (TCD) sonography in predicting ischemic stroke prognosis. The study was conducted on 116 ischemic stroke patients admitted to a specialty neurology ward. The Unified Neurological Stroke Scale was used once for clinical evaluation on the first week of admission and again six months later. All data was primarily analyzed using simple linear regression and later considered for multivariate analysis using PLS/OPLS models through the SIMCA P+12 statistical software package. The linear regression analysis results used for the identification of TCD predictors of stroke prognosis were confirmed through the OPLS modeling technique. Moreover, in comparison to linear regression, the OPLS model appeared to have higher sensitivity in detecting the predictors of ischemic stroke prognosis and detected several more predictors. Applying the OPLS model made it possible to use both single TCD measures/indicators and arbitrarily dichotomized measures of TCD single vessel involvement as well as the overall TCD result. In conclusion, the authors recommend PLS/OPLS methods as complementary rather than alternative to the available classical regression models such as linear regression.

  3. Quantitative Differentiation of LV Myocardium with and without Layer-Specific Fibrosis Using MRI in Hypertrophic Cardiomyopathy and Layer-Specific Strain TTE Analysis.

    PubMed

    Funabashi, Nobusada; Takaoka, Hiroyuki; Ozawa, Koya; Kamata, Tomoko; Uehara, Masae; Komuro, Issei; Kobayashi, Yoshio

    2018-05-30

    To achieve further risk stratification in hypertrophic cardiomyopathy (HCM) patients, we localized and quantified layer-specific LVM fibrosis on MRI in HCM patients using regional layer-specific peak longitudinal strain (PLS) and peak circumferential strain (PCS) in LV myocardium (LVM) on speckle tracking transthoracic echocardiography (TTE). A total of 18 HCM patients (14 males; 58 ± 17 years) underwent 1.5T-MRI and TTE. PLS and PCS in each layer of the LVM (endocardium, epicardium, and whole-layer myocardium) were calculated for 17 AHA-defined lesions. MRI assessment showed that fibrosis was classified as endocardial, epicardial, or whole-layer (= either or both of these). Regional PLS was smaller in fibrotic endocardial lesions than in non-fibrotic endocardial lesions (P = 0.004). To detect LV endocardial lesions with fibrosis, ROC curves of regional PLS revealed an area under the curve (AUC) of 0.609 and a best cut-off point of 13.5%, with sensitivity of 65.3% and specificity of 54.3%. Regional PLS was also smaller in fibrotic epicardial lesions than in non-fibrotic epicardial lesions (P < 0.001). To detect LV epicardial lesions with fibrosis, ROC curves of PLS revealed an AUC of 0.684 and a best cut-off point of 9.5%, with sensitivity of 73.5% and specificity of 55.5%. Using whole-layer myocardium analysis, PLS was smaller in fibrotic lesions than in non-fibrotic lesions (P < 0.001). To detect whole-layer LV lesions with fibrosis, ROC curves of regional PLS revealed an AUC of 0.674 and a best cut-off point of 12.5%, with sensitivity of 79.0% and specificity of 50.7%. There were no significant differences in PCS of LV myocardium (endocardium, epicardium, and whole-layer) between fibrotic and non-fibrotic lesions. Quantitative regional PLS but not PCS in LV endocardium, epicardium, and whole-layer myocardium provides useful non-invasive information for layer-specific localization of fibrosis in HCM patients.

  4. Development of a partial least squares-artificial neural network (PLS-ANN) hybrid model for the prediction of consumer liking scores of ready-to-drink green tea beverages.

    PubMed

    Yu, Peigen; Low, Mei Yin; Zhou, Weibiao

    2018-01-01

    In order to develop products that would be preferred by consumers, the effects of the chemical compositions of ready-to-drink green tea beverages on consumer liking were studied through regression analyses. Green tea model systems were prepared by dosing solutions of 0.1% green tea extract with differing concentrations of eight flavour keys deemed to be important for green tea aroma and taste, based on a D-optimal experimental design, before undergoing commercial sterilisation. Sensory evaluation of the green tea model system was carried out using an untrained consumer panel to obtain hedonic liking scores of the samples. Regression models were subsequently trained to objectively predict the consumer liking scores of the green tea model systems. A linear partial least squares (PLS) regression model was developed to describe the effects of the eight flavour keys on consumer liking, with a coefficient of determination (R 2 ) of 0.733, and a root-mean-square error (RMSE) of 3.53%. The PLS model was further augmented with an artificial neural network (ANN) to establish a PLS-ANN hybrid model. The established hybrid model was found to give a better prediction of consumer liking scores, based on its R 2 (0.875) and RMSE (2.41%). Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. A Comparison of Approaches for the Analysis of Interaction Effects between Latent Variables Using Partial Least Squares Path Modeling

    ERIC Educational Resources Information Center

    Henseler, Jorg; Chin, Wynne W.

    2010-01-01

    In social and business sciences, the importance of the analysis of interaction effects between manifest as well as latent variables steadily increases. Researchers using partial least squares (PLS) to analyze interaction effects between latent variables need an overview of the available approaches as well as their suitability. This article…

  6. A Latent Variable Analysis of Continuing Professional Development Constructs Using PLS-SEM Modeling

    ERIC Educational Resources Information Center

    Yazdi, Mona Tabatabaee; Motallebzadeh, Khalil; Ashraf, Hamid; Baghaei, Purya

    2017-01-01

    Continuing Professional Development (CPD), in the area of teacher education, refers to the procedures, programs or strategies that help teachers encounter the challenges of their work and accomplish their own and their learning center's goals. To this aim, the purpose of this study is to propose and validate an appropriate model of EFL teachers'…

  7. Traceability of 'Limone di Siracusa PGI' by a multidisciplinary analytical and chemometric approach.

    PubMed

    Amenta, M; Fabroni, S; Costa, C; Rapisarda, P

    2016-11-15

    Food traceability is increasingly relevant with respect to safety, quality and typicality issues. Lemon fruits grown in a typical lemon-growing area of southern Italy (Siracusa), have been awarded the PGI (Protected Geographical Indication) recognition as 'Limone di Siracusa'. Due to its peculiarity, consumers have an increasing interest about this product. The detection of potential fraud could be improved by using the tools linking the composition of this production to its typical features. This study used a wide range of analytical techniques, including conventional techniques and analytical approaches, such as spectral (NIR spectra), multi-elemental (Fe, Zn, Mn, Cu, Li, Sr) and isotopic ((13)C/(12)C, (18)O/(16)O) marker investigations, joined with multivariate statistical analysis, such as PLS-DA (Partial Least Squares Discriminant Analysis) and LDA (Linear Discriminant Analysis), to implement a traceability system to verify the authenticity of 'Limone di Siracusa' production. The results demonstrated a very good geographical discrimination rate. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Metabolomics-based optimal koji fermentation for tyrosinase inhibition supplemented with Astragalus radix.

    PubMed

    Kim, Ah Jin; Choi, Jung Nam; Kim, Jiyoung; Yeo, Soo Hwan; Choi, Ji Ho; Lee, Choong Hwan

    2012-01-01

    The present study was focused on improving the quality of rice koji by fermentation with a selected Aspergillus oryzae strain and a plant Astragalus radix. A. oryzae KCCM 60345 was used as main inoculant and the Astragalus radix was added as supplement in rice koji preparation. LC-MS based metabolite analysis and tyrosinase inhibitory activities were studied for different time periods. A. oryzae KCCM 60345 fermented rice koji supplemented with Astragalus showed higher tyrosinase inhibition activity at 4 d of fermentation and metabolite analysis with PCA and PLS-DA indicated differences in kojic acid, calycosin-7-O-β-D-glucoside, ononin, calycosin, and formononetin as compared with other forms of rice koji fermentation. By correlation analysis between metabolites and tyrosinase inhibitory activity, calycosin and kojic acid were identified as major tyrosinase inhibitors. Based on these results, we concluded that A. oryzae KCCM 60345 supplemented with Astragalus radix is useful for whitening effects, and we identified optimal conditions for rice koji preparation.

  9. Importance-Performance Matrix Analysis (IPMA) Of Transport Disadvantage Variables on Social Exclusion in a Rural Context

    NASA Astrophysics Data System (ADS)

    Larasati, Ophilia; Puspita Dirgahayani, Eng., Dr.

    2018-05-01

    Transport services are essential to support daily life. A lack of transport supply leads to the existence of transport disadvantaged (TDA) groups who are vulnerable to social exclusion, which happens when a particular group or individual is having difficulties to access certain activities that are considered normal in society. To tackle this phenomenon, the understanding of the influence of TDA variables on social exclusion is needed. The aim of this study is to analyze the influences of TDA variables on social exclusion in a rural context, with Cibeureum Village (Bandung Barat Regency) and Bunikasih Village (Subang Regency) as the study case. Both case studies provide different characteristics of accessibility. Partial Least Squares (PLS) Structural Equation Modeling (SEM) is chosen as the method to analyze the influences of TDA variables on social exclusion. The PLS-SEM model is developed according to the social exclusion variable and four TDA variables, i.e., accessibility, individual characteristics, private vehicle existence, and travel behavior. IPMA is done after the PLS-SEM model is evaluated. The study reveals that among four of the TDA variables, accessibility has the most influence on social exclusion, hence interventions related to improving accessibility are needed to tackle social exclusion. More specifically, the provision of alternative modes is needed in both study areas, while in Bunikasih Village the cost of travel is also an important variable to consider.

  10. Classification and Identification of Plant Fibrous Material with Different Species Using near Infrared Technique—A New Way to Approach Determining Biomass Properties Accurately within Different Species

    PubMed Central

    Jiang, Wei; Zhou, Chengfeng; Han, Guangting; Via, Brian; Swain, Tammy; Fan, Zhaofei; Liu, Shaoyang

    2017-01-01

    Plant fibrous material is a good resource in textile and other industries. Normally, several kinds of plant fibrous materials used in one process are needed to be identified and characterized in advance. It is easy to identify them when they are in raw condition. However, most of the materials are semi products which are ground, rotted or pre-hydrolyzed. To classify these samples which include different species with high accuracy is a big challenge. In this research, both qualitative and quantitative analysis methods were chosen to classify six different species of samples, including softwood, hardwood, bast, and aquatic plant. Soft Independent Modeling of Class Analogy (SIMCA) and partial least squares (PLS) were used. The algorithm to classify different species of samples using PLS was created independently in this research. Results found that the six species can be successfully classified using SIMCA and PLS methods, and these two methods show similar results. The identification rates of kenaf, ramie and pine are 100%, and the identification rates of lotus, eucalyptus and tallow are higher than 94%. It is also found that spectra loadings can help pick up best wavenumber ranges for constructing the NIR model. Inter material distance can show how close between two species. Scores graph is helpful to choose the principal components numbers during the model construction. PMID:28105037

  11. Identifying maternal and infant factors associated with newborn size in rural Bangladesh by partial least squares (PLS) regression analysis

    PubMed Central

    Rahman, Md. Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D. W.; Labrique, Alain B.; Rashid, Mahbubur; Christian, Parul; West, Keith P.

    2017-01-01

    Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 − -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset. PMID:29261760

  12. Identifying maternal and infant factors associated with newborn size in rural Bangladesh by partial least squares (PLS) regression analysis.

    PubMed

    Kabir, Alamgir; Rahman, Md Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D W; Labrique, Alain B; Rashid, Mahbubur; Christian, Parul; West, Keith P

    2017-01-01

    Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 - -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset.

  13. Modeling of temperature-induced near-infrared and low-field time-domain nuclear magnetic resonance spectral variation: chemometric prediction of limonene and water content in spray-dried delivery systems.

    PubMed

    Andrade, Letícia; Farhat, Imad A; Aeberhardt, Kasia; Bro, Rasmus; Engelsen, Søren Balling

    2009-02-01

    The influence of temperature on near-infrared (NIR) and nuclear magnetic resonance (NMR) spectroscopy complicates the industrial applications of both spectroscopic methods. The focus of this study is to analyze and model the effect of temperature variation on NIR spectra and NMR relaxation data. Different multivariate methods were tested for constructing robust prediction models based on NIR and NMR data acquired at various temperatures. Data were acquired on model spray-dried limonene systems at five temperatures in the range from 20 degrees C to 60 degrees C and partial least squares (PLS) regression models were computed for limonene and water predictions. The predictive ability of the models computed on the NIR spectra (acquired at various temperatures) improved significantly when data were preprocessed using extended inverted signal correction (EISC). The average PLS regression prediction error was reduced to 0.2%, corresponding to 1.9% and 3.4% of the full range of limonene and water reference values, respectively. The removal of variation induced by temperature prior to calibration, by direct orthogonalization (DO), slightly enhanced the predictive ability of the models based on NMR data. Bilinear PLS models, with implicit inclusion of the temperature, enabled limonene and water predictions by NMR with an error of 0.3% (corresponding to 2.8% and 7.0% of the full range of limonene and water). For NMR, and in contrast to the NIR results, modeling the data using multi-way N-PLS improved the models' performance. N-PLS models, in which temperature was included as an extra variable, enabled more accurate prediction, especially for limonene (prediction error was reduced to 0.2%). Overall, this study proved that it is possible to develop models for limonene and water content prediction based on NIR and NMR data, independent of the measurement temperature.

  14. Prediction of olive oil sensory descriptors using instrumental data fusion and partial least squares (PLS) regression.

    PubMed

    Borràs, Eva; Ferré, Joan; Boqué, Ricard; Mestres, Montserrat; Aceña, Laura; Calvo, Angels; Busto, Olga

    2016-08-01

    Headspace-Mass Spectrometry (HS-MS), Fourier Transform Mid-Infrared spectroscopy (FT-MIR) and UV-Visible spectrophotometry (UV-vis) instrumental responses have been combined to predict virgin olive oil sensory descriptors. 343 olive oil samples analyzed during four consecutive harvests (2010-2014) were used to build multivariate calibration models using partial least squares (PLS) regression. The reference values of the sensory attributes were provided by expert assessors from an official taste panel. The instrumental data were modeled individually and also using data fusion approaches. The use of fused data with both low- and mid-level of abstraction improved PLS predictions for all the olive oil descriptors. The best PLS models were obtained for two positive attributes (fruity and bitter) and two defective descriptors (fusty and musty), all of them using data fusion of MS and MIR spectral fingerprints. Although good predictions were not obtained for some sensory descriptors, the results are encouraging, specially considering that the legal categorization of virgin olive oils only requires the determination of fruity and defective descriptors. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. Thermal-to-visible face recognition using partial least squares.

    PubMed

    Hu, Shuowen; Choi, Jonghyun; Chan, Alex L; Schwartz, William Robson

    2015-03-01

    Although visible face recognition has been an active area of research for several decades, cross-modal face recognition has only been explored by the biometrics community relatively recently. Thermal-to-visible face recognition is one of the most difficult cross-modal face recognition challenges, because of the difference in phenomenology between the thermal and visible imaging modalities. We address the cross-modal recognition problem using a partial least squares (PLS) regression-based approach consisting of preprocessing, feature extraction, and PLS model building. The preprocessing and feature extraction stages are designed to reduce the modality gap between the thermal and visible facial signatures, and facilitate the subsequent one-vs-all PLS-based model building. We incorporate multi-modal information into the PLS model building stage to enhance cross-modal recognition. The performance of the proposed recognition algorithm is evaluated on three challenging datasets containing visible and thermal imagery acquired under different experimental scenarios: time-lapse, physical tasks, mental tasks, and subject-to-camera range. These scenarios represent difficult challenges relevant to real-world applications. We demonstrate that the proposed method performs robustly for the examined scenarios.

  16. Robust Ultraviolet-Visible (UV-Vis) Partial Least-Squares (PLS) Models for Tannin Quantification in Red Wine.

    PubMed

    Aleixandre-Tudo, José Luis; Nieuwoudt, Helené; Aleixandre, José Luis; Du Toit, Wessel J

    2015-02-04

    The validation of ultraviolet-visible (UV-vis) spectroscopy combined with partial least-squares (PLS) regression to quantify red wine tannins is reported. The methylcellulose precipitable (MCP) tannin assay and the bovine serum albumin (BSA) tannin assay were used as reference methods. To take the high variability of wine tannins into account when the calibration models were built, a diverse data set was collected from samples of South African red wines that consisted of 18 different cultivars, from regions spanning the wine grape-growing areas of South Africa with their various sites, climates, and soils, ranging in vintage from 2000 to 2012. A total of 240 wine samples were analyzed, and these were divided into a calibration set (n = 120) and a validation set (n = 120) to evaluate the predictive ability of the models. To test the robustness of the PLS calibration models, the predictive ability of the classifying variables cultivar, vintage year, and experimental versus commercial wines was also tested. In general, the statistics obtained when BSA was used as a reference method were slightly better than those obtained with MCP. Despite this, the MCP tannin assay should also be considered as a valid reference method for developing PLS calibrations. The best calibration statistics for the prediction of new samples were coefficient of correlation (R 2 val) = 0.89, root mean standard error of prediction (RMSEP) = 0.16, and residual predictive deviation (RPD) = 3.49 for MCP and R 2 val = 0.93, RMSEP = 0.08, and RPD = 4.07 for BSA, when only the UV region (260-310 nm) was selected, which also led to a faster analysis time. In addition, a difference in the results obtained when the predictive ability of the classifying variables vintage, cultivar, or commercial versus experimental wines was studied suggests that tannin composition is highly affected by many factors. This study also discusses the correlations in tannin values between the methylcellulose and protein precipitation methods.

  17. Visible/near-infrared spectroscopy to predict water holding capacity in broiler breast meat

    USDA-ARS?s Scientific Manuscript database

    Visible/Near-infrared spectroscopy (Vis/NIRS) was examined as a tool for rapidly determining water holding capacity (WHC) in broiler breast meat. Both partial least squares (PLS) and principal component analysis (PCA) models were developed to relate Vis/NIRS spectra of 85 broiler breast meat sample...

  18. Selecting the optimum number of partial least squares components for the calibration of attenuated total reflectance-mid-infrared spectra of undesigned kerosene samples.

    PubMed

    Gómez-Carracedo, M P; Andrade, J M; Rutledge, D N; Faber, N M

    2007-03-07

    Selecting the correct dimensionality is critical for obtaining partial least squares (PLS) regression models with good predictive ability. Although calibration and validation sets are best established using experimental designs, industrial laboratories cannot afford such an approach. Typically, samples are collected in an (formally) undesigned way, spread over time and their measurements are included in routine measurement processes. This makes it hard to evaluate PLS model dimensionality. In this paper, classical criteria (leave-one-out cross-validation and adjusted Wold's criterion) are compared to recently proposed alternatives (smoothed PLS-PoLiSh and a randomization test) to seek out the optimum dimensionality of PLS models. Kerosene (jet fuel) samples were measured by attenuated total reflectance-mid-IR spectrometry and their spectra where used to predict eight important properties determined using reference methods that are time-consuming and prone to analytical errors. The alternative methods were shown to give reliable dimensionality predictions when compared to external validation. By contrast, the simpler methods seemed to be largely affected by the largest changes in the modeling capabilities of the first components.

  19. Rapid and simultaneous analysis of five alkaloids in four parts of Coptidis Rhizoma by near-infrared spectroscopy

    NASA Astrophysics Data System (ADS)

    Jintao, Xue; Yufei, Liu; Liming, Ye; Chunyan, Li; Quanwei, Yang; Weiying, Wang; Yun, Jing; Minxiang, Zhang; Peng, Li

    2018-01-01

    Near-Infrared Spectroscopy (NIRS) was first used to develop a method for rapid and simultaneous determination of 5 active alkaloids (berberine, coptisine, palmatine, epiberberine and jatrorrhizine) in 4 parts (rhizome, fibrous root, stem and leaf) of Coptidis Rhizoma. A total of 100 samples from 4 main places of origin were collected and studied. With HPLC analysis values as calibration reference, the quantitative analysis of 5 marker components was performed by two different modeling methods, partial least-squares (PLS) regression as linear regression and artificial neural networks (ANN) as non-linear regression. The results indicated that the 2 types of models established were robust, accurate and repeatable for five active alkaloids, and the ANN models was more suitable for the determination of berberine, coptisine and palmatine while the PLS model was more suitable for the analysis of epiberberine and jatrorrhizine. The performance of the optimal models was achieved as follows: the correlation coefficient (R) for berberine, coptisine, palmatine, epiberberine and jatrorrhizine was 0.9958, 0.9956, 0.9959, 0.9963 and 0.9923, respectively; the root mean square error of validation (RMSEP) was 0.5093, 0.0578, 0.0443, 0.0563 and 0.0090, respectively. Furthermore, for the comprehensive exploitation and utilization of plant resource of Coptidis Rhizoma, the established NIR models were used to analysis the content of 5 active alkaloids in 4 parts of Coptidis Rhizoma and 4 main origin of places. This work demonstrated that NIRS may be a promising method as routine screening for off-line fast analysis or on-line quality assessment of traditional Chinese medicine (TCM).

  20. A comparative study of volatile components in Dianhong teas from fresh leaves of four tea cultivars by using chromatography-mass spectrometry, multivariate data analysis, and descriptive sensory analysis.

    PubMed

    Wang, Chao; Zhang, Chenxia; Kong, Yawen; Peng, Xiaopei; Li, Changwen; Liu, Shunhang; Du, Liping; Xiao, Dongguang; Xu, Yongquan

    2017-10-01

    Dianhong teas produced from fresh leaves of different tea cultivars (YK is Yunkang No. 10, XY is Xueya 100, CY is Changyebaihao, SS is Shishengmiao), were compared in terms of volatile compounds and descriptive sensory analysis. A total of 73 volatile compounds in 16 tea samples were tentatively identified. YK, XY, CY, and SS contained 55, 53, 49, and 51 volatile compounds, respectively. Partial least squares-discriminant analysis (PLS-DA) and hierarchical cluster analysis (HCA) were used to classify the samples, and 40 key components were selected based on variable importance in the projection. Moreover, 11 flavor attributes, namely, floral, fruity, grass/green, woody, sweet, roasty, caramel, mellow and thick, bitter, astringent, and sweet aftertaste were identified through descriptive sensory analysis (DSA). In generally, innate differences among the tea varieties significantly affected the intensities of most of the key sensory attributes of Dianhong teas possibly because of the different amounts of aroma-active and taste components in Dianhong teas. Copyright © 2017 Elsevier Ltd. All rights reserved.

Top