ERIC Educational Resources Information Center
Loehlin, John C.; Neiderhiser, Jenae M.; Reiss, David
2005-01-01
Adolescent adjustment measures may be related to each other and to the social environment in various ways. Are these relationships similar in genetic and environmental sources of covariation, or different? A multivariate behaviorgenetic analysis was made of 6 adjustment and 3 treatment composites from the study Nonshared Environment in Adolescent…
Dehesh, Tania; Zare, Najaf; Ayatollahi, Seyyed Mohammad Taghi
2015-01-01
Background. Univariate meta-analysis (UM) procedure, as a technique that provides a single overall result, has become increasingly popular. Neglecting the existence of other concomitant covariates in the models leads to loss of treatment efficiency. Our aim was proposing four new approximation approaches for the covariance matrix of the coefficients, which is not readily available for the multivariate generalized least square (MGLS) method as a multivariate meta-analysis approach. Methods. We evaluated the efficiency of four new approaches including zero correlation (ZC), common correlation (CC), estimated correlation (EC), and multivariate multilevel correlation (MMC) on the estimation bias, mean square error (MSE), and 95% probability coverage of the confidence interval (CI) in the synthesis of Cox proportional hazard models coefficients in a simulation study. Result. Comparing the results of the simulation study on the MSE, bias, and CI of the estimated coefficients indicated that MMC approach was the most accurate procedure compared to EC, CC, and ZC procedures. The precision ranking of the four approaches according to all above settings was MMC ≥ EC ≥ CC ≥ ZC. Conclusion. This study highlights advantages of MGLS meta-analysis on UM approach. The results suggested the use of MMC procedure to overcome the lack of information for having a complete covariance matrix of the coefficients. PMID:26413142
Multivariate Analysis in Metabolomics
Worley, Bradley; Powers, Robert
2015-01-01
Metabolomics aims to provide a global snapshot of all small-molecule metabolites in cells and biological fluids, free of observational biases inherent to more focused studies of metabolism. However, the staggeringly high information content of such global analyses introduces a challenge of its own; efficiently forming biologically relevant conclusions from any given metabolomics dataset indeed requires specialized forms of data analysis. One approach to finding meaning in metabolomics datasets involves multivariate analysis (MVA) methods such as principal component analysis (PCA) and partial least squares projection to latent structures (PLS), where spectral features contributing most to variation or separation are identified for further analysis. However, as with any mathematical treatment, these methods are not a panacea; this review discusses the use of multivariate analysis for metabolomics, as well as common pitfalls and misconceptions. PMID:26078916
Adjustment of geochemical background by robust multivariate statistics
Zhou, D.
1985-01-01
Conventional analyses of exploration geochemical data assume that the background is a constant or slowly changing value, equivalent to a plane or a smoothly curved surface. However, it is better to regard the geochemical background as a rugged surface, varying with changes in geology and environment. This rugged surface can be estimated from observed geological, geochemical and environmental properties by using multivariate statistics. A method of background adjustment was developed and applied to groundwater and stream sediment reconnaissance data collected from the Hot Springs Quadrangle, South Dakota, as part of the National Uranium Resource Evaluation (NURE) program. Source-rock lithology appears to be a dominant factor controlling the chemical composition of groundwater or stream sediments. The most efficacious adjustment procedure is to regress uranium concentration on selected geochemical and environmental variables for each lithologic unit, and then to delineate anomalies by a common threshold set as a multiple of the standard deviation of the combined residuals. Robust versions of regression and RQ-mode principal components analysis techniques were used rather than ordinary techniques to guard against distortion caused by outliers Anomalies delineated by this background adjustment procedure correspond with uranium prospects much better than do anomalies delineated by conventional procedures. The procedure should be applicable to geochemical exploration at different scales for other metals. ?? 1985.
Method of multivariate spectral analysis
Keenan, Michael R.; Kotula, Paul G.
2004-01-06
A method of determining the properties of a sample from measured spectral data collected from the sample by performing a multivariate spectral analysis. The method can include: generating a two-dimensional matrix A containing measured spectral data; providing a weighted spectral data matrix D by performing a weighting operation on matrix A; factoring D into the product of two matrices, C and S.sup.T, by performing a constrained alternating least-squares analysis of D=CS.sup.T, where C is a concentration intensity matrix and S is a spectral shapes matrix; unweighting C and S by applying the inverse of the weighting used previously; and determining the properties of the sample by inspecting C and S. This method can be used to analyze X-ray spectral data generated by operating a Scanning Electron Microscope (SEM) with an attached Energy Dispersive Spectrometer (EDS).
Estimation of Data Uncertainty Adjustment Parameters for Multivariate Earth Rotation Series
NASA Technical Reports Server (NTRS)
Sung, Li-yu; Steppe, J. Alan
1994-01-01
We have developed a maximum likelihood method to estimate a set of data uncertainty adjustment parameters, iccluding scaling factors and additive variances and covariances, for multivariate Earth rotation series.
Multivariate data analysis of proteome data.
Engkilde, Kåre; Jacobsen, Susanne; Søndergaard, Ib
2007-01-01
We present the background for multivariate data analysis on proteomics data with a hands-on section on how to transfer data between different software packages. The techniques can also be used for other biological and biochemical problems in which structures have to be found in a large amount of data. Digitalization of the 2D gels, analysis using image processing software, transfer of data, multivariate data analysis, interpretation of the results, and finally we return to biology. PMID:17093312
Multivariate Longitudinal Analysis with Bivariate Correlation Test.
Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory
2016-01-01
In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model's parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated. PMID:27537692
Multivariate Longitudinal Analysis with Bivariate Correlation Test
Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory
2016-01-01
In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model’s parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated. PMID:27537692
Multivariate analysis: A statistical approach for computations
NASA Astrophysics Data System (ADS)
Michu, Sachin; Kaushik, Vandana
2014-10-01
Multivariate analysis is a type of multivariate statistical approach commonly used in, automotive diagnosis, education evaluating clusters in finance etc and more recently in the health-related professions. The objective of the paper is to provide a detailed exploratory discussion about factor analysis (FA) in image retrieval method and correlation analysis (CA) of network traffic. Image retrieval methods aim to retrieve relevant images from a collected database, based on their content. The problem is made more difficult due to the high dimension of the variable space in which the images are represented. Multivariate correlation analysis proposes an anomaly detection and analysis method based on the correlation coefficient matrix. Anomaly behaviors in the network include the various attacks on the network like DDOs attacks and network scanning.
Are propensity scores really superior to standard multivariable analysis?
Biondi-Zoccai, Giuseppe; Romagnoli, Enrico; Agostoni, Pierfrancesco; Capodanno, Davide; Castagno, Davide; D'Ascenzo, Fabrizio; Sangiorgi, Giuseppe; Modena, Maria Grazia
2011-09-01
Clinicians often face difficult decisions despite the lack of evidence from randomized trials. Thus, clinical evidence is often shaped by non-randomized studies exploiting multivariable approaches to limit the extent of confounding. Since their introduction, propensity scores have been used more and more frequently to estimate relevant clinical effects adjusting for established confounders, especially in small datasets. However, debate persists on their real usefulness in comparison to standard multivariable approaches such as logistic regression and Cox proportional hazard analysis. This holds even truer in light of key quantitative developments such as bootstrap and Bayesian methods. This qualitative review aims to provide a concise and practical guide to choose between propensity scores and standard multivariable analysis, emphasizing strengths and weaknesses of both approaches. PMID:21616172
Biological Sequence Analysis with Multivariate String Kernels.
Kuksa, Pavel P
2013-03-01
String kernel-based machine learning methods have yielded great success in practical tasks of structured/sequential data analysis. They often exhibit state-of-the-art performance on many practical tasks of sequence analysis such as biological sequence classification, remote homology detection, or protein superfamily and fold prediction. However, typical string kernel methods rely on analysis of discrete one-dimensional (1D) string data (e.g., DNA or amino acid sequences). In this work we address the multi-class biological sequence classification problems using multivariate representations in the form of sequences of features vectors (as in biological sequence profiles, or sequences of individual amino acid physico-chemical descriptors) and a class of multivariate string kernels that exploit these representations. On a number of protein sequence classification tasks proposed multivariate representations and kernels show significant 15-20\\% improvements compared to existing state-of-the-art sequence classification methods. PMID:23509193
Multivariate Analysis of Ladle Vibration
NASA Astrophysics Data System (ADS)
Yenus, Jaefer; Brooks, Geoffrey; Dunn, Michelle
2016-05-01
The homogeneity of composition and uniformity of temperature of the steel melt before it is transferred to the tundish are crucial in making high-quality steel product. The homogenization process is performed by stirring the melt using inert gas in ladles. Continuous monitoring of this process is important to make sure the action of stirring is constant throughout the ladle. Currently, the stirring process is monitored by process operators who largely rely on visual and acoustic phenomena from the ladle. However, due to lack of measurable signals, the accuracy and suitability of this manual monitoring are problematic. The actual flow of argon gas to the ladle may not be same as the flow gage reading due to leakage along the gas line components. As a result, the actual degree of stirring may not be correctly known. Various researchers have used one-dimensional vibration, and sound and image signals measured from the ladle to predict the degree of stirring inside. They developed online sensors which are indeed to monitor the online stirring phenomena. In this investigation, triaxial vibration signals have been measured from a cold water model which is a model of an industrial ladle. Three flow rate ranges and varying bath heights were used to collect vibration signals. The Fast Fourier Transform was applied to the dataset before it has been analyzed using principal component analysis (PCA) and partial least squares (PLS). PCA was used to unveil the structure in the experimental data. PLS was mainly applied to predict the stirring from the vibration response. It was found that for each flow rate range considered in this study, the informative signals reside in different frequency ranges. The first latent variables in these frequency ranges explain more than 95 pct of the variation in the stirring process for the entire single layer and the double layer data collected from the cold model. PLS analysis in these identified frequency ranges demonstrated that the latent
Multivariate Analysis of Ladle Vibration
NASA Astrophysics Data System (ADS)
Yenus, Jaefer; Brooks, Geoffrey; Dunn, Michelle
2016-08-01
The homogeneity of composition and uniformity of temperature of the steel melt before it is transferred to the tundish are crucial in making high-quality steel product. The homogenization process is performed by stirring the melt using inert gas in ladles. Continuous monitoring of this process is important to make sure the action of stirring is constant throughout the ladle. Currently, the stirring process is monitored by process operators who largely rely on visual and acoustic phenomena from the ladle. However, due to lack of measurable signals, the accuracy and suitability of this manual monitoring are problematic. The actual flow of argon gas to the ladle may not be same as the flow gage reading due to leakage along the gas line components. As a result, the actual degree of stirring may not be correctly known. Various researchers have used one-dimensional vibration, and sound and image signals measured from the ladle to predict the degree of stirring inside. They developed online sensors which are indeed to monitor the online stirring phenomena. In this investigation, triaxial vibration signals have been measured from a cold water model which is a model of an industrial ladle. Three flow rate ranges and varying bath heights were used to collect vibration signals. The Fast Fourier Transform was applied to the dataset before it has been analyzed using principal component analysis (PCA) and partial least squares (PLS). PCA was used to unveil the structure in the experimental data. PLS was mainly applied to predict the stirring from the vibration response. It was found that for each flow rate range considered in this study, the informative signals reside in different frequency ranges. The first latent variables in these frequency ranges explain more than 95 pct of the variation in the stirring process for the entire single layer and the double layer data collected from the cold model. PLS analysis in these identified frequency ranges demonstrated that the latent
Multivariate multiscale entropy for brain consciousness analysis.
Ahmed, Mosabber Uddin; Li, Ling; Cao, Jianting; Mandic, Danilo P
2011-01-01
The recently introduced multiscale entropy (MSE) method accounts for long range correlations over multiple time scales and can therefore reveal the complexity of biological signals. The existing MSE algorithm deals with scalar time series whereas multivariate time series are common in experimental and biological systems. To that cause, in this paper the MSE method is extended to the multivariate case. This allows us to gain a greater insight into the complexity of the underlying signal generating system, producing multifaceted and more robust estimates than standard single channel MSE. Simulations on both synthetic data and brain consciousness analysis support the approach. PMID:22254434
Classical least squares multivariate spectral analysis
Haaland, David M.
2002-01-01
An improved classical least squares multivariate spectral analysis method that adds spectral shapes describing non-calibrated components and system effects (other than baseline corrections) present in the analyzed mixture to the prediction phase of the method. These improvements decrease or eliminate many of the restrictions to the CLS-type methods and greatly extend their capabilities, accuracy, and precision. One new application of PACLS includes the ability to accurately predict unknown sample concentrations when new unmodeled spectral components are present in the unknown samples. Other applications of PACLS include the incorporation of spectrometer drift into the quantitative multivariate model and the maintenance of a calibration on a drifting spectrometer. Finally, the ability of PACLS to transfer a multivariate model between spectrometers is demonstrated.
Hybrid least squares multivariate spectral analysis methods
Haaland, David M.
2002-01-01
A set of hybrid least squares multivariate spectral analysis methods in which spectral shapes of components or effects not present in the original calibration step are added in a following estimation or calibration step to improve the accuracy of the estimation of the amount of the original components in the sampled mixture. The "hybrid" method herein means a combination of an initial classical least squares analysis calibration step with subsequent analysis by an inverse multivariate analysis method. A "spectral shape" herein means normally the spectral shape of a non-calibrated chemical component in the sample mixture but can also mean the spectral shapes of other sources of spectral variation, including temperature drift, shifts between spectrometers, spectrometer drift, etc. The "shape" can be continuous, discontinuous, or even discrete points illustrative of the particular effect.
Hybrid least squares multivariate spectral analysis methods
Haaland, David M.
2004-03-23
A set of hybrid least squares multivariate spectral analysis methods in which spectral shapes of components or effects not present in the original calibration step are added in a following prediction or calibration step to improve the accuracy of the estimation of the amount of the original components in the sampled mixture. The hybrid method herein means a combination of an initial calibration step with subsequent analysis by an inverse multivariate analysis method. A spectral shape herein means normally the spectral shape of a non-calibrated chemical component in the sample mixture but can also mean the spectral shapes of other sources of spectral variation, including temperature drift, shifts between spectrometers, spectrometer drift, etc. The shape can be continuous, discontinuous, or even discrete points illustrative of the particular effect.
Damage detection using multivariate recurrence quantification analysis
NASA Astrophysics Data System (ADS)
Nichols, J. M.; Trickey, S. T.; Seaver, M.
2006-02-01
Recurrence-quantification analysis (RQA) has emerged as a useful tool for detecting subtle non-stationarities and/or changes in time-series data. Here, we extend the RQA analysis methods to multivariate observations and present a method by which the "length scale" parameter ɛ (the only parameter required for RQA) may be selected. We then apply the technique to the difficult engineering problem of damage detection. The structure considered is a finite element model of a rectangular steel plate where damage is represented as a cut in the plate, starting at one edge and extending from 0% to 25% of the plate width in 5% increments. Time series, recorded at nine separate locations on the structure, are used to reconstruct the phase space of the system's dynamics and subsequently generate the multivariate recurrence (and cross-recurrence) plots. Multivariate RQA is then used to detect damage-induced changes to the structural dynamics. These results are then compared with shifts in the plate's natural frequencies. Two of the RQA-based features are found to be more sensitive to damage than are the plate's frequencies.
Multivariate streamflow forecasting using independent component analysis
NASA Astrophysics Data System (ADS)
Westra, Seth; Sharma, Ashish; Brown, Casey; Lall, Upmanu
2008-02-01
Seasonal forecasting of streamflow provides many benefits to society, by improving our ability to plan and adapt to changing water supplies. A common approach to developing these forecasts is to use statistical methods that link a set of predictors representing climate state as it relates to historical streamflow, and then using this model to project streamflow one or more seasons in advance based on current or a projected climate state. We present an approach for forecasting multivariate time series using independent component analysis (ICA) to transform the multivariate data to a set of univariate time series that are mutually independent, thereby allowing for the much broader class of univariate models to provide seasonal forecasts for each transformed series. Uncertainty is incorporated by bootstrapping the error component of each univariate model so that the probability distribution of the errors is maintained. Although all analyses are performed on univariate time series, the spatial dependence of the streamflow is captured by applying the inverse ICA transform to the predicted univariate series. We demonstrate the technique on a multivariate streamflow data set in Colombia, South America, by comparing the results to a range of other commonly used forecasting methods. The results show that the ICA-based technique is significantly better at representing spatial dependence, while not resulting in any loss of ability in capturing temporal dependence. As such, the ICA-based technique would be expected to yield considerable advantages when used in a probabilistic setting to manage large reservoir systems with multiple inflows or data collection points.
Multivariate meta-analysis using individual participant data.
Riley, R D; Price, M J; Jackson, D; Wardle, M; Gueyffier, F; Wang, J; Staessen, J A; White, I R
2015-06-01
When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment-covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. PMID:26099484
COSIMA data analysis using multivariate techniques
NASA Astrophysics Data System (ADS)
Silén, J.; Cottin, H.; Hilchenbach, M.; Kissel, J.; Lehto, H.; Siljeström, S.; Varmuza, K.
2015-02-01
We describe how to use multivariate analysis of complex TOF-SIMS (time-of-flight secondary ion mass spectrometry) spectra by introducing the method of random projections. The technique allows us to do full clustering and classification of the measured mass spectra. In this paper we use the tool for classification purposes. The presentation describes calibration experiments of 19 minerals on Ag and Au substrates using positive mode ion spectra. The discrimination between individual minerals gives a cross-validation Cohen κ for classification of typically about 80%. We intend to use the method as a fast tool to deduce a qualitative similarity of measurements.
Multivariate Analysis of Genotype-Phenotype Association.
Mitteroecker, Philipp; Cheverud, James M; Pavlicev, Mihaela
2016-04-01
With the advent of modern imaging and measurement technology, complex phenotypes are increasingly represented by large numbers of measurements, which may not bear biological meaning one by one. For such multivariate phenotypes, studying the pairwise associations between all measurements and all alleles is highly inefficient and prevents insight into the genetic pattern underlying the observed phenotypes. We present a new method for identifying patterns of allelic variation (genetic latent variables) that are maximally associated-in terms of effect size-with patterns of phenotypic variation (phenotypic latent variables). This multivariate genotype-phenotype mapping (MGP) separates phenotypic features under strong genetic control from less genetically determined features and thus permits an analysis of the multivariate structure of genotype-phenotype association, including its dimensionality and the clustering of genetic and phenotypic variables within this association. Different variants of MGP maximize different measures of genotype-phenotype association: genetic effect, genetic variance, or heritability. In an application to a mouse sample, scored for 353 SNPs and 11 phenotypic traits, the first dimension of genetic and phenotypic latent variables accounted for >70% of genetic variation present in all 11 measurements; 43% of variation in this phenotypic pattern was explained by the corresponding genetic latent variable. The first three dimensions together sufficed to account for almost 90% of genetic variation in the measurements and for all the interpretable genotype-phenotype association. Each dimension can be tested as a whole against the hypothesis of no association, thereby reducing the number of statistical tests from 7766 to 3-the maximal number of meaningful independent tests. Important alleles can be selected based on their effect size (additive or nonadditive effect on the phenotypic latent variable). This low dimensionality of the genotype-phenotype map
ERIC Educational Resources Information Center
Siman-Tov, Ayelet; Kaniel, Shlomo
2011-01-01
The research validates a multivariate model that predicts parental adjustment to coping successfully with an autistic child. The model comprises four elements: parental stress, parental resources, parental adjustment and the child's autism symptoms. 176 parents of children aged between 6 to 16 diagnosed with PDD answered several questionnaires…
Augmented classical least squares multivariate spectral analysis
Haaland, David M.; Melgaard, David K.
2004-02-03
A method of multivariate spectral analysis, termed augmented classical least squares (ACLS), provides an improved CLS calibration model when unmodeled sources of spectral variation are contained in a calibration sample set. The ACLS methods use information derived from component or spectral residuals during the CLS calibration to provide an improved calibration-augmented CLS model. The ACLS methods are based on CLS so that they retain the qualitative benefits of CLS, yet they have the flexibility of PLS and other hybrid techniques in that they can define a prediction model even with unmodeled sources of spectral variation that are not explicitly included in the calibration model. The unmodeled sources of spectral variation may be unknown constituents, constituents with unknown concentrations, nonlinear responses, non-uniform and correlated errors, or other sources of spectral variation that are present in the calibration sample spectra. Also, since the various ACLS methods are based on CLS, they can incorporate the new prediction-augmented CLS (PACLS) method of updating the prediction model for new sources of spectral variation contained in the prediction sample set without having to return to the calibration process. The ACLS methods can also be applied to alternating least squares models. The ACLS methods can be applied to all types of multivariate data.
Augmented Classical Least Squares Multivariate Spectral Analysis
Haaland, David M.; Melgaard, David K.
2005-01-11
A method of multivariate spectral analysis, termed augmented classical least squares (ACLS), provides an improved CLS calibration model when unmodeled sources of spectral variation are contained in a calibration sample set. The ACLS methods use information derived from component or spectral residuals during the CLS calibration to provide an improved calibration-augmented CLS model. The ACLS methods are based on CLS so that they retain the qualitative benefits of CLS, yet they have the flexibility of PLS and other hybrid techniques in that they can define a prediction model even with unmodeled sources of spectral variation that are not explicitly included in the calibration model. The unmodeled sources of spectral variation may be unknown constituents, constituents with unknown concentrations, nonlinear responses, non-uniform and correlated errors, or other sources of spectral variation that are present in the calibration sample spectra. Also, since the various ACLS methods are based on CLS, they can incorporate the new prediction-augmented CLS (PACLS) method of updating the prediction model for new sources of spectral variation contained in the prediction sample set without having to return to the calibration process. The ACLS methods can also be applied to alternating least squares models. The ACLS methods can be applied to all types of multivariate data.
Augmented Classical Least Squares Multivariate Spectral Analysis
Haaland, David M.; Melgaard, David K.
2005-07-26
A method of multivariate spectral analysis, termed augmented classical least squares (ACLS), provides an improved CLS calibration model when unmodeled sources of spectral variation are contained in a calibration sample set. The ACLS methods use information derived from component or spectral residuals during the CLS calibration to provide an improved calibration-augmented CLS model. The ACLS methods are based on CLS so that they retain the qualitative benefits of CLS, yet they have the flexibility of PLS and other hybrid techniques in that they can define a prediction model even with unmodeled sources of spectral variation that are not explicitly included in the calibration model. The unmodeled sources of spectral variation may be unknown constituents, constituents with unknown concentrations, nonlinear responses, non-uniform and correlated errors, or other sources of spectral variation that are present in the calibration sample spectra. Also, since the various ACLS methods are based on CLS, they can incorporate the new prediction-augmented CLS (PACLS) method of updating the prediction model for new sources of spectral variation contained in the prediction sample set without having to return to the calibration process. The ACLS methods can also be applied to alternating least squares models. The ACLS methods can be applied to all types of multivariate data.
Apparatus and system for multivariate spectral analysis
Keenan, Michael R.; Kotula, Paul G.
2003-06-24
An apparatus and system for determining the properties of a sample from measured spectral data collected from the sample by performing a method of multivariate spectral analysis. The method can include: generating a two-dimensional matrix A containing measured spectral data; providing a weighted spectral data matrix D by performing a weighting operation on matrix A; factoring D into the product of two matrices, C and S.sup.T, by performing a constrained alternating least-squares analysis of D=CS.sup.T, where C is a concentration intensity matrix and S is a spectral shapes matrix; unweighting C and S by applying the inverse of the weighting used previously; and determining the properties of the sample by inspecting C and S. This method can be used by a spectrum analyzer to process X-ray spectral data generated by a spectral analysis system that can include a Scanning Electron Microscope (SEM) with an Energy Dispersive Detector and Pulse Height Analyzer.
Multivariate statistical analysis of wildfires in Portugal
NASA Astrophysics Data System (ADS)
Costa, Ricardo; Caramelo, Liliana; Pereira, Mário
2013-04-01
Several studies demonstrate that wildfires in Portugal present high temporal and spatial variability as well as cluster behavior (Pereira et al., 2005, 2011). This study aims to contribute to the characterization of the fire regime in Portugal with the multivariate statistical analysis of the time series of number of fires and area burned in Portugal during the 1980 - 2009 period. The data used in the analysis is an extended version of the Rural Fire Portuguese Database (PRFD) (Pereira et al, 2011), provided by the National Forest Authority (Autoridade Florestal Nacional, AFN), the Portuguese Forest Service, which includes information for more than 500,000 fire records. There are many multiple advanced techniques for examining the relationships among multiple time series at the same time (e.g., canonical correlation analysis, principal components analysis, factor analysis, path analysis, multiple analyses of variance, clustering systems). This study compares and discusses the results obtained with these different techniques. Pereira, M.G., Trigo, R.M., DaCamara, C.C., Pereira, J.M.C., Leite, S.M., 2005: "Synoptic patterns associated with large summer forest fires in Portugal". Agricultural and Forest Meteorology. 129, 11-25. Pereira, M. G., Malamud, B. D., Trigo, R. M., and Alves, P. I.: The history and characteristics of the 1980-2005 Portuguese rural fire database, Nat. Hazards Earth Syst. Sci., 11, 3343-3358, doi:10.5194/nhess-11-3343-2011, 2011 This work is supported by European Union Funds (FEDER/COMPETE - Operational Competitiveness Programme) and by national funds (FCT - Portuguese Foundation for Science and Technology) under the project FCOMP-01-0124-FEDER-022692, the project FLAIR (PTDC/AAC-AMB/104702/2008) and the EU 7th Framework Program through FUME (contract number 243888).
Stellar populations in ω Centauri: a multivariate analysis
NASA Astrophysics Data System (ADS)
Fraix-Burnet, D.; Davoust, E.
2015-07-01
We have performed multivariate statistical analyses of photometric and chemical abundance parameters of three large samples of stars in the globular cluster ω Centauri. The statistical analysis of a sample of 735 stars based on seven chemical abundances with the method of Maximum Parsimony (cladistics) yields the most promising results: seven groups are found, distributed along three branches with distinct chemical, spatial and kinematical properties. A progressive chemical evolution can be traced from one group to the next, but also within groups, suggestive of an inhomogeneous chemical enrichment of the initial interstellar matter. The adjustment of stellar evolution models shows that the groups with metallicities [Fe/H] > -1.5 are Helium enriched, thus presumably of second generation. The spatial concentration of the groups increases with chemical evolution, except for two groups, which stand out in their other properties as well. The amplitude of rotation decreases with chemical evolution, except for two of the three metal-rich groups, which rotate fastest, as predicted by recent hydrodynamical simulations. The properties of the groups are interpreted in terms of star formation in gas clouds of different origins. In conclusion, our multivariate analysis has shown that metallicity alone cannot segregate the different populations of ω Centauri.
A Multivariate Analysis of Galaxy Cluster Properties
NASA Astrophysics Data System (ADS)
Ogle, P. M.; Djorgovski, S.
1993-05-01
We have assembled from the literature a data base on on 394 clusters of galaxies, with up to 16 parameters per cluster. They include optical and x-ray luminosities, x-ray temperatures, galaxy velocity dispersions, central galaxy and particle densities, optical and x-ray core radii and ellipticities, etc. In addition, derived quantities, such as the mass-to-light ratios and x-ray gas masses are included. Doubtful measurements have been identified, and deleted from the data base. Our goal is to explore the correlations between these parameters, and interpret them in the framework of our understanding of evolution of clusters and large-scale structure, such as the Gott-Rees scaling hierarchy. Among the simple, monovariate correlations we found, the most significant include those between the optical and x-ray luminosities, x-ray temperatures, cluster velocity dispersions, and central galaxy densities, in various mutual combinations. While some of these correlations have been discussed previously in the literature, generally smaller samples of objects have been used. We will also present the results of a multivariate statistical analysis of the data, including a principal component analysis (PCA). Such an approach has not been used previously for studies of cluster properties, even though it is much more powerful and complete than the simple monovariate techniques which are commonly employed. The observed correlations may lead to powerful constraints for theoretical models of formation and evolution of galaxy clusters. P.M.O. was supported by a Caltech graduate fellowship. S.D. acknowledges a partial support from the NASA contract NAS5-31348 and the NSF PYI award AST-9157412.
Multivariate singular spectrum analysis and the road to phase synchronization
NASA Astrophysics Data System (ADS)
Groth, Andreas; Ghil, Michael
2010-05-01
Singular spectrum analysis (SSA) and multivariate SSA (M-SSA) are based on the classical work of Kosambi (1943), Loeve (1945) and Karhunen (1946) and are closely related to principal component analysis. They have been introduced into information theory by Bertero, Pike and co-workers (1982, 1984) and into dynamical systems analysis by Broomhead and King (1986a,b). Ghil, Vautard and associates have applied SSA and M-SSA to the temporal and spatio-temporal analysis of short and noisy time series in climate dynamics and other fields in the geosciences since the late 1980s. M-SSA provides insight into the unknown or partially known dynamics of the underlying system by decomposing the delay-coordinate phase space of a given multivariate time series into a set of data-adaptive orthonormal components. These components can be classified essentially into trends, oscillatory patterns and noise, and allow one to reconstruct a robust "skeleton" of the dynamical system's structure. For an overview we refer to Ghil et al. (Rev. Geophys., 2002). In this talk, we present M-SSA in the context of synchronization analysis and illustrate its ability to unveil information about the mechanisms behind the adjustment of rhythms in coupled dynamical systems. The focus of the talk is on the special case of phase synchronization between coupled chaotic oscillators (Rosenblum et al., PRL, 1996). Several ways of measuring phase synchronization are in use, and the robust definition of a reasonable phase for each oscillator is critical in each of them. We illustrate here the advantages of M-SSA in the automatic identification of oscillatory modes and in drawing conclusions about the transition to phase synchronization. Without using any a priori definition of a suitable phase, we show that M-SSA is able to detect phase synchronization in a chain of coupled chaotic oscillators (Osipov et al., PRE, 1996). Recently, Muller et al. (PRE, 2005) and Allefeld et al. (Intl. J. Bif. Chaos, 2007) have
Multivariate analysis of environmental data for two hydrographic basins
Andrade, J.M.; Prada, D.; Muniategui, S.; Gonzalez, E.; Alonso, E. )
1992-02-01
A multivariate study (PCA Analysis and Cluster analysis) of two Spanish hydrographic basins (The Mandeo and Mero basins) was made to achieve reliable conclusions about their actual physico-chemical environmental situation. Two police-samples' are defined, their effects explained, and are introduced in Cluster analysis as a way to examine sample quality. The multivariate analysis shows different qualities in the two hydrographic basins.
Regional dissociated heterochrony in multivariate analysis.
Mitteroecker, P; Gunz, P; Weber, G W; Bookstein, F L
2004-12-01
Heterochrony, the classic framework to study ontogeny and phylogeny, in essence relies on a univariate concept of shape. Though principal component plots of multivariate shape data seem to resemble classical bivariate allometric plots, the language of heterochrony cannot be translated directly into general multivariate methodology. We simulate idealized multivariate ontogenetic trajectories and demonstrate their behavior in principal component plots in shape space and in size-shape space. The concept of "dissociation", which is conventionally regarded as a change in the relationship between shape change and size change, appears to be algebraically the same as regional dissociation - the variation of apparent heterochrony by region. Only if the trajectories of two related species lie along exactly the same path in shape space can the classic terminology of heterochrony apply so that pure dissociation of size change against shape change can be detected. We demonstrate a geometric morphometric approach to these issues using adult and subadult crania of 48 Pan paniscus and 47 P. troglodytes. On each specimen we digitized 47 landmarks and 144 semilandmarks on ridge curves and the external neurocranial surface. The relation between these two species' growth trajectories is too complex for a simple summary in terms of global heterochrony. PMID:15646279
Heavy flavor identification using multivariate analysis at H1
Pandurovic, Mila; Bozovic-Jelisavcic, Ivanka; Mudrinic, Mihajlo
2010-01-21
We discuss b quark identification in deep inelastic scattering of electron on proton at H1 by applying multivariate analysis method. Separation between heavy and light flavors can be further used to extract proton quark content.
Search for the top quark using multivariate analysis techniques
Bhat, P.C.; D0 Collaboration
1994-08-01
The D0 collaboration is developing top search strategies using multivariate analysis techniques. We report here on applications of the H-matrix method to the e{mu} channel and neural networks to the e+jets channel.
The statistical analysis of multivariate serological frequency data.
Reyment, Richard A
2005-11-01
Data occurring in the form of frequencies are common in genetics-for example, in serology. Examples are provided by the AB0 group, the Rhesus group, and also DNA data. The statistical analysis of tables of frequencies is carried out using the available methods of multivariate analysis with usually three principal aims. One of these is to seek meaningful relationships between the components of a data set, the second is to examine relationships between populations from which the data have been obtained, the third is to bring about a reduction in dimensionality. This latter aim is usually realized by means of bivariate scatter diagrams using scores computed from a multivariate analysis. The multivariate statistical analysis of tables of frequencies cannot safely be carried out by standard multivariate procedures because they represent compositions and are therefore embedded in simplex space, a subspace of full space. Appropriate procedures for simplex space are compared and contrasted with simple standard methods of multivariate analysis ("raw" principal component analysis). The study shows that the differences between a log-ratio model and a simple logarithmic transformation of proportions may not be very great, particularly as regards graphical ordinations, but important discrepancies do occur. The divergencies between logarithmically based analyses and raw data are, however, great. Published data on Rhesus alleles observed for Italian populations are used to exemplify the subject. PMID:16024067
Multivariate concentration determination using principal component regression with residual analysis
Keithley, Richard B.; Heien, Michael L.; Wightman, R. Mark
2009-01-01
Data analysis is an essential tenet of analytical chemistry, extending the possible information obtained from the measurement of chemical phenomena. Chemometric methods have grown considerably in recent years, but their wide use is hindered because some still consider them too complicated. The purpose of this review is to describe a multivariate chemometric method, principal component regression, in a simple manner from the point of view of an analytical chemist, to demonstrate the need for proper quality-control (QC) measures in multivariate analysis and to advocate the use of residuals as a proper QC method. PMID:20160977
Multivariate analysis of TLD orientation effects
Archer, B.R.; Bushong, S.C.; Thornby, J.I.
1980-07-01
The effect of orientation on extruded thermoluminescent dosimeters has been investigated. TLD's placed on the surface and within a phantom were exposed separately to five diagnostic beam qualities and to /sup 60/Co ..gamma.. rays. The resulting data were subjected to analysis of variance and examined for significant correlations. The response of dosimeters on the phantom surface varied with orientation and was energy dependent. In the phantom and with /sup 60/Co, no orientation effects were observed.
Multivariate Analysis of Solar Spectral Irradiance Measurements
NASA Technical Reports Server (NTRS)
Pilewskie, P.; Rabbette, M.
2001-01-01
Principal component analysis is used to characterize approximately 7000 downwelling solar irradiance spectra retrieved at the Southern Great Plains site during an Atmospheric Radiation Measurement (ARM) shortwave intensive operating period. This analysis technique has proven to be very effective in reducing a large set of variables into a much smaller set of independent variables while retaining the information content. It is used to determine the minimum number of parameters necessary to characterize atmospheric spectral irradiance or the dimensionality of atmospheric variability. It was found that well over 99% of the spectral information was contained in the first six mutually orthogonal linear combinations of the observed variables (flux at various wavelengths). Rotation of the principal components was effective in separating various components by their independent physical influences. The majority of the variability in the downwelling solar irradiance (380-1000 nm) was explained by the following fundamental atmospheric parameters (in order of their importance): cloud scattering, water vapor absorption, molecular scattering, and ozone absorption. In contrast to what has been proposed as a resolution to a clear-sky absorption anomaly, no unexpected gaseous absorption signature was found in any of the significant components.
Multivariate calibration applied to the quantitative analysis of infrared spectra
NASA Astrophysics Data System (ADS)
Haaland, David M.
1992-03-01
Multivariate calibration methods are very useful for improving the precision, accuracy, and reliability of quantitative spectral analyses. Spectroscopists can more effectively use these sophisticated statistical tools if they have a qualitative understanding of the techniques involved. A qualitative picture of the factor analysis multivariate calibration methods of partial least squares (PLS) and principal component regression (PCR) is presented using infrared calibrations based upon spectra of phosphosilicate glass thin films on silicon wafers. Comparisons of the relative prediction abilities of four different multivariate calibration methods are given based on Monte Carlo simulations of spectral calibration and prediction data. The success of multivariate spectral calibrations is demonstrated for several quantitative infrared studies. The infrared absorption and emission spectra of thin-film dielectrics used in the manufacture of microelectronic devices demonstrate rapid, nondestructive at-line and in- situ analyses using PLS calibrations. Finally, the application of multivariate spectral calibrations to reagentless analysis of blood is presented. We have found that the determination of glucose in whole blood taken from diabetics can be precisely monitored from the PLS calibration of either mid- or near-infrared spectra of the blood. Progress toward the noninvasive determination of glucose levels in diabetics is an ultimate goal of this research.
Multivariate calibration applied to the quantitative analysis of infrared spectra
Haaland, D.M.
1991-01-01
Multivariate calibration methods are very useful for improving the precision, accuracy, and reliability of quantitative spectral analyses. Spectroscopists can more effectively use these sophisticated statistical tools if they have a qualitative understanding of the techniques involved. A qualitative picture of the factor analysis multivariate calibration methods of partial least squares (PLS) and principal component regression (PCR) is presented using infrared calibrations based upon spectra of phosphosilicate glass thin films on silicon wafers. Comparisons of the relative prediction abilities of four different multivariate calibration methods are given based on Monte Carlo simulations of spectral calibration and prediction data. The success of multivariate spectral calibrations is demonstrated for several quantitative infrared studies. The infrared absorption and emission spectra of thin-film dielectrics used in the manufacture of microelectronic devices demonstrate rapid, nondestructive at-line and in-situ analyses using PLS calibrations. Finally, the application of multivariate spectral calibrations to reagentless analysis of blood is presented. We have found that the determination of glucose in whole blood taken from diabetics can be precisely monitored from the PLS calibration of either mind- or near-infrared spectra of the blood. Progress toward the non-invasive determination of glucose levels in diabetics is an ultimate goal of this research. 13 refs., 4 figs.
Multivariate Probabilistic Analysis of an Hydrological Model
NASA Astrophysics Data System (ADS)
Franceschini, Samuela; Marani, Marco
2010-05-01
Model predictions derived based on rainfall measurements and hydrological model results are often limited by the systematic error of measuring instruments, by the intrinsic variability of the natural processes and by the uncertainty of the mathematical representation. We propose a means to identify such sources of uncertainty and to quantify their effects based on point-estimate approaches, as a valid alternative to cumbersome Montecarlo methods. We present uncertainty analyses on the hydrologic response to selected meteorological events, in the mountain streamflow-generating portion of the Brenta basin at Bassano del Grappa, Italy. The Brenta river catchment has a relatively uniform morphology and quite a heterogeneous rainfall-pattern. In the present work, we evaluate two sources of uncertainty: data uncertainty (the uncertainty due to data handling and analysis) and model uncertainty (the uncertainty related to the formulation of the model). We thus evaluate the effects of the measurement error of tipping-bucket rain gauges, the uncertainty in estimating spatially-distributed rainfall through block kriging, and the uncertainty associated with estimated model parameters. To this end, we coupled a deterministic model based on the geomorphological theory of the hydrologic response to probabilistic methods. In particular we compare the results of Monte Carlo Simulations (MCS) to the results obtained, in the same conditions, using Li's Point Estimate Method (LiM). The LiM is a probabilistic technique that approximates the continuous probability distribution function of the considered stochastic variables by means of discrete points and associated weights. This allows to satisfactorily reproduce results with only few evaluations of the model function. The comparison between the LiM and MCS results highlights the pros and cons of using an approximating method. LiM is less computationally demanding than MCS, but has limited applicability especially when the model
McKinney, Cliff; Renk, Kimberly
2008-06-01
Although parent-adolescent interactions have been examined, relevant variables have not been integrated into a multivariate model. As a result, this study examined a multivariate model of parent-late adolescent gender dyads in an attempt to capture important predictors in late adolescents' important and unique transition to adulthood. The sample for this study consisted of 151 male and 324 female late adolescents, who reported on their mothers' and fathers' parenting style, their family environment, their mothers' and fathers' expectations for them, the conflict that they experience with their mothers and fathers, and their own adjustment. Overall, the variables had significant relationships with one another. Further, the male-father, male-mother, and female-father structural equation models that were examined suggested that parenting style has an indirect relationship with late adolescents' adjustment through characteristics of the family environment and the conflict that is experienced in families; such findings were not evident for the female-mother model. Thus, the examination of parent-late adolescent interactions should occur in the context of the gender of parents and their late adolescents. PMID:17710537
Evaluation of Meterorite Amono Acid Analysis Data Using Multivariate Techniques
NASA Technical Reports Server (NTRS)
McDonald, G.; Storrie-Lombardi, M.; Nealson, K.
1999-01-01
The amino acid distributions in the Murchison carbonaceous chondrite, Mars meteorite ALH84001, and ice from the Allan Hills region of Antarctica are shown, using a multivariate technique known as Principal Component Analysis (PCA), to be statistically distinct from the average amino acid compostion of 101 terrestrial protein superfamilies.
Strength of Relationship in Multivariate Analysis of Variance.
ERIC Educational Resources Information Center
Smith, I. Leon
Methods for the calculation of eta coefficient, or correlation ratio, squared have recently been presented for examining the strength of relationship in univariate analysis of variance. This paper extends them to the multivariate case in which the effects of independent variables may be examined in relation to two or more dependent variables, and…
Univariate Analysis of Multivariate Outcomes in Educational Psychology.
ERIC Educational Resources Information Center
Hubble, L. M.
1984-01-01
The author examined the prevalence of multiple operational definitions of outcome constructs and an estimate of the incidence of Type I error rates when univariate procedures were applied to multiple variables in educational psychology. Multiple operational definitions of constructs were advocated and wider use of multivariate analysis was…
Multivariate geometry as an approach to algal community analysis
Allen, T.F.H.; Skagen, S.
1973-01-01
Multivariate analyses are put in the context of more usual approaches to phycological investigations. The intuitive common-sense involved in methods of ordination, classification and discrimination are emphasised by simple geometric accounts which avoid jargon and matrix algebra. Warnings are given that artifacts result from technique abuses by the naive or over-enthusiastic. An analysis of a simple periphyton data set is presented as an example of the approach. Suggestions are made as to situations in phycological investigations, where the techniques could be appropriate. The discipline is reprimanded for its neglect of the multivariate approach.
Multivariate analysis of pathophysiological factors in reflux oesophagitis.
Cadiot, G; Bruhat, A; Rigaud, D; Coste, T; Vuagnat, A; Benyedder, Y; Vallot, T; Le Guludec, D; Mignon, M
1997-01-01
BACKGROUND: Reflux oesophagitis is considered a multifactorial disease, but the respective roles of the main factors involved in its pathophysiology have not been clearly established. AIMS: To attempt to assign these roles by means of a multivariate logistic regression analysis of the main parameters associated with reflux oesophagitis. PATIENTS: Eighty seven patients with gastro-oesophageal reflux disease were studied: 41 without oesophagitis and 46 with reflux oesophagitis grade 1 to 3. METHODS: (1) Monovariate comparison of patients' characteristics and of parameters derived from in hospital 24 hour oesophageal pH monitoring, oesophageal manometry, double isotope gastric emptying studies, and basal and pentagastrin stimulated gastric acid and pepsin output determinations, between patients with and without oesophagitis. (2) Multivariate logistic regression analysis including the parameters significant in the monovariate analysis. RESULTS: Among the 16 significant parameters from monovariate analysis, three significant independent parameters were identified by multivariate logistic regression analysis: number of refluxes lasting more than five minutes, reflecting oesophageal acid clearance (p = 0.002); basal lower oesophageal sphincter pressure (p = 0.008); and peak acid output (p = 0.012). These three parameters were not correlated with each other. The multivariate model was highly discriminant (correct classification of 81.3% of the cases (95% confidence intervals 0.723, 0.903). Risk for oesophagitis increased as a function of the tercile threshold values of the three parameters. Odds ratios of the three parameters for oesophagitis risk were similar, regardless of whether they were calculated when the patients were compared as a function of oesophagitis grade or the presence or absence of oesophagitis. CONCLUSIONS: This multivariate approach adds evidence that impaired oesophageal acid clearance and hypotonic lower oesophageal sphincter are the two major
Tvedebrink, Torben; Eriksen, Poul Svante; Morling, Niels
2015-11-01
In this paper, we discuss the construction of a multivariate generalisation of the Dirichlet-multinomial distribution. An example from forensic genetics in the statistical analysis of DNA mixtures motivates the study of this multivariate extension. In forensic genetics, adjustment of the match probabilities due to remote ancestry in the population is often done using the so-called θ-correction. This correction increases the probability of observing multiple copies of rare alleles in a subpopulation and thereby reduces the weight of the evidence for rare genotypes. A recent publication by Cowell et al. (2015) showed elegantly how to use Bayesian networks for efficient computations of likelihood ratios in a forensic genetic context. However, their underlying population genetic model assumed independence of alleles, which is not realistic in real populations. We demonstrate how the so-called θ-correction can be incorporated in Bayesian networks to make efficient computations by modifying the Markov structure of Cowell et al. (2015). By numerical examples, we show how the θ-correction incorporated in the multivariate Dirichlet-multinomial distribution affects the weight of evidence. PMID:26344785
Multivariate analysis of prognostic factors in early stage Hodgkin's disease
Tubiana, M.; Henry-Amar, M.; van der Werf-Messing, B.; Henry, J.; Abbatucci, J.; Burgers, M.; Hayat, M.; Somers, R.; Laugier, A.; Carde, P.
1985-01-01
A multivariate analysis of the prognostic factors was carried out with a Cox model on 1,139 patients with clinical Stage I + II Hodgkin's disease included in three controlled clinical trials. The following indicators had been prospectively registered: aged, sex, systemic symptoms, erythrocyte sedimentation, results of staging laparotomy when performed, as well as the date and type of treatment. A linear logistic analysis showed that most of the indicators are interrelated. This emphasizes the necessity of a multivariate analysis in order to assess the independent influence of each of them. The two main prognostic indicators for relapse-free survival are systemic symptoms and/or ESR and number of involved areas. The only significant factor for survival after relapse is age. Sex has a small but significant influence on relapse-free survival. The relative influence of each indicator varies with the type of treatment and these variations may help in understanding the biologic significance of the indicators.
Multivariate analysis of eigenvalues and eigenvectors in tensor based morphometry
NASA Astrophysics Data System (ADS)
Rajagopalan, Vidya; Schwartzman, Armin; Hua, Xue; Leow, Alex; Thompson, Paul; Lepore, Natasha
2015-01-01
We develop a new algorithm to compute voxel-wise shape differences in tensor-based morphometry (TBM). As in standard TBM, we non-linearly register brain T1-weighed MRI data from a patient and control group to a template, and compute the Jacobian of the deformation fields. In standard TBM, the determinants of the Jacobian matrix at each voxel are statistically compared between the two groups. More recently, a multivariate extension of the statistical analysis involving the deformation tensors derived from the Jacobian matrices has been shown to improve statistical detection power.7 However, multivariate methods comprising large numbers of variables are computationally intensive and may be subject to noise. In addition, the anatomical interpretation of results is sometimes difficult. Here instead, we analyze the eigenvalues and the eigenvectors of the Jacobian matrices. Our method is validated on brain MRI data from Alzheimer's patients and healthy elderly controls from the Alzheimer's Disease Neuro Imaging Database.
Dating violence, social learning theory, and gender: a multivariate analysis.
Tontodonato, P; Crew, B K
1992-01-01
The study of violence between dating partners is a logical extension of interest in marital violence. However, little of this research tests explanations of intimate violence using multivariate techniques, and only recently have such tests occurred within a theoretical framework. Drawing on a recent social learning model of courtship violence (Riggs & O'Leary, 1989), this paper empirically examines constructs hypothesized to be predictive of the use of dating violence and investigates possible gender differences in the underlying causal structure of such violence. Logit analysis indicates that parent-child violence, drug use, and knowledge of use of dating violence by others predict the use of courtship violence by females. Belief that violence between intimates is justifiable, drug use, and parental divorce are related to perpetration of dating aggression by males. Explanations for these results and the importance of a multivariate approach to the problem are discussed. PMID:1504032
Multivariate statistical analysis of atom probe tomography data
Parish, Chad M; Miller, Michael K
2010-01-01
The application of spectrum imaging multivariate statistical analysis methods, specifically principal component analysis (PCA), to atom probe tomography (APT) data has been investigated. The mathematical method of analysis is described and the results for two example datasets are analyzed and presented. The first dataset is from the analysis of a PM 2000 Fe-Cr-Al-Ti steel containing two different ultrafine precipitate populations. PCA properly describes the matrix and precipitate phases in a simple and intuitive manner. A second APT example is from the analysis of an irradiated reactor pressure vessel steel. Fine, nm-scale Cu-enriched precipitates having a core-shell structure were identified and qualitatively described by PCA. Advantages, disadvantages, and future prospects for implementing these data analysis methodologies for APT datasets, particularly with regard to quantitative analysis, are also discussed.
Multivariate statistical analysis of atom probe tomography data.
Parish, Chad M; Miller, Michael K
2010-10-01
The application of spectrum imaging multivariate statistical analysis methods, specifically principal component analysis (PCA), to atom probe tomography (APT) data has been investigated. The mathematical method of analysis is described and the results for two example datasets are analyzed and presented. The first dataset is from the analysis of a PM 2000 Fe-Cr-Al-Ti steel containing two different ultrafine precipitate populations. PCA properly describes the matrix and precipitate phases in a simple and intuitive manner. A second APT example is from the analysis of an irradiated reactor pressure vessel steel. Fine, nm-scale Cu-enriched precipitates having a core-shell structure were identified and qualitatively described by PCA. Advantages, disadvantages, and future prospects for implementing these data analysis methodologies for APT datasets, particularly with regard to quantitative analysis, are also discussed. PMID:20650566
Comparing G: multivariate analysis of genetic variation in multiple populations.
Aguirre, J D; Hine, E; McGuigan, K; Blows, M W
2014-01-01
The additive genetic variance-covariance matrix (G) summarizes the multivariate genetic relationships among a set of traits. The geometry of G describes the distribution of multivariate genetic variance, and generates genetic constraints that bias the direction of evolution. Determining if and how the multivariate genetic variance evolves has been limited by a number of analytical challenges in comparing G-matrices. Current methods for the comparison of G typically share several drawbacks: metrics that lack a direct relationship to evolutionary theory, the inability to be applied in conjunction with complex experimental designs, difficulties with determining statistical confidence in inferred differences and an inherently pair-wise focus. Here, we present a cohesive and general analytical framework for the comparative analysis of G that addresses these issues, and that incorporates and extends current methods with a strong geometrical basis. We describe the application of random skewers, common subspace analysis, the 4th-order genetic covariance tensor and the decomposition of the multivariate breeders equation, all within a Bayesian framework. We illustrate these methods using data from an artificial selection experiment on eight traits in Drosophila serrata, where a multi-generational pedigree was available to estimate G in each of six populations. One method, the tensor, elegantly captures all of the variation in genetic variance among populations, and allows the identification of the trait combinations that differ most in genetic variance. The tensor approach is likely to be the most generally applicable method to the comparison of G-matrices from any sampling or experimental design. PMID:23486079
Simplifying multivariate survival analysis using global score test methodology
NASA Astrophysics Data System (ADS)
Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz
2015-12-01
In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve multiple endpoints, and this situation further complicates the analysis of survival data. In the case of tumor patients, endpoints concerning survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For each patient, these endpoints are correlated, and the estimation of the correlation between two score statistics is fundamental in derivation of overall treatment advantage. In this paper, the bivariate survival analysis method using the global score test methodology is extended to multivariate setting.
Asymmetric measures of association, closed data, and multivariate analysis
Hohn, M. Ed; Nuhfer, E.B.
1980-01-01
The association between constant-sum variables Xiand Xjexpressed as percentages can be calculated as a product-moment correlation between Xiand Xj/(100 - Xi) and a correlation between Xjand Xi/(100 - Xj). An asymmetric, square matrix may be formed from these coefficients, and multivariate analysis performed by two methods: singular value decomposition and canonical decomposition. Either analysis avoids problems in the interpretation of correlation coefficients determined from closed arrays, and provides information about dependencies among the variables beyond that obtained from the usual correlation coefficient between Xiand Xj. Two examples show the canonical decomposition to have the greater usefulness. ?? 1980 Plenum Publishing Corporation.
Advanced multivariate analysis to assess remediation of hydrocarbons in soils.
Lin, Deborah S; Taylor, Peter; Tibbett, Mark
2014-10-01
Accurate monitoring of degradation levels in soils is essential in order to understand and achieve complete degradation of petroleum hydrocarbons in contaminated soils. We aimed to develop the use of multivariate methods for the monitoring of biodegradation of diesel in soils and to determine if diesel contaminated soils could be remediated to a chemical composition similar to that of an uncontaminated soil. An incubation experiment was set up with three contrasting soil types. Each soil was exposed to diesel at varying stages of degradation and then analysed for key hydrocarbons throughout 161 days of incubation. Hydrocarbon distributions were analysed by Principal Coordinate Analysis and similar samples grouped by cluster analysis. Variation and differences between samples were determined using permutational multivariate analysis of variance. It was found that all soils followed trajectories approaching the chemical composition of the unpolluted soil. Some contaminated soils were no longer significantly different to that of uncontaminated soil after 161 days of incubation. The use of cluster analysis allows the assignment of a percentage chemical similarity of a diesel contaminated soil to an uncontaminated soil sample. This will aid in the monitoring of hydrocarbon contaminated sites and the establishment of potential endpoints for successful remediation. PMID:25028320
Decoding neural representational spaces using multivariate pattern analysis.
Haxby, James V; Connolly, Andrew C; Guntupalli, J Swaroop
2014-01-01
A major challenge for systems neuroscience is to break the neural code. Computational algorithms for encoding information into neural activity and extracting information from measured activity afford understanding of how percepts, memories, thought, and knowledge are represented in patterns of brain activity. The past decade and a half has seen significant advances in the development of methods for decoding human neural activity, such as multivariate pattern classification, representational similarity analysis, hyperalignment, and stimulus-model-based encoding and decoding. This article reviews these advances and integrates neural decoding methods into a common framework organized around the concept of high-dimensional representational spaces. PMID:25002277
a Multivariate Statistical Analysis of Visibility at California Regions.
NASA Astrophysics Data System (ADS)
Motallebi, Nehzat
This study summarizes the results of a comprehensive study of visibility in California. California is one of the few states that has promulgated air quality standards for visibility. The study was concerned not only with major metropolitan areas such as Los Angeles, but also with deterioration of visibility in the less urbanized areas of California. The relationships among visibility reduction, atmospheric pollutants, and meteorological conditions were examined by using the multivariate statistical techniques of principal component analysis and multiple linear regression analysis. The primary concern of this work was to find which of the many atmospheric constituents most effectively reduce visibility, and to determine the role of the different meteorological variables on these relationships. Another objective was to identify the major pollutant sources and transport routes which contribute to visibility degradation. In order to establish the relationship between the light scattering coefficient and particulate data, both the size distribution and the elemental composition of particulate aerosols were considered. Meanwhile, including meteorological parameters in the principal component analysis made it possible to investigate meteorological effects on the observed pollution patterns. The associations among wind direction, elemental concentration, and additional meteorological parameters were considered by using a special modification of principal component analysis. This technique can identify all of the main features, and provides reasonable source direction for particular elements. It is appropriate to note that there appeared to be no published accounts of a principal component analysis for a data set similar to that analyzed in this work. Finally, the results of the multivariate statistical analyses, multiple linear regression analysis and principal component analysis, indicate that intermediate size sulfur containing aerosols, sulfur size mode 0.6 (mu)m < D
Processes and subdivisions in diogenites, a multivariate statistical analysis
NASA Technical Reports Server (NTRS)
Harriott, T. A.; Hewins, R. H.
1984-01-01
Multivariate statistical techniques used on diogenite orthopyroxene analyses show the relationships that occur within diogenites and the two orthopyroxenite components (class I and II) in the polymict diogenite Garland. Cluster analysis shows that only Peckelsheim is similar to Garland class I (Fe-rich) and the other diogenites resemble Garland class II. The unique diogenite Y 75032 may be related to type I by fractionation. Factor analysis confirms the subdivision and shows that Fe does not correlate with the weakly incompatible elements across the entire pyroxene composition range, indicating that igneous fractionation is not the process controlling total diogenite composition variation. The occurrence of two groups of diogenites is interpreted as the result of sampling or mixing of two main sequences of orthopyroxene cumulates with slightly different compositions.
Multivariate space - time analysis of PRE-STORM precipitation
NASA Technical Reports Server (NTRS)
Polyak, Ilya; North, Gerald R.; Valdes, Juan B.
1994-01-01
This paper presents the methodologies and results of the multivariate modeling and two-dimensional spectral and correlation analysis of PRE-STORM rainfall gauge data. Estimated parameters of the models for the specific spatial averages clearly indicate the eastward and southeastward wave propagation of rainfall fluctuations. A relationship between the coefficients of the diffusion equation and the parameters of the stochastic model of rainfall fluctuations is derived that leads directly to the exclusive use of rainfall data to estimate advection speed (about 12 m/s) as well as other coefficients of the diffusion equation of the corresponding fields. The statistical methodology developed here can be used for confirmation of physical models by comparison of the corresponding second-moment statistics of the observed and simulated data, for generating multiple samples of any size, for solving the inverse problem of the hydrodynamic equations, and for application in some other areas of meteorological and climatological data analysis and modeling.
Ordinary chondrites - Multivariate statistical analysis of trace element contents
NASA Technical Reports Server (NTRS)
Lipschutz, Michael E.; Samuels, Stephen M.
1991-01-01
The contents of mobile trace elements (Co, Au, Sb, Ga, Se, Rb, Cs, Te, Bi, Ag, In, Tl, Zn, and Cd) in Antarctic and non-Antarctic populations of H4-6 and L4-6 chondrites, were compared using standard multivariate discriminant functions borrowed from linear discriminant analysis and logistic regression. A nonstandard randomization-simulation method was developed, making it possible to carry out probability assignments on a distribution-free basis. Compositional differences were found both between the Antarctic and non-Antarctic H4-6 chondrite populations and between two L4-6 chondrite populations. It is shown that, for various types of meteorites (in particular, for the H4-6 chondrites), the Antarctic/non-Antarctic compositional difference is due to preterrestrial differences in the genesis of their parent materials.
Toxicological evaluation of complex mixtures: fingerprinting and multivariate analysis.
Eide, Ingvar; Neverdal, Gunhild; Thorvaldsen, Bodil; Arneberg, Reidar; Grung, Bjørn; Kvalheim, Olav M
2004-11-01
The present paper describes a strategy for toxicological evaluation of complex mixtures based on chemical "fingerprinting" followed by pattern recognition (multivariate data analysis). The purpose is to correlate chemical fingerprints to measured toxicological endpoints, identify all major contributors to toxicity, and predict toxicity of additional mixtures. The strategy is illustrated with organic extracts of exhaust particles which are characterized by full scan gas chromatography-mass spectrometry (GC-MS). The complex GC-MS data are resolved into peaks and spectra for individual compounds using an automated curve resolution procedure. Projections to latent structures (PLS) is used for the regression modeling to correlate the GC-MS data to the measured responses; mutagenicity in the Ames Salmonella assay. The regression model identifies those peaks that co-vary with the observed mutagenicity. These peaks may be identified chemically from their spectra. Furthermore, the regression model can be used to predict mutagenicity from GC-MS chromatograms of additional samples. PMID:21782741
Monitoring Quality of Biotherapeutic Products Using Multivariate Data Analysis.
Rathore, Anurag S; Pathak, Mili; Jain, Renu; Jadaun, Gaurav Pratap Singh
2016-07-01
Monitoring the quality of pharmaceutical products is a global challenge, heightened by the implications of letting subquality drugs come to the market on public safety. Regulatory agencies do their due diligence at the time of approval as per their prescribed regulations. However, product quality needs to be monitored post-approval as well to ensure patient safety throughout the product life cycle. This is particularly complicated for biotechnology-based therapeutics where seemingly minor changes in process and/or raw material attributes have been shown to have a significant effect on clinical safety and efficacy of the product. This article provides a perspective on the topic of monitoring the quality of biotech therapeutics. In the backdrop of challenges faced by the regulatory agencies, the potential use of multivariate data analysis as a tool for effective monitoring has been proposed. Case studies using data from several insulin biosimilars have been used to illustrate the key concepts. PMID:27044370
Successes, Challenges and Future Outlook of Multivariate Analysis In HEP
NASA Astrophysics Data System (ADS)
Voss, Helge
2015-05-01
Multivariate techniques using machine learning algorithms have become an integral part in many High Energy Physics data analyses. This article is intended to sketch how this development took place by pointing out a few analyses that pushed forward the exploitation of these powerful analysis techniques. This article does not focus on controversial issues like for example how systematic uncertainties can be dealt with when using such techniques, which have been widely discussed previously by other authors. The main purpose here is to point to the gain in physics reach of the physics experiments due to the adaptation of machine learning techniques and to the challenges the HEP community faces in the light a rapid development in the field of machine learning if we want to make successful use of these powerful selection and reconstruction techniques.
ERIC Educational Resources Information Center
McKinney, Cliff; Renk, Kimberly
2008-01-01
Although parent-adolescent interactions have been examined, relevant variables have not been integrated into a multivariate model. As a result, this study examined a multivariate model of parent-late adolescent gender dyads in an attempt to capture important predictors in late adolescents' important and unique transition to adulthood. The sample…
Multivariate Analysis of State Variation in Breastfeeding Rates in the United States
Singh, Gopal K.; Dee, Deborah L.; Belanoff, Candice; Grummer-Strawn, Laurence M.
2008-01-01
Objectives. We sought to determine the impact of sociodemographic and behavioral factors and state legislation on breastfeeding initiation (child ever fed breastmilk) and duration. Methods. We used data from a nationally representative study of children aged 6 to 71 months (N = 33 121); we calculated unadjusted and adjusted state estimates for breastfeeding initiation and duration. We used logistic regression models to examine factors associated with never breastfeeding or breastfeeding less than 6 months. We conducted a multilevel analysis of state legislation's role. Results. There were wide state variations in breastfeeding initiation and duration. The western and northwestern states had the highest rates. Covariate adjustment accounted for 25% to 30% of the disparity. Multivariate analysis showed that the adjusted odds of not being breastfed were 2.5- to 5.15-times greater in southern states compared with Oregon (reference). Children in states without breastfeeding legislation had higher odds of not being breastfed. Conclusions. Sociodemographic and maternal factors do not account for most breastfeeding rate variation. The association with breastfeeding legislation should be explored and may reflect cultural norms. PMID:18703441
Multispectral light scattering imaging and multivariate analysis of airborne particulates
NASA Astrophysics Data System (ADS)
Holler, Stephen; Skelsey, Charles R.; Fuerstenau, Stephen D.
2005-05-01
Light scattering patterns from non-spherical particles and aggregates exhibit complex structure that is only revealed when observing in two angular dimensions. However, due to the varied shape and packing of such aerosols, the rich structure in the two-dimensional angular optical scattering (TAOS) pattern varies from particle to particle. We examine two-dimensional light scattering patterns obtained at multiple wavelengths using a single CCD camera with minimal cross talk between channels. The integration of the approach with a single CCD camera assures that data is acquired within the same solid angle and orientation. Since the optical size of the scattering particle is inversely proportional to the illuminating wavelength, the spectrally resolved scattering information provides characteristic information about the airborne particles simultaneously in two different scaling regimes. The simultaneous acquisition of data from airborne particulate matter at two different wavelengths allows for additional degrees of freedom in the analysis and characterization of the aerosols. Whereas our previous multivariate analyses of aerosol particles has relied solely on spatial frequency components, our present approach attempts to incorporate the relative symmetry of the particledetector system while extracting information content from both spectral channels. In addition to single channel data, this current approach also examines relative metrics. Consequently, we have begun to employ multivariate techniques based on novel morphological descriptors in order to classify "unknown" particles within a database of TAOS patterns from known aerosols utilizing both spectral and spatial information acquired. A comparison is made among several different classification metrics, all of which show improved classification capabilities relative to our previous approaches.
Multivariate Analysis of the Globular Clusters in M87
NASA Astrophysics Data System (ADS)
Das, Sukanta; Chattopadhayay, Tanuka; Davoust, Emmanuel
2015-11-01
An objective classification of 147 globular clusters (GCs) in the inner region of the giant elliptical galaxy M87 is carried out with the help of two methods of multivariate analysis. First, independent component analysis (ICA) is used to determine a set of independent variables that are linear combinations of various observed parameters (mostly Lick indices) of the GCs. Next, K-means cluster analysis (CA) is applied on the independent components (ICs), to find the optimum number of homogeneous groups having an underlying structure. The properties of the four groups of GCs thus uncovered are used to explain the formation mechanism of the host galaxy. It is suggested that M87 formed in two successive phases. First a monolithic collapse, which gave rise to an inner group of metal-rich clusters with little systematic rotation and an outer group of metal-poor clusters in eccentric orbits. In a second phase, the galaxy accreted low-mass satellites in a dissipationless fashion, from the gas of which the two other groups of GCs formed. Evidence is given for a blue stellar population in the more metal rich clusters, which we interpret by Helium enrichment. Finally, it is found that the clusters of M87 differ in some of their chemical properties (NaD, TiO1, light-element abundances) from GCs in our Galaxy and M31.
Cocaine dependence and thalamic functional connectivity: a multivariate pattern analysis.
Zhang, Sheng; Hu, Sien; Sinha, Rajita; Potenza, Marc N; Malison, Robert T; Li, Chiang-Shan R
2016-01-01
Cocaine dependence is associated with deficits in cognitive control. Previous studies demonstrated that chronic cocaine use affects the activity and functional connectivity of the thalamus, a subcortical structure critical for cognitive functioning. However, the thalamus contains nuclei heterogeneous in functions, and it is not known how thalamic subregions contribute to cognitive dysfunctions in cocaine dependence. To address this issue, we used multivariate pattern analysis (MVPA) to examine how functional connectivity of the thalamus distinguishes 100 cocaine-dependent participants (CD) from 100 demographically matched healthy control individuals (HC). We characterized six task-related networks with independent component analysis of fMRI data of a stop signal task and employed MVPA to distinguish CD from HC on the basis of voxel-wise thalamic connectivity to the six independent components. In an unbiased model of distinct training and testing data, the analysis correctly classified 72% of subjects with leave-one-out cross-validation (p < 0.001), superior to comparison brain regions with similar voxel counts (p < 0.004, two-sample t test). Thalamic voxels that form the basis of classification aggregate in distinct subclusters, suggesting that connectivities of thalamic subnuclei distinguish CD from HC. Further, linear regressions provided suggestive evidence for a correlation of the thalamic connectivities with clinical variables and performance measures on the stop signal task. Together, these findings support thalamic circuit dysfunction in cognitive control as an important neural marker of cocaine dependence. PMID:27556009
Multivariate Comparative Analysis of Stock Exchanges: The European Perspective
NASA Astrophysics Data System (ADS)
Koralun-Bereźnicka, Julia
The aim of the research is to perform a multivariate comparative analysis of 20 European stock exchanges in order to identify the main similarities between the objects. Due to the convergence process of capital markets in Europe the similarities between stock exchanges could be expected to increase over time. The research is meant to show whether and how these similarities change. Consequently, the distances between clusters of similar stock exchanges should become less significant, which the analysis also aims at verifying. The basis of comparison is a set of 48 monthly variables from the period January, 2003 to December, 2006. The variables are classified into three categories: size of the market, equity trading and bonds. The paper aims at identifying the clusters of alike stock exchanges and at finding the characteristic features of each of the distinguished groups. The obtained categorization to some extent corresponds with the division of the European Union into “new” and “old” member countries. Clustering method, performed for each quarter separately, also reveals that the classification is fairly stable in time. The factor analysis, which was carried out to reduce the number of variables, reveals three major factors behind the data, which are related with the earlier mentioned categories of variables.
High dimensional data analysis using multivariate generalized spatial quantiles
Mukhopadhyay, Nitai D.; Chatterjee, Snigdhansu
2015-01-01
High dimensional data routinely arises in image analysis, genetic experiments, network analysis, and various other research areas. Many such datasets do not correspond to well-studied probability distributions, and in several applications the data-cloud prominently displays non-symmetric and non-convex shape features. We propose using spatial quantiles and their generalizations, in particular, the projection quantile, for describing, analyzing and conducting inference with multivariate data. Minimal assumptions are made about the nature and shape characteristics of the underlying probability distribution, and we do not require the sample size to be as high as the data-dimension. We present theoretical properties of the generalized spatial quantiles, and an algorithm to compute them quickly. Our quantiles may be used to obtain multidimensional confidence or credible regions that are not required to conform to a pre-determined shape. We also propose a new notion of multidimensional order statistics, which may be used to obtain multidimensional outliers. Many of the features revealed using a generalized spatial quantile-based analysis would be missed if the data was shoehorned into a well-known probabilistic configuration. PMID:26617421
Characterization and Classification of Lanthanides by Multivariate-Analysis Methods
NASA Astrophysics Data System (ADS)
Horovitz, Ossi; Sârbu, Costel
2005-03-01
A chemometric study was conducted on a data set consisting of 18 characteristics, mainly physical properties of the 14 lanthanides and lanthanum, including Sc and Y. Classical methods of multivariate analysis, namely, principal component analysis (PCA) and cluster analysis (CA) were applied. The results obtained by using the Statistica software package are presented and discussed concerning the correlations between the properties and those between the elements themselves. The discussion and findings are based on the tables of correlation, the eigenvectors and eigenvalues of PCA, the 2D- and 3D-representations of the loadings of variables and scores of the elements corresponding to the first principal components, including also the dendrograms obtained by using CA. Loadings scatterplots are used as a display tool for examining the relationships between properties, looking for trends, grouping, or outliers. In the same way, the scatterplots of scores emphasized the difference between La and the lanthanides on the one side and Sc and Y on the other and support setting Lu as their homologue, rather than La. On the basis of these findings, a ”periodic system“ of the lanthanides is suggested that agrees well with chemical intuition.
ERIC Educational Resources Information Center
Grochowalski, Joseph H.
2015-01-01
Component Universe Score Profile analysis (CUSP) is introduced in this paper as a psychometric alternative to multivariate profile analysis. The theoretical foundations of CUSP analysis are reviewed, which include multivariate generalizability theory and constrained principal components analysis. Because CUSP is a combination of generalizability…
Classification of Malaysia aromatic rice using multivariate statistical analysis
Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md; Masnan, M. J.; Zakaria, A.; Rahim, N. A.; Omar, O.
2015-05-15
Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy training time, and prone to fatigue as the number of sample increased and inconsistent. The GC–MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties.
Classification of Malaysia aromatic rice using multivariate statistical analysis
NASA Astrophysics Data System (ADS)
Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md; Masnan, M. J.; Zakaria, A.; Rahim, N. A.; Omar, O.
2015-05-01
Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy training time, and prone to fatigue as the number of sample increased and inconsistent. The GC-MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties.
Multivariate Analysis of Conformational Changes Induced by Macromolecular Interactions
NASA Astrophysics Data System (ADS)
Mitra, Indranil; Alexov, Emil
2009-11-01
Understanding protein-protein binding and associated conformational changes is critical for both understanding thermodynamics of protein interactions and successful drug discovery. Our study focuses on computational analysis of plausible correlations between induced conformational changes and set of biophysical characteristics of interacting monomers. It was done by comparing 3D structures of unbound and bound monomers to calculate the RMSD which is used as measure of the structural changed induced by the binding. We correlate RMSD with volumetric and interfacial charge of the monomers, the amino acid composition, the energy of binding, and type of amino acids at the interface. as predictors. The data set was analyzed with SVM in R & SPSS which is trained on a combination of a new robust evolutionary conservation signal with the monomeric properties to predict the induced RMSD. The goal of this study is to undergo parametric tests and heirchiacal cluster and discriminant multivariate analysis to find key predictors which will be used to develop algorithm to predict the magnitude of conformational changes provided by the structure of interacting monomers. Results indicate that the most promising predictor is the net charge of the monomers, however, other parameters as the type of amino acids at the interface have significant contribution as well.
Multivariate Sensitivity Analysis of Time-of-Flight Sensor Fusion
NASA Astrophysics Data System (ADS)
Schwarz, Sebastian; Sjöström, Mårten; Olsson, Roger
2014-09-01
Obtaining three-dimensional scenery data is an essential task in computer vision, with diverse applications in various areas such as manufacturing and quality control, security and surveillance, or user interaction and entertainment. Dedicated Time-of-Flight sensors can provide detailed scenery depth in real-time and overcome short-comings of traditional stereo analysis. Nonetheless, they do not provide texture information and have limited spatial resolution. Therefore such sensors are typically combined with high resolution video sensors. Time-of-Flight Sensor Fusion is a highly active field of research. Over the recent years, there have been multiple proposals addressing important topics such as texture-guided depth upsampling and depth data denoising. In this article we take a step back and look at the underlying principles of ToF sensor fusion. We derive the ToF sensor fusion error model and evaluate its sensitivity to inaccuracies in camera calibration and depth measurements. In accordance with our findings, we propose certain courses of action to ensure high quality fusion results. With this multivariate sensitivity analysis of the ToF sensor fusion model, we provide an important guideline for designing, calibrating and running a sophisticated Time-of-Flight sensor fusion capture systems.
Multivariate analysis of dim elves from ISUAL observations
NASA Astrophysics Data System (ADS)
Offroy, Marc; Farges, Thomas; Gaillard, Pierre; Kuo, Cheng Ling; Chen, Alfred Bing-Chih; Hsu, Rue-Ron; Takahashi, Yukihiro
2015-08-01
The Imager of Sprites and Upper Atmospheric Lightning (ISUAL) on the FORMOSAT-2 satellite, launched in 2004, records Transient Luminous Events (TLEs). ISUAL has an imager and a spectrophotometer that observe TLEs all over the globe. Among these phenomena, elves are particularly difficult to detect. ISUAL often records events that correspond to significant far ultraviolet (FUV) emissions in the spectrophotometer but have no discernible TLEs in the imager. These FUV events are called "dim" elves. Therefore, it is important to develop mathematical tools to analyze the data to obtain a better evaluation of the number of elves and their occurrence. Multivariate approaches are applied to characterize the unlabeled events. The first approach is the principal component analysis which distinguishes two different groups, one including elves and dim elves. The second approach is the PARallel FACtor analysis which provides a waveform model for each group. These methodologies confirm that FUV signal is the evidence of TLE presence. A crude classification method was then suggested taking into account these results. The proportion of elves, relatively to the considered ISUAL data set, is found to be about 40%. It is similar to previous results and confirms that relatively weak lightning peak current is sufficient to produce elves. This new strategy demonstrates the potential for discriminating between lightning and TLEs without prior knowledge within the selectivity of the FUV spectral band.
NASA Technical Reports Server (NTRS)
Achtemeier, Gary L.; Ochs, Harry T., III
1988-01-01
The variational method of undetermined multipliers is used to derive a multivariate model for objective analysis. The model is intended for the assimilation of 3-D fields of rawinsonde height, temperature and wind, and mean level temperature observed by satellite into a dynamically consistent data set. Relative measurement errors are taken into account. The dynamic equations are the two nonlinear horizontal momentum equations, the hydrostatic equation, and an integrated continuity equation. The model Euler-Lagrange equations are eleven linear and/or nonlinear partial differential and/or algebraic equations. A cyclical solution sequence is described. Other model features include a nonlinear terrain-following vertical coordinate that eliminates truncation error in the pressure gradient terms of the horizontal momentum equations and easily accommodates satellite observed mean layer temperatures in the middle and upper troposphere. A projection of the pressure gradient onto equivalent pressure surfaces removes most of the adverse impacts of the lower coordinate surface on the variational adjustment.
Prognostic factors of adult metastatic renal carcinoma: a multivariate analysis.
de Forges, A; Rey, A; Klink, M; Ghosn, M; Kramar, A; Droz, J P
1988-01-01
In order to define the prognostic factors for metastatic renal carcinoma, we reviewed 134 patients who were treated from 1971 through 1986. Survival rates were 72, 45, and 25% at 6, 12, and 18 months, respectively. Seventeen variables were tested using the logrank test. Improved survival was correlated with normal performance status, and an absence of fever, weight loss, hepatic metastasis, and lung metastasis (or, if lung metastasis was present, less than 2 cm in diameter and limited to one site), a disease-free interval, sedimentation rate less than 100, and renal surgery. Four variables retained significant value in the multivariate analysis: hepatic metastasis, lung metastasis, disease-free interval, and a variable combining the sedimentation rate and the weight loss (SWRL). Predictive survival rates based on these variables were calculated from the Cox model. Six subgroups of patients were identified. The estimation of survival is clinically of value for future phase II trials of chemotherapy in patients with adult metastatic renal carcinoma. PMID:3187293
Atmospheric conditions, lunar phases, and childbirth: a multivariate analysis.
Ochiai, Angela Megumi; Gonçalves, Fabio Luiz Teixeira; Ambrizzi, Tercio; Florentino, Lucia Cristina; Wei, Chang Yi; Soares, Alda Valeria Neves; De Araujo, Natalucia Matos; Gualda, Dulce Maria Rosa
2012-07-01
Our objective was to assess extrinsic influences upon childbirth. In a cohort of 1,826 days containing 17,417 childbirths among them 13,252 spontaneous labor admissions, we studied the influence of environment upon the high incidence of labor (defined by 75th percentile or higher), analyzed by logistic regression. The predictors of high labor admission included increases in outdoor temperature (odds ratio: 1.742, P = 0.045, 95%CI: 1.011 to 3.001), and decreases in atmospheric pressure (odds ratio: 1.269, P = 0.029, 95%CI: 1.055 to 1.483). In contrast, increases in tidal range were associated with a lower probability of high admission (odds ratio: 0.762, P = 0.030, 95%CI: 0.515 to 0.999). Lunar phase was not a predictor of high labor admission (P = 0.339). Using multivariate analysis, increases in temperature and decreases in atmospheric pressure predicted high labor admission, and increases of tidal range, as a measurement of the lunar gravitational force, predicted a lower probability of high admission. PMID:21744100
Atmospheric conditions, lunar phases, and childbirth: a multivariate analysis
NASA Astrophysics Data System (ADS)
Ochiai, Angela Megumi; Gonçalves, Fabio Luiz Teixeira; Ambrizzi, Tercio; Florentino, Lucia Cristina; Wei, Chang Yi; Soares, Alda Valeria Neves; De Araujo, Natalucia Matos; Gualda, Dulce Maria Rosa
2012-07-01
Our objective was to assess extrinsic influences upon childbirth. In a cohort of 1,826 days containing 17,417 childbirths among them 13,252 spontaneous labor admissions, we studied the influence of environment upon the high incidence of labor (defined by 75th percentile or higher), analyzed by logistic regression. The predictors of high labor admission included increases in outdoor temperature (odds ratio: 1.742, P = 0.045, 95%CI: 1.011 to 3.001), and decreases in atmospheric pressure (odds ratio: 1.269, P = 0.029, 95%CI: 1.055 to 1.483). In contrast, increases in tidal range were associated with a lower probability of high admission (odds ratio: 0.762, P = 0.030, 95%CI: 0.515 to 0.999). Lunar phase was not a predictor of high labor admission ( P = 0.339). Using multivariate analysis, increases in temperature and decreases in atmospheric pressure predicted high labor admission, and increases of tidal range, as a measurement of the lunar gravitational force, predicted a lower probability of high admission.
Adaptive multiscale entropy analysis of multivariate neural data.
Hu, Meng; Liang, Hualou
2012-01-01
Multiscale entropy (MSE) has been widely used to quantify a system's complexity by taking into account the multiple time scales inherent in physiologic time series. The method, however, is biased toward the coarse scale, i.e., low-frequency components due to the progressive smoothing operations. In addition, the algorithm for extracting the different scales is not well adapted to nonlinear/nonstationary signals. In this letter, we introduce adaptive multiscale entropy (AME) measures in which the scales are adaptively derived directly from the data by virtue of recently developed multivariate empirical mode decomposition. Depending on the consecutive removal of low-frequency or high-frequency components, our AME can be estimated at either coarse-to-fine or fine-to-coarse scales over which the sample entropy is performed. Computer simulations are performed to verify the effectiveness of AME for analysis of the highly nonstationary data. Local field potentials collected from the visual cortex of macaque monkey while performing a generalized flash suppression task are used as an example to demonstrate the usefulness of our AME approach to reveal the underlying dynamics in complex neural data. PMID:21788182
A multivariate analysis of arctic climate in GCMs
McGinnis, D.L.; Crane, R.G. )
1994-08-01
A multivariate analysis of Arctic climate is performed comparing the observed climate with that simulated by four different global climate models (GCMs). The focus is on the patterns of temporal and spatial variability in several climate parameters (sea level pressure, temperature, specific humidity, and precipitation). There are broad similarities between the observed data and all the GCM climates. There are, however, several major differences. The observed data show the Arctic climate to be dominated by the summertime pattern of temperature and humidity, which is decoupled from the atmospheric circulation. The winter patterns explain less of the observed variance but show a much closer association between temperature and the large-scale circulation. The GCMs, in contrast, overemphasize the winter season and show more of a large-scale advective control on summertime temperature patterns. Possible reasons for these differences are suggested, and their implications for GCM climate studies are discussed. The shortcomings in the GCMs point to the need for improvements in boundary layer rendition, in the treatment of Arctic stratus, and in sea ice simulations through coupled ocean models and the inclusion of ice dynamics. 25 refs., 7 figs., 2 tabs.
ADS-Demo Fuel Rod Performance: Multivariate Statistical Analysis
Calabrese, R.; Vettraino, F.; Luzzi, L.
2004-07-01
A forward step in the development of Accelerator Driven System (ADS) for the Pu, MA and LLFP transmutation, is the realisation of a 80 MWt ADS-demo (XADS) whose basic objective is the system feasibility demonstration. The XADS is forecasted to adopt the UO{sub 2}-PuO{sub 2} mixed-oxides fuel already experimented in the sodium cooled fast reactors such as the french SPX-1. The present multivariate statistical analysis performed by using the Transuranus Code, was carried out for the Normal Operation at the so-called Enhanced Nominal Conditions (120% nominal reactor power), aimed at verifying that the fuel system complies with the stated design limits, i.e. centerline fuel temperature, cladding temperature and damage, during all the in-reactor lifetime. A statistical input set similar to SPX and PEC fuel case, was adopted. One most relevant assumption in the present calculations was a 30% AISI-316 cladding thickness corrosion at EOL. Relative influence of main fuel rod parameters on fuel centerline temperature was also evaluated. (authors)
Multivariate multiscale entropy: a tool for complexity analysis of multichannel data.
Ahmed, Mosabber Uddin; Mandic, Danilo P
2011-12-01
This work generalizes the recently introduced univariate multiscale entropy (MSE) analysis to the multivariate case. This is achieved by introducing multivariate sample entropy (MSampEn) in a rigorous way, in order to account for both within- and cross-channel dependencies in multiple data channels, and by evaluating it over multiple temporal scales. The multivariate MSE (MMSE) method is shown to provide an assessment of the underlying dynamical richness of multichannel observations, and more degrees of freedom in the analysis than standard MSE. The benefits of the proposed approach are illustrated by simulations on complexity analysis of multivariate stochastic processes and on real-world multichannel physiological and environmental data. PMID:22304127
SERS spectroscopy and multivariate analysis of globulin in human blood
NASA Astrophysics Data System (ADS)
Wang, J.; Zeng, Y. Y.; Lin, J. Q.; Lin, L.; Wang, X. C.; Chen, G. N.; Huang, Z. F.; Li, B. H.; Zeng, H. S.; Chen, R.
2014-06-01
Globulin plays a significant role in body processes, acts as an important marker for disease diagnosis and determines blood type. Moreover, recent reports about the strong association between cancer risk and blood type imply that further studying these relationships may yield new findings on the biological mechanisms of tumorigenesis. In this paper, we propose and evaluate the efficacy of surface-enhanced Raman scattering (SERS) for the determination of this important globulin derived from human blood. Comparing globulins from different blood types by utilizing SERS spectroscopy and multivariate analysis, we show that primary structures of globulins from different blood types are similar to each other, but subtle differences in structures which may be vital for determining blood type are still observed. The abilities of globulins from different blood types to approach silver surfaces seem to differ, which also indicates that there are structural differences in blood type related globulins. Furthermore, this method differentiates blood type A from type B, type A from type O, type B from type O, type AB from type A, type AB from type B, and type AB from type O with sensitivities and specificities as follows: (90.0%, 95.0%), (80.0%, 83.9%), (95.0%, 90.3%), (97.3%, 96.7%), (94.6%, 95.5%), (100%, 100%), suggesting a potential feasibility for use in blood type identification. Our method sheds new light on blood type analysis, paves the way for the study of relationships between cancer risk and blood types, and expands the flexibility of SERS for useful applications in the life sciences.
Pleiotropy analysis of quantitative traits at gene level by multivariate functional linear models.
Wang, Yifan; Liu, Aiyi; Mills, James L; Boehnke, Michael; Wilson, Alexander F; Bailey-Wilson, Joan E; Xiong, Momiao; Wu, Colin O; Fan, Ruzong
2015-05-01
In genetics, pleiotropy describes the genetic effect of a single gene on multiple phenotypic traits. A common approach is to analyze the phenotypic traits separately using univariate analyses and combine the test results through multiple comparisons. This approach may lead to low power. Multivariate functional linear models are developed to connect genetic variant data to multiple quantitative traits adjusting for covariates for a unified analysis. Three types of approximate F-distribution tests based on Pillai-Bartlett trace, Hotelling-Lawley trace, and Wilks's Lambda are introduced to test for association between multiple quantitative traits and multiple genetic variants in one genetic region. The approximate F-distribution tests provide much more significant results than those of F-tests of univariate analysis and optimal sequence kernel association test (SKAT-O). Extensive simulations were performed to evaluate the false positive rates and power performance of the proposed models and tests. We show that the approximate F-distribution tests control the type I error rates very well. Overall, simultaneous analysis of multiple traits can increase power performance compared to an individual test of each trait. The proposed methods were applied to analyze (1) four lipid traits in eight European cohorts, and (2) three biochemical traits in the Trinity Students Study. The approximate F-distribution tests provide much more significant results than those of F-tests of univariate analysis and SKAT-O for the three biochemical traits. The approximate F-distribution tests of the proposed functional linear models are more sensitive than those of the traditional multivariate linear models that in turn are more sensitive than SKAT-O in the univariate case. The analysis of the four lipid traits and the three biochemical traits detects more association than SKAT-O in the univariate case. PMID:25809955
Multivariate Analysis for Animal Selection in Experimental Research
Pinto, Renan Mercuri; de Campos, Dijon Henrique Salomé; Tomasi, Loreta Casquel; Cicogna, Antonio Carlos; Okoshi, Katashi; Padovani, Carlos Roberto
2015-01-01
Background Several researchers seek methods for the selection of homogeneous groups of animals in experimental studies, a fact justified because homogeneity is an indispensable prerequisite for casualization of treatments. The lack of robust methods that comply with statistical and biological principles is the reason why researchers use empirical or subjective methods, influencing their results. Objective To develop a multivariate statistical model for the selection of a homogeneous group of animals for experimental research and to elaborate a computational package to use it. Methods The set of echocardiographic data of 115 male Wistar rats with supravalvular aortic stenosis (AoS) was used as an example of model development. Initially, the data were standardized, and became dimensionless. Then, the variance matrix of the set was submitted to principal components analysis (PCA), aiming at reducing the parametric space and at retaining the relevant variability. That technique established a new Cartesian system into which the animals were allocated, and finally the confidence region (ellipsoid) was built for the profile of the animals’ homogeneous responses. The animals located inside the ellipsoid were considered as belonging to the homogeneous batch; those outside the ellipsoid were considered spurious. Results The PCA established eight descriptive axes that represented the accumulated variance of the data set in 88.71%. The allocation of the animals in the new system and the construction of the confidence region revealed six spurious animals as compared to the homogeneous batch of 109 animals. Conclusion The biometric criterion presented proved to be effective, because it considers the animal as a whole, analyzing jointly all parameters measured, in addition to having a small discard rate. PMID:25651342
Multivariate meta-analysis of mixed outcomes: a Bayesian approach.
Bujkiewicz, Sylwia; Thompson, John R; Sutton, Alex J; Cooper, Nicola J; Harrison, Mark J; Symmons, Deborah P M; Abrams, Keith R
2013-09-30
Multivariate random effects meta-analysis (MRMA) is an appropriate way for synthesizing data from studies reporting multiple correlated outcomes. In a Bayesian framework, it has great potential for integrating evidence from a variety of sources. In this paper, we propose a Bayesian model for MRMA of mixed outcomes, which extends previously developed bivariate models to the trivariate case and also allows for combination of multiple outcomes that are both continuous and binary. We have constructed informative prior distributions for the correlations by using external evidence. Prior distributions for the within-study correlations were constructed by employing external individual patent data and using a double bootstrap method to obtain the correlations between mixed outcomes. The between-study model of MRMA was parameterized in the form of a product of a series of univariate conditional normal distributions. This allowed us to place explicit prior distributions on the between-study correlations, which were constructed using external summary data. Traditionally, independent 'vague' prior distributions are placed on all parameters of the model. In contrast to this approach, we constructed prior distributions for the between-study model parameters in a way that takes into account the inter-relationship between them. This is a flexible method that can be extended to incorporate mixed outcomes other than continuous and binary and beyond the trivariate case. We have applied this model to a motivating example in rheumatoid arthritis with the aim of incorporating all available evidence in the synthesis and potentially reducing uncertainty around the estimate of interest. PMID:23630081
Multivariate cluster analysis of forest fire events in Portugal
NASA Astrophysics Data System (ADS)
Tonini, Marj; Pereira, Mario; Vega Orozco, Carmen; Parente, Joana
2015-04-01
Portugal is one of the major fire-prone European countries, mainly due to its favourable climatic, topographic and vegetation conditions. Compared to the other Mediterranean countries, the number of events registered here from 1980 up to nowadays is the highest one; likewise, with respect to the burnt area, Portugal is the third most affected country. Portuguese mapped burnt areas are available from the website of the Institute for the Conservation of Nature and Forests (ICNF). This official geodatabase is the result of satellite measurements starting from the year 1990. The spatial information, delivered in shapefile format, provides a detailed description of the shape and the size of area burnt by each fire, while the date/time information relate to the ignition fire is restricted to the year of occurrence. In terms of a statistical formalism wildfires can be associated to a stochastic point process, where events are analysed as a set of geographical coordinates corresponding, for example, to the centroid of each burnt area. The spatio/temporal pattern of stochastic point processes, including the cluster analysis, is a basic procedure to discover predisposing factorsas well as for prevention and forecasting purposes. These kinds of studies are primarily focused on investigating the spatial cluster behaviour of environmental data sequences and/or mapping their distribution at different times. To include both the two dimensions (space and time) a comprehensive spatio-temporal analysis is needful. In the present study authors attempt to verify if, in the case of wildfires in Portugal, space and time act independently or if, conversely, neighbouring events are also closer in time. We present an application of the spatio-temporal K-function to a long dataset (1990-2012) of mapped burnt areas. Moreover, the multivariate K-function allowed checking for an eventual different distribution between small and large fires. The final objective is to elaborate a 3D
Correlative and multivariate analysis of increased radon concentration in underground laboratory.
Maletić, Dimitrije M; Udovičić, Vladimir I; Banjanac, Radomir M; Joković, Dejan R; Dragić, Aleksandar L; Veselinović, Nikola B; Filipović, Jelena
2014-11-01
The results of analysis using correlative and multivariate methods, as developed for data analysis in high-energy physics and implemented in the Toolkit for Multivariate Analysis software package, of the relations of the variation of increased radon concentration with climate variables in shallow underground laboratory is presented. Multivariate regression analysis identified a number of multivariate methods which can give a good evaluation of increased radon concentrations based on climate variables. The use of the multivariate regression methods will enable the investigation of the relations of specific climate variable with increased radon concentrations by analysis of regression methods resulting in 'mapped' underlying functional behaviour of radon concentrations depending on a wide spectrum of climate variables. PMID:25080439
Statistical analysis of multivariate atmospheric variables. [cloud cover
NASA Technical Reports Server (NTRS)
Tubbs, J. D.
1979-01-01
Topics covered include: (1) estimation in discrete multivariate distributions; (2) a procedure to predict cloud cover frequencies in the bivariate case; (3) a program to compute conditional bivariate normal parameters; (4) the transformation of nonnormal multivariate to near-normal; (5) test of fit for the extreme value distribution based upon the generalized minimum chi-square; (6) test of fit for continuous distributions based upon the generalized minimum chi-square; (7) effect of correlated observations on confidence sets based upon chi-square statistics; and (8) generation of random variates from specified distributions.
Turnover intention in new graduate nurses: a multivariate analysis
Beecroft, Pauline C; Dorey, Frederick; Wenten, Madé
2008-01-01
Title Turnover intention in new graduate nurses: a multivariate analysis Aim This paper is a report of a study to determine the relationship of new nurse turnover intent with individual characteristics, work environment variables and organizational factors and to compare new nurse turnover with actual turnover in the 18 months of employment following completion of a residency. Background Because of their influence on patient safety and health outcomes nurse turnover and turnover intent have received considerable attention worldwide. When nurse staffing is inadequate, especially during nursing shortages, unfavourable clinical outcomes have been documented. Method Prospective data collection took place from 1999 to 2006 with 889 new paediatric nurses who completed the same residency. Scores on study instruments were related to likelihood of turnover intent using logistic regression analysis models. Relationships between turnover intent and actual turnover were compared using Kaplan–Meier survivorship. Results The final model demonstrated that older respondents were more likely to have turnover intent if they did not get their ward choice. Also higher scores on work environment and organizational characteristics contributed to likelihood that the new nurse would not be in the turnover intent group. These factors distinguish a new nurse with turnover intent from one without 79% of the time. Increased seeking of social support was related to turnover intent and older new graduates were more likely to be in the turnover intent group if they did not get their ward choice. Conclusion When new graduate nurses are satisfied with their jobs and pay and feel committed to the organization, the odds against turnover intent decrease. What is already known about this topic There is concern in many countries about nurse turnover and the resulting effects on patient safety and quality of care. Decreasing ability to recruit experienced nurses has increased the emphasis on
Selective Exposure and Foreign News: A Multivariate Analysis.
ERIC Educational Resources Information Center
Kim, Hyun Kap
This multivariate study examined attitudinal and demographic variables affecting the degree of foreign news exposure on the basis of the data collected from 102 daily newspaper readers in Carbondale, Illinois. The data were obtained in personal interviews with the respondents. The ultimate goal of the study was to contribute to the investigation…
Bayesian Analysis of Multivariate Probit Models with Surrogate Outcome Data
ERIC Educational Resources Information Center
Poon, Wai-Yin; Wang, Hai-Bin
2010-01-01
A new class of parametric models that generalize the multivariate probit model and the errors-in-variables model is developed to model and analyze ordinal data. A general model structure is assumed to accommodate the information that is obtained via surrogate variables. A hybrid Gibbs sampler is developed to estimate the model parameters. To…
A Multivariate Generalizability Analysis of the Multistate Bar Examination
ERIC Educational Resources Information Center
Yin, Ping
2005-01-01
The main purpose of this study is to examine the content structure of the Multistate Bar Examination (MBE) using the "table of specifications" model from the perspective of multivariate generalizability theory. Specifically, using MBE data collected over different years (six administrations: three from the February test and three from July test),…
Hydrogeochemical characteristics of groundwater in Latvia using multivariate statistical analysis
NASA Astrophysics Data System (ADS)
Retike, Inga; Kalvans, Andis; Bikse, Janis; Popovs, Konrads; Babre, Alise
2015-04-01
product between the two previously named clusters. Groundwater in cluster 2, 6 and 7 is considered to be a result of carbonate weathering with some addition of sea salts or gypsum dissolution. As a conclusion, the highest or lowest concentrations of some trace elements in groundwater was found out to be strongly associated with certain clusters. For example, Cluster 9 represents gypsum dissolution and has the highest concentrations of F, Sr, Rb, Li and the lowest levels of Ba. It can be also concluded that multivariate statistical analysis of major components can be used as an exploratory and predictive tool to identify groundwater objects with high possibility of elevated or reduced concentrations of harmful or essential trace elements. The research is supported by the European Union through the ESF Mobilitas grant No MJD309 and the European Regional Development Fund project Nr.2013/0054/2DP/2.1.1.1.0/13/APIA/VIAA/007 and NRP project EVIDENnT project "Groundwater and climate scenarios" subproject "Groundwater Research".
Hirozawa, Anne M; Montez-Rath, Maria E; Johnson, Elizabeth C; Solnit, Stephen A; Drennan, Michael J; Katz, Mitchell H; Marx, Rani
2016-01-01
We compared prospective risk adjustment models for adjusting patient panels at the San Francisco Department of Public Health. We used 4 statistical models (linear regression, two-part model, zero-inflated Poisson, and zero-inflated negative binomial) and 4 subsets of predictor variables (age/gender categories, chronic diagnoses, homelessness, and a loss to follow-up indicator) to predict primary care visit frequency. Predicted visit frequency was then used to calculate patient weights and adjusted panel sizes. The two-part model using all predictor variables performed best (R = 0.20). This model, designed specifically for safety net patients, may prove useful for panel adjustment in other public health settings. PMID:27576054
Multivariate Meta-Analysis of Preference-Based Quality of Life Values in Coronary Heart Disease
Stevanović, Jelena; Pechlivanoglou, Petros; Kampinga, Marthe A.; Krabbe, Paul F. M.; Postma, Maarten J.
2016-01-01
Background There are numerous health-related quality of life (HRQol) measurements used in coronary heart disease (CHD) in the literature. However, only values assessed with preference-based instruments can be directly applied in a cost-utility analysis (CUA). Objective To summarize and synthesize instrument-specific preference-based values in CHD and the underlying disease-subgroups, stable angina and post-acute coronary syndrome (post-ACS), for developed countries, while accounting for study-level characteristics, and within- and between-study correlation. Methods A systematic review was conducted to identify studies reporting preference-based values in CHD. A multivariate meta-analysis was applied to synthesize the HRQoL values. Meta-regression analyses examined the effect of study level covariates age, publication year, prevalence of diabetes and gender. Results A total of 40 studies providing preference-based values were detected. Synthesized estimates of HRQoL in post-ACS ranged from 0.64 (Quality of Well-Being) to 0.92 (EuroQol European”tariff”), while in stable angina they ranged from 0.64 (Short form 6D) to 0.89 (Standard Gamble). Similar findings were observed in estimates applying to general CHD. No significant improvement in model fit was found after adjusting for study-level covariates. Large between-study heterogeneity was observed in all the models investigated. Conclusions The main finding of our study is the presence of large heterogeneity both within and between instrument-specific HRQoL values. Current economic models in CHD ignore this between-study heterogeneity. Multivariate meta-analysis can quantify this heterogeneity and offers the means for uncertainty around HRQoL values to be translated to uncertainty in CUAs. PMID:27011260
Confirmatory Factor Analysis of Transfer Student Adjustment
ERIC Educational Resources Information Center
Young, Jacob T. N.; Litzler, Elizabeth
2013-01-01
Transfer students' adjustment to college has received substantial attention by researchers. This focus has predominately investigated the observation of "transfer shock": a decrease in grade point average (GPA) experienced after transferring. In response to the persistent focus on transfer shock, growing attention has been directed…
Multivariate Stable Isotope Analysis to Determine Linkages between Benzocaine Seizures
NASA Astrophysics Data System (ADS)
Kemp, H. F.; Meier-Augenstein, W.; Collins, M.; Salouros, H.; Cunningham, A.; Harrison, M.
2012-04-01
In July 2010, a woman was jailed for nine years in the UK after the prosecution successfully argued that attempting to import a cutting agent was proof of involvement in a conspiracy to supply Cocaine. That landmark ruling provided law enforcement agencies with much greater scope to tackle those involved in this aspect of the drug trade, specifically targeting those importing the likes of benzocaine or lidocaine. Huge quantities of these compounds are imported into the UK and between May and August 2010, four shipments of Benzocaine amounting to more then 4 tons had been seized as part of Operation Kitley, a joint initiative between the UK Border Agency and the Serious Organised Crime Agency (SOCA). By diluting cocaine, traffickers can make it go a lot further for very little cost, leading to huge profits. In recent years, dealers have moved away from inert substances, like sugar and baby milk powder, in favour of active pharmaceutical ingredients (APIs), including anaesthetics like Benzocaine and Lidocaine. Both these mimic the numbing effect of cocaine, and resemble it closely in colour, texture and some chemical behaviours, making it easier to conceal the fact that the drug has been diluted. API cutting agents have helped traffickers to maintain steady supplies in the face of successful interdiction and even expand the market in the UK, particularly to young people aged from their mid teens to early twenties. From importation to street-level, the purity of the drug can be reduced up to a factor of 80 and street level cocaine can have a cocaine content as low as 1%. In view of the increasing use of Benzocaine as cutting agent for Cocaine, a study was carried out to investigate if 2H, 13C, 15N and 18O stable isotope signatures could be used in conjunction with multivariate chemometric data analysis to determine potential linkage between benzocaine exhibits seized from different locations or individuals to assist with investigation and prosecution of drug
A Primer on Multivariate Analysis of Variance (MANOVA) for Behavioral Scientists
ERIC Educational Resources Information Center
Warne, Russell T.
2014-01-01
Reviews of statistical procedures (e.g., Bangert & Baumberger, 2005; Kieffer, Reese, & Thompson, 2001; Warne, Lazo, Ramos, & Ritter, 2012) show that one of the most common multivariate statistical methods in psychological research is multivariate analysis of variance (MANOVA). However, MANOVA and its associated procedures are often not…
Estimating an Effect Size in One-Way Multivariate Analysis of Variance (MANOVA)
ERIC Educational Resources Information Center
Steyn, H. S., Jr.; Ellis, S. M.
2009-01-01
When two or more univariate population means are compared, the proportion of variation in the dependent variable accounted for by population group membership is eta-squared. This effect size can be generalized by using multivariate measures of association, based on the multivariate analysis of variance (MANOVA) statistics, to establish whether…
Analysis techniques for multivariate root loci. [a tool in linear control systems
NASA Technical Reports Server (NTRS)
Thompson, P. M.; Stein, G.; Laub, A. J.
1980-01-01
Analysis and techniques are developed for the multivariable root locus and the multivariable optimal root locus. The generalized eigenvalue problem is used to compute angles and sensitivities for both types of loci, and an algorithm is presented that determines the asymptotic properties of the optimal root locus.
Multivariate analysis of elemental chemistry as a robust biosignature
NASA Astrophysics Data System (ADS)
Storrie-Lombardi, M.; Nealson, K.
2003-04-01
The robotic detection of life in extraterrestrial settings (i.e., Mars, Europa, etc.) would be greatly simplified if analysis could be accomplished in the absence of direct mechanical manipulation of a sample. It would also be preferable to employ a fundamental physico-chemical phenomenon as a biosignature and depend less on the particular manifestations of life on Earth (i.e. to employ non-earthcentric methods). One such approach, which we put forward here, is that of elemental composition, a reflection of the use of specific chemical elements for the construction of living systems. Using appropriate analyses (over the proper spatial scales), it should be possible to see deviations from the geological background (mineral and geochemical composition of the crust), and identify anomalies that would indicate sufficient deviation from the norm as to indicate a possible living system. To this end, over the past four decades elemental distributions have been determined for the sun, the interstellar medium, seawater, the crust of the Earth, carbonaceous chondrite meteorites, bacteria, plants, animals, and human beings. Such data can be relatively easily obtained for samples of a variety of types using a technique known as laser-induced breakdown spectroscopy (LIBS), which employs a high energy laser to ablate a portion of a sample, and then determine elemental composition using remote optical spectroscopy. However, the elements commonly associated with living systems (H, C, O, and N), while useful for detecting extant life, are relatively volatile and are not easily constrained across geological time scales. This minimizes their utility as fossil markers of ancient life. We have investigated the possibility of distinguishing the distributions of less volatile elements in a variety of biological materials from the distributions found in carbonaceous chondrites and the Earth’s crust using principal component analysis (PCA), a classical multivariate analysis technique
Analysis of Forest Foliage Using a Multivariate Mixture Model
NASA Technical Reports Server (NTRS)
Hlavka, C. A.; Peterson, David L.; Johnson, L. F.; Ganapol, B.
1997-01-01
Data with wet chemical measurements and near infrared spectra of ground leaf samples were analyzed to test a multivariate regression technique for estimating component spectra which is based on a linear mixture model for absorbance. The resulting unmixed spectra for carbohydrates, lignin, and protein resemble the spectra of extracted plant starches, cellulose, lignin, and protein. The unmixed protein spectrum has prominent absorption spectra at wavelengths which have been associated with nitrogen bonds.
Systematic wavelength selection for improved multivariate spectral analysis
Thomas, Edward V.; Robinson, Mark R.; Haaland, David M.
1995-01-01
Methods and apparatus for determining in a biological material one or more unknown values of at least one known characteristic (e.g. the concentration of an analyte such as glucose in blood or the concentration of one or more blood gas parameters) with a model based on a set of samples with known values of the known characteristics and a multivariate algorithm using several wavelength subsets. The method includes selecting multiple wavelength subsets, from the electromagnetic spectral region appropriate for determining the known characteristic, for use by an algorithm wherein the selection of wavelength subsets improves the model's fitness of the determination for the unknown values of the known characteristic. The selection process utilizes multivariate search methods that select both predictive and synergistic wavelengths within the range of wavelengths utilized. The fitness of the wavelength subsets is determined by the fitness function F=.function.(cost, performance). The method includes the steps of: (1) using one or more applications of a genetic algorithm to produce one or more count spectra, with multiple count spectra then combined to produce a combined count spectrum; (2) smoothing the count spectrum; (3) selecting a threshold count from a count spectrum to select these wavelength subsets which optimize the fitness function; and (4) eliminating a portion of the selected wavelength subsets. The determination of the unknown values can be made: (1) noninvasively and in vivo; (2) invasively and in vivo; or (3) in vitro.
Greenland Scotland overflow studied by hydro-chemical multivariate analysis
NASA Astrophysics Data System (ADS)
Fogelqvist, E.; Blindheim, J.; Tanhua, T.; Østerhus, S.; Buch, E.; Rey, F.
2003-01-01
Hydrographic, nutrient and halocarbon tracer data collected in July-August 1994 in the Norwegian Sea, the Faroe Bank Channel (FBC), the Iceland and Irminger Basins and the Iceland Sea are presented. Special attention was given to the overflow waters over the Iceland-Scotland Ridge (ISOW). The Iceland-Scottland overflow water (ISOW) was identified along its pathway in the Iceland Basin, and entrainment of overlying water masses was quantified by multivariate analysis (MVA) using principal component analysis (PCA) and Partial Least Square (PLS) calibration. It was concluded that the deeper portion of the ISOW in the FBC was a mixture of about equal parts of Norwegian Sea Deep Water (NSDW) and Norwegian Sea Arctic Intermediate Water (NSAIW). The mixing development of ISOW during its descent in the Iceland Basin was analysed in three sections across the plume. In the southern section at 61°N, where the ISOW core was observed at 2300 m depth, the fraction of waters originating north of the ridge was assessed to be 54%. MVA assessed the fractional composition of the ISOW to be 21% NSDW, 22% NSAIW, 18% Northeast Atlantic Water (NEAW), 11% Modified East Icelandic Water, 25% Labrador Sea Water (LSW) and 3% North East Atlantic Deep Water. It may be noted that the fraction of NEAW is of the same volume as the NSDW. On its further path around the Reykjanes Ridge, the ISOW mixed mainly with LSW, and at 63°N in the Irminger Basin, it was warmer and fresher ( θ=2.8°C and S=34.92) than at 61°N east of the ridge (θ=2.37° C, S=34.97) . The most intensive mixing occurred immediately west of the FBC, probably due to high velocity of the overflow plume through the channel, where annual velocity means exceeded 1.1 m s -1. This resulted in shear instabilities towards the overlying Atlantic waters and cross-stream velocities exceeding 0.3 m s -1 in the bottom boundary layer. The role of NSAIW as a component of ISOW is increasing. Being largely a product of winter convection in the
Multivariate longitudinal data analysis with mixed effects hidden Markov models.
Raffa, Jesse D; Dubin, Joel A
2015-09-01
Multiple longitudinal responses are often collected as a means to capture relevant features of the true outcome of interest, which is often hidden and not directly measurable. We outline an approach which models these multivariate longitudinal responses as generated from a hidden disease process. We propose a class of models which uses a hidden Markov model with separate but correlated random effects between multiple longitudinal responses. This approach was motivated by a smoking cessation clinical trial, where a bivariate longitudinal response involving both a continuous and a binomial response was collected for each participant to monitor smoking behavior. A Bayesian method using Markov chain Monte Carlo is used. Comparison of separate univariate response models to the bivariate response models was undertaken. Our methods are demonstrated on the smoking cessation clinical trial dataset, and properties of our approach are examined through extensive simulation studies. PMID:25761965
A regularized multivariate regression approach for eQTL analysis
Zhang, Hexin; Zhang, Yuzheng; Hsu, Li; Wang, Pei
2013-01-01
Expression quantitative trait loci (eQTLs) are genomic loci that regulate expression levels of mRNAs or proteins. Understanding these regulatory provides important clues to biological pathways that underlie diseases. In this paper, we propose a new statistical method, GroupRemMap, for identifying eQTLs. We model the relationship between gene expression and single nucleotide variants (SNVs) through multivariate linear regression models, in which gene expression levels are responses and SNV genotypes are predictors. To handle the high-dimensionality as well as to incorporate the intrinsic group structure of SNVs, we introduce a new regularization scheme to (1) control the overall sparsity of the model; (2) encourage the group selection of SNVs from the same gene; and (3) facilitate the detection of trans-hub-eQTLs. We apply the proposed method to the colorectal and breast cancer data sets from The Cancer Genome Atlas (TCGA), and identify several biologically interesting eQTLs. These findings may provide insight into biological processes associated with cancers and generate hypotheses for future studies. PMID:26085849
Suicidal ideation among Canadian youth: a multivariate analysis.
Peter, Tracey; Roberts, Lance W; Buzdugan, Raluca
2008-01-01
A multivariate model was developed incorporating various socio-demographic, social-environmental, and social-psychological factors in an attempt to predict suicidal ideation among Canadian youth. The main research objective sought to determine what socially based factors elevate or reduce suicidal ideation within this population. Using data from the National Longitudinal Study of Children and Youth-Cycle 5 (2003), a cross-sectional sample of 1,032 was used to empirically identify various social determinants of suicidal ideation among youth between the ages of 12 and 15. Results reveal statistically significant correlations between suicide ideation and some lesser examined socially based measures. In particular, ability to communicate feelings, negative attachment to parents/guardians, taunting/bullying or abuse, and presence of deviant peers were significant predictors of suicidal ideation. As expected, depression/anxiety, gender, and age were also correlated with thoughts of suicide. Research findings should help foster a better understanding toward the social elements of suicide and provide insight into how suicide prevention strategies may be improved through an increased emphasis on substance use education, direct targeting of dysfunctional families and deviant peer groups, and exploring more avenues of self-expression for youth. PMID:18576207
The Potential of Multivariate Analysis in Assessing Students' Attitude to Curriculum Subjects
ERIC Educational Resources Information Center
Gaotlhobogwe, Michael; Laugharne, Janet; Durance, Isabelle
2011-01-01
Background: Understanding student attitudes to curriculum subjects is central to providing evidence-based options to policy makers in education. Purpose: We illustrate how quantitative approaches used in the social sciences and based on multivariate analysis (categorical Principal Components Analysis, Clustering Analysis and General Linear…
Using Interactive Graphics to Teach Multivariate Data Analysis to Psychology Students
ERIC Educational Resources Information Center
Valero-Mora, Pedro M.; Ledesma, Ruben D.
2011-01-01
This paper discusses the use of interactive graphics to teach multivariate data analysis to Psychology students. Three techniques are explored through separate activities: parallel coordinates/boxplots; principal components/exploratory factor analysis; and cluster analysis. With interactive graphics, students may perform important parts of the…
Guo, Diansheng; Gahegan, Mark; MacEachren, Alan M.; Zhou, Biliang
2009-01-01
The discovery, interpretation, and presentation of multivariate spatial patterns are important for scientific understanding of complex geographic problems. This research integrates computational, visual, and cartographic methods together to detect and visualize multivariate spatial patterns. The integrated approach is able to: (1) perform multivariate analysis, dimensional reduction, and data reduction (summarizing a large number of input data items in a moderate number of clusters) with the Self-Organizing Map (SOM); (2) encode the SOM result with a systematically designed color scheme; (3) visualize the multivariate patterns with a modified Parallel Coordinate Plot (PCP) display and a geographic map (GeoMap); and (4) support human interactions to explore and examine patterns. The research shows that such “mixed initiative” methods (computational and visual) can mitigate each other’s weakness and collaboratively discover complex patterns in large geographic datasets, in an effective and efficient way. PMID:19960118
Soviet Jews in the United States: an analysis of their linguistic and economic adjustment.
Chiswick, B R
1993-01-01
"This article reviews the literature and analyzes 1980 Census data to study English language fluency and earnings among Soviet Jews [in the United States]. The literature review reveals: 1) the importance of employment and attaining premigration occupational status for self-esteem; 2) the difficulty of adjusting to the wide range of choices in the United States; 3) the greater difficulty and economic importance of learning English; and 4) the rapid linguistic and economic mobility. The multivariate analysis supports the latter two points. Soviet Jews have a difficult initial adjustment, but after five years in the United States they achieve parity in English fluency and earnings with other European immigrants, ceteris paribus." PMID:12318146
HRMAS-NMR spectroscopy and multivariate analysis meat characterisation.
Ritota, Mena; Casciani, Lorena; Failla, Sebastiana; Valentini, Massimiliano
2012-12-01
¹H-High resolution magic angle spinning-nuclear magnetic resonance spectroscopy was employed to gain the metabolic profile of longissimus dorsi and semitendinosus muscles of four different breeds: Chianina, Holstein Friesian, Maremmana and Buffalo. Principal component analysis, partial least squares projection to latent structure - discriminant analysis and orthogonal partial least squares projection to latent structure - discriminant analysis were used to build models capable of discriminating the muscle type according to the breed. Data analysis led to an excellent classification for Buffalo and Chianina, while for Holstein Friesian the separation was lower. In the case of Maremmana the use of intelligent bucketing was necessary due to some resonances shifting allowed improvement of the discrimination ability. Finally, by using the Variable Importance in Projection values the metabolites relevant for the classification were identified. PMID:22819725
Seasonal foods of coyotes in southeastern Idaho: a multivariate analysis
MacCracken, J.G.; Hansen, R.M.
1982-03-01
Seasonal foods of coyotes (Canis latrans) inhabiting the Idaho National Engineering Laboratory site were examined using step-wise discriminant analysis. Significant differences (P<0.01) were detected among seasons in food consumption by coyotes, where univariate statistical analysis failed to recognize differences. Recognition of seasonal changes in foods consumed by coyotes is essential to understanding coyote feeding strategies. The role opportunistic behavior plays in coyote food selection on the study area is questioned.
NASA Astrophysics Data System (ADS)
Kovács, József; Bodnár, Nikolett; Török, Ákos
2016-01-01
The paper presents the evaluation of engineering geological laboratory test results of core drillings along the new metro line (line 4) in Budapest by using a multivariate data analysis. A data set of 30 core drillings with a total coring length of over 1500 meters was studied. Of the eleven engineering geological parameters considered in this study, only the five most reliable (void ratio, dry bulk density, angle of internal friction, cohesion and compressive strength) representing 1260 data points were used for multivariate (cluster and discriminant) analyses. To test the results of the cluster analysis discriminant analysis was used. The results suggest that the use of multivariate analyses allows the identification of different groups of sediments even when the data sets are overlapping and contain several uncertainties. The tests also prove that the use of these methods for seemingly very scattered parameters is crucial in obtaining reliable engineering geological data for design.
On variance estimate for covariate adjustment by propensity score analysis.
Zou, Baiming; Zou, Fei; Shuster, Jonathan J; Tighe, Patrick J; Koch, Gary G; Zhou, Haibo
2016-09-10
Propensity score (PS) methods have been used extensively to adjust for confounding factors in the statistical analysis of observational data in comparative effectiveness research. There are four major PS-based adjustment approaches: PS matching, PS stratification, covariate adjustment by PS, and PS-based inverse probability weighting. Though covariate adjustment by PS is one of the most frequently used PS-based methods in clinical research, the conventional variance estimation of the treatment effects estimate under covariate adjustment by PS is biased. As Stampf et al. have shown, this bias in variance estimation is likely to lead to invalid statistical inference and could result in erroneous public health conclusions (e.g., food and drug safety and adverse events surveillance). To address this issue, we propose a two-stage analytic procedure to develop a valid variance estimator for the covariate adjustment by PS analysis strategy. We also carry out a simple empirical bootstrap resampling scheme. Both proposed procedures are implemented in an R function for public use. Extensive simulation results demonstrate the bias in the conventional variance estimator and show that both proposed variance estimators offer valid estimates for the true variance, and they are robust to complex confounding structures. The proposed methods are illustrated for a post-surgery pain study. Copyright © 2016 John Wiley & Sons, Ltd. PMID:26999553
NASA Technical Reports Server (NTRS)
Hague, D. S.; Vanderberg, J. D.; Woodbury, N. W.
1974-01-01
A method for rapidly examining the probable applicability of weight estimating formulae to a specific aerospace vehicle design is presented. The Multivariate Analysis Retrieval and Storage System (MARS) is comprised of three computer programs which sequentially operate on the weight and geometry characteristics of past aerospace vehicles designs. Weight and geometric characteristics are stored in a set of data bases which are fully computerized. Additional data bases are readily added to the MARS system and/or the existing data bases may be easily expanded to include additional vehicles or vehicle characteristics.
Clegg, Samuel M; Barefield, James E; Wiens, Roger C; Sklute, Elizabeth; Dyare, Melinda D
2008-01-01
Quantitative analysis with LIBS traditionally employs calibration curves that are complicated by the chemical matrix effects. These chemical matrix effects influence the LIBS plasma and the ratio of elemental composition to elemental emission line intensity. Consequently, LIBS calibration typically requires a priori knowledge of the unknown, in order for a series of calibration standards similar to the unknown to be employed. In this paper, three new Multivariate Analysis (MV A) techniques are employed to analyze the LIBS spectra of 18 disparate igneous and highly-metamorphosed rock samples. Partial Least Squares (PLS) analysis is used to generate a calibration model from which unknown samples can be analyzed. Principal Components Analysis (PCA) and Soft Independent Modeling of Class Analogy (SIMCA) are employed to generate a model and predict the rock type of the samples. These MV A techniques appear to exploit the matrix effects associated with the chemistries of these 18 samples.
Multivariate Genetic Analysis of Learning and Early Reading Development
ERIC Educational Resources Information Center
Byrne, Brian; Wadsworth, Sally; Boehme, Kristi; Talk, Andrew C.; Coventry, William L.; Olson, Richard K.; Samuelsson, Stefan; Corley, Robin
2013-01-01
The genetic factor structure of a range of learning measures was explored in twin children, recruited in preschool and followed to Grade 2 ("N"?=?2,084). Measures of orthographic learning and word reading were included in the analyses to determine how these patterned with the learning processes. An exploratory factor analysis of the…
Sun, Hui; Wang, Huiyu; Zhang, Aihua; Yan, Guangli; Han, Ying; Li, Yuan; Wu, Xiuhong; Meng, Xiangcai; Wang, Xijun
2016-01-01
Background: As herbal medicines have an important position in health care systems worldwide, their current assessment, and quality control are a major bottleneck. Cortex Phellodendri chinensis (CPC) and Cortex Phellodendri amurensis (CPA) are widely used in China, however, how to identify species of CPA and CPC has become urgent. Materials and Methods: In this study, multivariate analysis approach was performed to the investigation of chemical discrimination of CPA and CPC. Results: Principal component analysis showed that two herbs could be separated clearly. The chemical markers such as berberine, palmatine, phellodendrine, magnoflorine, obacunone, and obaculactone were identified through the orthogonal partial least squared discriminant analysis, and were identified tentatively by the accurate mass of quadruple-time-of-flight mass spectrometry. A total of 29 components can be used as the chemical markers for discrimination of CPA and CPC. Of them, phellodenrine is significantly higher in CPC than that of CPA, whereas obacunone and obaculactone are significantly higher in CPA than that of CPC. Conclusion: The present study proves that multivariate analysis approach based chemical analysis greatly contributes to the investigation of CPA and CPC, and showed that the identified chemical markers as a whole should be used to discriminate the two herbal medicines, and simultaneously the results also provided chemical information for their quality assessment. SUMMARY Multivariate analysis approach was performed to the investigate the herbal medicineThe chemical markers were identified through multivariate analysis approachA total of 29 components can be used as the chemical markers. UPLC-Q/TOF-MS-based multivariate analysis method for the herbal medicine samples Abbreviations used: CPC: Cortex Phellodendri chinensis, CPA: Cortex Phellodendri amurensis, PCA: Principal component analysis, OPLS-DA: Orthogonal partial least squares discriminant analysis, BPI: Base peaks ion
Characterization of Nuclear Fuel using Multivariate Statistical Analysis
Robel, M; Robel, M; Robel, M; Kristo, M J; Kristo, M J
2007-11-27
Various combinations of reactor type and fuel composition have been characterized using principle components analysis (PCA) of the concentrations of 9 U and Pu isotopes in the 10 fuel as a function of burnup. The use of PCA allows the reduction of the 9-dimensional data (isotopic concentrations) into a 3-dimensional approximation, giving a visual representation of the changes in nuclear fuel composition with burnup. Real-world variation in the concentrations of {sup 234}U and {sup 236}U in the fresh (unirradiated) fuel was accounted for. The effects of reprocessing were also simulated. The results suggest that, 15 even after reprocessing, Pu isotopes can be used to determine both the type of reactor and the initial fuel composition with good discrimination. Finally, partial least squares discriminant analysis (PSLDA) was investigated as a substitute for PCA. Our results suggest that PLSDA is a better tool for this application where separation between known classes is most important.
Multivariate analysis of Buteo nest site selection in Washington
Smith, D.G.; Bechard, M.; Knight, R.L.; Fitzner, R.E.
1983-03-01
Raptor breeding populations of grasslands and semi-arid grasslands of western North America include varying densities of three Buteo species, the Red-tailed Hawk (Buteo jamaicensis), Ferruginous Hawk (Buteo regalis) and Swainson's Hawk (Buteo swainsoni). These three species are behaviorially rather similar in their diurnal activity patterns and foraging habitats and exhibit similar diets comprised of small mammals and birds. Principal component analysis and discriminant function analysis were used to describe key similarities and differences in nest site selection of the three Buteo species. Differences in nest site selection may offer an explanation of how the three Buteo species may nest sympatrically and at times successfully at distances of less than 0.5 km despite apparent similarities in use of other resources. 15 refs., 1 fig., 9 tabs.
Multivariate Analysis of the Ecoregion Delineation for Aquatic Systems
NASA Astrophysics Data System (ADS)
Jenerette, G. Darrel; Lee, Jay; Waller, David W.; Carlson, Robert E.
2002-01-01
The ecoregion concept is a popular method of understanding the spatial distribution of the environment', however, it has yet to be adequately demonstrated that the environment is distributed in accordance with these bounded units. In this paper, we generated a testable hypothesis based on the current usage of ecoregions: the ecoregion classification will allow for discrimination between lakes of different water quality. The ecoregion classification should also be more effective better than a comparably scaled classification based on political boundaries, land-use class, or random grouping. To test this hypothesis we used the Environmental Monitoring and Assessment Program (EMAP) lake water chemistry data from the northeast United States. The water chemistry data were reduced to four components using principal component analysis. For comparison to an optimal grouping of these data we used K-means cluster analysis to define the extent at which these lakes could be segregated into distinct classes. Jackknifed discriminant analysis was used to determine the classification rate of ecoregions, the three alternative spatial classification methods, and the clustering algorithm. The classification based on ecoregions was successful for 35% of the lakes included in this study, in comparison to the clustered groups accuracy of 98%. These results suggest that the large scale spatial distribution of ecosystem types is more complicated than that suggested by the present ecoregion boundaries. Further tests of ecoregion delineations are needed and alternative large-scale management strategies should be investigated.
Multivariable Discriminant Analysis for the Differential Diagnosis of Microcytic Anemia
Urrechaga, Eloísa; Aguirre, Urko; Izquierdo, Silvia
2013-01-01
Introduction. Iron deficiency anemia and thalassemia are the most common causes of microcytic anemia. Powerful statistical computer programming enables sensitive discriminant analyses to aid in the diagnosis. We aimed at investigating the performance of the multiple discriminant analysis (MDA) to the differential diagnosis of microcytic anemia. Methods. The training group was composed of 200 β-thalassemia carriers, 65 α-thalassemia carriers, 170 iron deficiency anemia (IDA), and 45 mixed cases of thalassemia and acute phase response or iron deficiency. A set of potential predictor parameters that could detect differences among groups were selected: Red Blood Cells (RBC), hemoglobin (Hb), mean cell volume (MCV), mean cell hemoglobin (MCH), and RBC distribution width (RDW). The functions obtained with MDA analysis were applied to a set of 628 consecutive patients with microcytic anemia. Results. For classifying patients into two groups (genetic anemia and acquired anemia), only one function was needed; 87.9% β-thalassemia carriers, and 83.3% α-thalassemia carriers, and 72.1% in the mixed group were correctly classified. Conclusion. Linear discriminant functions based on hemogram data can aid in differentiating between IDA and thalassemia, so samples can be efficiently selected for further analysis to confirm the presence of genetic anemia. PMID:24093062
The analysis of multivariate longitudinal data: A review
Verbeke, Geert; Fieuws, Steffen; Molenberghs, Geert; Davidian, Marie
2012-01-01
Longitudinal experiments often involve multiple outcomes measured repeatedly within a set of study participants. While many questions can be answered by modeling the various outcomes separately, some questions can only be answered in a joint analysis of all of them. In this paper, we will present a review of the many approaches proposed in the statistical literature. Four main model families will be presented, discussed and compared. Focus will be on presenting advantages and disadvantages of the different models rather than on the mathematical or computational details. PMID:22523185
Scaling analysis of multi-variate intermittent time series
NASA Astrophysics Data System (ADS)
Kitt, Robert; Kalda, Jaan
2005-08-01
The scaling properties of the time series of asset prices and trading volumes of stock markets are analysed. It is shown that similar to the asset prices, the trading volume data obey multi-scaling length-distribution of low-variability periods. In the case of asset prices, such scaling behaviour can be used for risk forecasts: the probability of observing next day a large price movement is (super-universally) inversely proportional to the length of the ongoing low-variability period. Finally, a method is devised for a multi-factor scaling analysis. We apply the simplest, two-factor model to equity index and trading volume time series.
Safer approaches and landings: A multivariate analysis of critical factors
NASA Astrophysics Data System (ADS)
Heinrich, Durwood J.
The approach-and-landing phases of flight represent 27% of mission time while resulting in 61 of the accidents and 39% of the fatalities. The landing phase itself represents only 1% of flight time but claims 45% of the accidents. Inadequate crew situation awareness (SA), crew resource management (CRM), and crew decision-making (DM) have been implicated in 51%, 63%, and 73% respectively of these accidents. The human factors constructs of SA, CRM, and DM were explored; a comprehensive definition of SA was proposed; and a "proactive defense" safety strategy was recommended. Data from a 1997 analysis of worldwide fatal accidents by the Flight Safety Foundation (FSF) Approach-and-Landing Accident Reduction (ALAR) Task Force was used to isolate crew- and weather-related causal factors that lead to approach-and-landing accidents (ALAs). Logistic regression and decision tree analysis were used on samplings of NASA's Aviation Safety Reporting System (ASRS) incident records ("near misses") and the National Transportation Safety Board's (NTSB) accident reports to examine hypotheses regarding factors and factor combinations that can dramatically increase the opportunity for accidents. An effective scale of risk factors was introduced for use by crews to proactively counter safety-related error-chain situations.
Quantitative multivariate analysis of dynamic multicellular morphogenic trajectories.
White, Douglas E; Sylvester, Jonathan B; Levario, Thomas J; Lu, Hang; Streelman, J Todd; McDevitt, Todd C; Kemp, Melissa L
2015-07-01
Interrogating fundamental cell biology principles that govern tissue morphogenesis is critical to better understanding of developmental biology and engineering novel multicellular systems. Recently, functional micro-tissues derived from pluripotent embryonic stem cell (ESC) aggregates have provided novel platforms for experimental investigation; however elucidating the factors directing emergent spatial phenotypic patterns remains a significant challenge. Computational modelling techniques offer a unique complementary approach to probe mechanisms regulating morphogenic processes and provide a wealth of spatio-temporal data, but quantitative analysis of simulations and comparison to experimental data is extremely difficult. Quantitative descriptions of spatial phenomena across multiple systems and scales would enable unprecedented comparisons of computational simulations with experimental systems, thereby leveraging the inherent power of computational methods to interrogate the mechanisms governing emergent properties of multicellular biology. To address these challenges, we developed a portable pattern recognition pipeline consisting of: the conversion of cellular images into networks, extraction of novel features via network analysis, and generation of morphogenic trajectories. This novel methodology enabled the quantitative description of morphogenic pattern trajectories that could be compared across diverse systems: computational modelling of multicellular structures, differentiation of stem cell aggregates, and gastrulation of cichlid fish. Moreover, this method identified novel spatio-temporal features associated with different stages of embryo gastrulation, and elucidated a complex paracrine mechanism capable of explaining spatiotemporal pattern kinetic differences in ESC aggregates of different sizes. PMID:26095427
Multivariate analysis of parameters related to lake acidification in Quebec
Bobee, B.; Lachance, M.
1984-08-01
Physico-chemical data from 234 lakes were collected during the spring and summer of 1980 by the Quebec Ministry of the Environment, the Quebec Ministry of Recreation, Hunting and Fishing and the Canadian Wildlife Service. A statistical method, based on the joint use of factorial correspondence analysis and cluster analysis, was applied to these data to obtain a general picture of the spatial variability of a member of physico-chemical parameters related to the sensitivity or acidification of lakewaters. This method was first applied to the entire Quebec territory, and showed that the part of Quebec lying on the Canadian shield is especially vulnerable to acidification. The method also showed that the southwestern portion of this area of Quebec was more substantially affected by acid fallout. A detailed study of spatial variability over the shield area revealed the existence of greater spatial heterogeneity. More precisely, it was possible to pinpoint zones which are highly vulnerable to acid precipitation and zones whose lakes show clear signs of acidification resulting from such precipitation. These two statistical analyses led to a first general diagnosis on lake acidification in Quebec. They contribute to the rationalization of data acquisition in Quebec by delimitating zones where network density needs to be increased.
Widjaja, Effendi
2009-04-01
This paper describes the use of combined techniques, i.e. Raman spectral mapping, tape-lift, and multivariate data analysis, to extract chemical information of latent fingerprint and/or trace amounts of materials deposited in fingerprints. The tape-lift method was employed to lift trace particles, extrinsic materials, or sebum deposited on the finger of an individual after recent handling of such materials. The analysis of the tape-lifted materials was performed by Raman spectral mapping at a specific area. The collected mixture Raman spectra containing signals from lifting media and lifted materials was then deconvoluted using a powerful multivariate technique, namely band-target entropy minimization (BTEM). Three cases, i.e. a sebum-rich fingerprint after touching the forehead, a drug model comprising ibuprofen, L-arginine, and sodium bicarbonate, and an additive model comprising sucrose and aspartame were investigated. BTEM could recover all pure component spectra of both lifting media and tape-lifted materials. As such, all these test substances can be correctly identified using their unique pure Raman spectral signatures. In addition, the spatial distributions of all these identified components could also be determined. These combined three techniques hold promise as a new tool in forensic applications. PMID:19305929
Ghirardo, Andrea; Sørensen, Helle Aagaard; Petersen, Marianne; Jacobsen, Susanne; Søndergaard, Ib
2005-01-01
Matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry and multivariate data analysis have been used for the determination of wheat quality at different stages of grain development. Wheat varieties with one of two different end-use qualities (i.e. suitable or not suitable for bread-making purposes) were investigated. The samples were collected from grains from 15 until 45 days post-anthesis (dpa). Gluten proteins from wheat grains were extracted and subsequently analysed by mass spectrometry. Discrimination partial least-squares regression and soft independent modelling of class analogy were used to determine the quality of new and unknown wheat samples. With these methods, we were able to predict correctly the end-use qualities at every stage investigated. This new fast technique, based on the rapidity of mass spectrometry combined with the objectivity of multivariate data analysis, offers a method that can replace the traditional rather time-consuming ones such as gel electrophoresis. This study focused on the determination of wheat quality at 15 dpa, when the grain is due for harvest 1 month later. PMID:15655793
Multivariate analysis of somatosensory evoked potential parameters in normal adults.
Strenge, H; Gundel, A
1983-01-01
Cervical and cortical somatosensory evoked potentials (SEP) to median nerve stimulation were recorded in 65 normal subjects. Absolute peak latencies and amplitudes of cervical components N9, P10, N11, N13, P17, and cortical components P16, N20, P25, and N35 were measured. By means of partial correlations the interdependency of SEP-features could be verified in addition to the well-known dependence on arm length and age. In certain respects our results replicate other studies finding significant correlations between age and latency of early SEP-components as well as inverse relations between age and cervical amplitudes. Further analysis disclosed high inter-correlations between the latencies and between the amplitudes of the cervical and cortical components also revealing a certain exceptional position of the positive wave P17. In contrast to an inverse relation of amplitude and latency of the cervical components there were positive correlations between the respective features in the cortical evoked response. The findings are discussed with regard to the current knowledge about the origins of the SEP-components. PMID:6667105
Multivariate analysis of regional differentials of nuptiality in Bangladesh.
Chowdhury, A A; Islam, M A
1981-01-01
The importance of socioeconomics differentials in nuptiality has occupied a very important position in recent demographic research. An effort has been made in this paper to find out the nature and extent of the causal relationship between the dependent variable--nuptiality, and its determinants. Our findings suggest that education may play a vital role in raising mean age at marriage. This may be done by extending free and compulsory mass and primary education throughout the country. It has further been observed that urbanization through economic development is a precondition to increase the literacy rate and hence female labor force participation in the country's economy. Thus proper education will increase the female employment rate which in turn will raise the age at marriage. Equal distribution of population and insurance schemes for childless couples may also indirectly put a positive effect on nuptiality. Finally, this paper provides a guideline for using the path analysis technique in determining the factors causing the changes and the effects of these factors on nuptiality in Bangladesh. However, caution should be made in taking into account the causal ordering of the indices. Different ordering may give different results. PMID:12312786
Multivariate analysis relating oil shale geochemical properties to NMR relaxometry
Birdwell, Justin E.; Washburn, Kathryn E.
2015-01-01
Low-field nuclear magnetic resonance (NMR) relaxometry has been used to provide insight into shale composition by separating relaxation responses from the various hydrogen-bearing phases present in shales in a noninvasive way. Previous low-field NMR work using solid-echo methods provided qualitative information on organic constituents associated with raw and pyrolyzed oil shale samples, but uncertainty in the interpretation of longitudinal-transverse (T1–T2) relaxometry correlation results indicated further study was required. Qualitative confirmation of peaks attributed to kerogen in oil shale was achieved by comparing T1–T2 correlation measurements made on oil shale samples to measurements made on kerogen isolated from those shales. Quantitative relationships between T1–T2 correlation data and organic geochemical properties of raw and pyrolyzed oil shales were determined using partial least-squares regression (PLSR). Relaxometry results were also compared to infrared spectra, and the results not only provided further confidence in the organic matter peak interpretations but also confirmed attribution of T1–T2 peaks to clay hydroxyls. In addition, PLSR analysis was applied to correlate relaxometry data to trace element concentrations with good success. The results of this work show that NMR relaxometry measurements using the solid-echo approach produce T1–T2 peak distributions that correlate well with geochemical properties of raw and pyrolyzed oil shales.
NASA Technical Reports Server (NTRS)
Leininger, G. G.; Borysiak, M. L.
1978-01-01
In the feedback control design of multivariable systems, closed loop performance evaluations must include the dynamic behavior of variables unavailable to the feedback controller. For the multivariable Nyquist array method, a set of sensitivity functions are proposed to simplify the adjustment of compensator parameters when the dynamic response of the unmeasurable output variables is unacceptable. A sensitivity study to improve thrust and turbine temperature responses for the Pratt-Whitney F100 turbofan engine demonstrates the utility of the proposed method.
Van Benthem, Mark Hilary; Mowry, Curtis Dale; Kotula, Paul Gabriel; Borek, Theodore Thaddeus, III
2010-09-01
Thermal decomposition of poly dimethyl siloxane compounds, Sylgard{reg_sign} 184 and 186, were examined using thermal desorption coupled gas chromatography-mass spectrometry (TD/GC-MS) and multivariate analysis. This work describes a method of producing multiway data using a stepped thermal desorption. The technique involves sequentially heating a sample of the material of interest with subsequent analysis in a commercial GC/MS system. The decomposition chromatograms were analyzed using multivariate analysis tools including principal component analysis (PCA), factor rotation employing the varimax criterion, and multivariate curve resolution. The results of the analysis show seven components related to offgassing of various fractions of siloxanes that vary as a function of temperature. Thermal desorption coupled with gas chromatography-mass spectrometry (TD/GC-MS) is a powerful analytical technique for analyzing chemical mixtures. It has great potential in numerous analytic areas including materials analysis, sports medicine, in the detection of designer drugs; and biological research for metabolomics. Data analysis is complicated, far from automated and can result in high false positive or false negative rates. We have demonstrated a step-wise TD/GC-MS technique that removes more volatile compounds from a sample before extracting the less volatile compounds. This creates an additional dimension of separation before the GC column, while simultaneously generating three-way data. Sandia's proven multivariate analysis methods, when applied to these data, have several advantages over current commercial options. It also has demonstrated potential for success in finding and enabling identification of trace compounds. Several challenges remain, however, including understanding the sources of noise in the data, outlier detection, improving the data pretreatment and analysis methods, developing a software tool for ease of use by the chemist, and demonstrating our belief that
Multivariate analysis of noise in genetic regulatory networks.
Tomioka, Ryota; Kimura, Hidenori; J Kobayashi, Tetsuya; Aihara, Kazuyuki
2004-08-21
Stochasticity is an intrinsic property of genetic regulatory networks due to the low copy numbers of the major molecular species, such as, DNA, mRNA, and regulatory proteins. Therefore, investigation of the mechanisms that reduce the stochastic noise is essential in understanding the reproducible behaviors of real organisms and is also a key to design synthetic genetic regulatory networks that can reliably work. We use an analytical and systematic method, the linear noise approximation of the chemical master equation along with the decoupling of a stoichiometric matrix. In the analysis of fluctuations of multiple molecular species, the covariance is an important measure of noise. However, usually the representation of a covariance matrix in the natural coordinate system, i.e. the copy numbers of the molecular species, is intractably complicated because reactions change copy numbers of more than one molecular species simultaneously. Decoupling of a stoichiometric matrix, which is a transformation of variables, significantly simplifies the representation of a covariance matrix and elucidates the mechanisms behind the observed fluctuations in the copy numbers. We apply our method to three types of fundamental genetic regulatory networks, that is, a single-gene autoregulatory network, a two-gene autoregulatory network, and a mutually repressive network. We have found that there are multiple noise components differently originating. Each noise component produces fluctuation in the characteristic direction. The resulting fluctuations in the copy numbers of the molecular species are the sum of these fluctuations. In the examples, the limitation of the negative feedback in noise reduction and the trade-off of fluctuations in multiple molecular species are clearly explained. The analytical representations show the full parameter dependence. Additionally, the validity of our method is tested by stochastic simulations. PMID:15246787
Integrated Analysis of Tropical Trees Growth: A Multivariate Approach
YÁÑEZ-ESPINOSA, LAURA; TERRAZAS, TERESA; LÓPEZ-MATA, LAURO
2006-01-01
• Background and Aims One of the problems analysing cause–effect relationships of growth and environmental factors is that a single factor could be correlated with other ones directly influencing growth. One attempt to understand tropical trees' growth cause–effect relationships is integrating research about anatomical, physiological and environmental factors that influence growth in order to develop mathematical models. The relevance is to understand the nature of the process of growth and to model this as a function of the environment. • Methods The relationships of Aphananthe monoica, Pleuranthodendron lindenii and Psychotria costivenia radial growth and phenology with environmental factors (local climate, vertical strata microclimate and physical and chemical soil variables) were evaluated from April 2000 to September 2001. The association among these groups of variables was determined by generalized canonical correlation analysis (GCCA), which considers the probable associations of three or more data groups and the selection of the most important variables for each data group. • Key Results The GCCA allowed determination of a general model of relationships among tree phenology and radial growth with climate, microclimate and soil factors. A strong influence of climate in phenology and radial growth existed. Leaf initiation and cambial activity periods were associated with maximum temperature and day length, and vascular tissue differentiation with soil moisture and rainfall. The analyses of individual species detected different relationships for the three species. • Conclusions The analyses of the individual species suggest that each one takes advantage in a different way of the environment in which they are growing, allowing them to coexist. PMID:16822807
Multivariate analysis of a small pleistocene catchment: tracing hydrological change
NASA Astrophysics Data System (ADS)
Boettcher, Steven; Merz, Christoph; Dannowski, Ralf
2013-04-01
The water budget of catchments in north-east Germany has decreased considerably over the last decades. Especially small catchments are affected due to the small amount of water stored within. Climate projections for the next decades hint to even more negative impacts on the water budgets of these catchments. Therefore, a new concept of water resource management for this region must be developed, including counter measures to extreme events such as low and high flow conditions. In order to manage a hydrological system one needs to know the typical behavior and be able to effectively counteract if needed. Within the network activity INKA-BB (Inovationsnetzwerk Klimaanpassung Brandenburg Berlin) dealing with possible adaptation measures to climate change in the Brandenburg and Berlin region, this study aims at identifying the typical hydraulic behavior of the Fredersdorfer Mühlenfließ catchment located north-east of Berlin as a basis for a sustainable water resource management concept. Established schemes are followed, including the application of numerical geochemical and hydraulic models as well as chemical graphical interpretation approaches. A common problem is the sparse spatial as well as temporal resolution of the data at hand. Here, these schemes are too inflexible and vague with respect to analyzing and parameterization of complex features used for identifying operative hydraulic-geochemical processes including intensive non-linear interactions. Hence, methods must be applied that are able to effectively utilize the limited information available. Ordination methods such as the Principle Component Analysis (PCA) or the non-linear Isometric Feature Mapping (Isomap) can provide such a tool. Ordination methods are used in order to derive a meaningful low-dimensional representation of a high-dimensional input data set. The approach is based on the hypothesis, that the amount of processes which explain the variance of the data is relative low although the
A Multivariate Analysis of Freshwater Variability over West Africa
NASA Astrophysics Data System (ADS)
Andam-Akorful, S. A.; He, X.; Ferreira, V. G.; Quaye-Ballard, J. A.
2015-12-01
As one of the most vulnerable regions to climate change, West Africa (WA) has since the 1970s suffered sustained reduction in rainfall amounts, leading to droughts and associated negative impacts on its water resources. Although rainfall rates have been reported to have experienced a degree of recovery, dry conditions persist. Additionally, the region faces perennial flooding, thus resulting in a highly variable hydrologic regime due to the extreme climate conditions. This therefore necessitates routine monitoring of the WA's freshwater reserves and its response to climate variations at the short and long term scales to aid sustainable use and management. However, this monitoring is hampered by data deficiency issues within the region. Consequently, dynamics leading to changes in water availability over the region are not completely understood. In this work, the recent flux and state of freshwater availability over WA from 1979 to 2013 is assessed by investigating the coupled variability of GRACE-derived terrestrial water storage (TWS) and its changes (TWSC) estimates with rainfall, evapotranspiration, and land surface air temperature (LSAT), as well as, major global and regional teleconnection indices using complex principal component analysis and wavelet transforms. Since GRACE covers a relatively short period, and thereby present challenges for long to medium term analyses, Artificial Neural Network (ANN) is employed to extend the GRACE series to 1979. The results from the ANN proved to be robust upon evaluation; spatially-averaged series for major basins and sub-climatic zones, as well as, the whole of WA presented RMSE, Nash-Sutcliffe efficient, and coefficient of determination (R2) of 11.83 mm, 0.76 and 0.89 respectively. Overall, the results obtained from this study indicate that, sustained increase in water flux, in terms of TWSC, contributed to a resurgence in freshwater reserves in the 21st century over WA from the low levels in the late 20th century
A multivariate analysis approach for the Imaging Atmospheric Cerenkov Telescopes System H.E.S.S
Dubois, F.; Lamanna, G.
2008-12-24
We present a multivariate classification approach applied to the analysis of data from the H.E.S.S. Very High Energy (VHE){gamma}-ray IACT stereoscopic system. This approach combines three complementary analysis methods already successfully applied in the H.E.S.S. data analysis. The proposed approach, with the combined effective estimator X{sub eff}, is conceived to improve the signal-to-background ratio and therefore particularly relevant to the morphological studies of faint extended sources.
Why Do Principals Change Schools? A Multivariate Analysis of Principal Retention
ERIC Educational Resources Information Center
Papa, Frank, Jr.
2007-01-01
This study uses multivariate analysis of a large panel dataset to examine the determinants of principal retention (and, thus, the determinants of attracting a principal away from her current position). The empirical model incorporates measures of a principal's traits and of the organizational structure, culture, and situational context within a…
Dissection of genomic correlation matrices of US Holsteins using multivariate factor analysis
Technology Transfer Automated Retrieval System (TEKTRAN)
Aim of the study was to compare correlation matrices between direct genomic predictions for 31 production, fitness and conformation traits both at genomic and chromosomal level in US Holstein bulls. Multivariate factor analysis was used to quantify basic features of correlation matrices. Factor extr...
Tracking Problem Solving by Multivariate Pattern Analysis and Hidden Markov Model Algorithms
ERIC Educational Resources Information Center
Anderson, John R.
2012-01-01
Multivariate pattern analysis can be combined with Hidden Markov Model algorithms to track the second-by-second thinking as people solve complex problems. Two applications of this methodology are illustrated with a data set taken from children as they interacted with an intelligent tutoring system for algebra. The first "mind reading" application…
The Relevance of Oral Language Skills to Early Literacy: A Multivariate Analysis.
ERIC Educational Resources Information Center
Speece, Deborah L.; Roth, Froma P.; Cooper, David H.
1999-01-01
Examined the relationship between oral language and literacy in a two-year, multivariate design. Through empirical cluster analysis of a sample of 88 kindergarten children, four oral language subtypes were identified based on measures of semantics, syntax, metalinguistics, and oral narration. (Author/VWL)
ERIC Educational Resources Information Center
Martin, James L.
This paper reports on attempts by the author to construct a theoretical framework of adult education participation using a theory development process and the corresponding multivariate statistical techniques. Two problems are identified: the lack of theoretical framework in studying problems, and the limiting of statistical analysis to univariate…
Missing Data and Multiple Imputation in the Context of Multivariate Analysis of Variance
ERIC Educational Resources Information Center
Finch, W. Holmes
2016-01-01
Multivariate analysis of variance (MANOVA) is widely used in educational research to compare means on multiple dependent variables across groups. Researchers faced with the problem of missing data often use multiple imputation of values in place of the missing observations. This study compares the performance of 2 methods for combining p values in…
Web-Based Tools for Modelling and Analysis of Multivariate Data: California Ozone Pollution Activity
ERIC Educational Resources Information Center
Dinov, Ivo D.; Christou, Nicolas
2011-01-01
This article presents a hands-on web-based activity motivated by the relation between human health and ozone pollution in California. This case study is based on multivariate data collected monthly at 20 locations in California between 1980 and 2006. Several strategies and tools for data interrogation and exploratory data analysis, model fitting…
Clark, Neil R.; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D.; Jones, Matthew R.; Ma’ayan, Avi
2016-01-01
Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community. PMID:26848405
Extracting tidal frequencies using multivariate harmonic analysis of sea level height time series
NASA Astrophysics Data System (ADS)
Amiri-Simkooei, A. R.; Zaminpardaz, S.; Sharifi, M. A.
2014-10-01
This contribution is seen as a first attempt to extract the tidal frequencies using a multivariate spectral analysis method applied to multiple time series of tide-gauge records. The existing methods are either physics-based in which the ephemeris of Moon, Sun and other planets are used, or are observation-based in which univariate analysis methods—Fourier and wavelet for instance—are applied to tidal observations. The existence of many long tide-gauge records around the world allows one to use tidal observations and extract the main tidal constituents for which efficient multivariate methods are to be developed. This contribution applies the multivariate least-squares harmonic estimation (LS-HE) to the tidal time series of the UK tide-gauge stations. The first 413 harmonics of the tidal constituents and their nonlinear components are provided using the multivariate LS-HE. A few observations of the research are highlighted: (1) the multivariate analysis takes information of multiple time series into account in an optimal least- squares sense, and thus the tidal frequencies have higher detection power compared to the univariate analysis. (2) Dominant tidal frequencies range from the long-term signals to the sixth-diurnal species interval. Higher frequencies have negligible effects. (3) The most important tidal constituents (the first 50 frequencies) ordered from their amplitudes range from 212 cm (M2) to 1 cm (OQ2) for the data set considered. There are signals in this list that are not available in the 145 main tidal frequencies of the literature. (4) Tide predictions using different lists of tidal frequencies on five different data sets around the world are compared. The prediction results using the first significant 50 constituents provided promising results on these locations of the world.
NASA Astrophysics Data System (ADS)
Valotto, Gabrio; Quaranta, Alberto; Cattaruzza, Elti; Gonella, Francesco; Rampazzo, Giancarlo
A multivariate analysis is used for the identification of the spectral features in Ion Beam Induced Luminescence (IBIL) spectra of soda-lime silicate glasses doped with silver by Ag+-Na+ ion exchange. Both Principal Component Analysis and multivariate analysis were used to characterize time-evolving IBIL spectra of Ag-doped glasses, by means of the identification of the number and of the wavelength positions of the main luminescent features and the study of their evolution during irradiation. This method helps to identify the spectral features of the samples spectra, even when partially overlapped or less intense. This analysis procedure does not require additional input such as the number of peaks.
Valotto, Gabrio; Quaranta, Alberto; Cattaruzza, Elti; Gonella, Francesco; Rampazzo, Giancarlo
2012-09-01
A multivariate analysis is used for the identification of the spectral features in Ion Beam Induced Luminescence (IBIL) spectra of soda-lime silicate glasses doped with silver by Ag(+)-Na(+) ion exchange. Both Principal Component Analysis and multivariate analysis were used to characterize time-evolving IBIL spectra of Ag-doped glasses, by means of the identification of the number and of the wavelength positions of the main luminescent features and the study of their evolution during irradiation. This method helps to identify the spectral features of the samples spectra, even when partially overlapped or less intense. This analysis procedure does not require additional input such as the number of peaks. PMID:22571943
Significant drivers of the virtual water trade evaluated with a multivariate regression analysis
NASA Astrophysics Data System (ADS)
Tamea, Stefania; Laio, Francesco; Ridolfi, Luca
2014-05-01
International trade of food is vital for the food security of many countries, which rely on trade to compensate for an agricultural production insufficient to feed the population. At the same time, food trade has implications on the distribution and use of water resources, because through the international trade of food commodities, countries virtually displace the water used for food production, known as "virtual water". Trade thus implies a network of virtual water fluxes from exporting to importing countries, which has been estimated to displace more than 2 billions of m3 of water per year, or about the 2% of the annual global precipitation above land. It is thus important to adequately identify the dynamics and the controlling factors of the virtual water trade in that it supports and enables the world food security. Using the FAOSTAT database of international trade and the virtual water content available from the Water Footprint Network, we reconstructed 25 years (1986-2010) of virtual water fluxes. We then analyzed the dependence of exchanged fluxes on a set of major relevant factors, that includes: population, gross domestic product, arable land, virtual water embedded in agricultural production and dietary consumption, and geographical distance between countries. Significant drivers have been identified by means of a multivariate regression analysis, applied separately to the export and import fluxes of each country; temporal trends are outlined and the relative importance of drivers is assessed by a commonality analysis. Results indicate that population, gross domestic product and geographical distance are the major drivers of virtual water fluxes, with a minor (but non-negligible) contribution given by the agricultural production of exporting countries. Such drivers have become relevant for an increasing number of countries throughout the years, with an increasing variance explained by the distance between countries and a decreasing role of the gross
Multivariate analysis of intracranial pressure (ICP) signal using principal component analysis.
Al-Zubi, N; Momani, L; Al-Kharabsheh, A; Al-Nuaimy, W
2009-01-01
The diagnosis and treatment of hydrocephalus and other neurological disorders often involve the acquisition and analysis of large amount of intracranial pressure (ICP) signal. Although the analysis and subsequent interpretation of this data is an essential part of the clinical management of the disorders, it is typically done manually by a trained clinician, and the difficulty in interpreting some of the features of this complex time series can sometimes lead to issues of subjectivity and reliability. This paper presents a method for the quantitative analysis of this data using a multivariate approach based on principal component analysis, with the aim of optimising symptom diagnosis, patient characterisation and treatment simulation and personalisation. In this method, 10 features are extracted from the ICP signal and principal components that represent these features are defined and analysed. Results from ICP traces of 40 patients show that the chosen features have relevant information about the ICP signal and can be represented with a few components of the PCA (approximately 91% of the total variance of the data is represented by the first four components of the PCA) and that these components can be helpful in characterising subgroups in the patient population that would otherwise not have been apparent. The introduction of supplementaty (non-ICP) variables has offered insight into additional groupings and relationships which may prove to be a fruitful avenue for exploration. PMID:19964826
NASA Astrophysics Data System (ADS)
Salman, A.; Shufan, E.; Lapidot, I.; Tsror, L.; Zeiri, L.; Sahu, R. K.; Moreh, R.; Mordechai, S.; Huleihel, M.
2015-12-01
Fourier Transform Infrared (FTIR) and Raman spectroscopies have emerged as powerful tools for chemical analysis. This is due to their ability to provide detailed information about the spatial distribution of chemical composition at the molecular level. A biological sample, i.e. bacteria or fungi, has a typical spectrum. This spectral fingerprint, characterizes the sample and can therefore be used for differentiating between biology samples which belong to different groups, i.e., several different isolates of a given fungi. When the spectral differences between the groups are minute, multivariate analysis should be used to provide a good differentiation. We hereby review several results which demonstrate the differentiation success obtained by combining spectroscopy measurements and multivariate analysis.
Analysis of worldwide earthquake mortality using multivariate demographic and seismic data.
Gutiérrez, E; Taucer, F; De Groeve, T; Al-Khudhairy, D H A; Zaldivar, J M
2005-06-15
In this paper, mortality in the immediate aftermath of an earthquake is studied on a worldwide scale using multivariate analysis. A statistical method is presented that analyzes reported earthquake fatalities as a function of a heterogeneous set of parameters selected on the basis of their presumed influence on earthquake mortality. The ensemble was compiled from demographic, seismic, and reported fatality data culled from available records of past earthquakes organized in a geographic information system. The authors consider the statistical relation between earthquake mortality and the available data ensemble, analyze the validity of the results in view of the parametric uncertainties, and propose a multivariate mortality analysis prediction method. The analysis reveals that, although the highest mortality rates are expected in poorly developed rural areas, high fatality counts can result from a wide range of mortality ratios that depend on the effective population size. PMID:15937024
NASA Astrophysics Data System (ADS)
Darvishzadeh, R.; Skidmore, A. K.; Mirzaie, M.; Atzberger, C.; Schlerf, M.
2014-12-01
Accurate estimation of grassland biomass at their peak productivity can provide crucial information regarding the functioning and productivity of the rangelands. Hyperspectral remote sensing has proved to be valuable for estimation of vegetation biophysical parameters such as biomass using different statistical techniques. However, in statistical analysis of hyperspectral data, multicollinearity is a common problem due to large amount of correlated hyper-spectral reflectance measurements. The aim of this study was to examine the prospect of above ground biomass estimation in a heterogeneous Mediterranean rangeland employing multivariate calibration methods. Canopy spectral measurements were made in the field using a GER 3700 spectroradiometer, along with concomitant in situ measurements of above ground biomass for 170 sample plots. Multivariate calibrations including partial least squares regression (PLSR), principal component regression (PCR), and Least-Squared Support Vector Machine (LS-SVM) were used to estimate the above ground biomass. The prediction accuracy of the multivariate calibration methods were assessed using cross validated R2 and RMSE. The best model performance was obtained using LS_SVM and then PLSR both calibrated with first derivative reflectance dataset with R2cv = 0.88 & 0.86 and RMSEcv= 1.15 & 1.07 respectively. The weakest prediction accuracy was appeared when PCR were used (R2cv = 0.31 and RMSEcv= 2.48). The obtained results highlight the importance of multivariate calibration methods for biomass estimation when hyperspectral data are used.
Rosa, Maria J.; Mehta, Mitul A.; Pich, Emilio M.; Risterucci, Celine; Zelaya, Fernando; Reinders, Antje A. T. S.; Williams, Steve C. R.; Dazzan, Paola; Doyle, Orla M.; Marquand, Andre F.
2015-01-01
An increasing number of neuroimaging studies are based on either combining more than one data modality (inter-modal) or combining more than one measurement from the same modality (intra-modal). To date, most intra-modal studies using multivariate statistics have focused on differences between datasets, for instance relying on classifiers to differentiate between effects in the data. However, to fully characterize these effects, multivariate methods able to measure similarities between datasets are needed. One classical technique for estimating the relationship between two datasets is canonical correlation analysis (CCA). However, in the context of high-dimensional data the application of CCA is extremely challenging. A recent extension of CCA, sparse CCA (SCCA), overcomes this limitation, by regularizing the model parameters while yielding a sparse solution. In this work, we modify SCCA with the aim of facilitating its application to high-dimensional neuroimaging data and finding meaningful multivariate image-to-image correspondences in intra-modal studies. In particular, we show how the optimal subset of variables can be estimated independently and we look at the information encoded in more than one set of SCCA transformations. We illustrate our framework using Arterial Spin Labeling data to investigate multivariate similarities between the effects of two antipsychotic drugs on cerebral blood flow. PMID:26528117
Rosa, Maria J; Mehta, Mitul A; Pich, Emilio M; Risterucci, Celine; Zelaya, Fernando; Reinders, Antje A T S; Williams, Steve C R; Dazzan, Paola; Doyle, Orla M; Marquand, Andre F
2015-01-01
An increasing number of neuroimaging studies are based on either combining more than one data modality (inter-modal) or combining more than one measurement from the same modality (intra-modal). To date, most intra-modal studies using multivariate statistics have focused on differences between datasets, for instance relying on classifiers to differentiate between effects in the data. However, to fully characterize these effects, multivariate methods able to measure similarities between datasets are needed. One classical technique for estimating the relationship between two datasets is canonical correlation analysis (CCA). However, in the context of high-dimensional data the application of CCA is extremely challenging. A recent extension of CCA, sparse CCA (SCCA), overcomes this limitation, by regularizing the model parameters while yielding a sparse solution. In this work, we modify SCCA with the aim of facilitating its application to high-dimensional neuroimaging data and finding meaningful multivariate image-to-image correspondences in intra-modal studies. In particular, we show how the optimal subset of variables can be estimated independently and we look at the information encoded in more than one set of SCCA transformations. We illustrate our framework using Arterial Spin Labeling data to investigate multivariate similarities between the effects of two antipsychotic drugs on cerebral blood flow. PMID:26528117
Reichardt, Thomas A.; Timlin, Jerilyn Ann; Jones, Howland D. T.; Sickafoose, Shane M.; Schmitt, Randal L.
2010-09-01
Laser-induced fluorescence measurements of cuvette-contained laser dye mixtures are made for evaluation of multivariate analysis techniques to optically thick environments. Nine mixtures of Coumarin 500 and Rhodamine 610 are analyzed, as well as the pure dyes. For each sample, the cuvette is positioned on a two-axis translation stage to allow the interrogation at different spatial locations, allowing the examination of both primary (absorption of the laser light) and secondary (absorption of the fluorescence) inner filter effects. In addition to these expected inner filter effects, we find evidence that a portion of the absorbed fluorescence is re-emitted. A total of 688 spectra are acquired for the evaluation of multivariate analysis approaches to account for nonlinear effects.
Chemical structure of wood charcoal by infrared spectroscopy and multivariate analysis.
Labbé, Nicole; Harper, David; Rials, Timothy; Elder, Thomas
2006-05-17
In this work, the effect of temperature on charcoal structure and chemical composition is investigated for four tree species. Wood charcoal carbonized at various temperatures is analyzed by mid infrared spectroscopy coupled with multivariate analysis and by thermogravimetric analysis to characterize the chemical composition during the carbonization process. The multivariate models of charcoal were able to distinguish between species and wood thermal treatments, revealing that the characteristics of the wood charcoal depend not only on the wood species, but also on the carbonization temperature. This work demonstrates the potential of mid infrared spectroscopy in the whiskey industry, from the identification and classification of the wood species for the mellowing process to the chemical characterization of the barrels after the toasting and charring process. PMID:19127715
A Signal Transmission Technique for Stability Analysis of Multivariable Non-Linear Control Systems
NASA Technical Reports Server (NTRS)
Jackson, Mark; Zimpfer, Doug; Adams, Neil; Lindsey, K. L. (Technical Monitor)
2000-01-01
Among the difficulties associated with multivariable, non-linear control systems is the problem of assessing closed-loop stability. Of particular interest is the class of non-linear systems controlled with on/off actuators, such as spacecraft thrusters or electrical relays. With such systems, standard describing function techniques are typically too conservative, and time-domain simulation analysis is prohibitively extensive, This paper presents an open-loop analysis technique for this class of non-linear systems. The technique is centered around an innovative use of multivariable signal transmission theory to quantify the plant response to worst case control commands. The technique has been applied to assess stability of thruster controlled flexible space structures. Examples are provided for Space Shuttle attitude control with attached flexible payloads.
[Characterization of flowability of pharmaceutical powders based on multivariate analysis method].
Du, Yan; Zhao, Li-Jie; Xiong, Yao-Kun; Li, Xiao-Hai; Wang, Song-Tao; Feng, Yi; Xu, De-Sheng
2012-09-01
The main methods of characterizing the flowability of pharmaceutical powders include repose angle method, HR method, Carr's index method, Jenike flow function method, fractal dimension method, and mass flow rate method, etc. Regarding powders with different flowabilities as the research subject, comprehensive features of pharmaceutical materials were investigated and characterized. The multivariate analysis method was employed to evaluate and analyze flowability values of the tested pharmaceutical materials. Comparing with the method of the mass flow rate, it was feasible to use multivariate analysis method to evaluate the flowability of powders. Simultaneously, the flowability of pharmaceutical materials could be ranked and definitely quantified, and critical values be determined according to the actual production, which has promoted the previous methods dependent only on the single parameter, i.e. repose angle and compression degree methods. A relatively objective standard method of evaluating flowability of powders is formed. PMID:23227556
Structural analysis and design of multivariable control systems: An algebraic approach
NASA Technical Reports Server (NTRS)
Tsay, Yih Tsong; Shieh, Leang-San; Barnett, Stephen
1988-01-01
The application of algebraic system theory to the design of controllers for multivariable (MV) systems is explored analytically using an approach based on state-space representations and matrix-fraction descriptions. Chapters are devoted to characteristic lambda matrices and canonical descriptions of MIMO systems; spectral analysis, divisors, and spectral factors of nonsingular lambda matrices; feedback control of MV systems; and structural decomposition theories and their application to MV control systems.
Adjusting for matching and covariates in linear discriminant analysis
Asafu-Adjei, Josephine K.; Sampson, Allan R.; Sweet, Robert A.; Lewis, David A.
2013-01-01
In studies that compare several diagnostic or treatment groups, subjects may not only be measured on a certain set of feature variables, but also be matched on a number of demographic characteristics and measured on additional covariates. Linear discriminant analysis (LDA) is sometimes used to identify which feature variables best discriminate among groups, while accounting for the dependencies among the feature variables. We present a new approach to LDA for multivariate normal data that accounts for the subject matching used in a particular study design, as well as covariates not used in the matching. Applications are given for post-mortem tissue data with the aim of comparing neurobiological characteristics of subjects with schizophrenia with those of normal controls, and for a post-mortem tissue primate study comparing brain biomarker measurements across three treatment groups. We also investigate the performance of our approach using a simulation study. PMID:23640791
Spectral compression algorithms for the analysis of very large multivariate images
Keenan, Michael R.
2007-10-16
A method for spectrally compressing data sets enables the efficient analysis of very large multivariate images. The spectral compression algorithm uses a factored representation of the data that can be obtained from Principal Components Analysis or other factorization technique. Furthermore, a block algorithm can be used for performing common operations more efficiently. An image analysis can be performed on the factored representation of the data, using only the most significant factors. The spectral compression algorithm can be combined with a spatial compression algorithm to provide further computational efficiencies.
Enhancing e-waste estimates: Improving data quality by multivariate Input–Output Analysis
Wang, Feng; Huisman, Jaco; Stevels, Ab; Baldé, Cornelis Peter
2013-11-15
Highlights: • A multivariate Input–Output Analysis method for e-waste estimates is proposed. • Applying multivariate analysis to consolidate data can enhance e-waste estimates. • We examine the influence of model selection and data quality on e-waste estimates. • Datasets of all e-waste related variables in a Dutch case study have been provided. • Accurate modeling of time-variant lifespan distributions is critical for estimate. - Abstract: Waste electrical and electronic equipment (or e-waste) is one of the fastest growing waste streams, which encompasses a wide and increasing spectrum of products. Accurate estimation of e-waste generation is difficult, mainly due to lack of high quality data referred to market and socio-economic dynamics. This paper addresses how to enhance e-waste estimates by providing techniques to increase data quality. An advanced, flexible and multivariate Input–Output Analysis (IOA) method is proposed. It links all three pillars in IOA (product sales, stock and lifespan profiles) to construct mathematical relationships between various data points. By applying this method, the data consolidation steps can generate more accurate time-series datasets from available data pool. This can consequently increase the reliability of e-waste estimates compared to the approach without data processing. A case study in the Netherlands is used to apply the advanced IOA model. As a result, for the first time ever, complete datasets of all three variables for estimating all types of e-waste have been obtained. The result of this study also demonstrates significant disparity between various estimation models, arising from the use of data under different conditions. It shows the importance of applying multivariate approach and multiple sources to improve data quality for modelling, specifically using appropriate time-varying lifespan parameters. Following the case study, a roadmap with a procedural guideline is provided to enhance e
Westman, Eric; Aguilar, Carlos; Muehlboeck, J-Sebastian; Simmons, Andrew
2013-01-01
Automated structural magnetic resonance imaging (MRI) processing pipelines are gaining popularity for Alzheimer's disease (AD) research. They generate regional volumes, cortical thickness measures and other measures, which can be used as input for multivariate analysis. It is not clear which combination of measures and normalization approach are most useful for AD classification and to predict mild cognitive impairment (MCI) conversion. The current study includes MRI scans from 699 subjects [AD, MCI and controls (CTL)] from the Alzheimer's disease Neuroimaging Initiative (ADNI). The Freesurfer pipeline was used to generate regional volume, cortical thickness, gray matter volume, surface area, mean curvature, gaussian curvature, folding index and curvature index measures. 259 variables were used for orthogonal partial least square to latent structures (OPLS) multivariate analysis. Normalisation approaches were explored and the optimal combination of measures determined. Results indicate that cortical thickness measures should not be normalized, while volumes should probably be normalized by intracranial volume (ICV). Combining regional cortical thickness measures (not normalized) with cortical and subcortical volumes (normalized with ICV) using OPLS gave a prediction accuracy of 91.5 % when distinguishing AD versus CTL. This model prospectively predicted future decline from MCI to AD with 75.9 % of converters correctly classified. Normalization strategy did not have a significant effect on the accuracies of multivariate models containing multiple MRI measures for this large dataset. The appropriate choice of input for multivariate analysis in AD and MCI is of great importance. The results support the use of un-normalised cortical thickness measures and volumes normalised by ICV. PMID:22890700
Yang, Yong-Hui; Zhou, Feng; Guo, Huai-Cheng; Sheng, Hu; Liu, Hui; Dao, Xu; He, Cheng-Jie
2010-11-01
Various multivariate statistical methods including cluster analysis (CA), discriminant analysis (DA), factor analysis (FA), and principal component analysis (PCA) were used to explain the spatial and temporal patterns of surface water pollution in Lake Dianchi. The dataset, obtained during the period 2003-2007 from the Kunming Environmental Monitoring Center, consisted of 12 variables surveyed monthly at eight sites. The CA grouped the 12 months into two groups, August-September and the remainder, and divided the lake into two regions based on their different physicochemical properties and pollution levels. The DA showed the best results for data reduction and pattern recognition in both temporal and spatial analysis. It calculated four parameters (TEMP, pH, CODMn, and Chl-a) to 85.4% correct assignment in the temporal analysis and three parameters (BOD, NH₄+-N, and TN) to almost 71.7% correct assignment in spatial analysis of the two clusters. The FA/PCA applied to datasets of two special clusters of the lake calculated four factors for each region, capturing 72.5% and 62.5% of the total variance, respectively. Strong loadings included DO, BOD, TN, CODCr, CODMn, NH₄+-N, TP, and EC. In addition, box-whisker plots and GIS further facilitated and supported the multivariate analysis results. PMID:19936953
Craniometrical estimation of the native Japanese Mishima cattle, using multivariate analysis.
Ogawa, Y; Daigo, M; Amasaki, H
1989-01-01
The present study on measurement of the skull of Mishima cattle, which has been postulated as the only pure representative breed of native Japanese cattle, was performed using craniometrical multivariate analysis. The data of the skull of Mishima cattle was compared with 17 breeds of cattle, i.e. Korean cattle (Hamhung, Pyongyang, Chinju Suwon, and Kwangju), Mongolian cattle, Hainan Tao cattle, northeastern Chinese cattle (Shuangliao, Shenyang, Tongliao, Lüta, and Chilin), Astatic Water Buffalo, Yak, Bos Banteng, American Bison, and Holstein-Friesian. The Mishima cattle was included in the group of Korean breeds, especially it was closed on the group of Pyongyang and Chinju breeds. The distance on the craniometrical multivariate analyzing co-ordinate between Mishima cattle and Hainan Tao breed of Zebu cattle was larger than the distance between Mishima cattle and Korean breeds. While result, as a above the present study was very important for the origin of "Wagyu" (native Japanese cattle). Since the northern route theory of the origin of Mishima cattle has been reported on the type of serum enzymes and hemotypes. It was suggested that the craniometrical multivariate analysis supported to the northern route theory of the origin of Mishima cattle. PMID:2764276
NASA Astrophysics Data System (ADS)
Huang, Zufang; Lin, Jinyong; Cao, Gang; Chen, Xiwen; Li, Yongzeng; Feng, Shangyuan; Lin, Juqiang; Wang, Jing; Lin, Hongxin; Chen, Rong
2014-09-01
Molecular characterization of semen that can be used to provide an objective diagnosis of semen quality is still lacking. Raman spectroscopy measures vibrational modes of molecules, thus can be utilized to characterize biological fluids. Here, we employed Raman spectroscopy to characterize and compare normal and abnormal semen samples in the fingerprint region (400-1800cm-1). Multivariate analysis methods including principal component analysis (PCA) and partial least square-discriminant analysis (PLS-DA) were used for spectral analysis to differentiate between normal and abnormal semen samples. Compared with PCA-LDA analysis, PLS-DA improved the diagnostic results, showing a sensitivity of 77% and specificity of 73%. Furthermore, our preliminary quantitative analysis based on PLS algorithm demonstrated that spermatozoa concentration were relatively well predicted (R2=0.825). In conclusion, this study demonstrated that micro-Raman spectroscopy combined with multivariate methods can provide as a new diagnostic technique for semen analysis and differentiation between normal and abnormal semen samples.
NASA Astrophysics Data System (ADS)
Grujic, O.; Caers, J.
2014-12-01
Modern approaches to uncertainty quantification in the subsurface rely on complex procedures of geological modeling combined with numerical simulation of flow & transport. This approach requires long computational times rendering any full Monte Carlo simulation infeasible, in particular solving the flow & transport problem takes hours of computing time in real field problems. This motivated the development of model selection methods aiming to identify a small subset of models that capture important statistics of a larger ensemble of geological model realization. A recent method, based on model selection in metric space, termed distance-kernel method (DKM) allows selecting representative models though kernel k-medoid clustering. The distance defining the metric space is usually based on some approximate flow model. However, the output of an approximate flow model can be multi-variate (reporting heads/pressures, saturation, rates). In addition, the modeler may have information from several other approximate models (e.g. upscaled models) or summary statistical information about geological heterogeneity that could allow for a more accurate selection. In an effort to perform model selection based on multivariate attributes, we rely on functional data analysis which allows for an exploitation of covariances between time-varying multivariate numerical simulation output. Based on mixed functional principal component analysis, we construct a lower dimensional space in which kernel k-medoid clustering is used for model selection. In this work we demonstrate the functional approach on a complex compositional flow problem where the geological uncertainty consists of channels with uncertain spatial distribution of facies, proportions, orientations and geometries. We illustrate that using multivariate attributes and multiple approximate models provides accuracy improvement over using a single attribute.
Kambeitz, Joseph; Kambeitz-Ilankovic, Lana; Leucht, Stefan; Wood, Stephen; Davatzikos, Christos; Malchow, Berend; Falkai, Peter; Koutsouleris, Nikolaos
2015-06-01
Multivariate pattern recognition approaches have recently facilitated the search for reliable neuroimaging-based biomarkers in psychiatric disorders such as schizophrenia. By taking into account the multivariate nature of brain functional and structural changes as well as their distributed localization across the whole brain, they overcome drawbacks of traditional univariate approaches. To evaluate the overall reliability of neuroimaging-based biomarkers, we conducted a comprehensive literature search to identify all studies that used multivariate pattern recognition to identify patterns of brain alterations that differentiate patients with schizophrenia from healthy controls. A bivariate random-effects meta-analytic model was implemented to investigate the sensitivity and specificity across studies as well as to assess the robustness to potentially confounding variables. In the total sample of n=38 studies (1602 patients and 1637 healthy controls), patients were differentiated from controls with a sensitivity of 80.3% (95% CI: 76.7-83.5%) and a specificity of 80.3% (95% CI: 76.9-83.3%). Analysis of neuroimaging modality indicated higher sensitivity (84.46%, 95% CI: 79.9-88.2%) and similar specificity (76.9%, 95% CI: 71.3-81.6%) of rsfMRI studies as compared with structural MRI studies (sensitivity: 76.4%, 95% CI: 71.9-80.4%, specificity of 79.0%, 95% CI: 74.6-82.8%). Moderator analysis identified significant effects of age (p=0.029), imaging modality (p=0.019), and disease stage (p=0.025) on sensitivity as well as of positive-to-negative symptom ratio (p=0.022) and antipsychotic medication (p=0.016) on specificity. Our results underline the utility of multivariate pattern recognition approaches for the identification of reliable neuroimaging-based biomarkers. Despite the clinical heterogeneity of the schizophrenia phenotype, brain functional and structural alterations differentiate schizophrenic patients from healthy controls with 80% sensitivity and specificity
Spatial compression algorithm for the analysis of very large multivariate images
Keenan, Michael R.
2008-07-15
A method for spatially compressing data sets enables the efficient analysis of very large multivariate images. The spatial compression algorithms use a wavelet transformation to map an image into a compressed image containing a smaller number of pixels that retain the original image's information content. Image analysis can then be performed on a compressed data matrix consisting of a reduced number of significant wavelet coefficients. Furthermore, a block algorithm can be used for performing common operations more efficiently. The spatial compression algorithms can be combined with spectral compression algorithms to provide further computational efficiencies.
Identification of Unknown Substances by Terahertz Spectroscopy and Multivariate Data Analysis
NASA Astrophysics Data System (ADS)
Pohl, Andreas; Deßmann, Nils; Dutzi, Katja; Hübers, Heinz-Wilhelm
2016-02-01
The identification of various substances by multivariate data analysis of terahertz transmittance spectra is demonstrated. Transmittance spectra were obtained by the use of a Fourier transform infrared spectrometer. By means of principal component analysis and partial least squares regression, the spectral data were analyzed in order to identify substances and mixtures of several substances. With only three principal components, detection and identification of substances are possible with high accuracy. Using these methods, concentration ratios of substances in mixtures of two substances can be determined with an accuracy of 10 %. It is shown that the method is robust against disturbances in the spectra such as standing waves. This is particularly important for practical applications.
NASA Technical Reports Server (NTRS)
Kriegler, F. J.; Gordon, M. F.; Mclaughlin, R. H.; Marshall, R. E.
1975-01-01
The MIDAS (Multivariate Interactive Digital Analysis System) processor is a high-speed processor designed to process multispectral scanner data (from Landsat, EOS, aircraft, etc.) quickly and cost-effectively to meet the requirements of users of remote sensor data, especially from very large areas. MIDAS consists of a fast multipipeline preprocessor and classifier, an interactive color display and color printer, and a medium scale computer system for analysis and control. The system is designed to process data having as many as 16 spectral bands per picture element at rates of 200,000 picture elements per second into as many as 17 classes using a maximum likelihood decision rule.
Severe pneumonia in the elderly: a multivariate analysis of risk factors
Li, Wei; Ding, Cheng; Yin, Shaojun
2015-01-01
Pneumonia is the second leading reason for hospitalization of medicare beneficiaries. The mortality rate is high, especially in the elderly. In this study, we aimed to determine the risk factors associated with severe pneumonia in the elderly. Retrospective study was conducted and data of old patients with severe pneumonia were collected. They were divided into two groups: the experiment group (death group) and the control (living group). The general situation, underlying diseases, laboratory tests, types of etiology, imaging analysis and treatment situation of patients were analyzed and compared. Univariate analysis and logistic multivariate regression analysis were used to screen the related and independent risk factors for the diagnosis of severe pneumonia in the elderly. In univariate analysis, there were many factors had statistical significance including chronic kidney disease, electrolyte disturbance, low phosphorus and so on. Result of logistic multivariate regression analysis showed pro-BNP level and serum prealbumin were independent risk factors. In sputum culture, the relevance ratio of acinetobacter baumannii was the highest in gram negative bacteria followed by klebsiella pneumoniae. In gram positive bacteria, the relevance ratio of staphylococcus aureus was the highest. In conclusion, the analysis on risk factors for severe pneumonia has great clinical significance on improving the prognosis. PMID:26550157
NASA Astrophysics Data System (ADS)
Lee, An-Sheng; Lu, Wei-Li; Huang, Jyh-Jaan; Chang, Queenie; Wei, Kuo-Yen; Lin, Chin-Jung; Liou, Sofia Ya Hsuan
2016-04-01
Through the geology and climate characteristic in Taiwan, generally rivers carry a lot of suspended particles. After these particles settled, they become sediments which are good sorbent for heavy metals in river system. Consequently, sediments can be found recording contamination footprint at low flow energy region, such as estuary. Seven sediment cores were collected along Nankan River, northern Taiwan, which is seriously contaminated by factory, household and agriculture input. Physico-chemical properties of these cores were derived from Itrax-XRF Core Scanner and grain size analysis. In order to interpret these complex data matrices, the multivariate statistical techniques (cluster analysis, factor analysis and discriminant analysis) were introduced to this study. Through the statistical determination, the result indicates four types of sediment. One of them represents contamination event which shows high concentration of Cu, Zn, Pb, Ni and Fe, and low concentration of Si and Zr. Furthermore, three possible contamination sources of this type of sediment were revealed by Factor Analysis. The combination of sediment analysis and multivariate statistical techniques used provides new insights into the contamination depositional history of Nankan River and could be similarly applied to other river systems to determine the scale of anthropogenic contamination.
NASA Astrophysics Data System (ADS)
Wang, W.; Ayhan, B.; Kwan, C.; Qi, H.; Vance, S.
2014-03-01
Compositional analysis is important to interrogate spectral samples for direct analysis of materials in agriculture, environment and archaeology, etc. In this paper, multi-variate analysis (MVA) techniques are coupled with laser induced breakdown spectroscopy (LIBS) to estimate quantitative elemental compositions and determine the type of the sample. In particular, we present a new multivariate analysis method for composition analysis, referred to as "spectral unmixing". The LIBS spectrum of a testing sample is considered as a linear mixture with more than one constituent signatures that correspond to various chemical elements. The signature library is derived from regression analysis using training samples or is manually set up with the information from an elemental LIBS spectral database. A calibration step is used to make all the signatures in library to be homogeneous with the testing sample so as to avoid inhomogeneous signatures that might be caused by different sampling conditions. To demonstrate the feasibility of the proposed method, we compare it with the traditional partial least squares (PLS) method and the univariate method using a standard soil data set with elemental concentration measured a priori. The experimental results show that the proposed method holds great potential for reliable and effective elemental concentration estimation.
Severe pneumonia in the elderly: a multivariate analysis of risk factors.
Li, Wei; Ding, Cheng; Yin, Shaojun
2015-01-01
Pneumonia is the second leading reason for hospitalization of medicare beneficiaries. The mortality rate is high, especially in the elderly. In this study, we aimed to determine the risk factors associated with severe pneumonia in the elderly. Retrospective study was conducted and data of old patients with severe pneumonia were collected. They were divided into two groups: the experiment group (death group) and the control (living group). The general situation, underlying diseases, laboratory tests, types of etiology, imaging analysis and treatment situation of patients were analyzed and compared. Univariate analysis and logistic multivariate regression analysis were used to screen the related and independent risk factors for the diagnosis of severe pneumonia in the elderly. In univariate analysis, there were many factors had statistical significance including chronic kidney disease, electrolyte disturbance, low phosphorus and so on. Result of logistic multivariate regression analysis showed pro-BNP level and serum prealbumin were independent risk factors. In sputum culture, the relevance ratio of acinetobacter baumannii was the highest in gram negative bacteria followed by klebsiella pneumoniae. In gram positive bacteria, the relevance ratio of staphylococcus aureus was the highest. In conclusion, the analysis on risk factors for severe pneumonia has great clinical significance on improving the prognosis. PMID:26550157
Wavelet analysis of electric adjustable speed drive waveforms
Czarkowski, D.; Domijan, A. Jr.
1998-10-01
The three most common adjustable speed drives (ASDs) used in HVAC equipment, namely, pulse-width modulated (PWM) induction drive, brushless-dc drive, and switched-reluctance drive, generate non-periodic and nonstationary electric waveforms with sharp edges and transients. Deficiencies of Fourier transform methods in analysis of such ASD waveforms prompted an application of the wavelet transform. Results of discrete wavelet transform (DWT) analysis of PWM inverter-fed motor waveforms are presented. The best mother wavelet for analysis of the recorded waveforms is selected. Data compression properties of the selected mother wavelet are compared to those of the fast Fourier transform (FFT). Multilevel feature detection of ASD waveforms using the DWT is shown.
Verbyla, Arūnas P; Cullis, Brian R
2012-09-01
A major aim in some plant-based studies is the determination of quantitative trait loci (QTL) for multiple traits or across multiple environments. Understanding these QTL by trait or QTL by environment interactions can be of great value to the plant breeder. A whole genome approach for the analysis of QTL is presented for such multivariate applications. The approach is an extension of whole genome average interval mapping in which all intervals on a linkage map are included in the analysis simultaneously. A random effects working model is proposed for the multivariate (trait or environment) QTL effects for each interval, with a variance-covariance matrix linking the variates in a particular interval. The significance of the variance-covariance matrix for the QTL effects is tested and if significant, an outlier detection technique is used to select a putative QTL. This QTL by variate interaction is transferred to the fixed effects. The process is repeated until the variance-covariance matrix for QTL random effects is not significant; at this point all putative QTL have been selected. Unlinked markers can also be included in the analysis. A simulation study was conducted to examine the performance of the approach and demonstrated the multivariate approach results in increased power for detecting QTL in comparison to univariate methods. The approach is illustrated for data arising from experiments involving two doubled haploid populations. The first involves analysis of two wheat traits, α-amylase activity and height, while the second is concerned with a multi-environment trial for extensibility of flour dough. The method provides an approach for multi-trait and multi-environment QTL analysis in the presence of non-genetic sources of variation. PMID:22692445
Badran, M; Morsy, R; Soliman, H; Elnimr, T
2016-01-01
The trace elements metabolism has been reported to possess specific roles in the pathogenesis and progress of diabetes mellitus. Due to the continuous increase in the population of patients with Type 2 diabetes (T2D), this study aims to assess the levels and inter-relationships of fast blood glucose (FBG) and serum trace elements in Type 2 diabetic patients. This study was conducted on 40 Egyptian Type 2 diabetic patients and 36 healthy volunteers (Hospital of Tanta University, Tanta, Egypt). The blood serum was digested and then used to determine the levels of 24 trace elements using an inductive coupled plasma mass spectroscopy (ICP-MS). Multivariate statistical analysis depended on correlation coefficient, cluster analysis (CA) and principal component analysis (PCA), were used to analysis the data. The results exhibited significant changes in FBG and eight of trace elements, Zn, Cu, Se, Fe, Mn, Cr, Mg, and As, levels in the blood serum of Type 2 diabetic patients relative to those of healthy controls. The statistical analyses using multivariate statistical techniques were obvious in the reduction of the experimental variables, and grouping the trace elements in patients into three clusters. The application of PCA revealed a distinct difference in associations of trace elements and their clustering patterns in control and patients group in particular for Mg, Fe, Cu, and Zn that appeared to be the most crucial factors which related with Type 2 diabetes. Therefore, on the basis of this study, the contributors of trace elements content in Type 2 diabetic patients can be determine and specify with correlation relationship and multivariate statistical analysis, which confirm that the alteration of some essential trace metals may play a role in the development of diabetes mellitus. PMID:26653752
Marengo, Emilio; Gennaro, Maria Carla; Gianotti, Valentina; Robotti, Elisa
2003-01-01
The control and monitoring of an industrial process is performed in this paper by the multivariate control charts. The process analysed consists of the bottling of the entire production of 1999 of the sparkling wine "Asti Spumante". This process is characterised by a great number of variables that can be treated with multivariate techniques. The monitoring of the process performed with classical Shewhart charts is very dangerous because they do not take into account the presence of functional relationships between the variables. The industrial process was firstly analysed by multivariate control charts based on Principal Component Analysis. This approach allowed the identification of problems in the process and of their causes. Successively, the SMART Charts (Simultaneous Scores Monitoring And Residual Tracking) were built in order to study the process in its whole. In spite of the successful identification of the presence of problems in the monitored process, the Smart chart did not allow an easy identification of the special causes of variation which casued the problems themselves. PMID:12911145
Baty, Florent; Jaeger, Daniel; Preiswerk, Frank; Schumacher, Martin M; Brutsche, Martin H
2008-01-01
Background Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not allow direct statistical testing of the stability of genes. Results In this study, we developed a computationally efficient algorithm for: i) the assessment of the significance of gene contributions and ii) the identification of sample outliers in multivariate analysis of microarray data. The approach is based on the use of resampling methods including bootstrapping and jackknifing. A statistical package of R functions was developed. This package includes tools for both inferring the statistical significance of gene contributions and identifying outliers among samples. Conclusion The methodology was successfully applied to three published data sets with varying levels of signal intensities. Its relevance was compared with alternative methods. Overall, it proved to be particularly effective for the evaluation of the stability of microarray data. PMID:18570644
Li, Xiaoli; Xie, Chuanqi; He, Yong; Qiu, Zhengjun; Zhang, Yanchao
2012-01-01
Effects of the moisture content (MC) of tea on diffuse reflectance spectroscopy were investigated by integrated wavelet transform and multivariate analysis. A total of 738 representative samples, including fresh tea leaves, manufactured tea and partially processed tea were collected for spectral measurement in the 325–1,075 nm range with a field portable spectroradiometer. Then wavelet transform (WT) and multivariate analysis were adopted for quantitative determination of the relationship between MC and spectral data. Three feature extraction methods including WT, principal component analysis (PCA) and kernel principal component analysis (KPCA) were used to explore the internal structure of spectral data. Comparison of those three methods indicated that the variables generated by WT could efficiently discover structural information of spectral data. Calibration involving seeking the relationship between MC and spectral data was executed by using regression analysis, including partial least squares regression, multiple linear regression and least square support vector machine. Results showed that there was a significant correlation between MC and spectral data (r = 0.991, RMSEP = 0.034). Moreover, the effective wavelengths for MC measurement were detected at range of 888–1,007 nm by wavelet transform. The results indicated that the diffuse reflectance spectroscopy of tea is highly correlated with MC. PMID:23012574
Hakimzadeh, Neda; Parastar, Hadi; Fattahi, Mohammad
2014-01-24
In this study, multivariate curve resolution (MCR) and multivariate classification methods are proposed to develop a new chemometric strategy for comprehensive analysis of high-performance liquid chromatography-diode array absorbance detection (HPLC-DAD) fingerprints of sixty Salvia reuterana samples from five different geographical regions. Different chromatographic problems occurred during HPLC-DAD analysis of S. reuterana samples, such as baseline/background contribution and noise, low signal-to-noise ratio (S/N), asymmetric peaks, elution time shifts, and peak overlap are handled using the proposed strategy. In this way, chromatographic fingerprints of sixty samples are properly segmented to ten common chromatographic regions using local rank analysis and then, the corresponding segments are column-wise augmented for subsequent MCR analysis. Extended multivariate curve resolution-alternating least squares (MCR-ALS) is used to obtain pure component profiles in each segment. In general, thirty-one chemical components were resolved using MCR-ALS in sixty S. reuterana samples and the lack of fit (LOF) values of MCR-ALS models were below 10.0% in all cases. Pure spectral profiles are considered for identification of chemical components by comparing their resolved spectra with the standard ones and twenty-four components out of thirty-one components were identified. Additionally, pure elution profiles are used to obtain relative concentrations of chemical components in different samples for multivariate classification analysis by principal component analysis (PCA) and k-nearest neighbors (kNN). Inspection of the PCA score plot (explaining 76.1% of variance accounted for three PCs) showed that S. reuterana samples belong to four clusters. The degree of class separation (DCS) which quantifies the distance separating clusters in relation to the scatter within each cluster is calculated for four clusters and it was in the range of 1.6-5.8. These results are then
Factor Analysis of the Modified Sexual Adjustment Questionnaire-Male
Wilmoth, Margaret C.; Hanlon, Alexandra L.; Ng, Lit Soo; Bruner, Debra W.
2015-01-01
Background and Purpose The Sexual Adjustment Questionnaire (SAQ) is used in National Cancer Institute–sponsored clinical trials as an outcome measure for sexual functioning. The tool was revised to meet the needs for a clinically useful, theory-based outcome measure for use in both research and clinical settings. This report describes the modifications and validity testing of the modified Sexual Adjustment Questionnaire-Male (mSAQ-Male). Methods This secondary analysis of data from a large Radiation Therapy Oncology Group trial employed principal axis factor analytic techniques in estimating validity of the revised tool. The sample size was 686; most subjects were White, older than the age 60 years, and with a high school education and a Karnofsky performance scale (KPS) score of greater than 90. Results A 16-item, 3-factor solution resulted from the factor analysis. The mSAQ-Male was also found to be sensitive to changes in physical sexual functioning as measured by the KPS. Conclusion The mSAQ-Male is a valid self-report measure of sexuality that can be used clinically to detect changes in male sexual functioning. PMID:25255676
Risk factors for baclofen pump infection in children: a multivariate analysis.
Spader, Heather S; Bollo, Robert J; Bowers, Christian A; Riva-Cambrin, Jay
2016-06-01
OBJECTIVE Intrathecal baclofen infusion systems to manage severe spasticity and dystonia are associated with higher infection rates in children than in adults. Factors unique to this population, such as poor nutrition and physical limitations for pump placement, have been hypothesized as the reasons for this disparity. The authors assessed potential risk factors for infection in a multivariate analysis. METHODS Patients who underwent implantation of a programmable pump and intrathecal catheter for baclofen infusion at a single center between January 1, 2000, and March 1, 2012, were identified in this retrospective cohort study. The primary end point was infection. Potential risk factors investigated included preoperative (i.e., demographics, body mass index [BMI], gastrostomy tube, tracheostomy, previous spinal fusion), intraoperative (i.e., surgeon, antibiotics, pump size, catheter location), and postoperative (i.e., wound dehiscence, CSF leak, and number of revisions) factors. Univariate analysis was performed, and a multivariate logistic regression model was created to identify independent risk factors for infection. RESULTS A total of 254 patients were evaluated. The overall infection rate was 9.8%. Univariate analysis identified young age, shorter height, lower weight, dehiscence, CSF leak, and number of revisions within 6 months of pump placement as significantly associated with infection. Multivariate analysis identified young age, dehiscence, and number of revisions as independent risk factors for infection. CONCLUSIONS Young age, wound dehiscence, and number of revisions were independent risk factors for infection in this pediatric cohort. A low BMI and the presence of either a gastrostomy or tracheostomy were not associated with infection and may not be contraindications for this procedure. PMID:26919315
Multivariate Classification of Original and Fake Perfumes by Ion Analysis and Ethanol Content.
Gomes, Clêrton L; de Lima, Ari Clecius A; Loiola, Adonay R; da Silva, Abel B R; Cândido, Manuela C L; Nascimento, Ronaldo F
2016-07-01
The increased marketing of fake perfumes has encouraged us to investigate how to identify such products by their chemical characteristics and multivariate analysis. The aim of this study was to present an alternative approach to distinguish original from fake perfumes by means of the investigation of sodium, potassium, chloride ions, and ethanol contents by chemometric tools. For this, 50 perfumes were used (25 original and 25 counterfeit) for the analysis of ions (ion chromatography) and ethanol (gas chromatography). The results demonstrated that the fake perfume had low levels of ethanol and high levels of chloride compared to the original product. The data were treated by chemometric tools such as principal component analysis and linear discriminant analysis. This study proved that the analysis of ethanol is an effective method of distinguishing original from the fake products, and it may potentially be used to assist legal authorities in such cases. PMID:27364290
A Multivariate Logit Analysis of the Outcomes of Arbitration Decisions in British Columbia.
ERIC Educational Resources Information Center
Holmes, Richard A.; And Others
1990-01-01
In this analysis of 1,815 arbitration cases in British Columbia in 1982-85, bias is identified in the bivariate estimates of the probability of management wins by industry and issue. Bivariate estimate errors in these cases can result from failure of estimates to adjust for the effects of omitted variables. (TJH)
Microenvironmental and biological/personal monitoring information were collected during the National Human Exposure Assessment Survey (NHEXAS), conducted in the six states comprising U.S. EPA Region Five. They have been analyzed by multivariate analysis techniques with general ...
Multivariate genetic analysis of brain structure in an extended twin design.
Posthuma, D; de Geus, E J; Neale, M C; Hulshoff Pol, H E; Baaré WEC; Kahn, R S; Boomsma, D
2000-07-01
The hunt for genes influencing behavior may be aided by the study of intermediate phenotypes for several reasons. First, intermediate phenotypes may be influenced by only a few genes, which facilitates their detection. Second, many intermediate phenotypes can be measured on a continuous quantitative scale and thus can be assessed in affected and unaffected individuals. Continuous measures increase the statistical power to detect genetic effects (Neale et al., 1994), and allow studies to be designed to collect data from informative subjects such as extreme concordant or discordant pairs. Intermediate phenotypes for discrete traits, such as psychiatric disorders, can be neurotransmitter levels, brain function, or structure. In this paper we conduct a multivariate analysis of data from 111 twin pairs and 34 additional siblings on cerebellar volume, intracranial space, and body height. The analysis is carried out on the raw data and specifies a model for the mean and the covariance structure. Results suggest that cerebellar volume and intracranial space vary with age and sex. Brain volumes tend to decrease slightly with age, and males generally have a larger brain volume than females. The remaining phenotypic variance of cerebellar volume is largely genetic (88%). These genetic factors partly overlap with the genetic factors that explain variance in intracranial space and body height. The applied method is presented as a general approach for the analysis of intermediate phenotypes in which the effects of correlated variables on the observed scores are modeled through multivariate analysis. PMID:11206086
PyMVPA: A Python toolbox for multivariate pattern analysis of fMRI data
Hanke, Michael; Halchenko, Yaroslav O.; Sederberg, Per B.; Hanson, Stephen José; Haxby, James V.; Pollmann, Stefan
2009-01-01
Decoding patterns of neural activity onto cognitive states is one of the central goals of functional brain imaging. Standard univariate fMRI analysis methods, which correlate cognitive and perceptual function with the blood oxygenation-level dependent (BOLD) signal, have proven successful in identifying anatomical regions based on signal increases during cognitive and perceptual tasks. Recently, researchers have begun to explore new multivariate techniques that have proven to be more flexible, more reliable, and more sensitive than standard univariate analysis. Drawing on the field of statistical learning theory, these new classifier-based analysis techniques possess explanatory power that could provide new insights into the functional properties of the brain. However, unlike the wealth of software packages for univariate analyses, there are few packages that facilitate multivariate pattern classification analyses of fMRI data. Here we introduce a Python-based, cross-platform, and open-source software toolbox, called PyMVPA, for the application of classifier-based analysis techniques to fMRI datasets. PyMVPA makes use of Python's ability to access libraries written in a large variety of programming languages and computing environments to interface with the wealth of existing machine-learning packages. We present the framework in this paper and provide illustrative examples on its usage, features, and programmability. PMID:19184561
Singh, Elangbam J K; Gupta, Abhik; Singh, N R
2013-04-01
The aim of this paper was to analyze the groundwater quality of Imphal West district, Manipur, India, and assess its suitability for drinking, domestic, and agricultural use. Eighteen physico-chemical variables were analyzed in groundwater from 30 different hand-operated tube wells in urban, suburban, and rural areas in two seasons. The data were subjected to uni-, bi-, and multivariate statistical analysis, the latter comprising cluster analysis (CA), principal component analysis (PCA), and factor analysis (FA). Arsenic concentrations exceed the Indian standard in 23.3% and the WHO limit in 73.3% of the groundwater sources with only 26.7% in the acceptable range. Several variables like iron, chloride, sodium, sulfate, total dissolved solids, and turbidity are also beyond their desirable limits for drinking water in a number of sites. Sodium concentrations and sodium absorption ratio (SAR) are both high to render the water from the majority of the sources unsuitable for agricultural use. Multivariate statistical techniques, especially varimax rotation of PCA data helped to bring to focus the hidden yet important variables and understand their roles in influencing groundwater quality. Widespread arsenic contamination and high sodium concentration of groundwater pose formidable constraints towards its exploitation for drinking and other domestic and agricultural use in the study area, although urban anthropogenic impacts are not yet pronounced. PMID:22935861
Lindsey, David A.
2001-01-01
Pebble count data from Quaternary gravel deposits north of Denver, Colo., were analyzed by multivariate statistical methods to identify lithologic factors that might affect aggregate quality. The pebble count data used in this analysis were taken from the map by Colton and Fitch (1974) and are supplemented by data reported by the Front Range Infrastructure Resources Project. This report provides data tables and results of the statistical analysis. The multivariate statistical analysis used here consists of log-contrast principal components analysis (method of Reyment and Savazzi, 1999) followed by rotation of principal components and factor interpretation. Three lithologic factors that might affect aggregate quality were identified: 1) granite and gneiss versus pegmatite, 2) quartz + quartzite versus total volcanic rocks, and 3) total sedimentary rocks (mainly sandstone) versus granite. Factor 1 (grain size of igneous and metamorphic rocks) may represent destruction during weathering and transport or varying proportions of rocks in source areas. Factor 2 (resistant source rocks) represents the dispersion shadow of metaquartzite detritus, perhaps enhanced by resistance of quartz and quartzite during weathering and transport. Factor 3 (proximity to sandstone source) represents dilution of gravel by soft sedimentary rocks (mainly sandstone), which are exposed mainly in hogbacks near the mountain front. Factor 1 probably does not affect aggregate quality. Factor 2 would be expected to enhance aggregate quality as measured by the Los Angeles degradation test. Factor 3 may diminish aggregate quality.
Application of multivariate statistical methods to the analysis of ancient Turkish potsherds
Martin, R.C.
1986-01-01
Three hundred ancient Turkish potsherds were analyzed by instrumental neutron activation analysis, and the resulting data analyzed by several techniques of multivariate statistical analysis, some only recently developed. The programs AGCLUS, MASLOC, and SIMCA were sequentially employed to characterize and group the samples by type of pottery and site of excavation. Comparison of the statistical analyses by each method provided archaeological insight into the site/type relationships of the samples and ultimately evidence relevant to the commercial relations between the ancient communities and specialization of pottery production over time. The techniques used for statistical analysis were found to be of significant potential utility in the future analysis of other archaeometric data sets. 25 refs., 33 figs.
Applications of multivariate analysis to precious metal exploration in the western US
Nelson, C.E.
1985-01-01
Precious metal exploration in the western United States relies heavily on geochemical analyses for so-called pathfinder elements. These are elements known to be enriched in or near ore deposits. Carlin-type ore bodies, for example, contain anomalous, but quite variable, amounts of Au, Ag, As, Sb, Hg, Tl, and Ba. Unfortunately, many Carlin-type hydrothermal systems with no associated gold ore are equally anomalous in all members of the pathfinder suite. Large geochemical databases, containing thirty to fifty elements, have been gathered for a variety of epithermal precious metal deposits in the western United States. Multivariate analysis of the data identifies new element groups, some of which are distinctive of ore-related systems. These new geochemical fingerprints are more effective than the well-known epithermal suite at establishing ore potential in untested epithermal targets. Several multivariate techniques have been applied. Discriminant analysis uses raw geochemical data to provide a function that maximizes the separation between known groups, such as ore bodies and barren systems. Factor analysis reduce the raw data to a number of geologically interpretable element groups (factors). Each of the described techniques provides tools which can be quantitatively applied to exploration.
Pistone, Sara; Qoragllu, Dafina; Smistad, Gro; Hiorth, Marianne
2016-10-01
Polysaccharide-based nanoparticles are promising carriers for drug delivery applications. The particle size influences the biodistribution of the nanoparticles; hence size distributions and polydispersity index (PDI) are critical characteristics. However, the preparation of stable particles with a low PDI is a challenging task and is usually based on empirical trials. In this study, we report the use of multivariate evaluation to optimize the formulation factors for the preparation of alginate-zinc nanoparticles by ionotropic gelation. The PDI was selected as the response variable. Particle size, size distributions, zeta potential and pH of the samples were also recorded. Two full factorial (mixed-level) designs were analyzed by partial least squares regression (PLS). In the first design, the influence of the polysaccharide and the crosslinker concentrations were studied. The results revealed that size distributions with a low PDI were obtained by using a low polysaccharide concentrations (0.03-0.05%) and a zinc concentration of 0.03% (w/w). However, a high polysaccharide concentration can be advantageous for drug delivery systems. Therefore, in the second design, a high alginate concentration was used (0.09%) and a reduction in the PDI was obtained by simultaneously increasing the ionic strength of the solvent and the zinc concentration. The multivariate analysis also revealed the interaction between the factors in terms of their effects on the PDI; hence, compared to traditional univariate analyses, the multivariate analysis allowed us to obtain a more complete understanding of the effects of the factors scrutinized. In addition, the results are considered useful in order to avoid extensive empirical tests for future formulation studies. PMID:27288663
Multivariate stochastic analysis for Monthly hydrological time series at Cuyahoga River Basin
NASA Astrophysics Data System (ADS)
zhang, L.
2011-12-01
Copula has become a very powerful statistic and stochastic methodology in case of the multivariate analysis in Environmental and Water resources Engineering. In recent years, the popular one-parameter Archimedean copulas, e.g. Gumbel-Houggard copula, Cook-Johnson copula, Frank copula, the meta-elliptical copula, e.g. Gaussian Copula, Student-T copula, etc. have been applied in multivariate hydrological analyses, e.g. multivariate rainfall (rainfall intensity, duration and depth), flood (peak discharge, duration and volume), and drought analyses (drought length, mean and minimum SPI values, and drought mean areal extent). Copula has also been applied in the flood frequency analysis at the confluences of river systems by taking into account the dependence among upstream gauge stations rather than by using the hydrological routing technique. In most of the studies above, the annual time series have been considered as stationary signal which the time series have been assumed as independent identically distributed (i.i.d.) random variables. But in reality, hydrological time series, especially the daily and monthly hydrological time series, cannot be considered as i.i.d. random variables due to the periodicity existed in the data structure. Also, the stationary assumption is also under question due to the Climate Change and Land Use and Land Cover (LULC) change in the fast years. To this end, it is necessary to revaluate the classic approach for the study of hydrological time series by relaxing the stationary assumption by the use of nonstationary approach. Also as to the study of the dependence structure for the hydrological time series, the assumption of same type of univariate distribution also needs to be relaxed by adopting the copula theory. In this paper, the univariate monthly hydrological time series will be studied through the nonstationary time series analysis approach. The dependence structure of the multivariate monthly hydrological time series will be
Detection of counterfeit Viagra® by Raman microspectroscopy imaging and multivariate analysis.
Sacré, Pierre-Yves; Deconinck, Eric; Saerens, Lien; De Beer, Thomas; Courselle, Patricia; Vancauwenberghe, Roy; Chiap, Patrice; Crommen, Jacques; De Beer, Jacques O
2011-09-10
During the past years, pharmaceutical counterfeiting was mainly a problem of developing countries with weak enforcement and inspection programs. However, Europe and North America are more and more confronted with the counterfeiting problem. During this study, 26 counterfeits and imitations of Viagra® tablets and 8 genuine tablets of Viagra® were analysed by Raman microspectroscopy imaging. After unfolding the data, three maps are combined per sample and a first PCA is realised on these data. Then, the first principal components of each sample are assembled. The exploratory and classification analysis are performed on that matrix. PCA was applied as exploratory analysis tool on different spectral ranges to detect counterfeit medicines based on the full spectra (200-1800 cm⁻¹), the presence of lactose (830-880 cm⁻¹) and the spatial distribution of sildenafil (1200-1290 cm⁻¹) inside the tablet. After the exploratory analysis, three different classification algorithms were applied on the full spectra dataset: linear discriminant analysis, k-nearest neighbour and soft independent modelling of class analogy. PCA analysis of the 830-880 cm⁻¹ spectral region discriminated genuine samples while the multivariate analysis of the spectral region between 1200 cm⁻¹ and 1290 cm⁻¹ returns no satisfactory results. A good discrimination of genuine samples was obtained with multivariate analysis of the full spectra region (200-1800 cm⁻¹). Application of the k-NN and SIMCA algorithm returned 100% correct classification during both internal and external validation. PMID:21715121
NASA Astrophysics Data System (ADS)
Belianinov, Alex; Ganesh, Panchapakesan; Lin, Wenzhi; Sales, Brian C.; Sefat, Athena S.; Jesse, Stephen; Pan, Minghu; Kalinin, Sergei V.
2014-12-01
Atomic level spatial variability of electronic structure in Fe-based superconductor FeTe0.55Se0.45 (Tc = 15 K) is explored using current-imaging tunneling-spectroscopy. Multivariate statistical analysis of the data differentiates regions of dissimilar electronic behavior that can be identified with the segregation of chalcogen atoms, as well as boundaries between terminations and near neighbor interactions. Subsequent clustering analysis allows identification of the spatial localization of these dissimilar regions. Similar statistical analysis of modeled calculated density of states of chemically inhomogeneous FeTe1-xSex structures further confirms that the two types of chalcogens, i.e., Te and Se, can be identified by their electronic signature and differentiated by their local chemical environment. This approach allows detailed chemical discrimination of the scanning tunneling microscopy data including separation of atomic identities, proximity, and local configuration effects and can be universally applicable to chemically and electronically inhomogeneous surfaces.
A versatile multivariate image analysis pipeline reveals features of Xenopus extract spindles.
Grenfell, Andrew W; Strzelecka, Magdalena; Crowder, Marina E; Helmke, Kara J; Schlaitz, Anne-Lore; Heald, Rebecca
2016-04-11
Imaging datasets are rich in quantitative information. However, few cell biologists possess the tools necessary to analyze them. Here, we present a large dataset ofXenopusextract spindle images together with an analysis pipeline designed to assess spindle morphology across a range of experimental conditions. Our analysis of different spindle types illustrates how kinetochore microtubules amplify spindle microtubule density. Extract mixing experiments reveal that some spindle features titrate, while others undergo switch-like transitions, and multivariate analysis shows the pleiotropic morphological effects of modulating the levels of TPX2, a key spindle assembly factor. We also apply our pipeline to analyze nuclear morphology in human cell culture, showing the general utility of the segmentation approach. Our analyses provide new insight into the diversity of spindle types and suggest areas for future study. The approaches outlined can be applied by other researchers studying spindle morphology and adapted with minimal modification to other experimental systems. PMID:27044897
NASA Astrophysics Data System (ADS)
Sommer, M.; Goschnick, J.
2005-09-01
The quantification of elements in quadrupole based SNMS is hampered by superpositions of atomic and cluster signals. Moreover, the conventional SNMS data evaluation employs only atomic signals to determine elemental concentrations, which not allows any chemical specifications of the determined elements. Improvements in the elemental quantification and additional chemical information can be obtained from kinetic energy analysis and the inclusion of molecular signals into mass spectra evaluation. With the help of multivariate data analysis techniques, the combined information is used for the first time for a quantitative and chemically distinctive determination of sulfur. The kinetic energy analysis, used to solve the interference of sulfur with O 2 at masses 32-34 D, turned out to be highly important for the new type of evaluation.
Differentiation of aged fibers by Raman spectroscopy and multivariate data analysis.
Bianchi, Federica; Riboni, Nicolò; Trolla, Valentina; Furlan, Giada; Avantaggiato, Giorgio; Iacobellis, Giuliano; Careri, Maria
2016-07-01
Raman spectroscopy followed by multivariate data analysis was used to analyze cotton fibers dyed using similar formulations and submitted to different aging conditions. Spectra were collected on a commercial instrument using a near-infrared laser with a 780nm light source. Discriminant analysis allowed to correctly classify the aged fibers 100% of the time. The prediction ability of the calculated model was estimated to be 100% by the "leave-one-out" cross-validation for 3 out of the 4 series under investigation. Finally, reliability of the developed approach for the discrimination of aged vs new fibers was confirmed by the analysis of commercial polyamide and polyester textiles submitted to the same aging process. PMID:27154701
Belianinov, Alex; Panchapakesan, G.; Lin, Wenzhi; Sales, Brian C.; Sefat, Athena Safa; Jesse, Stephen; Pan, Minghu; Kalinin, Sergei V.
2014-12-02
Atomic level spatial variability of electronic structure in Fe-based superconductor FeTe0.55Se0.45 (Tc = 15 K) is explored using current-imaging tunneling-spectroscopy. Multivariate statistical analysis of the data differentiates regions of dissimilar electronic behavior that can be identified with the segregation of chalcogen atoms, as well as boundaries between terminations and near neighbor interactions. Subsequent clustering analysis allows identification of the spatial localization of these dissimilar regions. Similar statistical analysis of modeled calculated density of states of chemically inhomogeneous FeTe1 x Sex structures further confirms that the two types of chalcogens, i.e., Te and Se, can be identified by their electronic signature and differentiated by their local chemical environment. This approach allows detailed chemical discrimination of the scanning tunneling microscopy data including separation of atomic identities, proximity, and local configuration effects and can be universally applicable to chemically and electronically inhomogeneous surfaces.
Beer fermentation: monitoring of process parameters by FT-NIR and multivariate data analysis.
Grassi, Silvia; Amigo, José Manuel; Lyndgaard, Christian Bøge; Foschino, Roberto; Casiraghi, Ernestina
2014-07-15
This work investigates the capability of Fourier-Transform near infrared (FT-NIR) spectroscopy to monitor and assess process parameters in beer fermentation at different operative conditions. For this purpose, the fermentation of wort with two different yeast strains and at different temperatures was monitored for nine days by FT-NIR. To correlate the collected spectra with °Brix, pH and biomass, different multivariate data methodologies were applied. Principal component analysis (PCA), partial least squares (PLS) and locally weighted regression (LWR) were used to assess the relationship between FT-NIR spectra and the abovementioned process parameters that define the beer fermentation. The accuracy and robustness of the obtained results clearly show the suitability of FT-NIR spectroscopy, combined with multivariate data analysis, to be used as a quality control tool in the beer fermentation process. FT-NIR spectroscopy, when combined with LWR, demonstrates to be a perfectly suitable quantitative method to be implemented in the production of beer. PMID:24594186
What Makes a Pattern? Matching Decoding Methods to Data in Multivariate Pattern Analysis
Kragel, Philip A.; Carter, R. McKell; Huettel, Scott A.
2012-01-01
Research in neuroscience faces the challenge of integrating information across different spatial scales of brain function. A promising technique for harnessing information at a range of spatial scales is multivariate pattern analysis (MVPA) of functional magnetic resonance imaging (fMRI) data. While the prevalence of MVPA has increased dramatically in recent years, its typical implementations for classification of mental states utilize only a subset of the information encoded in local fMRI signals. We review published studies employing multivariate pattern classification since the technique’s introduction, which reveal an extensive focus on the improved detection power that linear classifiers provide over traditional analysis techniques. We demonstrate using simulations and a searchlight approach, however, that non-linear classifiers are capable of extracting distinct information about interactions within a local region. We conclude that for spatially localized analyses, such as searchlight and region of interest, multiple classification approaches should be compared in order to match fMRI analyses to the properties of local circuits. PMID:23189035
Kambeitz, Joseph; Kambeitz-Ilankovic, Lana; Leucht, Stefan; Wood, Stephen; Davatzikos, Christos; Malchow, Berend; Falkai, Peter; Koutsouleris, Nikolaos
2015-01-01
Multivariate pattern recognition approaches have recently facilitated the search for reliable neuroimaging-based biomarkers in psychiatric disorders such as schizophrenia. By taking into account the multivariate nature of brain functional and structural changes as well as their distributed localization across the whole brain, they overcome drawbacks of traditional univariate approaches. To evaluate the overall reliability of neuroimaging-based biomarkers, we conducted a comprehensive literature search to identify all studies that used multivariate pattern recognition to identify patterns of brain alterations that differentiate patients with schizophrenia from healthy controls. A bivariate random-effects meta-analytic model was implemented to investigate the sensitivity and specificity across studies as well as to assess the robustness to potentially confounding variables. In the total sample of n=38 studies (1602 patients and 1637 healthy controls), patients were differentiated from controls with a sensitivity of 80.3% (95% CI: 76.7–83.5%) and a specificity of 80.3% (95% CI: 76.9–83.3%). Analysis of neuroimaging modality indicated higher sensitivity (84.46%, 95% CI: 79.9–88.2%) and similar specificity (76.9%, 95% CI: 71.3–81.6%) of rsfMRI studies as compared with structural MRI studies (sensitivity: 76.4%, 95% CI: 71.9–80.4%, specificity of 79.0%, 95% CI: 74.6–82.8%). Moderator analysis identified significant effects of age (p=0.029), imaging modality (p=0.019), and disease stage (p=0.025) on sensitivity as well as of positive-to-negative symptom ratio (p=0.022) and antipsychotic medication (p=0.016) on specificity. Our results underline the utility of multivariate pattern recognition approaches for the identification of reliable neuroimaging-based biomarkers. Despite the clinical heterogeneity of the schizophrenia phenotype, brain functional and structural alterations differentiate schizophrenic patients from healthy controls with 80% sensitivity and
Predicting Cytotoxic T-cell Age from Multivariate Analysis of Static and Dynamic Biomarkers*
Rivet, Catherine A.; Hill, Abby S.; Lu, Hang; Kemp, Melissa L.
2011-01-01
Adoptive T-cell transfer therapy relies upon in vitro expansion of autologous cytotoxic T cells that are capable of tumor recognition. The success of this cell-based therapy depends on the specificity and responsiveness of the T cell clones before transfer. During ex vivo expansion, CD8+ T cells present signs of replicative senescence and loss of function. The transfer of nonresponsive senescent T cells is a major bottleneck for the success of adoptive T-cell transfer therapy. Quantitative methods for assessing cellular age and responsiveness will facilitate the development of appropriate cell expansion and selection protocols. Although several biomarkers of lymphocyte senescence have been identified, these proteins in isolation are not sufficient to determine the age-dependent responsiveness of T cells. We have developed a multivariate model capable of extracting combinations of markers that are the most informative to predict cellular age. To acquire signaling information with high temporal resolution, we designed a microfluidic chip enabling parallel lysis and fixation of stimulated cell samples on-chip. The acquisition of 25 static biomarkers and 48 dynamic signaling measurements at different days in culture, integrating single-cell and population based information, allowed the multivariate regression model to accurately predict CD8+ T-cell age. From surface marker expression and early phosphorylation events following T-cell receptor stimulation, the model successfully predicts days in culture and number of population doublings with R2 = 0.91 and 0.98, respectively. Furthermore, we found that impairment of early signaling events following T cell receptor stimulation because of long term culture allows prediction of costimulatory molecules CD28 and CD27 expression levels and the number of population divisions in culture from a limited subset of signaling proteins. The multivariate analysis highlights the information content of both averaged biomarker values and
Multivariate analysis of groundwater resources in Ganga-Yamuna basin (India).
Sargaonkar, Aabha P; Gupta, Apurba; Devotta, Sukumar
2008-07-01
Groundwater quality data on physico-chemical, bacteriological and heavy metal concentrations in three cities (Faridabad, Allahabad and Varanasi) in Ganga-Yamuna basin was subjected to multivariate analysis (MVA) using SPSS. The factors extracted showed high loading (> 0.3) of various parameters, such as Cl, conductivity, TDS, hardness, Na, Mg, and SO4, indicating contamination due to leaching of pollutants. Major manifest variable associated with these factors is the unorganized solid waste dumping practiced in all the cities. Bacterial contamination of hand pump samples in Allahabad is attributed to surface water-groundwater interaction. The factor with high loading of Ca and F is indicative of geological conditions of the region. Wells in Yamuna river sub-watershed exhibit less freshwater recharge, which is attributed to surface water pollution and sediment deposition in the river. Thus, the methodology for hydrogeological analysis is useful to identify critical water quality issues and possible sources of pollution in river basins. PMID:19552076
Lee, S. H.; van der Werf, J. H. J.
2016-01-01
Summary: We have developed an algorithm for genetic analysis of complex traits using genome-wide SNPs in a linear mixed model framework. Compared to current standard REML software based on the mixed model equation, our method is substantially faster. The advantage is largest when there is only a single genetic covariance structure. The method is particularly useful for multivariate analysis, including multi-trait models and random regression models for studying reaction norms. We applied our proposed method to publicly available mice and human data and discuss the advantages and limitations. Availability and implementation: MTG2 is available in https://sites.google.com/site/honglee0707/mtg2. Contact: hong.lee@une.edu.au Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26755623
Web-based tools for modelling and analysis of multivariate data: California ozone pollution activity
Dinov, Ivo D.; Christou, Nicolas
2014-01-01
This article presents a hands-on web-based activity motivated by the relation between human health and ozone pollution in California. This case study is based on multivariate data collected monthly at 20 locations in California between 1980 and 2006. Several strategies and tools for data interrogation and exploratory data analysis, model fitting and statistical inference on these data are presented. All components of this case study (data, tools, activity) are freely available online at: http://wiki.stat.ucla.edu/socr/index.php/SOCR_MotionCharts_CAOzoneData. Several types of exploratory (motion charts, box-and-whisker plots, spider charts) and quantitative (inference, regression, analysis of variance (ANOVA)) data analyses tools are demonstrated. Two specific human health related questions (temporal and geographic effects of ozone pollution) are discussed as motivational challenges. PMID:24465054
Multivariate statistical analysis of stream-sediment geochemistry in the Grazer Paläozoikum, Austria
Weber, L.; Davis, J.C.
1990-01-01
The Austrian reconnaissance study of stream-sediment composition — more than 30000 clay-fraction samples collected over an area of 40000 km2 — is summarized in an atlas of regional maps that show the distributions of 35 elements. These maps, rich in information, reveal complicated patterns of element abundance that are difficult to compare on more than a small number of maps at one time. In such a study, multivariate procedures such as simultaneous R-Q mode components analysis may be helpful. They can compress a large number of variables into a much smaller number of independent linear combinations. These composite variables may be mapped and relationships sought between them and geological properties. As an example, R-Q mode components analysis is applied here to the Grazer Paläozoikum, a tectonic unit northeast of the city of Graz, which is composed of diverse lithologies and contains many mineral deposits.
NASA Astrophysics Data System (ADS)
Ruddell, B. L.; Kumar, P.
2006-12-01
Information-Theoretic approaches to the stochastic study of hydrology and other environmental sciences have become popular in recent years, owing to a greater availability of environmental data and the need to develop tools that can dissect the complex dynamics of environmental systems. Entropy-based metrics such at Transfer Entropy can go beyond correlation and uncover the cardinality of relationships between measured signals, and are therefore useful for the study of feedback-based complex environmental systems that are rich in data. Because information theoretic approaches to signal analysis are relatively immature, effort is needed to refine the technique in terms of algorithms, controls, and interpretation of results. A basic framework for environmental system analysis and interpretation is presented, and this framework is applied to simultaneous multivariate timeseries data. Eco-hydrological data from the Ameriflux network is examined, and the complex dynamics and feedbacks of the Eco-hydrological system are discussed.
Berman, E F; Kulp, K S; Knize, M G; Wu, L; Nelson, E J; Nelson, D O; Wu, K J
2006-05-04
Time-of-Flight Secondary Ion Mass Spectrometry (ToF-SIMS) is utilized to examine the mass spectra and fragmentation patterns of seven isomeric monosaccharides. Multivariate statistical analysis techniques, including principal component analysis (PCA), allow discrimination of the extremely similar mass spectra of stereoisomers. Furthermore, PCA identifies those fragment peaks which vary significantly between spectra. Heavy isotope studies confirm that these peaks are indeed sugar fragments, allow identification of the fragments, and provide clues to the fragmentation pathways. Excellent reproducibility is shown by multiple experiments performed over time and on separate samples. This study demonstrates the combined selectivity and discrimination power of ToF-SIMS and PCA, and suggests new applications of the technique including differentiation of subtle chemical changes in biological samples that may provide insights into cellular processes, disease progress, and disease diagnosis.
Sensitivity Analysis and Neutron Fluence Adjustment for VVER-1000 Rpv
NASA Astrophysics Data System (ADS)
Belousov, S.; Ilieva, Kr.; Kirilova, D.
2003-06-01
Adjustment of the neutron fluence at the VVER-1000 RPV inner wall has been carried out. For the purpose of this adjustment the neutron flux response sensitivity to the main parameters of calculation uncertainty has been calculated. The obtained sensitivities, the parameters uncertainty and activity measurement data of iron, copper and niobium detectors positioned behind the RPV of Kozloduy NPP Unit 5 have been used in this adjustment.
Computed ABC Analysis for Rational Selection of Most Informative Variables in Multivariate Data
Ultsch, Alfred; Lötsch, Jörn
2015-01-01
Objective Multivariate data sets often differ in several factors or derived statistical parameters, which have to be selected for a valid interpretation. Basing this selection on traditional statistical limits leads occasionally to the perception of losing information from a data set. This paper proposes a novel method for calculating precise limits for the selection of parameter sets. Methods The algorithm is based on an ABC analysis and calculates these limits on the basis of the mathematical properties of the distribution of the analyzed items. The limits im-plement the aim of any ABC analysis, i.e., comparing the increase in yield to the required additional effort. In particular, the limit for set A, the “important few”, is optimized in a way that both, the effort and the yield for the other sets (B and C), are minimized and the additional gain is optimized. Results As a typical example from biomedical research, the feasibility of the ABC analysis as an objective replacement for classical subjective limits to select highly relevant variance components of pain thresholds is presented. The proposed method improved the biological inter-pretation of the results and increased the fraction of valid information that was obtained from the experimental data. Conclusions The method is applicable to many further biomedical problems in-cluding the creation of diagnostic complex biomarkers or short screening tests from comprehensive test batteries. Thus, the ABC analysis can be proposed as a mathematically valid replacement for traditional limits to maximize the information obtained from multivariate research data. PMID:26061064
Multivariate Statistical Analysis of Water Quality data in Indian River Lagoon, Florida
NASA Astrophysics Data System (ADS)
Sayemuzzaman, M.; Ye, M.
2015-12-01
The Indian River Lagoon, is part of the longest barrier island complex in the United States, is a region of particular concern to the environmental scientist because of the rapid rate of human development throughout the region and the geographical position in between the colder temperate zone and warmer sub-tropical zone. Thus, the surface water quality analysis in this region always brings the newer information. In this present study, multivariate statistical procedures were applied to analyze the spatial and temporal water quality in the Indian River Lagoon over the period 1998-2013. Twelve parameters have been analyzed on twelve key water monitoring stations in and beside the lagoon on monthly datasets (total of 27,648 observations). The dataset was treated using cluster analysis (CA), principle component analysis (PCA) and non-parametric trend analysis. The CA was used to cluster twelve monitoring stations into four groups, with stations on the similar surrounding characteristics being in the same group. The PCA was then applied to the similar groups to find the important water quality parameters. The principal components (PCs), PC1 to PC5 was considered based on the explained cumulative variances 75% to 85% in each cluster groups. Nutrient species (phosphorus and nitrogen), salinity, specific conductivity and erosion factors (TSS, Turbidity) were major variables involved in the construction of the PCs. Statistical significant positive or negative trends and the abrupt trend shift were detected applying Mann-Kendall trend test and Sequential Mann-Kendall (SQMK), for each individual stations for the important water quality parameters. Land use land cover change pattern, local anthropogenic activities and extreme climate such as drought might be associated with these trends. This study presents the multivariate statistical assessment in order to get better information about the quality of surface water. Thus, effective pollution control/management of the surface
Chen, Gang; Adleman, Nancy E.; Saad, Ziad S.; Leibenluft, Ellen; Cox, RobertW.
2014-01-01
All neuroimaging packages can handle group analysis with t-tests or general linear modeling (GLM). However, they are quite hamstrung when there are multiple within-subject factors or when quantitative covariates are involved in the presence of a within-subject factor. In addition, sphericity is typically assumed for the variance–covariance structure when there are more than two levels in a within-subject factor. To overcome such limitations in the traditional AN(C)OVA and GLM, we adopt a multivariate modeling (MVM) approach to analyzing neuroimaging data at the group level with the following advantages: a) there is no limit on the number of factors as long as sample sizes are deemed appropriate; b) quantitative covariates can be analyzed together with within- subject factors; c) when a within-subject factor is involved, three testing methodologies are provided: traditional univariate testing (UVT)with sphericity assumption (UVT-UC) and with correction when the assumption is violated (UVT-SC), and within-subject multivariate testing (MVT-WS); d) to correct for sphericity violation at the voxel level, we propose a hybrid testing (HT) approach that achieves equal or higher power via combining traditional sphericity correction methods (Greenhouse–Geisser and Huynh–Feldt) with MVT-WS. PMID:24954281
A Versatile Cell Death Screening Assay Using Dye-Stained Cells and Multivariate Image Analysis
Collins, Tony J.; Ylanko, Jarkko; Geng, Fei
2015-01-01
Abstract A novel dye-based method for measuring cell death in image-based screens is presented. Unlike conventional high- and medium-throughput cell death assays that measure only one form of cell death accurately, using multivariate analysis of micrographs of cells stained with the inexpensive mix, red dye nonyl acridine orange, and a nuclear stain, it was possible to quantify cell death induced by a variety of different agonists even without a positive control. Surprisingly, using a single known cytotoxic agent as a positive control for training a multivariate classifier allowed accurate quantification of cytotoxicity for mechanistically unrelated compounds enabling generation of dose–response curves. Comparison with low throughput biochemical methods suggested that cell death was accurately distinguished from cell stress induced by low concentrations of the bioactive compounds Tunicamycin and Brefeldin A. High-throughput image-based format analyses of more than 300 kinase inhibitors correctly identified 11 as cytotoxic with only 1 false positive. The simplicity and robustness of this dye-based assay makes it particularly suited to live cell screening for toxic compounds. PMID:26422066
Gieray, R.A.; Lazar, A.; Parker, E.P.; Ramsey, J. M.; Reilly, P.T.A.; Rosenthal, S.E.; Trahan, M.W.; Wagner, J.S.; Whitten, W.B.
1999-04-27
We are developing a method for the real-time analysis of airborne microparticles based on laser ablation mass spectroscopy. Airborne particles enter an ion trap mass spectrometer through a differentially-pumped inlet, are detected by light scattered from two CW laser beams, and sampled by a 10 ns excimer laser pulse at 308 nm as they pass through the center of the ion trap electrodes. After the laser pulse, the stored ions are separated by conventional ion trap methods. In this work thousands of positive and negative ion spectra were collected for eighteen different species: six bacteria, six pollen, and six particulate samples. The data were then averaged and analyzed using the Multivariate Patch Algorithm (MPA), a variant of traditional multivariate anal ysis. The MPA correctly identified all of the positive ion spectra and 17 of the 18 negative ion spectra. In addition, when the average positive and negative spectra were combined the MPA correctly identified all 18 species. Finally, the MPA is also able to identify the components of computer synthesized mixtures of the samples studied
A Versatile Cell Death Screening Assay Using Dye-Stained Cells and Multivariate Image Analysis.
Collins, Tony J; Ylanko, Jarkko; Geng, Fei; Andrews, David W
2015-11-01
A novel dye-based method for measuring cell death in image-based screens is presented. Unlike conventional high- and medium-throughput cell death assays that measure only one form of cell death accurately, using multivariate analysis of micrographs of cells stained with the inexpensive mix, red dye nonyl acridine orange, and a nuclear stain, it was possible to quantify cell death induced by a variety of different agonists even without a positive control. Surprisingly, using a single known cytotoxic agent as a positive control for training a multivariate classifier allowed accurate quantification of cytotoxicity for mechanistically unrelated compounds enabling generation of dose-response curves. Comparison with low throughput biochemical methods suggested that cell death was accurately distinguished from cell stress induced by low concentrations of the bioactive compounds Tunicamycin and Brefeldin A. High-throughput image-based format analyses of more than 300 kinase inhibitors correctly identified 11 as cytotoxic with only 1 false positive. The simplicity and robustness of this dye-based assay makes it particularly suited to live cell screening for toxic compounds. PMID:26422066
Multivariate co-integration analysis of the Kaya factors in Ghana.
Asumadu-Sarkodie, Samuel; Owusu, Phebe Asantewaa
2016-05-01
The fundamental goal of the Government of Ghana's development agenda as enshrined in the Growth and Poverty Reduction Strategy to grow the economy to a middle income status of US$1000 per capita by the end of 2015 could be met by increasing the labour force, increasing energy supplies and expanding the energy infrastructure in order to achieve the sustainable development targets. In this study, a multivariate co-integration analysis of the Kaya factors namely carbon dioxide, total primary energy consumption, population and GDP was investigated in Ghana using vector error correction model with data spanning from 1980 to 2012. Our research results show an existence of long-run causality running from population, GDP and total primary energy consumption to carbon dioxide emissions. However, there is evidence of short-run causality running from population to carbon dioxide emissions. There was a bi-directional causality running from carbon dioxide emissions to energy consumption and vice versa. In other words, decreasing the primary energy consumption in Ghana will directly reduce carbon dioxide emissions. In addition, a bi-directional causality running from GDP to energy consumption and vice versa exists in the multivariate model. It is plausible that access to energy has a relationship with increasing economic growth and productivity in Ghana. PMID:26857005
EXPERIMENTAL ANALYSIS OF MULTIVARIATE FEMALE CHOICE IN GRAY TREEFROGS (Hyla versicolor)
Gerhardt, H. Carl; Brooks, Robert
2009-01-01
Even simple biological signals vary in several measurable dimensions. Understanding their evolution requires, therefore, a multivariate understanding of selection, including how different properties interact to determine the effectiveness of the signal. We combined experimental manipulation with multivariate selection analysis to assess female mate choice on the simple trilled calls of male gray treefrogs. We independently and randomly varied five behaviorally relevant acoustic properties in 154 synthetic calls. We compared response times of each of 154 females to one of these calls with its response to a standard call that had mean values of the five properties. We found directional and quadratic selection on two properties indicative of the amount of signaling, pulse number and call rate. Canonical rotation of the fitness surface showed that these properties, along with pulse rate, contributed heavily to a major axis of stabilizing selection, a result consistent with univariate studies showing diminishing effects of increasing pulse number well beyond the mean. Spectral properties contributed to a second major axis of stabilizing selection. The single major axis of disruptive selection suggested that a combination of two temporal and two spectral properties with values differing from the mean should be especially attractive. PMID:19500145
Enhancing e-waste estimates: improving data quality by multivariate Input-Output Analysis.
Wang, Feng; Huisman, Jaco; Stevels, Ab; Baldé, Cornelis Peter
2013-11-01
Waste electrical and electronic equipment (or e-waste) is one of the fastest growing waste streams, which encompasses a wide and increasing spectrum of products. Accurate estimation of e-waste generation is difficult, mainly due to lack of high quality data referred to market and socio-economic dynamics. This paper addresses how to enhance e-waste estimates by providing techniques to increase data quality. An advanced, flexible and multivariate Input-Output Analysis (IOA) method is proposed. It links all three pillars in IOA (product sales, stock and lifespan profiles) to construct mathematical relationships between various data points. By applying this method, the data consolidation steps can generate more accurate time-series datasets from available data pool. This can consequently increase the reliability of e-waste estimates compared to the approach without data processing. A case study in the Netherlands is used to apply the advanced IOA model. As a result, for the first time ever, complete datasets of all three variables for estimating all types of e-waste have been obtained. The result of this study also demonstrates significant disparity between various estimation models, arising from the use of data under different conditions. It shows the importance of applying multivariate approach and multiple sources to improve data quality for modelling, specifically using appropriate time-varying lifespan parameters. Following the case study, a roadmap with a procedural guideline is provided to enhance e-waste estimation studies. PMID:23899476
NASA Astrophysics Data System (ADS)
Pisias, Nicklas G.; Murray, Richard W.; Scudder, Rachel P.
2013-10-01
Multivariate statistical treatments of large data sets in sedimentary geochemical and other fields are rapidly becoming more popular as analytical and computational capabilities expand. Because geochemical data sets present a unique set of conditions (e.g., the closed array), application of generic off-the-shelf applications is not straightforward and can yield misleading results. We present here annotated MATLAB scripts (and specific guidelines for their use) for Q-mode factor analysis, a constrained least squares multiple linear regression technique, and a total inversion protocol, that are based on the well-known approaches taken by Dymond (1981), Leinen and Pisias (1984), Kyte et al. (1993), and their predecessors. Although these techniques have been used by investigators for the past decades, their application has been neither consistent nor transparent, as their code has remained in-house or in formats not commonly used by many of today's researchers (e.g., FORTRAN). In addition to providing the annotated scripts and instructions for use, we discuss general principles to be considered when performing multivariate statistical treatments of large geochemical data sets, provide a brief contextual history of each approach, explain their similarities and differences, and include a sample data set for the user to test their own manipulation of the scripts.
NASA Astrophysics Data System (ADS)
Daeid, N. Nic; Meier-Augenstein, W.; Kemp, H. F.
2012-04-01
The analysis of cotton fibres can be particularly challenging within a forensic science context where discrimination of one fibre from another is of importance. Normally cotton fibre analysis examines the morphological structure of the recovered material and compares this with that of a known fibre from a particular source of interest. However, the conventional microscopic and chemical analysis of fibres and any associated dyes is generally unsuccessful because of the similar morphology of the fibres. Analysis of the dyes which may have been applied to the cotton fibre can also be undertaken though this can be difficult and unproductive in terms of discriminating one fibre from another. In the study presented here we have explored the potential for Isotope Ratio Mass Spectrometry (IRMS) to be utilised as an additional tool for cotton fibre analysis in an attempt to reveal further discriminatory information. This work has concentrated on un-dyed cotton fibres of known origin in order to expose the potential of the analytical technique. We report the results of a pilot study aimed at testing the hypothesis that multi-element stable isotope analysis of cotton fibres in conjunction with multivariate statistical analysis of the resulting isotopic abundance data using well established chemometric techniques permits sample provenancing based on the determination of where the cotton was grown and as such will facilitate sample discrimination. To date there is no recorded literature of this type of application of IRMS to cotton samples, which may be of forensic science relevance.
Cole, B F; Gelber, R D; Anderson, K M
1994-09-01
We present a parametric methodology for performing quality-of-life-adjusted survival analysis using multivariate censored survival data. It represents a generalization of the nonparametric Q-TWiST method (Quality-adjusted Time without Symptoms and Toxicity). The event times correspond to transitions between states of health that differ in terms of quality of life. Each transition is governed by a competing risks model where the health states are the competing risks. Overall survival is the sum of the amount of time spent in each health state. The first step of the proposed methodology consists of defining a quality function that assigns a "score" to a life having given health state transitions. It is a composite measure of both quantity and quality of life. In general, the quality function assigns a small value to a short life with poor quality and a high value to a long life with good quality. In the second step, parametric survival models are fit to the data. This is done by repeatedly modeling the conditional cause-specific hazard functions given the previous transitions. Covariates are incorporated by accelerated failure time regression, and the model parameters are estimated by maximum likelihood. Lastly, the modeling results are used to estimate the expectation of quality functions. Standard errors and confidence intervals are computed using the bootstrap and delta methods. The results are useful for simultaneously evaluating treatments in terms of quantity and quality of life. To demonstrate the proposed methods, we perform an analysis of data from the International Breast Cancer Study Group Trial V, which compared short-duration chemotherapy versus long-duration chemotherapy in the treatment of node-positive breast cancer. The events studied are: (1) the end of treatment toxicity, (2) disease recurrence, and (3) overall survival. PMID:7981389
[Factors behind global fertility development after 1950: a multivariate analysis of 128 countries].
Lutz, W
1984-01-01
Time series of selected socioeconomic indicators of 128 countries representing 97.4% of the world population in 1975 are related to gross reproduction rates (GRR) in various models of bivariate and multivariate analysis assuming linear as well as logistic functional relationships. The data base stems mainly from UN publications and special attention is given to China. For the pooling of time series and cross-sectional data, countries are grouped according to geographical and cultural criteria, and variables accounting for these regional effects are included in the equations. Special emphasis is placed on the development of an analytical framework trying to combine aspects of economic and sociological fertility analysis which account for shortterm economic determination as well as for changes in the system of social norms and in the degree to which these norms are forced on individual behavior. Among other findings, there is a pronounced dichotomy between more and less developed countries with respect to most variables but especially so with fertility. There are those countries with GRRs above 2.0 and below. The explanatory values of the models of multivariate analysis are generally very high with the RZs ranging from 0.73-0.97, depending on the weighting used and on the specification of the model. Mutlicollinearity was reduced by transformation and aggregation of variables. Female life expectancy at birth seems to be the single most important variable in explaining differential fertility; however, no direct causal link may be assumed; instead life expectancy can be seen as a very general indicator of health conditions and quality of life which in turn influence fertility development. Female educational status is the 2nd most important variable. Measures of female educational status relative to men also supports the argument that female social status is relevant for fertility. A positive income effect on fertility appears for the gross domestic product per caput in
Assessment of water quality parameters using multivariate analysis for Klang River basin, Malaysia.
Mohamed, Ibrahim; Othman, Faridah; Ibrahim, Adriana I N; Alaa-Eldin, M E; Yunus, Rossita M
2015-01-01
This case study uses several univariate and multivariate statistical techniques to evaluate and interpret a water quality data set obtained from the Klang River basin located within the state of Selangor and the Federal Territory of Kuala Lumpur, Malaysia. The river drains an area of 1,288 km(2), from the steep mountain rainforests of the main Central Range along Peninsular Malaysia to the river mouth in Port Klang, into the Straits of Malacca. Water quality was monitored at 20 stations, nine of which are situated along the main river and 11 along six tributaries. Data was collected from 1997 to 2007 for seven parameters used to evaluate the status of the water quality, namely dissolved oxygen, biochemical oxygen demand, chemical oxygen demand, suspended solids, ammoniacal nitrogen, pH, and temperature. The data were first investigated using descriptive statistical tools, followed by two practical multivariate analyses that reduced the data dimensions for better interpretation. The analyses employed were factor analysis and principal component analysis, which explain 60 and 81.6% of the total variation in the data, respectively. We found that the resulting latent variables from the factor analysis are interpretable and beneficial for describing the water quality in the Klang River. This study presents the usefulness of several statistical methods in evaluating and interpreting water quality data for the purpose of monitoring the effectiveness of water resource management. The results should provide more straightforward data interpretation as well as valuable insight for managers to conceive optimum action plans for controlling pollution in river water. PMID:25433545
Spatial assessment of air quality patterns in Malaysia using multivariate analysis
NASA Astrophysics Data System (ADS)
Dominick, Doreena; Juahir, Hafizan; Latif, Mohd Talib; Zain, Sharifuddin M.; Aris, Ahmad Zaharin
2012-12-01
This study aims to investigate possible sources of air pollutants and the spatial patterns within the eight selected Malaysian air monitoring stations based on a two-year database (2008-2009). The multivariate analysis was applied on the dataset. It incorporated Hierarchical Agglomerative Cluster Analysis (HACA) to access the spatial patterns, Principal Component Analysis (PCA) to determine the major sources of the air pollution and Multiple Linear Regression (MLR) to assess the percentage contribution of each air pollutant. The HACA results grouped the eight monitoring stations into three different clusters, based on the characteristics of the air pollutants and meteorological parameters. The PCA analysis showed that the major sources of air pollution were emissions from motor vehicles, aircraft, industries and areas of high population density. The MLR analysis demonstrated that the main pollutant contributing to variability in the Air Pollutant Index (API) at all stations was particulate matter with a diameter of less than 10 μm (PM10). Further MLR analysis showed that the main air pollutant influencing the high concentration of PM10 was carbon monoxide (CO). This was due to combustion processes, particularly originating from motor vehicles. Meteorological factors such as ambient temperature, wind speed and humidity were also noted to influence the concentration of PM10.
Tracking Problem Solving by Multivariate Pattern Analysis and Hidden Markov Model Algorithms
Anderson, John R.
2011-01-01
Multivariate pattern analysis can be combined with hidden Markov model algorithms to track the second-by-second thinking as people solve complex problems. Two applications of this methodology are illustrated with a data set taken from children as they interacted with an intelligent tutoring system for algebra. The first “mind reading” application involves using fMRI activity to track what students are doing as they solve a sequence of algebra problems. The methodology achieves considerable accuracy at determining both what problem-solving step the students are taking and whether they are performing that step correctly. The second “model discovery” application involves using statistical model evaluation to determine how many substates are involved in performing a step of algebraic problem solving. This research indicates that different steps involve different numbers of substates and these substates are associated with different fluency in algebra problem solving. PMID:21820455
Pradhan, Debabrata; Mishra, Debaraj; Kim, Dong J; Ahn, Jong G; Chaudhury, G Roy; Lee, Seoung W
2010-03-15
Bioleaching studies were conducted to evaluate the recovery of metal values from waste petroleum catalyst using two different acidophilic microorganisms, Acidithiobacillus ferrooxidans and Acidithiobacillus thiooxidans. Various leaching parameters such as contact time, pH, oxidant concentration, pulp densities, particle size, and temperature were studied in detail. Activation energy was evaluated from Arrhenius equation and values for Ni, V and Mo were calculated in case of both the acidophiles. In both cases, the dissolution kinetics of Mo was lower than those of V and Ni. The lower dissolution kinetics may have been due to the formation of a sulfur product layer, refractoriness of MoS(2) or both. Multivariate statistical data were presented to interpret the leaching data in the present case. The significance of the leaching parameters was derived through principle component analysis and multi linear regression analyses for both iron and sulfur oxidizing bacteria. PMID:19879686
NASA Astrophysics Data System (ADS)
Lampen, Caleb Parnell
A search for evidence of supersymmetry with a three lepton and missing transverse energy signature is presented. This signature is a possible final state from the associated production of a chargino and a neutralino, two particles predicted by supersymmetry. The study is performed on 2.06 fb-1 of 7 TeV center of mass energy proton-proton collisions, recorded with the ATLAS experiment at the Large Hadron Collider in 2011. A multivariate analysis is utilized, implementing a boosted decision tree classifier using lepton pT, missing transverse energy, dilepton mass, and the razor variable R as inputs. No significant excess over the Standard Model prediction is observed, and upper limits are placed on the cross section times branching ratio of simplified models across a mass parameter space.
UV-vis absorption spectroscopy and multivariate analysis as a method to discriminate tequila
NASA Astrophysics Data System (ADS)
Barbosa-García, O.; Ramos-Ortíz, G.; Maldonado, J. L.; Pichardo-Molina, J. L.; Meneses-Nava, M. A.; Landgrave, J. E. A.; Cervantes-Martínez, J.
2007-01-01
Based on the UV-vis absorption spectra of commercially bottled tequilas, and with the aid of multivariate analysis, it is proved that different brands of white tequila can be identified from such spectra, and that 100% agave and mixed tequilas can be discriminated as well. Our study was done with 60 tequilas, 58 of them purchased at liquor stores in various Mexican cities, and two directly acquired from a distillery. All the tequilas were of the "white" type, that is, no aged spirits were considered. For the purposes of discrimination and quality control of tequilas, the spectroscopic method that we present here offers an attractive alternative to the traditional methods, like gas chromatography, which is expensive and time-consuming.
Zahavich, A.
1995-10-01
The recycling of post consumer (PCR) high density polyethylene (HDPE) blow molding resins has increased dramatically over the past 5 years. The focus of research for this product has been on specific performance and processing properties such as tensile or melt strength. Little work has been done on studying the entire range of properties as a whole, particularly in the area of multiple extrusions. This paper describes a designed experiment study where multivariate statistical techniques were used to compare 2 HDPE and 2 HDPE PCR materials, in terms of changes in a number of properties with exposure to multiple extrusions. Virgin homopolymer and copolymer resins and PCR, mixed color bottle and natural, were passed through 4 extrusion cycles. Viscosity, swell, melt strength, crystallinity, polydispersity and ESCR properties were studied using principal component analysis.
NASA Astrophysics Data System (ADS)
Everard, Colm D.; Kim, Moon S.; Lee, Hoyoung
2014-05-01
The production of contaminant free fresh fruit and vegetables is needed to reduce foodborne illnesses and related costs. Leafy greens grown in the field can be susceptible to fecal matter contamination from uncontrolled livestock and wild animals entering the field. Pathogenic bacteria can be transferred via fecal matter and several outbreaks of E.coli O157:H7 have been associated with the consumption of leafy greens. This study examines the use of hyperspectral fluorescence imaging coupled with multivariate image analysis to detect fecal contamination on Spinach leaves (Spinacia oleracea). Hyperspectral fluorescence images from 464 to 800 nm were captured; ultraviolet excitation was supplied by two LED-based line light sources at 370 nm. Key wavelengths and algorithms useful for a contaminant screening optical imaging device were identified and developed, respectively. A non-invasive screening device has the potential to reduce the harmful consequences of foodborne illnesses.
Generalova, A G; Berzhets, V M; Novikov, D K; Generalov, I I
1998-01-01
A new quantitative method for the determination of the DNA-se activity of bacteria, based on the prevention of the clot formation of DNA with rivanol under the action of microbial DNA-ses, has been developed. The sensitivity of this method is 0.005-0.01 units of activity (1-2 ng) of the enzyme in a sample for 1 hour of incubation; the variation coefficient of the method is 8-9%. The use of this method has made it possible to reduce the time of the identification of S.aureus by 2-3 days in comparison with the commonly used methods. As revealed on the model of differentiation between S.aureus and S.epidermidis, the methods of multivariate statistical analysis (dispersion, discriminant, cluster methods) may find wide application for the discrimination of bacteria. They are supposed to be used for the interspecific identification of coagulase-negative staphylococci. PMID:9700871
NASA Technical Reports Server (NTRS)
Djorgovski, Stanislav
1992-01-01
The existing and forthcoming data bases from NASA missions contain an abundance of information whose complexity cannot be efficiently tapped with simple statistical techniques. Powerful multivariate statistical methods already exist which can be used to harness much of the richness of these data. Automatic classification techniques have been developed to solve the problem of identifying known types of objects in multi parameter data sets, in addition to leading to the discovery of new physical phenomena and classes of objects. We propose an exploratory study and integration of promising techniques in the development of a general and modular classification/analysis system for very large data bases, which would enhance and optimize data management and the use of human research resources.
Wavelet aided multivariate outlier analysis to enhance defect contrast in thermal images
NASA Astrophysics Data System (ADS)
Manohar, Arun; Lanza di Scalea, Francesco
2011-04-01
A novel two-stage signal reconstruction approach is proposed to analyze raw thermal image sequences for damage detection purposes by Infrared Thermographic NDE. The first stage involves low-pass filtering using Wavelets. In the second stage, a Multivariate Outlier Analysis is performed on filtered data using a set of signal features. The proposed approach significantly enhances the defective area contrast against the background in infrared thermography NDE. The two-stage approach has some advantages in comparison to the traditionally used methods, including automation in the defect detection process and better defective area isolation through increased contrast. The method does not require a reference area to function. The results are presented for the case of a composite plate with simulated delaminations, and a composite sandwich plate with skin-core disbonds.
Molecular Isotopic Distribution Analysis (MIDAs) with adjustable mass accuracy.
Alves, Gelio; Ogurtsov, Aleksey Y; Yu, Yi-Kuo
2014-01-01
In this paper, we present Molecular Isotopic Distribution Analysis (MIDAs), a new software tool designed to compute molecular isotopic distributions with adjustable accuracies. MIDAs offers two algorithms, one polynomial-based and one Fourier-transform-based, both of which compute molecular isotopic distributions accurately and efficiently. The polynomial-based algorithm contains few novel aspects, whereas the Fourier-transform-based algorithm consists mainly of improvements to other existing Fourier-transform-based algorithms. We have benchmarked the performance of the two algorithms implemented in MIDAs with that of eight software packages (BRAIN, Emass, Mercury, Mercury5, NeutronCluster, Qmass, JFC, IC) using a consensus set of benchmark molecules. Under the proposed evaluation criteria, MIDAs's algorithms, JFC, and Emass compute with comparable accuracy the coarse-grained (low-resolution) isotopic distributions and are more accurate than the other software packages. For fine-grained isotopic distributions, we compared IC, MIDAs's polynomial algorithm, and MIDAs's Fourier transform algorithm. Among the three, IC and MIDAs's polynomial algorithm compute isotopic distributions that better resemble their corresponding exact fine-grained (high-resolution) isotopic distributions. MIDAs can be accessed freely through a user-friendly web-interface at http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/midas/index.html. PMID:24254576
Molecular Isotopic Distribution Analysis (MIDAs) with Adjustable Mass Accuracy
NASA Astrophysics Data System (ADS)
Alves, Gelio; Ogurtsov, Aleksey Y.; Yu, Yi-Kuo
2014-01-01
In this paper, we present Molecular Isotopic Distribution Analysis (MIDAs), a new software tool designed to compute molecular isotopic distributions with adjustable accuracies. MIDAs offers two algorithms, one polynomial-based and one Fourier-transform-based, both of which compute molecular isotopic distributions accurately and efficiently. The polynomial-based algorithm contains few novel aspects, whereas the Fourier-transform-based algorithm consists mainly of improvements to other existing Fourier-transform-based algorithms. We have benchmarked the performance of the two algorithms implemented in MIDAs with that of eight software packages (BRAIN, Emass, Mercury, Mercury5, NeutronCluster, Qmass, JFC, IC) using a consensus set of benchmark molecules. Under the proposed evaluation criteria, MIDAs's algorithms, JFC, and Emass compute with comparable accuracy the coarse-grained (low-resolution) isotopic distributions and are more accurate than the other software packages. For fine-grained isotopic distributions, we compared IC, MIDAs's polynomial algorithm, and MIDAs's Fourier transform algorithm. Among the three, IC and MIDAs's polynomial algorithm compute isotopic distributions that better resemble their corresponding exact fine-grained (high-resolution) isotopic distributions. MIDAs can be accessed freely through a user-friendly web-interface at http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/midas/index.html.