Sample records for multivariate analysis incorporating

  1. Esophageal wall dose-surface maps do not improve the predictive performance of a multivariable NTCP model for acute esophageal toxicity in advanced stage NSCLC patients treated with intensity-modulated (chemo-)radiotherapy.

    PubMed

    Dankers, Frank; Wijsman, Robin; Troost, Esther G C; Monshouwer, René; Bussink, Johan; Hoffmann, Aswin L

    2017-05-07

    In our previous work, a multivariable normal-tissue complication probability (NTCP) model for acute esophageal toxicity (AET) Grade  ⩾2 after highly conformal (chemo-)radiotherapy for non-small cell lung cancer (NSCLC) was developed using multivariable logistic regression analysis incorporating clinical parameters and mean esophageal dose (MED). Since the esophagus is a tubular organ, spatial information of the esophageal wall dose distribution may be important in predicting AET. We investigated whether the incorporation of esophageal wall dose-surface data with spatial information improves the predictive power of our established NTCP model. For 149 NSCLC patients treated with highly conformal radiation therapy esophageal wall dose-surface histograms (DSHs) and polar dose-surface maps (DSMs) were generated. DSMs were used to generate new DSHs and dose-length-histograms that incorporate spatial information of the dose-surface distribution. From these histograms dose parameters were derived and univariate logistic regression analysis showed that they correlated significantly with AET. Following our previous work, new multivariable NTCP models were developed using the most significant dose histogram parameters based on univariate analysis (19 in total). However, the 19 new models incorporating esophageal wall dose-surface data with spatial information did not show improved predictive performance (area under the curve, AUC range 0.79-0.84) over the established multivariable NTCP model based on conventional dose-volume data (AUC  =  0.84). For prediction of AET, based on the proposed multivariable statistical approach, spatial information of the esophageal wall dose distribution is of no added value and it is sufficient to only consider MED as a predictive dosimetric parameter.

  2. Esophageal wall dose-surface maps do not improve the predictive performance of a multivariable NTCP model for acute esophageal toxicity in advanced stage NSCLC patients treated with intensity-modulated (chemo-)radiotherapy

    NASA Astrophysics Data System (ADS)

    Dankers, Frank; Wijsman, Robin; Troost, Esther G. C.; Monshouwer, René; Bussink, Johan; Hoffmann, Aswin L.

    2017-05-01

    In our previous work, a multivariable normal-tissue complication probability (NTCP) model for acute esophageal toxicity (AET) Grade  ⩾2 after highly conformal (chemo-)radiotherapy for non-small cell lung cancer (NSCLC) was developed using multivariable logistic regression analysis incorporating clinical parameters and mean esophageal dose (MED). Since the esophagus is a tubular organ, spatial information of the esophageal wall dose distribution may be important in predicting AET. We investigated whether the incorporation of esophageal wall dose-surface data with spatial information improves the predictive power of our established NTCP model. For 149 NSCLC patients treated with highly conformal radiation therapy esophageal wall dose-surface histograms (DSHs) and polar dose-surface maps (DSMs) were generated. DSMs were used to generate new DSHs and dose-length-histograms that incorporate spatial information of the dose-surface distribution. From these histograms dose parameters were derived and univariate logistic regression analysis showed that they correlated significantly with AET. Following our previous work, new multivariable NTCP models were developed using the most significant dose histogram parameters based on univariate analysis (19 in total). However, the 19 new models incorporating esophageal wall dose-surface data with spatial information did not show improved predictive performance (area under the curve, AUC range 0.79-0.84) over the established multivariable NTCP model based on conventional dose-volume data (AUC  =  0.84). For prediction of AET, based on the proposed multivariable statistical approach, spatial information of the esophageal wall dose distribution is of no added value and it is sufficient to only consider MED as a predictive dosimetric parameter.

  3. Multivariate optimum interpolation of surface pressure and surface wind over oceans

    NASA Technical Reports Server (NTRS)

    Bloom, S. C.; Baker, W. E.; Nestler, M. S.

    1984-01-01

    The present multivariate analysis method for surface pressure and winds incorporates ship wind observations into the analysis of surface pressure. For the specific case of 0000 GMT, on February 3, 1979, the additional data resulted in a global rms difference of 0.6 mb; individual maxima as larse as 5 mb occurred over the North Atlantic and East Pacific Oceans. These differences are noted to be smaller than the analysis increments to the first-guess fields.

  4. Classical least squares multivariate spectral analysis

    DOEpatents

    Haaland, David M.

    2002-01-01

    An improved classical least squares multivariate spectral analysis method that adds spectral shapes describing non-calibrated components and system effects (other than baseline corrections) present in the analyzed mixture to the prediction phase of the method. These improvements decrease or eliminate many of the restrictions to the CLS-type methods and greatly extend their capabilities, accuracy, and precision. One new application of PACLS includes the ability to accurately predict unknown sample concentrations when new unmodeled spectral components are present in the unknown samples. Other applications of PACLS include the incorporation of spectrometer drift into the quantitative multivariate model and the maintenance of a calibration on a drifting spectrometer. Finally, the ability of PACLS to transfer a multivariate model between spectrometers is demonstrated.

  5. Multivariate interactive digital analysis system /MIDAS/ - A new fast multispectral recognition system

    NASA Technical Reports Server (NTRS)

    Kriegler, F.; Marshall, R.; Lampert, S.; Gordon, M.; Cornell, C.; Kistler, R.

    1973-01-01

    The MIDAS system is a prototype, multiple-pipeline digital processor mechanizing the multivariate-Gaussian, maximum-likelihood decision algorithm operating at 200,000 pixels/second. It incorporates displays and film printer equipment under control of a general purpose midi-computer and possesses sufficient flexibility that operational versions of the equipment may be subsequently specified as subsets of the system.

  6. Dynamic GSCA (Generalized Structured Component Analysis) with Applications to the Analysis of Effective Connectivity in Functional Neuroimaging Data

    ERIC Educational Resources Information Center

    Jung, Kwanghee; Takane, Yoshio; Hwang, Heungsun; Woodward, Todd S.

    2012-01-01

    We propose a new method of structural equation modeling (SEM) for longitudinal and time series data, named Dynamic GSCA (Generalized Structured Component Analysis). The proposed method extends the original GSCA by incorporating a multivariate autoregressive model to account for the dynamic nature of data taken over time. Dynamic GSCA also…

  7. Bayesian multivariate hierarchical transformation models for ROC analysis.

    PubMed

    O'Malley, A James; Zou, Kelly H

    2006-02-15

    A Bayesian multivariate hierarchical transformation model (BMHTM) is developed for receiver operating characteristic (ROC) curve analysis based on clustered continuous diagnostic outcome data with covariates. Two special features of this model are that it incorporates non-linear monotone transformations of the outcomes and that multiple correlated outcomes may be analysed. The mean, variance, and transformation components are all modelled parametrically, enabling a wide range of inferences. The general framework is illustrated by focusing on two problems: (1) analysis of the diagnostic accuracy of a covariate-dependent univariate test outcome requiring a Box-Cox transformation within each cluster to map the test outcomes to a common family of distributions; (2) development of an optimal composite diagnostic test using multivariate clustered outcome data. In the second problem, the composite test is estimated using discriminant function analysis and compared to the test derived from logistic regression analysis where the gold standard is a binary outcome. The proposed methodology is illustrated on prostate cancer biopsy data from a multi-centre clinical trial.

  8. Bayesian multivariate hierarchical transformation models for ROC analysis

    PubMed Central

    O'Malley, A. James; Zou, Kelly H.

    2006-01-01

    SUMMARY A Bayesian multivariate hierarchical transformation model (BMHTM) is developed for receiver operating characteristic (ROC) curve analysis based on clustered continuous diagnostic outcome data with covariates. Two special features of this model are that it incorporates non-linear monotone transformations of the outcomes and that multiple correlated outcomes may be analysed. The mean, variance, and transformation components are all modelled parametrically, enabling a wide range of inferences. The general framework is illustrated by focusing on two problems: (1) analysis of the diagnostic accuracy of a covariate-dependent univariate test outcome requiring a Box–Cox transformation within each cluster to map the test outcomes to a common family of distributions; (2) development of an optimal composite diagnostic test using multivariate clustered outcome data. In the second problem, the composite test is estimated using discriminant function analysis and compared to the test derived from logistic regression analysis where the gold standard is a binary outcome. The proposed methodology is illustrated on prostate cancer biopsy data from a multi-centre clinical trial. PMID:16217836

  9. A Multitaper, Causal Decomposition for Stochastic, Multivariate Time Series: Application to High-Frequency Calcium Imaging Data.

    PubMed

    Sornborger, Andrew T; Lauderdale, James D

    2016-11-01

    Neural data analysis has increasingly incorporated causal information to study circuit connectivity. Dimensional reduction forms the basis of most analyses of large multivariate time series. Here, we present a new, multitaper-based decomposition for stochastic, multivariate time series that acts on the covariance of the time series at all lags, C ( τ ), as opposed to standard methods that decompose the time series, X ( t ), using only information at zero-lag. In both simulated and neural imaging examples, we demonstrate that methods that neglect the full causal structure may be discarding important dynamical information in a time series.

  10. Evaluation of the microscopic distribution of florfenicol in feed pellets for salmon by Fourier Transform infrared imaging and multivariate analysis.

    PubMed

    Bastidas, Camila Y; von Plessing, Carlos; Troncoso, José; Del P Castillo, Rosario

    2018-04-15

    Fourier Transform infrared imaging and multivariate analysis were used to identify, at the microscopic level, the presence of florfenicol (FF), a heavily-used antibiotic in the salmon industry, supplied to fishes in feed pellets for the treatment of salmonid rickettsial septicemia (SRS). The FF distribution was evaluated using Principal Component Analysis (PCA) and Augmented Multivariate Curve Resolution with Alternating Least Squares (augmented MCR-ALS) on the spectra obtained from images with pixel sizes of 6.25 μm × 6.25 μm and 1.56 μm × 1.56 μm, in different zones of feed pellets. Since the concentration of the drug was 3.44 mg FF/g pellet, this is the first report showing the powerful ability of the used of spectroscopic techniques and multivariate analysis, especially the augmented MCR-ALS, to describe the FF distribution in both the surface and inner parts of feed pellets at low concentration, in a complex matrix and at the microscopic level. The results allow monitoring the incorporation of the drug into the feed pellets. Copyright © 2018 Elsevier B.V. All rights reserved.

  11. Longitudinal assessment of treatment effects on pulmonary ventilation using 1H/3He MRI multivariate templates

    NASA Astrophysics Data System (ADS)

    Tustison, Nicholas J.; Contrella, Benjamin; Altes, Talissa A.; Avants, Brian B.; de Lange, Eduard E.; Mugler, John P.

    2013-03-01

    The utitlity of pulmonary functional imaging techniques, such as hyperpolarized 3He MRI, has encouraged their inclusion in research studies for longitudinal assessment of disease progression and the study of treatment effects. We present methodology for performing voxelwise statistical analysis of ventilation maps derived from hyper­ polarized 3He MRI which incorporates multivariate template construction using simultaneous acquisition of IH and 3He images. Additional processing steps include intensity normalization, bias correction, 4-D longitudinal segmentation, and generation of expected ventilation maps prior to voxelwise regression analysis. Analysis is demonstrated on a cohort of eight individuals with diagnosed cystic fibrosis (CF) undergoing treatment imaged five times every two weeks with a prescribed treatment schedule.

  12. Multivariate Formation Pressure Prediction with Seismic-derived Petrophysical Properties from Prestack AVO inversion and Poststack Seismic Motion Inversion

    NASA Astrophysics Data System (ADS)

    Yu, H.; Gu, H.

    2017-12-01

    A novel multivariate seismic formation pressure prediction methodology is presented, which incorporates high-resolution seismic velocity data from prestack AVO inversion, and petrophysical data (porosity and shale volume) derived from poststack seismic motion inversion. In contrast to traditional seismic formation prediction methods, the proposed methodology is based on a multivariate pressure prediction model and utilizes a trace-by-trace multivariate regression analysis on seismic-derived petrophysical properties to calibrate model parameters in order to make accurate predictions with higher resolution in both vertical and lateral directions. With prestack time migration velocity as initial velocity model, an AVO inversion was first applied to prestack dataset to obtain high-resolution seismic velocity with higher frequency that is to be used as the velocity input for seismic pressure prediction, and the density dataset to calculate accurate Overburden Pressure (OBP). Seismic Motion Inversion (SMI) is an inversion technique based on Markov Chain Monte Carlo simulation. Both structural variability and similarity of seismic waveform are used to incorporate well log data to characterize the variability of the property to be obtained. In this research, porosity and shale volume are first interpreted on well logs, and then combined with poststack seismic data using SMI to build porosity and shale volume datasets for seismic pressure prediction. A multivariate effective stress model is used to convert velocity, porosity and shale volume datasets to effective stress. After a thorough study of the regional stratigraphic and sedimentary characteristics, a regional normally compacted interval model is built, and then the coefficients in the multivariate prediction model are determined in a trace-by-trace multivariate regression analysis on the petrophysical data. The coefficients are used to convert velocity, porosity and shale volume datasets to effective stress and then to calculate formation pressure with OBP. Application of the proposed methodology to a research area in East China Sea has proved that the method can bridge the gap between seismic and well log pressure prediction and give predicted pressure values close to pressure meassurements from well testing.

  13. Biostatistics Series Module 10: Brief Overview of Multivariate Methods.

    PubMed

    Hazra, Avijit; Gogtay, Nithya

    2017-01-01

    Multivariate analysis refers to statistical techniques that simultaneously look at three or more variables in relation to the subjects under investigation with the aim of identifying or clarifying the relationships between them. These techniques have been broadly classified as dependence techniques, which explore the relationship between one or more dependent variables and their independent predictors, and interdependence techniques, that make no such distinction but treat all variables equally in a search for underlying relationships. Multiple linear regression models a situation where a single numerical dependent variable is to be predicted from multiple numerical independent variables. Logistic regression is used when the outcome variable is dichotomous in nature. The log-linear technique models count type of data and can be used to analyze cross-tabulations where more than two variables are included. Analysis of covariance is an extension of analysis of variance (ANOVA), in which an additional independent variable of interest, the covariate, is brought into the analysis. It tries to examine whether a difference persists after "controlling" for the effect of the covariate that can impact the numerical dependent variable of interest. Multivariate analysis of variance (MANOVA) is a multivariate extension of ANOVA used when multiple numerical dependent variables have to be incorporated in the analysis. Interdependence techniques are more commonly applied to psychometrics, social sciences and market research. Exploratory factor analysis and principal component analysis are related techniques that seek to extract from a larger number of metric variables, a smaller number of composite factors or components, which are linearly related to the original variables. Cluster analysis aims to identify, in a large number of cases, relatively homogeneous groups called clusters, without prior information about the groups. The calculation intensive nature of multivariate analysis has so far precluded most researchers from using these techniques routinely. The situation is now changing with wider availability, and increasing sophistication of statistical software and researchers should no longer shy away from exploring the applications of multivariate methods to real-life data sets.

  14. Transforming growth factor-β and toll-like receptor-4 polymorphisms are not associated with fibrosis in haemochromatosis

    PubMed Central

    Wood, Marnie J; Powell, Lawrie W; Dixon, Jeannette L; Subramaniam, V Nathan; Ramm, Grant A

    2013-01-01

    AIM: To investigate the role of genetic polymorphisms in the progression of hepatic fibrosis in hereditary haemochromatosis. METHODS: A cohort of 245 well-characterised C282Y homozygous patients with haemochromatosis was studied, with all subjects having liver biopsy data and DNA available for testing. This study assessed the association of eight single nucleotide polymorphisms (SNPs) in a total of six genes including toll-like receptor 4 (TLR4), transforming growth factor-beta (TGF-β), oxoguanine DNA glycosylase, monocyte chemoattractant protein 1, chemokine C-C motif receptor 2 and interleukin-10 with liver disease severity. Genotyping was performed using high resolution melt analysis and sequencing. The results were analysed in relation to the stage of hepatic fibrosis in multivariate analysis incorporating other cofactors including alcohol consumption and hepatic iron concentration. RESULTS: There were significant associations between the cofactors of male gender (P = 0.0001), increasing age (P = 0.006), alcohol consumption (P = 0.0001), steatosis (P = 0.03), hepatic iron concentration (P < 0.0001) and the presence of hepatic fibrosis. Of the candidate gene polymorphisms studied, none showed a significant association with hepatic fibrosis in univariate or multivariate analysis incorporating cofactors. We also specifically studied patients with hepatic iron loading above threshold levels for cirrhosis and compared the genetic polymorphisms between those with no fibrosis vs cirrhosis however there was no significant effect from any of the candidate genes studied. Importantly, in this large, well characterised cohort of patients there was no association between SNPs for TGF-β or TLR4 and the presence of fibrosis, cirrhosis or increasing fibrosis stage in multivariate analysis. CONCLUSION: In our large, well characterised group of haemochromatosis subjects we did not demonstrate any relationship between candidate gene polymorphisms and hepatic fibrosis or cirrhosis. PMID:24409064

  15. Transforming growth factor-β and toll-like receptor-4 polymorphisms are not associated with fibrosis in haemochromatosis.

    PubMed

    Wood, Marnie J; Powell, Lawrie W; Dixon, Jeannette L; Subramaniam, V Nathan; Ramm, Grant A

    2013-12-28

    To investigate the role of genetic polymorphisms in the progression of hepatic fibrosis in hereditary haemochromatosis. A cohort of 245 well-characterised C282Y homozygous patients with haemochromatosis was studied, with all subjects having liver biopsy data and DNA available for testing. This study assessed the association of eight single nucleotide polymorphisms (SNPs) in a total of six genes including toll-like receptor 4 (TLR4), transforming growth factor-beta (TGF-β), oxoguanine DNA glycosylase, monocyte chemoattractant protein 1, chemokine C-C motif receptor 2 and interleukin-10 with liver disease severity. Genotyping was performed using high resolution melt analysis and sequencing. The results were analysed in relation to the stage of hepatic fibrosis in multivariate analysis incorporating other cofactors including alcohol consumption and hepatic iron concentration. There were significant associations between the cofactors of male gender (P = 0.0001), increasing age (P = 0.006), alcohol consumption (P = 0.0001), steatosis (P = 0.03), hepatic iron concentration (P < 0.0001) and the presence of hepatic fibrosis. Of the candidate gene polymorphisms studied, none showed a significant association with hepatic fibrosis in univariate or multivariate analysis incorporating cofactors. We also specifically studied patients with hepatic iron loading above threshold levels for cirrhosis and compared the genetic polymorphisms between those with no fibrosis vs cirrhosis however there was no significant effect from any of the candidate genes studied. Importantly, in this large, well characterised cohort of patients there was no association between SNPs for TGF-β or TLR4 and the presence of fibrosis, cirrhosis or increasing fibrosis stage in multivariate analysis. In our large, well characterised group of haemochromatosis subjects we did not demonstrate any relationship between candidate gene polymorphisms and hepatic fibrosis or cirrhosis.

  16. A CLIPS expert system for clinical flow cytometry data analysis

    NASA Technical Reports Server (NTRS)

    Salzman, G. C.; Duque, R. E.; Braylan, R. C.; Stewart, C. C.

    1990-01-01

    An expert system is being developed using CLIPS to assist clinicians in the analysis of multivariate flow cytometry data from cancer patients. Cluster analysis is used to find subpopulations representing various cell types in multiple datasets each consisting of four to five measurements on each of 5000 cells. CLIPS facts are derived from results of the clustering. CLIPS rules are based on the expertise of Drs. Stewart, Duque, and Braylan. The rules incorporate certainty factors based on case histories.

  17. Early functional MRI activation predicts motor outcome after ischemic stroke: a longitudinal, multimodal study.

    PubMed

    Du, Juan; Yang, Fang; Zhang, Zhiqiang; Hu, Jingze; Xu, Qiang; Hu, Jianping; Zeng, Fanyong; Lu, Guangming; Liu, Xinfeng

    2018-05-15

    An accurate prediction of long term outcome after stroke is urgently required to provide early individualized neurorehabilitation. This study aimed to examine the added value of early neuroimaging measures and identify the best approaches for predicting motor outcome after stroke. This prospective study involved 34 first-ever ischemic stroke patients (time since stroke: 1-14 days) with upper limb impairment. All patients underwent baseline multimodal assessments that included clinical (age, motor impairment), neurophysiological (motor-evoked potentials, MEP) and neuroimaging (diffusion tensor imaging and motor task-based fMRI) measures, and also underwent reassessment 3 months after stroke. Bivariate analysis and multivariate linear regression models were used to predict the motor scores (Fugl-Meyer assessment, FMA) at 3 months post-stroke. With bivariate analysis, better motor outcome significantly correlated with (1) less initial motor impairment and disability, (2) less corticospinal tract injury, (3) the initial presence of MEPs, (4) stronger baseline motor fMRI activations. In multivariate analysis, incorporating neuroimaging data improved the predictive accuracy relative to only clinical and neurophysiological assessments. Baseline fMRI activation in SMA was an independent predictor of motor outcome after stroke. A multimodal model incorporating fMRI and clinical measures best predicted the motor outcome following stroke. fMRI measures obtained early after stroke provided independent prediction of long-term motor outcome.

  18. A Dynamic Intrusion Detection System Based on Multivariate Hotelling's T2 Statistics Approach for Network Environments

    PubMed Central

    Avalappampatty Sivasamy, Aneetha; Sundan, Bose

    2015-01-01

    The ever expanding communication requirements in today's world demand extensive and efficient network systems with equally efficient and reliable security features integrated for safe, confident, and secured communication and data transfer. Providing effective security protocols for any network environment, therefore, assumes paramount importance. Attempts are made continuously for designing more efficient and dynamic network intrusion detection models. In this work, an approach based on Hotelling's T2 method, a multivariate statistical analysis technique, has been employed for intrusion detection, especially in network environments. Components such as preprocessing, multivariate statistical analysis, and attack detection have been incorporated in developing the multivariate Hotelling's T2 statistical model and necessary profiles have been generated based on the T-square distance metrics. With a threshold range obtained using the central limit theorem, observed traffic profiles have been classified either as normal or attack types. Performance of the model, as evaluated through validation and testing using KDD Cup'99 dataset, has shown very high detection rates for all classes with low false alarm rates. Accuracy of the model presented in this work, in comparison with the existing models, has been found to be much better. PMID:26357668

  19. A Dynamic Intrusion Detection System Based on Multivariate Hotelling's T2 Statistics Approach for Network Environments.

    PubMed

    Sivasamy, Aneetha Avalappampatty; Sundan, Bose

    2015-01-01

    The ever expanding communication requirements in today's world demand extensive and efficient network systems with equally efficient and reliable security features integrated for safe, confident, and secured communication and data transfer. Providing effective security protocols for any network environment, therefore, assumes paramount importance. Attempts are made continuously for designing more efficient and dynamic network intrusion detection models. In this work, an approach based on Hotelling's T(2) method, a multivariate statistical analysis technique, has been employed for intrusion detection, especially in network environments. Components such as preprocessing, multivariate statistical analysis, and attack detection have been incorporated in developing the multivariate Hotelling's T(2) statistical model and necessary profiles have been generated based on the T-square distance metrics. With a threshold range obtained using the central limit theorem, observed traffic profiles have been classified either as normal or attack types. Performance of the model, as evaluated through validation and testing using KDD Cup'99 dataset, has shown very high detection rates for all classes with low false alarm rates. Accuracy of the model presented in this work, in comparison with the existing models, has been found to be much better.

  20. Gas-water two-phase flow characterization with Electrical Resistance Tomography and Multivariate Multiscale Entropy analysis.

    PubMed

    Tan, Chao; Zhao, Jia; Dong, Feng

    2015-03-01

    Flow behavior characterization is important to understand gas-liquid two-phase flow mechanics and further establish its description model. An Electrical Resistance Tomography (ERT) provides information regarding flow conditions at different directions where the sensing electrodes implemented. We extracted the multivariate sample entropy (MSampEn) by treating ERT data as a multivariate time series. The dynamic experimental results indicate that the MSampEn is sensitive to complexity change of flow patterns including bubbly flow, stratified flow, plug flow and slug flow. MSampEn can characterize the flow behavior at different direction of two-phase flow, and reveal the transition between flow patterns when flow velocity changes. The proposed method is effective to analyze two-phase flow pattern transition by incorporating information of different scales and different spatial directions. Copyright © 2014 ISA. Published by Elsevier Ltd. All rights reserved.

  1. Discrimination of inflammatory bowel disease using Raman spectroscopy and linear discriminant analysis methods

    NASA Astrophysics Data System (ADS)

    Ding, Hao; Cao, Ming; DuPont, Andrew W.; Scott, Larry D.; Guha, Sushovan; Singhal, Shashideep; Younes, Mamoun; Pence, Isaac; Herline, Alan; Schwartz, David; Xu, Hua; Mahadevan-Jansen, Anita; Bi, Xiaohong

    2016-03-01

    Inflammatory bowel disease (IBD) is an idiopathic disease that is typically characterized by chronic inflammation of the gastrointestinal tract. Recently much effort has been devoted to the development of novel diagnostic tools that can assist physicians for fast, accurate, and automated diagnosis of the disease. Previous research based on Raman spectroscopy has shown promising results in differentiating IBD patients from normal screening cases. In the current study, we examined IBD patients in vivo through a colonoscope-coupled Raman system. Optical diagnosis for IBD discrimination was conducted based on full-range spectra using multivariate statistical methods. Further, we incorporated several feature selection methods in machine learning into the classification model. The diagnostic performance for disease differentiation was significantly improved after feature selection. Our results showed that improved IBD diagnosis can be achieved using Raman spectroscopy in combination with multivariate analysis and feature selection.

  2. Order-restricted inference for multivariate longitudinal data with applications to the natural history of hearing loss.

    PubMed

    Rosen, Sophia; Davidov, Ori

    2012-07-20

    Multivariate outcomes are often measured longitudinally. For example, in hearing loss studies, hearing thresholds for each subject are measured repeatedly over time at several frequencies. Thus, each patient is associated with a multivariate longitudinal outcome. The multivariate mixed-effects model is a useful tool for the analysis of such data. There are situations in which the parameters of the model are subject to some restrictions or constraints. For example, it is known that hearing thresholds, at every frequency, increase with age. Moreover, this age-related threshold elevation is monotone in frequency, that is, the higher the frequency, the higher, on average, is the rate of threshold elevation. This means that there is a natural ordering among the different frequencies in the rate of hearing loss. In practice, this amounts to imposing a set of constraints on the different frequencies' regression coefficients modeling the mean effect of time and age at entry to the study on hearing thresholds. The aforementioned constraints should be accounted for in the analysis. The result is a multivariate longitudinal model with restricted parameters. We propose estimation and testing procedures for such models. We show that ignoring the constraints may lead to misleading inferences regarding the direction and the magnitude of various effects. Moreover, simulations show that incorporating the constraints substantially improves the mean squared error of the estimates and the power of the tests. We used this methodology to analyze a real hearing loss study. Copyright © 2012 John Wiley & Sons, Ltd.

  3. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool

    PubMed Central

    Clark, Neil R.; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D.; Jones, Matthew R.; Ma’ayan, Avi

    2016-01-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community. PMID:26848405

  4. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    PubMed

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  5. Application of kernel principal component analysis and computational machine learning to exploration of metabolites strongly associated with diet.

    PubMed

    Shiokawa, Yuka; Date, Yasuhiro; Kikuchi, Jun

    2018-02-21

    Computer-based technological innovation provides advancements in sophisticated and diverse analytical instruments, enabling massive amounts of data collection with relative ease. This is accompanied by a fast-growing demand for technological progress in data mining methods for analysis of big data derived from chemical and biological systems. From this perspective, use of a general "linear" multivariate analysis alone limits interpretations due to "non-linear" variations in metabolic data from living organisms. Here we describe a kernel principal component analysis (KPCA)-incorporated analytical approach for extracting useful information from metabolic profiling data. To overcome the limitation of important variable (metabolite) determinations, we incorporated a random forest conditional variable importance measure into our KPCA-based analytical approach to demonstrate the relative importance of metabolites. Using a market basket analysis, hippurate, the most important variable detected in the importance measure, was associated with high levels of some vitamins and minerals present in foods eaten the previous day, suggesting a relationship between increased hippurate and intake of a wide variety of vegetables and fruits. Therefore, the KPCA-incorporated analytical approach described herein enabled us to capture input-output responses, and should be useful not only for metabolic profiling but also for profiling in other areas of biological and environmental systems.

  6. Phylogenetic Factor Analysis.

    PubMed

    Tolkoff, Max R; Alfaro, Michael E; Baele, Guy; Lemey, Philippe; Suchard, Marc A

    2018-05-01

    Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.

  7. Comparing lagged linear correlation, lagged regression, Granger causality, and vector autoregression for uncovering associations in EHR data.

    PubMed

    Levine, Matthew E; Albers, David J; Hripcsak, George

    2016-01-01

    Time series analysis methods have been shown to reveal clinical and biological associations in data collected in the electronic health record. We wish to develop reliable high-throughput methods for identifying adverse drug effects that are easy to implement and produce readily interpretable results. To move toward this goal, we used univariate and multivariate lagged regression models to investigate associations between twenty pairs of drug orders and laboratory measurements. Multivariate lagged regression models exhibited higher sensitivity and specificity than univariate lagged regression in the 20 examples, and incorporating autoregressive terms for labs and drugs produced more robust signals in cases of known associations among the 20 example pairings. Moreover, including inpatient admission terms in the model attenuated the signals for some cases of unlikely associations, demonstrating how multivariate lagged regression models' explicit handling of context-based variables can provide a simple way to probe for health-care processes that confound analyses of EHR data.

  8. Identification of complex metabolic states in critically injured patients using bioinformatic cluster analysis.

    PubMed

    Cohen, Mitchell J; Grossman, Adam D; Morabito, Diane; Knudson, M Margaret; Butte, Atul J; Manley, Geoffrey T

    2010-01-01

    Advances in technology have made extensive monitoring of patient physiology the standard of care in intensive care units (ICUs). While many systems exist to compile these data, there has been no systematic multivariate analysis and categorization across patient physiological data. The sheer volume and complexity of these data make pattern recognition or identification of patient state difficult. Hierarchical cluster analysis allows visualization of high dimensional data and enables pattern recognition and identification of physiologic patient states. We hypothesized that processing of multivariate data using hierarchical clustering techniques would allow identification of otherwise hidden patient physiologic patterns that would be predictive of outcome. Multivariate physiologic and ventilator data were collected continuously using a multimodal bioinformatics system in the surgical ICU at San Francisco General Hospital. These data were incorporated with non-continuous data and stored on a server in the ICU. A hierarchical clustering algorithm grouped each minute of data into 1 of 10 clusters. Clusters were correlated with outcome measures including incidence of infection, multiple organ failure (MOF), and mortality. We identified 10 clusters, which we defined as distinct patient states. While patients transitioned between states, they spent significant amounts of time in each. Clusters were enriched for our outcome measures: 2 of the 10 states were enriched for infection, 6 of 10 were enriched for MOF, and 3 of 10 were enriched for death. Further analysis of correlations between pairs of variables within each cluster reveals significant differences in physiology between clusters. Here we show for the first time the feasibility of clustering physiological measurements to identify clinically relevant patient states after trauma. These results demonstrate that hierarchical clustering techniques can be useful for visualizing complex multivariate data and may provide new insights for the care of critically injured patients.

  9. Using Concurrent Cardiovascular Information to Augment Survival Time Data from Orthostatic Tilt Tests

    NASA Technical Reports Server (NTRS)

    Feiveson, Alan H.; Fiedler, James; Lee, Stuart M. M.; Westby, Christian M.; Stenger, Michael B.; Platts, Steven H.

    2014-01-01

    Orthostatic Intolerance (OI) is the propensity to develop symptoms of fainting during upright standing. OI is associated with changes in heart rate, blood pressure and other measures of cardiac function. Problem: NASA astronauts have shown increased susceptibility to OI on return from space missions. Current methods for counteracting OI in astronauts include fluid loading and the use of compression garments. Multivariate trajectory spread is greater as OI increases. Pairwise comparisons at the same time within subjects allows incorporation of pass/fail outcomes. Path length, convex hull area, and covariance matrix determinant do well as statistics to summarize this spread Missing data problems Time series analysis need many more time points per OTT session treatment of trend? how incorporate survival information?

  10. Fiber-optic evanescent-wave spectroscopy for fast multicomponent analysis of human blood

    NASA Astrophysics Data System (ADS)

    Simhi, Ronit; Gotshal, Yaron; Bunimovich, David; Katzir, Abraham; Sela, Ben-Ami

    1996-07-01

    A spectral analysis of human blood serum was undertaken by fiber-optic evanescent-wave spectroscopy (FEWS) by the use of a Fourier-transform infrared spectrometer. A special cell for the FEWS measurements was designed and built that incorporates an IR-transmitting silver halide fiber and a means for introducing the blood-serum sample. Further improvements in analysis were obtained by the adoption of multivariate calibration techniques that are already used in clinical chemistry. The partial least-squares algorithm was used to calculate the concentrations of cholesterol, total protein, urea, and uric acid in human blood serum. The estimated prediction errors obtained (in percent from the average value) were 6% for total protein, 15% for cholesterol, 30% for urea, and 30% for uric acid. These results were compared with another independent prediction method that used a neural-network model. This model yielded estimated prediction errors of 8.8% for total protein, 25% for cholesterol, and 21% for uric acid. spectroscopy, fiber-optic evanescent-wave spectroscopy, Fourier-transform infrared spectrometer, blood, multivariate calibration, neural networks.

  11. On Models for Binomial Data with Random Numbers of Trials

    PubMed Central

    Comulada, W. Scott; Weiss, Robert E.

    2010-01-01

    Summary A binomial outcome is a count s of the number of successes out of the total number of independent trials n = s + f, where f is a count of the failures. The n are random variables not fixed by design in many studies. Joint modeling of (s, f) can provide additional insight into the science and into the probability π of success that cannot be directly incorporated by the logistic regression model. Observations where n = 0 are excluded from the binomial analysis yet may be important to understanding how π is influenced by covariates. Correlation between s and f may exist and be of direct interest. We propose Bayesian multivariate Poisson models for the bivariate response (s, f), correlated through random effects. We extend our models to the analysis of longitudinal and multivariate longitudinal binomial outcomes. Our methodology was motivated by two disparate examples, one from teratology and one from an HIV tertiary intervention study. PMID:17688514

  12. Differentiation between borderline and benign ovarian tumors: combined analysis of MRI with tumor markers for large cystic masses (≥5 cm).

    PubMed

    Park, Sung Yoon; Oh, Young Taik; Jung, Dae Chul

    2016-05-01

    There is overlap in imaging features between borderline and benign ovarian tumors. To analyze diagnostic performance of magnetic resonance imaging (MRI) combined with tumor markers for differentiating borderline from benign ovarian tumor. Ninety-nine patient with MRI and surgically confirmed ovarian tumors 5 cm or larger (borderline, n = 37; benign, n = 62) were included. On MRI, tumor size, septal number (0; 1-4; 5 or more), and presence of solid portion such as papillary projection or septal thickening 0.5 cm or larger were investigated. Serum tumor markers (carbohydrate antigen 125 [CA 125] and CA 19-9) were recorded. Multivariate analysis was conducted for assessing whether combined MRI with tumor markers could differentiate borderline from benign tumor. The diagnostic performance was also analyzed. Incidence of solid portion was 67.6% (25/37) in borderline and 3.2% (2/62) in benign tumors (P < 0.05). In all patients, without combined analysis of MRI with tumor markers, multivariate analysis revealed solid portion (P < 0.001) and CA 125 (P = 0.039) were significant for predicting borderline tumors. When combined analysis of MRI with CA 125 ((i) the presence of solid portion or (ii) CA 125 > 44.1 U/mL with septal number ≥5 for borderline tumor) is incorporated to multivariate analysis, it was only significant (P = 0.001). The sensitivity, specificity, PPV, NPV, and accuracy of combined analysis of MRI with CA 125 were 89.1%, 91.9%, 86.8%, 93.4, and 90.9%, respectively. Combined analysis of MRI with CA 125 may allow better differentiation between borderline and benign ovarian tumor compared with MRI alone. © The Foundation Acta Radiologica 2015.

  13. Incorporation of N0 Stage with Insufficient Numbers of Lymph Nodes into N1 Stage in the Seventh Edition of the TNM Classification Improves Prediction of Prognosis in Gastric Cancer: Results of a Single-Institution Study of 1258 Chinese Patients.

    PubMed

    Li, Bofei; Li, Yuanfang; Wang, Wei; Qiu, Haibo; Seeruttun, Sharvesh Raj; Fang, Cheng; Chen, Yongming; Liang, Yao; Li, Wei; Chen, Yingbo; Sun, Xiaowei; Guan, Yuanxiang; Zhan, Youqing; Zhou, Zhiwei

    2016-01-01

    This study examined the prognosis of the "node-negative with eLNs ≤ 15" designation and the additional value of incorporating it into the pN1 designation in the seventh edition of the N classification. From January 2000 to September 2010, a total of 1258 gastric cancer patients (patients with eLNs > 15 or node-negative with eLNs ≤ 15) undergoing radical gastric resection were enrolled in this study. We incorporated node-negative patients with eLNs ≤ 15 into pN1 and compared this designation with the current 7th edition UICC N stage for 3, 5-year overall survival by univariate and multivariate analysis. Homogeneity, discriminatory ability, and monotonicity of gradients in the hypothetical N stage and the UICC N stage were compared using linear trend χ2, likelihood ratio χ2 statistics, and Akaike information criterion (AIC) calculations. Node-negative patients with eLNs ≤ 15 had worse survival compared with those with eLNs > 15. In univariate and multivariate analyses, the hypothetical N stage showed superiority to the 7th edition pN staging. The hypothetical staging system had higher linear trend and likelihood ratio χ (2) scores and smaller AIC values compared with those for the TNM system, which represented the optimum prognostic stratification. Node-negative patients with eLNs ≤ 15 can be considered to be incorporated into the pN1 stage in the 7th edition of the TNM classification.

  14. Spatial assessment of air quality patterns in Malaysia using multivariate analysis

    NASA Astrophysics Data System (ADS)

    Dominick, Doreena; Juahir, Hafizan; Latif, Mohd Talib; Zain, Sharifuddin M.; Aris, Ahmad Zaharin

    2012-12-01

    This study aims to investigate possible sources of air pollutants and the spatial patterns within the eight selected Malaysian air monitoring stations based on a two-year database (2008-2009). The multivariate analysis was applied on the dataset. It incorporated Hierarchical Agglomerative Cluster Analysis (HACA) to access the spatial patterns, Principal Component Analysis (PCA) to determine the major sources of the air pollution and Multiple Linear Regression (MLR) to assess the percentage contribution of each air pollutant. The HACA results grouped the eight monitoring stations into three different clusters, based on the characteristics of the air pollutants and meteorological parameters. The PCA analysis showed that the major sources of air pollution were emissions from motor vehicles, aircraft, industries and areas of high population density. The MLR analysis demonstrated that the main pollutant contributing to variability in the Air Pollutant Index (API) at all stations was particulate matter with a diameter of less than 10 μm (PM10). Further MLR analysis showed that the main air pollutant influencing the high concentration of PM10 was carbon monoxide (CO). This was due to combustion processes, particularly originating from motor vehicles. Meteorological factors such as ambient temperature, wind speed and humidity were also noted to influence the concentration of PM10.

  15. Augmented classical least squares multivariate spectral analysis

    DOEpatents

    Haaland, David M.; Melgaard, David K.

    2004-02-03

    A method of multivariate spectral analysis, termed augmented classical least squares (ACLS), provides an improved CLS calibration model when unmodeled sources of spectral variation are contained in a calibration sample set. The ACLS methods use information derived from component or spectral residuals during the CLS calibration to provide an improved calibration-augmented CLS model. The ACLS methods are based on CLS so that they retain the qualitative benefits of CLS, yet they have the flexibility of PLS and other hybrid techniques in that they can define a prediction model even with unmodeled sources of spectral variation that are not explicitly included in the calibration model. The unmodeled sources of spectral variation may be unknown constituents, constituents with unknown concentrations, nonlinear responses, non-uniform and correlated errors, or other sources of spectral variation that are present in the calibration sample spectra. Also, since the various ACLS methods are based on CLS, they can incorporate the new prediction-augmented CLS (PACLS) method of updating the prediction model for new sources of spectral variation contained in the prediction sample set without having to return to the calibration process. The ACLS methods can also be applied to alternating least squares models. The ACLS methods can be applied to all types of multivariate data.

  16. Augmented Classical Least Squares Multivariate Spectral Analysis

    DOEpatents

    Haaland, David M.; Melgaard, David K.

    2005-07-26

    A method of multivariate spectral analysis, termed augmented classical least squares (ACLS), provides an improved CLS calibration model when unmodeled sources of spectral variation are contained in a calibration sample set. The ACLS methods use information derived from component or spectral residuals during the CLS calibration to provide an improved calibration-augmented CLS model. The ACLS methods are based on CLS so that they retain the qualitative benefits of CLS, yet they have the flexibility of PLS and other hybrid techniques in that they can define a prediction model even with unmodeled sources of spectral variation that are not explicitly included in the calibration model. The unmodeled sources of spectral variation may be unknown constituents, constituents with unknown concentrations, nonlinear responses, non-uniform and correlated errors, or other sources of spectral variation that are present in the calibration sample spectra. Also, since the various ACLS methods are based on CLS, they can incorporate the new prediction-augmented CLS (PACLS) method of updating the prediction model for new sources of spectral variation contained in the prediction sample set without having to return to the calibration process. The ACLS methods can also be applied to alternating least squares models. The ACLS methods can be applied to all types of multivariate data.

  17. Augmented Classical Least Squares Multivariate Spectral Analysis

    DOEpatents

    Haaland, David M.; Melgaard, David K.

    2005-01-11

    A method of multivariate spectral analysis, termed augmented classical least squares (ACLS), provides an improved CLS calibration model when unmodeled sources of spectral variation are contained in a calibration sample set. The ACLS methods use information derived from component or spectral residuals during the CLS calibration to provide an improved calibration-augmented CLS model. The ACLS methods are based on CLS so that they retain the qualitative benefits of CLS, yet they have the flexibility of PLS and other hybrid techniques in that they can define a prediction model even with unmodeled sources of spectral variation that are not explicitly included in the calibration model. The unmodeled sources of spectral variation may be unknown constituents, constituents with unknown concentrations, nonlinear responses, non-uniform and correlated errors, or other sources of spectral variation that are present in the calibration sample spectra. Also, since the various ACLS methods are based on CLS, they can incorporate the new prediction-augmented CLS (PACLS) method of updating the prediction model for new sources of spectral variation contained in the prediction sample set without having to return to the calibration process. The ACLS methods can also be applied to alternating least squares models. The ACLS methods can be applied to all types of multivariate data.

  18. A Study on Aircraft Engine Control Systems for Integrated Flight and Propulsion Control

    NASA Astrophysics Data System (ADS)

    Yamane, Hideaki; Matsunaga, Yasushi; Kusakawa, Takeshi

    A flyable FADEC system engineering model incorporating Integrated Flight and Propulsion Control (IFPC) concept is developed for a highly maneuverable aircraft and a fighter-class engine. An overview of the FADEC system and functional assignments for its components such as the Engine Control Unit (ECU) and the Integrated Control Unit (ICU) are described. Overall system reliability analysis, convex analysis and multivariable controller design for the engine, fault detection/redundancy management, and response characteristics of a fuel system are addressed. The engine control performance of the FADEC is demonstrated by hardware-in-the-loop simulation for fast acceleration and thrust transient characteristics.

  19. A framework for list representation, enabling list stabilization through incorporation of gene exchangeabilities.

    PubMed

    Soneson, Charlotte; Fontes, Magnus

    2012-01-01

    Analysis of multivariate data sets from, for example, microarray studies frequently results in lists of genes which are associated with some response of interest. The biological interpretation is often complicated by the statistical instability of the obtained gene lists, which may partly be due to the functional redundancy among genes, implying that multiple genes can play exchangeable roles in the cell. In this paper, we use the concept of exchangeability of random variables to model this functional redundancy and thereby account for the instability. We present a flexible framework to incorporate the exchangeability into the representation of lists. The proposed framework supports straightforward comparison between any 2 lists. It can also be used to generate new more stable gene rankings incorporating more information from the experimental data. Using 2 microarray data sets, we show that the proposed method provides more robust gene rankings than existing methods with respect to sampling variations, without compromising the biological significance of the rankings.

  20. A multivariate spatial mixture model for areal data: examining regional differences in standardized test scores

    PubMed Central

    Neelon, Brian; Gelfand, Alan E.; Miranda, Marie Lynn

    2013-01-01

    Summary Researchers in the health and social sciences often wish to examine joint spatial patterns for two or more related outcomes. Examples include infant birth weight and gestational length, psychosocial and behavioral indices, and educational test scores from different cognitive domains. We propose a multivariate spatial mixture model for the joint analysis of continuous individual-level outcomes that are referenced to areal units. The responses are modeled as a finite mixture of multivariate normals, which accommodates a wide range of marginal response distributions and allows investigators to examine covariate effects within subpopulations of interest. The model has a hierarchical structure built at the individual level (i.e., individuals are nested within areal units), and thus incorporates both individual- and areal-level predictors as well as spatial random effects for each mixture component. Conditional autoregressive (CAR) priors on the random effects provide spatial smoothing and allow the shape of the multivariate distribution to vary flexibly across geographic regions. We adopt a Bayesian modeling approach and develop an efficient Markov chain Monte Carlo model fitting algorithm that relies primarily on closed-form full conditionals. We use the model to explore geographic patterns in end-of-grade math and reading test scores among school-age children in North Carolina. PMID:26401059

  1. Comparative multivariate analyses of transient otoacoustic emissions and distorsion products in normal and impaired hearing.

    PubMed

    Stamate, Mirela Cristina; Todor, Nicolae; Cosgarea, Marcel

    2015-01-01

    The clinical utility of otoacoustic emissions as a noninvasive objective test of cochlear function has been long studied. Both transient otoacoustic emissions and distorsion products can be used to identify hearing loss, but to what extent they can be used as predictors for hearing loss is still debated. Most studies agree that multivariate analyses have better test performances than univariate analyses. The aim of the study was to determine transient otoacoustic emissions and distorsion products performance in identifying normal and impaired hearing loss, using the pure tone audiogram as a gold standard procedure and different multivariate statistical approaches. The study included 105 adult subjects with normal hearing and hearing loss who underwent the same test battery: pure-tone audiometry, tympanometry, otoacoustic emission tests. We chose to use the logistic regression as a multivariate statistical technique. Three logistic regression models were developed to characterize the relations between different risk factors (age, sex, tinnitus, demographic features, cochlear status defined by otoacoustic emissions) and hearing status defined by pure-tone audiometry. The multivariate analyses allow the calculation of the logistic score, which is a combination of the inputs, weighted by coefficients, calculated within the analyses. The accuracy of each model was assessed using receiver operating characteristics curve analysis. We used the logistic score to generate receivers operating curves and to estimate the areas under the curves in order to compare different multivariate analyses. We compared the performance of each otoacoustic emission (transient, distorsion product) using three different multivariate analyses for each ear, when multi-frequency gold standards were used. We demonstrated that all multivariate analyses provided high values of the area under the curve proving the performance of the otoacoustic emissions. Each otoacoustic emission test presented high values of area under the curve, suggesting that implementing a multivariate approach to evaluate the performances of each otoacoustic emission test would serve to increase the accuracy in identifying the normal and impaired ears. We encountered the highest area under the curve value for the combined multivariate analysis suggesting that both otoacoustic emission tests should be used in assessing hearing status. Our multivariate analyses revealed that age is a constant predictor factor of the auditory status for both ears, but the presence of tinnitus was the most important predictor for the hearing level, only for the left ear. Age presented similar coefficients, but tinnitus coefficients, by their high value, produced the highest variations of the logistic scores, only for the left ear group, thus increasing the risk of hearing loss. We did not find gender differences between ears for any otoacoustic emission tests, but studies still debate this question as the results are contradictory. Neither gender, nor environment origin had any predictive value for the hearing status, according to the results of our study. Like any other audiological test, using otoacoustic emissions to identify hearing loss is not without error. Even when applying multivariate analysis, perfect test performance is never achieved. Although most studies demonstrated the benefit of using the multivariate analysis, it has not been incorporated into clinical decisions maybe because of the idiosyncratic nature of multivariate solutions or because of the lack of the validation studies.

  2. Comparative multivariate analyses of transient otoacoustic emissions and distorsion products in normal and impaired hearing

    PubMed Central

    STAMATE, MIRELA CRISTINA; TODOR, NICOLAE; COSGAREA, MARCEL

    2015-01-01

    Background and aim The clinical utility of otoacoustic emissions as a noninvasive objective test of cochlear function has been long studied. Both transient otoacoustic emissions and distorsion products can be used to identify hearing loss, but to what extent they can be used as predictors for hearing loss is still debated. Most studies agree that multivariate analyses have better test performances than univariate analyses. The aim of the study was to determine transient otoacoustic emissions and distorsion products performance in identifying normal and impaired hearing loss, using the pure tone audiogram as a gold standard procedure and different multivariate statistical approaches. Methods The study included 105 adult subjects with normal hearing and hearing loss who underwent the same test battery: pure-tone audiometry, tympanometry, otoacoustic emission tests. We chose to use the logistic regression as a multivariate statistical technique. Three logistic regression models were developed to characterize the relations between different risk factors (age, sex, tinnitus, demographic features, cochlear status defined by otoacoustic emissions) and hearing status defined by pure-tone audiometry. The multivariate analyses allow the calculation of the logistic score, which is a combination of the inputs, weighted by coefficients, calculated within the analyses. The accuracy of each model was assessed using receiver operating characteristics curve analysis. We used the logistic score to generate receivers operating curves and to estimate the areas under the curves in order to compare different multivariate analyses. Results We compared the performance of each otoacoustic emission (transient, distorsion product) using three different multivariate analyses for each ear, when multi-frequency gold standards were used. We demonstrated that all multivariate analyses provided high values of the area under the curve proving the performance of the otoacoustic emissions. Each otoacoustic emission test presented high values of area under the curve, suggesting that implementing a multivariate approach to evaluate the performances of each otoacoustic emission test would serve to increase the accuracy in identifying the normal and impaired ears. We encountered the highest area under the curve value for the combined multivariate analysis suggesting that both otoacoustic emission tests should be used in assessing hearing status. Our multivariate analyses revealed that age is a constant predictor factor of the auditory status for both ears, but the presence of tinnitus was the most important predictor for the hearing level, only for the left ear. Age presented similar coefficients, but tinnitus coefficients, by their high value, produced the highest variations of the logistic scores, only for the left ear group, thus increasing the risk of hearing loss. We did not find gender differences between ears for any otoacoustic emission tests, but studies still debate this question as the results are contradictory. Neither gender, nor environment origin had any predictive value for the hearing status, according to the results of our study. Conclusion Like any other audiological test, using otoacoustic emissions to identify hearing loss is not without error. Even when applying multivariate analysis, perfect test performance is never achieved. Although most studies demonstrated the benefit of using the multivariate analysis, it has not been incorporated into clinical decisions maybe because of the idiosyncratic nature of multivariate solutions or because of the lack of the validation studies. PMID:26733749

  3. Error Covariance Penalized Regression: A novel multivariate model combining penalized regression with multivariate error structure.

    PubMed

    Allegrini, Franco; Braga, Jez W B; Moreira, Alessandro C O; Olivieri, Alejandro C

    2018-06-29

    A new multivariate regression model, named Error Covariance Penalized Regression (ECPR) is presented. Following a penalized regression strategy, the proposed model incorporates information about the measurement error structure of the system, using the error covariance matrix (ECM) as a penalization term. Results are reported from both simulations and experimental data based on replicate mid and near infrared (MIR and NIR) spectral measurements. The results for ECPR are better under non-iid conditions when compared with traditional first-order multivariate methods such as ridge regression (RR), principal component regression (PCR) and partial least-squares regression (PLS). Copyright © 2018 Elsevier B.V. All rights reserved.

  4. Resemblance profiles as clustering decision criteria: Estimating statistical power, error, and correspondence for a hypothesis test for multivariate structure.

    PubMed

    Kilborn, Joshua P; Jones, David L; Peebles, Ernst B; Naar, David F

    2017-04-01

    Clustering data continues to be a highly active area of data analysis, and resemblance profiles are being incorporated into ecological methodologies as a hypothesis testing-based approach to clustering multivariate data. However, these new clustering techniques have not been rigorously tested to determine the performance variability based on the algorithm's assumptions or any underlying data structures. Here, we use simulation studies to estimate the statistical error rates for the hypothesis test for multivariate structure based on dissimilarity profiles (DISPROF). We concurrently tested a widely used algorithm that employs the unweighted pair group method with arithmetic mean (UPGMA) to estimate the proficiency of clustering with DISPROF as a decision criterion. We simulated unstructured multivariate data from different probability distributions with increasing numbers of objects and descriptors, and grouped data with increasing overlap, overdispersion for ecological data, and correlation among descriptors within groups. Using simulated data, we measured the resolution and correspondence of clustering solutions achieved by DISPROF with UPGMA against the reference grouping partitions used to simulate the structured test datasets. Our results highlight the dynamic interactions between dataset dimensionality, group overlap, and the properties of the descriptors within a group (i.e., overdispersion or correlation structure) that are relevant to resemblance profiles as a clustering criterion for multivariate data. These methods are particularly useful for multivariate ecological datasets that benefit from distance-based statistical analyses. We propose guidelines for using DISPROF as a clustering decision tool that will help future users avoid potential pitfalls during the application of methods and the interpretation of results.

  5. Multivariate regression model for predicting yields of grade lumber from yellow birch sawlogs

    Treesearch

    Andrew F. Howard; Daniel A. Yaussy

    1986-01-01

    A multivariate regression model was developed to predict green board-foot yields for the common grades of factory lumber processed from yellow birch factory-grade logs. The model incorporates the standard log measurements of scaling diameter, length, proportion of scalable defects, and the assigned USDA Forest Service log grade. Differences in yields between band and...

  6. Validation and Development of a Modified Breast Graded Prognostic Assessment As a Tool for Survival in Patients With Breast Cancer and Brain Metastases.

    PubMed

    Subbiah, Ishwaria M; Lei, Xiudong; Weinberg, Jeffrey S; Sulman, Erik P; Chavez-MacGregor, Mariana; Tripathy, Debu; Gupta, Rohan; Varma, Ankur; Chouhan, Jay; Guevarra, Richard P; Valero, Vicente; Gilbert, Mark R; Gonzalez-Angulo, Ana M

    2015-07-10

    Several indices have been developed to predict overall survival (OS) in patients with breast cancer with brain metastases, including the breast graded prognostic assessment (breast-GPA), comprising age, tumor subtype, and Karnofsky performance score. However, number of brain metastases-a highly relevant clinical variable-is less often incorporated into the final model. We sought to validate the existing breast-GPA in an independent larger cohort and refine it integrating number of brain metastases. Data were retrospectively gathered from a prospectively maintained institutional database. Patients with newly diagnosed brain metastases from 1996 to 2013 were identified. After validating the breast-GPA, multivariable Cox regression and recursive partitioning analysis led to the development of the modified breast-GPA. The performances of the breast-GPA and modified breast-GPA were compared using the concordance index. In our cohort of 1,552 patients, the breast-GPA was validated as a prognostic tool for OS (P < .001). In multivariable analysis of the breast-GPA and number of brain metastases (> three v ≤ three), both were independent predictors of OS. We therefore developed the modified breast-GPA integrating a fourth clinical parameter. Recursive partitioning analysis reinforced the prognostic significance of these four factors. Concordance indices were 0.78 (95% CI, 0.77 to 0.80) and 0.84 (95% CI, 0.83 to 0.85) for the breast-GPA and modified breast-GPA, respectively (P < .001). The modified breast-GPA incorporates four simple clinical parameters of high prognostic significance. This index has an immediate role in the clinic as a formative part of the clinician's discussion of prognosis and direction of care and as a potential patient selection tool for clinical trials. © 2015 by American Society of Clinical Oncology.

  7. [Multivariate Adaptive Regression Splines (MARS), an alternative for the analysis of time series].

    PubMed

    Vanegas, Jairo; Vásquez, Fabián

    Multivariate Adaptive Regression Splines (MARS) is a non-parametric modelling method that extends the linear model, incorporating nonlinearities and interactions between variables. It is a flexible tool that automates the construction of predictive models: selecting relevant variables, transforming the predictor variables, processing missing values and preventing overshooting using a self-test. It is also able to predict, taking into account structural factors that might influence the outcome variable, thereby generating hypothetical models. The end result could identify relevant cut-off points in data series. It is rarely used in health, so it is proposed as a tool for the evaluation of relevant public health indicators. For demonstrative purposes, data series regarding the mortality of children under 5 years of age in Costa Rica were used, comprising the period 1978-2008. Copyright © 2016 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.

  8. Correlates of HIV knowledge and Sexual risk behaviors among Female Military Personnel

    PubMed Central

    Essien, E. James; Monjok, Emmanuel; Chen, Hua; Abughosh, Susan; Ekong, Ernest; Peters, Ronald J.; Holmes, Laurens; Holstad, Marcia M.; Mgbere, Osaro

    2010-01-01

    Objective Uniformed services personnel are at an increased risk of HIV infection. We examined the HIV/AIDS knowledge and sexual risk behaviors among female military personnel to determine the correlates of HIV risk behaviors in this population. Method The study used a cross-sectional design to examine HIV/AIDS knowledge and sexual risk behaviors in a sample of 346 females drawn from two military cantonments in Southwestern Nigeria. Data was collected between 2006 and 2008. Using bivariate analysis and multivariate logistic regression, HIV/AIDS knowledge and sexual behaviors were described in relation to socio-demographic characteristics of the participants. Results Multivariate logistic regression analysis revealed that level of education and knowing someone with HIV/AIDS were significant (p<0.05) predictors of HIV knowledge in this sample. HIV prevention self-efficacy was significantly (P<0.05) predicted by annual income and race/ethnicity. Condom use attitudes were also significantly (P<0.05) associated with number of children, annual income, and number of sexual partners. Conclusion Data indicates the importance of incorporating these predictor variables into intervention designs. PMID:20387111

  9. Measuring center of pressure signals to quantify human balance using multivariate multiscale entropy by designing a force platform.

    PubMed

    Huang, Cheng-Wei; Sue, Pei-Der; Abbod, Maysam F; Jiang, Bernard C; Shieh, Jiann-Shing

    2013-08-08

    To assess the improvement of human body balance, a low cost and portable measuring device of center of pressure (COP), known as center of pressure and complexity monitoring system (CPCMS), has been developed for data logging and analysis. In order to prove that the system can estimate the different magnitude of different sways in comparison with the commercial Advanced Mechanical Technology Incorporation (AMTI) system, four sway tests have been developed (i.e., eyes open, eyes closed, eyes open with water pad, and eyes closed with water pad) to produce different sway displacements. Firstly, static and dynamic tests were conducted to investigate the feasibility of the system. Then, correlation tests of the CPCMS and AMTI systems have been compared with four sway tests. The results are within the acceptable range. Furthermore, multivariate empirical mode decomposition (MEMD) and enhanced multivariate multiscale entropy (MMSE) analysis methods have been used to analyze COP data reported by the CPCMS and compare it with the AMTI system. The improvements of the CPCMS are 35% to 70% (open eyes test) and 60% to 70% (eyes closed test) with and without water pad. The AMTI system has shown an improvement of 40% to 80% (open eyes test) and 65% to 75% (closed eyes test). The results indicate that the CPCMS system can achieve similar results to the commercial product so it can determine the balance.

  10. Measuring Center of Pressure Signals to Quantify Human Balance Using Multivariate Multiscale Entropy by Designing a Force Platform

    PubMed Central

    Huang, Cheng-Wei; Sue, Pei-Der; Abbod, Maysam F.; Jiang, Bernard C.; Shieh, Jiann-Shing

    2013-01-01

    To assess the improvement of human body balance, a low cost and portable measuring device of center of pressure (COP), known as center of pressure and complexity monitoring system (CPCMS), has been developed for data logging and analysis. In order to prove that the system can estimate the different magnitude of different sways in comparison with the commercial Advanced Mechanical Technology Incorporation (AMTI) system, four sway tests have been developed (i.e., eyes open, eyes closed, eyes open with water pad, and eyes closed with water pad) to produce different sway displacements. Firstly, static and dynamic tests were conducted to investigate the feasibility of the system. Then, correlation tests of the CPCMS and AMTI systems have been compared with four sway tests. The results are within the acceptable range. Furthermore, multivariate empirical mode decomposition (MEMD) and enhanced multivariate multiscale entropy (MMSE) analysis methods have been used to analyze COP data reported by the CPCMS and compare it with the AMTI system. The improvements of the CPCMS are 35% to 70% (open eyes test) and 60% to 70% (eyes closed test) with and without water pad. The AMTI system has shown an improvement of 40% to 80% (open eyes test) and 65% to 75% (closed eyes test). The results indicate that the CPCMS system can achieve similar results to the commercial product so it can determine the balance. PMID:23966184

  11. Practice and Learning: Spatiotemporal Differences in Thalamo-Cortical-Cerebellar Networks Engagement across Learning Phases in Schizophrenia.

    PubMed

    Korostil, Michele; Remington, Gary; McIntosh, Anthony Randal

    2016-01-01

    Understanding how practice mediates the transition of brain-behavior networks between early and later stages of learning is constrained by the common approach to analysis of fMRI data. Prior imaging studies have mostly relied on a single scan, and parametric, task-related analyses. Our experiment incorporates a multisession fMRI lexicon-learning experiment with multivariate, whole-brain analysis to further knowledge of the distributed networks supporting practice-related learning in schizophrenia (SZ). Participants with SZ were compared with healthy control (HC) participants as they learned a novel lexicon during two fMRI scans over a several day period. All participants were trained to equal task proficiency prior to scanning. Behavioral-Partial Least Squares, a multivariate analytic approach, was used to analyze the imaging data. Permutation testing was used to determine statistical significance and bootstrap resampling to determine the reliability of the findings. With practice, HC participants transitioned to a brain-accuracy network incorporating dorsostriatal regions in late-learning stages. The SZ participants did not transition to this pattern despite comparable behavioral results. Instead, successful learners with SZ were differentiated primarily on the basis of greater engagement of perceptual and perceptual-integration brain regions. There is a different spatiotemporal unfolding of brain-learning relationships in SZ. In SZ, given the same amount of practice, the movement from networks suggestive of effortful learning toward subcortically driven procedural one differs from HC participants. Learning performance in SZ is driven by varying levels of engagement in perceptual regions, which suggests perception itself is impaired and may impact downstream, "higher level" cognition.

  12. Detecting spatio-temporal modes in multivariate data by entropy field decomposition

    NASA Astrophysics Data System (ADS)

    Frank, Lawrence R.; Galinsky, Vitaly L.

    2016-09-01

    A new data analysis method that addresses a general problem of detecting spatio-temporal variations in multivariate data is presented. The method utilizes two recent and complimentary general approaches to data analysis, information field theory (IFT) and entropy spectrum pathways (ESPs). Both methods reformulate and incorporate Bayesian theory, thus use prior information to uncover underlying structure of the unknown signal. Unification of ESP and IFT creates an approach that is non-Gaussian and nonlinear by construction and is found to produce unique spatio-temporal modes of signal behavior that can be ranked according to their significance, from which space-time trajectories of parameter variations can be constructed and quantified. Two brief examples of real world applications of the theory to the analysis of data bearing completely different, unrelated nature, lacking any underlying similarity, are also presented. The first example provides an analysis of resting state functional magnetic resonance imaging data that allowed us to create an efficient and accurate computational method for assessing and categorizing brain activity. The second example demonstrates the potential of the method in the application to the analysis of a strong atmospheric storm circulation system during the complicated stage of tornado development and formation using data recorded by a mobile Doppler radar. Reference implementation of the method will be made available as a part of the QUEST toolkit that is currently under development at the Center for Scientific Computation in Imaging.

  13. Can we discover double Higgs production at the LHC?

    NASA Astrophysics Data System (ADS)

    Alves, Alexandre; Ghosh, Tathagata; Sinha, Kuver

    2017-08-01

    We explore double Higgs production via gluon fusion in the b b ¯γ γ channel at the high-luminosity LHC using machine learning tools. We first propose a Bayesian optimization approach to select cuts on kinematic variables, obtaining a 30%-50% increase in the significance compared to current results in the literature. We show that this improvement persists once systematic uncertainties are taken into account. We next use boosted decision trees (BDT) to further discriminate signal and background events. Our analysis shows that a joint optimization of kinematic cuts and BDT hyperparameters results in an appreciable improvement in the significance. Finally, we perform a multivariate analysis of the output scores of the BDT. We find that assuming a very low level of systematics, the techniques proposed here will be able to confirm the production of a pair of standard model Higgs bosons at 5 σ level with 3 ab-1 of data. Assuming a more realistic projection of the level of systematics, around 10%, the optimization of cuts to train BDTs combined with a multivariate analysis delivers a respectable significance of 4.6 σ . Even assuming large systematics of 20%, our analysis predicts a 3.6 σ significance, which represents at least strong evidence in favor of double Higgs production. We carefully incorporate background contributions coming from light flavor jets or c jets being misidentified as b jets and jets being misidentified as photons in our analysis.

  14. Mouse double minute-2 homolog (MDM2)-rs2279744 polymorphism associated with lung cancer risk in a Northeastern Chinese population.

    PubMed

    Wang, Xu; Jin, Lina; Cui, Jiuwei; Ma, Kewei; Chen, Xiao; Li, Wei

    2015-01-01

    Altered expression or function of mouse double minute-2 (MDM2) protein could contribute to lung carcinogenesis; thus, this study investigated MDM2-rs2279744 polymorphism together with other epidemiologic factors for their association with lung cancer risk. A total of 500 lung cancer patients and 500 age and gender-matched healthy controls living in Northeastern China were recruited for genotyping of MDM2-rs2279744. Clinicopathological data was collected and subjected to univariate and multivariate analyses. In univariate analysis, the MDM2-rs2279744 G/G genotype versus T/T + T/G genotypes showed a tendency toward a higher incidence of lung cancer in the recessive model (P = 0.043). However, there were no significant differences when it was analyzed by the dominant, additive, or multiplicative models. A significantly increased lung cancer risk was observed associated with lower education level, lower body mass index, cancer family history, prior diagnosis of chronic obstructive pulmonary disease and pneumonia, exposure to pesticide or gasoline/diesel, tobacco smoking, and heavy cooking emissions when assessed by multivariate analyses. Moreover, MDM2-rs2279744 was still a significant risk factor even after incorporating environmental and lifestyle factors. However, there was no association between MDM2-rs2279744 and other factors. The MDM2-rs2279744 G/G genotype was associated with a higher lung cancer risk, even after incorporating other epidemiologic factors.

  15. Refined nomogram incorporating standing cough test improves prediction of male transobturator sling success.

    PubMed

    Shakir, Nabeel A; Fuchs, Joceline S; McKibben, Maxim J; Viers, Boyd R; Pagliara, Travis J; Scott, Jeremy M; Morey, Allen F

    2018-05-01

    To develop a decision aid in predicting sling success, incorporating the Male Stress Incontinence Grading Scale (MSIGS) into existing treatment algorithms. We reviewed men undergoing first-time transobturator sling for stress urinary incontinence (SUI) from 2007 to 2016 at our institution. Patient demographics, reported pads per day (PPD), and Standing Cough Test (SCT) results graded 0-4, according to MSIGS, were assessed. Treatment failure was defined as subsequent need for >1 PPD or further procedures. Parameters associated with failure were included in multivariable logistic models, compared by area under the receiver-operating characteristic curves. A nomogram was generated from the model with greatest AUC and internally validated. Overall 203 men (median age 67 years, IQR 63-72) were evaluated with median follow-up of 45 months (IQR 11-75 months). A total of 185 men (91%) were status-post radical prostatectomy and 29 (14%) had pelvic radiation history. Median PPD and SCT grade were both two. Eighty men (39%) failed treatment (use of ≥1 PPD or subsequent anti-incontinence procedures) at a median of 9 months. History of radiation (P = 0.03), increasing MSIGS (P < 0.0001) and increasing preoperative PPD (P < 0.0001) were associated with failure on univariate analysis. In a multivariable model with AUC 0.81, MSIGS, and PPD remained associated (P = 0.002 and <0.0001 respectively, and radiation history P = 0.06), and was superior to models incorporating PPD and radiation alone (AUC 0.77, P = 0.02), PPD alone (AUC 0.76, P = 0.02), and a cutpoint of >2 PPD alone (AUC 0.71, P = 0.0001). MSIGS adds prognostic value to PPD in assessing success of transobturator sling for treatment of SUI. © 2018 Wiley Periodicals, Inc.

  16. Extending local canonical correlation analysis to handle general linear contrasts for FMRI data.

    PubMed

    Jin, Mingwu; Nandy, Rajesh; Curran, Tim; Cordes, Dietmar

    2012-01-01

    Local canonical correlation analysis (CCA) is a multivariate method that has been proposed to more accurately determine activation patterns in fMRI data. In its conventional formulation, CCA has several drawbacks that limit its usefulness in fMRI. A major drawback is that, unlike the general linear model (GLM), a test of general linear contrasts of the temporal regressors has not been incorporated into the CCA formalism. To overcome this drawback, a novel directional test statistic was derived using the equivalence of multivariate multiple regression (MVMR) and CCA. This extension will allow CCA to be used for inference of general linear contrasts in more complicated fMRI designs without reparameterization of the design matrix and without reestimating the CCA solutions for each particular contrast of interest. With the proper constraints on the spatial coefficients of CCA, this test statistic can yield a more powerful test on the inference of evoked brain regional activations from noisy fMRI data than the conventional t-test in the GLM. The quantitative results from simulated and pseudoreal data and activation maps from fMRI data were used to demonstrate the advantage of this novel test statistic.

  17. Extending Local Canonical Correlation Analysis to Handle General Linear Contrasts for fMRI Data

    PubMed Central

    Jin, Mingwu; Nandy, Rajesh; Curran, Tim; Cordes, Dietmar

    2012-01-01

    Local canonical correlation analysis (CCA) is a multivariate method that has been proposed to more accurately determine activation patterns in fMRI data. In its conventional formulation, CCA has several drawbacks that limit its usefulness in fMRI. A major drawback is that, unlike the general linear model (GLM), a test of general linear contrasts of the temporal regressors has not been incorporated into the CCA formalism. To overcome this drawback, a novel directional test statistic was derived using the equivalence of multivariate multiple regression (MVMR) and CCA. This extension will allow CCA to be used for inference of general linear contrasts in more complicated fMRI designs without reparameterization of the design matrix and without reestimating the CCA solutions for each particular contrast of interest. With the proper constraints on the spatial coefficients of CCA, this test statistic can yield a more powerful test on the inference of evoked brain regional activations from noisy fMRI data than the conventional t-test in the GLM. The quantitative results from simulated and pseudoreal data and activation maps from fMRI data were used to demonstrate the advantage of this novel test statistic. PMID:22461786

  18. Survival from colorectal cancer in Victoria: 10-year follow up of the 1987 management survey.

    PubMed

    McLeish, John A; Thursfield, Vicky J; Giles, Graham G

    2002-05-01

    In 1987, the Victorian Cancer Registry identified a population-based sample of patients who underwent surgery for colorectal cancer for an audit of management following resection. Over 10 years have passed since this survey, and data on the survival of these patients (incorporating various prognostic indicators collected at the time of the survey) are now discussed in the present report. Relative survival analysis was conducted for each prognostic indicator separately and then combined in a multivariate model. Relative survival at 5 years for patients undergoing curative resections was 76% compared with 7% for those whose treatment was considered palliative. Survival at 10 years was little changed (73% and 7% respectively). Survival did not differ significantly by sex or age irrespective of treatment intention. In the curative group, only stage was a significant predictor of survival. Multivariate analysis was performed only for the curative group. Adjusting for all variables simultaneously,stage was the only -significant predictor of survival. Patients with Dukes' stage C disease were at a significantly greater risk (OR 5.5 (1.7-17.6)) than those with Dukes' A. Neither tumour site, sex, age, surgeon activity level nor adjuvant therapies made a significant contribution to the model.

  19. A revision of chiggers of the minuta species-group (Acari: Trombiculidae: Neotrombicula Hirst, 1925) using multivariate morphometrics.

    PubMed

    Stekolnikov, Alexandr A; Klimov, Pavel B

    2010-09-01

    We revise chiggers belonging to the minuta-species group (genus Neotrombicula Hirst, 1925) from the Palaearctic using size-free multivariate morphometrics. This approach allowed us to resolve several diagnostic problems. We show that the widely distributed Neotrombicula scrupulosa Kudryashova, 1993 forms three spatially and ecologically isolated groups different from each other in size or shape (morphometric property) only: specimens from the Caucasus are distinct from those from Asia in shape, whereas the Asian specimens from plains and mountains are different from each other in size. We developed a multivariate classification model to separate three closely related species: N. scrupulosa, N. lubrica Kudryashova, 1993 and N. minuta Schluger, 1966. This model is based on five shape variables selected from an initial 17 variables by a best subset analysis using a custom size-correction subroutine. The variable selection procedure slightly improved the predictive power of the model, suggesting that it not only removed redundancy but also reduced 'noise' in the dataset. The overall classification accuracy of this model is 96.2, 96.2 and 95.5%, as estimated by internal validation, external validation and jackknife statistics, respectively. Our analyses resulted in one new synonymy: N. dimidiata Stekolnikov, 1995 is considered to be a synonym of N. lubrica. Both N. scrupulosa and N. lubrica are recorded from new localities. A key to species of the minuta-group incorporating results from our multivariate analyses is presented.

  20. Detecting Spatio-Temporal Modes in Multivariate Data by Entropy Field Decomposition

    PubMed Central

    Frank, Lawrence R.; Galinsky, Vitaly L.

    2016-01-01

    A new data analysis method that addresses a general problem of detecting spatio-temporal variations in multivariate data is presented. The method utilizes two recent and complimentary general approaches to data analysis, information field theory (IFT) and entropy spectrum pathways (ESP). Both methods reformulate and incorporate Bayesian theory, thus use prior information to uncover underlying structure of the unknown signal. Unification of ESP and IFT creates an approach that is non-Gaussian and non-linear by construction and is found to produce unique spatio-temporal modes of signal behavior that can be ranked according to their significance, from which space-time trajectories of parameter variations can be constructed and quantified. Two brief examples of real world applications of the theory to the analysis of data bearing completely different, unrelated nature, lacking any underlying similarity, are also presented. The first example provides an analysis of resting state functional magnetic resonance imaging (rsFMRI) data that allowed us to create an efficient and accurate computational method for assessing and categorizing brain activity. The second example demonstrates the potential of the method in the application to the analysis of a strong atmospheric storm circulation system during the complicated stage of tornado development and formation using data recorded by a mobile Doppler radar. Reference implementation of the method will be made available as a part of the QUEST toolkit that is currently under development at the Center for Scientific Computation in Imaging. PMID:27695512

  1. Population genetic structure in a social landscape: barley in a traditional Ethiopian agricultural system

    PubMed Central

    Samberg, Leah H; Fishman, Lila; Allendorf, Fred W

    2013-01-01

    Conservation strategies are increasingly driven by our understanding of the processes and patterns of gene flow across complex landscapes. The expansion of population genetic approaches into traditional agricultural systems requires understanding how social factors contribute to that landscape, and thus to gene flow. This study incorporates extensive farmer interviews and population genetic analysis of barley landraces (Hordeum vulgare) to build a holistic picture of farmer-mediated geneflow in an ancient, traditional agricultural system in the highlands of Ethiopia. We analyze barley samples at 14 microsatellite loci across sites at varying elevations and locations across a contiguous mountain range, and across farmer-identified barley types and management strategies. Genetic structure is analyzed using population-based and individual-based methods, including measures of population differentiation and genetic distance, multivariate Principal Coordinate Analysis, and Bayesian assignment tests. Phenotypic analysis links genetic patterns to traits identified by farmers. We find that differential farmer management strategies lead to markedly different patterns of population structure across elevation classes and barley types. The extent to which farmer seed management appears as a stronger determinant of spatial structure than the physical landscape highlights the need for incorporation of social, landscape, and genetic data for the design of conservation strategies in human-influenced landscapes. PMID:24478796

  2. Bayesian inference on risk differences: an application to multivariate meta-analysis of adverse events in clinical trials.

    PubMed

    Chen, Yong; Luo, Sheng; Chu, Haitao; Wei, Peng

    2013-05-01

    Multivariate meta-analysis is useful in combining evidence from independent studies which involve several comparisons among groups based on a single outcome. For binary outcomes, the commonly used statistical models for multivariate meta-analysis are multivariate generalized linear mixed effects models which assume risks, after some transformation, follow a multivariate normal distribution with possible correlations. In this article, we consider an alternative model for multivariate meta-analysis where the risks are modeled by the multivariate beta distribution proposed by Sarmanov (1966). This model have several attractive features compared to the conventional multivariate generalized linear mixed effects models, including simplicity of likelihood function, no need to specify a link function, and has a closed-form expression of distribution functions for study-specific risk differences. We investigate the finite sample performance of this model by simulation studies and illustrate its use with an application to multivariate meta-analysis of adverse events of tricyclic antidepressants treatment in clinical trials.

  3. Expression of p53, p21 and cyclin D1 in penile cancer: p53 predicts poor prognosis.

    PubMed

    Gunia, Sven; Kakies, Christoph; Erbersdobler, Andreas; Hakenberg, Oliver W; Koch, Stefan; May, Matthias

    2012-03-01

    To evaluate the role of p53, p21 and cyclin D1 expression in patients with penile cancer (PC). Paraffin-embedded tissues from PC specimens from six pathology departments were subjected to a central histopathological review performed by one pathologist. The tissue microarray technique was used for immunostaining which was evaluated by two independent pathologists and correlated with cancer-specific survival (CSS). κ-statistics were used to assess interobserver variability. Uni- and multivariable Cox proportional hazards analysis was applied to assess the independent effects of several prognostic factors on CSS over a median of 32 months (IQR 6-66 months). Specimens and clinical data from 110 men treated surgically for primary PC were collected. p53 staining was positive in 30 and negative in 62 specimens. κ-statistics showed substantial interobserver reproducibility of p53 staining evaluation (κ=0.73; p<0.001). The 5-year CSS rate for the entire study cohort was 74%. Five-year CSS was 84% in p53-negative and 51% in p53-positive PC patients (p=0.003). Multivariable analysis showed p53 (HR=3.20; p=0.041) and pT-stage (HR=4.29; p<0.001) as independent significant prognostic factors for CSS. Cyclin D1 and p21 expression were not correlated with survival. However, incorporating p21 into a multivariable Cox model did contribute to improved model quality for predicting CSS. In patients with PC, the expression of p53 in the primary tumour specimen can be reproducibly assessed and is negatively associated with cancer specific survival.

  4. Comparison of Dissolution Similarity Assessment Methods for Products with Large Variations: f2 Statistics and Model-Independent Multivariate Confidence Region Procedure for Dissolution Profiles of Multiple Oral Products.

    PubMed

    Yoshida, Hiroyuki; Shibata, Hiroko; Izutsu, Ken-Ichi; Goda, Yukihiro

    2017-01-01

    The current Japanese Ministry of Health Labour and Welfare (MHLW)'s Guideline for Bioequivalence Studies of Generic Products uses averaged dissolution rates for the assessment of dissolution similarity between test and reference formulations. This study clarifies how the application of model-independent multivariate confidence region procedure (Method B), described in the European Medical Agency and U.S. Food and Drug Administration guidelines, affects similarity outcomes obtained empirically from dissolution profiles with large variations in individual dissolution rates. Sixty-one datasets of dissolution profiles for immediate release, oral generic, and corresponding innovator products that showed large variation in individual dissolution rates in generic products were assessed on their similarity by using the f 2 statistics defined in the MHLW guidelines (MHLW f 2 method) and two different Method B procedures, including a bootstrap method applied with f 2 statistics (BS method) and a multivariate analysis method using the Mahalanobis distance (MV method). The MHLW f 2 and BS methods provided similar dissolution similarities between reference and generic products. Although a small difference in the similarity assessment may be due to the decrease in the lower confidence interval for expected f 2 values derived from the large variation in individual dissolution rates, the MV method provided results different from those obtained through MHLW f 2 and BS methods. Analysis of actual dissolution data for products with large individual variations would provide valuable information towards an enhanced understanding of these methods and their possible incorporation in the MHLW guidelines.

  5. Robust tumor morphometry in multispectral fluorescence microscopy

    NASA Astrophysics Data System (ADS)

    Tabesh, Ali; Vengrenyuk, Yevgen; Teverovskiy, Mikhail; Khan, Faisal M.; Sapir, Marina; Powell, Douglas; Mesa-Tejada, Ricardo; Donovan, Michael J.; Fernandez, Gerardo

    2009-02-01

    Morphological and architectural characteristics of primary tissue compartments, such as epithelial nuclei (EN) and cytoplasm, provide important cues for cancer diagnosis, prognosis, and therapeutic response prediction. We propose two feature sets for the robust quantification of these characteristics in multiplex immunofluorescence (IF) microscopy images of prostate biopsy specimens. To enable feature extraction, EN and cytoplasm regions were first segmented from the IF images. Then, feature sets consisting of the characteristics of the minimum spanning tree (MST) connecting the EN and the fractal dimension (FD) of gland boundaries were obtained from the segmented compartments. We demonstrated the utility of the proposed features in prostate cancer recurrence prediction on a multi-institution cohort of 1027 patients. Univariate analysis revealed that both FD and one of the MST features were highly effective for predicting cancer recurrence (p <= 0.0001). In multivariate analysis, an MST feature was selected for a model incorporating clinical and image features. The model achieved a concordance index (CI) of 0.73 on the validation set, which was significantly higher than the CI of 0.69 for the standard multivariate model based solely on clinical features currently used in clinical practice (p < 0.0001). The contributions of this work are twofold. First, it is the first demonstration of the utility of the proposed features in morphometric analysis of IF images. Second, this is the largest scale study of the efficacy and robustness of the proposed features in prostate cancer prognosis.

  6. Recursive Partitioning Analysis for New Classification of Patients With Esophageal Cancer Treated by Chemoradiotherapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nomura, Motoo, E-mail: excell@hkg.odn.ne.jp; Department of Clinical Oncology, Aichi Cancer Center Hospital, Nagoya; Department of Radiation Oncology, Aichi Cancer Center Hospital, Nagoya

    2012-11-01

    Background: The 7th edition of the American Joint Committee on Cancer staging system does not include lymph node size in the guidelines for staging patients with esophageal cancer. The objectives of this study were to determine the prognostic impact of the maximum metastatic lymph node diameter (ND) on survival and to develop and validate a new staging system for patients with esophageal squamous cell cancer who were treated with definitive chemoradiotherapy (CRT). Methods: Information on 402 patients with esophageal cancer undergoing CRT at two institutions was reviewed. Univariate and multivariate analyses of data from one institution were used to assessmore » the impact of clinical factors on survival, and recursive partitioning analysis was performed to develop the new staging classification. To assess its clinical utility, the new classification was validated using data from the second institution. Results: By multivariate analysis, gender, T, N, and ND stages were independently and significantly associated with survival (p < 0.05). The resulting new staging classification was based on the T and ND. The four new stages led to good separation of survival curves in both the developmental and validation datasets (p < 0.05). Conclusions: Our results showed that lymph node size is a strong independent prognostic factor and that the new staging system, which incorporated lymph node size, provided good prognostic power, and discriminated effectively for patients with esophageal cancer undergoing CRT.« less

  7. A mathematical prediction model incorporating molecular subtype for risk of non-sentinel lymph node metastasis in sentinel lymph node-positive breast cancer patients: a retrospective analysis and nomogram development.

    PubMed

    Wang, Na-Na; Yang, Zheng-Jun; Wang, Xue; Chen, Li-Xuan; Zhao, Hong-Meng; Cao, Wen-Feng; Zhang, Bin

    2018-04-25

    Molecular subtype of breast cancer is associated with sentinel lymph node status. We sought to establish a mathematical prediction model that included breast cancer molecular subtype for risk of positive non-sentinel lymph nodes in breast cancer patients with sentinel lymph node metastasis and further validate the model in a separate validation cohort. We reviewed the clinicopathologic data of breast cancer patients with sentinel lymph node metastasis who underwent axillary lymph node dissection between June 16, 2014 and November 16, 2017 at our hospital. Sentinel lymph node biopsy was performed and patients with pathologically proven sentinel lymph node metastasis underwent axillary lymph node dissection. Independent risks for non-sentinel lymph node metastasis were assessed in a training cohort by multivariate analysis and incorporated into a mathematical prediction model. The model was further validated in a separate validation cohort, and a nomogram was developed and evaluated for diagnostic performance in predicting the risk of non-sentinel lymph node metastasis. Moreover, we assessed the performance of five different models in predicting non-sentinel lymph node metastasis in training cohort. Totally, 495 cases were eligible for the study, including 291 patients in the training cohort and 204 in the validation cohort. Non-sentinel lymph node metastasis was observed in 33.3% (97/291) patients in the training cohort. The AUC of MSKCC, Tenon, MDA, Ljubljana, and Louisville models in training cohort were 0.7613, 0.7142, 0.7076, 0.7483, and 0.671, respectively. Multivariate regression analysis indicated that tumor size (OR = 1.439; 95% CI 1.025-2.021; P = 0.036), sentinel lymph node macro-metastasis versus micro-metastasis (OR = 5.063; 95% CI 1.111-23.074; P = 0.036), the number of positive sentinel lymph nodes (OR = 2.583, 95% CI 1.714-3.892; P < 0.001), and the number of negative sentinel lymph nodes (OR = 0.686, 95% CI 0.575-0.817; P < 0.001) were independent statistically significant predictors of non-sentinel lymph node metastasis. Furthermore, luminal B (OR = 3.311, 95% CI 1.593-6.884; P = 0.001) and HER2 overexpression (OR = 4.308, 95% CI 1.097-16.912; P = 0.036) were independent and statistically significant predictor of non-sentinel lymph node metastasis versus luminal A. A regression model based on the results of multivariate analysis was established to predict the risk of non-sentinel lymph node metastasis, which had an AUC of 0.8188. The model was validated in the validation cohort and showed excellent diagnostic performance. The mathematical prediction model that incorporates five variables including breast cancer molecular subtype demonstrates excellent diagnostic performance in assessing the risk of non-sentinel lymph node metastasis in sentinel lymph node-positive patients. The prediction model could be of help surgeons in evaluating the risk of non-sentinel lymph node involvement for breast cancer patients; however, the model requires further validation in prospective studies.

  8. Incorporating interaction networks into the determination of functionally related hit genes in genomic experiments with Markov random fields

    PubMed Central

    Robinson, Sean; Nevalainen, Jaakko; Pinna, Guillaume; Campalans, Anna; Radicella, J. Pablo; Guyon, Laurent

    2017-01-01

    Abstract Motivation: Incorporating gene interaction data into the identification of ‘hit’ genes in genomic experiments is a well-established approach leveraging the ‘guilt by association’ assumption to obtain a network based hit list of functionally related genes. We aim to develop a method to allow for multivariate gene scores and multiple hit labels in order to extend the analysis of genomic screening data within such an approach. Results: We propose a Markov random field-based method to achieve our aim and show that the particular advantages of our method compared with those currently used lead to new insights in previously analysed data as well as for our own motivating data. Our method additionally achieves the best performance in an independent simulation experiment. The real data applications we consider comprise of a survival analysis and differential expression experiment and a cell-based RNA interference functional screen. Availability and implementation: We provide all of the data and code related to the results in the paper. Contact: sean.j.robinson@utu.fi or laurent.guyon@cea.fr Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28881978

  9. Layer-by-Layer Polyelectrolyte Encapsulation of Mycoplasma pneumoniae for Enhanced Raman Detection

    PubMed Central

    Rivera-Betancourt, Omar E.; Sheppard, Edward S.; Krause, Duncan C.; Dluhy, Richard A.

    2014-01-01

    Mycoplasma pneumoniae is a major cause of respiratory disease in humans and accounts for as much as 20% of all community-acquired pneumonia. Existing mycoplasma diagnosis is primarily limited by the poor success rate at culturing the bacteria from clinical samples. There is a critical need to develop a new platform for mycoplasma detection that has high sensitivity, specificity, and expediency. Here we report the layer-by-layer (LBL) encapsulation of M. pneumoniae cells with Ag nanoparticles in a matrix of the polyelectrolytes poly(allylamine hydrochloride) (PAH) and poly(styrene sulfonate) (PSS). We evaluated nanoparticle encapsulated mycoplasma cells as a platform for the differentiation of M. pneumoniae strains using surface enhanced Raman scattering (SERS) combined with multivariate statistical analysis. Three separate M. pneumoniae strains (M129, FH and II-3) were studied. Scanning electron microscopy and fluorescence imaging showed that the Ag nanoparticles were incorporated between the oppositely charged polyelectrolyte layers. SERS spectra showed that LBL encapsulation provides excellent spectral reproducibility. Multivariate statistical analysis of the Raman spectra differentiated the three M. pneumoniae strains with 97 – 100% specificity and sensitivity, and low (0.1 – 0.4) root mean square error. These results indicated that nanoparticle and polyelectrolyte encapsulation of M. pneumoniae is a potentially powerful platform for rapid and sensitive SERS-based bacterial identification. PMID:25017005

  10. Partial Least Squares for Discrimination in fMRI Data

    PubMed Central

    Andersen, Anders H.; Rayens, William S.; Liu, Yushu; Smith, Charles D.

    2011-01-01

    Multivariate methods for discrimination were used in the comparison of brain activation patterns between groups of cognitively normal women who are at either high or low Alzheimer's disease risk based on family history and apolipoprotein-E4 status. Linear discriminant analysis (LDA) was preceded by dimension reduction using either principal component analysis (PCA), partial least squares (PLS), or a new oriented partial least squares (OrPLS) method. The aim was to identify a spatial pattern of functionally connected brain regions that was differentially expressed by the risk groups and yielded optimal classification accuracy. Multivariate dimension reduction is required prior to LDA when the data contains more feature variables than there are observations on individual subjects. Whereas PCA has been commonly used to identify covariance patterns in neuroimaging data, this approach only identifies gross variability and is not capable of distinguishing among-groups from within-groups variability. PLS and OrPLS provide a more focused dimension reduction by incorporating information on class structure and therefore lead to more parsimonious models for discrimination. Performance was evaluated in terms of the cross-validated misclassification rates. The results support the potential of using fMRI as an imaging biomarker or diagnostic tool to discriminate individuals with disease or high risk. PMID:22227352

  11. Better Working Memory and Motor Inhibition in Children Who Delayed Gratification

    PubMed Central

    Yu, Junhong; Kam, Chi-Ming; Lee, Tatia M. C.

    2016-01-01

    Background: Despite the extensive research on delayed gratification over the past few decades, the neurocognitive processes that subserve delayed gratification remains unclear. As an exploratory step in studying these processes, the present study aims to describe the executive function profiles of children who were successful at delaying gratification and those who were not. Methods: A total of 138 kindergarten students (65 males, 73 females; Mage = 44 months, SD = 3.5; age range = 37–53 months) were administered a delayed gratification task, a 1-back test, a Day/night Stroop test and a Go/no-go test. The outcome measures of these tests were then analyzed between groups using a Multivariate Analysis of Variance, and subsequently a Multivariate Analysis of Covariance incorporating age as a covariate. Results: Children who were successful in delaying gratification were significantly older and had significantly better outcomes in the 1-back test and go/no-go test. With the exception of the number of hits in the go/no-go test, all other group differences remained significant after controlling for age. Conclusion: Children who were successful in delaying gratification showed better working memory and motor inhibition relative to those who failed the delayed gratification task. The implications of these findings are discussed. PMID:27493638

  12. Multivariate analysis in thoracic research.

    PubMed

    Mengual-Macenlle, Noemí; Marcos, Pedro J; Golpe, Rafael; González-Rivas, Diego

    2015-03-01

    Multivariate analysis is based in observation and analysis of more than one statistical outcome variable at a time. In design and analysis, the technique is used to perform trade studies across multiple dimensions while taking into account the effects of all variables on the responses of interest. The development of multivariate methods emerged to analyze large databases and increasingly complex data. Since the best way to represent the knowledge of reality is the modeling, we should use multivariate statistical methods. Multivariate methods are designed to simultaneously analyze data sets, i.e., the analysis of different variables for each person or object studied. Keep in mind at all times that all variables must be treated accurately reflect the reality of the problem addressed. There are different types of multivariate analysis and each one should be employed according to the type of variables to analyze: dependent, interdependence and structural methods. In conclusion, multivariate methods are ideal for the analysis of large data sets and to find the cause and effect relationships between variables; there is a wide range of analysis types that we can use.

  13. Application of multivariable search techniques to structural design optimization

    NASA Technical Reports Server (NTRS)

    Jones, R. T.; Hague, D. S.

    1972-01-01

    Multivariable optimization techniques are applied to a particular class of minimum weight structural design problems: the design of an axially loaded, pressurized, stiffened cylinder. Minimum weight designs are obtained by a variety of search algorithms: first- and second-order, elemental perturbation, and randomized techniques. An exterior penalty function approach to constrained minimization is employed. Some comparisons are made with solutions obtained by an interior penalty function procedure. In general, it would appear that an interior penalty function approach may not be as well suited to the class of design problems considered as the exterior penalty function approach. It is also shown that a combination of search algorithms will tend to arrive at an extremal design in a more reliable manner than a single algorithm. The effect of incorporating realistic geometrical constraints on stiffener cross-sections is investigated. A limited comparison is made between minimum weight cylinders designed on the basis of a linear stability analysis and cylinders designed on the basis of empirical buckling data. Finally, a technique for locating more than one extremal is demonstrated.

  14. Support vector machine learning-based fMRI data group analysis.

    PubMed

    Wang, Ze; Childress, Anna R; Wang, Jiongjiong; Detre, John A

    2007-07-15

    To explore the multivariate nature of fMRI data and to consider the inter-subject brain response discrepancies, a multivariate and brain response model-free method is fundamentally required. Two such methods are presented in this paper by integrating a machine learning algorithm, the support vector machine (SVM), and the random effect model. Without any brain response modeling, SVM was used to extract a whole brain spatial discriminance map (SDM), representing the brain response difference between the contrasted experimental conditions. Population inference was then obtained through the random effect analysis (RFX) or permutation testing (PMU) on the individual subjects' SDMs. Applied to arterial spin labeling (ASL) perfusion fMRI data, SDM RFX yielded lower false-positive rates in the null hypothesis test and higher detection sensitivity for synthetic activations with varying cluster size and activation strengths, compared to the univariate general linear model (GLM)-based RFX. For a sensory-motor ASL fMRI study, both SDM RFX and SDM PMU yielded similar activation patterns to GLM RFX and GLM PMU, respectively, but with higher t values and cluster extensions at the same significance level. Capitalizing on the absence of temporal noise correlation in ASL data, this study also incorporated PMU in the individual-level GLM and SVM analyses accompanied by group-level analysis through RFX or group-level PMU. Providing inferences on the probability of being activated or deactivated at each voxel, these individual-level PMU-based group analysis methods can be used to threshold the analysis results of GLM RFX, SDM RFX or SDM PMU.

  15. Correlative and multivariate analysis of increased radon concentration in underground laboratory.

    PubMed

    Maletić, Dimitrije M; Udovičić, Vladimir I; Banjanac, Radomir M; Joković, Dejan R; Dragić, Aleksandar L; Veselinović, Nikola B; Filipović, Jelena

    2014-11-01

    The results of analysis using correlative and multivariate methods, as developed for data analysis in high-energy physics and implemented in the Toolkit for Multivariate Analysis software package, of the relations of the variation of increased radon concentration with climate variables in shallow underground laboratory is presented. Multivariate regression analysis identified a number of multivariate methods which can give a good evaluation of increased radon concentrations based on climate variables. The use of the multivariate regression methods will enable the investigation of the relations of specific climate variable with increased radon concentrations by analysis of regression methods resulting in 'mapped' underlying functional behaviour of radon concentrations depending on a wide spectrum of climate variables. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  16. Digital controllers for VTOL aircraft

    NASA Technical Reports Server (NTRS)

    Stengel, R. F.; Broussard, J. R.; Berry, P. W.

    1976-01-01

    Using linear-optimal estimation and control techniques, digital-adaptive control laws have been designed for a tandem-rotor helicopter which is equipped for fully automatic flight in terminal area operations. Two distinct discrete-time control laws are designed to interface with velocity-command and attitude-command guidance logic, and each incorporates proportional-integral compensation for non-zero-set-point regulation, as well as reduced-order Kalman filters for sensor blending and noise rejection. Adaptation to flight condition is achieved with a novel gain-scheduling method based on correlation and regression analysis. The linear-optimal design approach is found to be a valuable tool in the development of practical multivariable control laws for vehicles which evidence significant coupling and insufficient natural stability.

  17. Multivariate Methods for Meta-Analysis of Genetic Association Studies.

    PubMed

    Dimou, Niki L; Pantavou, Katerina G; Braliou, Georgia G; Bagos, Pantelis G

    2018-01-01

    Multivariate meta-analysis of genetic association studies and genome-wide association studies has received a remarkable attention as it improves the precision of the analysis. Here, we review, summarize and present in a unified framework methods for multivariate meta-analysis of genetic association studies and genome-wide association studies. Starting with the statistical methods used for robust analysis and genetic model selection, we present in brief univariate methods for meta-analysis and we then scrutinize multivariate methodologies. Multivariate models of meta-analysis for a single gene-disease association studies, including models for haplotype association studies, multiple linked polymorphisms and multiple outcomes are discussed. The popular Mendelian randomization approach and special cases of meta-analysis addressing issues such as the assumption of the mode of inheritance, deviation from Hardy-Weinberg Equilibrium and gene-environment interactions are also presented. All available methods are enriched with practical applications and methodologies that could be developed in the future are discussed. Links for all available software implementing multivariate meta-analysis methods are also provided.

  18. Independent Prognostic Value of Serum Markers in Diffuse Large B-Cell Lymphoma in the Era of the NCCN-IPI.

    PubMed

    Melchardt, Thomas; Troppan, Katharina; Weiss, Lukas; Hufnagl, Clemens; Neureiter, Daniel; Tränkenschuh, Wolfgang; Schlick, Konstantin; Huemer, Florian; Deutsch, Alexander; Neumeister, Peter; Greil, Richard; Pichler, Martin; Egle, Alexander

    2015-12-01

    Several serum parameters have been evaluated for adding prognostic value to clinical scoring systems in diffuse large B-cell lymphoma (DLBCL), but none of the reports used multivariate testing of more than one parameter at a time. The goal of this study was to validate widely available serum parameters for their independent prognostic impact in the era of the National Comprehensive Cancer Network-International Prognostic Index (NCCN-IPI) score to determine which were the most useful. This retrospective bicenter analysis includes 515 unselected patients with DLBCL who were treated with rituximab and anthracycline-based chemoimmunotherapy between 2004 and January 2014. Anemia, high C-reactive protein, and high bilirubin levels had an independent prognostic value for survival in multivariate analyses in addition to the NCCN-IPI, whereas neutrophil-to-lymphocyte ratio, high gamma-glutamyl transferase levels, and platelets-to-lymphocyte ratio did not. In our cohort, we describe the most promising markers to improve the NCCN-IPI. Anemia and high C-reactive protein levels retain their power in multivariate testing even in the era of the NCCN-IPI. The negative role of high bilirubin levels may be associated as a marker of liver function. Further studies are warranted to incorporate these markers into prognostic models and define their role opposite novel molecular markers. Copyright © 2015 by the National Comprehensive Cancer Network.

  19. Enhanced ID Pit Sizing Using Multivariate Regression Algorithm

    NASA Astrophysics Data System (ADS)

    Krzywosz, Kenji

    2007-03-01

    EPRI is funding a program to enhance and improve the reliability of inside diameter (ID) pit sizing for balance-of plant heat exchangers, such as condensers and component cooling water heat exchangers. More traditional approaches to ID pit sizing involve the use of frequency-specific amplitude or phase angles. The enhanced multivariate regression algorithm for ID pit depth sizing incorporates three simultaneous input parameters of frequency, amplitude, and phase angle. A set of calibration data sets consisting of machined pits of various rounded and elongated shapes and depths was acquired in the frequency range of 100 kHz to 1 MHz for stainless steel tubing having nominal wall thickness of 0.028 inch. To add noise to the acquired data set, each test sample was rotated and test data acquired at 3, 6, 9, and 12 o'clock positions. The ID pit depths were estimated using a second order and fourth order regression functions by relying on normalized amplitude and phase angle information from multiple frequencies. Due to unique damage morphology associated with the microbiologically-influenced ID pits, it was necessary to modify the elongated calibration standard-based algorithms by relying on the algorithm developed solely from the destructive sectioning results. This paper presents the use of transformed multivariate regression algorithm to estimate ID pit depths and compare the results with the traditional univariate phase angle analysis. Both estimates were then compared with the destructive sectioning results.

  20. Insights to Galaxy Evolution Utilizing a Multivariate Comparison of Circumgalactic OVI and MgII

    NASA Astrophysics Data System (ADS)

    Lewis, James; Churchill, Christopher; Nielsen, Nikole; Kacprzak, Glenn; Muzahid, Sowgat; Charlton, Jane

    2018-01-01

    We present a promising multivariate method to categorize inter-related astronomical data in meaningful ways. We use data from the MAGIICAT and "Multiphase Galaxy Halos" surveys and limit our sample to those galaxies which are imaged with the Hubble Space Telescope and for which the Circumgalactic Medium (CGM) is measured using high-resolution quasar spectra (HIRES/COS). Utilizing the method to categorize data about the CGM and its host galaxy yields distinct categories of CGM-galaxy pairs that imply a common fate for the outflows of MgII and OVI in redder galaxies. The analysis reveals a lack of circumgalactic OVI in lower mass, bluer (younger) galaxies, and that as the blue galaxies gain mass and age along the green valley strong OVI appears in the CGM predominately along the minor axes. But as the galaxies continue to gain mass and age into the red sequence strong OVI gas is found primarily along the major axes. Furthermore, we find a population of low mass red galaxies in which only weak, uniform, circumgalactic OVI is found. Incorporating our multivariate results for circumgalactic MgII, including evidence for quenching of star formation via weak circumgalactic MgII preferentially found along the minor axes of redder galaxies, and invoking the similarity of OVI column densities and kinematic spreads along the major and minor axes, we infer that OVI is ancient gas in the CGM.

  1. Estimation and Psychometric Analysis of Component Profile Scores via Multivariate Generalizability Theory

    ERIC Educational Resources Information Center

    Grochowalski, Joseph H.

    2015-01-01

    Component Universe Score Profile analysis (CUSP) is introduced in this paper as a psychometric alternative to multivariate profile analysis. The theoretical foundations of CUSP analysis are reviewed, which include multivariate generalizability theory and constrained principal components analysis. Because CUSP is a combination of generalizability…

  2. A novel combined approach of diffuse reflectance UV-Vis-NIR spectroscopy and multivariate analysis for non-destructive examination of blue ballpoint pen inks in forensic application

    NASA Astrophysics Data System (ADS)

    Kumar, Raj; Sharma, Vishal

    2017-03-01

    The present research is focused on the analysis of writing inks using destructive UV-Vis spectroscopy (dissolution of ink by the solvent) and non-destructive diffuse reflectance UV-Vis-NIR spectroscopy along with Chemometrics. Fifty seven samples of blue ballpoint pen inks were analyzed under optimum conditions to determine the differences in spectral features of inks among same and different manufacturers. Normalization was performed on the spectroscopic data before chemometric analysis. Principal Component Analysis (PCA) and K-mean cluster analysis were used on the data to ascertain whether the blue ballpoint pen inks could be differentiated by their UV-Vis/UV-Vis NIR spectra. The discriminating power is calculated by qualitative analysis by the visual comparison of the spectra (absorbance peaks), produced by the destructive and non-destructive methods. In the latter two methods, the pairwise comparison is made by incorporating the clustering method. It is found that chemometric method provides better discriminating power (98.72% and 99.46%, in destructive and non-destructive, respectively) in comparison to the qualitative analysis (69.67%).

  3. Multivariate carbon and nitrogen stable isotope model for the reconstruction of prehistoric human diet.

    PubMed

    Froehle, A W; Kellner, C M; Schoeninger, M J

    2012-03-01

    Using a sample of published archaeological data, we expand on an earlier bivariate carbon model for diet reconstruction by adding bone collagen nitrogen stable isotope values (δ(15) N), which provide information on trophic level and consumption of terrestrial vs. marine protein. The bivariate carbon model (δ(13) C(apatite) vs. δ(13) C(collagen) ) provides detailed information on the isotopic signatures of whole diet and dietary protein, but is limited in its ability to distinguish between C(4) and marine protein. Here, using cluster analysis and discriminant function analysis, we generate a multivariate diet reconstruction model that incorporates δ(13) C(apatite) , δ(13) C(collagen) , and δ(15) N holistically. Inclusion of the δ(15) N data proves useful in resolving protein-related limitations of the bivariate carbon model, and splits the sample into five distinct dietary clusters. Two significant discriminant functions account for 98.8% of the sample variance, providing a multivariate model for diet reconstruction. Both carbon variables dominate the first function, while δ(15) N most strongly influences the second. Independent support for the functions' ability to accurately classify individuals according to diet comes from a small sample of experimental rats, which cluster as expected from their diets. The new model also provides a statistical basis for distinguishing between food sources with similar isotopic signatures, as in a previously analyzed archaeological population from Saipan (see Ambrose et al.: AJPA 104(1997) 343-361). Our model suggests that the Saipan islanders' (13) C-enriched signal derives mainly from sugarcane, not seaweed. Further development and application of this model can similarly improve dietary reconstructions in archaeological, paleontological, and primatological contexts. Copyright © 2011 Wiley Periodicals, Inc.

  4. Exploring connectivity with large-scale Granger causality on resting-state functional MRI.

    PubMed

    DSouza, Adora M; Abidin, Anas Z; Leistritz, Lutz; Wismüller, Axel

    2017-08-01

    Large-scale Granger causality (lsGC) is a recently developed, resting-state functional MRI (fMRI) connectivity analysis approach that estimates multivariate voxel-resolution connectivity. Unlike most commonly used multivariate approaches, which establish coarse-resolution connectivity by aggregating voxel time-series avoiding an underdetermined problem, lsGC estimates voxel-resolution, fine-grained connectivity by incorporating an embedded dimension reduction. We investigate application of lsGC on realistic fMRI simulations, modeling smoothing of neuronal activity by the hemodynamic response function and repetition time (TR), and empirical resting-state fMRI data. Subsequently, functional subnetworks are extracted from lsGC connectivity measures for both datasets and validated quantitatively. We also provide guidelines to select lsGC free parameters. Results indicate that lsGC reliably recovers underlying network structure with area under receiver operator characteristic curve (AUC) of 0.93 at TR=1.5s for a 10-min session of fMRI simulations. Furthermore, subnetworks of closely interacting modules are recovered from the aforementioned lsGC networks. Results on empirical resting-state fMRI data demonstrate recovery of visual and motor cortex in close agreement with spatial maps obtained from (i) visuo-motor fMRI stimulation task-sequence (Accuracy=0.76) and (ii) independent component analysis (ICA) of resting-state fMRI (Accuracy=0.86). Compared with conventional Granger causality approach (AUC=0.75), lsGC produces better network recovery on fMRI simulations. Furthermore, it cannot recover functional subnetworks from empirical fMRI data, since quantifying voxel-resolution connectivity is not possible as consequence of encountering an underdetermined problem. Functional network recovery from fMRI data suggests that lsGC gives useful insight into connectivity patterns from resting-state fMRI at a multivariate voxel-resolution. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Multivariate meta-analysis: potential and promise.

    PubMed

    Jackson, Dan; Riley, Richard; White, Ian R

    2011-09-10

    The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day 'Multivariate meta-analysis' event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd.

  6. Regression Models For Multivariate Count Data

    PubMed Central

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2016-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data. PMID:28348500

  7. Regression Models For Multivariate Count Data.

    PubMed

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2017-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data.

  8. Excellent real-world outcomes of adults with Burkitt lymphoma treated with CODOX-M/IVAC plus or minus rituximab.

    PubMed

    Zhu, Katie Y; Song, Kevin W; Connors, Joseph M; Leitch, Heather; Barnett, Michael J; Ramadan, Khaled; Slack, Graham W; Abou Mourad, Yasser; Forrest, Donna L; Hogge, Donna E; Nantel, Stephen H; Narayanan, Sujaatha; Nevill, Thomas J; Power, Maryse M; Sanford, David S; Sutherland, Heather J; Tucker, Tracy; Toze, Cynthia L; Sehn, Laurie H; Broady, Raewyn; Gerrie, Alina S

    2018-06-01

    Treatment of Burkitt lymphoma (BL) with intensive, multi-agent chemotherapy with aggressive central nervous system (CNS) prophylaxis results in high cure rates, although no regimen is standard of care. We examined population-based survival outcomes of adults with BL treated with a modified combination of cyclophosphamide, vincristine, doxorubicin, prednisone and systemic high-dose methotrexate (MTX) (CODOX-M) with IVAC (ifosfamide, mesna, etoposide, cytarabine and intrathecal MTX) (CODOX-M/IVAC) ± rituximab over a 15-year period in British Columbia. For the 81 patients identified (including 8 with CNS involvement and 18 with human immunodeficiency virus-associated BL), 5-year progression-free survival (PFS) and overall survival (OS) were 75% [95% confidence interval (CI): 63-83%] and 77% (95% CI: 66-85%), respectively, with no treatment-related deaths. Those who completed the regimen per protocol (n = 38) had significantly improved 5-year PFS 86% (P = 0·04) and OS 92% (P = 0·008), as did those under 60 years with 5-year PFS 82% (P = 0·005) and OS 86% (P = 0·002), which remained significant in multivariate analysis [PFS: hazard ratio (HR) 3·36, P = 0·018; OS HR 4·03, P = 0·012]. Incorporation of high-dose systemic methotrexate also significantly affected multivariate survival outcomes (OS HR 0·28, P = 0·025). Stem cell transplant in first remission had no effect on OS or PFS. This large, real-world analysis of BL patients treated with CODOX-M/IVAC ± rituximab demonstrates excellent survival outcomes comparable to clinical trials. These results help to serve as a benchmark when comparing curative therapies for BL patients as novel regimens are incorporated into clinical practice. © 2018 John Wiley & Sons Ltd.

  9. Free Software and Multivariable Calculus

    ERIC Educational Resources Information Center

    Nord, Gail M.

    2011-01-01

    Calculators and computers make new modes of instruction possible; yet, at the same time they pose hardships for school districts and mathematics educators trying to incorporate technology with limited monetary resources. In the "Standards," a recommended classroom is one in which calculators, computers, courseware, and manipulative materials are…

  10. Teaching Confirmatory Factor Analysis to Non-Statisticians: A Case Study for Estimating Composite Reliability of Psychometric Instruments

    PubMed Central

    Gajewski, Byron J.; Jiang, Yu; Yeh, Hung-Wen; Engelman, Kimberly; Teel, Cynthia; Choi, Won S.; Greiner, K. Allen; Daley, Christine Makosky

    2013-01-01

    Texts and software that we are currently using for teaching multivariate analysis to non-statisticians lack in the delivery of confirmatory factor analysis (CFA). The purpose of this paper is to provide educators with a complement to these resources that includes CFA and its computation. We focus on how to use CFA to estimate a “composite reliability” of a psychometric instrument. This paper provides guidance for introducing, via a case-study, the non-statistician to CFA. As a complement to our instruction about the more traditional SPSS, we successfully piloted the software R for estimating CFA on nine non-statisticians. This approach can be used with healthcare graduate students taking a multivariate course, as well as modified for community stakeholders of our Center for American Indian Community Health (e.g. community advisory boards, summer interns, & research team members). The placement of CFA at the end of the class is strategic and gives us an opportunity to do some innovative teaching: (1) build ideas for understanding the case study using previous course work (such as ANOVA); (2) incorporate multi-dimensional scaling (that students already learned) into the selection of a factor structure (new concept); (3) use interactive data from the students (active learning); (4) review matrix algebra and its importance to psychometric evaluation; (5) show students how to do the calculation on their own; and (6) give students access to an actual recent research project. PMID:24772373

  11. Multivariate Models for Normal and Binary Responses in Intervention Studies

    ERIC Educational Resources Information Center

    Pituch, Keenan A.; Whittaker, Tiffany A.; Chang, Wanchen

    2016-01-01

    Use of multivariate analysis (e.g., multivariate analysis of variance) is common when normally distributed outcomes are collected in intervention research. However, when mixed responses--a set of normal and binary outcomes--are collected, standard multivariate analyses are no longer suitable. While mixed responses are often obtained in…

  12. Deconstructing multivariate decoding for the study of brain function.

    PubMed

    Hebart, Martin N; Baker, Chris I

    2017-08-04

    Multivariate decoding methods were developed originally as tools to enable accurate predictions in real-world applications. The realization that these methods can also be employed to study brain function has led to their widespread adoption in the neurosciences. However, prior to the rise of multivariate decoding, the study of brain function was firmly embedded in a statistical philosophy grounded on univariate methods of data analysis. In this way, multivariate decoding for brain interpretation grew out of two established frameworks: multivariate decoding for predictions in real-world applications, and classical univariate analysis based on the study and interpretation of brain activation. We argue that this led to two confusions, one reflecting a mixture of multivariate decoding for prediction or interpretation, and the other a mixture of the conceptual and statistical philosophies underlying multivariate decoding and classical univariate analysis. Here we attempt to systematically disambiguate multivariate decoding for the study of brain function from the frameworks it grew out of. After elaborating these confusions and their consequences, we describe six, often unappreciated, differences between classical univariate analysis and multivariate decoding. We then focus on how the common interpretation of what is signal and noise changes in multivariate decoding. Finally, we use four examples to illustrate where these confusions may impact the interpretation of neuroimaging data. We conclude with a discussion of potential strategies to help resolve these confusions in interpreting multivariate decoding results, including the potential departure from multivariate decoding methods for the study of brain function. Copyright © 2017. Published by Elsevier Inc.

  13. Analysis of cohort studies with multivariate and partially observed disease classification data.

    PubMed

    Chatterjee, Nilanjan; Sinha, Samiran; Diver, W Ryan; Feigelson, Heather Spencer

    2010-09-01

    Complex diseases like cancers can often be classified into subtypes using various pathological and molecular traits of the disease. In this article, we develop methods for analysis of disease incidence in cohort studies incorporating data on multiple disease traits using a two-stage semiparametric Cox proportional hazards regression model that allows one to examine the heterogeneity in the effect of the covariates by the levels of the different disease traits. For inference in the presence of missing disease traits, we propose a generalization of an estimating equation approach for handling missing cause of failure in competing-risk data. We prove asymptotic unbiasedness of the estimating equation method under a general missing-at-random assumption and propose a novel influence-function-based sandwich variance estimator. The methods are illustrated using simulation studies and a real data application involving the Cancer Prevention Study II nutrition cohort.

  14. Multivariate meta-analysis: Potential and promise

    PubMed Central

    Jackson, Dan; Riley, Richard; White, Ian R

    2011-01-01

    The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day ‘Multivariate meta-analysis’ event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd. PMID:21268052

  15. V/STOL propulsion control analysis: Phase 2, task 5-9

    NASA Technical Reports Server (NTRS)

    1981-01-01

    Typical V/STOL propulsion control requirements were derived for transition between vertical and horizontal flight using the General Electric RALS (Remote Augmented Lift System) concept. Steady-state operating requirements were defined for a typical Vertical-to-Horizontal transition and for a typical Horizontal-to-Vertical transition. Control mode requirements were established and multi-variable regulators developed for individual operating conditions. Proportional/Integral gain schedules were developed and were incorporated into a transition controller with capabilities for mode switching and manipulated variable reassignment. A non-linear component-level transient model of the engine was developed and utilized to provide a preliminary check-out of the controller logic. An inlet and nozzle effects model was developed for subsequent incorporation into the engine model and an aircraft model was developed for preliminary flight transition simulations. A condition monitoring development plan was developed and preliminary design requirements established. The Phase 1 long-range technology plan was refined and restructured toward the development of a real-time high fidelity transient model of a supersonic V/STOL propulsion system and controller for use in a piloted simulation program at NASA-Ames.

  16. Multivariate Longitudinal Analysis with Bivariate Correlation Test

    PubMed Central

    Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory

    2016-01-01

    In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model’s parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated. PMID:27537692

  17. Multivariate Longitudinal Analysis with Bivariate Correlation Test.

    PubMed

    Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory

    2016-01-01

    In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model's parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated.

  18. Robust detection, isolation and accommodation for sensor failures

    NASA Technical Reports Server (NTRS)

    Emami-Naeini, A.; Akhter, M. M.; Rock, S. M.

    1986-01-01

    The objective is to extend the recent advances in robust control system design of multivariable systems to sensor failure detection, isolation, and accommodation (DIA), and estimator design. This effort provides analysis tools to quantify the trade-off between performance robustness and DIA sensitivity, which are to be used to achieve higher levels of performance robustness for given levels of DIA sensitivity. An innovations-based DIA scheme is used. Estimators, which depend upon a model of the process and process inputs and outputs, are used to generate these innovations. Thresholds used to determine failure detection are computed based on bounds on modeling errors, noise properties, and the class of failures. The applicability of the newly developed tools are demonstrated on a multivariable aircraft turbojet engine example. A new concept call the threshold selector was developed. It represents a significant and innovative tool for the analysis and synthesis of DiA algorithms. The estimators were made robust by introduction of an internal model and by frequency shaping. The internal mode provides asymptotically unbiased filter estimates.The incorporation of frequency shaping of the Linear Quadratic Gaussian cost functional modifies the estimator design to make it suitable for sensor failure DIA. The results are compared with previous studies which used thresholds that were selcted empirically. Comparison of these two techniques on a nonlinear dynamic engine simulation shows improved performance of the new method compared to previous techniques

  19. The proliferation marker Ki67, but not neuroendocrine expression, is an independent factor in the prediction of prognosis of primary prostate cancer patients

    PubMed Central

    Pascale, Mariarosa; Aversa, Cinzia; Barbazza, Renzo; Marongiu, Barbara; Siracusano, Salvatore; Stoffel, Flavio; Sulfaro, Sando; Roggero, Enrico; Stanta, Giorgio

    2016-01-01

    Abstract Background Neuroendocrine markers, which could indicate for aggressive variants of prostate cancer and Ki67 (a well-known marker in oncology for defining tumor proliferation), have already been associated with clinical outcome in prostate cancer. The aim of this study was to investigate the prognostic value of those markers in primary prostate cancer patients. Patients and methods NSE (neuron specific enolase), ChrA (chromogranin A), Syp (Synaptophysin) and Ki67 staining were performed by immunohistochemistry. Then, the prognostic impact of their expression on overall survival was investigated in 166 primary prostate cancer patients by univariate and multivariate analyses. Results NSE, ChrA, Syp and Ki67 were positive in 50, 45, 54 and 146 out of 166 patients, respectively. In Kaplan-Meier analysis only diffuse NSE staining (negative vs diffuse, p = 0.004) and Ki67 (≤ 10% vs > 10%, p < 0.0001) were significantly associated with overall survival. Ki67 expression, but not NSE, resulted as an independent prognostic factor for overall survival in multivariate analysis. Conclusions A prognostic model incorporating Ki67 expression with clinical-pathological covariates could provide additional prognostic information. Ki67 may thus improve prediction of prostate cancer outcome based on standard clinical-pathological parameters improving prognosis and management of prostate cancer patients. PMID:27679548

  20. Constraining DALECv2 using multiple data streams and ecological constraints: analysis and application

    DOE PAGES

    Delahaies, Sylvain; Roulstone, Ian; Nichols, Nancy

    2017-07-10

    We use a variational method to assimilate multiple data streams into the terrestrial ecosystem carbon cycle model DALECv2 (Data Assimilation Linked Ecosystem Carbon). Ecological and dynamical constraints have recently been introduced to constrain unresolved components of this otherwise ill-posed problem. We recast these constraints as a multivariate Gaussian distribution to incorporate them into the variational framework and we demonstrate their advantage through a linear analysis. By using an adjoint method we study a linear approximation of the inverse problem: firstly we perform a sensitivity analysis of the different outputs under consideration, and secondly we use the concept of resolution matricesmore » to diagnose the nature of the ill-posedness and evaluate regularisation strategies. We then study the non-linear problem with an application to real data. Finally, we propose a modification to the model: introducing a spin-up period provides us with a built-in formulation of some ecological constraints which facilitates the variational approach.« less

  1. Constraining DALECv2 using multiple data streams and ecological constraints: analysis and application

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Delahaies, Sylvain; Roulstone, Ian; Nichols, Nancy

    We use a variational method to assimilate multiple data streams into the terrestrial ecosystem carbon cycle model DALECv2 (Data Assimilation Linked Ecosystem Carbon). Ecological and dynamical constraints have recently been introduced to constrain unresolved components of this otherwise ill-posed problem. We recast these constraints as a multivariate Gaussian distribution to incorporate them into the variational framework and we demonstrate their advantage through a linear analysis. By using an adjoint method we study a linear approximation of the inverse problem: firstly we perform a sensitivity analysis of the different outputs under consideration, and secondly we use the concept of resolution matricesmore » to diagnose the nature of the ill-posedness and evaluate regularisation strategies. We then study the non-linear problem with an application to real data. Finally, we propose a modification to the model: introducing a spin-up period provides us with a built-in formulation of some ecological constraints which facilitates the variational approach.« less

  2. Damage detection of engine bladed-disks using multivariate statistical analysis

    NASA Astrophysics Data System (ADS)

    Fang, X.; Tang, J.

    2006-03-01

    The timely detection of damage in aero-engine bladed-disks is an extremely important and challenging research topic. Bladed-disks have high modal density and, particularly, their vibration responses are subject to significant uncertainties due to manufacturing tolerance (blade-to-blade difference or mistuning), operating condition change and sensor noise. In this study, we present a new methodology for the on-line damage detection of engine bladed-disks using their vibratory responses during spin-up or spin-down operations which can be measured by blade-tip-timing sensing technique. We apply a principle component analysis (PCA)-based approach for data compression, feature extraction, and denoising. The non-model based damage detection is achieved by analyzing the change between response features of the healthy structure and of the damaged one. We facilitate such comparison by incorporating the Hotelling's statistic T2 analysis, which yields damage declaration with a given confidence level. The effectiveness of the method is demonstrated by case studies.

  3. Multivariate Regression Analysis and Slaughter Livestock,

    DTIC Science & Technology

    AGRICULTURE, *ECONOMICS), (*MEAT, PRODUCTION), MULTIVARIATE ANALYSIS, REGRESSION ANALYSIS , ANIMALS, WEIGHT, COSTS, PREDICTIONS, STABILITY, MATHEMATICAL MODELS, STORAGE, BEEF, PORK, FOOD, STATISTICAL DATA, ACCURACY

  4. A screening model for low bone mass in elderly Japanese men using quantitative ultrasound measurements: Fujiwara-Kyo Study.

    PubMed

    Minematsu, Akira; Hazaki, Kan; Harano, Akihiro; Iki, Masayuki; Fujita, Yuki; Okamoto, Nozomi; Kurumatani, Norio

    2012-01-01

    Screening for low bone mass is important to prevent fragility fractures in men as well as women, although men show a much lower prevalence of osteoporosis than women. The purpose of this study was to establish a screening model for low bone mineral density (BMD) using a quantitative ultrasound parameter and easily obtained objective indices for elderly Japanese men. We examined 1633 men (65-84 yr old) who were subjects of the Fujiwara-Kyo Study. Speed of sound (SOS) at the calcaneus was determined, and BMD was measured by dual-energy X-ray absorptiometry at the lumbar spine (LS), total hip (TH), and femoral neck (FN). Low BMD was defined as >1 standard deviation below the young adult mean, in accordance with World Health Organization criteria. We performed receiver operating characteristic (ROC) analysis to identify a better screening model incorporating SOS and determined the optimal cutoff value using Youden index. Prevalences of low BMD at the 3 skeletal sites were 27.8% (LS), 33.5% (TH), 48.6% (FN), and 43.3% at either LS or TH. The greatest area under the ROC curve (0.806, 95% confidence interval: 0.785-0.828) and smallest Akaike's information criterion were obtained in the multivariate model incorporating SOS, age, height, and weight for predicting low BMD at all skeletal sites. This model predicted low BMD at TH with the sensitivity of 0.726 and specificity of 0.739, whereas a similar model predicted low BMD at LS with much lower validity. We conclude that the multivariate model for TH could be used to screen for low BMD in elderly Japanese men. Copyright © 2012 The International Society for Clinical Densitometry. Published by Elsevier Inc. All rights reserved.

  5. Preoperative predictive model of recovery of urinary continence after radical prostatectomy.

    PubMed

    Matsushita, Kazuhito; Kent, Matthew T; Vickers, Andrew J; von Bodman, Christian; Bernstein, Melanie; Touijer, Karim A; Coleman, Jonathan A; Laudone, Vincent T; Scardino, Peter T; Eastham, James A; Akin, Oguz; Sandhu, Jaspreet S

    2015-10-01

    To build a predictive model of urinary continence recovery after radical prostatectomy (RP) that incorporates magnetic resonance imaging (MRI) parameters and clinical data. We conducted a retrospective review of data from 2,849 patients who underwent pelvic staging MRI before RP from November 2001 to June 2010. We used logistic regression to evaluate the association between each MRI variable and continence at 6 or 12 months, adjusting for age, body mass index (BMI) and American Society of Anesthesiologists (ASA) score, and then used multivariable logistic regression to create our model. A nomogram was constructed using the multivariable logistic regression models. In all, 68% (1,742/2,559) and 82% (2,205/2,689) regained function at 6 and 12 months, respectively. In the base model, age, BMI and ASA score were significant predictors of continence at 6 or 12 months on univariate analysis (P < 0.005). Among the preoperative MRI measurements, membranous urethral length, which showed great significance, was incorporated into the base model to create the full model. For continence recovery at 6 months, the addition of membranous urethral length increased the area under the curve (AUC) to 0.664 for the validation set, an increase of 0.064 over the base model. For continence recovery at 12 months, the AUC was 0.674, an increase of 0.085 over the base model. Using our model, the likelihood of continence recovery increases with membranous urethral length and decreases with age, BMI and ASA score. This model could be used for patient counselling and for the identification of patients at high risk for urinary incontinence in whom to study changes in operative technique that improve urinary function after RP. © 2015 The Authors BJU International © 2015 BJU International Published by John Wiley & Sons Ltd.

  6. Use of electronic personal health records (PHRs) for complementary and alternative medicine (CAM) disclosure: Implications for integrative health care.

    PubMed

    Yeo, Younsook; Park, Jisung; Roh, Soonhee; Levkoff, Sue

    2016-06-01

    To test a hypothesis that patients' use of Internet-based personal health records (PHRs) will be positively related to their disclosure of their CAM use to medical doctors, controlling for covariates' effects (e.g., health, human capital, and demographics), and to examine the factors influencing patients' CAM use disclosures. Cross-sectional survey. We analyzed data in a subsample of CAM users who used both the internet and healthcare services (n=1457) from the Health Information National Trends Survey, a nationally representative study of U.S. adults (≥18), by using a multivariate logistic analysis. Among the subsample, 52.7% disclosed their use of CAM to their doctors and 19.3% used PHRs. Both the bivariate (64.1% vs. 35.9%, p<0.01) and multivariate (β=0.558, SE=0.220, OR=1.75, p<0.05) analyses revealed a positive relationship between PHR use and CAM use disclosure. Other significant factors for CAM use disclosure included being older, being a female, having insurance, and having regular source of care. Particularly, foreign-born adults had significantly lower odds of disclosing their CAM use than U.S.-born adults. We found that patients' PHR use facilitated their disclosure of CAM use to medical doctors. To ensure integrative healthcare and integrative medicine in the healthcare sector and optimum care for patients, education for CAM users regarding PHR adoption is encouraged. Next-generation PHR designs should consider incorporating domains for CAM data that allow patients to store CAM data and also incorporating 'intelligent' PHRs, whose contents can be converted into the patient's first language. Copyright © 2016 Elsevier Ltd. All rights reserved.

  7. Multivariate inference of pathway activity in host immunity and response to therapeutics

    PubMed Central

    Goel, Gautam; Conway, Kara L.; Jaeger, Martin; Netea, Mihai G.; Xavier, Ramnik J.

    2014-01-01

    Developing a quantitative view of how biological pathways are regulated in response to environmental factors is central for understanding of disease phenotypes. We present a computational framework, named Multivariate Inference of Pathway Activity (MIPA), which quantifies degree of activity induced in a biological pathway by computing five distinct measures from transcriptomic profiles of its member genes. Statistical significance of inferred activity is examined using multiple independent self-contained tests followed by a competitive analysis. The method incorporates a new algorithm to identify a subset of genes that may regulate the extent of activity induced in a pathway. We present an in-depth evaluation of specificity, robustness, and reproducibility of our method. We benchmarked MIPA's false positive rate at less than 1%. Using transcriptomic profiles representing distinct physiological and disease states, we illustrate applicability of our method in (i) identifying gene–gene interactions in autophagy-dependent response to Salmonella infection, (ii) uncovering gene–environment interactions in host response to bacterial and viral pathogens and (iii) identifying driver genes and processes that contribute to wound healing and response to anti-TNFα therapy. We provide relevant experimental validation that corroborates the accuracy and advantage of our method. PMID:25147207

  8. Optimization of large animal MI models; a systematic analysis of control groups from preclinical studies.

    PubMed

    Zwetsloot, P P; Kouwenberg, L H J A; Sena, E S; Eding, J E; den Ruijter, H M; Sluijter, J P G; Pasterkamp, G; Doevendans, P A; Hoefer, I E; Chamuleau, S A J; van Hout, G P J; Jansen Of Lorkeers, S J

    2017-10-27

    Large animal models are essential for the development of novel therapeutics for myocardial infarction. To optimize translation, we need to assess the effect of experimental design on disease outcome and model experimental design to resemble the clinical course of MI. The aim of this study is therefore to systematically investigate how experimental decisions affect outcome measurements in large animal MI models. We used control animal-data from two independent meta-analyses of large animal MI models. All variables of interest were pre-defined. We performed univariable and multivariable meta-regression to analyze whether these variables influenced infarct size and ejection fraction. Our analyses incorporated 246 relevant studies. Multivariable meta-regression revealed that infarct size and cardiac function were influenced independently by choice of species, sex, co-medication, occlusion type, occluded vessel, quantification method, ischemia duration and follow-up duration. We provide strong systematic evidence that commonly used endpoints significantly depend on study design and biological variation. This makes direct comparison of different study-results difficult and calls for standardized models. Researchers should take this into account when designing large animal studies to most closely mimic the clinical course of MI and enable translational success.

  9. Recurrent Neural Networks for Multivariate Time Series with Missing Values.

    PubMed

    Che, Zhengping; Purushotham, Sanjay; Cho, Kyunghyun; Sontag, David; Liu, Yan

    2018-04-17

    Multivariate time series data in practical applications, such as health care, geoscience, and biology, are characterized by a variety of missing values. In time series prediction and other related tasks, it has been noted that missing values and their missing patterns are often correlated with the target labels, a.k.a., informative missingness. There is very limited work on exploiting the missing patterns for effective imputation and improving prediction performance. In this paper, we develop novel deep learning models, namely GRU-D, as one of the early attempts. GRU-D is based on Gated Recurrent Unit (GRU), a state-of-the-art recurrent neural network. It takes two representations of missing patterns, i.e., masking and time interval, and effectively incorporates them into a deep model architecture so that it not only captures the long-term temporal dependencies in time series, but also utilizes the missing patterns to achieve better prediction results. Experiments of time series classification tasks on real-world clinical datasets (MIMIC-III, PhysioNet) and synthetic datasets demonstrate that our models achieve state-of-the-art performance and provide useful insights for better understanding and utilization of missing values in time series analysis.

  10. A hybrid approach identifies metabolic signatures of high-producers for chinese hamster ovary clone selection and process optimization.

    PubMed

    Popp, Oliver; Müller, Dirk; Didzus, Katharina; Paul, Wolfgang; Lipsmeier, Florian; Kirchner, Florian; Niklas, Jens; Mauch, Klaus; Beaucamp, Nicola

    2016-09-01

    In-depth characterization of high-producer cell lines and bioprocesses is vital to ensure robust and consistent production of recombinant therapeutic proteins in high quantity and quality for clinical applications. This requires applying appropriate methods during bioprocess development to enable meaningful characterization of CHO clones and processes. Here, we present a novel hybrid approach for supporting comprehensive characterization of metabolic clone performance. The approach combines metabolite profiling with multivariate data analysis and fluxomics to enable a data-driven mechanistic analysis of key metabolic traits associated with desired cell phenotypes. We applied the methodology to quantify and compare metabolic performance in a set of 10 recombinant CHO-K1 producer clones and a host cell line. The comprehensive characterization enabled us to derive an extended set of clone performance criteria that not only captured growth and product formation, but also incorporated information on intracellular clone physiology and on metabolic changes during the process. These criteria served to establish a quantitative clone ranking and allowed us to identify metabolic differences between high-producing CHO-K1 clones yielding comparably high product titers. Through multivariate data analysis of the combined metabolite and flux data we uncovered common metabolic traits characteristic of high-producer clones in the screening setup. This included high intracellular rates of glutamine synthesis, low cysteine uptake, reduced excretion of aspartate and glutamate, and low intracellular degradation rates of branched-chain amino acids and of histidine. Finally, the above approach was integrated into a workflow that enables standardized high-content selection of CHO producer clones in a high-throughput fashion. In conclusion, the combination of quantitative metabolite profiling, multivariate data analysis, and mechanistic network model simulations can identify metabolic traits characteristic of high-performance clones and enables informed decisions on which clones provide a good match for a particular process platform. The proposed approach also provides a mechanistic link between observed clone phenotype, process setup, and feeding regimes, and thereby offers concrete starting points for subsequent process optimization. Biotechnol. Bioeng. 2016;113: 2005-2019. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  11. Multivariate calibration in Laser-Induced Breakdown Spectroscopy quantitative analysis: The dangers of a 'black box' approach and how to avoid them

    NASA Astrophysics Data System (ADS)

    Safi, A.; Campanella, B.; Grifoni, E.; Legnaioli, S.; Lorenzetti, G.; Pagnotta, S.; Poggialini, F.; Ripoll-Seguer, L.; Hidalgo, M.; Palleschi, V.

    2018-06-01

    The introduction of multivariate calibration curve approach in Laser-Induced Breakdown Spectroscopy (LIBS) quantitative analysis has led to a general improvement of the LIBS analytical performances, since a multivariate approach allows to exploit the redundancy of elemental information that are typically present in a LIBS spectrum. Software packages implementing multivariate methods are available in the most diffused commercial and open source analytical programs; in most of the cases, the multivariate algorithms are robust against noise and operate in unsupervised mode. The reverse of the coin of the availability and ease of use of such packages is the (perceived) difficulty in assessing the reliability of the results obtained which often leads to the consideration of the multivariate algorithms as 'black boxes' whose inner mechanism is supposed to remain hidden to the user. In this paper, we will discuss the dangers of a 'black box' approach in LIBS multivariate analysis, and will discuss how to overcome them using the chemical-physical knowledge that is at the base of any LIBS quantitative analysis.

  12. Linear regression analysis and its application to multivariate chromatographic calibration for the quantitative analysis of two-component mixtures.

    PubMed

    Dinç, Erdal; Ozdemir, Abdil

    2005-01-01

    Multivariate chromatographic calibration technique was developed for the quantitative analysis of binary mixtures enalapril maleate (EA) and hydrochlorothiazide (HCT) in tablets in the presence of losartan potassium (LST). The mathematical algorithm of multivariate chromatographic calibration technique is based on the use of the linear regression equations constructed using relationship between concentration and peak area at the five-wavelength set. The algorithm of this mathematical calibration model having a simple mathematical content was briefly described. This approach is a powerful mathematical tool for an optimum chromatographic multivariate calibration and elimination of fluctuations coming from instrumental and experimental conditions. This multivariate chromatographic calibration contains reduction of multivariate linear regression functions to univariate data set. The validation of model was carried out by analyzing various synthetic binary mixtures and using the standard addition technique. Developed calibration technique was applied to the analysis of the real pharmaceutical tablets containing EA and HCT. The obtained results were compared with those obtained by classical HPLC method. It was observed that the proposed multivariate chromatographic calibration gives better results than classical HPLC.

  13. A novel combined approach of diffuse reflectance UV-Vis-NIR spectroscopy and multivariate analysis for non-destructive examination of blue ballpoint pen inks in forensic application.

    PubMed

    Kumar, Raj; Sharma, Vishal

    2017-03-15

    The present research is focused on the analysis of writing inks using destructive UV-Vis spectroscopy (dissolution of ink by the solvent) and non-destructive diffuse reflectance UV-Vis-NIR spectroscopy along with Chemometrics. Fifty seven samples of blue ballpoint pen inks were analyzed under optimum conditions to determine the differences in spectral features of inks among same and different manufacturers. Normalization was performed on the spectroscopic data before chemometric analysis. Principal Component Analysis (PCA) and K-mean cluster analysis were used on the data to ascertain whether the blue ballpoint pen inks could be differentiated by their UV-Vis/UV-Vis NIR spectra. The discriminating power is calculated by qualitative analysis by the visual comparison of the spectra (absorbance peaks), produced by the destructive and non-destructive methods. In the latter two methods, the pairwise comparison is made by incorporating the clustering method. It is found that chemometric method provides better discriminating power (98.72% and 99.46%, in destructive and non-destructive, respectively) in comparison to the qualitative analysis (69.67%). Copyright © 2016 Elsevier B.V. All rights reserved.

  14. Multivariate analysis: A statistical approach for computations

    NASA Astrophysics Data System (ADS)

    Michu, Sachin; Kaushik, Vandana

    2014-10-01

    Multivariate analysis is a type of multivariate statistical approach commonly used in, automotive diagnosis, education evaluating clusters in finance etc and more recently in the health-related professions. The objective of the paper is to provide a detailed exploratory discussion about factor analysis (FA) in image retrieval method and correlation analysis (CA) of network traffic. Image retrieval methods aim to retrieve relevant images from a collected database, based on their content. The problem is made more difficult due to the high dimension of the variable space in which the images are represented. Multivariate correlation analysis proposes an anomaly detection and analysis method based on the correlation coefficient matrix. Anomaly behaviors in the network include the various attacks on the network like DDOs attacks and network scanning.

  15. Multivariate Cluster Analysis.

    ERIC Educational Resources Information Center

    McRae, Douglas J.

    Procedures for grouping students into homogeneous subsets have long interested educational researchers. The research reported in this paper is an investigation of a set of objective grouping procedures based on multivariate analysis considerations. Four multivariate functions that might serve as criteria for adequate grouping are given and…

  16. Part 2. Development of Enhanced Statistical Methods for Assessing Health Effects Associated with an Unknown Number of Major Sources of Multiple Air Pollutants.

    PubMed

    Park, Eun Sug; Symanski, Elaine; Han, Daikwon; Spiegelman, Clifford

    2015-06-01

    A major difficulty with assessing source-specific health effects is that source-specific exposures cannot be measured directly; rather, they need to be estimated by a source-apportionment method such as multivariate receptor modeling. The uncertainty in source apportionment (uncertainty in source-specific exposure estimates and model uncertainty due to the unknown number of sources and identifiability conditions) has been largely ignored in previous studies. Also, spatial dependence of multipollutant data collected from multiple monitoring sites has not yet been incorporated into multivariate receptor modeling. The objectives of this project are (1) to develop a multipollutant approach that incorporates both sources of uncertainty in source-apportionment into the assessment of source-specific health effects and (2) to develop enhanced multivariate receptor models that can account for spatial correlations in the multipollutant data collected from multiple sites. We employed a Bayesian hierarchical modeling framework consisting of multivariate receptor models, health-effects models, and a hierarchical model on latent source contributions. For the health model, we focused on the time-series design in this project. Each combination of number of sources and identifiability conditions (additional constraints on model parameters) defines a different model. We built a set of plausible models with extensive exploratory data analyses and with information from previous studies, and then computed posterior model probability to estimate model uncertainty. Parameter estimation and model uncertainty estimation were implemented simultaneously by Markov chain Monte Carlo (MCMC*) methods. We validated the methods using simulated data. We illustrated the methods using PM2.5 (particulate matter ≤ 2.5 μm in aerodynamic diameter) speciation data and mortality data from Phoenix, Arizona, and Houston, Texas. The Phoenix data included counts of cardiovascular deaths and daily PM2.5 speciation data from 1995-1997. The Houston data included respiratory mortality data and 24-hour PM2.5 speciation data sampled every six days from a region near the Houston Ship Channel in years 2002-2005. We also developed a Bayesian spatial multivariate receptor modeling approach that, while simultaneously dealing with the unknown number of sources and identifiability conditions, incorporated spatial correlations in the multipollutant data collected from multiple sites into the estimation of source profiles and contributions based on the discrete process convolution model for multivariate spatial processes. This new modeling approach was applied to 24-hour ambient air concentrations of 17 volatile organic compounds (VOCs) measured at nine monitoring sites in Harris County, Texas, during years 2000 to 2005. Simulation results indicated that our methods were accurate in identifying the true model and estimated parameters were close to the true values. The results from our methods agreed in general with previous studies on the source apportionment of the Phoenix data in terms of estimated source profiles and contributions. However, we had a greater number of statistically insignificant findings, which was likely a natural consequence of incorporating uncertainty in the estimated source contributions into the health-effects parameter estimation. For the Houston data, a model with five sources (that seemed to be Sulfate-Rich Secondary Aerosol, Motor Vehicles, Industrial Combustion, Soil/Crustal Matter, and Sea Salt) showed the highest posterior model probability among the candidate models considered when fitted simultaneously to the PM2.5 and mortality data. There was a statistically significant positive association between respiratory mortality and same-day PM2.5 concentrations attributed to one of the sources (probably industrial combustion). The Bayesian spatial multivariate receptor modeling approach applied to the VOC data led to a highest posterior model probability for a model with five sources (that seemed to be refinery, petrochemical production, gasoline evaporation, natural gas, and vehicular exhaust) among several candidate models, with the number of sources varying between three and seven and with different identifiability conditions. Our multipollutant approach assessing source-specific health effects is more advantageous than a single-pollutant approach in that it can estimate total health effects from multiple pollutants and can also identify emission sources that are responsible for adverse health effects. Our Bayesian approach can incorporate not only uncertainty in the estimated source contributions, but also model uncertainty that has not been addressed in previous studies on assessing source-specific health effects. The new Bayesian spatial multivariate receptor modeling approach enables predictions of source contributions at unmonitored sites, minimizing exposure misclassification and providing improved exposure estimates along with their uncertainty estimates, as well as accounting for uncertainty in the number of sources and identifiability conditions.

  17. Head and facial anthropometry of mixed-race US Army male soldiers for military design and sizing: a pilot study.

    PubMed

    Yokota, Miyo

    2005-05-01

    In the United States, the biologically admixed population is increasing. Such demographic changes may affect the distribution of anthropometric characteristics, which are incorporated into the design of equipment and clothing for the US Army and other large organizations. The purpose of this study was to examine multivariate craniofacial anthropometric distributions between biologically admixed male populations and single racial groups of Black and White males. Multivariate statistical results suggested that nose breadth and lip length were different between Blacks and Whites. Such differences may be considered for adjustments to respirators and chemical-biological protective masks. However, based on this pilot study, multivariate anthropometric distributions of admixed individuals were within the distributions of single racial groups. Based on the sample reported, sizing and designing for the admixed groups are not necessary if anthropometric distributions of single racial groups comprising admixed groups are known.

  18. Stability of Teacher Value-Added Rankings across Measurement Model and Scaling Conditions

    ERIC Educational Resources Information Center

    Hawley, Leslie R.; Bovaird, James A.; Wu, ChaoRong

    2017-01-01

    Value-added assessment methods have been criticized by researchers and policy makers for a number of reasons. One issue includes the sensitivity of model results across different outcome measures. This study examined the utility of incorporating multivariate latent variable approaches within a traditional value-added framework. We evaluated the…

  19. Exploring Sex Differences in Worry with a Cognitive Vulnerability Model

    ERIC Educational Resources Information Center

    Zalta, Alyson K.; Chambless, Dianne L.

    2008-01-01

    A multivariate model was developed to examine the relative contributions of mastery, stress, interpretive bias, and coping to sex differences in worry. Rumination was incorporated as a second outcome variable to test the specificity of these associations. Participants included two samples of undergraduates totaling 302 men and 379 women. A path…

  20. Multitrait, random regression, or simple repeatability model in high-throughput phenotyping data improve genomic prediction for wheat grain yield

    USDA-ARS?s Scientific Manuscript database

    High-throughput phenotyping (HTP) platforms can be used to measure traits that are genetically correlated with wheat (Triticum aestivum L.) grain yield across time. Incorporating such secondary traits in the multivariate pedigree and genomic prediction models would be desirable to improve indirect s...

  1. Serum CA125 predicts extrauterine disease and survival in uterine carcinosarcoma

    PubMed Central

    Huang, Gloria S.; Chiu, Lydia G.; Gebb, Juliana S.; Gunter, Marc J.; Sukumvanich, Paniti; Goldberg, Gary L.; Einstein, Mark H.

    2009-01-01

    Objective The purpose of this study was to determine the clinical utility of CA125 measurement in patients with uterine carcinosarcoma (CS). Methods Ninety-five consecutive patients treated for CS at a single institution were identified. All 54 patients who underwent preoperative CA125 measurement were included in the study. Data were abstracted from the medical records. Tests of association between preoperative CA125 and previously identified clinicopathologic prognostic factors were performed using Fisher’s exact test and Pearson chi-square test. To evaluate relationship of CA125 elevation and survival, a Cox proportional hazard model was used for multivariate analysis, incorporating all of prognostic factors identified by univariate analysis. Results Preoperative CA125 was significantly associated with the presence of extrauterine disease (P<0.001), deep myometrial invasion (P<0.001), and serous histology of the epithelial component (P=0.005). Using univariate survival analysis, stage (HR=1.808, P=0.004), postoperative CA125 level (HR=9.855, P<0.001), and estrogen receptor positivity (HR=0.314, P=0.029) were significantly associated with survival. In the multivariate model, only postoperative CA125 level remained significantly associated with poor survival (HR=5.725, P=0.009). Conclusion Preoperative CA125 elevation is a marker of extrauterine disease and deep myometrial invasion in patients with uterine CS. Postoperative CA125 elevation is an independent prognostic factor for poor survival. These findings indicate that CA125 may be a clinically useful serum marker in the management of patients with CS. PMID:17935762

  2. Is social interaction associated with alcohol consumption in Uganda?

    PubMed

    Tumwesigye, Nazarius Mbona; Kasirye, Rogers; Nansubuga, Elizabeth

    2009-07-01

    Little is documented about the association of alcohol consumption and social interaction in Uganda, a country with one of the highest per capita alcohol consumptions in the world. This paper describes the pattern of social interaction by sex and establishes the relationship between social interaction and alcohol consumption with and without the consideration of confounders. The data used had 1479 records and were collected in a survey in 2003. The study was part of a multinational study on Gender, Alcohol, and Culture International Study (GENACIS). Each question on social interaction had been pre-coded in a way that quantified the extent of social interaction. The sum of responses on interaction questions gave a summative score which was used to compute summary indices on social interaction. Principal component analysis (PCA) was used to identify the best combination of variables for a social interaction index. The index was computed by a prediction using a PCA model developed from the selected variables. The index was categorised into quintiles and used in bivariate and multivariate logistic regression analysis of alcohol consumption and social interaction. The stronger the social interaction the more the likelihood of taking alcohol frequently (chi(trend)(2)=4.72, p<0.001). The strength of the association remains significant even after controlling for sex, age group and education level (p=0.008). The strength of relationship between social interaction and heavy consumption of alcohol gets weak in multivariate analysis. Communication messages meant to improve health, well-being and public order need to incorporate dangers of negative influence of social interaction.

  3. Comparison of Adjuvant Radiation Therapy Alone and Chemotherapy Alone in Surgically Resected Low-Grade Gliomas: Survival Analyses of 2253 Cases from the National Cancer Data Base.

    PubMed

    Wu, Jing; Neale, Natalie; Huang, Yuqian; Bai, Harrison X; Li, Xuejun; Zhang, Zishu; Karakousis, Giorgos; Huang, Raymond; Zhang, Paul J; Tang, Lei; Xiao, Bo; Yang, Li

    2018-04-01

    It is becoming increasingly common to incorporate chemotherapy (CT) with radiotherapy (RT) in the treatment of low-grade gliomas (LGGs) after surgical resection. However, there is a lack of literature comparing survival of patients who underwent RT or CT alone. The U.S. National Cancer Data Base was used to identify patients with histologically confirmed, World Health Organization grade 2 gliomas who received either RT alone or CT alone after surgery from 2004 to 2013. Overall survival (OS) was evaluated by Kaplan-Meier analysis, multivariable Cox proportional hazard regression, and propensity-score-matched analysis. In total, 2253 patients with World Health Organization grade 2 gliomas were included, of whom 1466 (65.1%) received RT alone and 787 (34.9%) CT alone. The median OS was 98.9 months for the RT alone group and 125.8 months for the CT alone group. On multivariable analysis, CT alone was associated with a significant OS benefit compared with RT alone (hazard ratio [HR], 0.405; 95% confidence interval, 0.277-0.592; P < 0.001). On subgroup analyses, the survival advantage of CT alone over RT alone persisted across all age groups, and for the subtotal resection and biopsy groups, but not in the gross total resection group. In propensity-score-matched analysis, CT alone still showed significantly improved OS compared with RT alone (HR, 0.612; 95% confidence interval, 0.506-0.741; P < 0.001). Our results suggest that CT alone was independently associated with longer OS compared with RT alone in patients with LGGs who underwent surgery. Copyright © 2018 Elsevier Inc. All rights reserved.

  4. Multivariate statistical analysis software technologies for astrophysical research involving large data bases

    NASA Technical Reports Server (NTRS)

    Djorgovski, S. George

    1994-01-01

    We developed a package to process and analyze the data from the digital version of the Second Palomar Sky Survey. This system, called SKICAT, incorporates the latest in machine learning and expert systems software technology, in order to classify the detected objects objectively and uniformly, and facilitate handling of the enormous data sets from digital sky surveys and other sources. The system provides a powerful, integrated environment for the manipulation and scientific investigation of catalogs from virtually any source. It serves three principal functions: image catalog construction, catalog management, and catalog analysis. Through use of the GID3* Decision Tree artificial induction software, SKICAT automates the process of classifying objects within CCD and digitized plate images. To exploit these catalogs, the system also provides tools to merge them into a large, complete database which may be easily queried and modified when new data or better methods of calibrating or classifying become available. The most innovative feature of SKICAT is the facility it provides to experiment with and apply the latest in machine learning technology to the tasks of catalog construction and analysis. SKICAT provides a unique environment for implementing these tools for any number of future scientific purposes. Initial scientific verification and performance tests have been made using galaxy counts and measurements of galaxy clustering from small subsets of the survey data, and a search for very high redshift quasars. All of the tests were successful, and produced new and interesting scientific results. Attachments to this report give detailed accounts of the technical aspects for multivariate statistical analysis of small and moderate-size data sets, called STATPROG. The package was tested extensively on a number of real scientific applications, and has produced real, published results.

  5. Comparative forensic soil analysis of New Jersey state parks using a combination of simple techniques with multivariate statistics.

    PubMed

    Bonetti, Jennifer; Quarino, Lawrence

    2014-05-01

    This study has shown that the combination of simple techniques with the use of multivariate statistics offers the potential for the comparative analysis of soil samples. Five samples were obtained from each of twelve state parks across New Jersey in both the summer and fall seasons. Each sample was examined using particle-size distribution, pH analysis in both water and 1 M CaCl2 , and a loss on ignition technique. Data from each of the techniques were combined, and principal component analysis (PCA) and canonical discriminant analysis (CDA) were used for multivariate data transformation. Samples from different locations could be visually differentiated from one another using these multivariate plots. Hold-one-out cross-validation analysis showed error rates as low as 3.33%. Ten blind study samples were analyzed resulting in no misclassifications using Mahalanobis distance calculations and visual examinations of multivariate plots. Seasonal variation was minimal between corresponding samples, suggesting potential success in forensic applications. © 2014 American Academy of Forensic Sciences.

  6. Quantifying the impact of between-study heterogeneity in multivariate meta-analyses

    PubMed Central

    Jackson, Dan; White, Ian R; Riley, Richard D

    2012-01-01

    Measures that quantify the impact of heterogeneity in univariate meta-analysis, including the very popular I2 statistic, are now well established. Multivariate meta-analysis, where studies provide multiple outcomes that are pooled in a single analysis, is also becoming more commonly used. The question of how to quantify heterogeneity in the multivariate setting is therefore raised. It is the univariate R2 statistic, the ratio of the variance of the estimated treatment effect under the random and fixed effects models, that generalises most naturally, so this statistic provides our basis. This statistic is then used to derive a multivariate analogue of I2, which we call . We also provide a multivariate H2 statistic, the ratio of a generalisation of Cochran's heterogeneity statistic and its associated degrees of freedom, with an accompanying generalisation of the usual I2 statistic, . Our proposed heterogeneity statistics can be used alongside all the usual estimates and inferential procedures used in multivariate meta-analysis. We apply our methods to some real datasets and show how our statistics are equally appropriate in the context of multivariate meta-regression, where study level covariate effects are included in the model. Our heterogeneity statistics may be used when applying any procedure for fitting the multivariate random effects model. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22763950

  7. Analyzing Multiple Outcomes in Clinical Research Using Multivariate Multilevel Models

    PubMed Central

    Baldwin, Scott A.; Imel, Zac E.; Braithwaite, Scott R.; Atkins, David C.

    2014-01-01

    Objective Multilevel models have become a standard data analysis approach in intervention research. Although the vast majority of intervention studies involve multiple outcome measures, few studies use multivariate analysis methods. The authors discuss multivariate extensions to the multilevel model that can be used by psychotherapy researchers. Method and Results Using simulated longitudinal treatment data, the authors show how multivariate models extend common univariate growth models and how the multivariate model can be used to examine multivariate hypotheses involving fixed effects (e.g., does the size of the treatment effect differ across outcomes?) and random effects (e.g., is change in one outcome related to change in the other?). An online supplemental appendix provides annotated computer code and simulated example data for implementing a multivariate model. Conclusions Multivariate multilevel models are flexible, powerful models that can enhance clinical research. PMID:24491071

  8. Analysis techniques for multivariate root loci. [a tool in linear control systems

    NASA Technical Reports Server (NTRS)

    Thompson, P. M.; Stein, G.; Laub, A. J.

    1980-01-01

    Analysis and techniques are developed for the multivariable root locus and the multivariable optimal root locus. The generalized eigenvalue problem is used to compute angles and sensitivities for both types of loci, and an algorithm is presented that determines the asymptotic properties of the optimal root locus.

  9. Methods for presentation and display of multivariate data

    NASA Technical Reports Server (NTRS)

    Myers, R. H.

    1981-01-01

    Methods for the presentation and display of multivariate data are discussed with emphasis placed on the multivariate analysis of variance problems and the Hotelling T(2) solution in the two-sample case. The methods utilize the concepts of stepwise discrimination analysis and the computation of partial correlation coefficients.

  10. A Primer on Multivariate Analysis of Variance (MANOVA) for Behavioral Scientists

    ERIC Educational Resources Information Center

    Warne, Russell T.

    2014-01-01

    Reviews of statistical procedures (e.g., Bangert & Baumberger, 2005; Kieffer, Reese, & Thompson, 2001; Warne, Lazo, Ramos, & Ritter, 2012) show that one of the most common multivariate statistical methods in psychological research is multivariate analysis of variance (MANOVA). However, MANOVA and its associated procedures are often not…

  11. Supervised multiblock sparse multivariable analysis with application to multimodal brain imaging genetics.

    PubMed

    Kawaguchi, Atsushi; Yamashita, Fumio

    2017-10-01

    This article proposes a procedure for describing the relationship between high-dimensional data sets, such as multimodal brain images and genetic data. We propose a supervised technique to incorporate the clinical outcome to determine a score, which is a linear combination of variables with hieratical structures to multimodalities. This approach is expected to obtain interpretable and predictive scores. The proposed method was applied to a study of Alzheimer's disease (AD). We propose a diagnostic method for AD that involves using whole-brain magnetic resonance imaging (MRI) and positron emission tomography (PET), and we select effective brain regions for the diagnostic probability and investigate the genome-wide association with the regions using single nucleotide polymorphisms (SNPs). The two-step dimension reduction method, which we previously introduced, was considered applicable to such a study and allows us to partially incorporate the proposed method. We show that the proposed method offers classification functions with feasibility and reasonable prediction accuracy based on the receiver operating characteristic (ROC) analysis and reasonable regions of the brain and genomes. Our simulation study based on the synthetic structured data set showed that the proposed method outperformed the original method and provided the characteristic for the supervised feature. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  12. Recommendation for incorporation of a different lymph node scoring system in future AJCC N category for oral cancer.

    PubMed

    Lee, Ching-Chih; Su, Yu-Chieh; Hung, Shih-Kai; Chen, Po-Chun; Huang, Chung-I; Huang, Wei-Lun; Lin, Yu-Wei; Yang, Ching-Chieh

    2017-10-26

    To compare the prognostic value of 3 different lymph node scoring systems " log odds of positive nodes (LODDS), lymph node ratio (rN), and lymph node yield " in an effort to improve the staging of oral cancer. We identified 3958 oral cancer patients from Surveillance, Epidemiology, and End Results database from 2007 to 2013. In univariate analysis, LODDS, pN, rN, and lymph node yield were prognostic factors for 5-year disease-specific survival (DSS) and overall survival (OS). Multivariate analysis indicated that patients with LODDS 4 had worst 5-year DSS and OS. Stage migration occurred in pN1 and pN2 patients with LODDS 4. In pN1 patients, those with LODDS 4 had the worst 5-year DSS (41.2%) and OS (31.6%) than patients with pN1 and LODDS 2-3. In pN2 patients, those with LODDS4 had the worst 5-year DSS (34.5%) and OS (27.4%) than patients with pN2 and LODDS 2-3. The proposed staging system, which incorporates LODDS with AJCC pN, had better discriminability and prediction accuracy for predicting survival. We also noted that patients with LODDS 4 given adjuvant radiotherapy had better 5-year DSS and OS. The LODDS should be considered as a future candidate measurement for N category in oral cancer.

  13. The impact of HIV infection and socioeconomic factors on the incidence of gonorrhea: A county-level, US-wide analysis.

    PubMed

    Andreatos, Nikolaos; Grigoras, Christos; Shehadeh, Fadi; Pliakos, Elina Eleftheria; Stoukides, Georgianna; Port, Jenna; Flokas, Myrto Eleni; Mylonakis, Eleftherios

    2017-01-01

    Gonorrhea is the second most commonly reported identifiable disease in the United States (U.S.). Importantly, more than 25% of gonorrheal infections demonstrate antibiotic resistance, leading the Centers for Disease Control and Prevention (CDC) to classify gonorrhea as an "urgent threat". We examined the association of gonorrhea infection rates with the incidence of HIV and socioeconomic factors. A county-level multivariable model was then constructed. Multivariable analysis demonstrated that HIV incidence [Coefficient (Coeff): 1.26, 95% Confidence Interval (CI): 0.86, 1.66, P<0.001] exhibited the most powerful independent association with the incidence of gonorrhea and predicted 40% of the observed variation in gonorrhea infection rates. Sociodemographic factors like county urban ranking (Coeff: 0.12, 95% CI: 0.03, 0.20, P = 0.005), percentage of women (Coeff: 0.41, 95% CI: 0.28, 0.53, P<0.001) and percentage of individuals under the poverty line (Coeff: 0.45, 95% CI: 0.32, 0.57, P<0.001) exerted a secondary impact. A regression model that incorporated these variables predicted 56% of the observed variation in gonorrhea incidence (Pmodel<0.001, R2 model = 0.56). Gonorrhea and HIV infection exhibited a powerful correlation thus emphasizing the benefits of comprehensive screening for sexually transmitted infections (STIs) and the value of pre-exposure prophylaxis for HIV among patients visiting an STI clinic. Furthermore, sociodemographic factors also impacted gonorrhea incidence, thus suggesting another possible focus for public health initiatives.

  14. Advanced multivariate data analysis to determine the root cause of trisulfide bond formation in a novel antibody–peptide fusion

    PubMed Central

    Goldrick, Stephen; Holmes, William; Bond, Nicholas J.; Lewis, Gareth; Kuiper, Marcel; Turner, Richard

    2017-01-01

    ABSTRACT Product quality heterogeneities, such as a trisulfide bond (TSB) formation, can be influenced by multiple interacting process parameters. Identifying their root cause is a major challenge in biopharmaceutical production. To address this issue, this paper describes the novel application of advanced multivariate data analysis (MVDA) techniques to identify the process parameters influencing TSB formation in a novel recombinant antibody–peptide fusion expressed in mammalian cell culture. The screening dataset was generated with a high‐throughput (HT) micro‐bioreactor system (AmbrTM 15) using a design of experiments (DoE) approach. The complex dataset was firstly analyzed through the development of a multiple linear regression model focusing solely on the DoE inputs and identified the temperature, pH and initial nutrient feed day as important process parameters influencing this quality attribute. To further scrutinize the dataset, a partial least squares model was subsequently built incorporating both on‐line and off‐line process parameters and enabled accurate predictions of the TSB concentration at harvest. Process parameters identified by the models to promote and suppress TSB formation were implemented on five 7 L bioreactors and the resultant TSB concentrations were comparable to the model predictions. This study demonstrates the ability of MVDA to enable predictions of the key performance drivers influencing TSB formation that are valid also upon scale‐up. Biotechnol. Bioeng. 2017;114: 2222–2234. © 2017 The Authors. Biotechnology and Bioengineering Published by Wiley Periodicals, Inc. PMID:28500668

  15. Multivariate Analysis and Machine Learning in Cerebral Palsy Research

    PubMed Central

    Zhang, Jing

    2017-01-01

    Cerebral palsy (CP), a common pediatric movement disorder, causes the most severe physical disability in children. Early diagnosis in high-risk infants is critical for early intervention and possible early recovery. In recent years, multivariate analytic and machine learning (ML) approaches have been increasingly used in CP research. This paper aims to identify such multivariate studies and provide an overview of this relatively young field. Studies reviewed in this paper have demonstrated that multivariate analytic methods are useful in identification of risk factors, detection of CP, movement assessment for CP prediction, and outcome assessment, and ML approaches have made it possible to automatically identify movement impairments in high-risk infants. In addition, outcome predictors for surgical treatments have been identified by multivariate outcome studies. To make the multivariate and ML approaches useful in clinical settings, further research with large samples is needed to verify and improve these multivariate methods in risk factor identification, CP detection, movement assessment, and outcome evaluation or prediction. As multivariate analysis, ML and data processing technologies advance in the era of Big Data of this century, it is expected that multivariate analysis and ML will play a bigger role in improving the diagnosis and treatment of CP to reduce mortality and morbidity rates, and enhance patient care for children with CP. PMID:29312134

  16. Multivariate Analysis and Machine Learning in Cerebral Palsy Research.

    PubMed

    Zhang, Jing

    2017-01-01

    Cerebral palsy (CP), a common pediatric movement disorder, causes the most severe physical disability in children. Early diagnosis in high-risk infants is critical for early intervention and possible early recovery. In recent years, multivariate analytic and machine learning (ML) approaches have been increasingly used in CP research. This paper aims to identify such multivariate studies and provide an overview of this relatively young field. Studies reviewed in this paper have demonstrated that multivariate analytic methods are useful in identification of risk factors, detection of CP, movement assessment for CP prediction, and outcome assessment, and ML approaches have made it possible to automatically identify movement impairments in high-risk infants. In addition, outcome predictors for surgical treatments have been identified by multivariate outcome studies. To make the multivariate and ML approaches useful in clinical settings, further research with large samples is needed to verify and improve these multivariate methods in risk factor identification, CP detection, movement assessment, and outcome evaluation or prediction. As multivariate analysis, ML and data processing technologies advance in the era of Big Data of this century, it is expected that multivariate analysis and ML will play a bigger role in improving the diagnosis and treatment of CP to reduce mortality and morbidity rates, and enhance patient care for children with CP.

  17. Quality by design case study: an integrated multivariate approach to drug product and process development.

    PubMed

    Huang, Jun; Kaul, Goldi; Cai, Chunsheng; Chatlapalli, Ramarao; Hernandez-Abad, Pedro; Ghosh, Krishnendu; Nagi, Arwinder

    2009-12-01

    To facilitate an in-depth process understanding, and offer opportunities for developing control strategies to ensure product quality, a combination of experimental design, optimization and multivariate techniques was integrated into the process development of a drug product. A process DOE was used to evaluate effects of the design factors on manufacturability and final product CQAs, and establish design space to ensure desired CQAs. Two types of analyses were performed to extract maximal information, DOE effect & response surface analysis and multivariate analysis (PCA and PLS). The DOE effect analysis was used to evaluate the interactions and effects of three design factors (water amount, wet massing time and lubrication time), on response variables (blend flow, compressibility and tablet dissolution). The design space was established by the combined use of DOE, optimization and multivariate analysis to ensure desired CQAs. Multivariate analysis of all variables from the DOE batches was conducted to study relationships between the variables and to evaluate the impact of material attributes/process parameters on manufacturability and final product CQAs. The integrated multivariate approach exemplifies application of QbD principles and tools to drug product and process development.

  18. The combination of ovarian volume and outline has better diagnostic accuracy than prostate-specific antigen (PSA) concentrations in women with polycystic ovarian syndrome (PCOs).

    PubMed

    Bili, Eleni; Bili, Authors Eleni; Dampala, Kaliopi; Iakovou, Ioannis; Tsolakidis, Dimitrios; Giannakou, Anastasia; Tarlatzis, Basil C

    2014-08-01

    The aim of this study was to determine the performance of prostate specific antigen (PSA) and ultrasound parameters, such as ovarian volume and outline, in the diagnosis of polycystic ovary syndrome (PCOS). This prospective, observational, case-controlled study included 43 women with PCOS, and 40 controls. Between day 3 and 5 of the menstrual cycle, fasting serum samples were collected and transvaginal ultrasound was performed. The diagnostic performance of each parameter [total PSA (tPSA), total-to-free PSA ratio (tPSA:fPSA), ovarian volume, ovarian outline] was estimated by means of receiver operating characteristic (ROC) analysis, along with area under the curve (AUC), threshold, sensitivity, specificity as well as positive (+) and negative (-) likelihood ratios (LRs). Multivariate logistical regression models, using ovarian volume and ovarian outline, were constructed. The tPSA and tPSA:fPSA ratio resulted in AUC of 0.74 and 0.70, respectively, with moderate specificity/sensitivity and insufficient LR+/- values. In the multivariate logistic regression model, the combination of ovarian volume and outline had a sensitivity of 97.7% and a specificity of 97.5% in the diagnosis of PCOS, with +LR and -LR values of 39.1 and 0.02, respectively. In women with PCOS, tPSA and tPSA:fPSA ratio have similar diagnostic performance. The use of a multivariate logistic regression model, incorporating ovarian volume and outline, offers very good diagnostic accuracy in distinguishing women with PCOS patients from controls. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  19. Multivariate Bayesian variable selection exploiting dependence structure among outcomes: Application to air pollution effects on DNA methylation.

    PubMed

    Lee, Kyu Ha; Tadesse, Mahlet G; Baccarelli, Andrea A; Schwartz, Joel; Coull, Brent A

    2017-03-01

    The analysis of multiple outcomes is becoming increasingly common in modern biomedical studies. It is well-known that joint statistical models for multiple outcomes are more flexible and more powerful than fitting a separate model for each outcome; they yield more powerful tests of exposure or treatment effects by taking into account the dependence among outcomes and pooling evidence across outcomes. It is, however, unlikely that all outcomes are related to the same subset of covariates. Therefore, there is interest in identifying exposures or treatments associated with particular outcomes, which we term outcome-specific variable selection. In this work, we propose a variable selection approach for multivariate normal responses that incorporates not only information on the mean model, but also information on the variance-covariance structure of the outcomes. The approach effectively leverages evidence from all correlated outcomes to estimate the effect of a particular covariate on a given outcome. To implement this strategy, we develop a Bayesian method that builds a multivariate prior for the variable selection indicators based on the variance-covariance of the outcomes. We show via simulation that the proposed variable selection strategy can boost power to detect subtle effects without increasing the probability of false discoveries. We apply the approach to the Normative Aging Study (NAS) epigenetic data and identify a subset of five genes in the asthma pathway for which gene-specific DNA methylations are associated with exposures to either black carbon, a marker of traffic pollution, or sulfate, a marker of particles generated by power plants. © 2016, The International Biometric Society.

  20. Multitrait, Random Regression, or Simple Repeatability Model in High-Throughput Phenotyping Data Improve Genomic Prediction for Wheat Grain Yield.

    PubMed

    Sun, Jin; Rutkoski, Jessica E; Poland, Jesse A; Crossa, José; Jannink, Jean-Luc; Sorrells, Mark E

    2017-07-01

    High-throughput phenotyping (HTP) platforms can be used to measure traits that are genetically correlated with wheat ( L.) grain yield across time. Incorporating such secondary traits in the multivariate pedigree and genomic prediction models would be desirable to improve indirect selection for grain yield. In this study, we evaluated three statistical models, simple repeatability (SR), multitrait (MT), and random regression (RR), for the longitudinal data of secondary traits and compared the impact of the proposed models for secondary traits on their predictive abilities for grain yield. Grain yield and secondary traits, canopy temperature (CT) and normalized difference vegetation index (NDVI), were collected in five diverse environments for 557 wheat lines with available pedigree and genomic information. A two-stage analysis was applied for pedigree and genomic selection (GS). First, secondary traits were fitted by SR, MT, or RR models, separately, within each environment. Then, best linear unbiased predictions (BLUPs) of secondary traits from the above models were used in the multivariate prediction models to compare predictive abilities for grain yield. Predictive ability was substantially improved by 70%, on average, from multivariate pedigree and genomic models when including secondary traits in both training and test populations. Additionally, (i) predictive abilities slightly varied for MT, RR, or SR models in this data set, (ii) results indicated that including BLUPs of secondary traits from the MT model was the best in severe drought, and (iii) the RR model was slightly better than SR and MT models under drought environment. Copyright © 2017 Crop Science Society of America.

  1. Estimating an Effect Size in One-Way Multivariate Analysis of Variance (MANOVA)

    ERIC Educational Resources Information Center

    Steyn, H. S., Jr.; Ellis, S. M.

    2009-01-01

    When two or more univariate population means are compared, the proportion of variation in the dependent variable accounted for by population group membership is eta-squared. This effect size can be generalized by using multivariate measures of association, based on the multivariate analysis of variance (MANOVA) statistics, to establish whether…

  2. Dangers in Using Analysis of Covariance Procedures.

    ERIC Educational Resources Information Center

    Campbell, Kathleen T.

    Problems associated with the use of analysis of covariance (ANCOVA) as a statistical control technique are explained. Three problems relate to the use of "OVA" methods (analysis of variance, analysis of covariance, multivariate analysis of variance, and multivariate analysis of covariance) in general. These are: (1) the wasting of information when…

  3. Predictors of Locoregional Failure and Impact on Overall Survival in Patients With Resected Exocrine Pancreatic Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Merrell, Kenneth W.; Haddock, Michael G.; Quevedo, J. Fernando

    Purpose: Resection of exocrine pancreatic cancer is necessary for cure, but locoregional and distant relapse is common. We evaluated our institutional experience to better understand risk factors for locoregional failure (LRF) and its impact on overall survival (OS). Methods and Materials: We reviewed 1051 consecutive patients with nonmetastatic exocrine pancreatic cancer who underwent resection at our institution between March 1987 and January 2011. Among them, 458 had adequate follow-up and evaluation for study inclusion. All patients received adjuvant chemotherapy (n=80 [17.5%]) or chemoradiation therapy (n=378 [82.5%]). Chemotherapy and chemoradiation therapy most frequently consisted of 6 cycles of gemcitabine and 50.4 Gymore » in 28 fractions with concurrent 5-fluorouracil, respectively. Locoregional control (LRC) and OS were estimated with the Kaplan-Meier method. Univariate and multivariate analyses were performed with Cox proportional hazards regression models incorporating propensity score. Results: Median patient age was 64.5 years (range: 29-88 years). Median follow-up for living patients was 84 months (range: 6-300 months). Extent of resection was R0 (83.8%) or R1 (16.2%). Overall crude incidence of LRF was 17% (n=79). The 5-year LRC for patients with and without radiation therapy was 80% and 68%, respectively (P=.003; hazard ratio [HR]: 0.45; 95% confidence interval [CI]: 0.28-0.76). Multivariate analysis, incorporating propensity score, indicated radiation therapy (P<.0001; HR: 0.23; 95% CI: 0.12-0.42) and positive lymph node ratio of ≥0.2 (P=.02; HR: 1.78; 95% CI: 1.10-2.9) were associated with LRC. In addition, LRF was associated with worse OS (P<.0001; HR: 5.0; 95% CI: 3.9-6.3). Conclusions: In our analysis of 458 patients with resected pancreatic cancer, positive lymph node ratio of ≥0.2 and no adjuvant chemoradiation therapy were associated with increased LRF risk. LRF was associated with poor OS. Radiation therapy should be considered as adjuvant locoregional treatment following pancreatic cancer resection.« less

  4. Predictors of condom use and refusal among the population of Free State province in South Africa

    PubMed Central

    2012-01-01

    Background This study investigated the extent and predictors of condom use and condom refusal in the Free State province in South Africa. Methods Through a household survey conducted in the Free Sate province of South Africa, 5,837 adults were interviewed. Univariate and multivariate survey logistic regressions and classification trees (CT) were used for analysing two response variables ‘ever used condom’ and ‘ever refused condom’. Results Eighty-three per cent of the respondents had ever used condoms, of which 38% always used them; 61% used them during the last sexual intercourse and 9% had ever refused to use them. The univariate logistic regression models and CT analysis indicated that a strong predictor of condom use was its perceived need. In the CT analysis, this variable was followed in importance by ‘knowledge of correct use of condom’, condom availability, young age, being single and higher education. ‘Perceived need’ for condoms did not remain significant in the multivariate analysis after controlling for other variables. The strongest predictor of condom refusal, as shown by the CT, was shame associated with condoms followed by the presence of sexual risk behaviour, knowing one’s HIV status, older age and lacking knowledge of condoms (i.e., ability to prevent sexually transmitted diseases and pregnancy, availability, correct and consistent use and existence of female condoms). In the multivariate logistic regression, age was not significant for condom refusal while affordability and perceived need were additional significant variables. Conclusions The use of complementary modelling techniques such as CT in addition to logistic regressions adds to a better understanding of condom use and refusal. Further improvement in correct and consistent use of condoms will require targeted interventions. In addition to existing social marketing campaigns, tailored approaches should focus on establishing the perceived need for condom-use and improving skills for correct use. They should also incorporate interventions to reduce the shame associated with condoms and individual counselling of those likely to refuse condoms. PMID:22639964

  5. A novel structure-aware sparse learning algorithm for brain imaging genetics.

    PubMed

    Du, Lei; Jingwen, Yan; Kim, Sungeun; Risacher, Shannon L; Huang, Heng; Inlow, Mark; Moore, Jason H; Saykin, Andrew J; Shen, Li

    2014-01-01

    Brain imaging genetics is an emergent research field where the association between genetic variations such as single nucleotide polymorphisms (SNPs) and neuroimaging quantitative traits (QTs) is evaluated. Sparse canonical correlation analysis (SCCA) is a bi-multivariate analysis method that has the potential to reveal complex multi-SNP-multi-QT associations. Most existing SCCA algorithms are designed using the soft threshold strategy, which assumes that the features in the data are independent from each other. This independence assumption usually does not hold in imaging genetic data, and thus inevitably limits the capability of yielding optimal solutions. We propose a novel structure-aware SCCA (denoted as S2CCA) algorithm to not only eliminate the independence assumption for the input data, but also incorporate group-like structure in the model. Empirical comparison with a widely used SCCA implementation, on both simulated and real imaging genetic data, demonstrated that S2CCA could yield improved prediction performance and biologically meaningful findings.

  6. Bias correction in the hierarchical likelihood approach to the analysis of multivariate survival data.

    PubMed

    Jeon, Jihyoun; Hsu, Li; Gorfine, Malka

    2012-07-01

    Frailty models are useful for measuring unobserved heterogeneity in risk of failures across clusters, providing cluster-specific risk prediction. In a frailty model, the latent frailties shared by members within a cluster are assumed to act multiplicatively on the hazard function. In order to obtain parameter and frailty variate estimates, we consider the hierarchical likelihood (H-likelihood) approach (Ha, Lee and Song, 2001. Hierarchical-likelihood approach for frailty models. Biometrika 88, 233-243) in which the latent frailties are treated as "parameters" and estimated jointly with other parameters of interest. We find that the H-likelihood estimators perform well when the censoring rate is low, however, they are substantially biased when the censoring rate is moderate to high. In this paper, we propose a simple and easy-to-implement bias correction method for the H-likelihood estimators under a shared frailty model. We also extend the method to a multivariate frailty model, which incorporates complex dependence structure within clusters. We conduct an extensive simulation study and show that the proposed approach performs very well for censoring rates as high as 80%. We also illustrate the method with a breast cancer data set. Since the H-likelihood is the same as the penalized likelihood function, the proposed bias correction method is also applicable to the penalized likelihood estimators.

  7. Examining the impacts of increased corn production on ...

    EPA Pesticide Factsheets

    This study demonstrates the value of a coupled chemical transport modeling system for investigating groundwater nitrate contamination responses associated with nitrogen (N) fertilizer application and increased corn production. The coupled Community Multiscale Air Quality Bidirectional and Environmental Policy Integrated Climate modeling system incorporates agricultural management practices and N exchange processes between the soil and atmosphere to estimate levels of N that may volatilize into the atmosphere, re-deposit, and seep or flow into surface and groundwater. Simulated values from this modeling system were used in a land-use regression model to examine associations between groundwater nitrate-N measurements and a suite of factors related to N fertilizer and groundwater nitrate contamination. Multi-variable modeling analysis revealed that the N-fertilizer rate (versus total) applied to irrigated (versus rainfed) grain corn (versus other crops) was the strongest N-related predictor variable of groundwater nitrate-N concentrations. Application of this multi-variable model considered groundwater nitrate-N concentration responses under two corn production scenarios. Findings suggest that increased corn production between 2002 and 2022 could result in 56% to 79% increase in areas vulnerable to groundwater nitrate-N concentrations ≥ 5 mg/L. These above-threshold areas occur on soils with a hydraulic conductivity 13% higher than the rest of the domain. Additio

  8. Polarization in Raman spectroscopy helps explain bone brittleness in genetic mouse models

    NASA Astrophysics Data System (ADS)

    Makowski, Alexander J.; Pence, Isaac J.; Uppuganti, Sasidhar; Zein-Sabatto, Ahbid; Huszagh, Meredith C.; Mahadevan-Jansen, Anita; Nyman, Jeffry S.

    2014-11-01

    Raman spectroscopy (RS) has been extensively used to characterize bone composition. However, the link between bone biomechanics and RS measures is not well established. Here, we leveraged the sensitivity of RS polarization to organization, thereby assessing whether RS can explain differences in bone toughness in genetic mouse models for which traditional RS peak ratios are not informative. In the selected mutant mice-activating transcription factor 4 (ATF4) or matrix metalloproteinase 9 (MMP9) knock-outs-toughness is reduced but differences in bone strength do not exist between knock-out and corresponding wild-type controls. To incorporate differences in the RS of bone occurring at peak shoulders, a multivariate approach was used. Full spectrum principal components analysis of two paired, orthogonal bone orientations (relative to laser polarization) improved genotype classification and correlation to bone toughness when compared to traditional peak ratios. When applied to femurs from wild-type mice at 8 and 20 weeks of age, the principal components of orthogonal bone orientations improved age classification but not the explanation of the maturation-related increase in strength. Overall, increasing polarization information by collecting spectra from two bone orientations improves the ability of multivariate RS to explain variance in bone toughness, likely due to polarization sensitivity to organizational changes in both mineral and collagen.

  9. Chemical Discrimination of Cortex Phellodendri amurensis and Cortex Phellodendri chinensis by Multivariate Analysis Approach.

    PubMed

    Sun, Hui; Wang, Huiyu; Zhang, Aihua; Yan, Guangli; Han, Ying; Li, Yuan; Wu, Xiuhong; Meng, Xiangcai; Wang, Xijun

    2016-01-01

    As herbal medicines have an important position in health care systems worldwide, their current assessment, and quality control are a major bottleneck. Cortex Phellodendri chinensis (CPC) and Cortex Phellodendri amurensis (CPA) are widely used in China, however, how to identify species of CPA and CPC has become urgent. In this study, multivariate analysis approach was performed to the investigation of chemical discrimination of CPA and CPC. Principal component analysis showed that two herbs could be separated clearly. The chemical markers such as berberine, palmatine, phellodendrine, magnoflorine, obacunone, and obaculactone were identified through the orthogonal partial least squared discriminant analysis, and were identified tentatively by the accurate mass of quadruple-time-of-flight mass spectrometry. A total of 29 components can be used as the chemical markers for discrimination of CPA and CPC. Of them, phellodenrine is significantly higher in CPC than that of CPA, whereas obacunone and obaculactone are significantly higher in CPA than that of CPC. The present study proves that multivariate analysis approach based chemical analysis greatly contributes to the investigation of CPA and CPC, and showed that the identified chemical markers as a whole should be used to discriminate the two herbal medicines, and simultaneously the results also provided chemical information for their quality assessment. Multivariate analysis approach was performed to the investigate the herbal medicineThe chemical markers were identified through multivariate analysis approachA total of 29 components can be used as the chemical markers. UPLC-Q/TOF-MS-based multivariate analysis method for the herbal medicine samples Abbreviations used: CPC: Cortex Phellodendri chinensis, CPA: Cortex Phellodendri amurensis, PCA: Principal component analysis, OPLS-DA: Orthogonal partial least squares discriminant analysis, BPI: Base peaks ion intensity.

  10. metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis.

    PubMed

    Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti

    2016-07-01

    A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness.Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Code is available at https://github.com/aalto-ics-kepaco anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  11. metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis

    PubMed Central

    Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J.; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T.; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti

    2016-01-01

    Motivation: A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. Results: We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness. Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Availability and implementation: Code is available at https://github.com/aalto-ics-kepaco Contacts: anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153689

  12. Association Between Severe Hypoglycemia and Cardiovascular Disease Risk in Japanese Patients With Type 2 Diabetes.

    PubMed

    Goto, Atsushi; Goto, Maki; Terauchi, Yasuo; Yamaguchi, Naohito; Noda, Mitsuhiko

    2016-03-09

    It remains unclear whether severe hypoglycemia is associated with cardiovascular disease (CVD) in Asian populations with type 2 diabetes (T2D). Furthermore, no study in Japan, where the prescription patterns differ from those in other countries, has examined this association. We retrospectively included 58 223 patients (18-74 years old) with T2D. First, we examined the potential predictors of severe hypoglycemia. Then, we investigated the association between severe hypoglycemia and CVD risk. Finally, we performed an updated systematic review and meta-analysis to incorporate our findings and recently published studies into the previous systematic review and meta-analysis. During 134 597 person-years from cumulative observation periods, 128 persons experienced severe hypoglycemia and 550 developed CVD events. In a multivariate Cox proportional hazard model, severe hypoglycemia was strongly and positively associated with the risk of CVD (multivariate-adjusted adjusted hazard ratio, 3.39; 95% CI, 1.25-9.18). In a propensity score-matched cohort that had similar baseline characteristics for patients with severe hypoglycemia and those without, severe hypoglycemia was more strongly associated with the risk of CVD. An updated systematic review and meta-analysis that included 10 studies found that severe hypoglycemia was associated with an ≈2-fold increased risk of CVD (pooled relative risk, 1.91; 95% CI, 1.69-2.15). Our results suggest that severe hypoglycemia is strongly associated with an increased risk of CVD in Japanese patients with T2D, further supporting the notion that avoiding severe hypoglycemia may be important in preventing CVD in this patient population. © 2016 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley Blackwell.

  13. DNA Repair Biomarkers Predict Response to Neoadjuvant Chemoradiotherapy in Esophageal Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alexander, Brian M., E-mail: bmalexander@lroc.harvard.edu; Wang Xiaozhe; Niemierko, Andrzej

    2012-05-01

    Purpose: The addition of neoadjuvant chemoradiotherapy prior to surgical resection for esophageal cancer has improved clinical outcomes in some trials. Pathologic complete response (pCR) following neoadjuvant therapy is associated with better clinical outcome in these patients, but only 22% to 40% of patients achieve pCR. Because both chemotherapy and radiotherapy act by inducing DNA damage, we analyzed proteins selected from multiple DNA repair pathways, using quantitative immunohistochemistry coupled with a digital pathology platform, as possible biomarkers of treatment response and clinical outcome. Methods and Materials: We identified 79 patients diagnosed with esophageal cancer between October 1994 and September 2002, withmore » biopsy tissue available, who underwent neoadjuvant chemoradiotherapy prior to surgery at the Massachusetts General Hospital and used their archived, formalin-fixed, paraffin-embedded biopsy samples to create tissue microarrays (TMA). TMA sections were stained using antibodies against proteins in various DNA repair pathways including XPF, FANCD2, PAR, MLH1, PARP1, and phosphorylated MAPKAP kinase 2 (pMK2). Stained TMA slides were evaluated using machine-based image analysis, and scoring incorporated both the intensity and the quantity of positive tumor nuclei. Biomarker scores and clinical data were assessed for correlations with clinical outcome. Results: Higher scores for MLH1 (p = 0.018) and lower scores for FANCD2 (p = 0.037) were associated with pathologic response to neoadjuvant chemoradiation on multivariable analysis. Staining of MLH1, PARP1, XPF, and PAR was associated with recurrence-free survival, and staining of PARP1 and FANCD2 was associated with overall survival on multivariable analysis. Conclusions: DNA repair proteins analyzed by immunohistochemistry may be useful as predictive markers for response to neoadjuvant chemoradiotherapy in patients with esophageal cancer. These results are hypothesis generating and need confirmation in an independent data set.« less

  14. The Impact of Adjuvant Postoperative Radiation Therapy and Chemotherapy on Survival After Esophagectomy for Esophageal Carcinoma.

    PubMed

    Wong, Andrew T; Shao, Meng; Rineer, Justin; Lee, Anna; Schwartz, David; Schreiber, David

    2017-06-01

    The objective of this study was to analyze the impact on overall survival (OS) from the addition of postoperative radiation with or without chemotherapy after esophagectomy, using a large, hospital-based dataset. Previous retrospective studies have suggested an OS advantage for postoperative chemoradiation over surgery alone, although prospective data are lacking. The National Cancer Data Base was queried to select patients diagnosed with stage pT3-4Nx-0M0 or pT1-4N1-3M0 esophageal carcinoma (squamous cell or adenocarcinoma) from 1998 to 2011 treated with definitive esophagectomy ± postoperative radiation and/or chemotherapy. OS was analyzed using the Kaplan-Meier method and compared using the log-rank test. Multivariate Cox regression analysis was used to identify covariates associated with OS. There were 4893 patients selected, of whom 1153 (23.6%) received postoperative radiation. Most patients receiving radiation also received sequential/concomitant chemotherapy (89.9%). For the entire cohort, postoperative radiation was associated with a statistically significant but modest absolute improvement in survival (hazard ratio 0.77; 95% CI, 0.71-0.83; P < 0.001). On subgroup analysis, postoperative radiation was associated with improved OS for patients with node-positive disease (3-yr OS 34.3 % vs 27.8%, P < 0.001) or positive margins (3-yr OS 36.4% vs 18.0%, P < 0.001). When chemotherapy usage was incorporated, sequential chemotherapy was associated with the best survival (P < 0.001). Multivariate analysis revealed that the addition of chemotherapy to radiation therapy, whether sequentially or concurrently, was a strong prognostic factor for OS. In this hospital-based study, the addition of postoperative chemoradiation (either sequentially or concomitantly) after esophagectomy was associated with improved OS for patients with node-positive disease or positive margins.

  15. Incorporation of support vector machines in the LIBS toolbox for sensitive and robust classification amidst unexpected sample and system variability

    PubMed Central

    ChariDingari, Narahara; Barman, Ishan; Myakalwar, Ashwin Kumar; Tewari, Surya P.; Kumar, G. Manoj

    2012-01-01

    Despite the intrinsic elemental analysis capability and lack of sample preparation requirements, laser-induced breakdown spectroscopy (LIBS) has not been extensively used for real world applications, e.g. quality assurance and process monitoring. Specifically, variability in sample, system and experimental parameters in LIBS studies present a substantive hurdle for robust classification, even when standard multivariate chemometric techniques are used for analysis. Considering pharmaceutical sample investigation as an example, we propose the use of support vector machines (SVM) as a non-linear classification method over conventional linear techniques such as soft independent modeling of class analogy (SIMCA) and partial least-squares discriminant analysis (PLS-DA) for discrimination based on LIBS measurements. Using over-the-counter pharmaceutical samples, we demonstrate that application of SVM enables statistically significant improvements in prospective classification accuracy (sensitivity), due to its ability to address variability in LIBS sample ablation and plasma self-absorption behavior. Furthermore, our results reveal that SVM provides nearly 10% improvement in correct allocation rate and a concomitant reduction in misclassification rates of 75% (cf. PLS-DA) and 80% (cf. SIMCA)-when measurements from samples not included in the training set are incorporated in the test data – highlighting its robustness. While further studies on a wider matrix of sample types performed using different LIBS systems is needed to fully characterize the capability of SVM to provide superior predictions, we anticipate that the improved sensitivity and robustness observed here will facilitate application of the proposed LIBS-SVM toolbox for screening drugs and detecting counterfeit samples as well as in related areas of forensic and biological sample analysis. PMID:22292496

  16. Incorporation of support vector machines in the LIBS toolbox for sensitive and robust classification amidst unexpected sample and system variability.

    PubMed

    Dingari, Narahara Chari; Barman, Ishan; Myakalwar, Ashwin Kumar; Tewari, Surya P; Kumar Gundawar, Manoj

    2012-03-20

    Despite the intrinsic elemental analysis capability and lack of sample preparation requirements, laser-induced breakdown spectroscopy (LIBS) has not been extensively used for real-world applications, e.g., quality assurance and process monitoring. Specifically, variability in sample, system, and experimental parameters in LIBS studies present a substantive hurdle for robust classification, even when standard multivariate chemometric techniques are used for analysis. Considering pharmaceutical sample investigation as an example, we propose the use of support vector machines (SVM) as a nonlinear classification method over conventional linear techniques such as soft independent modeling of class analogy (SIMCA) and partial least-squares discriminant analysis (PLS-DA) for discrimination based on LIBS measurements. Using over-the-counter pharmaceutical samples, we demonstrate that the application of SVM enables statistically significant improvements in prospective classification accuracy (sensitivity), because of its ability to address variability in LIBS sample ablation and plasma self-absorption behavior. Furthermore, our results reveal that SVM provides nearly 10% improvement in correct allocation rate and a concomitant reduction in misclassification rates of 75% (cf. PLS-DA) and 80% (cf. SIMCA)-when measurements from samples not included in the training set are incorporated in the test data-highlighting its robustness. While further studies on a wider matrix of sample types performed using different LIBS systems is needed to fully characterize the capability of SVM to provide superior predictions, we anticipate that the improved sensitivity and robustness observed here will facilitate application of the proposed LIBS-SVM toolbox for screening drugs and detecting counterfeit samples, as well as in related areas of forensic and biological sample analysis.

  17. Number of negative lymph nodes should be considered for incorporation into staging for breast cancer

    PubMed Central

    Wu, San-Gang; Wang, Yan; Zhou, Juan; Sun, Jia-Yuan; Li, Feng-Yan; Lin, Huan-Xin; He, Zhen-Yu

    2015-01-01

    This study aimed to investigate the prognostic value of the number of involved lymph nodes (pN), number of removed lymph nodes (RLNs), lymph node ratio (LNR), number of negative lymph nodes (NLNs), and log odds of positive lymph nodes (LODDS) in breast cancer patients. The records of 2,515 breast cancer patients who received a mastectomy or breast-conserving surgery were retrospectively reviewed. The log-rank test was used to compare survival curves, and Cox regression analysis was performed to identify prognostic factors. The median follow-up time was 64.2 months, and the 8-year disease-free survival (DFS) and overall survival (OS) were 74.6% and 82.3%, respectively. Univariate analysis showed that pN stage, LNR, number of RLNs, and number of NLNs were significant prognostic factors for DFS and OS (all, P < 0.05). LODDS was a significant prognostic factor for OS (P = 0.021). Multivariate analysis indicated that pN stage and the number of NLNs were independent prognostic factors for DFS and OS. A higher number of NLNs was associated with higher DFS and OS, and a higher number of involved lymph nodes were associated with poorer DFS and OS. Patients with a NLNs count > 9 had better survival (P < 0.001). Subgroup analysis showed that the NLNs count had a prognostic value in patients with different pT stages and different lymph node status (log-rank P < 0.05). For breast cancer, pN stage and NLNs count have a better prognostic value compared to the RLNs count, LNR, and LODDS. Number of negative lymph nodes should be considered for incorporation into staging for breast cancer. PMID:25973321

  18. Particle shape effect on erosion of optical glass substrates due to microparticles

    NASA Astrophysics Data System (ADS)

    Waxman, Rachel; Gray, Perry; Guven, Ibrahim

    2018-03-01

    Impact experiments using sand particles and soda lime glass spheres were performed on four distinct glass substrates. Sand particles were characterized using optical and scanning electron microscopy. High-speed video footage from impact tests was used to calculate incoming and rebound velocities of the individual impact events, as well as the particle volume and two-dimensional sphericity. Furthermore, video analysis was used in conjunction with optical and scanning electron microscopy to relate the incoming velocity and particle shape to subsequent fractures, including both radial and lateral cracks. Indentation theory [Marshall et al., J. Am. Ceram. Soc. 65, 561-566 (1982)] was applied and correlated with lateral crack lengths. Multi-variable power law regression was performed, incorporating the particle shape into the model and was shown to have better fit to damage data than the previous indentation model.

  19. Female employment and fertility in Peninsular Malaysia: the maternal role incompatibility hypothesis reconsidered.

    PubMed

    Mason, K O; Palan, V T

    1981-11-01

    Multivariate analysis of the 1974 Malaysian Fertility and Family Survey tests the hypothesis that an inverse relationship between women's work and fertility occurs only when there are serious conflicts between working and caring for children. The results are only partly consistent with the hypothesis and suggest that normative conflicts between working and mothering affect the employment-fertility relationship in Malaysia more than spacio-temporal conflicts do. The lack of consistent evidence for the hypothesis, as well as some conceptual problems, lead us to propose an alternative framework for understanding variation in the employment-fertility relationship, both in Malaysia and elsewhere. This framework incorporates ideas from the role incompatibility hypothesis but views the employment-fertility relationship as dependent not just on role conflicts but more generally on the structure of the household's socioeconomic opportunities.

  20. Introduction of functionality, selection of topology, and enhancement of gas adsorption in multivariate metal-organic framework-177.

    PubMed

    Zhang, Yue-Biao; Furukawa, Hiroyasu; Ko, Nakeun; Nie, Weixuan; Park, Hye Jeong; Okajima, Satoshi; Cordova, Kyle E; Deng, Hexiang; Kim, Jaheon; Yaghi, Omar M

    2015-02-25

    Metal-organic framework-177 (MOF-177) is one of the most porous materials whose structure is composed of octahedral Zn4O(-COO)6 and triangular 1,3,5-benzenetribenzoate (BTB) units to make a three-dimensional extended network based on the qom topology. This topology violates a long-standing thesis where highly symmetric building units are expected to yield highly symmetric networks. In the case of octahedron and triangle combinations, MOFs based on pyrite (pyr) and rutile (rtl) nets were expected instead of qom. In this study, we have made 24 MOF-177 structures with different functional groups on the triangular BTB linker, having one or more functionalities. We find that the position of the functional groups on the BTB unit allows the selection for a specific net (qom, pyr, and rtl), and that mixing of functionalities (-H, -NH2, and -C4H4) is an important strategy for the incorporation of a specific functionality (-NO2) into MOF-177 where otherwise incorporation of such functionality would be difficult. Such mixing of functionalities to make multivariate MOF-177 structures leads to enhancement of hydrogen uptake by 25%.

  1. Introduction of Functionality, Selection of Topology, and Enhancement of Gas Adsorption in Multivariate Metal–Organic Framework-177

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Yue-Biao; Furukawa, Hiroyasu; Ko, Nakeun

    2015-02-25

    Metal–organic framework-177 (MOF-177) is one of the most porous materials whose structure is composed of octahedral Zn 4O(-COO) 6 and triangular 1,3,5-benzenetribenzoate (BTB) units to make a three-dimensional extended network based on the qom topology. This topology violates a long-standing thesis where highly symmetric building units are expected to yield highly symmetric networks. In the case of octahedron and triangle combinations, MOFs based on pyrite (pyr) and rutile (rtl) nets were expected instead of qom. In this study, we have made 24 MOF-177 structures with different functional groups on the triangular BTB linker, having one or more functionalities. We findmore » that the position of the functional groups on the BTB unit allows the selection for a specific net (qom, pyr, and rtl), and that mixing of functionalities (-H, -NH 2, and -C 4H 4) is an important strategy for the incorporation of a specific functionality (-NO 2) into MOF-177 where otherwise incorporation of such functionality would be difficult. Such mixing of functionalities to make multivariate MOF-177 structures leads to enhancement of hydrogen uptake by 25%.« less

  2. Using Interactive Graphics to Teach Multivariate Data Analysis to Psychology Students

    ERIC Educational Resources Information Center

    Valero-Mora, Pedro M.; Ledesma, Ruben D.

    2011-01-01

    This paper discusses the use of interactive graphics to teach multivariate data analysis to Psychology students. Three techniques are explored through separate activities: parallel coordinates/boxplots; principal components/exploratory factor analysis; and cluster analysis. With interactive graphics, students may perform important parts of the…

  3. A power analysis for multivariate tests of temporal trend in species composition.

    PubMed

    Irvine, Kathryn M; Dinger, Eric C; Sarr, Daniel

    2011-10-01

    Long-term monitoring programs emphasize power analysis as a tool to determine the sampling effort necessary to effectively document ecologically significant changes in ecosystems. Programs that monitor entire multispecies assemblages require a method for determining the power of multivariate statistical models to detect trend. We provide a method to simulate presence-absence species assemblage data that are consistent with increasing or decreasing directional change in species composition within multiple sites. This step is the foundation for using Monte Carlo methods to approximate the power of any multivariate method for detecting temporal trends. We focus on comparing the power of the Mantel test, permutational multivariate analysis of variance, and constrained analysis of principal coordinates. We find that the power of the various methods we investigate is sensitive to the number of species in the community, univariate species patterns, and the number of sites sampled over time. For increasing directional change scenarios, constrained analysis of principal coordinates was as or more powerful than permutational multivariate analysis of variance, the Mantel test was the least powerful. However, in our investigation of decreasing directional change, the Mantel test was typically as or more powerful than the other models.

  4. Fourier Transform Infrared Spectroscopy (FTIR) and Multivariate Analysis for Identification of Different Vegetable Oils Used in Biodiesel Production

    PubMed Central

    Mueller, Daniela; Ferrão, Marco Flôres; Marder, Luciano; da Costa, Adilson Ben; de Cássia de Souza Schneider, Rosana

    2013-01-01

    The main objective of this study was to use infrared spectroscopy to identify vegetable oils used as raw material for biodiesel production and apply multivariate analysis to the data. Six different vegetable oil sources—canola, cotton, corn, palm, sunflower and soybeans—were used to produce biodiesel batches. The spectra were acquired by Fourier transform infrared spectroscopy using a universal attenuated total reflectance sensor (FTIR-UATR). For the multivariate analysis principal component analysis (PCA), hierarchical cluster analysis (HCA), interval principal component analysis (iPCA) and soft independent modeling of class analogy (SIMCA) were used. The results indicate that is possible to develop a methodology to identify vegetable oils used as raw material in the production of biodiesel by FTIR-UATR applying multivariate analysis. It was also observed that the iPCA found the best spectral range for separation of biodiesel batches using FTIR-UATR data, and with this result, the SIMCA method classified 100% of the soybean biodiesel samples. PMID:23539030

  5. Multivariate meta-analysis for non-linear and other multi-parameter associations

    PubMed Central

    Gasparrini, A; Armstrong, B; Kenward, M G

    2012-01-01

    In this paper, we formalize the application of multivariate meta-analysis and meta-regression to synthesize estimates of multi-parameter associations obtained from different studies. This modelling approach extends the standard two-stage analysis used to combine results across different sub-groups or populations. The most straightforward application is for the meta-analysis of non-linear relationships, described for example by regression coefficients of splines or other functions, but the methodology easily generalizes to any setting where complex associations are described by multiple correlated parameters. The modelling framework of multivariate meta-analysis is implemented in the package mvmeta within the statistical environment R. As an illustrative example, we propose a two-stage analysis for investigating the non-linear exposure–response relationship between temperature and non-accidental mortality using time-series data from multiple cities. Multivariate meta-analysis represents a useful analytical tool for studying complex associations through a two-stage procedure. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22807043

  6. The Potential of Multivariate Analysis in Assessing Students' Attitude to Curriculum Subjects

    ERIC Educational Resources Information Center

    Gaotlhobogwe, Michael; Laugharne, Janet; Durance, Isabelle

    2011-01-01

    Background: Understanding student attitudes to curriculum subjects is central to providing evidence-based options to policy makers in education. Purpose: We illustrate how quantitative approaches used in the social sciences and based on multivariate analysis (categorical Principal Components Analysis, Clustering Analysis and General Linear…

  7. Two-sample tests and one-way MANOVA for multivariate biomarker data with nondetects.

    PubMed

    Thulin, M

    2016-09-10

    Testing whether the mean vector of a multivariate set of biomarkers differs between several populations is an increasingly common problem in medical research. Biomarker data is often left censored because some measurements fall below the laboratory's detection limit. We investigate how such censoring affects multivariate two-sample and one-way multivariate analysis of variance tests. Type I error rates, power and robustness to increasing censoring are studied, under both normality and non-normality. Parametric tests are found to perform better than non-parametric alternatives, indicating that the current recommendations for analysis of censored multivariate data may have to be revised. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  8. A non-iterative extension of the multivariate random effects meta-analysis.

    PubMed

    Makambi, Kepher H; Seung, Hyunuk

    2015-01-01

    Multivariate methods in meta-analysis are becoming popular and more accepted in biomedical research despite computational issues in some of the techniques. A number of approaches, both iterative and non-iterative, have been proposed including the multivariate DerSimonian and Laird method by Jackson et al. (2010), which is non-iterative. In this study, we propose an extension of the method by Hartung and Makambi (2002) and Makambi (2001) to multivariate situations. A comparison of the bias and mean square error from a simulation study indicates that, in some circumstances, the proposed approach perform better than the multivariate DerSimonian-Laird approach. An example is presented to demonstrate the application of the proposed approach.

  9. Multivariate Autoregressive Modeling and Granger Causality Analysis of Multiple Spike Trains

    PubMed Central

    Krumin, Michael; Shoham, Shy

    2010-01-01

    Recent years have seen the emergence of microelectrode arrays and optical methods allowing simultaneous recording of spiking activity from populations of neurons in various parts of the nervous system. The analysis of multiple neural spike train data could benefit significantly from existing methods for multivariate time-series analysis which have proven to be very powerful in the modeling and analysis of continuous neural signals like EEG signals. However, those methods have not generally been well adapted to point processes. Here, we use our recent results on correlation distortions in multivariate Linear-Nonlinear-Poisson spiking neuron models to derive generalized Yule-Walker-type equations for fitting ‘‘hidden” Multivariate Autoregressive models. We use this new framework to perform Granger causality analysis in order to extract the directed information flow pattern in networks of simulated spiking neurons. We discuss the relative merits and limitations of the new method. PMID:20454705

  10. A refined method for multivariate meta-analysis and meta-regression.

    PubMed

    Jackson, Daniel; Riley, Richard D

    2014-02-20

    Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects' standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. Copyright © 2013 John Wiley & Sons, Ltd.

  11. Multivariate missing data in hydrology - Review and applications

    NASA Astrophysics Data System (ADS)

    Ben Aissia, Mohamed-Aymen; Chebana, Fateh; Ouarda, Taha B. M. J.

    2017-12-01

    Water resources planning and management require complete data sets of a number of hydrological variables, such as flood peaks and volumes. However, hydrologists are often faced with the problem of missing data (MD) in hydrological databases. Several methods are used to deal with the imputation of MD. During the last decade, multivariate approaches have gained popularity in the field of hydrology, especially in hydrological frequency analysis (HFA). However, treating the MD remains neglected in the multivariate HFA literature whereas the focus has been mainly on the modeling component. For a complete analysis and in order to optimize the use of data, MD should also be treated in the multivariate setting prior to modeling and inference. Imputation of MD in the multivariate hydrological framework can have direct implications on the quality of the estimation. Indeed, the dependence between the series represents important additional information that can be included in the imputation process. The objective of the present paper is to highlight the importance of treating MD in multivariate hydrological frequency analysis by reviewing and applying multivariate imputation methods and by comparing univariate and multivariate imputation methods. An application is carried out for multiple flood attributes on three sites in order to evaluate the performance of the different methods based on the leave-one-out procedure. The results indicate that, the performance of imputation methods can be improved by adopting the multivariate setting, compared to mean substitution and interpolation methods, especially when using the copula-based approach.

  12. Development of Pattern Recognition Techniques for the Evaluation of Toxicant Impacts to Multispecies Systems

    DTIC Science & Technology

    1993-06-18

    the exception. In the Standardized Aquatic Microcosm and the Mixed Flask Culture (MFC) microcosms, multivariate analysis and clustering methods...rule rather than the exception. In the Standardized Aquatic Microcosm and the Mixed Flask Culture (MFC) microcosms, multivariate analysis and...experiments using two microcosm protocols. We use nonmetric clustering, a multivariate pattern recognition technique developed by Matthews and Heame (1991

  13. Multivariate analysis for scanning tunneling spectroscopy data

    NASA Astrophysics Data System (ADS)

    Yamanishi, Junsuke; Iwase, Shigeru; Ishida, Nobuyuki; Fujita, Daisuke

    2018-01-01

    We applied principal component analysis (PCA) to two-dimensional tunneling spectroscopy (2DTS) data obtained on a Si(111)-(7 × 7) surface to explore the effectiveness of multivariate analysis for interpreting 2DTS data. We demonstrated that several components that originated mainly from specific atoms at the Si(111)-(7 × 7) surface can be extracted by PCA. Furthermore, we showed that hidden components in the tunneling spectra can be decomposed (peak separation), which is difficult to achieve with normal 2DTS analysis without the support of theoretical calculations. Our analysis showed that multivariate analysis can be an additional powerful way to analyze 2DTS data and extract hidden information from a large amount of spectroscopic data.

  14. Multivariate Analysis of Schools and Educational Policy.

    ERIC Educational Resources Information Center

    Kiesling, Herbert J.

    This report describes a multivariate analysis technique that approaches the problems of educational production function analysis by (1) using comparable measures of output across large experiments, (2) accounting systematically for differences in socioeconomic background, and (3) treating the school as a complete system in which different…

  15. An In Situ One-Pot Synthetic Approach towards Multivariate Zirconium MOFs.

    PubMed

    Sun, Yujia; Sun, Lixian; Feng, Dawei; Zhou, Hong-Cai

    2016-05-23

    Chemically highly stable MOFs incorporating multiple functionalities are of great interest for applications under harsh environments. Herein, we presented a facile one-pot synthetic strategy to incorporate multiple functionalities into stable Zr-MOFs from mixed ligands of different geometry and connectivity. Via our strategy, tetratopic tetrakis(4-carboxyphenyl)porphyrin (TCPP) ligands were successfully integrated into UiO-66 while maintaining the crystal structure, morphology, and ultrahigh chemical stability of UiO-66. The amount of incorporated TCPP is controllable. Through various combinations of BDC derivatives and TCPP, 49 MOFs with multiple functionalities were obtained. Among them, MOFs modified with FeTCPPCl were demonstrated to be catalytically active for the oxidation of ABTS. We anticipate our strategy to provide a facile route to introduce multiple functionalities into stable Zr-MOFs for a wide variety of potential applications. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Representing general theoretical concepts in structural equation models: The role of composite variables

    USGS Publications Warehouse

    Grace, J.B.; Bollen, K.A.

    2008-01-01

    Structural equation modeling (SEM) holds the promise of providing natural scientists the capacity to evaluate complex multivariate hypotheses about ecological systems. Building on its predecessors, path analysis and factor analysis, SEM allows for the incorporation of both observed and unobserved (latent) variables into theoretically-based probabilistic models. In this paper we discuss the interface between theory and data in SEM and the use of an additional variable type, the composite. In simple terms, composite variables specify the influences of collections of other variables and can be helpful in modeling heterogeneous concepts of the sort commonly of interest to ecologists. While long recognized as a potentially important element of SEM, composite variables have received very limited use, in part because of a lack of theoretical consideration, but also because of difficulties that arise in parameter estimation when using conventional solution procedures. In this paper we present a framework for discussing composites and demonstrate how the use of partially-reduced-form models can help to overcome some of the parameter estimation and evaluation problems associated with models containing composites. Diagnostic procedures for evaluating the most appropriate and effective use of composites are illustrated with an example from the ecological literature. It is argued that an ability to incorporate composite variables into structural equation models may be particularly valuable in the study of natural systems, where concepts are frequently multifaceted and the influence of suites of variables are often of interest. ?? Springer Science+Business Media, LLC 2007.

  17. The Interface Between Theory and Data in Structural Equation Models

    USGS Publications Warehouse

    Grace, James B.; Bollen, Kenneth A.

    2006-01-01

    Structural equation modeling (SEM) holds the promise of providing natural scientists the capacity to evaluate complex multivariate hypotheses about ecological systems. Building on its predecessors, path analysis and factor analysis, SEM allows for the incorporation of both observed and unobserved (latent) variables into theoretically based probabilistic models. In this paper we discuss the interface between theory and data in SEM and the use of an additional variable type, the composite, for representing general concepts. In simple terms, composite variables specify the influences of collections of other variables and can be helpful in modeling general relationships of the sort commonly of interest to ecologists. While long recognized as a potentially important element of SEM, composite variables have received very limited use, in part because of a lack of theoretical consideration, but also because of difficulties that arise in parameter estimation when using conventional solution procedures. In this paper we present a framework for discussing composites and demonstrate how the use of partially reduced form models can help to overcome some of the parameter estimation and evaluation problems associated with models containing composites. Diagnostic procedures for evaluating the most appropriate and effective use of composites are illustrated with an example from the ecological literature. It is argued that an ability to incorporate composite variables into structural equation models may be particularly valuable in the study of natural systems, where concepts are frequently multifaceted and the influences of suites of variables are often of interest.

  18. A model and nomogram to predict tumor site origin for squamous cell cancer confined to cervical lymph nodes.

    PubMed

    Ali, Arif N; Switchenko, Jeffrey M; Kim, Sungjin; Kowalski, Jeanne; El-Deiry, Mark W; Beitler, Jonathan J

    2014-11-15

    The current study was conducted to develop a multifactorial statistical model to predict the specific head and neck (H&N) tumor site origin in cases of squamous cell carcinoma confined to the cervical lymph nodes ("unknown primaries"). The Surveillance, Epidemiology, and End Results (SEER) database was analyzed for patients with an H&N tumor site who were diagnosed between 2004 and 2011. The SEER patients were identified according to their H&N primary tumor site and clinically positive cervical lymph node levels at the time of presentation. The SEER patient data set was randomly divided into 2 data sets for the purposes of internal split-sample validation. The effects of cervical lymph node levels, age, race, and sex on H&N primary tumor site were examined using univariate and multivariate analyses. Multivariate logistic regression models and an associated set of nomograms were developed based on relevant factors to provide probabilities of tumor site origin. Analysis of the SEER database identified 20,011 patients with H&N disease with both site-level and lymph node-level data. Sex, race, age, and lymph node levels were associated with primary H&N tumor site (nasopharynx, hypopharynx, oropharynx, and larynx) in the multivariate models. Internal validation techniques affirmed the accuracy of these models on separate data. The incorporation of epidemiologic and lymph node data into a predictive model has the potential to provide valuable guidance to clinicians in the treatment of patients with squamous cell carcinoma confined to the cervical lymph nodes. © 2014 The Authors. Cancer published by Wiley Periodicals, Inc. on behalf of American Cancer Society.

  19. Psychological distress, health and treatment-related factors among individuals initiating ART in Oromia, Ethiopia.

    PubMed

    Parcesepe, Angela M; Tymejczyk, Olga; Remien, Robert; Gadisa, Tsigereda; Kulkarni, Sarah Gorrell; Hoffman, Susie; Melaku, Zenebe; Elul, Batya; Nash, Denis

    2018-03-01

    HIV diagnosis may be a source of psychological distress. Late initiation of antiretroviral therapy (ART) and treatment-related beliefs may intensify psychological distress among those recently diagnosed. This analysis describes the prevalence of psychological distress among people living with HIV (PLWH) and examines the association of recent HIV diagnosis, late ART initiation and treatment-related beliefs with psychological distress. The sample includes 1175 PLWH aged 18 or older initiating ART at six HIV clinics in Ethiopia. Psychological distress was assessed with Kessler Psychological Distress Scale. Scores ≥ 29 were categorized as severe psychological distress. Individuals who received their first HIV diagnosis in the past 90 days were categorized as recently diagnosed. Multivariable logistic regression modeled the association of recent diagnosis, late ART initiation and treatment-related beliefs on severe psychological distress, controlling for age, sex, education, area of residence, relationship status, and health facility. Among respondents, 29.5% reported severe psychological distress, 46.6% were recently diagnosed and 31.0% initiated ART late. In multivariable models, relative to those who did not initiate ART late and had longer time since diagnosis, odds of severe psychological distress was significantly greater among those with recent diagnosis and late ART initiation (adjusted OR [aOR]: 1.9 [95% CI 1.4, 2.8]). Treatment-related beliefs were not associated with severe psychological distress in multivariable models. Severe psychological distress was highly prevalent, particularly among those who were recently diagnosed and initiated ART late. Greater understanding of the relationship between psychological distress, recent diagnosis, and late ART initiation can inform interventions to reduce psychological distress among this population. Mental health screening and interventions should be incorporated into routine HIV clinical care from diagnosis through treatment.

  20. Multivariate statistical analysis: Principles and applications to coorbital streams of meteorite falls

    NASA Technical Reports Server (NTRS)

    Wolf, S. F.; Lipschutz, M. E.

    1993-01-01

    Multivariate statistical analysis techniques (linear discriminant analysis and logistic regression) can provide powerful discrimination tools which are generally unfamiliar to the planetary science community. Fall parameters were used to identify a group of 17 H chondrites (Cluster 1) that were part of a coorbital stream which intersected Earth's orbit in May, from 1855 - 1895, and can be distinguished from all other H chondrite falls. Using multivariate statistical techniques, it was demonstrated that a totally different criterion, labile trace element contents - hence thermal histories - or 13 Cluster 1 meteorites are distinguishable from those of 45 non-Cluster 1 H chondrites. Here, we focus upon the principles of multivariate statistical techniques and illustrate their application using non-meteoritic and meteoritic examples.

  1. Proton pump inhibitor use and recurrent Clostridium difficile-associated disease: a case-control analysis matched by propensity score.

    PubMed

    Kim, Yong Gil; Graham, David Y; Jang, Byung Ik

    2012-01-01

    Clostridium difficile has been increasingly diagnosed in hospitalized patients. An association between proton pump inhibitors (PPIs) use and Clostridium difficile-associated disease (CDAD) and between recurrent CDAD has been suggested. The aim of this study is to investigate whether PPI use is associated with the development of recurrent CDAD. This was a retrospective case-control study of patients with CDAD at Yeungnam University Medical Center, seen from January 2004 to December 2008. C. difficile infection was diagnosed by the presence of C. difficile toxin in the stool. Those with recurrent disease were matched with nonrecurrent controls using multivariate matched sampling methods that incorporated the propensity score. Recurrent CDAD developed in 28 (14.1%) of the 198 patients with diarrhea and positive C. difficile stool toxin assays. Multivariate analysis of the total population of recurrent versus nonrecurrent CDAD revealed that additional use of non-C. difficile antimicrobial therapy (concomitant with the treatment or after or both), poor response to therapy with metronidazole or vancomycin, and recent gastrointestinal surgery were risk factors for recurrent CDAD. We were able to match 21 recurrent CDAD subjects with 21 without recurrent CDAD. Among the matched patients only PPI use was associated with recurrent CDAD (ie, 47.6% vs. 4.8%, P=0.004 for recurrent vs. nonrecurrent CDAD, respectively). Among the matched patient groups, only PPI therapy was associated with recurrent CDAD. Prospective studies are needed to clarify whether avoidance of PPIs or specific cotherapies will reduce the incidence of recurrent C. difficile-associated diarrhea.

  2. Combining Raman and FT-IR spectroscopy with quantitative isotopic labeling for differentiation of E. coli cells at community and single cell levels.

    PubMed

    Muhamadali, Howbeer; Chisanga, Malama; Subaihi, Abdu; Goodacre, Royston

    2015-04-21

    There is no doubt that the contribution of microbially mediated bioprocesses toward maintenance of life on earth is vital. However, understanding these microbes in situ is currently a bottleneck, as most methods require culturing these microorganisms to suitable biomass levels so that their phenotype can be measured. The development of new culture-independent strategies such as stable isotope probing (SIP) coupled with molecular biology has been a breakthrough toward linking gene to function, while circumventing in vitro culturing. In this study, for the first time we have combined Raman spectroscopy and Fourier transform infrared (FT-IR) spectroscopy, as metabolic fingerprinting approaches, with SIP to demonstrate the quantitative labeling and differentiation of Escherichia coli cells. E. coli cells were grown in minimal medium with fixed final concentrations of carbon and nitrogen supply, but with different ratios and combinations of (13)C/(12)C glucose and (15)N/(14)N ammonium chloride, as the sole carbon and nitrogen sources, respectively. The cells were collected at stationary phase and examined by Raman and FT-IR spectroscopies. The multivariate analysis investigation of FT-IR and Raman data illustrated unique clustering patterns resulting from specific spectral shifts upon the incorporation of different isotopes, which were directly correlated with the ratio of the isotopically labeled content of the medium. Multivariate analysis results of single-cell Raman spectra followed the same trend, exhibiting a separation between E. coli cells labeled with different isotopes and multiple isotope levels of C and N.

  3. The Effect of Patient and Surgical Characteristics on Renal Function After Partial Nephrectomy.

    PubMed

    Winer, Andrew G; Zabor, Emily C; Vacchio, Michael J; Hakimi, A Ari; Russo, Paul; Coleman, Jonathan A; Jaimes, Edgar A

    2018-06-01

    The purpose of the study was to identify patient and disease characteristics that have an adverse effect on renal function after partial nephrectomy. We conducted a retrospective review of 387 patients who underwent partial nephrectomy for renal tumors between 2006 and 2014. A line plot with a locally weighted scatterplot smoothing was generated to visually assess renal function over time. Univariable and multivariable longitudinal regression analyses incorporated a random intercept and slope to evaluate the association between patient and disease characteristics with renal function after surgery. Median age was 60 years and most patients were male (255 patients [65.9%]) and white (343 patients [88.6%]). In univariable analysis, advanced age at surgery, larger tumor size, male sex, longer ischemia time, history of smoking, and hypertension were significantly associated with lower preoperative estimated glomerular filtration rate (eGFR). In multivariable analysis, independent predictors of reduced renal function after surgery included advanced age, lower preoperative eGFR, and longer ischemia time. Length of time from surgery was strongly associated with improvement in renal function among all patients. Independent predictors of postoperative decline in renal function include advanced age, lower preoperative eGFR, and longer ischemia time. A substantial number of subjects had recovery in renal function over time after surgery, which continued past the 12-month mark. These findings suggest that patients who undergo partial nephrectomy can experience long-term improvement in renal function. This improvement is most pronounced among younger patients with higher preoperative eGFR. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Advanced multivariate data analysis to determine the root cause of trisulfide bond formation in a novel antibody-peptide fusion.

    PubMed

    Goldrick, Stephen; Holmes, William; Bond, Nicholas J; Lewis, Gareth; Kuiper, Marcel; Turner, Richard; Farid, Suzanne S

    2017-10-01

    Product quality heterogeneities, such as a trisulfide bond (TSB) formation, can be influenced by multiple interacting process parameters. Identifying their root cause is a major challenge in biopharmaceutical production. To address this issue, this paper describes the novel application of advanced multivariate data analysis (MVDA) techniques to identify the process parameters influencing TSB formation in a novel recombinant antibody-peptide fusion expressed in mammalian cell culture. The screening dataset was generated with a high-throughput (HT) micro-bioreactor system (Ambr TM 15) using a design of experiments (DoE) approach. The complex dataset was firstly analyzed through the development of a multiple linear regression model focusing solely on the DoE inputs and identified the temperature, pH and initial nutrient feed day as important process parameters influencing this quality attribute. To further scrutinize the dataset, a partial least squares model was subsequently built incorporating both on-line and off-line process parameters and enabled accurate predictions of the TSB concentration at harvest. Process parameters identified by the models to promote and suppress TSB formation were implemented on five 7 L bioreactors and the resultant TSB concentrations were comparable to the model predictions. This study demonstrates the ability of MVDA to enable predictions of the key performance drivers influencing TSB formation that are valid also upon scale-up. Biotechnol. Bioeng. 2017;114: 2222-2234. © 2017 The Authors. Biotechnology and Bioengineering Published by Wiley Periodicals, Inc. © 2017 The Authors. Biotechnology and Bioengineering Published by Wiley Periodicals, Inc.

  5. Impact of FAB classification on predicting outcome in acute myeloid leukemia, not otherwise specified, patients undergoing allogeneic stem cell transplantation in CR1: An analysis of 1690 patients from the acute leukemia working party of EBMT.

    PubMed

    Canaani, Jonathan; Beohou, Eric; Labopin, Myriam; Socié, Gerard; Huynh, Anne; Volin, Liisa; Cornelissen, Jan; Milpied, Noel; Gedde-Dahl, Tobias; Deconinck, Eric; Fegueux, Nathalie; Blaise, Didier; Mohty, Mohamad; Nagler, Arnon

    2017-04-01

    The French, American, and British (FAB) classification system for acute myeloid leukemia (AML) is extensively used and is incorporated into the AML, not otherwise specified (NOS) category in the 2016 WHO edition of myeloid neoplasm classification. While recent data proposes that FAB classification does not provide additional prognostic information for patients for whom NPM1 status is available, it is unknown whether FAB still retains a current prognostic role in predicting outcome of AML patients undergoing allogeneic stem cell transplantation. Using the European Society of Blood and Bone Marrow Transplantation registry we analyzed outcome of 1690 patients transplanted in CR1 to determine if FAB classification provides additional prognostic value. Multivariate analysis revealed that M6/M7 patients had decreased leukemia free survival (hazard ratio (HR) of 1.41, 95% confidence interval (CI), 1.01-1.99; P = .046) in addition to increased nonrelapse mortality (NRM) rates (HR, 1.79; 95% CI, 1.06-3.01; P = .028) compared with other FAB types. In the NPM1 wt AML, NOS cohort, FAB M6/M7 was also associated with increased NRM (HR, 2.17; 95% CI, 1.14-4.16; P = .019). Finally, in FLT3-ITD + patients, multivariate analyses revealed that specific FAB types were tightly associated with adverse outcome. In conclusion, FAB classification may predict outcome following transplantation in AML, NOS patients. © 2017 Wiley Periodicals, Inc.

  6. A novel classifier based on three preoperative tumor markers predicting the cancer-specific survival of gastric cancer (CEA, CA19-9 and CA72-4).

    PubMed

    Guo, Jing; Chen, Shangxiang; Li, Shun; Sun, Xiaowei; Li, Wei; Zhou, Zhiwei; Chen, Yingbo; Xu, Dazhi

    2018-01-12

    Several studies have highlighted the prognostic value of the individual and the various combinations of the tumor markers for gastric cancer (GC). Our study was designed to assess establish a new novel model incorporating carcino-embryonic antigen (CEA), carbohydrate antigen 19-9 (CA19-9), carbohydrate antigen 72-4 (CA72-4). A total of 1,566 GC patients (Primary cohort) between Jan 2000 and July 2013 were analyzed. The Primary cohort was randomly divided into Training set (n=783) and Validation set (n=783). A three-tumor marker classifier was developed in the Training set and validated in the Validation set by multivariate regression and risk-score analysis. We have identified a three-tumor marker classifier (including CEA, CA19-9 and CA72-4) for the cancer specific survival (CSS) of GC (p<0.001). Consistent results were obtained in the both Training set and Validation set. Multivariate analysis showed that the classifier was an independent predictor of GC (All p value <0.001 in the Training set, Validation set and Primary cohort). Furthermore, when the leave-one-out approach was performed, the classifier showed superior predictive value to the individual or two of them (with the highest AUC (Area Under Curve); 0.618 for the Training set, and 0.625 for the Validation set), which ascertained its predictive value. Our three-tumor marker classifier is closely associated with the CSS of GC and may serve as a novel model for future decisions concerning treatments.

  7. Metastatic Lymph Node Burden and Survival in Oral Cavity Cancer

    PubMed Central

    Kim, Sungjin; Tighiouart, Mourad; Gudino, Cynthia; Mita, Alain; Scher, Kevin S.; Laury, Anna; Prasad, Ravi; Shiao, Stephen L.; Van Eyk, Jennifer E.; Zumsteg, Zachary S.

    2017-01-01

    Purpose Current staging systems for oral cavity cancers incorporate lymph node (LN) size and laterality, but place less weight on the total number of positive metastatic nodes. We investigated the independent impact of numerical metastatic LN burden on survival. Methods Adult patients with oral cavity squamous cell carcinoma undergoing upfront surgical resection for curative intent were identified in the National Cancer Data Base between 2004 and 2013. A neck dissection of a minimum of 10 LNs was required. Multivariable models were constructed to assess the association between the number of metastatic LNs and survival, adjusting for factors such as nodal size, laterality, extranodal extension, margin status, and adjuvant treatment. Results Overall, 14,554 patients met inclusion criteria (7,906 N0 patients; 6,648 node-positive patients). Mortality risk escalated continuously with increasing number of metastatic nodes without plateau, with the effect most pronounced with up to four LNs (HR, 1.34; 95% CI, 1.29 to 1.39; P < .001). Extranodal extension (HR, 1.41; 95% CI, 1.20 to 1.65; P < .001) and lower neck involvement (HR, 1.16; 95% CI, 1.06 to 1.27; P < .001) also predicted increased mortality. Increasing number of nodes examined was associated with improved survival, plateauing at 35 LNs (HR, 0.98; 95% CI, 0.98 to 0.99; P < .001). In multivariable models accounting for the number of metastatic nodes, contralateral LN involvement (N2c status) and LN size were not associated with mortality. A novel nodal staging system derived by recursive partitioning analysis exhibited greater concordance than the American Joint Committee on Cancer (8th edition) system. Conclusion The number of metastatic nodes is a critical predictor of oral cavity cancer mortality, eclipsing other features such as LN size and contralaterality in prognostic value. More robust incorporation of numerical metastatic LN burden may augment staging and better inform adjuvant treatment decisions. PMID:28880746

  8. Role of subdural electrocorticography in prediction of long-term seizure outcome in epilepsy surgery

    PubMed Central

    Juhász, Csaba; Shah, Aashit; Sood, Sandeep; Chugani, Harry T.

    2009-01-01

    Since prediction of long-term seizure outcome using preoperative diagnostic modalities remains suboptimal in epilepsy surgery, we evaluated whether interictal spike frequency measures obtained from extraoperative subdural electrocorticography (ECoG) recording could predict long-term seizure outcome. This study included 61 young patients (age 0.4–23.0 years), who underwent extraoperative ECoG recording prior to cortical resection for alleviation of uncontrolled focal seizures. Patient age, frequency of preoperative seizures, neuroimaging findings, ictal and interictal ECoG measures were preoperatively obtained. The seizure outcome was prospectively measured [follow-up period: 2.5–6.4 years (mean 4.6 years)]. Univariate and multivariate logistic regression analyses determined how well preoperative demographic and diagnostic measures predicted long-term seizure outcome. Following the initial cortical resection, Engel Class I, II, III and IV outcomes were noted in 35, 6, 12 and 7 patients, respectively. One child died due to disseminated intravascular coagulation associated with pseudomonas sepsis 2 days after surgery. Univariate regression analyses revealed that incomplete removal of seizure onset zone, higher interictal spike-frequency in the preserved cortex and incomplete removal of cortical abnormalities on neuroimaging were associated with a greater risk of failing to obtain Class I outcome. Multivariate logistic regression analysis revealed that incomplete removal of seizure onset zone was the only independent predictor of failure to obtain Class I outcome. The goodness of regression model fit and the predictive ability of regression model were greatest in the full regression model incorporating both ictal and interictal measures [R2 0.44; Area under the receiver operating characteristic (ROC) curve: 0.81], slightly smaller in the reduced model incorporating ictal but not interictal measures (R2 0.40; Area under the ROC curve: 0.79) and slightly smaller again in the reduced model incorporating interictal but not ictal measures (R2 0.27; Area under the ROC curve: 0.77). Seizure onset zone and interictal spike frequency measures on subdural ECoG recording may both be useful in predicting the long-term seizure outcome of epilepsy surgery. Yet, the additive clinical impact of interictal spike frequency measures to predict long-term surgical outcome may be modest in the presence of ictal ECoG and neuroimaging data. PMID:19286694

  9. Multivariate analysis in the pharmaceutical industry: enabling process understanding and improvement in the PAT and QbD era.

    PubMed

    Ferreira, Ana P; Tobyn, Mike

    2015-01-01

    In the pharmaceutical industry, chemometrics is rapidly establishing itself as a tool that can be used at every step of product development and beyond: from early development to commercialization. This set of multivariate analysis methods allows the extraction of information contained in large, complex data sets thus contributing to increase product and process understanding which is at the core of the Food and Drug Administration's Process Analytical Tools (PAT) Guidance for Industry and the International Conference on Harmonisation's Pharmaceutical Development guideline (Q8). This review is aimed at providing pharmaceutical industry professionals an introduction to multivariate analysis and how it is being adopted and implemented by companies in the transition from "quality-by-testing" to "quality-by-design". It starts with an introduction to multivariate analysis and the two methods most commonly used: principal component analysis and partial least squares regression, their advantages, common pitfalls and requirements for their effective use. That is followed with an overview of the diverse areas of application of multivariate analysis in the pharmaceutical industry: from the development of real-time analytical methods to definition of the design space and control strategy, from formulation optimization during development to the application of quality-by-design principles to improve manufacture of existing commercial products.

  10. Multi-variant study of obesity risk genes in African Americans: The Jackson Heart Study.

    PubMed

    Liu, Shijian; Wilson, James G; Jiang, Fan; Griswold, Michael; Correa, Adolfo; Mei, Hao

    2016-11-30

    Genome-wide association study (GWAS) has been successful in identifying obesity risk genes by single-variant association analysis. For this study, we designed steps of analysis strategy and aimed to identify multi-variant effects on obesity risk among candidate genes. Our analyses were focused on 2137 African American participants with body mass index measured in the Jackson Heart Study and 657 common single nucleotide polymorphisms (SNPs) genotyped at 8 GWAS-identified obesity risk genes. Single-variant association test showed that no SNPs reached significance after multiple testing adjustment. The following gene-gene interaction analysis, which was focused on SNPs with unadjusted p-value<0.10, identified 6 significant multi-variant associations. Logistic regression showed that SNPs in these associations did not have significant linear interactions; examination of genetic risk score evidenced that 4 multi-variant associations had significant additive effects of risk SNPs; and haplotype association test presented that all multi-variant associations contained one or several combinations of particular alleles or haplotypes, associated with increased obesity risk. Our study evidenced that obesity risk genes generated multi-variant effects, which can be additive or non-linear interactions, and multi-variant study is an important supplement to existing GWAS for understanding genetic effects of obesity risk genes. Copyright © 2016 Elsevier B.V. All rights reserved.

  11. Instrumental Neutron Activation Analysis and Multivariate Statistics for Pottery Provenance

    NASA Astrophysics Data System (ADS)

    Glascock, M. D.; Neff, H.; Vaughn, K. J.

    2004-06-01

    The application of instrumental neutron activation analysis and multivariate statistics to archaeological studies of ceramics and clays is described. A small pottery data set from the Nasca culture in southern Peru is presented for illustration.

  12. A Study of Effects of MultiCollinearity in the Multivariable Analysis

    PubMed Central

    Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; (Peter) He, Qinghua; Lillard, James W.

    2015-01-01

    A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables. PMID:25664257

  13. A Study of Effects of MultiCollinearity in the Multivariable Analysis.

    PubMed

    Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; Peter He, Qinghua; Lillard, James W

    2014-10-01

    A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables.

  14. Localization of genes involved in the metabolic syndrome using multivariate linkage analysis.

    PubMed

    Olswold, Curtis; de Andrade, Mariza

    2003-12-31

    There are no well accepted criteria for the diagnosis of the metabolic syndrome. However, the metabolic syndrome is identified clinically by the presence of three or more of these five variables: larger waist circumference, higher triglyceride levels, lower HDL-cholesterol concentrations, hypertension, and impaired fasting glucose. We use sets of two or three variables, which are available in the Framingham Heart Study data set, to localize genes responsible for this syndrome using multivariate quantitative linkage analysis. This analysis demonstrates the applicability of using multivariate linkage analysis and how its use increases the power to detect linkage when genes are involved in the same disease mechanism.

  15. Textural analysis of pre-therapeutic [18F]-FET-PET and its correlation with tumor grade and patient survival in high-grade gliomas.

    PubMed

    Pyka, Thomas; Gempt, Jens; Hiob, Daniela; Ringel, Florian; Schlegel, Jürgen; Bette, Stefanie; Wester, Hans-Jürgen; Meyer, Bernhard; Förster, Stefan

    2016-01-01

    Amino acid positron emission tomography (PET) with [18F]-fluoroethyl-L-tyrosine (FET) is well established in the diagnostic work-up of malignant brain tumors. Analysis of FET-PET data using tumor-to-background ratios (TBR) has been shown to be highly valuable for the detection of viable hypermetabolic brain tumor tissue; however, it has not proven equally useful for tumor grading. Recently, textural features in 18-fluorodeoxyglucose-PET have been proposed as a method to quantify the heterogeneity of glucose metabolism in a variety of tumor entities. Herein we evaluate whether textural FET-PET features are of utility for grading and prognostication in patients with high-grade gliomas. One hundred thirteen patients (70 men, 43 women) with histologically proven high-grade gliomas were included in this retrospective study. All patients received static FET-PET scans prior to first-line therapy. TBR (max and mean), volumetric parameters and textural parameters based on gray-level neighborhood difference matrices were derived from static FET-PET images. Receiver operating characteristic (ROC) and discriminant function analyses were used to assess the value for tumor grading. Kaplan-Meier curves and univariate and multivariate Cox regression were employed for analysis of progression-free and overall survival. All FET-PET textural parameters showed the ability to differentiate between World Health Organization (WHO) grade III and IV tumors (p < 0.001; AUC 0.775). Further improvement in discriminatory power was possible through a combination of texture and metabolic tumor volume, classifying 85 % of tumors correctly (AUC 0.830). TBR and volumetric parameters alone were correlated with tumor grade, but showed lower AUC values (0.644 and 0.710, respectively). Furthermore, a correlation of FET-PET texture but not TBR was shown with patient PFS and OS, proving significant in multivariate analysis as well. Volumetric parameters were predictive for OS, but this correlation did not hold in multivariate analysis. Determination of uptake heterogeneity in pre-therapeutic FET-PET using textural features proved valuable for the (sub-)grading of high-grade glioma as well as prediction of tumor progression and patient survival, and showed improved performance compared to standard parameters such as TBR and tumor volume. Our results underscore the importance of intratumoral heterogeneity in the biology of high-grade glial cell tumors and may contribute to individual therapy planning in the future, although they must be confirmed in prospective studies before incorporation into clinical routine.

  16. Multivariate Statistical Analysis Software Technologies for Astrophysical Research Involving Large Data Bases

    NASA Technical Reports Server (NTRS)

    Djorgovski, S. G.

    1994-01-01

    We developed a package to process and analyze the data from the digital version of the Second Palomar Sky Survey. This system, called SKICAT, incorporates the latest in machine learning and expert systems software technology, in order to classify the detected objects objectively and uniformly, and facilitate handling of the enormous data sets from digital sky surveys and other sources. The system provides a powerful, integrated environment for the manipulation and scientific investigation of catalogs from virtually any source. It serves three principal functions: image catalog construction, catalog management, and catalog analysis. Through use of the GID3* Decision Tree artificial induction software, SKICAT automates the process of classifying objects within CCD and digitized plate images. To exploit these catalogs, the system also provides tools to merge them into a large, complex database which may be easily queried and modified when new data or better methods of calibrating or classifying become available. The most innovative feature of SKICAT is the facility it provides to experiment with and apply the latest in machine learning technology to the tasks of catalog construction and analysis. SKICAT provides a unique environment for implementing these tools for any number of future scientific purposes. Initial scientific verification and performance tests have been made using galaxy counts and measurements of galaxy clustering from small subsets of the survey data, and a search for very high redshift quasars. All of the tests were successful and produced new and interesting scientific results. Attachments to this report give detailed accounts of the technical aspects of the SKICAT system, and of some of the scientific results achieved to date. We also developed a user-friendly package for multivariate statistical analysis of small and moderate-size data sets, called STATPROG. The package was tested extensively on a number of real scientific applications and has produced real, published results.

  17. Multivariate frequency domain analysis of protein dynamics

    NASA Astrophysics Data System (ADS)

    Matsunaga, Yasuhiro; Fuchigami, Sotaro; Kidera, Akinori

    2009-03-01

    Multivariate frequency domain analysis (MFDA) is proposed to characterize collective vibrational dynamics of protein obtained by a molecular dynamics (MD) simulation. MFDA performs principal component analysis (PCA) for a bandpass filtered multivariate time series using the multitaper method of spectral estimation. By applying MFDA to MD trajectories of bovine pancreatic trypsin inhibitor, we determined the collective vibrational modes in the frequency domain, which were identified by their vibrational frequencies and eigenvectors. At near zero temperature, the vibrational modes determined by MFDA agreed well with those calculated by normal mode analysis. At 300 K, the vibrational modes exhibited characteristic features that were considerably different from the principal modes of the static distribution given by the standard PCA. The influences of aqueous environments were discussed based on two different sets of vibrational modes, one derived from a MD simulation in water and the other from a simulation in vacuum. Using the varimax rotation, an algorithm of the multivariate statistical analysis, the representative orthogonal set of eigenmodes was determined at each vibrational frequency.

  18. Imaging of polysaccharides in the tomato cell wall with Raman microspectroscopy

    PubMed Central

    2014-01-01

    Background The primary cell wall of fruits and vegetables is a structure mainly composed of polysaccharides (pectins, hemicelluloses, cellulose). Polysaccharides are assembled into a network and linked together. It is thought that the percentage of components and of plant cell wall has an important influence on mechanical properties of fruits and vegetables. Results In this study the Raman microspectroscopy technique was introduced to the visualization of the distribution of polysaccharides in cell wall of fruit. The methodology of the sample preparation, the measurement using Raman microscope and multivariate image analysis are discussed. Single band imaging (for preliminary analysis) and multivariate image analysis methods (principal component analysis and multivariate curve resolution) were used for the identification and localization of the components in the primary cell wall. Conclusions Raman microspectroscopy supported by multivariate image analysis methods is useful in distinguishing cellulose and pectins in the cell wall in tomatoes. It presents how the localization of biopolymers was possible with minimally prepared samples. PMID:24917885

  19. A refined method for multivariate meta-analysis and meta-regression

    PubMed Central

    Jackson, Daniel; Riley, Richard D

    2014-01-01

    Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects’ standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. PMID:23996351

  20. Relationship Between Parental and Adolescent eHealth Literacy and Online Health Information Seeking in Taiwan.

    PubMed

    Chang, Fong-Ching; Chiu, Chiung-Hui; Chen, Ping-Hung; Miao, Nae-Fang; Lee, Ching-Mei; Chiang, Jeng-Tung; Pan, Ying-Chun

    2015-10-01

    This study examined the relationship between parental and adolescent eHealth literacy and its impact on online health information seeking. Data were obtained from 1,869 junior high school students and 1,365 parents in Taiwan in 2013. Multivariate analysis results showed that higher levels of parental Internet skill and eHealth literacy were associated with an increase in parental online health information seeking. Parental eHealth literacy, parental active use Internet mediation, adolescent Internet literacy, and health information literacy were all related to adolescent eHealth literacy. Similarly, adolescent Internet/health information literacy, eHealth literacy, and parental active use Internet mediation, and parental online health information seeking were associated with an increase in adolescent online health information seeking. The incorporation of eHealth literacy courses into parenting programs and school education curricula is crucial to promote the eHealth literacy of parents and adolescents.

  1. Further blood genetic studies on Amazonian diversity--data from four Indian groups.

    PubMed

    Callegari-Jacques, S M; Salzano, F M; Weimer, T A; Hutz, M H; Black, F L; Santos, S E; Guerreiro, J F; Mestriner, M A; Pandey, J P

    1994-01-01

    Information related to 31 protein genetic systems was obtained for 307 individuals affiliated with the Cinta Larga, Karitiana, Surui and Kararaô Indians of northern Brazil. In terms of genetic distances the Cinta Larga showed more similarities with the Karitiana (both are Tupi-speaking tribes), while at a more distant level the Surui clustered with the Kararaô. The latter, a Cayapo subgroup, showed a completely different genetic constitution from the other subgroups of this same tribe. Both the Kararaô and Karitiana are small, remnant populations, and their gene pools have presumably been severely affected by random and founder effects. These results were incorporated with those of 25 other Amazonian Indian tribes, and analysis by two multivariate techniques confirmed a previously observed geographical dichotomy, suggesting either that the Amazon river constitutes a barrier to north-south gene flow or that latitudinally different past migrations entered the region from the west.

  2. The spatial pattern of suicide in the US in relation to deprivation, fragmentation and rurality.

    PubMed

    Congdon, Peter

    2011-01-01

    Analysis of geographical patterns of suicide and psychiatric morbidity has demonstrated the impact of latent ecological variables (such as deprivation, rurality). Such latent variables may be derived by conventional multivariate techniques from sets of observed indices (for example, by principal components), by composite variable methods or by methods which explicitly consider the spatial framework of areas and, in particular, the spatial clustering of latent risks and outcomes. This article considers a latent random variable approach to explaining geographical contrasts in suicide in the US; and it develops a spatial structural equation model incorporating deprivation, social fragmentation and rurality. The approach allows for such latent spatial constructs to be correlated both within and between areas. Potential effects of area ethnic mix are also included. The model is applied to male and female suicide deaths over 2002–06 in 3142 US counties.

  3. Multidisciplinary optimization of a controlled space structure using 150 design variables

    NASA Technical Reports Server (NTRS)

    James, Benjamin B.

    1993-01-01

    A controls-structures interaction design method is presented. The method coordinates standard finite-element structural analysis, multivariable controls, and nonlinear programming codes and allows simultaneous optimization of the structure and control system of a spacecraft. Global sensitivity equations are used to account for coupling between the disciplines. Use of global sensitivity equations helps solve optimization problems that have a large number of design variables and a high degree of coupling between disciplines. The preliminary design of a generic geostationary platform is used to demonstrate the multidisciplinary optimization method. Design problems using 15, 63, and 150 design variables to optimize truss member sizes and feedback gain values are solved and the results are presented. The goal is to reduce the total mass of the structure and the vibration control system while satisfying constraints on vibration decay rate. Incorporation of the nonnegligible mass of actuators causes an essential coupling between structural design variables and control design variables.

  4. Does ethno-cultural betrayal in trauma affect Asian American/Pacific Islander college students' mental health outcomes? An exploratory study.

    PubMed

    Gómez, Jennifer M

    2017-01-01

    Interpersonal trauma has deleterious effects on mental health, with college students experiencing relatively high rates of lifetime trauma. Asian American/Pacific Islanders (AAPIs) have the lowest rate of mental healthcare utilization. According to cultural betrayal trauma theory, societal inequality may impact within-group violence in minority populations, thus having implications for mental health. In the current exploratory study, between-group (interracial) and within-group (ethno-cultural betrayal) trauma and mental health outcomes were examined in AAPI college students. Participants (N = 108) were AAPI college students from a predominantly white university. Data collection concluded in December 2015. Participants completed online self-report measures. A multivariate analysis of variance revealed that when controlling for interracial trauma, ethno-cultural betrayal trauma significantly impacted dissociation, hallucinations, posttraumatic stress symptoms, and hypervigilance. The results have implications for incorporating identity, discrimination, and ethno-cultural betrayal trauma victimization into assessments and case conceptualizations in therapy.

  5. Content Specificity of Expectancy Beliefs and Task Values in Elementary Physical Education

    PubMed Central

    Chen, Ang; Martin, Robert; Ennis, Catherine D.; Sun, Haichun

    2015-01-01

    The curriculum may superimpose a content-specific context that mediates motivation (Bong, 2001). This study examined content specificity of the expectancy-value motivation in elementary school physical education. Students’ expectancy beliefs and perceived task values from a cardiorespiratory fitness unit, a muscular fitness unit, and a traditional skill/game unit were analyzed using constant comparison coding procedures, multivariate analysis of variance, χ2, and correlation analyses. There was no difference in the intrinsic interest value among the three content conditions. Expectancy belief, attainment, and utility values were significantly higher for the cardiorespiratory fitness curriculum. Correlations differentiated among the expectancy-value components of the content conditions, providing further evidence of content specificity in the expectancy-value motivation process. The findings suggest that expectancy beliefs and task values should be incorporated in the theoretical platform for curriculum development based on the learning outcomes that can be specified with enhanced motivation effect. PMID:18664044

  6. Market and plan characteristics related to HMO quality and improvement.

    PubMed

    Scanlon, Dennis P; Swaminathan, Shailender; Chernew, Michael; Lee, Woolton

    2006-12-01

    Existing research on health plan performance examines whether variation in plans' scores is related to enrollee and health plan traits, primarily using cross-sectional research designs. This study extends that literature by incorporating data on market characteristics using a longitudinal framework. We estimate multivariate growth models that relate plan performance on standard measures to market and HMO characteristics using an unbalanced panel of data for 1998 to 2002. We find that HMO competition is not associated with better performance or greater rates of improvement in performance on the HEDIS chronic care measures. HMO penetration, on the other hand, is positively associated with HEDIS performance in several of the chronic care process-and-outcomes measures but not with a greater rate of improvement through time. Our analysis indicates that a significant percentage of the unexplained variation in quality improvement is because of permanent, unobserved plan-level characteristics that future research should strive to identify.

  7. Beyond the Flipped Classroom: A Highly Interactive Cloud-Classroom (HIC) Embedded into Basic Materials Science Courses

    NASA Astrophysics Data System (ADS)

    Liou, Wei-Kai; Bhagat, Kaushal Kumar; Chang, Chun-Yen

    2016-06-01

    The present study compares the highly interactive cloud-classroom (HIC) system with traditional methods of teaching materials science that utilize crystal structure picture or real crystal structure model, in order to examine its learning effectiveness across three dimensions: knowledge, comprehension and application. The aim of this study was to evaluate the (HIC) system, which incorporates augmented reality, virtual reality and cloud-classroom to teach basic materials science courses. The study followed a pretest-posttest quasi-experimental research design. A total of 92 students (aged 19-20 years), in a second-year undergraduate program, participated in this 18-week-long experiment. The students were divided into an experimental group and a control group. The experimental group (36 males and 10 females) was instructed utilizing the HIC system, while the control group (34 males and 12 females) was led through traditional teaching methods. Pretest, posttest, and delayed posttest scores were evaluated by multivariate analysis of covariance. The results indicated that participants in the experimental group who used the HIC system outperformed the control group, in the both posttest and delayed posttest, across three learning dimensions. Based on these results, the HIC system is recommended to be incorporated in formal materials science learning settings.

  8. Improving the realism of hydrologic model through multivariate parameter estimation

    NASA Astrophysics Data System (ADS)

    Rakovec, Oldrich; Kumar, Rohini; Attinger, Sabine; Samaniego, Luis

    2017-04-01

    Increased availability and quality of near real-time observations should improve understanding of predictive skills of hydrological models. Recent studies have shown the limited capability of river discharge data alone to adequately constrain different components of distributed model parameterizations. In this study, the GRACE satellite-based total water storage (TWS) anomaly is used to complement the discharge data with an aim to improve the fidelity of mesoscale hydrologic model (mHM) through multivariate parameter estimation. The study is conducted in 83 European basins covering a wide range of hydro-climatic regimes. The model parameterization complemented with the TWS anomalies leads to statistically significant improvements in (1) discharge simulations during low-flow period, and (2) evapotranspiration estimates which are evaluated against independent (FLUXNET) data. Overall, there is no significant deterioration in model performance for the discharge simulations when complemented by information from the TWS anomalies. However, considerable changes in the partitioning of precipitation into runoff components are noticed by in-/exclusion of TWS during the parameter estimation. A cross-validation test carried out to assess the transferability and robustness of the calibrated parameters to other locations further confirms the benefit of complementary TWS data. In particular, the evapotranspiration estimates show more robust performance when TWS data are incorporated during the parameter estimation, in comparison with the benchmark model constrained against discharge only. This study highlights the value for incorporating multiple data sources during parameter estimation to improve the overall realism of hydrologic model and its applications over large domains. Rakovec, O., Kumar, R., Attinger, S. and Samaniego, L. (2016): Improving the realism of hydrologic model functioning through multivariate parameter estimation. Water Resour. Res., 52, http://dx.doi.org/10.1002/2016WR019430

  9. A matrix-based method of moments for fitting the multivariate random effects model for meta-analysis and meta-regression

    PubMed Central

    Jackson, Dan; White, Ian R; Riley, Richard D

    2013-01-01

    Multivariate meta-analysis is becoming more commonly used. Methods for fitting the multivariate random effects model include maximum likelihood, restricted maximum likelihood, Bayesian estimation and multivariate generalisations of the standard univariate method of moments. Here, we provide a new multivariate method of moments for estimating the between-study covariance matrix with the properties that (1) it allows for either complete or incomplete outcomes and (2) it allows for covariates through meta-regression. Further, for complete data, it is invariant to linear transformations. Our method reduces to the usual univariate method of moments, proposed by DerSimonian and Laird, in a single dimension. We illustrate our method and compare it with some of the alternatives using a simulation study and a real example. PMID:23401213

  10. Multivariate time series analysis of neuroscience data: some challenges and opportunities.

    PubMed

    Pourahmadi, Mohsen; Noorbaloochi, Siamak

    2016-04-01

    Neuroimaging data may be viewed as high-dimensional multivariate time series, and analyzed using techniques from regression analysis, time series analysis and spatiotemporal analysis. We discuss issues related to data quality, model specification, estimation, interpretation, dimensionality and causality. Some recent research areas addressing aspects of some recurring challenges are introduced. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. Data analysis techniques

    NASA Technical Reports Server (NTRS)

    Park, Steve

    1990-01-01

    A large and diverse number of computational techniques are routinely used to process and analyze remotely sensed data. These techniques include: univariate statistics; multivariate statistics; principal component analysis; pattern recognition and classification; other multivariate techniques; geometric correction; registration and resampling; radiometric correction; enhancement; restoration; Fourier analysis; and filtering. Each of these techniques will be considered, in order.

  12. Chemical structure of wood charcoal by infrared spectroscopy and multivariate analysis

    Treesearch

    Nicole Labbe; David Harper; Timothy Rials; Thomas Elder

    2006-01-01

    In this work, the effect of temperature on charcoal structure and chemical composition is investigated for four tree species. Wood charcoal carbonized at various temperatures is analyzed by mid infrared spectroscopy coupled with multivariate analysis and by thermogravimetric analysis to characterize the chemical composition during the carbonization process. The...

  13. Root Cause Analysis of Quality Defects Using HPLC-MS Fingerprint Knowledgebase for Batch-to-batch Quality Control of Herbal Drugs.

    PubMed

    Yan, Binjun; Fang, Zhonghua; Shen, Lijuan; Qu, Haibin

    2015-01-01

    The batch-to-batch quality consistency of herbal drugs has always been an important issue. To propose a methodology for batch-to-batch quality control based on HPLC-MS fingerprints and process knowledgebase. The extraction process of Compound E-jiao Oral Liquid was taken as a case study. After establishing the HPLC-MS fingerprint analysis method, the fingerprints of the extract solutions produced under normal and abnormal operation conditions were obtained. Multivariate statistical models were built for fault detection and a discriminant analysis model was built using the probabilistic discriminant partial-least-squares method for fault diagnosis. Based on multivariate statistical analysis, process knowledge was acquired and the cause-effect relationship between process deviations and quality defects was revealed. The quality defects were detected successfully by multivariate statistical control charts and the type of process deviations were diagnosed correctly by discriminant analysis. This work has demonstrated the benefits of combining HPLC-MS fingerprints, process knowledge and multivariate analysis for the quality control of herbal drugs. Copyright © 2015 John Wiley & Sons, Ltd.

  14. Bouncing Back: Resilience and Mastery Among HIV-Positive Older Gay and Bisexual Men.

    PubMed

    Emlet, Charles A; Shiu, Chengshi; Kim, Hyun-Jun; Fredriksen-Goldsen, Karen

    2017-02-01

    Adults with HIV infection are living into old age. It is critical we investigate positive constructs such as resilience and mastery to determine factors associated with psychological well-being. We examine HIV-related factors, adverse conditions, and psychosocial characteristics that are associated with resilience (the ability to bounce back) and mastery (sense of self-efficacy). We analyzed 2014 data from the longitudinal study Aging with Pride: National Health, Aging, and Sexuality/Gender Study (NHAS), focusing on a subsample of 335 gay and bisexual older men. Multivariate linear regression was used to identify factors that contributed or detracted from resilience and mastery in the sample recruited from 17 sites from across the United States. Resilience and mastery were independently associated with psychological health-related quality of life. In multivariate analysis, adjusting for demographic characteristics, previous diagnosis of depression was negatively associated with resilience. Time since HIV diagnosis was positively associated with mastery whereas victimization was negatively associated with mastery. Social support and community engagement were positively associated with both resilience and mastery. Individual and structural-environmental characteristics contributed to resilience and mastery. These findings can be used to develop interventions incorporating an increased understanding of factors that are associated with both resilience and mastery. © The Author 2017. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Magnitude of virologic blips is associated with a higher risk for virologic rebound in HIV-infected individuals: a recurrent events analysis.

    PubMed

    Grennan, J Troy; Loutfy, Mona R; Su, DeSheng; Harrigan, P Richard; Cooper, Curtis; Klein, Marina; Machouf, Nima; Montaner, Julio S G; Rourke, Sean; Tsoukas, Christos; Hogg, Bob; Raboud, Janet

    2012-04-15

    The importance of human immunodeficiency virus (HIV) blip magnitude on virologic rebound has been raised in clinical guidelines relating to viral load assays. Antiretroviral-naive individuals initiating combination antiretroviral therapy (cART) after 1 January 2000 and achieving virologic suppression were studied. Negative binomial models were used to identify blip correlates. Recurrent event models were used to determine the association between blips and rebound by incorporating multiple periods of virologic suppression per individual. 3550 participants (82% male; median age, 40 years) were included. In a multivariable negative binomial regression model, the Amplicor assay was associated with a lower blip rate than branched DNA (rate ratio, 0.69; P < .01), controlling for age, sex, region, baseline HIV-1 RNA and CD4 count, AIDS-defining illnesses, year of cART initiation, cART type, and HIV-1 RNA testing frequency. In a multivariable recurrent event model controlling for age, sex, intravenous drug use, cART start year, cART type, assay type, and HIV-1 RNA testing frequency, blips of 500-999 copies/mL were associated with virologic rebound (hazard ratio, 2.70; P = .002), whereas blips of 50-499 were not. HIV-1 RNA assay was an important determinant of blip rates and should be considered in clinical guidelines. Blips ≥500 copies/mL were associated with increased rebound risk.

  16. Effects of prediabetes mellitus alone or plus hypertension on subsequent occurrence of cardiovascular disease and diabetes mellitus: longitudinal study.

    PubMed

    Qiu, Miaoyan; Shen, Weili; Song, Xiaomin; Ju, Liping; Tong, Wenxin; Wang, Haiyan; Zheng, Sheng; Jin, Yan; Wu, Yixin; Wang, Weiqing; Tian, Jingyan

    2015-03-01

    Whether prediabetes mellitus alone or combined with other disorders means a higher risk for cardiovascular disease (CVD) is still controversial. This study aimed to investigate the association between prediabetes mellitus and CVD and diabetes mellitus and to explore whether prediabetes mellitus alone or combined with other syndromes, such as hypertension, could promote CVD risks significantly. This longitudinal population-based study of 1609 residents from Shanghai in Southern China was conducted between 2002 and 2014. Participants with a history of CVD at baseline were excluded from analysis. Multivariate log-binomial regression models were used to adjust possible coexisting factors. Incidence of CVD during follow-up was 10.1%. After adjusting for age, sex, and other factors, the association between prediabetes mellitus and CVD was not observed. When hypertension was incorporated in stratifying factors, adjusted CVD risk was elevated significantly (odds ratio, 2.41; 95% confidence interval, 1.25-4.64) in prediabetes mellitus and hypertension combined group, and coexistence of diabetes mellitus and hypertension made CVD risk highly significantly increased, reaching 3.43-fold higher than the reference group. Blood glucose level within prediabetic range is significantly associated with elevated risks for diabetes mellitus after multivariable adjustment, but only when it is concurrent with other disorders, such as hypertension, it will significantly increase CVD risk. © 2015 American Heart Association, Inc.

  17. Anger expression, violent behavior, and symptoms of depression among male college students in Ethiopia.

    PubMed

    Terasaki, Dale J; Gelaye, Bizu; Berhane, Yemane; Williams, Michelle A

    2009-01-12

    Depression is an important global public health problem. Given the scarcity of studies involving African youths, this study was conducted to evaluate the associations of anger expression and violent behavior with symptoms of depression among male college students. A self-administered questionnaire was used to collect information on socio-demographic and lifestyle characteristics and violent behavior among 1,176 college students in Awassa, Ethiopia in June, 2006. The questionnaire incorporated the Spielberger Anger-Out Expression (SAOE) scale and symptoms of depression were evaluated using the Patient Health Questionnaire (PHQ-9). Multivariable logistic regression procedures were used to calculate adjusted odds ratios (OR) and 95% confidence intervals (95%CI). Symptoms of depression were evident in 23.6% of participants. Some 54.3% of students reported committing at least one act of violence in the current academic year; and 29.3% of students reported high (SAOE score > or = 15) levels of anger-expression. In multivariate analysis, moderate (OR = 1.97; 95%CI 1.33-2.93) and high (OR = 3.23; 95%CI 2.14-4.88) outward anger were statistically significantly associated with increased risks of depressive symptoms. Violent behavior was noted to be associated with depressive symptoms (OR = 1.82; 95%CI 1.37-2.40). Further research should be conducted to better characterize community and individual level determinants of anger-expression, violent behavior and depression among youths.

  18. An efficient parallel sampling technique for Multivariate Poisson-Lognormal model: Analysis with two crash count datasets

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhan, Xianyuan; Aziz, H. M. Abdul; Ukkusuri, Satish V.

    Our study investigates the Multivariate Poisson-lognormal (MVPLN) model that jointly models crash frequency and severity accounting for correlations. The ordinary univariate count models analyze crashes of different severity level separately ignoring the correlations among severity levels. The MVPLN model is capable to incorporate the general correlation structure and takes account of the over dispersion in the data that leads to a superior data fitting. But, the traditional estimation approach for MVPLN model is computationally expensive, which often limits the use of MVPLN model in practice. In this work, a parallel sampling scheme is introduced to improve the original Markov Chainmore » Monte Carlo (MCMC) estimation approach of the MVPLN model, which significantly reduces the model estimation time. Two MVPLN models are developed using the pedestrian vehicle crash data collected in New York City from 2002 to 2006, and the highway-injury data from Washington State (5-year data from 1990 to 1994) The Deviance Information Criteria (DIC) is used to evaluate the model fitting. The estimation results show that the MVPLN models provide a superior fit over univariate Poisson-lognormal (PLN), univariate Poisson, and Negative Binomial models. Moreover, the correlations among the latent effects of different severity levels are found significant in both datasets that justifies the importance of jointly modeling crash frequency and severity accounting for correlations.« less

  19. An efficient parallel sampling technique for Multivariate Poisson-Lognormal model: Analysis with two crash count datasets

    DOE PAGES

    Zhan, Xianyuan; Aziz, H. M. Abdul; Ukkusuri, Satish V.

    2015-11-19

    Our study investigates the Multivariate Poisson-lognormal (MVPLN) model that jointly models crash frequency and severity accounting for correlations. The ordinary univariate count models analyze crashes of different severity level separately ignoring the correlations among severity levels. The MVPLN model is capable to incorporate the general correlation structure and takes account of the over dispersion in the data that leads to a superior data fitting. But, the traditional estimation approach for MVPLN model is computationally expensive, which often limits the use of MVPLN model in practice. In this work, a parallel sampling scheme is introduced to improve the original Markov Chainmore » Monte Carlo (MCMC) estimation approach of the MVPLN model, which significantly reduces the model estimation time. Two MVPLN models are developed using the pedestrian vehicle crash data collected in New York City from 2002 to 2006, and the highway-injury data from Washington State (5-year data from 1990 to 1994) The Deviance Information Criteria (DIC) is used to evaluate the model fitting. The estimation results show that the MVPLN models provide a superior fit over univariate Poisson-lognormal (PLN), univariate Poisson, and Negative Binomial models. Moreover, the correlations among the latent effects of different severity levels are found significant in both datasets that justifies the importance of jointly modeling crash frequency and severity accounting for correlations.« less

  20. A Regularized Linear Dynamical System Framework for Multivariate Time Series Analysis.

    PubMed

    Liu, Zitao; Hauskrecht, Milos

    2015-01-01

    Linear Dynamical System (LDS) is an elegant mathematical framework for modeling and learning Multivariate Time Series (MTS). However, in general, it is difficult to set the dimension of an LDS's hidden state space. A small number of hidden states may not be able to model the complexities of a MTS, while a large number of hidden states can lead to overfitting. In this paper, we study learning methods that impose various regularization penalties on the transition matrix of the LDS model and propose a regularized LDS learning framework (rLDS) which aims to (1) automatically shut down LDSs' spurious and unnecessary dimensions, and consequently, address the problem of choosing the optimal number of hidden states; (2) prevent the overfitting problem given a small amount of MTS data; and (3) support accurate MTS forecasting. To learn the regularized LDS from data we incorporate a second order cone program and a generalized gradient descent method into the Maximum a Posteriori framework and use Expectation Maximization to obtain a low-rank transition matrix of the LDS model. We propose two priors for modeling the matrix which lead to two instances of our rLDS. We show that our rLDS is able to recover well the intrinsic dimensionality of the time series dynamics and it improves the predictive performance when compared to baselines on both synthetic and real-world MTS datasets.

  1. Multivariate spline methods in surface fitting

    NASA Technical Reports Server (NTRS)

    Guseman, L. F., Jr. (Principal Investigator); Schumaker, L. L.

    1984-01-01

    The use of spline functions in the development of classification algorithms is examined. In particular, a method is formulated for producing spline approximations to bivariate density functions where the density function is decribed by a histogram of measurements. The resulting approximations are then incorporated into a Bayesiaan classification procedure for which the Bayes decision regions and the probability of misclassification is readily computed. Some preliminary numerical results are presented to illustrate the method.

  2. Use of Longitudinal Data in Genetic Studies in the Genome-wide Association Studies Era: Summary of Group 14

    PubMed Central

    Kerner, Berit; North, Kari E; Fallin, M Daniele

    2010-01-01

    Participants analyzed actual and simulated longitudinal data from the Framingham Heart Study for various metabolic and cardiovascular traits. The genetic information incorporated into these investigations ranged from selected single-nucleotide polymorphisms to genome-wide association arrays. Genotypes were incorporated using a broad range of methodological approaches including conditional logistic regression, linear mixed models, generalized estimating equations, linear growth curve estimation, growth modeling, growth mixture modeling, population attributable risk fraction based on survival functions under the proportional hazards models, and multivariate adaptive splines for the analysis of longitudinal data. The specific scientific questions addressed by these different approaches also varied, ranging from a more precise definition of the phenotype, bias reduction in control selection, estimation of effect sizes and genotype associated risk, to direct incorporation of genetic data into longitudinal modeling approaches and the exploration of population heterogeneity with regard to longitudinal trajectories. The group reached several overall conclusions: 1) The additional information provided by longitudinal data may be useful in genetic analyses. 2) The precision of the phenotype definition as well as control selection in nested designs may be improved, especially if traits demonstrate a trend over time or have strong age-of-onset effects. 3) Analyzing genetic data stratified for high-risk subgroups defined by a unique development over time could be useful for the detection of rare mutations in common multi-factorial diseases. 4) Estimation of the population impact of genomic risk variants could be more precise. The challenges and computational complexity demanded by genome-wide single-nucleotide polymorphism data were also discussed. PMID:19924713

  3. Multivariate analysis: greater insights into complex systems

    USDA-ARS?s Scientific Manuscript database

    Many agronomic researchers measure and collect multiple response variables in an effort to understand the more complex nature of the system being studied. Multivariate (MV) statistical methods encompass the simultaneous analysis of all random variables (RV) measured on each experimental or sampling ...

  4. Multivariate analysis of progressive thermal desorption coupled gas chromatography-mass spectrometry.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Van Benthem, Mark Hilary; Mowry, Curtis Dale; Kotula, Paul Gabriel

    Thermal decomposition of poly dimethyl siloxane compounds, Sylgard{reg_sign} 184 and 186, were examined using thermal desorption coupled gas chromatography-mass spectrometry (TD/GC-MS) and multivariate analysis. This work describes a method of producing multiway data using a stepped thermal desorption. The technique involves sequentially heating a sample of the material of interest with subsequent analysis in a commercial GC/MS system. The decomposition chromatograms were analyzed using multivariate analysis tools including principal component analysis (PCA), factor rotation employing the varimax criterion, and multivariate curve resolution. The results of the analysis show seven components related to offgassing of various fractions of siloxanes that varymore » as a function of temperature. Thermal desorption coupled with gas chromatography-mass spectrometry (TD/GC-MS) is a powerful analytical technique for analyzing chemical mixtures. It has great potential in numerous analytic areas including materials analysis, sports medicine, in the detection of designer drugs; and biological research for metabolomics. Data analysis is complicated, far from automated and can result in high false positive or false negative rates. We have demonstrated a step-wise TD/GC-MS technique that removes more volatile compounds from a sample before extracting the less volatile compounds. This creates an additional dimension of separation before the GC column, while simultaneously generating three-way data. Sandia's proven multivariate analysis methods, when applied to these data, have several advantages over current commercial options. It also has demonstrated potential for success in finding and enabling identification of trace compounds. Several challenges remain, however, including understanding the sources of noise in the data, outlier detection, improving the data pretreatment and analysis methods, developing a software tool for ease of use by the chemist, and demonstrating our belief that this multivariate analysis will enable superior differentiation capabilities. In addition, noise and system artifacts challenge the analysis of GC-MS data collected on lower cost equipment, ubiquitous in commercial laboratories. This research has the potential to affect many areas of analytical chemistry including materials analysis, medical testing, and environmental surveillance. It could also provide a method to measure adsorption parameters for chemical interactions on various surfaces by measuring desorption as a function of temperature for mixtures. We have presented results of a novel method for examining offgas products of a common PDMS material. Our method involves utilizing a stepped TD/GC-MS data acquisition scheme that may be almost totally automated, coupled with multivariate analysis schemes. This method of data generation and analysis can be applied to a number of materials aging and thermal degradation studies.« less

  5. Recent advances in scalable non-Gaussian geostatistics: The generalized sub-Gaussian model

    NASA Astrophysics Data System (ADS)

    Guadagnini, Alberto; Riva, Monica; Neuman, Shlomo P.

    2018-07-01

    Geostatistical analysis has been introduced over half a century ago to allow quantifying seemingly random spatial variations in earth quantities such as rock mineral content or permeability. The traditional approach has been to view such quantities as multivariate Gaussian random functions characterized by one or a few well-defined spatial correlation scales. There is, however, mounting evidence that many spatially varying quantities exhibit non-Gaussian behavior over a multiplicity of scales. The purpose of this minireview is not to paint a broad picture of the subject and its treatment in the literature. Instead, we focus on very recent advances in the recognition and analysis of this ubiquitous phenomenon, which transcends hydrology and the Earth sciences, brought about largely by our own work. In particular, we use porosity data from a deep borehole to illustrate typical aspects of such scalable non-Gaussian behavior, describe a very recent theoretical model that (for the first time) captures all these behavioral aspects in a comprehensive manner, show how this allows generating random realizations of the quantity conditional on sampled values, point toward ways of incorporating scalable non-Gaussian behavior in hydrologic analysis, highlight the significance of doing so, and list open questions requiring further research.

  6. KRAS Mutation as a Potential Prognostic Biomarker of Biliary Tract Cancers

    PubMed Central

    Yokoyama, Masaaki; Ohnishi, Hiroaki; Ohtsuka, Kouki; Matsushima, Satsuki; Ohkura, Yasuo; Furuse, Junji; Watanabe, Takashi; Mori, Toshiyuki; Sugiyama, Masanori

    2016-01-01

    BACKGROUND The aim of this study was to identify the unique molecular characteristics of biliary tract cancer (BTC) for the development of novel molecular-targeted therapies. MATERIALS AND METHODS We performed mutational analysis of KRAS, BRAF, PIK3CA, and FBXW7 and immunohistochemical analysis of EGFR and TP53 in 63 Japanese patients with BTC and retrospectively evaluated the association between the molecular characteristics and clinicopathological features of BTC. RESULTS KRAS mutations were identified in 9 (14%) of the 63 BTC patients; no mutations were detected within the analyzed regions of BRAF, PIK3CA, and FBXW7. EGFR overexpression was observed in 5 (8%) of the 63 tumors, while TP53 overexpression was observed in 48% (30/63) of the patients. Overall survival of patients with KRAS mutation was significantly shorter than that of patients with the wild-type KRAS gene (P = 0.005). By multivariate analysis incorporating molecular and clinicopathological features, KRAS mutations and lymph node metastasis were identified to be independently associated with shorter overall survival (KRAS, P = 0.004; lymph node metastasis, P = 0.015). CONCLUSIONS Our data suggest that KRAS mutation is a poor prognosis predictive biomarker for the survival in BTC patients. PMID:28008299

  7. Analysis and compensation for the effect of the catheter position on image intensities in intravascular optical coherence tomography

    NASA Astrophysics Data System (ADS)

    Liu, Shengnan; Eggermont, Jeroen; Wolterbeek, Ron; Broersen, Alexander; Busk, Carol A. G. R.; Precht, Helle; Lelieveldt, Boudewijn P. F.; Dijkstra, Jouke

    2016-12-01

    Intravascular optical coherence tomography (IVOCT) is an imaging technique that is used to analyze the underlying cause of cardiovascular disease. Because a catheter is used during imaging, the intensities can be affected by the catheter position. This work aims to analyze the effect of the catheter position on IVOCT image intensities and to propose a compensation method to minimize this effect in order to improve the visualization and the automatic analysis of IVOCT images. The effect of catheter position is modeled with respect to the distance between the catheter and the arterial wall (distance-dependent factor) and the incident angle onto the arterial wall (angle-dependent factor). A light transmission model incorporating both factors is introduced. On the basis of this model, the interaction effect of both factors is estimated with a hierarchical multivariant linear regression model. Statistical analysis shows that IVOCT intensities are significantly affected by both factors with p<0.001, as either aspect increases the intensity decreases. This effect differs for different pullbacks. The regression results were used to compensate for this effect. Experiments show that the proposed compensation method can improve the performance of the automatic bioresorbable vascular scaffold strut detection.

  8. Integrated GIS and multivariate statistical analysis for regional scale assessment of heavy metal soil contamination: A critical review.

    PubMed

    Hou, Deyi; O'Connor, David; Nathanail, Paul; Tian, Li; Ma, Yan

    2017-12-01

    Heavy metal soil contamination is associated with potential toxicity to humans or ecotoxicity. Scholars have increasingly used a combination of geographical information science (GIS) with geostatistical and multivariate statistical analysis techniques to examine the spatial distribution of heavy metals in soils at a regional scale. A review of such studies showed that most soil sampling programs were based on grid patterns and composite sampling methodologies. Many programs intended to characterize various soil types and land use types. The most often used sampling depth intervals were 0-0.10 m, or 0-0.20 m, below surface; and the sampling densities used ranged from 0.0004 to 6.1 samples per km 2 , with a median of 0.4 samples per km 2 . The most widely used spatial interpolators were inverse distance weighted interpolation and ordinary kriging; and the most often used multivariate statistical analysis techniques were principal component analysis and cluster analysis. The review also identified several determining and correlating factors in heavy metal distribution in soils, including soil type, soil pH, soil organic matter, land use type, Fe, Al, and heavy metal concentrations. The major natural and anthropogenic sources of heavy metals were found to derive from lithogenic origin, roadway and transportation, atmospheric deposition, wastewater and runoff from industrial and mining facilities, fertilizer application, livestock manure, and sewage sludge. This review argues that the full potential of integrated GIS and multivariate statistical analysis for assessing heavy metal distribution in soils on a regional scale has not yet been fully realized. It is proposed that future research be conducted to map multivariate results in GIS to pinpoint specific anthropogenic sources, to analyze temporal trends in addition to spatial patterns, to optimize modeling parameters, and to expand the use of different multivariate analysis tools beyond principal component analysis (PCA) and cluster analysis (CA). Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Comparison of connectivity analyses for resting state EEG data

    NASA Astrophysics Data System (ADS)

    Olejarczyk, Elzbieta; Marzetti, Laura; Pizzella, Vittorio; Zappasodi, Filippo

    2017-06-01

    Objective. In the present work, a nonlinear measure (transfer entropy, TE) was used in a multivariate approach for the analysis of effective connectivity in high density resting state EEG data in eyes open and eyes closed. Advantages of the multivariate approach in comparison to the bivariate one were tested. Moreover, the multivariate TE was compared to an effective linear measure, i.e. directed transfer function (DTF). Finally, the existence of a relationship between the information transfer and the level of brain synchronization as measured by phase synchronization value (PLV) was investigated. Approach. The comparison between the connectivity measures, i.e. bivariate versus multivariate TE, TE versus DTF, TE versus PLV, was performed by means of statistical analysis of indexes based on graph theory. Main results. The multivariate approach is less sensitive to false indirect connections with respect to the bivariate estimates. The multivariate TE differentiated better between eyes closed and eyes open conditions compared to DTF. Moreover, the multivariate TE evidenced non-linear phenomena in information transfer, which are not evidenced by the use of DTF. We also showed that the target of information flow, in particular the frontal region, is an area of greater brain synchronization. Significance. Comparison of different connectivity analysis methods pointed to the advantages of nonlinear methods, and indicated a relationship existing between the flow of information and the level of synchronization of the brain.

  10. Racial Differences in Perceptions of Air Pollution Health Risk: Does Environmental Exposure Matter?

    PubMed Central

    Chakraborty, Jayajit; Collins, Timothy W.; Grineski, Sara E.; Maldonado, Alejandra

    2017-01-01

    This article extends environmental risk perception research by exploring how potential health risk from exposure to industrial and vehicular air pollutants, as well as other contextual and socio-demographic factors, influence racial/ethnic differences in air pollution health risk perception. Our study site is the Greater Houston metropolitan area, Texas, USA—a racially/ethnically diverse area facing high levels of exposure to pollutants from both industrial and transportation sources. We integrate primary household-level survey data with estimates of excess cancer risk from ambient exposure to industrial and on-road mobile source emissions of air toxics obtained from the U.S. Environmental Protection Agency. Statistical analysis is based on multivariate generalized estimation equation models which account for geographic clustering of surveyed households. Our results reveal significantly higher risk perceptions for non-Hispanic Black residents and those exposed to greater cancer risk from industrial pollutants, and also indicate that gender influences the relationship between race/ethnicity and air pollution risk perception. These findings highlight the need to incorporate measures of environmental health risk exposure in future analysis of social disparities in risk perception. PMID:28125059

  11. Risk Zone Modelling and Early Warning System for Visceral Leishmaniasis Kala-Azar Disease in Bihar, India Using Remote Sensing and GIS

    NASA Astrophysics Data System (ADS)

    Jeyaram, A.; Kesari, S.; Bajpai, A.; Bhunia, G. S.; Krishna Murthy, Y. V. N.

    2012-07-01

    Visceral Leishmaniasis (VL) commonly known as Kala-azar is one of the most neglected tropical disease affecting approximately 200 million poorest populations 'at risk in 109 districts of three endemic countries namely Bangladesh, India and Nepal at different levels. This tropical disease is caused by the protozoan parasite Leishmania donovani and transmitted by female Phlebotomus argentipes sand flies. The analysis of disease dynamics indicate the periodicity at seasonal and inter-annual temporal scale which forms the basis for development of advanced early warning system. Study area of highly endemic Vaishali district, Bihar, India has been taken for model development. A Systematic study of geo-environmental parameters derived from satellite data in conjunction with ground intelligence enabled modelling of infectious disease and risk villages. High resolution Indian satellites data of IRS LISS IV (multi-spectral) and Cartosat-1 (Pan) have been used for studying environmentally risk parameters viz. peri-domestic vegetation, dwelling condition, wetland ecosystem, cropping pattern, Normalised Difference Vegetation Index (NDVI), detailed land use etc towards risk assessment. Univariate analysis of the relationship between vector density and various land cover categories and climatic variables suggested that all the variables are significantly correlated. Using the significantly correlated variables with vector density, a seasonal multivariate regression model has been carried out incorporating geo-environmental parameters, climate variables and seasonal time series disease parameters. Linear and non-linear models have been applied for periodicity and interannual temporal scale to predict Man-hour-density (MHD) and 'out-of-fit' data set used for validating the model with reasonable accuracy. To improve the MHD predictive approach, fuzzy model has also been incorporated in GIS environment combining spatial geo-environmental and climatic variables using fuzzy membership logic. Based on the perceived importance of the geoenvironmental parameters assigned by epidemiology expert, combined fuzzy membership has been calculated. The combined fuzzy membership indicate the predictive measure of vector density in each village. A γ factor has been introduced to have increasing effect in the higher side and decreasing effect in the lower side which facilitated for prioritisation of the villages. This approach is not only to predict vector density but also to prioritise the villages for effective control measures. A software package for modelling the risk villages integrating multivariate regression and fuzzy membership analysis models have been developed to estimate MHD (vector density) as part of the early warning system.

  12. MULTIVARIATE CURVE RESOLUTION OF NMR SPECTROSCOPY METABONOMIC DATA

    EPA Science Inventory

    Sandia National Laboratories is working with the EPA to evaluate and develop mathematical tools for analysis of the collected NMR spectroscopy data. Initially, we have focused on the use of Multivariate Curve Resolution (MCR) also known as molecular factor analysis (MFA), a tech...

  13. Characterizing multivariate decoding models based on correlated EEG spectral features

    PubMed Central

    McFarland, Dennis J.

    2013-01-01

    Objective Multivariate decoding methods are popular techniques for analysis of neurophysiological data. The present study explored potential interpretative problems with these techniques when predictors are correlated. Methods Data from sensorimotor rhythm-based cursor control experiments was analyzed offline with linear univariate and multivariate models. Features were derived from autoregressive (AR) spectral analysis of varying model order which produced predictors that varied in their degree of correlation (i.e., multicollinearity). Results The use of multivariate regression models resulted in much better prediction of target position as compared to univariate regression models. However, with lower order AR features interpretation of the spectral patterns of the weights was difficult. This is likely to be due to the high degree of multicollinearity present with lower order AR features. Conclusions Care should be exercised when interpreting the pattern of weights of multivariate models with correlated predictors. Comparison with univariate statistics is advisable. Significance While multivariate decoding algorithms are very useful for prediction their utility for interpretation may be limited when predictors are correlated. PMID:23466267

  14. Drunk driving detection based on classification of multivariate time series.

    PubMed

    Li, Zhenlong; Jin, Xue; Zhao, Xiaohua

    2015-09-01

    This paper addresses the problem of detecting drunk driving based on classification of multivariate time series. First, driving performance measures were collected from a test in a driving simulator located in the Traffic Research Center, Beijing University of Technology. Lateral position and steering angle were used to detect drunk driving. Second, multivariate time series analysis was performed to extract the features. A piecewise linear representation was used to represent multivariate time series. A bottom-up algorithm was then employed to separate multivariate time series. The slope and time interval of each segment were extracted as the features for classification. Third, a support vector machine classifier was used to classify driver's state into two classes (normal or drunk) according to the extracted features. The proposed approach achieved an accuracy of 80.0%. Drunk driving detection based on the analysis of multivariate time series is feasible and effective. The approach has implications for drunk driving detection. Copyright © 2015 Elsevier Ltd and National Safety Council. All rights reserved.

  15. The choice of prior distribution for a covariance matrix in multivariate meta-analysis: a simulation study.

    PubMed

    Hurtado Rúa, Sandra M; Mazumdar, Madhu; Strawderman, Robert L

    2015-12-30

    Bayesian meta-analysis is an increasingly important component of clinical research, with multivariate meta-analysis a promising tool for studies with multiple endpoints. Model assumptions, including the choice of priors, are crucial aspects of multivariate Bayesian meta-analysis (MBMA) models. In a given model, two different prior distributions can lead to different inferences about a particular parameter. A simulation study was performed in which the impact of families of prior distributions for the covariance matrix of a multivariate normal random effects MBMA model was analyzed. Inferences about effect sizes were not particularly sensitive to prior choice, but the related covariance estimates were. A few families of prior distributions with small relative biases, tight mean squared errors, and close to nominal coverage for the effect size estimates were identified. Our results demonstrate the need for sensitivity analysis and suggest some guidelines for choosing prior distributions in this class of problems. The MBMA models proposed here are illustrated in a small meta-analysis example from the periodontal field and a medium meta-analysis from the study of stroke. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.

  16. The Decoding Toolbox (TDT): a versatile software package for multivariate analyses of functional imaging data

    PubMed Central

    Hebart, Martin N.; Görgen, Kai; Haynes, John-Dylan

    2015-01-01

    The multivariate analysis of brain signals has recently sparked a great amount of interest, yet accessible and versatile tools to carry out decoding analyses are scarce. Here we introduce The Decoding Toolbox (TDT) which represents a user-friendly, powerful and flexible package for multivariate analysis of functional brain imaging data. TDT is written in Matlab and equipped with an interface to the widely used brain data analysis package SPM. The toolbox allows running fast whole-brain analyses, region-of-interest analyses and searchlight analyses, using machine learning classifiers, pattern correlation analysis, or representational similarity analysis. It offers automatic creation and visualization of diverse cross-validation schemes, feature scaling, nested parameter selection, a variety of feature selection methods, multiclass capabilities, and pattern reconstruction from classifier weights. While basic users can implement a generic analysis in one line of code, advanced users can extend the toolbox to their needs or exploit the structure to combine it with external high-performance classification toolboxes. The toolbox comes with an example data set which can be used to try out the various analysis methods. Taken together, TDT offers a promising option for researchers who want to employ multivariate analyses of brain activity patterns. PMID:25610393

  17. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities

    NASA Astrophysics Data System (ADS)

    Vallières, M.; Freeman, C. R.; Skamene, S. R.; El Naqa, I.

    2015-07-01

    This study aims at developing a joint FDG-PET and MRI texture-based model for the early evaluation of lung metastasis risk in soft-tissue sarcomas (STSs). We investigate if the creation of new composite textures from the combination of FDG-PET and MR imaging information could better identify aggressive tumours. Towards this goal, a cohort of 51 patients with histologically proven STSs of the extremities was retrospectively evaluated. All patients had pre-treatment FDG-PET and MRI scans comprised of T1-weighted and T2-weighted fat-suppression sequences (T2FS). Nine non-texture features (SUV metrics and shape features) and forty-one texture features were extracted from the tumour region of separate (FDG-PET, T1 and T2FS) and fused (FDG-PET/T1 and FDG-PET/T2FS) scans. Volume fusion of the FDG-PET and MRI scans was implemented using the wavelet transform. The influence of six different extraction parameters on the predictive value of textures was investigated. The incorporation of features into multivariable models was performed using logistic regression. The multivariable modeling strategy involved imbalance-adjusted bootstrap resampling in the following four steps leading to final prediction model construction: (1) feature set reduction; (2) feature selection; (3) prediction performance estimation; and (4) computation of model coefficients. Univariate analysis showed that the isotropic voxel size at which texture features were extracted had the most impact on predictive value. In multivariable analysis, texture features extracted from fused scans significantly outperformed those from separate scans in terms of lung metastases prediction estimates. The best performance was obtained using a combination of four texture features extracted from FDG-PET/T1 and FDG-PET/T2FS scans. This model reached an area under the receiver-operating characteristic curve of 0.984 ± 0.002, a sensitivity of 0.955 ± 0.006, and a specificity of 0.926 ± 0.004 in bootstrapping evaluations. Ultimately, lung metastasis risk assessment at diagnosis of STSs could improve patient outcomes by allowing better treatment adaptation.

  18. Application of multivariable statistical techniques in plant-wide WWTP control strategies analysis.

    PubMed

    Flores, X; Comas, J; Roda, I R; Jiménez, L; Gernaey, K V

    2007-01-01

    The main objective of this paper is to present the application of selected multivariable statistical techniques in plant-wide wastewater treatment plant (WWTP) control strategies analysis. In this study, cluster analysis (CA), principal component analysis/factor analysis (PCA/FA) and discriminant analysis (DA) are applied to the evaluation matrix data set obtained by simulation of several control strategies applied to the plant-wide IWA Benchmark Simulation Model No 2 (BSM2). These techniques allow i) to determine natural groups or clusters of control strategies with a similar behaviour, ii) to find and interpret hidden, complex and casual relation features in the data set and iii) to identify important discriminant variables within the groups found by the cluster analysis. This study illustrates the usefulness of multivariable statistical techniques for both analysis and interpretation of the complex multicriteria data sets and allows an improved use of information for effective evaluation of control strategies.

  19. Moving beyond Univariate Post-Hoc Testing in Exercise Science: A Primer on Descriptive Discriminate Analysis

    ERIC Educational Resources Information Center

    Barton, Mitch; Yeatts, Paul E.; Henson, Robin K.; Martin, Scott B.

    2016-01-01

    There has been a recent call to improve data reporting in kinesiology journals, including the appropriate use of univariate and multivariate analysis techniques. For example, a multivariate analysis of variance (MANOVA) with univariate post hocs and a Bonferroni correction is frequently used to investigate group differences on multiple dependent…

  20. MGAS: a powerful tool for multivariate gene-based genome-wide association analysis.

    PubMed

    Van der Sluis, Sophie; Dolan, Conor V; Li, Jiang; Song, Youqiang; Sham, Pak; Posthuma, Danielle; Li, Miao-Xin

    2015-04-01

    Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem. Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis. MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models. MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

  1. Probability of detecting atrazine/desethyl-atrazine and elevated concentrations of nitrate in ground water in Colorado

    USGS Publications Warehouse

    Rupert, Michael G.

    2003-01-01

    Draft Federal regulations may require that each State develop a State Pesticide Management Plan for the herbicides atrazine, alachlor, metolachlor, and simazine. Maps were developed that the State of Colorado could use to predict the probability of detecting atrazine and desethyl-atrazine (a breakdown product of atrazine) in ground water in Colorado. These maps can be incorporated into the State Pesticide Management Plan and can help provide a sound hydrogeologic basis for atrazine management in Colorado. Maps showing the probability of detecting elevated nitrite plus nitrate as nitrogen (nitrate) concentrations in ground water in Colorado also were developed because nitrate is a contaminant of concern in many areas of Colorado. Maps showing the probability of detecting atrazine and(or) desethyl-atrazine (atrazine/DEA) at or greater than concentrations of 0.1 microgram per liter and nitrate concentrations in ground water greater than 5 milligrams per liter were developed as follows: (1) Ground-water quality data were overlaid with anthropogenic and hydrogeologic data using a geographic information system to produce a data set in which each well had corresponding data on atrazine use, fertilizer use, geology, hydrogeomorphic regions, land cover, precipitation, soils, and well construction. These data then were downloaded to a statistical software package for analysis by logistic regression. (2) Relations were observed between ground-water quality and the percentage of land-cover categories within circular regions (buffers) around wells. Several buffer sizes were evaluated; the buffer size that provided the strongest relation was selected for use in the logistic regression models. (3) Relations between concentrations of atrazine/DEA and nitrate in ground water and atrazine use, fertilizer use, geology, hydrogeomorphic regions, land cover, precipitation, soils, and well-construction data were evaluated, and several preliminary multivariate models with various combinations of independent variables were constructed. (4) The multivariate models that best predicted the presence of atrazine/DEA and elevated concentrations of nitrate in ground water were selected. (5) The accuracy of the multivariate models was confirmed by validating the models with an independent set of ground-water quality data. (6) The multivariate models were entered into a geographic information system and the probability maps were constructed.

  2. Multivariate dynamic Tobit models with lagged observed dependent variables: An effectiveness analysis of highway safety laws.

    PubMed

    Dong, Chunjiao; Xie, Kun; Zeng, Jin; Li, Xia

    2018-04-01

    Highway safety laws aim to influence driver behaviors so as to reduce the frequency and severity of crashes, and their outcomes. For one specific highway safety law, it would have different effects on the crashes across severities. Understanding such effects can help policy makers upgrade current laws and hence improve traffic safety. To investigate the effects of highway safety laws on crashes across severities, multivariate models are needed to account for the interdependency issues in crash counts across severities. Based on the characteristics of the dependent variables, multivariate dynamic Tobit (MVDT) models are proposed to analyze crash counts that are aggregated at the state level. Lagged observed dependent variables are incorporated into the MVDT models to account for potential temporal correlation issues in crash data. The state highway safety law related factors are used as the explanatory variables and socio-demographic and traffic factors are used as the control variables. Three models, a MVDT model with lagged observed dependent variables, a MVDT model with unobserved random variables, and a multivariate static Tobit (MVST) model are developed and compared. The results show that among the investigated models, the MVDT models with lagged observed dependent variables have the best goodness-of-fit. The findings indicate that, compared to the MVST, the MVDT models have better explanatory power and prediction accuracy. The MVDT model with lagged observed variables can better handle the stochasticity and dependency in the temporal evolution of the crash counts and the estimated values from the model are closer to the observed values. The results show that more lives could be saved if law enforcement agencies can make a sustained effort to educate the public about the importance of motorcyclists wearing helmets. Motor vehicle crash-related deaths, injuries, and property damages could be reduced if states enact laws for stricter text messaging rules, higher speeding fines, older licensing age, and stronger graduated licensing provisions. Injury and PDO crashes would be significantly reduced with stricter laws prohibiting the use of hand-held communication devices and higher fines for drunk driving. Copyright © 2018 Elsevier Ltd. All rights reserved.

  3. Multivariate meta-analysis using individual participant data

    PubMed Central

    Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.

    2016-01-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment–covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. PMID:26099484

  4. Lung cancer risk prediction to select smokers for screening CT--a model based on the Italian COSMOS trial.

    PubMed

    Maisonneuve, Patrick; Bagnardi, Vincenzo; Bellomi, Massimo; Spaggiari, Lorenzo; Pelosi, Giuseppe; Rampinelli, Cristiano; Bertolotti, Raffaella; Rotmensz, Nicole; Field, John K; Decensi, Andrea; Veronesi, Giulia

    2011-11-01

    Screening with low-dose helical computed tomography (CT) has been shown to significantly reduce lung cancer mortality but the optimal target population and time interval to subsequent screening are yet to be defined. We developed two models to stratify individual smokers according to risk of developing lung cancer. We first used the number of lung cancers detected at baseline screening CT in the 5,203 asymptomatic participants of the COSMOS trial to recalibrate the Bach model, which we propose using to select smokers for screening. Next, we incorporated lung nodule characteristics and presence of emphysema identified at baseline CT into the Bach model and proposed the resulting multivariable model to predict lung cancer risk in screened smokers after baseline CT. Age and smoking exposure were the main determinants of lung cancer risk. The recalibrated Bach model accurately predicted lung cancers detected during the first year of screening. Presence of nonsolid nodules (RR = 10.1, 95% CI = 5.57-18.5), nodule size more than 8 mm (RR = 9.89, 95% CI = 5.84-16.8), and emphysema (RR = 2.36, 95% CI = 1.59-3.49) at baseline CT were all significant predictors of subsequent lung cancers. Incorporation of these variables into the Bach model increased the predictive value of the multivariable model (c-index = 0.759, internal validation). The recalibrated Bach model seems suitable for selecting the higher risk population for recruitment for large-scale CT screening. The Bach model incorporating CT findings at baseline screening could help defining the time interval to subsequent screening in individual participants. Further studies are necessary to validate these models.

  5. Hybrid least squares multivariate spectral analysis methods

    DOEpatents

    Haaland, David M.

    2002-01-01

    A set of hybrid least squares multivariate spectral analysis methods in which spectral shapes of components or effects not present in the original calibration step are added in a following estimation or calibration step to improve the accuracy of the estimation of the amount of the original components in the sampled mixture. The "hybrid" method herein means a combination of an initial classical least squares analysis calibration step with subsequent analysis by an inverse multivariate analysis method. A "spectral shape" herein means normally the spectral shape of a non-calibrated chemical component in the sample mixture but can also mean the spectral shapes of other sources of spectral variation, including temperature drift, shifts between spectrometers, spectrometer drift, etc. The "shape" can be continuous, discontinuous, or even discrete points illustrative of the particular effect.

  6. Multivariate generalized multifactor dimensionality reduction to detect gene-gene interactions

    PubMed Central

    2013-01-01

    Background Recently, one of the greatest challenges in genome-wide association studies is to detect gene-gene and/or gene-environment interactions for common complex human diseases. Ritchie et al. (2001) proposed multifactor dimensionality reduction (MDR) method for interaction analysis. MDR is a combinatorial approach to reduce multi-locus genotypes into high-risk and low-risk groups. Although MDR has been widely used for case-control studies with binary phenotypes, several extensions have been proposed. One of these methods, a generalized MDR (GMDR) proposed by Lou et al. (2007), allows adjusting for covariates and applying to both dichotomous and continuous phenotypes. GMDR uses the residual score of a generalized linear model of phenotypes to assign either high-risk or low-risk group, while MDR uses the ratio of cases to controls. Methods In this study, we propose multivariate GMDR, an extension of GMDR for multivariate phenotypes. Jointly analysing correlated multivariate phenotypes may have more power to detect susceptible genes and gene-gene interactions. We construct generalized estimating equations (GEE) with multivariate phenotypes to extend generalized linear models. Using the score vectors from GEE we discriminate high-risk from low-risk groups. We applied the multivariate GMDR method to the blood pressure data of the 7,546 subjects from the Korean Association Resource study: systolic blood pressure (SBP) and diastolic blood pressure (DBP). We compare the results of multivariate GMDR for SBP and DBP to the results from separate univariate GMDR for SBP and DBP, respectively. We also applied the multivariate GMDR method to the repeatedly measured hypertension status from 5,466 subjects and compared its result with those of univariate GMDR at each time point. Results Results from the univariate GMDR and multivariate GMDR in two-locus model with both blood pressures and hypertension phenotypes indicate best combinations of SNPs whose interaction has significant association with risk for high blood pressures or hypertension. Although the test balanced accuracy (BA) of multivariate analysis was not always greater than that of univariate analysis, the multivariate BAs were more stable with smaller standard deviations. Conclusions In this study, we have developed multivariate GMDR method using GEE approach. It is useful to use multivariate GMDR with correlated multiple phenotypes of interests. PMID:24565370

  7. Risk burdens of modifiable risk factors incorporating lipoprotein (a) and low serum albumin concentrations for first incident acute myocardial infarction

    PubMed Central

    Yang, Qin; He, Yong-Ming; Cai, Dong-Ping; Yang, Xiang-Jun; Xu, Hai-Feng

    2016-01-01

    Risk burdens of modifiable risk factors incorporating lipoprotein (a) (Lp(a)) and low serum albumin (LSA) concentrations for first incident acute myocardial infarction (AMI) haven’t been studied previously. Cross-sectional study of 1552 cases and 6125 controls was performed for identifying the association of risk factors with first incident AMI and their corresponding population attributable risks (PARs). Modifiable risk factors incorporating LSA and Lp(a) accounted for up to 92% of PAR for first incident AMI. Effects of these risk factors were different in different sexes across different age categories. Overall, smoking and LSA were the 2 strongest risk factors, together accounting for 64% of PAR for first incident AMI. After multivariable adjustment, Lp(a) and LSA accounted for 19% and 41%, respectively, and together for more than a half (54%) of PAR for first incident AMI. Modifiable risk factors incorporating LSA and Lp(a) have accounted for an overwhelmingly large proportion of the risk of first incident AMI, indicating most first incident AMI is preventable. The knowledge of risk burdens for first incident AMI incorporating Lp (a) and LSA may be beneficial for further reducing first incident AMI from a new angle. PMID:27748452

  8. Power analysis for multivariate and repeated measures designs: a flexible approach using the SPSS MANOVA procedure.

    PubMed

    D'Amico, E J; Neilands, T B; Zambarano, R

    2001-11-01

    Although power analysis is an important component in the planning and implementation of research designs, it is often ignored. Computer programs for performing power analysis are available, but most have limitations, particularly for complex multivariate designs. An SPSS procedure is presented that can be used for calculating power for univariate, multivariate, and repeated measures models with and without time-varying and time-constant covariates. Three examples provide a framework for calculating power via this method: an ANCOVA, a MANOVA, and a repeated measures ANOVA with two or more groups. The benefits and limitations of this procedure are discussed.

  9. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sanchez-Nieto, Beatriz, E-mail: bsanchez@fis.puc.cl; Goset, Karen C.; Caviedes, Ivan

    Purpose: To propose multivariate predictive models for changes in pulmonary function tests ({Delta}PFTs) with respect to preradiotherapy (pre-RT) values in patients undergoing RT for breast cancer and lymphoma. Methods and Materials: A prospective study was designed to measure {Delta}PFTs of patients undergoing RT. Sixty-six patients were included. Spirometry, lung capacity (measured by helium dilution), and diffusing capacity of carbon monoxide tests were used to measure lung function. Two lung definitions were considered: paired lung vs. irradiated lung (IL). Correlation analysis of dosimetric parameters (mean lung dose and the percentage of lung volume receiving more than a threshold dose) and {Delta}PFTsmore » was carried out to find the best dosimetric predictor. Chemotherapy, age, smoking, and the selected dose-volume parameter were considered as single and interaction terms in a multivariate analysis. Stability of results was checked by bootstrapping. Results: Both lung definitions proved to be similar. Modeling was carried out for IL. Acute and late damage showed the highest correlations with volumes irradiated above {approx}20 Gy (maximum R{sup 2} = 0.28) and {approx}40 Gy (maximum R{sup 2} = 0.21), respectively. RT alone induced a minor and transitory restrictive defect (p = 0.013). Doxorubicin-cyclophosphamide-paclitaxel (Taxol), when administered pre-RT, induced a late, large restrictive effect, independent of RT (p = 0.031). Bootstrap values confirmed the results. Conclusions: None of the dose-volume parameters was a perfect predictor of outcome. Thus, different predictor models for {Delta}PFTs were derived for the IL, which incorporated other nondosimetric parameters mainly through interaction terms. Late {Delta}PFTs seem to behave more serially than early ones. Large restrictive defects were demonstrated in patients pretreated with doxorubicin-cyclophosphamide-paclitaxel.« less

  10. Does duration of symptoms affect clinical outcome after hip arthroscopy for labral tears? Analysis of prospectively collected outcomes with minimum 2-year follow-up

    PubMed Central

    Ni, Jake; Hohn, Eric A; Domb, Benjamin G

    2017-01-01

    Abstract Limited research exists on the possible association between duration of symptoms and clinical outcomes following hip arthroscopy for labral tears. The purpose of this study was to evaluate whether duration of symptoms affected clinical and patient-reported outcome (PRO) scores following hip arthroscopy for labral tears. From 2008 to 2011, data were collected prospectively on all patients undergoing primary hip arthroscopy for labral tears. Workers’ compensation cases, dysplasia cases and patients with previous ipsilateral hip surgeries were excluded. A total of 738 patients were identified with a minimum of 2-year follow-up, and clinical and PRO data were available for 680 patients. Uni- and multivariate analyses were performed to determine the relationship between duration of symptoms along with other variables and PROs. Overall, patients experienced significant improvements in all clinical and PRO scores. Results of univariate analysis revealed that all PROs were negatively associated with increasing Log10 months of symptoms as were pain and satisfaction scores. During multivariate analyses, increasing Log10 months of symptoms, age, body mass index and trauma were all negatively associated with PROs (P  < 0.05). Our study demonstrates that clinical and PRO scores were negatively associated with increasing duration of symptoms prior to hip arthroscopy for treatment of labral tears. Although this implies that delay in treatment may adversely affect outcome, conservative treatment remains the gold standard first line of treatment. Surgeons should incorporate this information into their treatment algorithm to maximize patient outcomes following treatment for labral tears. Level of evidence: Level IV, prospective case series. PMID:29250339

  11. Thyroid V50 Highly Predictive of Hypothyroidism in Head-and-Neck Cancer Patients Treated With Intensity-modulated Radiotherapy (IMRT).

    PubMed

    Sachdev, Sean; Refaat, Tamer; Bacchus, Ian D; Sathiaseelan, Vythialinga; Mittal, Bharat B

    2017-08-01

    Radiation-induced hypothyroidism affects a significant number of patients with head-and-neck squamous cell cancer (HNSCC). We examined detailed dosimetric and clinical parameters to better determine the risk of hypothyroidism in euthyroid HNSCC patients treated with intensity-modulated radiation therapy (IMRT). From 2006 to 2010, 75 clinically euthyroid patients with HNSCC were treated with sequential IMRT. The cohort included 59 men and 16 females with a median age of 55 years (range, 30 to 89 y) who were treated to a median dose of 70 Gy (range, 60 to 75 Gy) with concurrent chemotherapy in nearly all (95%) cases. Detailed thyroid dosimetric parameters including maximum dose, mean dose, and other parameters (eg, V50-percent volume receiving at least 50 Gy) were obtained. Freedom from hypothyroidism was evaluated using the Kaplan-Meier method. Univariate and multivariate analyses were conducted using Cox regression. After a median follow-up period of 50 months, 25 patients (33%) became hypothyroid. On univariate analysis, thyroid V50 was highly correlated with developing hypothyroidism (P=0.035). Other dosimetric paramaters including mean thyroid dose (P=0.11) and maximum thyroid dose (P=0.39) did not reach statistical significance. On multivariate analysis incorporating patient, tumor, and treatment variables, V50 remained highly statistically significant (P=0.037). Regardless of other factors, for V50>60%, the odds ratio of developing hypothyroidism was 6.76 (P=0.002). In HNSCC patients treated with IMRT, thyroid V50 highly predicts the risk of developing hypothyroidism. V50>60% puts patients at a significantly higher risk of becoming hypothyroid. This can be a useful dose constraint to consider during treatment planning.

  12. Early plasma monocyte chemoattractant protein 1 predicts the development of sepsis in trauma patients: A prospective observational study.

    PubMed

    Wang, Yuchang; Liu, Qinxin; Liu, Tao; Zheng, Qiang; Xu, Xi'e; Liu, Xinghua; Gao, Wei; Li, Zhanfei; Bai, Xiangjun

    2018-04-01

    Monocyte chemoattractant protein 1 (MCP-1) is an initiating cytokine of the inflammatory cascade. Extracellular MCP-1 exhibits pro-inflammatory characteristic and plays a central pathogenic role in critical illness. The purpose of the study was to identify the association between plasma MCP-1 levels and the development of sepsis after severe trauma.The plasma levels of MCP-1 in severe trauma patients were measured by a quantitative enzyme-linked immune sorbent assay and the dynamic release patterns were recorded at three time points during seven days post-trauma. The related factors of prognosis were compared between sepsis and non-sepsis groups and analyzed using multivariate logistic regression analysis. We also used receiver operating characteristic (ROC) curves to assess the values of different variables in predicting sepsis.A total of 72 patients who met criteria indicative of severe trauma (72.22% of male; mean age, 49.40 ± 14.29 years) were enrolled. Plasma MCP-1 concentrations significantly increased on post-trauma day 1 and that this increase was significantly correlated with the Injury Severity Score (ISS) and interleukin-6 (IL-6). Multivariate logistic regression analysis showed that early MCP-1, ISS, and IL-6 were independent risk factors for sepsis in severe trauma patients. Incorporation of the early MCP-1 into the ISS can increase the discriminative performance for predicting development of sepsis.Early plasma MCP-1 concentrations can be used to assess the severity of trauma and is correlated with the development of sepsis after severe trauma. The addition of the early MCP-1 levels to the ISS significantly improves its ability to predict development of sepsis.

  13. Pilot study of the Mini Nutritional Assessment on predicting outcomes in older adults with type 2 diabetes.

    PubMed

    Liu, Gong-Xiang; Chen, Yan; Yang, Ying-Xue; Yang, Kun; Liang, Jin; Wang, Shuang; Gan, Hua-Tian

    2017-12-01

    To date, few studies have focused on the nutritional status of elderly hospitalized patients with diabetes. Our aims were to explore the prevalence of malnutrition among elderly diabetes patients admitted to the hospital, and to explore the relationships between malnutrition and geriatric syndromes, diabetic complications, and clinical outcomes. A prospective, observational study including diabetes patients aged ≥65 years was carried out in a central hospital in Western China. Nutritional status was assessed using the Mini Nutritional Assessment incorporated into a comprehensive geriatric assessment. Follow up was carried out for ≤2.8 years. Of 302 participants, the prevalence of malnutrition, risk of malnutrition, and normal nutrition was 18.5%, 33.1% and 48.3%, respectively. In multivariate analysis, incontinence (odds ratio [OR] 3.17, 95% confidence interval [CI] 1.08-9.36), diabetic microvascular complications (OR 2.22, 95% CI 1.06-4.61) and activities of daily living (ADL) dependence (OR 11.6, 95% CI 5.10-26.5) were independently associated with malnutrition. Malnourished patients had longer hospital stays (P = 0.003) and higher mortality rates (P < 0.001) than patients either at risk of malnutrition or with a normal nutritional status. Multivariate analysis also showed that malnutrition was independently associated with an increased risk of death (OR 2.86, 95% CI 1.30-6.28). The present study showed a high prevalence of malnutrition among elderly diabetes patients hospitalized for geriatric care. Considering the negative impact of malnutrition on hospital stay and mortality, adequate nutritional care should be emphasized for each elderly patient with diabetes, regardless of body mass index. Geriatr Gerontol Int 2017; 17: 2485-2492. © 2017 Japan Geriatrics Society.

  14. Concomitant apical suspensory procedures in women with anterior vaginal wall prolapse in the United States in 2011.

    PubMed

    Northington, Gina M; Hudson, Catherine O; Karp, Deborah R; Huber, Sarah A

    2016-04-01

    Although the surgical restoration of apical support has been shown to decrease reoperation rates, it is unclear whether this has been incorporated into current practice. The aims of this study were to determine the rate of concomitant apical suspensory procedures in women with anterior vaginal wall prolapse undergoing surgical repair in 2011 and to identify associated factors. This cross-sectional study queried the Nationwide Inpatient Sample for women with a primary diagnosis of cystocele who underwent prolapse repair in 2011. The study cohort was analyzed for demographics, concomitant procedures, and hospital characteristics. The rate of apical suspensory procedures was determined. Factors potentially associated with receiving concomitant apical suspensory procedure were evaluated using univariate analysis and multivariate logistic regression. A total of 2,900 women in the database had a primary diagnosis of cystocele and underwent surgical prolapse repair in 2011. 925 (31.9 %) subjects underwent a concomitant apical suspensory procedure. The mean age in the study cohort was 61.9 ± 12.8 years. Hysterectomies were performed in 11.1 % of subjects. 61.1 % were performed vaginally, 26.5 % laparoscopically, and 12.5 % abdominally. On multivariate analysis, age greater than 50 years, Caucasian race, concomitant hysterectomy, and an urban teaching hospital setting were independently associated with receiving concomitant apical suspensory procedure in 2011. Despite evidence that the restoration of apical support is important for optimal anterior support, the overall rate of concomitant apical suspensory procedures is low. Several factors may play a role in whether or not women receive an apical suspensory procedure. This study highlights opportunities to improve the quality of surgical care provided to women with anterior vaginal prolapse.

  15. Evaluation of CROES Nephrolithometry Nomogram as a Preoperative Predictive System for Percutaneous Nephrolithotomy Outcomes.

    PubMed

    Kumar, Sumit; Sreenivas, Jayaram; Karthikeyan, Vilvapathy Senguttuvan; Mallya, Ashwin; Keshavamurthy, Ramaiah

    2016-10-01

    Scoring systems have been devised to predict outcomes of percutaneous nephrolithotomy (PCNL). CROES nephrolithometry nomogram (CNN) is the latest tool devised to predict stone-free rate (SFR). We aim to compare predictive accuracy of CNN against Guy stone score (GSS) for SFR and postoperative outcomes. Between January 2013 and December 2015, 313 patients undergoing PCNL were analyzed for predictive accuracy of GSS, CNN, and stone burden (SB) for SFR, complications, operation time (OT), and length of hospitalization (LOH). We further stratified patients into risk groups based on CNN and GSS. Mean ± standard deviation (SD) SB was 298.8 ± 235.75 mm 2 . SB, GSS, and CNN (area under curve [AUC]: 0.662, 0.660, 0.673) were found to be predictors of SFR. However, predictability for complications was not as good (AUC: SB 0.583, GSS 0.554, CNN 0.580). Single implicated calix (Adj. OR 3.644; p = 0.027), absence of staghorn calculus (Adj. OR 3.091; p = 0.044), single stone (Adj. OR 3.855; p = 0.002), and single puncture (Adj. OR 2.309; p = 0.048) significantly predicted SFR on multivariate analysis. Charlson comorbidity index (CCI; p = 0.020) and staghorn calculus (p = 0.002) were independent predictors for complications on linear regression. SB and GSS independently predicted OT on multivariate analysis. SB and complications significantly predicted LOH, while GSS and CNN did not predict LOH. CNN offered better risk stratification for residual stones than GSS. CNN and GSS have good preoperative predictive accuracy for SFR. Number of implicated calices may affect SFR, and CCI affects complications. Studies should incorporate these factors in scoring systems and assess if predictability of PCNL outcomes improves.

  16. Household food insecurity is associated with abdominal but not general obesity among Iranian children.

    PubMed

    Jafari, Fateme; Ehsani, Simin; Nadjarzadeh, Azadeh; Esmaillzadeh, Ahmad; Noori-Shadkam, Mahmood; Salehi-Abargouei, Amin

    2017-04-21

    Childhood obesity is increasing all over the world. Food insecurity is mentioned as a possible risk factor; however, previous studies have led to inconsistent results in different societies while data are lacking for the Middle East. We aimed to investigate the relationship between food insecurity and general or abdominal obesity in Iranian children in a cross-sectional study. Anthropometric data including height, weight, and waist circumference were measured by trained nutritionists. General and abdominal obesity were defined based on world health organization (WHO) and Iranian reference curves for age and gender, respectively. Radimer/Cornell food security questionnaire was filled by parents. Data about the physical activity of participants, family socio-economic status, parental obesity and data about perinatal period were also gathered using self-administered questionnaires. Logistic regression was incorporated to investigate the association between food insecurity and obesity in crude and multi-variable adjusted models. A total of 587 children aged 9.30 ± 1.49 years had complete data for analysis. Food insecurity at household level was significantly associated with abdominal obesity (odds ratio (OR) = 1.54; confidence interval (CI):1.01-2.34, p <0.05) and the relationship remained significant after adjusting for all potential confounding variables (OR = 2.02; CI:1.01-4.03, p <0.05). Food insecurity was associated with general obesity neither in crude analysis and multi-variable adjusted models. The slight levels of food insecurity might increase the likelihood of abdominal obesity in Iranian children and macroeconomic policies to improve the food security are necessary. Large-scale prospective studies, particularly in the Middle East, are highly recommended to confirm our results.

  17. Trends in Pulmonary Hypertension Over a Period of 30 Years: Experience From a Single Referral Centre.

    PubMed

    Quezada Loaiza, Carlos Andrés; Velázquez Martín, María Teresa; Jiménez López-Guarch, Carmen; Ruiz Cano, María José; Navas Tejedor, Paula; Carreira, Patricia Esmeralda; Flox Camacho, Ángela; de Pablo Gafas, Alicia; Delgado Jiménez, Juan Francisco; Gómez Sánchez, Miguel Ángel; Escribano Subías, Pilar

    2017-11-01

    Pulmonary arterial hypertension (PAH) is characterized by increased pulmonary vascular resistance, right ventricular dysfunction and death. Despite scientific advances, is still associated with high morbidity and mortality. The aim is to describe the clinical approach and determine the prognostic factors of patients with PAH treated in a national reference center over 30 years. Three hundred and seventy nine consecutive patients with PAH (January 1984 to December 2014) were studied. Were divided into 3 periods of time: before 2004, 2004-2009 and 2010-2014. Prognostic factors (multivariate analysis) were analyzed for clinical deterioration. Median age was 44 years (68.6% women), functional class III-IV: 72%. An increase was observed in more complex etiologies in the last period of time: Pulmonary venooclusive disease and portopulmonary hypertension. Upfront combination therapy significantly increased (5% before 2004 vs 27% after 2010; P < .05). Multivariate analysis showed prognostic significance in age, sex, etiology and combined clinical variables as they are independent predictors of clinical deterioration (P < .05). Survival free from death or transplantation for the 1st, 3rd and 5th year was 92.2%, 80.6% and 68.5% respectively. The median survival was 9 years (95% confidence interval, 7.532-11.959) CONCLUSIONS: The PAH is a heterogeneous and complex disease, the median survival free from death or transplantation in our series is 9 years after diagnosis. The structure of a multidisciplinary unit PAH must adapt quickly to changes that occur over time incorporating new diagnostic and therapeutic techniques. Copyright © 2017 Sociedad Española de Cardiología. Published by Elsevier España, S.L.U. All rights reserved.

  18. Multi-Sample Cluster Analysis Using Akaike’s Information Criterion.

    DTIC Science & Technology

    1982-12-20

    of Likelihood Criteria for I)fferent Hypotheses," in P. A. Krishnaiah (Ed.), Multivariate Analysis-Il, New York: Academic Press. [5] Fisher, R. A...Methods of Simultaneous Inference in MANOVA," in P. R. Krishnaiah (Ed.), rultivariate Analysis-Il, New York: Academic Press. [8) Kendall, M. G. (1966...1982), Applied Multivariate Statisti- cal-Analysis, Englewood Cliffs: Prentice-Mall, Inc. [1U] Krishnaiah , P. R. (1969), "Simultaneous Test

  19. Docking and multivariate methods to explore HIV-1 drug-resistance: a comparative analysis

    NASA Astrophysics Data System (ADS)

    Almerico, Anna Maria; Tutone, Marco; Lauria, Antonino

    2008-05-01

    In this paper we describe a comparative analysis between multivariate and docking methods in the study of the drug resistance to the reverse transcriptase and the protease inhibitors. In our early papers we developed a simple but efficient method to evaluate the features of compounds that are less likely to trigger resistance or are effective against mutant HIV strains, using the multivariate statistical procedures PCA and DA. In the attempt to create a more solid background for the prediction of susceptibility or resistance, we carried out a comparative analysis between our previous multivariate approach and molecular docking study. The intent of this paper is not only to find further support to the results obtained by the combined use of PCA and DA, but also to evidence the structural features, in terms of molecular descriptors, similarity, and energetic contributions, derived from docking, which can account for the arising of drug-resistance against mutant strains.

  20. SUGGESTIONS FOR OPTIMIZED PLANNING OF MULTIVARIATE MONITORING OF ATMOSPHERIC POLLUTION

    EPA Science Inventory

    Recent work in factor analysis of multivariate data sets has shown that variables with little signal should not be included in the factor analysis. Work also shows that rotational ambiguity is reduced if sources impacting a receptor have both large and small contributions. Thes...

  1. Multivariate Meta-Analysis Using Individual Participant Data

    ERIC Educational Resources Information Center

    Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.

    2015-01-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is…

  2. Bayesian inference for multivariate meta-analysis Box-Cox transformation models for individual patient data with applications to evaluation of cholesterol lowering drugs

    PubMed Central

    Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G.; Shah, Arvind K.; Lin, Jianxin

    2013-01-01

    In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data (IPD) in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the Deviance Information Criterion (DIC) is used to select the best transformation model. Since the model is quite complex, a novel Monte Carlo Markov chain (MCMC) sampling scheme is developed to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol lowering drugs where the goal is to jointly model the three dimensional response consisting of Low Density Lipoprotein Cholesterol (LDL-C), High Density Lipoprotein Cholesterol (HDL-C), and Triglycerides (TG) (LDL-C, HDL-C, TG). Since the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately: however, a multivariate approach would be more appropriate since these variables are correlated with each other. A detailed analysis of these data is carried out using the proposed methodology. PMID:23580436

  3. Bayesian inference for multivariate meta-analysis Box-Cox transformation models for individual patient data with applications to evaluation of cholesterol-lowering drugs.

    PubMed

    Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G; Shah, Arvind K; Lin, Jianxin

    2013-10-15

    In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the deviance information criterion is used to select the best transformation model. Because the model is quite complex, we develop a novel Monte Carlo Markov chain sampling scheme to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol-lowering drugs where the goal is to jointly model the three-dimensional response consisting of low density lipoprotein cholesterol (LDL-C), high density lipoprotein cholesterol (HDL-C), and triglycerides (TG) (LDL-C, HDL-C, TG). Because the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately; however, a multivariate approach would be more appropriate because these variables are correlated with each other. We carry out a detailed analysis of these data by using the proposed methodology. Copyright © 2013 John Wiley & Sons, Ltd.

  4. Directionality compensation for linear multivariable anti-windup synthesis

    NASA Astrophysics Data System (ADS)

    Adegbege, Ambrose A.; Heath, William P.

    2015-11-01

    We develop new synthesis procedures for optimising anti-windup control applicable to open-loop exponentially stable multivariable plants subject to hard bounds on the inputs. The optimising anti-windup control falls into a class of compensator commonly termed directionality compensation. The computation of the control involves the online solution of a low-order quadratic programme in place of simple saturation. We exploit the structure of the quadratic programme to incorporate directionality information into the offline anti-windup synthesis using a decoupled architecture similar to that proposed in the literature for anti-windup schemes with simple saturation. We demonstrate the effectiveness of the design compared to several schemes using a simulated example. Preliminary results of this work have been published in the proceedings of the IEEE Conference on Decision and Control, Orlando, 2011 (Adegbege & Heath, 2011a).

  5. Stochastic modelling of temperatures affecting the in situ performance of a solar-assisted heat pump: The multivariate approach and physical interpretation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Loveday, D.L.; Craggs, C.

    Box-Jenkins-based multivariate stochastic modeling is carried out using data recorded from a domestic heating system. The system comprises an air-source heat pump sited in the roof space of a house, solar assistance being provided by the conventional tile roof acting as a radiation absorber. Multivariate models are presented which illustrate the time-dependent relationships between three air temperatures - at external ambient, at entry to, and at exit from, the heat pump evaporator. Using a deterministic modeling approach, physical interpretations are placed on the results of the multivariate technique. It is concluded that the multivariate Box-Jenkins approach is a suitable techniquemore » for building thermal analysis. Application to multivariate Box-Jenkins approach is a suitable technique for building thermal analysis. Application to multivariate model-based control is discussed, with particular reference to building energy management systems. It is further concluded that stochastic modeling of data drawn from a short monitoring period offers a means of retrofitting an advanced model-based control system in existing buildings, which could be used to optimize energy savings. An approach to system simulation is suggested.« less

  6. Recent applications of multivariate data analysis methods in the authentication of rice and the most analyzed parameters: A review.

    PubMed

    Maione, Camila; Barbosa, Rommel Melgaço

    2018-01-24

    Rice is one of the most important staple foods around the world. Authentication of rice is one of the most addressed concerns in the present literature, which includes recognition of its geographical origin and variety, certification of organic rice and many other issues. Good results have been achieved by multivariate data analysis and data mining techniques when combined with specific parameters for ascertaining authenticity and many other useful characteristics of rice, such as quality, yield and others. This paper brings a review of the recent research projects on discrimination and authentication of rice using multivariate data analysis and data mining techniques. We found that data obtained from image processing, molecular and atomic spectroscopy, elemental fingerprinting, genetic markers, molecular content and others are promising sources of information regarding geographical origin, variety and other aspects of rice, being widely used combined with multivariate data analysis techniques. Principal component analysis and linear discriminant analysis are the preferred methods, but several other data classification techniques such as support vector machines, artificial neural networks and others are also frequently present in some studies and show high performance for discrimination of rice.

  7. Hybrid Arrays for Chemical Sensing

    NASA Astrophysics Data System (ADS)

    Kramer, Kirsten E.; Rose-Pehrsson, Susan L.; Johnson, Kevin J.; Minor, Christian P.

    In recent years, multisensory approaches to environment monitoring for chemical detection as well as other forms of situational awareness have become increasingly popular. A hybrid sensor is a multimodal system that incorporates several sensing elements and thus produces data that are multivariate in nature and may be significantly increased in complexity compared to data provided by single-sensor systems. Though a hybrid sensor is itself an array, hybrid sensors are often organized into more complex sensing systems through an assortment of network topologies. Part of the reason for the shift to hybrid sensors is due to advancements in sensor technology and computational power available for processing larger amounts of data. There is also ample evidence to support the claim that a multivariate analytical approach is generally superior to univariate measurements because it provides additional redundant and complementary information (Hall, D. L.; Linas, J., Eds., Handbook of Multisensor Data Fusion, CRC, Boca Raton, FL, 2001). However, the benefits of a multisensory approach are not automatically achieved. Interpretation of data from hybrid arrays of sensors requires the analyst to develop an application-specific methodology to optimally fuse the disparate sources of data generated by the hybrid array into useful information characterizing the sample or environment being observed. Consequently, multivariate data analysis techniques such as those employed in the field of chemometrics have become more important in analyzing sensor array data. Depending on the nature of the acquired data, a number of chemometric algorithms may prove useful in the analysis and interpretation of data from hybrid sensor arrays. It is important to note, however, that the challenges posed by the analysis of hybrid sensor array data are not unique to the field of chemical sensing. Applications in electrical and process engineering, remote sensing, medicine, and of course, artificial intelligence and robotics, all share the same essential data fusion challenges. The design of a hybrid sensor array should draw on this extended body of knowledge. In this chapter, various techniques for data preprocessing, feature extraction, feature selection, and modeling of sensor data will be introduced and illustrated with data fusion approaches that have been implemented in applications involving data from hybrid arrays. The example systems discussed in this chapter involve the development of prototype sensor networks for damage control event detection aboard US Navy vessels and the development of analysis algorithms to combine multiple sensing techniques for enhanced remote detection of unexploded ordnance (UXO) in both ground surveys and wide area assessments.

  8. Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes.

    PubMed

    Motegi, Hiromi; Tsuboi, Yuuri; Saga, Ayako; Kagami, Tomoko; Inoue, Maki; Toki, Hideaki; Minowa, Osamu; Noda, Tetsuo; Kikuchi, Jun

    2015-11-04

    There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases, and exploring biomarkers. In addition to classical analyses such as hierarchical cluster analysis, principal component analysis, and partial least squares discriminant analysis, various multivariate strategies, including independent component analysis, non-negative matrix factorization, and multivariate curve resolution, have recently been proposed. However, determining the number of components is problematic. Despite the proposal of several different methods, no satisfactory approach has yet been reported. To resolve this problem, we implemented a new idea: classifying a component as "reliable" or "unreliable" based on the reproducibility of its appearance, regardless of the number of components in the calculation. Using the clustering method for classification, we applied this idea to multivariate curve resolution-alternating least squares (MCR-ALS). Comparisons between conventional and modified methods applied to proton nuclear magnetic resonance ((1)H-NMR) spectral datasets derived from known standard mixtures and biological mixtures (urine and feces of mice) revealed that more plausible results are obtained by the modified method. In particular, clusters containing little information were detected with reliability. This strategy, named "cluster-aided MCR-ALS," will facilitate the attainment of more reliable results in the metabolomics datasets.

  9. Nutritional Intervention: A Secondary Analysis of Its Effect on Malnourished Colombian Pre-Schoolers.

    ERIC Educational Resources Information Center

    Bejar, Isaac I.

    1981-01-01

    Effects of nutritional supplementation on physical development of malnourished children was analyzed by univariate and multivariate methods for the analysis of repeated measures. Results showed that the nutritional treatment was successful, but it was necessary to resort to the multivariate approach. (Author/GK)

  10. A Multivariate Descriptive Model of Motivation for Orthodontic Treatment.

    ERIC Educational Resources Information Center

    Hackett, Paul M. W.; And Others

    1993-01-01

    Motivation for receiving orthodontic treatment was studied among 109 young adults, and a multivariate model of the process is proposed. The combination of smallest scale analysis and Partial Order Scalogram Analysis by base Coordinates (POSAC) illustrates an interesting methodology for health treatment studies and explores motivation for dental…

  11. Exploring Pattern of Socialisation Conditions and Human Development by Nonlinear Multivariate Analysis.

    ERIC Educational Resources Information Center

    Grundmann, Matthias

    Following the assumptions of ecological socialization research, adequate analysis of socialization conditions must take into account the multilevel and multivariate structure of social factors that impact on human development. This statement implies that complex models of family configurations or of socialization factors are needed to explain the…

  12. Univariate Analysis of Multivariate Outcomes in Educational Psychology.

    ERIC Educational Resources Information Center

    Hubble, L. M.

    1984-01-01

    The author examined the prevalence of multiple operational definitions of outcome constructs and an estimate of the incidence of Type I error rates when univariate procedures were applied to multiple variables in educational psychology. Multiple operational definitions of constructs were advocated and wider use of multivariate analysis was…

  13. Applied Statistics: From Bivariate through Multivariate Techniques [with CD-ROM

    ERIC Educational Resources Information Center

    Warner, Rebecca M.

    2007-01-01

    This book provides a clear introduction to widely used topics in bivariate and multivariate statistics, including multiple regression, discriminant analysis, MANOVA, factor analysis, and binary logistic regression. The approach is applied and does not require formal mathematics; equations are accompanied by verbal explanations. Students are asked…

  14. Evaluation of Meterorite Amono Acid Analysis Data Using Multivariate Techniques

    NASA Technical Reports Server (NTRS)

    McDonald, G.; Storrie-Lombardi, M.; Nealson, K.

    1999-01-01

    The amino acid distributions in the Murchison carbonaceous chondrite, Mars meteorite ALH84001, and ice from the Allan Hills region of Antarctica are shown, using a multivariate technique known as Principal Component Analysis (PCA), to be statistically distinct from the average amino acid compostion of 101 terrestrial protein superfamilies.

  15. MULTIVARIATE ANALYSIS ON LEVELS OF SELECTED METALS, PARTICULATE MATTER, VOC, AND HOUSEHOLD CHARACTERISTICS AND ACTIVITIES FROM THE MIDWESTERN STATES NHEXAS

    EPA Science Inventory

    Microenvironmental and biological/personal monitoring information were collected during the National Human Exposure Assessment Survey (NHEXAS), conducted in the six states comprising U.S. EPA Region Five. They have been analyzed by multivariate analysis techniques with general ...

  16. Multivariate meta-analysis: a robust approach based on the theory of U-statistic.

    PubMed

    Ma, Yan; Mazumdar, Madhu

    2011-10-30

    Meta-analysis is the methodology for combining findings from similar research studies asking the same question. When the question of interest involves multiple outcomes, multivariate meta-analysis is used to synthesize the outcomes simultaneously taking into account the correlation between the outcomes. Likelihood-based approaches, in particular restricted maximum likelihood (REML) method, are commonly utilized in this context. REML assumes a multivariate normal distribution for the random-effects model. This assumption is difficult to verify, especially for meta-analysis with small number of component studies. The use of REML also requires iterative estimation between parameters, needing moderately high computation time, especially when the dimension of outcomes is large. A multivariate method of moments (MMM) is available and is shown to perform equally well to REML. However, there is a lack of information on the performance of these two methods when the true data distribution is far from normality. In this paper, we propose a new nonparametric and non-iterative method for multivariate meta-analysis on the basis of the theory of U-statistic and compare the properties of these three procedures under both normal and skewed data through simulation studies. It is shown that the effect on estimates from REML because of non-normal data distribution is marginal and that the estimates from MMM and U-statistic-based approaches are very similar. Therefore, we conclude that for performing multivariate meta-analysis, the U-statistic estimation procedure is a viable alternative to REML and MMM. Easy implementation of all three methods are illustrated by their application to data from two published meta-analysis from the fields of hip fracture and periodontal disease. We discuss ideas for future research based on U-statistic for testing significance of between-study heterogeneity and for extending the work to meta-regression setting. Copyright © 2011 John Wiley & Sons, Ltd.

  17. Characterizing multivariate decoding models based on correlated EEG spectral features.

    PubMed

    McFarland, Dennis J

    2013-07-01

    Multivariate decoding methods are popular techniques for analysis of neurophysiological data. The present study explored potential interpretative problems with these techniques when predictors are correlated. Data from sensorimotor rhythm-based cursor control experiments was analyzed offline with linear univariate and multivariate models. Features were derived from autoregressive (AR) spectral analysis of varying model order which produced predictors that varied in their degree of correlation (i.e., multicollinearity). The use of multivariate regression models resulted in much better prediction of target position as compared to univariate regression models. However, with lower order AR features interpretation of the spectral patterns of the weights was difficult. This is likely to be due to the high degree of multicollinearity present with lower order AR features. Care should be exercised when interpreting the pattern of weights of multivariate models with correlated predictors. Comparison with univariate statistics is advisable. While multivariate decoding algorithms are very useful for prediction their utility for interpretation may be limited when predictors are correlated. Copyright © 2013 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

  18. Cardiomyocyte hypertrophy, oncosis, and autophagic vacuolization predict mortality in idiopathic dilated cardiomyopathy with advanced heart failure.

    PubMed

    Vigliano, Carlos A; Cabeza Meckert, Patricia M; Diez, Mirta; Favaloro, Liliana E; Cortés, Claudia; Fazzi, Lucía; Favaloro, Roberto R; Laguens, Rubén P

    2011-04-05

    The aim of this study was to identify the remodeling parameters cardiomyocyte (CM) damage or death, hypertrophy, and fibrosis that may be linked to outcomes in patients with advanced heart failure (HF) in an effort to understand the pathogenic mechanisms of HF that may support newer therapeutic modalities. There are controversial results on the influence of fibrosis, CM hypertrophy, and apoptosis on outcomes in patients with HF; other modalities of cell damage have been poorly investigated. In endomyocardial biopsy specimens from 100 patients with idiopathic dilated cardiomyopathy and advanced HF, CM diameter and the extent of fibrosis were determined by morphometry. The proportion of CMs with evidence of apoptosis, autophagic vacuolization (AuV), and oncosis was investigated by immunohistochemical methods and by terminal deoxynucleotidyl transferase-mediated deoxyuridine triphosphate nick end labeling. Those parameters were correlated with mortality in 3 years of follow-up by univariate analysis and with multivariate models incorporating the clinical variables more relevant to the prediction of outcomes. CM AuV occurred in 28 patients (0.013 ± 0.012%) and oncosis in 41 (0.109 ± 0.139%). Nineteen patients showed both markers. Apoptotic CM nuclei were observed in 3 patients. In univariate analysis, CM diameter and AuV, either alone or associated with oncosis, were predictors of mortality. In multivariate analysis, CM diameter (hazard ratio: 1.37; 95% confidence interval: 1.12 to 1.68; p = 0.002) and simultaneous presence in the same endomyocardial biopsy specimen of AuV and oncosis (hazard ratio: 2.82; 95% confidence interval: 1.12 to 7.13; p = 0.028) were independent predictors of mortality. CM hypertrophy and AuV, especially in association with oncosis, are predictors of outcome in patients with idiopathic dilated cardiomyopathy and severe HF. Copyright © 2011 American College of Cardiology Foundation. Published by Elsevier Inc. All rights reserved.

  19. Serum Wisteria floribunda agglutinin-positive Mac-2-binding protein evaluates liver function and predicts prognosis in liver cirrhosis.

    PubMed

    Xu, Wen Ping; Wang, Ze Rui; Zou, Xia; Zhao, Chen; Wang, Rui; Shi, Pei Mei; Yuan, Zong Li; Yang, Fang; Zeng, Xin; Wang, Pei Qin; Sultan, Sakhawat; Zhang, Yan; Xie, Wei Fen

    2018-04-01

    Wisteria floribunda agglutinin-positive Mac-2-binding protein (WFA + -M2BP) is a novel glycobiomarker for evaluating liver fibrosis, but less is known about its role in liver cirrhosis (LC). This study aimed to investigate the utility of WFA + -M2BP in evaluating liver function and predicting prognosis of cirrhotic patients. We retrospectively included 197 patients with LC between 2013 and 2016. Serum WFA + -M2BP and various biochemical parameters were measured in all patients. With a median follow-up of 23 months, liver-related complications and deaths of 160 patients were recorded. The accuracy of WFA + -M2BP in evaluating liver function, predicting decompensation and mortality were measured by the receiver operating characteristic (ROC) curve, logistic and Cox's regression analyses, respectively. WFA + -M2BP levels increased with elevated Child-Pugh classification, especially in patients with hepatitis B virus (HBV) infection. ROC analysis confirmed the high reliability of WFA + -M2BP for the assessment of liver function using Child-Pugh classification. WFA + -M2BP was also significantly positively correlated with the model for end-stage liver disease (MELD) score. Multivariate logistic regression analysis indicated WFA + -M2BP as an independent predictor of clinical decompensation for compensated patients (odds ratio 11.958, 95% confidence interval [CI] 1.876-76.226, P = 0.009), and multivariate Cox's regression analysis verified WFA + -M2BP as an independent risk factor for liver-related death in patients with HBV infection (hazards ratio 10.596, 95% CI 1.356-82.820, P = 0.024). Serum WFA + -M2BP is a reliable predictor of liver function and prognosis in LC and could be incorporated into clinical surveillance strategies for LC patients, especially those with HBV infection. © 2018 Chinese Medical Association Shanghai Branch, Chinese Society of Gastroenterology, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine and John Wiley & Sons Australia, Ltd.

  20. Exploring the epidemiology of carbapenem-resistant Gram-negative bacteria in west London and the utility of routinely collected hospital microbiology data.

    PubMed

    Freeman, R; Moore, L S P; Charlett, A; Donaldson, H; Holmes, A H

    2015-04-01

    The objective of this study was to identify carbapenem-resistant organisms using routinely collected local microbiology data and describe the epidemiology of carbapenem resistance in two London teaching hospitals. Data on inpatients infected or colonized with Gram-negative organisms between March 2009 and February 2012 were extracted. A computer algorithm was developed incorporating internationally recognized criteria to distinguish carbapenem-resistant organisms. Multivariable analysis was conducted to identify factors associated with infection or colonization with carbapenem-resistant organisms. Binomial regression was performed to detect changes in resistance trends over time. Yearly incidence of carbapenem resistance was observed to be increasing, with significant increasing trends in Acinetobacter baumannii (47.1% in 2009-10 to 77.2% in 2011-12; P<0.001) and Enterobacter spp. (2.2% in 2009-10 to 11.5% in 2011-12; P<0.001). Single-variable and multivariable analysis demonstrated differences in the proportion of carbapenem-resistant isolates across all variables investigated, including age, sex and clinical specialty; in the latter organism-specific niches were identified. Patients in the youngest age group (16-24 years old) had the highest odds of being infected or colonized with carbapenem-resistant isolates of Escherichia coli, Klebsiella spp. or Pseudomonas aeruginosa. Furthermore, proportions of carbapenem-resistant organisms differed between the hospitals. Carbapenem resistance is an emerging problem within the UK inpatient healthcare setting. This is not an issue confined to the Enterobacteriaceae and fine-resolution surveillance is needed to identify at-risk groups. Regular analysis of routinely collected data can provide insight into the evolving carbapenem-resistance threat, with the ability to inform efforts to prevent the spread of resistance. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. Urban Typologies: Towards an ORNL Urban Information System (UrbIS)

    NASA Astrophysics Data System (ADS)

    KC, B.; King, A. W.; Sorokine, A.; Crow, M. C.; Devarakonda, R.; Hilbert, N. L.; Karthik, R.; Patlolla, D.; Surendran Nair, S.

    2016-12-01

    Urban environments differ in a large number of key attributes; these include infrastructure, morphology, demography, and economic and social variables, among others. These attributes determine many urban properties such as energy and water consumption, greenhouse gas emissions, air quality, public health, sustainability, and vulnerability and resilience to climate change. Characterization of urban environments by a single property such as population size does not sufficiently capture this complexity. In addressing this multivariate complexity one typically faces such problems as disparate and scattered data, challenges of big data management, spatial searching, insufficient computational capacity for data-driven analysis and modelling, and the lack of tools to quickly visualize the data and compare the analytical results across different cities and regions. We have begun the development of an Urban Information System (UrbIS) to address these issues, one that embraces the multivariate "big data" of urban areas and their environments across the United States utilizing the Big Data as a Service (BDaaS) concept. With technological roots in High-performance Computing (HPC), BDaaS is based on the idea of outsourcing computations to different computing paradigms, scalable to super-computers. UrbIS aims to incorporate federated metadata search, integrated modeling and analysis, and geovisualization into a single seamless workflow. The system includes web-based 2D/3D visualization with an iGlobe interface, fast cloud-based and server-side data processing and analysis, and a metadata search engine based on the Mercury data search system developed at Oak Ridge National Laboratory (ORNL). Results of analyses will be made available through web services. We are implementing UrbIS in ORNL's Compute and Data Environment for Science (CADES) and are leveraging ORNL experience in complex data and geospatial projects. The development of UrbIS is being guided by an investigation of urban heat islands (UHI) using high-dimensional clustering and statistics to define urban typologies (types of cities) in an investigation of how UHI vary with urban type across the United States.

  2. Levofloxacin versus azithromycin for treating legionella pneumonia: a propensity score analysis.

    PubMed

    Garcia-Vidal, C; Sanchez-Rodriguez, I; Simonetti, A F; Burgos, J; Viasus, D; Martin, M T; Falco, V; Carratalà, J

    2017-09-01

    Concerns have arisen regarding the equivalence of levofloxacin and some macrolides for treating community-acquired legionella pneumonia (LP). We aimed to compare the outcomes of current patients with LP treated with levofloxacin, azithromycin and clarithromycin. Observational retrospective multicentre study of consecutive patients with LP requiring hospitalization (2000-2014) conducted in two hospitals. The primary outcome assessed was 30-day mortality. To control for confounding, therapy was assessed by multivariate analysis. We documented 446 patients with LP, of which 175 were treated with levofloxacin, 177 with azithromycin and 58 with clarithromycin. No significant differences in time to defervescence (2 (interquartile range (IQR) 1-4) versus 2 (IQR 1-3) days; p 0.453), time to achieve clinical stability (3 (2-5) versus 3 (2-5) days; p 0.486), length of intravenous therapy (3 (2-5.25) versus 4 (3-6) days; p 0.058) and length of hospital stay (7 (5-10) versus 6 (5-9) days; p 0.088) were found between patients treated with levofloxacin and those treated with azithromycin. Patients treated with clarithromycin had longer intravenous antibiotic treatment (3 (2-5.25) versus 5 (3-6.25) days; p 0.002) and longer hospital stay (7 (5-10) versus 9 (7-14) days; p 0.043) compared with those treated with levofloxacin. The overall mortality was 4.3% (19 patients). Neither univariate nor multivariate analysis showed a significant association of levofloxacin versus azithromycin on mortality (4 (2.3%) versus 9 (5.1%) deaths; p 0.164). The results did not change after incorporation of the propensity score into the models. In our study, no significant differences in most outcomes were found between patients treated with levofloxacin and those treated with azithromycin. Due to the small number of deaths, results regarding mortality should be interpreted with caution. Copyright © 2017 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  3. Time Series Model Identification by Estimating Information.

    DTIC Science & Technology

    1982-11-01

    principle, Applications of Statistics, P. R. Krishnaiah , ed., North-Holland: Amsterdam, 27-41. Anderson, T. W. (1971). The Statistical Analysis of Time Series...E. (1969). Multiple Time Series Modeling, Multivariate Analysis II, edited by P. Krishnaiah , Academic Press: New York, 389-409. Parzen, E. (1981...Newton, H. J. (1980). Multiple Time Series Modeling, II Multivariate Analysis - V, edited by P. Krishnaiah , North Holland: Amsterdam, 181-197. Shibata, R

  4. Genomic Analysis of Complex Microbial Communities in Wounds

    DTIC Science & Technology

    2012-01-01

    thoroughly in the ecology literature. Permutation Multivariate Analysis of Variance ( PerMANOVA ). We used PerMANOVA to test the null-hypothesis of no...difference between the bacterial communities found within a single wound compared to those from different patients (α = 0.05). PerMANOVA is a...permutation-based version of the multivariate analysis of variance (MANOVA). PerMANOVA uses the distances between samples to partition variance and

  5. In situ X-ray diffraction analysis of (CF x) n batteries: signal extraction by multivariate analysis

    DOE PAGES

    Rodriguez, Mark A.; Keenan, Michael R.; Nagasubramanian, Ganesan

    2007-11-10

    In this study, (CF x) n cathode reaction during discharge has been investigated using in situ X-ray diffraction (XRD). Mathematical treatment of the in situ XRD data set was performed using multivariate curve resolution with alternating least squares (MCR–ALS), a technique of multivariate analysis. MCR–ALS analysis successfully separated the relatively weak XRD signal intensity due to the chemical reaction from the other inert cell component signals. The resulting dynamic reaction component revealed the loss of (CF x) n cathode signal together with the simultaneous appearance of LiF by-product intensity. Careful examination of the XRD data set revealed an additional dynamicmore » component which may be associated with the formation of an intermediate compound during the discharge process.« less

  6. Hybrid least squares multivariate spectral analysis methods

    DOEpatents

    Haaland, David M.

    2004-03-23

    A set of hybrid least squares multivariate spectral analysis methods in which spectral shapes of components or effects not present in the original calibration step are added in a following prediction or calibration step to improve the accuracy of the estimation of the amount of the original components in the sampled mixture. The hybrid method herein means a combination of an initial calibration step with subsequent analysis by an inverse multivariate analysis method. A spectral shape herein means normally the spectral shape of a non-calibrated chemical component in the sample mixture but can also mean the spectral shapes of other sources of spectral variation, including temperature drift, shifts between spectrometers, spectrometer drift, etc. The shape can be continuous, discontinuous, or even discrete points illustrative of the particular effect.

  7. Computerized design of controllers using data models

    NASA Technical Reports Server (NTRS)

    Irwin, Dennis; Mitchell, Jerrel; Medina, Enrique; Allwine, Dan; Frazier, Garth; Duncan, Mark

    1995-01-01

    The major contributions of the grant effort have been the enhancement of the Compensator Improvement Program (CIP), which resulted in the Ohio University CIP (OUCIP) package, and the development of the Model and Data-Oriented Computer Aided Design System (MADCADS). Incorporation of direct z-domain designs into CIP was tested and determined to be numerically ill-conditioned for the type of lightly damped problems for which the development was intended. Therefore, it was decided to pursue the development of z-plane designs in the w-plane, and to make this conversion transparent to the user. The analytical development needed for this feature, as well as that needed for including compensator damping ratios and DC gain specifications, closed loop stability requirements, and closed loop disturbance rejection specifications into OUCIP are all contained in Section 3. OUCIP was successfully tested with several example systems to verify proper operation of existing and new features. The extension of the CIP philosophy and algorithmic approach to handle modern multivariable controller design criteria was implemented and tested. Several new algorithms for implementing the search approach to modern multivariable control system design were developed and tested. This analytical development, most of which was incorporated into the MADCADS software package, is described in Section 4, which also includes results of the application of MADCADS to the MSFC ACES facility and the Hubble Space Telescope.

  8. Multivariate geomorphic analysis of forest streams: Implications for assessment of land use impacts on channel condition

    Treesearch

    Richard. D. Wood-Smith; John M. Buffington

    1996-01-01

    Multivariate statistical analyses of geomorphic variables from 23 forest stream reaches in southeast Alaska result in successful discrimination between pristine streams and those disturbed by land management, specifically timber harvesting and associated road building. Results of discriminant function analysis indicate that a three-variable model discriminates 10...

  9. Modeling Associations among Multivariate Longitudinal Categorical Variables in Survey Data: A Semiparametric Bayesian Approach

    ERIC Educational Resources Information Center

    Tchumtchoua, Sylvie; Dey, Dipak K.

    2012-01-01

    This paper proposes a semiparametric Bayesian framework for the analysis of associations among multivariate longitudinal categorical variables in high-dimensional data settings. This type of data is frequent, especially in the social and behavioral sciences. A semiparametric hierarchical factor analysis model is developed in which the…

  10. Use of Multivariate Linkage Analysis for Dissection of a Complex Cognitive Trait

    PubMed Central

    Marlow, Angela J.; Fisher, Simon E.; Francks, Clyde; MacPhie, I. Laurence; Cherny, Stacey S.; Richardson, Alex J.; Talcott, Joel B.; Stein, John F.; Monaco, Anthony P.; Cardon, Lon R.

    2003-01-01

    Replication of linkage results for complex traits has been exceedingly difficult, owing in part to the inability to measure the precise underlying phenotype, small sample sizes, genetic heterogeneity, and statistical methods employed in analysis. Often, in any particular study, multiple correlated traits have been collected, yet these have been analyzed independently or, at most, in bivariate analyses. Theoretical arguments suggest that full multivariate analysis of all available traits should offer more power to detect linkage; however, this has not yet been evaluated on a genomewide scale. Here, we conduct multivariate genomewide analyses of quantitative-trait loci that influence reading- and language-related measures in families affected with developmental dyslexia. The results of these analyses are substantially clearer than those of previous univariate analyses of the same data set, helping to resolve a number of key issues. These outcomes highlight the relevance of multivariate analysis for complex disorders for dissection of linkage results in correlated traits. The approach employed here may aid positional cloning of susceptibility genes in a wide spectrum of complex traits. PMID:12587094

  11. The association between body mass index and severe biliary infections: a multivariate analysis.

    PubMed

    Stewart, Lygia; Griffiss, J McLeod; Jarvis, Gary A; Way, Lawrence W

    2012-11-01

    Obesity has been associated with worse infectious disease outcomes. It is a risk factor for cholesterol gallstones, but little is known about associations between body mass index (BMI) and biliary infections. We studied this using factors associated with biliary infections. A total of 427 patients with gallstones were studied. Gallstones, bile, and blood (as applicable) were cultured. Illness severity was classified as follows: none (no infection or inflammation), systemic inflammatory response syndrome (fever, leukocytosis), severe (abscess, cholangitis, empyema), or multi-organ dysfunction syndrome (bacteremia, hypotension, organ failure). Associations between BMI and biliary bacteria, bacteremia, gallstone type, and illness severity were examined using bivariate and multivariate analysis. BMI inversely correlated with pigment stones, biliary bacteria, bacteremia, and increased illness severity on bivariate and multivariate analysis. Obesity correlated with less severe biliary infections. BMI inversely correlated with pigment stones and biliary bacteria; multivariate analysis showed an independent correlation between lower BMI and illness severity. Most patients with severe biliary infections had a normal BMI, suggesting that obesity may be protective in biliary infections. This study examined the correlation between BMI and biliary infection severity. Published by Elsevier Inc.

  12. Multivariate meta-analysis using individual participant data.

    PubMed

    Riley, R D; Price, M J; Jackson, D; Wardle, M; Gueyffier, F; Wang, J; Staessen, J A; White, I R

    2015-06-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment-covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. © 2014 The Authors. Research Synthesis Methods published by John Wiley & Sons, Ltd.

  13. Multivariate Analysis As a Support for Diagnostic Flowcharts in Allergic Bronchopulmonary Aspergillosis: A Proof-of-Concept Study.

    PubMed

    Vitte, Joana; Ranque, Stéphane; Carsin, Ania; Gomez, Carine; Romain, Thomas; Cassagne, Carole; Gouitaa, Marion; Baravalle-Einaudi, Mélisande; Bel, Nathalie Stremler-Le; Reynaud-Gaubert, Martine; Dubus, Jean-Christophe; Mège, Jean-Louis; Gaudart, Jean

    2017-01-01

    Molecular-based allergy diagnosis yields multiple biomarker datasets. The classical diagnostic score for allergic bronchopulmonary aspergillosis (ABPA), a severe disease usually occurring in asthmatic patients and people with cystic fibrosis, comprises succinct immunological criteria formulated in 1977: total IgE, anti- Aspergillus fumigatus ( Af ) IgE, anti- Af "precipitins," and anti- Af IgG. Progress achieved over the last four decades led to multiple IgE and IgG(4) Af biomarkers available with quantitative, standardized, molecular-level reports. These newly available biomarkers have not been included in the current diagnostic criteria, either individually or in algorithms, despite persistent underdiagnosis of ABPA. Large numbers of individual biomarkers may hinder their use in clinical practice. Conversely, multivariate analysis using new tools may bring about a better chance of less diagnostic mistakes. We report here a proof-of-concept work consisting of a three-step multivariate analysis of Af IgE, IgG, and IgG4 biomarkers through a combination of principal component analysis, hierarchical ascendant classification, and classification and regression tree multivariate analysis. The resulting diagnostic algorithms might show the way for novel criteria and improved diagnostic efficiency in Af -sensitized patients at risk for ABPA.

  14. Multivariate analysis of longitudinal rates of change.

    PubMed

    Bryan, Matthew; Heagerty, Patrick J

    2016-12-10

    Longitudinal data allow direct comparison of the change in patient outcomes associated with treatment or exposure. Frequently, several longitudinal measures are collected that either reflect a common underlying health status, or characterize processes that are influenced in a similar way by covariates such as exposure or demographic characteristics. Statistical methods that can combine multivariate response variables into common measures of covariate effects have been proposed in the literature. Current methods for characterizing the relationship between covariates and the rate of change in multivariate outcomes are limited to select models. For example, 'accelerated time' methods have been developed which assume that covariates rescale time in longitudinal models for disease progression. In this manuscript, we detail an alternative multivariate model formulation that directly structures longitudinal rates of change and that permits a common covariate effect across multiple outcomes. We detail maximum likelihood estimation for a multivariate longitudinal mixed model. We show via asymptotic calculations the potential gain in power that may be achieved with a common analysis of multiple outcomes. We apply the proposed methods to the analysis of a trivariate outcome for infant growth and compare rates of change for HIV infected and uninfected infants. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  15. In Vitro Evaluation of the Inhibitory Activity of Thymoquinone in Combatting Candida albicans in Denture Stomatitis Prevention

    PubMed Central

    Al-Khalifa, Khalifa S.; Gad, Mohammed M.; Al-Hariri, Mohammed; Alnassar, Talal

    2017-01-01

    Candida albicans adhesion and proliferation on denture bases may lead to denture stomatitis, which is a common and recurrent problem in denture wearers. The goal of this study was to assess the inhibitory effect of thymoquinone incorporated in the polymethyl methacrylate denture base material against Candida albicans. Eighty acrylic resin specimens were fabricated and divided into eight groups (n = 10) according to thymoquinone concentrations of 0.5%, 1%, 1.5%, 2%, 2.5%, 3%, and 5% of acrylic powder. Two methods were used to evaluate the effect of thymoquinone on Candida albicans: the slide count and the serial dilution test. A multivariate analysis of variance (MANOVA) and the post-hoc Tukey’s Honestly Significant Difference (HSD) test were performed to compare the difference of means between the observations taken at various intervals with baseline. The p value was statistically significant at ≤0.05. According to the slide count and the serial dilution test, the mean number of adhered Candida albicans in the control group was 5436.9 ± 266 and 4691.4 ± 176.8; however, this number dramatically decreased to 0 ± 0 and 32.4 ± 1.7 in group 8 (concentration 5%). These results suggest that the incorporation of thymoquinone into the acrylic resin denture base material might be effective in preventing Candida albicans adhesion. PMID:28698449

  16. Erosion Modeling in Central China - Soil Data Acquisition by Conditioned Latin Hypercube Sampling and Incorporation of Legacy Data

    NASA Astrophysics Data System (ADS)

    Stumpf, Felix; Schönbrodt-Stitt, Sarah; Schmidt, Karsten; Behrens, Thorsten; Scholten, Thomas

    2013-04-01

    The Three Gorges Dam at the Yangtze River in Central China outlines a prominent example of human-induced environmental impacts. Throughout one year the water table at the main river fluctuates about 30m due to impoundment and drainage activities. The dynamic water table implicates a range of georisks such as soil erosion, mass movements, sediment transport and diffuse matter inputs into the reservoir. Within the framework of the joint Sino-German project YANGTZE GEO, the subproject "Soil Erosion" deals with soil erosion risks and sediment transport pathways into the reservoir. The study site is a small catchment (4.8 km²) in Badong, approximately 100 km upstream the dam. It is characterized by scattered plots of agricultural landuse and resettlements in a largely wooded, steep sloping and mountainous area. Our research is focused on data acquisition and processing to develop a process-oriented erosion model. Hereby, area-covering knowledge of specific soil properties in the catchment is an intrinsic input parameter. This will be acquired by means of digital soil mapping (DSM). Thereby, soil properties are estimated by covariates. The functions are calibrated by soil property samples. The DSM approach is based on an appropriate sample design, which reflects the heterogeneity of the catchment, regarding the covariates with influence on the relevant soil properties. In this approach the covariates, processed by a digital terrain analysis, are outlined by the slope, altitude, profile curvature, plane curvature, and the aspect. For the development of the sample design, we chose the Conditioned Latin Hypercube Sampling (cLHS) procedure (Minasny and McBratney, 2006). It provides an efficient method of sampling variables from their multivariate distribution. Thereby, a sample size n from multiple variables is drawn such that for each variable the sample is marginally maximally stratified. The method ensures the maximal stratification by two features: First, number of strata equals the sample size n and secondly, the probability of falling in each of the strata is n-¹ (McKay et al., 1979). We extended the classical cLHS with extremes (Schmidt et al., 2012) approach by incorporating legacy data of previous field campaigns. Instead of identifying precise sample locations by CLHS, we demarcate the multivariate attribute space of the samples based on the histogram borders of each stratum. This widens the spatial scope of the actual CLHS sample locations and allows the incorporation of legacy data lying within that scope. Furthermore, this approach provides an extended potential regarding the accessibility of sample sites in the field.

  17. Comparison of pure laparoscopic versus open left hemihepatectomy by multivariate analysis: a retrospective cohort study.

    PubMed

    Cho, Hwui-Dong; Kim, Ki-Hun; Hwang, Shin; Ahn, Chul-Soo; Moon, Deok-Bog; Ha, Tae-Yong; Song, Gi-Won; Jung, Dong-Hwan; Park, Gil-Chun; Lee, Sung-Gyu

    2018-02-01

    To compare the outcomes of pure laparoscopic left hemihepatectomy (LLH) versus open left hemihepatectomy (OLH) for benign and malignant conditions using multivariate analysis. All consecutive cases of LLH and OLH between October 2007 and December 2013 in a tertiary referral hospital were enrolled in this retrospective cohort study. All surgical procedures were performed by one surgeon. The LLH and OLH groups were compared in terms of patient demographics, preoperative data, clinical perioperative outcomes, and tumor characteristics in patients with malignancy. Multivariate analysis of the prognostic factors associated with severe complications was then performed. The LLH group (n = 62) had a significantly shorter postoperative hospital stay than the OLH group (n = 118) (9.53 ± 3.30 vs 14.88 ± 11.36 days, p < 0.001). Multivariate analysis revealed that the OLH group had >4 times the risk of the LLH group in terms of developing severe complications (Clavien-Dindo grade ≥III) (odds ratio 4.294, 95% confidence intervals 1.165-15.832, p = 0.029). LLH was a safe and feasible procedure for selected patients. LLH required shorter hospital stay and resulted in less operative blood loss. Multivariate analysis revealed that LLH was associated with a lower risk of severe complications compared to OLH. The authors suggest that LLH could be a reasonable treatment option for selected patients.

  18. Univariate and multivariate skewness and kurtosis for measuring nonnormality: Prevalence, influence and estimation.

    PubMed

    Cain, Meghan K; Zhang, Zhiyong; Yuan, Ke-Hai

    2017-10-01

    Nonnormality of univariate data has been extensively examined previously (Blanca et al., Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 9(2), 78-84, 2013; Miceeri, Psychological Bulletin, 105(1), 156, 1989). However, less is known of the potential nonnormality of multivariate data although multivariate analysis is commonly used in psychological and educational research. Using univariate and multivariate skewness and kurtosis as measures of nonnormality, this study examined 1,567 univariate distriubtions and 254 multivariate distributions collected from authors of articles published in Psychological Science and the American Education Research Journal. We found that 74 % of univariate distributions and 68 % multivariate distributions deviated from normal distributions. In a simulation study using typical values of skewness and kurtosis that we collected, we found that the resulting type I error rates were 17 % in a t-test and 30 % in a factor analysis under some conditions. Hence, we argue that it is time to routinely report skewness and kurtosis along with other summary statistics such as means and variances. To facilitate future report of skewness and kurtosis, we provide a tutorial on how to compute univariate and multivariate skewness and kurtosis by SAS, SPSS, R and a newly developed Web application.

  19. A Statistical Discrimination Experiment for Eurasian Events Using a Twenty-Seven-Station Network

    DTIC Science & Technology

    1980-07-08

    to test the effectiveness of a multivariate method of analysis for distinguishing earthquakes from explosions. The data base for the experiment...to test the effectiveness of a multivariate method of analysis for distinguishing earthquakes from explosions. The data base for the experiment...the weight assigned to each variable whenever a new one is added. Jennrich, R. I. (1977). Stepwise discriminant analysis , in Statistical Methods for

  20. Is Heart Rate Variability Better Than Routine Vital Signs for Prehospital Identification of Major Hemorrhage

    DTIC Science & Technology

    2015-01-01

    different PRBC transfusion volumes. We performed multivariate regression analysis using HRV metrics and routine vital signs to test the hypothesis that...study sponsors did not have any role in the study design, data collection, analysis and interpretation of data, report writing, or the decision to...primary outcome was hemorrhagic injury plus different PRBC transfusion volumes. We performed multivariate regression analysis using HRV metrics and

  1. Multivariate optimum interpolation of surface pressure and winds over oceans

    NASA Technical Reports Server (NTRS)

    Bloom, S. C.

    1984-01-01

    The observations of surface pressure are quite sparse over oceanic areas. An effort to improve the analysis of surface pressure over oceans through the development of a multivariate surface analysis scheme which makes use of surface pressure and wind data is discussed. Although the present research used ship winds, future versions of this analysis scheme could utilize winds from additional sources, such as satellite scatterometer data.

  2. Nonlinear multivariate and time series analysis by neural network methods

    NASA Astrophysics Data System (ADS)

    Hsieh, William W.

    2004-03-01

    Methods in multivariate statistical analysis are essential for working with large amounts of geophysical data, data from observational arrays, from satellites, or from numerical model output. In classical multivariate statistical analysis, there is a hierarchy of methods, starting with linear regression at the base, followed by principal component analysis (PCA) and finally canonical correlation analysis (CCA). A multivariate time series method, the singular spectrum analysis (SSA), has been a fruitful extension of the PCA technique. The common drawback of these classical methods is that only linear structures can be correctly extracted from the data. Since the late 1980s, neural network methods have become popular for performing nonlinear regression and classification. More recently, neural network methods have been extended to perform nonlinear PCA (NLPCA), nonlinear CCA (NLCCA), and nonlinear SSA (NLSSA). This paper presents a unified view of the NLPCA, NLCCA, and NLSSA techniques and their applications to various data sets of the atmosphere and the ocean (especially for the El Niño-Southern Oscillation and the stratospheric quasi-biennial oscillation). These data sets reveal that the linear methods are often too simplistic to describe real-world systems, with a tendency to scatter a single oscillatory phenomenon into numerous unphysical modes or higher harmonics, which can be largely alleviated in the new nonlinear paradigm.

  3. Spectral Mining for Discriminating Blood Origins in the Presence of Substrate Interference via Attenuated Total Reflection Fourier Transform Infrared Spectroscopy: Postmortem or Antemortem Blood?

    PubMed

    Takamura, Ayari; Watanabe, Ken; Akutsu, Tomoko; Ikegaya, Hiroshi; Ozawa, Takeaki

    2017-09-19

    Often in criminal investigations, discrimination of types of body fluid evidence is crucially important to ascertain how a crime was committed. Compared to current methods using biochemical techniques, vibrational spectroscopic approaches can provide versatile applicability to identify various body fluid types without sample invasion. However, their applicability is limited to pure body fluid samples because important signals from body fluids incorporated in a substrate are affected strongly by interference from substrate signals. Herein, we describe a novel approach to recover body fluid signals that are embedded in strong substrate interferences using attenuated total reflection Fourier transform infrared (ATR FT-IR) spectroscopy and an innovative multivariate spectral processing. This technique supported detection of covert features of body fluid signals, and then identified origins of body fluid stains on substrates. We discriminated between ATR FT-IR spectra of postmortem blood (PB) and those of antemortem blood (AB) by creating a multivariate statistics model. From ATR FT-IR spectra of PB and AB stains on interfering substrates (polyester, cotton, and denim), blood-originated signals were extracted by a weighted linear regression approach we developed originally using principal components of both blood and substrate spectra. The blood-originated signals were finally classified by the discriminant model, demonstrating high discriminant accuracy. The present method can identify body fluid evidence independently of the substrate type, which is expected to promote the application of vibrational spectroscopic techniques in forensic body fluid analysis.

  4. Anger expression, violent behavior, and symptoms of depression among male college students in Ethiopia

    PubMed Central

    Terasaki, Dale J; Gelaye, Bizu; Berhane, Yemane; Williams, Michelle A

    2009-01-01

    Background Depression is an important global public health problem. Given the scarcity of studies involving African youths, this study was conducted to evaluate the associations of anger expression and violent behavior with symptoms of depression among male college students. Methods A self-administered questionnaire was used to collect information on socio-demographic and lifestyle characteristics and violent behavior among 1,176 college students in Awassa, Ethiopia in June, 2006. The questionnaire incorporated the Spielberger Anger-Out Expression (SAOE) scale and symptoms of depression were evaluated using the Patient Health Questionnaire (PHQ-9). Multivariable logistic regression procedures were used to calculate adjusted odds ratios (OR) and 95% confidence intervals (95%CI). Results Symptoms of depression were evident in 23.6% of participants. Some 54.3% of students reported committing at least one act of violence in the current academic year; and 29.3% of students reported high (SAOE score ≥ 15) levels of anger-expression. In multivariate analysis, moderate (OR = 1.97; 95%CI 1.33–2.93) and high (OR = 3.23; 95%CI 2.14–4.88) outward anger were statistically significantly associated with increased risks of depressive symptoms. Violent behavior was noted to be associated with depressive symptoms (OR = 1.82; 95%CI 1.37–2.40). Conclusion Further research should be conducted to better characterize community and individual level determinants of anger-expression, violent behavior and depression among youths. PMID:19138431

  5. Producer attitudes and practices related to antimicrobial use in beef cattle in Tennessee.

    PubMed

    Green, Alice L; Carpenter, L Rand; Edmisson, Darryl E; Lane, Clyde D; Welborn, Matt G; Hopkins, Fred M; Bemis, David A; Dunn, John R

    2010-12-01

    To evaluate knowledge, attitudes, and management practices involving antimicrobial use among Tennessee beef producers. Mail survey. A population-based, stratified random sample of 3,000 beef producers across the state. Questionnaires were mailed to beef producers. Questions focused on producer practices related to education, biosecurity, veterinary use, and the purchase and use of antimicrobials. Operation types were categorized as either cow-calf only or multiple operation type (MOT). Associations between various factors and antimicrobial use were evaluated by use of multivariable logistic regression, with the outcome variable being any antimicrobial use (injectable or by mouth) in the past year. Of 3,000 questionnaires mailed, 1,042 (34.7%) were returned. A significantly higher proportion of producers with MOTs reported giving antimicrobials by mouth or by injection than did producers with cow-calf only operations. In addition, higher proportions of producers with MOTs than producers with cow-calf only operations reported treating with macrolides, florfenicol, ceftiofur, and aminoglycosides. In the multivariable analysis, herd size>50 cattle, participation in Beef Quality Assurance or master beef producer certification programs, quarantining of newly purchased animals, use of written instructions for treating disease, and observation of withdrawal times were associated with a higher likelihood of antimicrobial use. Results suggested that producers who engaged in more progressive farming practices were also more likely to use antimicrobials. Incorporating training on judicious antimicrobial use into educational programs would likely increase awareness of best management practices regarding antimicrobial use.

  6. Factors associated with school-aged children's body mass index in Korean American families.

    PubMed

    Jang, Myoungock; Grey, Margaret; Sadler, Lois; Jeon, Sangchoon; Nam, Soohyun; Song, Hee-Jung; Whittemore, Robin

    2017-08-01

    To examine factors associated with children's body mass index and obesity-risk behaviours in Korean American families. Limited data are available about family factors related to overweight and obesity in Korean American children. A cross-sectional study. Convenient sampling was employed to recruit Korean American families in the Northeast of the United States between August 2014 and January 2015. Child, family and societal/demographic/community factors were measured with self-report questionnaires completed by mothers and children. Height and weight were measured to calculate body mass index. Data were analyzed using mixed effects models incorporating within-group correlation in siblings. The sample included 170 Korean American children and 137 mothers. In bivariate analyses, more child screen time, number of children in the household, greater parental underestimation of child's weight and children's participation in the school lunch program were significantly associated with higher child body mass index. In multivariate analyses that included variables showing significant bivariate relationship, no variable was associated with child body mass index. There were no child, family and societal/demographic/community factors related to child body mass index in Korean American families in the multivariate analysis, which is contrary to research in other racial/ethnic groups. In bivariate analyses, there is evidence that some factors were significantly related to child body mass index. Further research is needed to understand the unique behavioural, social and cultural features that contribute to childhood obesity in Korean American families. © 2017 John Wiley & Sons Ltd.

  7. An examination of the relationship of interpersonal influences with walking and biking to work.

    PubMed

    Campbell, Matthew E; Bopp, Melissa

    2013-01-01

    Active commuting (AC) to the workplace is a successful strategy for incorporating more physical activity into daily life and is associated with health benefits. The purpose of this study was to understand the relationship between interpersonal influences and AC. A cross-sectional online survey was delivered to workplaces in the mid-Atlantic region. A volunteer convenience sample of adults (N = 1234) completed questions about demographics, number of times per week actively commuting, spouse and coworker AC patterns, and spousal and coworker normative beliefs for AC. Basic descriptive and frequencies described the sample; bivariate correlations examined the relationship between AC and spouse and coworker variables. A multivariate regression analysis predicted the variance in AC with interpersonal independent variables. The sample was primarily middle-aged, white (92.7%), female (67.9%), and well-educated (83.3% college graduate or higher). Of those surveyed, 20.3% report AC to work at least once per week by means of walking or biking. The number of times per week of AC for spouse (P < .001) and coworkers (P = .006) and AC norms for spouse (P < .001) and coworker (P < .001) were positively related to AC. The multivariate regression model accounted for 37.9% of the variance in AC (F = 101.83, df = 4, P < .001). This study demonstrates that interpersonal influences are significantly related to actively commuting to work. Future interventions targeting AC should consider these interpersonal influences in addition to individual and environmental influences that have been previously documented.

  8. Analysis and assessment on heavy metal sources in the coastal soils developed from alluvial deposits using multivariate statistical methods.

    PubMed

    Li, Jinling; He, Ming; Han, Wei; Gu, Yifan

    2009-05-30

    An investigation on heavy metal sources, i.e., Cu, Zn, Ni, Pb, Cr, and Cd in the coastal soils of Shanghai, China, was conducted using multivariate statistical methods (principal component analysis, clustering analysis, and correlation analysis). All the results of the multivariate analysis showed that: (i) Cu, Ni, Pb, and Cd had anthropogenic sources (e.g., overuse of chemical fertilizers and pesticides, industrial and municipal discharges, animal wastes, sewage irrigation, etc.); (ii) Zn and Cr were associated with parent materials and therefore had natural sources (e.g., the weathering process of parent materials and subsequent pedo-genesis due to the alluvial deposits). The effect of heavy metals in the soils was greatly affected by soil formation, atmospheric deposition, and human activities. These findings provided essential information on the possible sources of heavy metals, which would contribute to the monitoring and assessment process of agricultural soils in worldwide regions.

  9. Application of multivariate statistical techniques for differentiation of ripe banana flour based on the composition of elements.

    PubMed

    Alkarkhi, Abbas F M; Ramli, Saifullah Bin; Easa, Azhar Mat

    2009-01-01

    Major (sodium, potassium, calcium, magnesium) and minor elements (iron, copper, zinc, manganese) and one heavy metal (lead) of Cavendish banana flour and Dream banana flour were determined, and data were analyzed using multivariate statistical techniques of factor analysis and discriminant analysis. Factor analysis yielded four factors explaining more than 81% of the total variance: the first factor explained 28.73%, comprising magnesium, sodium, and iron; the second factor explained 21.47%, comprising only manganese and copper; the third factor explained 15.66%, comprising zinc and lead; while the fourth factor explained 15.50%, comprising potassium. Discriminant analysis showed that magnesium and sodium exhibited a strong contribution in discriminating the two types of banana flour, affording 100% correct assignation. This study presents the usefulness of multivariate statistical techniques for analysis and interpretation of complex mineral content data from banana flour of different varieties.

  10. PYCHEM: a multivariate analysis package for python.

    PubMed

    Jarvis, Roger M; Broadhurst, David; Johnson, Helen; O'Boyle, Noel M; Goodacre, Royston

    2006-10-15

    We have implemented a multivariate statistical analysis toolbox, with an optional standalone graphical user interface (GUI), using the Python scripting language. This is a free and open source project that addresses the need for a multivariate analysis toolbox in Python. Although the functionality provided does not cover the full range of multivariate tools that are available, it has a broad complement of methods that are widely used in the biological sciences. In contrast to tools like MATLAB, PyChem 2.0.0 is easily accessible and free, allows for rapid extension using a range of Python modules and is part of the growing amount of complementary and interoperable scientific software in Python based upon SciPy. One of the attractions of PyChem is that it is an open source project and so there is an opportunity, through collaboration, to increase the scope of the software and to continually evolve a user-friendly platform that has applicability across a wide range of analytical and post-genomic disciplines. http://sourceforge.net/projects/pychem

  11. Borrowing of strength and study weights in multivariate and network meta-analysis.

    PubMed

    Jackson, Dan; White, Ian R; Price, Malcolm; Copas, John; Riley, Richard D

    2017-12-01

    Multivariate and network meta-analysis have the potential for the estimated mean of one effect to borrow strength from the data on other effects of interest. The extent of this borrowing of strength is usually assessed informally. We present new mathematical definitions of 'borrowing of strength'. Our main proposal is based on a decomposition of the score statistic, which we show can be interpreted as comparing the precision of estimates from the multivariate and univariate models. Our definition of borrowing of strength therefore emulates the usual informal assessment. We also derive a method for calculating study weights, which we embed into the same framework as our borrowing of strength statistics, so that percentage study weights can accompany the results from multivariate and network meta-analyses as they do in conventional univariate meta-analyses. Our proposals are illustrated using three meta-analyses involving correlated effects for multiple outcomes, multiple risk factor associations and multiple treatments (network meta-analysis).

  12. Multivariate longitudinal data analysis with censored and intermittent missing responses.

    PubMed

    Lin, Tsung-I; Lachos, Victor H; Wang, Wan-Lun

    2018-05-08

    The multivariate linear mixed model (MLMM) has emerged as an important analytical tool for longitudinal data with multiple outcomes. However, the analysis of multivariate longitudinal data could be complicated by the presence of censored measurements because of a detection limit of the assay in combination with unavoidable missing values arising when subjects miss some of their scheduled visits intermittently. This paper presents a generalization of the MLMM approach, called the MLMM-CM, for a joint analysis of the multivariate longitudinal data with censored and intermittent missing responses. A computationally feasible expectation maximization-based procedure is developed to carry out maximum likelihood estimation within the MLMM-CM framework. Moreover, the asymptotic standard errors of fixed effects are explicitly obtained via the information-based method. We illustrate our methodology by using simulated data and a case study from an AIDS clinical trial. Experimental results reveal that the proposed method is able to provide more satisfactory performance as compared with the traditional MLMM approach. Copyright © 2018 John Wiley & Sons, Ltd.

  13. Borrowing of strength and study weights in multivariate and network meta-analysis

    PubMed Central

    Jackson, Dan; White, Ian R; Price, Malcolm; Copas, John; Riley, Richard D

    2016-01-01

    Multivariate and network meta-analysis have the potential for the estimated mean of one effect to borrow strength from the data on other effects of interest. The extent of this borrowing of strength is usually assessed informally. We present new mathematical definitions of ‘borrowing of strength’. Our main proposal is based on a decomposition of the score statistic, which we show can be interpreted as comparing the precision of estimates from the multivariate and univariate models. Our definition of borrowing of strength therefore emulates the usual informal assessment. We also derive a method for calculating study weights, which we embed into the same framework as our borrowing of strength statistics, so that percentage study weights can accompany the results from multivariate and network meta-analyses as they do in conventional univariate meta-analyses. Our proposals are illustrated using three meta-analyses involving correlated effects for multiple outcomes, multiple risk factor associations and multiple treatments (network meta-analysis). PMID:26546254

  14. Work and retirement among a cohort of older men in the United States, 1966-1983.

    PubMed

    Hayward, M D; Grady, W R

    1990-08-01

    Multivariate increment-decrement working life tables are estimated for a cohort of older men in the United States for the period 1966-1983. The approach taken allows multiple processes to be simultaneously incorporated into a single model, resulting in a more realistic portrayal of a cohort's late-life labor force behavior. In addition, because the life table model is developed from multivariate hazard equations, we identify the effects of sociodemographic characteristics on the potentially complex process by which the labor force career is ended. In contrast to the assumed homogeneity of previous working life table analyses, the present study shows marked differences in labor force mobility and working and nonworking life expectancy according to occupation, class of worker, education, race, and marital status. We briefly discuss the implications of these findings for inequities of access to retirement, private and public pension consumption, and future changes in the retirement process.

  15. Regularization with numerical extrapolation for finite and UV-divergent multi-loop integrals

    NASA Astrophysics Data System (ADS)

    de Doncker, E.; Yuasa, F.; Kato, K.; Ishikawa, T.; Kapenga, J.; Olagbemi, O.

    2018-03-01

    We give numerical integration results for Feynman loop diagrams such as those covered by Laporta (2000) and by Baikov and Chetyrkin (2010), and which may give rise to loop integrals with UV singularities. We explore automatic adaptive integration using multivariate techniques from the PARINT package for multivariate integration, as well as iterated integration with programs from the QUADPACK package, and a trapezoidal method based on a double exponential transformation. PARINT is layered over MPI (Message Passing Interface), and incorporates advanced parallel/distributed techniques including load balancing among processes that may be distributed over a cluster or a network/grid of nodes. Results are included for 2-loop vertex and box diagrams and for sets of 2-, 3- and 4-loop self-energy diagrams with or without UV terms. Numerical regularization of integrals with singular terms is achieved by linear and non-linear extrapolation methods.

  16. Modelling lifetime data with multivariate Tweedie distribution

    NASA Astrophysics Data System (ADS)

    Nor, Siti Rohani Mohd; Yusof, Fadhilah; Bahar, Arifah

    2017-05-01

    This study aims to measure the dependence between individual lifetimes by applying multivariate Tweedie distribution to the lifetime data. Dependence between lifetimes incorporated in the mortality model is a new form of idea that gives significant impact on the risk of the annuity portfolio which is actually against the idea of standard actuarial methods that assumes independent between lifetimes. Hence, this paper applies Tweedie family distribution to the portfolio of lifetimes to induce the dependence between lives. Tweedie distribution is chosen since it contains symmetric and non-symmetric, as well as light-tailed and heavy-tailed distributions. Parameter estimation is modified in order to fit the Tweedie distribution to the data. This procedure is developed by using method of moments. In addition, the comparison stage is made to check for the adequacy between the observed mortality and expected mortality. Finally, the importance of including systematic mortality risk in the model is justified by the Pearson's chi-squared test.

  17. A framework for multivariate data-based at-site flood frequency analysis: Essentiality of the conjugal application of parametric and nonparametric approaches

    NASA Astrophysics Data System (ADS)

    Vittal, H.; Singh, Jitendra; Kumar, Pankaj; Karmakar, Subhankar

    2015-06-01

    In watershed management, flood frequency analysis (FFA) is performed to quantify the risk of flooding at different spatial locations and also to provide guidelines for determining the design periods of flood control structures. The traditional FFA was extensively performed by considering univariate scenario for both at-site and regional estimation of return periods. However, due to inherent mutual dependence of the flood variables or characteristics [i.e., peak flow (P), flood volume (V) and flood duration (D), which are random in nature], analysis has been further extended to multivariate scenario, with some restrictive assumptions. To overcome the assumption of same family of marginal density function for all flood variables, the concept of copula has been introduced. Although, the advancement from univariate to multivariate analyses drew formidable attention to the FFA research community, the basic limitation was that the analyses were performed with the implementation of only parametric family of distributions. The aim of the current study is to emphasize the importance of nonparametric approaches in the field of multivariate FFA; however, the nonparametric distribution may not always be a good-fit and capable of replacing well-implemented multivariate parametric and multivariate copula-based applications. Nevertheless, the potential of obtaining best-fit using nonparametric distributions might be improved because such distributions reproduce the sample's characteristics, resulting in more accurate estimations of the multivariate return period. Hence, the current study shows the importance of conjugating multivariate nonparametric approach with multivariate parametric and copula-based approaches, thereby results in a comprehensive framework for complete at-site FFA. Although the proposed framework is designed for at-site FFA, this approach can also be applied to regional FFA because regional estimations ideally include at-site estimations. The framework is based on the following steps: (i) comprehensive trend analysis to assess nonstationarity in the observed data; (ii) selection of the best-fit univariate marginal distribution with a comprehensive set of parametric and nonparametric distributions for the flood variables; (iii) multivariate frequency analyses with parametric, copula-based and nonparametric approaches; and (iv) estimation of joint and various conditional return periods. The proposed framework for frequency analysis is demonstrated using 110 years of observed data from Allegheny River at Salamanca, New York, USA. The results show that for both univariate and multivariate cases, the nonparametric Gaussian kernel provides the best estimate. Further, we perform FFA for twenty major rivers over continental USA, which shows for seven rivers, all the flood variables followed nonparametric Gaussian kernel; whereas for other rivers, parametric distributions provide the best-fit either for one or two flood variables. Thus the summary of results shows that the nonparametric method cannot substitute the parametric and copula-based approaches, but should be considered during any at-site FFA to provide the broadest choices for best estimation of the flood return periods.

  18. Kernel canonical-correlation Granger causality for multiple time series

    NASA Astrophysics Data System (ADS)

    Wu, Guorong; Duan, Xujun; Liao, Wei; Gao, Qing; Chen, Huafu

    2011-04-01

    Canonical-correlation analysis as a multivariate statistical technique has been applied to multivariate Granger causality analysis to infer information flow in complex systems. It shows unique appeal and great superiority over the traditional vector autoregressive method, due to the simplified procedure that detects causal interaction between multiple time series, and the avoidance of potential model estimation problems. However, it is limited to the linear case. Here, we extend the framework of canonical correlation to include the estimation of multivariate nonlinear Granger causality for drawing inference about directed interaction. Its feasibility and effectiveness are verified on simulated data.

  19. Multivariate geometry as an approach to algal community analysis

    USGS Publications Warehouse

    Allen, T.F.H.; Skagen, S.

    1973-01-01

    Multivariate analyses are put in the context of more usual approaches to phycological investigations. The intuitive common-sense involved in methods of ordination, classification and discrimination are emphasised by simple geometric accounts which avoid jargon and matrix algebra. Warnings are given that artifacts result from technique abuses by the naive or over-enthusiastic. An analysis of a simple periphyton data set is presented as an example of the approach. Suggestions are made as to situations in phycological investigations, where the techniques could be appropriate. The discipline is reprimanded for its neglect of the multivariate approach.

  20. Examining the impacts of increased corn production on groundwater quality using a coupled modeling system.

    PubMed

    Garcia, Valerie; Cooter, Ellen; Crooks, James; Hinckley, Brian; Murphy, Mark; Xing, Xiangnan

    2017-05-15

    This study demonstrates the value of a coupled chemical transport modeling system for investigating groundwater nitrate contamination responses associated with nitrogen (N) fertilizer application and increased corn production. The coupled Community Multiscale Air Quality Bidirectional and Environmental Policy Integrated Climate modeling system incorporates agricultural management practices and N exchange processes between the soil and atmosphere to estimate levels of N that may volatilize into the atmosphere, re-deposit, and seep or flow into surface and groundwater. Simulated values from this modeling system were used in a land-use regression model to examine associations between groundwater nitrate-N measurements and a suite of factors related to N fertilizer and groundwater nitrate contamination. Multi-variable modeling analysis revealed that the N-fertilizer rate (versus total) applied to irrigated (versus rainfed) grain corn (versus other crops) was the strongest N-related predictor variable of groundwater nitrate-N concentrations. Application of this multi-variable model considered groundwater nitrate-N concentration responses under two corn production scenarios. Findings suggest that increased corn production between 2002 and 2022 could result in 56% to 79% increase in areas vulnerable to groundwater nitrate-N concentrations ≥5mg/L. These above-threshold areas occur on soils with a hydraulic conductivity 13% higher than the rest of the domain. Additionally, the average number of animal feeding operations (AFOs) for these areas was nearly 5 times higher, and the mean N-fertilizer rate was 4 times higher. Finally, we found that areas prone to high groundwater nitrate-N concentrations attributable to the expansion scenario did not occur in new grid cells of irrigated grain-corn croplands, but were clustered around areas of existing corn crops. This application demonstrates the value of the coupled modeling system in developing spatially refined multi-variable models to provide information for geographic locations lacking complete observational data; and in projecting possible groundwater nitrate-N concentration outcomes under alternative future crop production scenarios. Published by Elsevier B.V.

  1. Comparison of Optimum Interpolation and Cressman Analyses

    NASA Technical Reports Server (NTRS)

    Baker, W. E.; Bloom, S. C.; Nestler, M. S.

    1984-01-01

    The objective of this investigation is to develop a state-of-the-art optimum interpolation (O/I) objective analysis procedure for use in numerical weather prediction studies. A three-dimensional multivariate O/I analysis scheme has been developed. Some characteristics of the GLAS O/I compared with those of the NMC and ECMWF systems are summarized. Some recent enhancements of the GLAS scheme include a univariate analysis of water vapor mixing ratio, a geographically dependent model prediction error correlation function and a multivariate oceanic surface analysis.

  2. Comparison of Optimum Interpolation and Cressman Analyses

    NASA Technical Reports Server (NTRS)

    Baker, W. E.; Bloom, S. C.; Nestler, M. S.

    1985-01-01

    The development of a state of the art optimum interpolation (O/I) objective analysis procedure for use in numerical weather prediction studies was investigated. A three dimensional multivariate O/I analysis scheme was developed. Some characteristics of the GLAS O/I compared with those of the NMC and ECMWF systems are summarized. Some recent enhancements of the GLAS scheme include a univariate analysis of water vapor mixing ratio, a geographically dependent model prediction error correlation function and a multivariate oceanic surface analysis.

  3. Tracking Problem Solving by Multivariate Pattern Analysis and Hidden Markov Model Algorithms

    ERIC Educational Resources Information Center

    Anderson, John R.

    2012-01-01

    Multivariate pattern analysis can be combined with Hidden Markov Model algorithms to track the second-by-second thinking as people solve complex problems. Two applications of this methodology are illustrated with a data set taken from children as they interacted with an intelligent tutoring system for algebra. The first "mind reading" application…

  4. Functional Path Analysis as a Multivariate Technique in Developing a Theory of Participation in Adult Education.

    ERIC Educational Resources Information Center

    Martin, James L.

    This paper reports on attempts by the author to construct a theoretical framework of adult education participation using a theory development process and the corresponding multivariate statistical techniques. Two problems are identified: the lack of theoretical framework in studying problems, and the limiting of statistical analysis to univariate…

  5. Missing Data and Multiple Imputation in the Context of Multivariate Analysis of Variance

    ERIC Educational Resources Information Center

    Finch, W. Holmes

    2016-01-01

    Multivariate analysis of variance (MANOVA) is widely used in educational research to compare means on multiple dependent variables across groups. Researchers faced with the problem of missing data often use multiple imputation of values in place of the missing observations. This study compares the performance of 2 methods for combining p values in…

  6. Web-Based Tools for Modelling and Analysis of Multivariate Data: California Ozone Pollution Activity

    ERIC Educational Resources Information Center

    Dinov, Ivo D.; Christou, Nicolas

    2011-01-01

    This article presents a hands-on web-based activity motivated by the relation between human health and ozone pollution in California. This case study is based on multivariate data collected monthly at 20 locations in California between 1980 and 2006. Several strategies and tools for data interrogation and exploratory data analysis, model fitting…

  7. Bias and Precision of Measures of Association for a Fixed-Effect Multivariate Analysis of Variance Model

    ERIC Educational Resources Information Center

    Kim, Soyoung; Olejnik, Stephen

    2005-01-01

    The sampling distributions of five popular measures of association with and without two bias adjusting methods were examined for the single factor fixed-effects multivariate analysis of variance model. The number of groups, sample sizes, number of outcomes, and the strength of association were manipulated. The results indicate that all five…

  8. Multivariate analysis of climate along the southern coast of Alaska—some forestry implications.

    Treesearch

    Wilbur A. Farr; John S. Hard

    1987-01-01

    A multivariate analysis of climate was used to delineate 10 significantly different groups of climatic stations along the southern coast of Alaska based on latitude, longitude, seasonal temperatures and precipitation, frost-free periods, and total number of growing degree days. The climatic stations were too few to delineate this rugged, mountainous region into...

  9. Improved detection of highly energetic materials traces on surfaces by standoff laser-induced thermal emission incorporating neural networks

    NASA Astrophysics Data System (ADS)

    Figueroa-Navedo, Amanda; Galán-Freyle, Nataly Y.; Pacheco-Londoño, Leonardo C.; Hernández-Rivera, Samuel P.

    2013-05-01

    Terrorists conceal highly energetic materials (HEM) as Improvised Explosive Devices (IED) in various types of materials such as PVC, wood, Teflon, aluminum, acrylic, carton and rubber to disguise them from detection equipment used by military and security agency personnel. Infrared emissions (IREs) of substrates, with and without HEM, were measured to generate models for detection and discrimination. Multivariable analysis techniques such as principal component analysis (PCA), soft independent modeling by class analogy (SIMCA), partial least squares-discriminant analysis (PLS-DA), support vector machine (SVM) and neural networks (NN) were employed to generate models, in which the emission of IR light from heated samples was stimulated using a CO2 laser giving rise to laser induced thermal emission (LITE) of HEMs. Traces of a specific target threat chemical explosive: PETN in surface concentrations of 10 to 300 ug/cm2 were studied on the surfaces mentioned. Custom built experimental setup used a CO2 laser as a heating source positioned with a telescope, where a minimal loss in reflective optics was reported, for the Mid-IR at a distance of 4 m and 32 scans at 10 s. SVM-DA resulted in the best statistical technique for a discrimination performance of 97%. PLS-DA accurately predicted over 94% and NN 88%.

  10. Kinematic Analysis of Speech Sound Sequencing Errors Induced by Delayed Auditory Feedback.

    PubMed

    Cler, Gabriel J; Lee, Jackson C; Mittelman, Talia; Stepp, Cara E; Bohland, Jason W

    2017-06-22

    Delayed auditory feedback (DAF) causes speakers to become disfluent and make phonological errors. Methods for assessing the kinematics of speech errors are lacking, with most DAF studies relying on auditory perceptual analyses, which may be problematic, as errors judged to be categorical may actually represent blends of sounds or articulatory errors. Eight typical speakers produced nonsense syllable sequences under normal and DAF (200 ms). Lip and tongue kinematics were captured with electromagnetic articulography. Time-locked acoustic recordings were transcribed, and the kinematics of utterances with and without perceived errors were analyzed with existing and novel quantitative methods. New multivariate measures showed that for 5 participants, kinematic variability for productions perceived to be error free was significantly increased under delay; these results were validated by using the spatiotemporal index measure. Analysis of error trials revealed both typical productions of a nontarget syllable and productions with articulatory kinematics that incorporated aspects of both the target and the perceived utterance. This study is among the first to characterize articulatory changes under DAF and provides evidence for different classes of speech errors, which may not be perceptually salient. New methods were developed that may aid visualization and analysis of large kinematic data sets. https://doi.org/10.23641/asha.5103067.

  11. A Skew-t space-varying regression model for the spectral analysis of resting state brain activity.

    PubMed

    Ismail, Salimah; Sun, Wenqi; Nathoo, Farouk S; Babul, Arif; Moiseev, Alexader; Beg, Mirza Faisal; Virji-Babul, Naznin

    2013-08-01

    It is known that in many neurological disorders such as Down syndrome, main brain rhythms shift their frequencies slightly, and characterizing the spatial distribution of these shifts is of interest. This article reports on the development of a Skew-t mixed model for the spatial analysis of resting state brain activity in healthy controls and individuals with Down syndrome. Time series of oscillatory brain activity are recorded using magnetoencephalography, and spectral summaries are examined at multiple sensor locations across the scalp. We focus on the mean frequency of the power spectral density, and use space-varying regression to examine associations with age, gender and Down syndrome across several scalp regions. Spatial smoothing priors are incorporated based on a multivariate Markov random field, and the markedly non-Gaussian nature of the spectral response variable is accommodated by the use of a Skew-t distribution. A range of models representing different assumptions on the association structure and response distribution are examined, and we conduct model selection using the deviance information criterion. (1) Our analysis suggests region-specific differences between healthy controls and individuals with Down syndrome, particularly in the left and right temporal regions, and produces smoothed maps indicating the scalp topography of the estimated differences.

  12. Kinematic Analysis of Speech Sound Sequencing Errors Induced by Delayed Auditory Feedback

    PubMed Central

    Lee, Jackson C.; Mittelman, Talia; Stepp, Cara E.; Bohland, Jason W.

    2017-01-01

    Purpose Delayed auditory feedback (DAF) causes speakers to become disfluent and make phonological errors. Methods for assessing the kinematics of speech errors are lacking, with most DAF studies relying on auditory perceptual analyses, which may be problematic, as errors judged to be categorical may actually represent blends of sounds or articulatory errors. Method Eight typical speakers produced nonsense syllable sequences under normal and DAF (200 ms). Lip and tongue kinematics were captured with electromagnetic articulography. Time-locked acoustic recordings were transcribed, and the kinematics of utterances with and without perceived errors were analyzed with existing and novel quantitative methods. Results New multivariate measures showed that for 5 participants, kinematic variability for productions perceived to be error free was significantly increased under delay; these results were validated by using the spatiotemporal index measure. Analysis of error trials revealed both typical productions of a nontarget syllable and productions with articulatory kinematics that incorporated aspects of both the target and the perceived utterance. Conclusions This study is among the first to characterize articulatory changes under DAF and provides evidence for different classes of speech errors, which may not be perceptually salient. New methods were developed that may aid visualization and analysis of large kinematic data sets. Supplemental Material https://doi.org/10.23641/asha.5103067 PMID:28655038

  13. Patent ductus arteriosus and indomethacin treatment as independent risk factors for plus disease in retinopathy of prematurity.

    PubMed

    Tsui, Irena; Ebani, Edward; Rosenberg, Jamie B; Lin, Juan; Angert, Robert M; Mian, Umar

    2013-01-01

    To examine whether clinically significant patent ductus arteriosus (PDA) or indomethacin treatment are associated with plus disease or retinopathy of prematurity (ROP) requiring treatment. Retrospective, cross-sectional study. Charts were reviewed for gestational age, birth weight, birth head circumference, birth length, maternal characteristics, gender, bronchopulmonary dysplasia, neurologic comorbidities, PDA and its treatments, gastrointestinal comorbidities, blood transfusions, and sepsis. Main outcome measures were increased rates of plus disease or ROP requiring treatment. A total of 450 premature infants screened for ROP in a mid-sized, urban neonatal intensive care unit were included. On univariate analysis, gestational age, birth weight, birth head circumference, birth length, bronchopulmonary dysplasia, neurologic comorbidities, PDA and its treatments, gastrointestinal comorbidities, and sepsis were significantly correlated to plus disease and ROP requiring treatment. PDA was significantly associated with bronchopulmonary dysplasia, neurologic comorbidities, sepsis, and blood transfusions (P < .0001). With type 3 multivariate analysis, only gestational age and bronchopulmonary dysplasia were independent risk factors for ROP. PDA and indomethacin were associated with plus disease and ROP requiring treatment on univariate analysis but this was not significant after adjusting for other risk factors. PDA was also strongly related to bronchopulmonary dysplasia and blood transfusions, which may explain its effect on ROP. Copyright 2013, SLACK Incorporated.

  14. Multivariate Meta-Analysis of Genetic Association Studies: A Simulation Study

    PubMed Central

    Neupane, Binod; Beyene, Joseph

    2015-01-01

    In a meta-analysis with multiple end points of interests that are correlated between or within studies, multivariate approach to meta-analysis has a potential to produce more precise estimates of effects by exploiting the correlation structure between end points. However, under random-effects assumption the multivariate estimation is more complex (as it involves estimation of more parameters simultaneously) than univariate estimation, and sometimes can produce unrealistic parameter estimates. Usefulness of multivariate approach to meta-analysis of the effects of a genetic variant on two or more correlated traits is not well understood in the area of genetic association studies. In such studies, genetic variants are expected to roughly maintain Hardy-Weinberg equilibrium within studies, and also their effects on complex traits are generally very small to modest and could be heterogeneous across studies for genuine reasons. We carried out extensive simulation to explore the comparative performance of multivariate approach with most commonly used univariate inverse-variance weighted approach under random-effects assumption in various realistic meta-analytic scenarios of genetic association studies of correlated end points. We evaluated the performance with respect to relative mean bias percentage, and root mean square error (RMSE) of the estimate and coverage probability of corresponding 95% confidence interval of the effect for each end point. Our simulation results suggest that multivariate approach performs similarly or better than univariate method when correlations between end points within or between studies are at least moderate and between-study variation is similar or larger than average within-study variation for meta-analyses of 10 or more genetic studies. Multivariate approach produces estimates with smaller bias and RMSE especially for the end point that has randomly or informatively missing summary data in some individual studies, when the missing data in the endpoint are imputed with null effects and quite large variance. PMID:26196398

  15. Multivariate Meta-Analysis of Genetic Association Studies: A Simulation Study.

    PubMed

    Neupane, Binod; Beyene, Joseph

    2015-01-01

    In a meta-analysis with multiple end points of interests that are correlated between or within studies, multivariate approach to meta-analysis has a potential to produce more precise estimates of effects by exploiting the correlation structure between end points. However, under random-effects assumption the multivariate estimation is more complex (as it involves estimation of more parameters simultaneously) than univariate estimation, and sometimes can produce unrealistic parameter estimates. Usefulness of multivariate approach to meta-analysis of the effects of a genetic variant on two or more correlated traits is not well understood in the area of genetic association studies. In such studies, genetic variants are expected to roughly maintain Hardy-Weinberg equilibrium within studies, and also their effects on complex traits are generally very small to modest and could be heterogeneous across studies for genuine reasons. We carried out extensive simulation to explore the comparative performance of multivariate approach with most commonly used univariate inverse-variance weighted approach under random-effects assumption in various realistic meta-analytic scenarios of genetic association studies of correlated end points. We evaluated the performance with respect to relative mean bias percentage, and root mean square error (RMSE) of the estimate and coverage probability of corresponding 95% confidence interval of the effect for each end point. Our simulation results suggest that multivariate approach performs similarly or better than univariate method when correlations between end points within or between studies are at least moderate and between-study variation is similar or larger than average within-study variation for meta-analyses of 10 or more genetic studies. Multivariate approach produces estimates with smaller bias and RMSE especially for the end point that has randomly or informatively missing summary data in some individual studies, when the missing data in the endpoint are imputed with null effects and quite large variance.

  16. MULTIVARIATE ANALYSES (CONONICAL CORRELATION AND PARTIAL LEAST SQUARE, PLS) TO MODEL AND ASSESS THE ASSOCIATION OF LANDSCAPE METRICS TO SURFACE WATER CHEMICAL AND BIOLOGICAL PROPERTIES USING SAVANNAH RIVER BASIN DATA.

    EPA Science Inventory

    Many multivariate methods are used in describing and predicting relation; each has its unique usage of categorical and non-categorical data. In multivariate analysis of variance (MANOVA), many response variables (y's) are related to many independent variables that are categorical...

  17. Multivariate Density Estimation and Remote Sensing

    NASA Technical Reports Server (NTRS)

    Scott, D. W.

    1983-01-01

    Current efforts to develop methods and computer algorithms to effectively represent multivariate data commonly encountered in remote sensing applications are described. While this may involve scatter diagrams, multivariate representations of nonparametric probability density estimates are emphasized. The density function provides a useful graphical tool for looking at data and a useful theoretical tool for classification. This approach is called a thunderstorm data analysis.

  18. Comprehensive drought characteristics analysis based on a nonlinear multivariate drought index

    NASA Astrophysics Data System (ADS)

    Yang, Jie; Chang, Jianxia; Wang, Yimin; Li, Yunyun; Hu, Hui; Chen, Yutong; Huang, Qiang; Yao, Jun

    2018-02-01

    It is vital to identify drought events and to evaluate multivariate drought characteristics based on a composite drought index for better drought risk assessment and sustainable development of water resources. However, most composite drought indices are constructed by the linear combination, principal component analysis and entropy weight method assuming a linear relationship among different drought indices. In this study, the multidimensional copulas function was applied to construct a nonlinear multivariate drought index (NMDI) to solve the complicated and nonlinear relationship due to its dependence structure and flexibility. The NMDI was constructed by combining meteorological, hydrological, and agricultural variables (precipitation, runoff, and soil moisture) to better reflect the multivariate variables simultaneously. Based on the constructed NMDI and runs theory, drought events for a particular area regarding three drought characteristics: duration, peak, and severity were identified. Finally, multivariate drought risk was analyzed as a tool for providing reliable support in drought decision-making. The results indicate that: (1) multidimensional copulas can effectively solve the complicated and nonlinear relationship among multivariate variables; (2) compared with single and other composite drought indices, the NMDI is slightly more sensitive in capturing recorded drought events; and (3) drought risk shows a spatial variation; out of the five partitions studied, the Jing River Basin as well as the upstream and midstream of the Wei River Basin are characterized by a higher multivariate drought risk. In general, multidimensional copulas provides a reliable way to solve the nonlinear relationship when constructing a comprehensive drought index and evaluating multivariate drought characteristics.

  19. Insights on multivariate updates of physical and biogeochemical ocean variables using an Ensemble Kalman Filter and an idealized model of upwelling

    NASA Astrophysics Data System (ADS)

    Yu, Liuqian; Fennel, Katja; Bertino, Laurent; Gharamti, Mohamad El; Thompson, Keith R.

    2018-06-01

    Effective data assimilation methods for incorporating observations into marine biogeochemical models are required to improve hindcasts, nowcasts and forecasts of the ocean's biogeochemical state. Recent assimilation efforts have shown that updating model physics alone can degrade biogeochemical fields while only updating biogeochemical variables may not improve a model's predictive skill when the physical fields are inaccurate. Here we systematically investigate whether multivariate updates of physical and biogeochemical model states are superior to only updating either physical or biogeochemical variables. We conducted a series of twin experiments in an idealized ocean channel that experiences wind-driven upwelling. The forecast model was forced with biased wind stress and perturbed biogeochemical model parameters compared to the model run representing the "truth". Taking advantage of the multivariate nature of the deterministic Ensemble Kalman Filter (DEnKF), we assimilated different combinations of synthetic physical (sea surface height, sea surface temperature and temperature profiles) and biogeochemical (surface chlorophyll and nitrate profiles) observations. We show that when biogeochemical and physical properties are highly correlated (e.g., thermocline and nutricline), multivariate updates of both are essential for improving model skill and can be accomplished by assimilating either physical (e.g., temperature profiles) or biogeochemical (e.g., nutrient profiles) observations. In our idealized domain, the improvement is largely due to a better representation of nutrient upwelling, which results in a more accurate nutrient input into the euphotic zone. In contrast, assimilating surface chlorophyll improves the model state only slightly, because surface chlorophyll contains little information about the vertical density structure. We also show that a degradation of the correlation between observed subsurface temperature and nutrient fields, which has been an issue in several previous assimilation studies, can be reduced by multivariate updates of physical and biogeochemical fields.

  20. Incorporating Single-nucleotide Polymorphisms Into the Lyman Model to Improve Prediction of Radiation Pneumonitis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tucker, Susan L., E-mail: sltucker@mdanderson.org; Li Minghuan; Xu Ting

    2013-01-01

    Purpose: To determine whether single-nucleotide polymorphisms (SNPs) in genes associated with DNA repair, cell cycle, transforming growth factor-{beta}, tumor necrosis factor and receptor, folic acid metabolism, and angiogenesis can significantly improve the fit of the Lyman-Kutcher-Burman (LKB) normal-tissue complication probability (NTCP) model of radiation pneumonitis (RP) risk among patients with non-small cell lung cancer (NSCLC). Methods and Materials: Sixteen SNPs from 10 different genes (XRCC1, XRCC3, APEX1, MDM2, TGF{beta}, TNF{alpha}, TNFR, MTHFR, MTRR, and VEGF) were genotyped in 141 NSCLC patients treated with definitive radiation therapy, with or without chemotherapy. The LKB model was used to estimate the risk ofmore » severe (grade {>=}3) RP as a function of mean lung dose (MLD), with SNPs and patient smoking status incorporated into the model as dose-modifying factors. Multivariate analyses were performed by adding significant factors to the MLD model in a forward stepwise procedure, with significance assessed using the likelihood-ratio test. Bootstrap analyses were used to assess the reproducibility of results under variations in the data. Results: Five SNPs were selected for inclusion in the multivariate NTCP model based on MLD alone. SNPs associated with an increased risk of severe RP were in genes for TGF{beta}, VEGF, TNF{alpha}, XRCC1 and APEX1. With smoking status included in the multivariate model, the SNPs significantly associated with increased risk of RP were in genes for TGF{beta}, VEGF, and XRCC3. Bootstrap analyses selected a median of 4 SNPs per model fit, with the 6 genes listed above selected most often. Conclusions: This study provides evidence that SNPs can significantly improve the predictive ability of the Lyman MLD model. With a small number of SNPs, it was possible to distinguish cohorts with >50% risk vs <10% risk of RP when they were exposed to high MLDs.« less

  1. Effect of Contact Damage on the Strength of Ceramic Materials.

    DTIC Science & Technology

    1982-10-01

    variables that are important to erosion, and a multivariate , linear regression analysis is used to fit the data to the dimensional analysis. The...of Equations 7 and 8 by a multivariable regression analysis (room tem- perature data) Exponent Regression Standard error Computed coefficient of...1980) 593. WEAVER, Proc. Brit. Ceram. Soc. 22 (1973) 125. 39. P. W. BRIDGMAN, "Dimensional Analaysis ", (Yale 18. R. W. RICE, S. W. FREIMAN and P. F

  2. Can dynamite-blasted reefs recover? A novel, low-tech approach to stimulating natural recovery in fish and coral populations.

    PubMed

    Raymundo, L J; Maypa, A P; Gomez, E D; Cadiz, Pablina

    2007-07-01

    Throughout Southeast Asia, blast fishing creates persistent rubble fields with low coral cover and depauperate fish communities. We stabilized a 20-year-old rubble field in a Marine Protected Area in the Philippines, using plastic mesh and rock piles in replicated 17.5m(2) plots, thereby increasing topographic complexity, fish habitat, and recruitment substrate surface area. Multivariate analysis revealed fish community shifts within the rehabilitated area from that characteristic of rubble fields to one similar to the adjacent healthy reef within three years, as measured by changes in fish abundance and body size. Coral recruitment and percent cover increased over time, with 63.5% recruit survivorship within plots, compared with 6% on rubble. Our low-cost approach created a stable substrate favoring natural recovery processes. Both rehabilitation and the elimination of poaching were integral to success, emphasizing the synergism between the two and the need to incorporate both when considering mitigation.

  3. Prediction of Backbreak in Open-Pit Blasting Operations Using the Machine Learning Method

    NASA Astrophysics Data System (ADS)

    Khandelwal, Manoj; Monjezi, M.

    2013-03-01

    Backbreak is an undesirable phenomenon in blasting operations. It can cause instability of mine walls, falling down of machinery, improper fragmentation, reduced efficiency of drilling, etc. The existence of various effective parameters and their unknown relationships are the main reasons for inaccuracy of the empirical models. Presently, the application of new approaches such as artificial intelligence is highly recommended. In this paper, an attempt has been made to predict backbreak in blasting operations of Soungun iron mine, Iran, incorporating rock properties and blast design parameters using the support vector machine (SVM) method. To investigate the suitability of this approach, the predictions by SVM have been compared with multivariate regression analysis (MVRA). The coefficient of determination (CoD) and the mean absolute error (MAE) were taken as performance measures. It was found that the CoD between measured and predicted backbreak was 0.987 and 0.89 by SVM and MVRA, respectively, whereas the MAE was 0.29 and 1.07 by SVM and MVRA, respectively.

  4. Choroidal Infiltration by Retinoblastoma: Predictive Clinical Features and Outcome.

    PubMed

    Kaliki, Swathi; Tahiliani, Prerana; Iram, Sadiya; Ali, Mohammed Hasnat; Mishra, Dilip K; Reddy, Vijay Anand P

    2016-11-01

    To identify the clinical features predictive of choroidal infiltration by retinoblastoma on histopathology and to report the outcome in these patients. Retrospective study. Of the 403 patients who underwent primary enucleation for retinoblastoma, 113 patients had choroidal tumor infiltration and 290 patients had no choroidal tumor infiltration. There was a higher incidence of metastasis and related death in the choroidal tumor infiltration group compared to the no choroidal tumor infiltration group (4% vs 1%; P = .02). On multivariate analysis, the clinical features predictive of histopathologic massive choroidal infiltration included prolonged duration of symptoms for more than 6 months (hazard ratio [HR] = 3.04; P = .001) and secondary glaucoma (HR = 2.24; P = .005). In this study, the patients with retinoblastoma with prolonged duration of symptoms (> 6 months) had a three-fold greater risk and those with secondary glaucoma at presentation had a two-fold greater risk of massive choroidal tumor infiltration. [J Pediatr Ophthalmol Strabismus. 2016;53(6):349-356.]. Copyright 2016, SLACK Incorporated.

  5. Multidisciplinary optimization of controlled space structures with global sensitivity equations

    NASA Technical Reports Server (NTRS)

    Padula, Sharon L.; James, Benjamin B.; Graves, Philip C.; Woodard, Stanley E.

    1991-01-01

    A new method for the preliminary design of controlled space structures is presented. The method coordinates standard finite element structural analysis, multivariable controls, and nonlinear programming codes and allows simultaneous optimization of the structures and control systems of a spacecraft. Global sensitivity equations are a key feature of this method. The preliminary design of a generic geostationary platform is used to demonstrate the multidisciplinary optimization method. Fifteen design variables are used to optimize truss member sizes and feedback gain values. The goal is to reduce the total mass of the structure and the vibration control system while satisfying constraints on vibration decay rate. Incorporating the nonnegligible mass of actuators causes an essential coupling between structural design variables and control design variables. The solution of the demonstration problem is an important step toward a comprehensive preliminary design capability for structures and control systems. Use of global sensitivity equations helps solve optimization problems that have a large number of design variables and a high degree of coupling between disciplines.

  6. Nursing as a Career Choice by Hispanic/Latino College Students: A Multi-Institutional Study.

    PubMed

    Stroup, Linda M; Kuk, Linda

    2015-09-01

    Despite rapid growth in the Hispanic/Latino population, there is significant underrepresentation of Hispanic/Latino individuals in the nursing workforce and nursing programs. This study investigated college students' interest in and self-efficacy for nursing as a career choice, and factors that students believe will impact their success in a nursing program. A nonexperimental, associational research study using a survey instrument was conducted at three comprehensive, public state universities and one community college in the western United States in an area with a significant Hispanic/Latino population. Descriptive and multivariable correlation statistical analysis suggested that college students' interest in and self-efficacy for nursing as a career choice was similar for both Hispanic/Latino and non-Hispanic/Latino students in the sample. Perceived facilitators for success in a nursing program were identified. Findings can be used to develop strategies and programs to enhance the success of Hispanic/Latino students interested in nursing as a career choice. Copyright 2015, SLACK Incorporated.

  7. Output feedback regulator design for jet engine control systems

    NASA Technical Reports Server (NTRS)

    Merrill, W. C.

    1977-01-01

    A multivariable control design procedure based on the output feedback regulator formulation is described and applied to turbofan engine model. Full order model dynamics, were incorporated in the example design. The effect of actuator dynamics on closed loop performance was investigaged. Also, the importance of turbine inlet temperature as an element of the dynamic feedback was studied. Step responses were given to indicate the improvement in system performance with this control. Calculation times for all experiments are given in CPU seconds for comparison purposes.

  8. A Framework for Establishing Standard Reference Scale of Texture by Multivariate Statistical Analysis Based on Instrumental Measurement and Sensory Evaluation.

    PubMed

    Zhi, Ruicong; Zhao, Lei; Xie, Nan; Wang, Houyin; Shi, Bolin; Shi, Jingye

    2016-01-13

    A framework of establishing standard reference scale (texture) is proposed by multivariate statistical analysis according to instrumental measurement and sensory evaluation. Multivariate statistical analysis is conducted to rapidly select typical reference samples with characteristics of universality, representativeness, stability, substitutability, and traceability. The reasonableness of the framework method is verified by establishing standard reference scale of texture attribute (hardness) with Chinese well-known food. More than 100 food products in 16 categories were tested using instrumental measurement (TPA test), and the result was analyzed with clustering analysis, principal component analysis, relative standard deviation, and analysis of variance. As a result, nine kinds of foods were determined to construct the hardness standard reference scale. The results indicate that the regression coefficient between the estimated sensory value and the instrumentally measured value is significant (R(2) = 0.9765), which fits well with Stevens's theory. The research provides reliable a theoretical basis and practical guide for quantitative standard reference scale establishment on food texture characteristics.

  9. A Course in... Multivariable Control Methods.

    ERIC Educational Resources Information Center

    Deshpande, Pradeep B.

    1988-01-01

    Describes an engineering course for graduate study in process control. Lists four major topics: interaction analysis, multiloop controller design, decoupling, and multivariable control strategies. Suggests a course outline and gives information about each topic. (MVL)

  10. Wavelength selection-based nonlinear calibration for transcutaneous blood glucose sensing using Raman spectroscopy

    PubMed Central

    Dingari, Narahara Chari; Barman, Ishan; Kang, Jeon Woong; Kong, Chae-Ryon; Dasari, Ramachandra R.; Feld, Michael S.

    2011-01-01

    While Raman spectroscopy provides a powerful tool for noninvasive and real time diagnostics of biological samples, its translation to the clinical setting has been impeded by the lack of robustness of spectroscopic calibration models and the size and cumbersome nature of conventional laboratory Raman systems. Linear multivariate calibration models employing full spectrum analysis are often misled by spurious correlations, such as system drift and covariations among constituents. In addition, such calibration schemes are prone to overfitting, especially in the presence of external interferences that may create nonlinearities in the spectra-concentration relationship. To address both of these issues we incorporate residue error plot-based wavelength selection and nonlinear support vector regression (SVR). Wavelength selection is used to eliminate uninformative regions of the spectrum, while SVR is used to model the curved effects such as those created by tissue turbidity and temperature fluctuations. Using glucose detection in tissue phantoms as a representative example, we show that even a substantial reduction in the number of wavelengths analyzed using SVR lead to calibration models of equivalent prediction accuracy as linear full spectrum analysis. Further, with clinical datasets obtained from human subject studies, we also demonstrate the prospective applicability of the selected wavelength subsets without sacrificing prediction accuracy, which has extensive implications for calibration maintenance and transfer. Additionally, such wavelength selection could substantially reduce the collection time of serial Raman acquisition systems. Given the reduced footprint of serial Raman systems in relation to conventional dispersive Raman spectrometers, we anticipate that the incorporation of wavelength selection in such hardware designs will enhance the possibility of miniaturized clinical systems for disease diagnosis in the near future. PMID:21895336

  11. Smoking Adversely Affects Survival in Acute Myeloid Leukemia Patients

    PubMed Central

    Varadarajan, Ramya; Licht, Andrea S; Hyland, Andrew J; Ford, Laurie A.; Sait, Sheila N.J.; Block, Annemarie W.; Barcos, Maurice; Baer, Maria R.; Wang, Eunice S.; Wetzler, Meir

    2011-01-01

    Summary Smoking adversely affects hematopoietic stem cell transplantation outcome. We asked whether smoking affected outcome of newly diagnosed acute myeloid leukemia (AML) patients treated with chemotherapy. Data were collected on 280 AML patients treated with high-dose cytarabine and idarubicin-containing regimens at Roswell Park Cancer Institute who had smoking status data at diagnosis. Patients’ gender, age, AML presentation (de novo vs. secondary), white blood cell (WBC) count at diagnosis, karyotype and smoking status (never vs. ever) were analyzed. Among the 161 males and 119 females with a median follow-up of 12.9 months, 101 (36.1%) had never smoked and 179 (63.9%) were ever smokers. The proportion of patients between never and ever smokers was similar with respect to age, AML presentation, WBC count at diagnosis or karyotype based on univariate analysis of these categorical variables. Never smokers had a significantly longer overall survival (60.32 months) compared to ever smokers (30.89; p=0.005). In multivariate analysis incorporating gender, age, AML presentation, WBC count, karyotype, and smoking status as covariates, age, karyotype and smoking status retained prognostic value for overall survival. In summary, cigarette smoking has a deleterious effect on overall survival in AML. PMID:21520043

  12. Development of a Risk Assessment Tool to Predict Fall-Related Severe Injuries Occurring in a Hospital

    PubMed Central

    Toyabe, Shin-ichi

    2014-01-01

    Inpatient falls are the most common adverse events that occur in a hospital, and about 3 to 10% of falls result in serious injuries such as bone fractures and intracranial haemorrhages. We previously reported that bone fractures and intracranial haemorrhages were two major fall-related injuries and that risk assessment score for osteoporotic bone fracture was significantly associated not only with bone fractures after falls but also with intracranial haemorrhage after falls. Based on the results, we tried to establish a risk assessment tool for predicting fall-related severe injuries in a hospital. Possible risk factors related to fall-related serious injuries were extracted from data on inpatients that were admitted to a tertiary-care university hospital by using multivariate Cox’ s regression analysis and multiple logistic regression analysis. We found that fall risk score and fracture risk score were the two significant factors, and we constructed models to predict fall-related severe injuries incorporating these factors. When the prediction model was applied to another independent dataset, the constructed model could detect patients with fall-related severe injuries efficiently. The new assessment system could identify patients prone to severe injuries after falls in a reproducible fashion. PMID:25168984

  13. A systematic review of the relationship factor between women and health professionals within the multivariant analysis of maternal satisfaction.

    PubMed

    Macpherson, Ignacio; Roqué-Sánchez, María V; Legget Bn, Finola O; Fuertes, Ferran; Segarra, Ignacio

    2016-10-01

    personalised support provided to women by health professionals is one of the prime factors attaining women's satisfaction during pregnancy and childbirth. However the multifactorial nature of 'satisfaction' makes difficult to assess it. Statistical multivariate analysis may be an effective technique to obtain in depth quantitative evidence of the importance of this factor and its interaction with the other factors involved. This technique allows us to estimate the importance of overall satisfaction in its context and suggest actions for healthcare services. systematic review of studies that quantitatively measure the personal relationship between women and healthcare professionals (gynecologists, obstetricians, nurse, midwifes, etc.) regarding maternity care satisfaction. The literature search focused on studies carried out between 1970 and 2014 that used multivariate analyses and included the woman-caregiver relationship as a factor of their analysis. twenty-four studies which applied various multivariate analysis tools to different periods of maternity care (antenatal, perinatal, post partum) were selected. The studies included discrete scale scores and questionnaires from women with low-risk pregnancies. The "personal relationship" factor appeared under various names: care received, personalised treatment, professional support, amongst others. The most common multivariate techniques used to assess the percentage of variance explained and the odds ratio of each factor were principal component analysis and logistic regression. the data, variables and factor analysis suggest that continuous, personalised care provided by the usual midwife and delivered within a family or a specialised setting, generates the highest level of satisfaction. In addition, these factors foster the woman's psychological and physiological recovery, often surpassing clinical action (e.g. medicalization and hospital organization) and/or physiological determinants (e.g. pain, pathologies, etc.). Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Simultaneous calibration of ensemble river flow predictions over an entire range of lead times

    NASA Astrophysics Data System (ADS)

    Hemri, S.; Fundel, F.; Zappa, M.

    2013-10-01

    Probabilistic estimates of future water levels and river discharge are usually simulated with hydrologic models using ensemble weather forecasts as main inputs. As hydrologic models are imperfect and the meteorological ensembles tend to be biased and underdispersed, the ensemble forecasts for river runoff typically are biased and underdispersed, too. Thus, in order to achieve both reliable and sharp predictions statistical postprocessing is required. In this work Bayesian model averaging (BMA) is applied to statistically postprocess ensemble runoff raw forecasts for a catchment in Switzerland, at lead times ranging from 1 to 240 h. The raw forecasts have been obtained using deterministic and ensemble forcing meteorological models with different forecast lead time ranges. First, BMA is applied based on mixtures of univariate normal distributions, subject to the assumption of independence between distinct lead times. Then, the independence assumption is relaxed in order to estimate multivariate runoff forecasts over the entire range of lead times simultaneously, based on a BMA version that uses multivariate normal distributions. Since river runoff is a highly skewed variable, Box-Cox transformations are applied in order to achieve approximate normality. Both univariate and multivariate BMA approaches are able to generate well calibrated probabilistic forecasts that are considerably sharper than climatological forecasts. Additionally, multivariate BMA provides a promising approach for incorporating temporal dependencies into the postprocessed forecasts. Its major advantage against univariate BMA is an increase in reliability when the forecast system is changing due to model availability.

  15. Independent Predictors of Prognosis Based on Oral Cavity Squamous Cell Carcinoma Surgical Margins.

    PubMed

    Buchakjian, Marisa R; Ginader, Timothy; Tasche, Kendall K; Pagedar, Nitin A; Smith, Brian J; Sperry, Steven M

    2018-05-01

    Objective To conduct a multivariate analysis of a large cohort of oral cavity squamous cell carcinoma (OCSCC) cases for independent predictors of local recurrence (LR) and overall survival (OS), with emphasis on the relationship between (1) prognosis and (2) main specimen permanent margins and intraoperative tumor bed frozen margins. Study Design Retrospective cohort study. Setting Tertiary academic head and neck cancer program. Subjects and Methods This study included 426 patients treated with OCSCC resection between 2005 and 2014 at University of Iowa Hospitals and Clinics. Patients underwent excision of OCSCC with intraoperative tumor bed frozen margin sampling and main specimen permanent margin assessment. Multivariate analysis of the data set to predict LR and OS was performed. Results Independent predictors of LR included nodal involvement, histologic grade, and main specimen permanent margin status. Specifically, the presence of a positive margin (odds ratio, 6.21; 95% CI, 3.3-11.9) or <1-mm/carcinoma in situ margin (odds ratio, 2.41; 95% CI, 1.19-4.87) on the main specimen was an independent predictor of LR, whereas intraoperative tumor bed margins were not predictive of LR on multivariate analysis. Similarly, independent predictors of OS on multivariate analysis included nodal involvement, extracapsular extension, and a positive main specimen margin. Tumor bed margins did not independently predict OS. Conclusion The main specimen margin is a strong independent predictor of LR and OS on multivariate analysis. Intraoperative tumor bed frozen margins do not independently predict prognosis. We conclude that emphasis should be placed on evaluating the main specimen margins when estimating prognosis after OCSCC resection.

  16. Redefining the Breast Cancer Exosome Proteome by Tandem Mass Tag Quantitative Proteomics and Multivariate Cluster Analysis.

    PubMed

    Clark, David J; Fondrie, William E; Liao, Zhongping; Hanson, Phyllis I; Fulton, Amy; Mao, Li; Yang, Austin J

    2015-10-20

    Exosomes are microvesicles of endocytic origin constitutively released by multiple cell types into the extracellular environment. With evidence that exosomes can be detected in the blood of patients with various malignancies, the development of a platform that uses exosomes as a diagnostic tool has been proposed. However, it has been difficult to truly define the exosome proteome due to the challenge of discerning contaminant proteins that may be identified via mass spectrometry using various exosome enrichment strategies. To better define the exosome proteome in breast cancer, we incorporated a combination of Tandem-Mass-Tag (TMT) quantitative proteomics approach and Support Vector Machine (SVM) cluster analysis of three conditioned media derived fractions corresponding to a 10 000g cellular debris pellet, a 100 000g crude exosome pellet, and an Optiprep enriched exosome pellet. The quantitative analysis identified 2 179 proteins in all three fractions, with known exosomal cargo proteins displaying at least a 2-fold enrichment in the exosome fraction based on the TMT protein ratios. Employing SVM cluster analysis allowed for the classification 251 proteins as "true" exosomal cargo proteins. This study provides a robust and vigorous framework for the future development of using exosomes as a potential multiprotein marker phenotyping tool that could be useful in breast cancer diagnosis and monitoring disease progression.

  17. Metastatic Spinal Cord Compression from Non-Small-Cell Lung Cancer Treated with Surgery and Adjuvant Therapies: A Retrospective Analysis of Outcomes and Prognostic Factors in 116 Patients.

    PubMed

    Tang, Yu; Qu, Jintao; Wu, Juan; Li, Song; Zhou, Yue; Xiao, Jianru

    2015-09-02

    Metastatic spinal cord compression is a disastrous consequence of non-small-cell lung cancer (NSCLC). There have been few studies of the outcomes or prognostic factors in patients with metastatic spinal cord compression from NSCLC treated with surgery and adjuvant therapies. From 2002 to 2013, 116 patients with metastatic spinal cord compression from NSCLC treated with surgery and adjuvant therapies were enrolled in this retrospective analysis. Kaplan-Meier methods and Cox regression analysis were used to estimate overall survival and identify prognostic factors for survival. Multivariate analysis suggested that the Eastern Cooperative Oncology Group performance status (ECOG-PS), preoperative and postoperative Frankel scores, postoperative adjuvant radiation therapy, and target therapy were independent prognostic factors. Ninety patients died at a median of twelve months (range, three to forty-seven months) postoperatively, and twenty-six patients were still alive at the time of final follow-up (at a median of fifteen months [range, five to fifty-four months]). The complete disappearance of deficits in spinal cord function after surgery was the most robust predictor of survival. Adjuvant radiation therapy and target therapy were also associated with a better prognosis. Prognostic Level IV. See Instructions for Authors for a complete description of levels of evidence. Copyright © 2015 by The Journal of Bone and Joint Surgery, Incorporated.

  18. A critical analysis of early death after adult liver transplants.

    PubMed

    Rana, Abbas; Kaplan, Bruce; Jie, Tun; Porubsky, Marian; Habib, Shahid; Rilo, Horacio; Gruessner, Angelika C; Gruessner, Rainer W G

    2013-01-01

    The 15% mortality rate of liver transplant recipients at one yr may be viewed as a feat in comparison with the waiting list mortality, yet it nonetheless leaves room for much improvement. Our aim was to critically examine the mortality rates to identify high-risk periods and to incorporate cause of death into the analysis of post-transplant survival. We performed a retrospective analysis on United Network for Organ Sharing data for all adult recipients of liver transplants from January 1, 2002 to October 31, 2011. Our analysis included multivariate logistic regression where the primary outcome measure was patient death of 49,288 recipients. The highest mortality rate by day post-transplant was on day 0 (0.9%). The most significant risk factors were as follows: for one-d mortality from technical failure, intensive care unit admission odds ratio (OR 3.2); for one-d mortality from graft failure, warm ischemia >75 min (OR 5.6); for one-month mortality from infection, a previous transplant (OR 3.3); and for one-month mortality from graft failure, a previous transplant (OR 3.7). We found that the highest mortality rate after liver transplantation is within the first day and the first month post-transplant. Those two high-risk periods have common, as well as different, risk factors for mortality. © 2013 John Wiley & Sons A/S.

  19. Copula Multivariate analysis of Gross primary production and its hydro-environmental driver; A BIOME-BGC model applied to the Antisana páramos

    NASA Astrophysics Data System (ADS)

    Minaya, Veronica; Corzo, Gerald; van der Kwast, Johannes; Galarraga, Remigio; Mynett, Arthur

    2014-05-01

    Simulations of carbon cycling are prone to uncertainties from different sources, which in general are related to input data, parameters and the model representation capacities itself. The gross carbon uptake in the cycle is represented by the gross primary production (GPP), which deals with the spatio-temporal variability of the precipitation and the soil moisture dynamics. This variability associated with uncertainty of the parameters can be modelled by multivariate probabilistic distributions. Our study presents a novel methodology that uses multivariate Copulas analysis to assess the GPP. Multi-species and elevations variables are included in a first scenario of the analysis. Hydro-meteorological conditions that might generate a change in the next 50 or more years are included in a second scenario of this analysis. The biogeochemical model BIOME-BGC was applied in the Ecuadorian Andean region in elevations greater than 4000 masl with the presence of typical vegetation of páramo. The change of GPP over time is crucial for climate scenarios of the carbon cycling in this type of ecosystem. The results help to improve our understanding of the ecosystem function and clarify the dynamics and the relationship with the change of climate variables. Keywords: multivariate analysis, Copula, BIOME-BGC, NPP, páramos

  20. Multivariate analysis of cytokine profiles in pregnancy complications.

    PubMed

    Azizieh, Fawaz; Dingle, Kamaludin; Raghupathy, Raj; Johnson, Kjell; VanderPlas, Jacob; Ansari, Ali

    2018-03-01

    The immunoregulation to tolerate the semiallogeneic fetus during pregnancy includes a harmonious dynamic balance between anti- and pro-inflammatory cytokines. Several earlier studies reported significantly different levels and/or ratios of several cytokines in complicated pregnancy as compared to normal pregnancy. However, as cytokines operate in networks with potentially complex interactions, it is also interesting to compare groups with multi-cytokine data sets, with multivariate analysis. Such analysis will further examine how great the differences are, and which cytokines are more different than others. Various multivariate statistical tools, such as Cramer test, classification and regression trees, partial least squares regression figures, 2-dimensional Kolmogorov-Smirmov test, principal component analysis and gap statistic, were used to compare cytokine data of normal vs anomalous groups of different pregnancy complications. Multivariate analysis assisted in examining if the groups were different, how strongly they differed, in what ways they differed and further reported evidence for subgroups in 1 group (pregnancy-induced hypertension), possibly indicating multiple causes for the complication. This work contributes to a better understanding of cytokines interaction and may have important implications on targeting cytokine balance modulation or design of future medications or interventions that best direct management or prevention from an immunological approach. © 2018 The Authors. American Journal of Reproductive Immunology Published by John Wiley & Sons Ltd.

  1. Characterization of Interfacial Chemistry of Adhesive/Dentin Bond Using FTIR Chemical Imaging With Univariate and Multivariate Data Processing

    PubMed Central

    Wang, Yong; Yao, Xiaomei; Parthasarathy, Ranganathan

    2008-01-01

    Fourier transform infrared (FTIR) chemical imaging can be used to investigate molecular chemical features of the adhesive/dentin interfaces. However, the information is not straightforward, and is not easily extracted. The objective of this study was to use multivariate analysis methods, principal component analysis and fuzzy c-means clustering, to analyze spectral data in comparison with univariate analysis. The spectral imaging data collected from both the adhesive/healthy dentin and adhesive/caries-affected dentin specimens were used and compared. The univariate statistical methods such as mapping of intensities of specific functional group do not always accurately identify functional group locations and concentrations due to more or less band overlapping in adhesive and dentin. Apart from the ease with which information can be extracted, multivariate methods highlight subtle and often important changes in the spectra that are difficult to observe using univariate methods. The results showed that the multivariate methods gave more satisfactory, interpretable results than univariate methods and were conclusive in showing that they can discriminate and classify differences between healthy dentin and caries-affected dentin within the interfacial regions. It is demonstrated that the multivariate FTIR imaging approaches can be used in the rapid characterization of heterogeneous, complex structure. PMID:18980198

  2. Multivariate Analysis of Longitudinal Rates of Change

    PubMed Central

    Bryan, Matthew; Heagerty, Patrick J.

    2016-01-01

    Longitudinal data allow direct comparison of the change in patient outcomes associated with treatment or exposure. Frequently, several longitudinal measures are collected that either reflect a common underlying health status, or characterize processes that are influenced in a similar way by covariates such as exposure or demographic characteristics. Statistical methods that can combine multivariate response variables into common measures of covariate effects have been proposed by Roy and Lin [1]; Proust-Lima, Letenneur and Jacqmin-Gadda [2]; and Gray and Brookmeyer [3] among others. Current methods for characterizing the relationship between covariates and the rate of change in multivariate outcomes are limited to select models. For example, Gray and Brookmeyer [3] introduce an “accelerated time” method which assumes that covariates rescale time in longitudinal models for disease progression. In this manuscript we detail an alternative multivariate model formulation that directly structures longitudinal rates of change, and that permits a common covariate effect across multiple outcomes. We detail maximum likelihood estimation for a multivariate longitudinal mixed model. We show via asymptotic calculations the potential gain in power that may be achieved with a common analysis of multiple outcomes. We apply the proposed methods to the analysis of a trivariate outcome for infant growth and compare rates of change for HIV infected and uninfected infants. PMID:27417129

  3. Additive genetic variation and evolvability of a multivariate trait can be increased by epistatic gene action.

    PubMed

    Griswold, Cortland K

    2015-12-21

    Epistatic gene action occurs when mutations or alleles interact to produce a phenotype. Theoretically and empirically it is of interest to know whether gene interactions can facilitate the evolution of diversity. In this paper, we explore how epistatic gene action affects the additive genetic component or heritable component of multivariate trait variation, as well as how epistatic gene action affects the evolvability of multivariate traits. The analysis involves a sexually reproducing and recombining population. Our results indicate that under stabilizing selection conditions a population with a mixed additive and epistatic genetic architecture can have greater multivariate additive genetic variation and evolvability than a population with a purely additive genetic architecture. That greater multivariate additive genetic variation can occur with epistasis is in contrast to previous theory that indicated univariate additive genetic variation is decreased with epistasis under stabilizing selection conditions. In a multivariate setting, epistasis leads to less relative covariance among individuals in their genotypic, as well as their breeding values, which facilitates the maintenance of additive genetic variation and increases a population׳s evolvability. Our analysis involves linking the combinatorial nature of epistatic genetic effects to the ancestral graph structure of a population to provide insight into the consequences of epistasis on multivariate trait variation and evolution. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Exploring the Structure of Library and Information Science Web Space Based on Multivariate Analysis of Social Tags

    ERIC Educational Resources Information Center

    Joo, Soohyung; Kipp, Margaret E. I.

    2015-01-01

    Introduction: This study examines the structure of Web space in the field of library and information science using multivariate analysis of social tags from the Website, Delicious.com. A few studies have examined mathematical modelling of tags, mainly examining tagging in terms of tripartite graphs, pattern tracing and descriptive statistics. This…

  5. Multivariate Analysis of High Through-Put Adhesively Bonded Single Lap Joints: Experimental and Workflow Protocols

    DTIC Science & Technology

    2016-06-01

    unlimited. v List of Tables Table 1 Single-lap-joint experimental parameters ..............................................7 Table 2 Survey ...Joints: Experimental and Workflow Protocols by Robert E Jensen, Daniel C DeSchepper, and David P Flanagan Approved for...TR-7696 ● JUNE 2016 US Army Research Laboratory Multivariate Analysis of High Through-Put Adhesively Bonded Single Lap Joints: Experimental

  6. A Multivariate Model for the Meta-Analysis of Study Level Survival Data at Multiple Times

    ERIC Educational Resources Information Center

    Jackson, Dan; Rollins, Katie; Coughlin, Patrick

    2014-01-01

    Motivated by our meta-analytic dataset involving survival rates after treatment for critical leg ischemia, we develop and apply a new multivariate model for the meta-analysis of study level survival data at multiple times. Our data set involves 50 studies that provide mortality rates at up to seven time points, which we model simultaneously, and…

  7. Atomic-scale phase composition through multivariate statistical analysis of atom probe tomography data.

    PubMed

    Keenan, Michael R; Smentkowski, Vincent S; Ulfig, Robert M; Oltman, Edward; Larson, David J; Kelly, Thomas F

    2011-06-01

    We demonstrate for the first time that multivariate statistical analysis techniques can be applied to atom probe tomography data to estimate the chemical composition of a sample at the full spatial resolution of the atom probe in three dimensions. Whereas the raw atom probe data provide the specific identity of an atom at a precise location, the multivariate results can be interpreted in terms of the probabilities that an atom representing a particular chemical phase is situated there. When aggregated to the size scale of a single atom (∼0.2 nm), atom probe spectral-image datasets are huge and extremely sparse. In fact, the average spectrum will have somewhat less than one total count per spectrum due to imperfect detection efficiency. These conditions, under which the variance in the data is completely dominated by counting noise, test the limits of multivariate analysis, and an extensive discussion of how to extract the chemical information is presented. Efficient numerical approaches to performing principal component analysis (PCA) on these datasets, which may number hundreds of millions of individual spectra, are put forward, and it is shown that PCA can be computed in a few seconds on a typical laptop computer.

  8. Risk factors for incidental durotomy during lumbar surgery: a retrospective study by multivariate analysis.

    PubMed

    Chen, Zhixiang; Shao, Peng; Sun, Qizhao; Zhao, Dong

    2015-03-01

    The purpose of the present study was to use a prospectively collected data to evaluate the rate of incidental durotomy (ID) during lumbar surgery and determine the associated risk factors by using univariate and multivariate analysis. We retrospectively reviewed 2184 patients who underwent lumbar surgery from January 1, 2009 to December 31, 2011 at a single hospital. Patients with ID (n=97) were compared with the patients without ID (n=2019). The influences of several potential risk factors that might affect the occurrence of ID were assessed using univariate and multivariate analyses. The overall incidence of ID was 4.62%. Univariate analysis demonstrated that older age, diabetes, lumbar central stenosis, posterior approach, revision surgery, prior lumber surgery and minimal invasive surgery are risk factors for ID during lumbar surgery. However, multivariate analysis identified older age, prior lumber surgery, revision surgery, and minimally invasive surgery as independent risk factors. Older age, prior lumber surgery, revision surgery, and minimal invasive surgery were independent risk factors for ID during lumbar surgery. These findings may guide clinicians making future surgical decisions regarding ID and aid in the patient counseling process to alleviate risks and complications. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Linear models of coregionalization for multivariate lattice data: Order-dependent and order-free cMCARs.

    PubMed

    MacNab, Ying C

    2016-08-01

    This paper concerns with multivariate conditional autoregressive models defined by linear combination of independent or correlated underlying spatial processes. Known as linear models of coregionalization, the method offers a systematic and unified approach for formulating multivariate extensions to a broad range of univariate conditional autoregressive models. The resulting multivariate spatial models represent classes of coregionalized multivariate conditional autoregressive models that enable flexible modelling of multivariate spatial interactions, yielding coregionalization models with symmetric or asymmetric cross-covariances of different spatial variation and smoothness. In the context of multivariate disease mapping, for example, they facilitate borrowing strength both over space and cross variables, allowing for more flexible multivariate spatial smoothing. Specifically, we present a broadened coregionalization framework to include order-dependent, order-free, and order-robust multivariate models; a new class of order-free coregionalized multivariate conditional autoregressives is introduced. We tackle computational challenges and present solutions that are integral for Bayesian analysis of these models. We also discuss two ways of computing deviance information criterion for comparison among competing hierarchical models with or without unidentifiable prior parameters. The models and related methodology are developed in the broad context of modelling multivariate data on spatial lattice and illustrated in the context of multivariate disease mapping. The coregionalization framework and related methods also present a general approach for building spatially structured cross-covariance functions for multivariate geostatistics. © The Author(s) 2016.

  10. Multivariate reference technique for quantitative analysis of fiber-optic tissue Raman spectroscopy.

    PubMed

    Bergholt, Mads Sylvest; Duraipandian, Shiyamala; Zheng, Wei; Huang, Zhiwei

    2013-12-03

    We report a novel method making use of multivariate reference signals of fused silica and sapphire Raman signals generated from a ball-lens fiber-optic Raman probe for quantitative analysis of in vivo tissue Raman measurements in real time. Partial least-squares (PLS) regression modeling is applied to extract the characteristic internal reference Raman signals (e.g., shoulder of the prominent fused silica boson peak (~130 cm(-1)); distinct sapphire ball-lens peaks (380, 417, 646, and 751 cm(-1))) from the ball-lens fiber-optic Raman probe for quantitative analysis of fiber-optic Raman spectroscopy. To evaluate the analytical value of this novel multivariate reference technique, a rapid Raman spectroscopy system coupled with a ball-lens fiber-optic Raman probe is used for in vivo oral tissue Raman measurements (n = 25 subjects) under 785 nm laser excitation powers ranging from 5 to 65 mW. An accurate linear relationship (R(2) = 0.981) with a root-mean-square error of cross validation (RMSECV) of 2.5 mW can be obtained for predicting the laser excitation power changes based on a leave-one-subject-out cross-validation, which is superior to the normal univariate reference method (RMSE = 6.2 mW). A root-mean-square error of prediction (RMSEP) of 2.4 mW (R(2) = 0.985) can also be achieved for laser power prediction in real time when we applied the multivariate method independently on the five new subjects (n = 166 spectra). We further apply the multivariate reference technique for quantitative analysis of gelatin tissue phantoms that gives rise to an RMSEP of ~2.0% (R(2) = 0.998) independent of laser excitation power variations. This work demonstrates that multivariate reference technique can be advantageously used to monitor and correct the variations of laser excitation power and fiber coupling efficiency in situ for standardizing the tissue Raman intensity to realize quantitative analysis of tissue Raman measurements in vivo, which is particularly appealing in challenging Raman endoscopic applications.

  11. Causal diagrams and multivariate analysis II: precision work.

    PubMed

    Jupiter, Daniel C

    2014-01-01

    In this Investigators' Corner, I continue my discussion of when and why we researchers should include variables in multivariate regression. My examination focuses on studies comparing treatment groups and situations for which we can either exclude variables from multivariate analyses or include them for reasons of precision. Copyright © 2014 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.

  12. Multiscale Characterization of PM2.5 in Southern Taiwan based on Noise-assisted Multivariate Empirical Mode Decomposition and Time-dependent Intrinsic Correlation

    NASA Astrophysics Data System (ADS)

    Hsiao, Y. R.; Tsai, C.

    2017-12-01

    As the WHO Air Quality Guideline indicates, ambient air pollution exposes world populations under threat of fatal symptoms (e.g. heart disease, lung cancer, asthma etc.), raising concerns of air pollution sources and relative factors. This study presents a novel approach to investigating the multiscale variations of PM2.5 in southern Taiwan over the past decade, with four meteorological influencing factors (Temperature, relative humidity, precipitation and wind speed),based on Noise-assisted Multivariate Empirical Mode Decomposition(NAMEMD) algorithm, Hilbert Spectral Analysis(HSA) and Time-dependent Intrinsic Correlation(TDIC) method. NAMEMD algorithm is a fully data-driven approach designed for nonlinear and nonstationary multivariate signals, and is performed to decompose multivariate signals into a collection of channels of Intrinsic Mode Functions (IMFs). TDIC method is an EMD-based method using a set of sliding window sizes to quantify localized correlation coefficients for multiscale signals. With the alignment property and quasi-dyadic filter bank of NAMEMD algorithm, one is able to produce same number of IMFs for all variables and estimates the cross correlation in a more accurate way. The performance of spectral representation of NAMEMD-HSA method is compared with Complementary Empirical Mode Decomposition/ Hilbert Spectral Analysis (CEEMD-HSA) and Wavelet Analysis. The nature of NAMAMD-based TDICC analysis is then compared with CEEMD-based TDIC analysis and the traditional correlation analysis.

  13. Analysis/forecast experiments with a multivariate statistical analysis scheme using FGGE data

    NASA Technical Reports Server (NTRS)

    Baker, W. E.; Bloom, S. C.; Nestler, M. S.

    1985-01-01

    A three-dimensional, multivariate, statistical analysis method, optimal interpolation (OI) is described for modeling meteorological data from widely dispersed sites. The model was developed to analyze FGGE data at the NASA-Goddard Laboratory of Atmospherics. The model features a multivariate surface analysis over the oceans, including maintenance of the Ekman balance and a geographically dependent correlation function. Preliminary comparisons are made between the OI model and similar schemes employed at the European Center for Medium Range Weather Forecasts and the National Meteorological Center. The OI scheme is used to provide input to a GCM, and model error correlations are calculated for forecasts of 500 mb vertical water mixing ratios and the wind profiles. Comparisons are made between the predictions and measured data. The model is shown to be as accurate as a successive corrections model out to 4.5 days.

  14. The maternal and neonatal outcomes for an urban Indigenous population compared with their non-Indigenous counterparts and a trend analysis over four triennia.

    PubMed

    Kildea, Sue; Stapleton, Helen; Murphy, Rebecca; Kosiak, Machellee; Gibbons, Kristen

    2013-08-30

    Indigenous Australians experience significantly disproportionate poorer health outcomes compared to their non-Indigenous counterparts. Despite the recognised importance of maternal infant health (MIH), there is surprisingly little empirical research to guide service redesign that successfully addresses the disparities. This paper reports on a service evaluation that also compared key MIH indicators for Indigenous and non-Indigenous mothers and babies over a 12-year period 1998-2009. Trend analysis with logistic regression, using the independent variables of ethnicity and triennia, explored changes over time (1998-2009) between two cohorts: 1,523 births to Indigenous mothers and 43,693 births to non-Indigenous mothers. We included bivariate and multivariate analysis on key indicators (e.g. teenage births, preterm birth, low birth weight, smoking) and report odds ratios (ORs), 95% CIs and logistic regression adjusting for important confounders. We excluded transfers in from other areas which are identified within the database. Bivariate analysis revealed Indigenous women were statistically more likely to have spontaneous onset of labour and a non-instrumental vaginal birth. They were less likely to take epidurals for pain relief in labour, have assisted births, caesarean sections or perineal trauma. Despite better labour outcomes, Indigenous babies were more likely to be born preterm (< 37 weeks) and be low birth weight (< 2500 g); these differences remained significant in multivariate analysis. The trend analysis revealed relatively stable rates for teenage pregnancy, small for gestational age, low birth weight babies, and perinatal mortality for both cohorts, with the gap between cohorts consistent over time. A statistical widening of the gap in preterm birth and smoking rates was found with preterm birth demonstrating a relative increase of 51% over this period. The comprehensive database from a large urban hospital allowed a thorough examination of outcomes and contributing factors. The gap between both cohorts remains static in several areas but in some cases worsened. Alternative models for delivering care to Indigenous women and their babies have shown improved outcomes, including preterm birth, though not all have been sustained over time and none are available Australia-wide. New models of care, which recognise the heterogeneity of Indigenous communities, incorporate a multiagency approach, and are set within a research framework, are urgently needed.

  15. Spectral compression algorithms for the analysis of very large multivariate images

    DOEpatents

    Keenan, Michael R.

    2007-10-16

    A method for spectrally compressing data sets enables the efficient analysis of very large multivariate images. The spectral compression algorithm uses a factored representation of the data that can be obtained from Principal Components Analysis or other factorization technique. Furthermore, a block algorithm can be used for performing common operations more efficiently. An image analysis can be performed on the factored representation of the data, using only the most significant factors. The spectral compression algorithm can be combined with a spatial compression algorithm to provide further computational efficiencies.

  16. Radiation Therapy Versus No Radiation Therapy to the Neo-breast Following Skin-Sparing Mastectomy and Immediate Autologous Free Flap Reconstruction for Breast Cancer: Patient-Reported and Surgical Outcomes at 1 Year-A Mastectomy Reconstruction Outcomes Consortium (MROC) Substudy.

    PubMed

    Cooke, Andrew L; Diaz-Abele, Julian; Hayakawa, Tom; Buchel, Ed; Dalke, Kimberly; Lambert, Pascal

    2017-09-01

    To determine whether adjuvant radiation therapy (RT) is associated with adverse patient-reported outcomes and surgical complications 1 year after skin-sparing mastectomy and immediate autologous free flap reconstruction for breast cancer. We compared 24 domains of patient-reported outcome measures 1 year after autologous reconstruction between patients who received adjuvant RT and those who did not. A total of 125 patients who underwent surgery between 2012 and 2015 at our institution were included from the Mastectomy Reconstruction Outcomes Consortium study database. Adjusted multivariate models were created incorporating RT technical data, age, cancer stage, estrogen receptor, chemotherapy, breast size, body mass index, and income to determine whether RT was associated with outcomes. At 1 year after surgery, European Organisation for Research and Treatment of Cancer (EORTC) Breast Cancer-Specific Quality of Life Questionnaire breast symptoms were significantly greater in 64 patients who received RT (8-point difference on 100-point ordinal scale, P<.0001) versus 61 who did not receive RT in univariate and multivariate models. EORTC arm symptoms (20-point difference on 100-point ordinal scale, P=.0200) differed on univariate analysis but not on multivariate analysis. All other outcomes-including Numerical Pain Rating Scale, BREAST-Q (Post-operative Reconstruction Module), Patient-Report Outcomes Measurement Information System Profile 29, McGill Pain Questionnaire-Short Form (MPQ-SF) score, Generalized Anxiety Disorder Scale, and Patient Health Questionnaire-were not statistically different between groups. Surgical complications were uncommon and did not differ by treatment. RT to the neo-breast compared with no RT following immediate autologous free flap reconstruction for breast cancer is well tolerated at 1 year following surgery despite patients undergoing RT also having a higher cancer stage and more intensive surgical and systemic treatment. Neo-breast symptoms are more common in patients receiving RT by the EORTC Breast Cancer-Specific Quality of Life Questionnaire but not by the BREAST-Q. Patient-reported results at 1 year after surgery suggest RT following immediate autologous free flap breast reconstruction is well tolerated. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Esophageal cancer detection based on tissue surface-enhanced Raman spectroscopy and multivariate analysis

    NASA Astrophysics Data System (ADS)

    Feng, Shangyuan; Lin, Juqiang; Huang, Zufang; Chen, Guannan; Chen, Weisheng; Wang, Yue; Chen, Rong; Zeng, Haishan

    2013-01-01

    The capability of using silver nanoparticle based near-infrared surface enhanced Raman scattering (SERS) spectroscopy combined with principal component analysis (PCA) and linear discriminate analysis (LDA) to differentiate esophageal cancer tissue from normal tissue was presented. Significant differences in Raman intensities of prominent SERS bands were observed between normal and cancer tissues. PCA-LDA multivariate analysis of the measured tissue SERS spectra achieved diagnostic sensitivity of 90.9% and specificity of 97.8%. This exploratory study demonstrated great potential for developing label-free tissue SERS analysis into a clinical tool for esophageal cancer detection.

  18. New multivariable capabilities of the INCA program

    NASA Technical Reports Server (NTRS)

    Bauer, Frank H.; Downing, John P.; Thorpe, Christopher J.

    1989-01-01

    The INteractive Controls Analysis (INCA) program was developed at NASA's Goddard Space Flight Center to provide a user friendly, efficient environment for the design and analysis of control systems, specifically spacecraft control systems. Since its inception, INCA has found extensive use in the design, development, and analysis of control systems for spacecraft, instruments, robotics, and pointing systems. The (INCA) program was initially developed as a comprehensive classical design analysis tool for small and large order control systems. The latest version of INCA, expected to be released in February of 1990, was expanded to include the capability to perform multivariable controls analysis and design.

  19. A multivariate fall risk assessment model for VHA nursing homes using the minimum data set.

    PubMed

    French, Dustin D; Werner, Dennis C; Campbell, Robert R; Powell-Cope, Gail M; Nelson, Audrey L; Rubenstein, Laurence Z; Bulat, Tatjana; Spehar, Andrea M

    2007-02-01

    The purpose of this study was to develop a multivariate fall risk assessment model beyond the current fall Resident Assessment Protocol (RAP) triggers for nursing home residents using the Minimum Data Set (MDS). Retrospective, clustered secondary data analysis. National Veterans Health Administration (VHA) long-term care nursing homes (N = 136). The study population consisted of 6577 national VHA nursing home residents who had an annual assessment during FY 2005, identified from the MDS, as well as an earlier annual or admission assessment within a 1-year look-back period. A dichotomous multivariate model of nursing home residents coded with a fall on selected fall risk characteristics from the MDS, estimated with general estimation equations (GEE). There were 17 170 assessments corresponding to 6577 long-term care nursing home residents. The increased odds ratio (OR) of being classified as a faller relative to the omitted "dependent" category of activities of daily living (ADL) ranged from OR = 1.35 for "limited" ADL category up to OR = 1.57 for "extensive-2" ADL (P < .0001). Unsteady gait more than doubles the odds of being a faller (OR = 2.63, P < .0001). The use of assistive devices such as canes, walkers, or crutches, or the use of wheelchairs increases the odds of being a faller (OR = 1.17, P < .0005) or (OR = 1.19, P < .0002), respectively. Foot problems may also increase the odds of being a faller (OR = 1.26, P < .0016). Alzheimer's or other dementias also increase the odds of being classified as a faller (OR = 1.18, P < .0219) or (OR=1.22, P < .0001), respectively. In addition, anger (OR = 1.19, P < .0065); wandering (OR = 1.53, P < .0001); or use of antipsychotic medications (OR = 1.15, P < .0039), antianxiety medications (OR = 1.13, P < .0323), or antidepressant medications (OR = 1.39, P < .0001) was also associated with the odds of being a faller. This national study in one of the largest managed healthcare systems in the United States has empirically confirmed the relative importance of certain risk factors for falls in long-term care settings. The model incorporated an ADL index and adjusted for case mix by including only long-term care nursing home residents. The study offers clinicians practical estimates by combining multiple univariate MDS elements in an empirically based, multivariate fall risk assessment model.

  20. Pretreatment Endorectal Coil Magnetic Resonance Imaging Findings Predict Biochemical Tumor Control in Prostate Cancer Patients Treated With Combination Brachytherapy and External-Beam Radiotherapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Riaz, Nadeem; Afaq, Asim; Akin, Oguz

    Purpose: To investigate the utility of endorectal coil magenetic resonance imaging (eMRI) in predicting biochemical relapse in prostate cancer patients treated with combination brachytherapy and external-beam radiotherapy. Methods and Materials: Between 2000 and 2008, 279 men with intermediate- or high-risk prostate cancer underwent eMRI of their prostate before receiving brachytherapy and supplemental intensity-modulated radiotherapy. Endorectal coil MRI was performed before treatment and retrospectively reviewed by two radiologists experienced in genitourinary MRI. Image-based variables, including tumor diameter, location, number of sextants involved, and the presence of extracapsular extension (ECE), were incorporated with other established clinical variables to predict biochemical control outcomes.more » The median follow-up was 49 months (range, 1-13 years). Results: The 5-year biochemical relapse-free survival for the cohort was 92%. Clinical findings predicting recurrence on univariate analysis included Gleason score (hazard ratio [HR] 3.6, p = 0.001), PSA (HR 1.04, p = 0.005), and National Comprehensive Cancer Network risk group (HR 4.1, p = 0.002). Clinical T stage and the use of androgen deprivation therapy were not correlated with biochemical failure. Imaging findings on univariate analysis associated with relapse included ECE on MRI (HR 3.79, p = 0.003), tumor size (HR 2.58, p = 0.04), and T stage (HR 1.71, p = 0.004). On multivariate analysis incorporating both clinical and imaging findings, only ECE on MRI and Gleason score were independent predictors of recurrence. Conclusions: Pretreatment eMRI findings predict for biochemical recurrence in intermediate- and high-risk prostate cancer patients treated with combination brachytherapy and external-beam radiotherapy. Gleason score and the presence of ECE on MRI were the only significant predictors of biochemical relapse in this group of patients.« less

  1. Size-adjusted Quantitative Gleason Score as a Predictor of Biochemical Recurrence after Radical Prostatectomy.

    PubMed

    Deng, Fang-Ming; Donin, Nicholas M; Pe Benito, Ruth; Melamed, Jonathan; Le Nobin, Julien; Zhou, Ming; Ma, Sisi; Wang, Jinhua; Lepor, Herbert

    2016-08-01

    The risk of biochemical recurrence (BCR) following radical prostatectomy for pathologic Gleason 7 prostate cancer varies according to the proportion of Gleason 4 component. We sought to explore the value of several novel quantitative metrics of Gleason 4 disease for the prediction of BCR in men with Gleason 7 disease. We analyzed a cohort of 2630 radical prostatectomy cases from 1990-2007. All pathologic Gleason 7 cases were identified and assessed for quantity of Gleason pattern 4. Three methods were used to quantify the extent of Gleason 4: a quantitative Gleason score (qGS) based on the proportion of tumor composed of Gleason pattern 4, a size-weighted score (swGS) incorporating the overall quantity of Gleason 4, and a size index (siGS) incorporating the quantity of Gleason 4 based on the index lesion. Associations between the above metrics and BCR were evaluated using Cox proportional hazards regression analysis. qGS, swGS, and siGS were significantly associated with BCR on multivariate analysis when adjusted for traditional Gleason score, age, prostate specific antigen, surgical margin, and stage. Using Harrell's c-index to compare the scoring systems, qGS (0.83), swGS (0.84), and siGS (0.84) all performed better than the traditional Gleason score (0.82). Quantitative measures of Gleason pattern 4 predict BCR better than the traditional Gleason score. In men with Gleason 7 prostate cancer, quantitative analysis of the proportion of Gleason pattern 4 (quantitative Gleason score), as well as size-weighted measurement of Gleason 4 (size-weighted Gleason score), and a size-weighted measurement of Gleason 4 based on the largest tumor nodule significantly improve the predicted risk of biochemical recurrence compared with the traditional Gleason score. Copyright © 2015 European Association of Urology. Published by Elsevier B.V. All rights reserved.

  2. Testing Mean Differences among Groups: Multivariate and Repeated Measures Analysis with Minimal Assumptions

    PubMed Central

    Bathke, Arne C.; Friedrich, Sarah; Pauly, Markus; Konietschke, Frank; Staffen, Wolfgang; Strobl, Nicolas; Höller, Yvonne

    2018-01-01

    ABSTRACT To date, there is a lack of satisfactory inferential techniques for the analysis of multivariate data in factorial designs, when only minimal assumptions on the data can be made. Presently available methods are limited to very particular study designs or assume either multivariate normality or equal covariance matrices across groups, or they do not allow for an assessment of the interaction effects across within-subjects and between-subjects variables. We propose and methodologically validate a parametric bootstrap approach that does not suffer from any of the above limitations, and thus provides a rather general and comprehensive methodological route to inference for multivariate and repeated measures data. As an example application, we consider data from two different Alzheimer’s disease (AD) examination modalities that may be used for precise and early diagnosis, namely, single-photon emission computed tomography (SPECT) and electroencephalogram (EEG). These data violate the assumptions of classical multivariate methods, and indeed classical methods would not have yielded the same conclusions with regards to some of the factors involved. PMID:29565679

  3. Development of multivariate exposure and fatal accident involvement rates for 1977

    DOT National Transportation Integrated Search

    1985-10-01

    The need for multivariate accident involvement rates is often encounted in : accident analysis. The FARS (Fatal Accident Reporting System) files contain : records of fatal involvements characterized by many variables while NPTS : (National Personal T...

  4. A Novel Approach to Detect Accelerated Aged and Surface-Mediated Degradation in Explosives by UPLC-ESI-MS.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Beppler, Christina L

    2015-12-01

    A new approach was created for studying energetic material degradation. This approach involved detecting and tentatively identifying non-volatile chemical species by liquid chromatography-mass spectrometry (LC-MS) with multivariate statistical data analysis that form as the CL-20 energetic material thermally degraded. Multivariate data analysis showed clear separation and clustering of samples based on sample group: either pristine or aged material. Further analysis showed counter-clockwise trends in the principal components analysis (PCA), a type of multivariate data analysis, Scores plots. These trends may indicate that there was a discrete shift in the chemical markers as the went from pristine to aged material, andmore » then again when the aged CL-20 mixed with a potentially incompatible material was thermally aged for 4, 6, or 9 months. This new approach to studying energetic material degradation should provide greater knowledge of potential degradation markers in these materials.« less

  5. Complex numbers in chemometrics: examples from multivariate impedance measurements on lipid monolayers.

    PubMed

    Geladi, Paul; Nelson, Andrew; Lindholm-Sethson, Britta

    2007-07-09

    Electrical impedance gives multivariate complex number data as results. Two examples of multivariate electrical impedance data measured on lipid monolayers in different solutions give rise to matrices (16x50 and 38x50) of complex numbers. Multivariate data analysis by principal component analysis (PCA) or singular value decomposition (SVD) can be used for complex data and the necessary equations are given. The scores and loadings obtained are vectors of complex numbers. It is shown that the complex number PCA and SVD are better at concentrating information in a few components than the naïve juxtaposition method and that Argand diagrams can replace score and loading plots. Different concentrations of Magainin and Gramicidin A give different responses and also the role of the electrolyte medium can be studied. An interaction of Gramicidin A in the solution with the monolayer over time can be observed.

  6. A Multivariate Methodological Workflow for the Analysis of FTIR Chemical Mapping Applied on Historic Paint Stratigraphies

    PubMed Central

    Sciutto, Giorgia; Oliveri, Paolo; Catelli, Emilio; Bonacini, Irene

    2017-01-01

    In the field of applied researches in heritage science, the use of multivariate approach is still quite limited and often chemometric results obtained are often underinterpreted. Within this scenario, the present paper is aimed at disseminating the use of suitable multivariate methodologies and proposes a procedural workflow applied on a representative group of case studies, of considerable importance for conservation purposes, as a sort of guideline on the processing and on the interpretation of this FTIR data. Initially, principal component analysis (PCA) is performed and the score values are converted into chemical maps. Successively, the brushing approach is applied, demonstrating its usefulness for a deep understanding of the relationships between the multivariate map and PC score space, as well as for the identification of the spectral bands mainly involved in the definition of each area localised within the score maps. PMID:29333162

  7. Risk Factors for Central Serous Chorioretinopathy: Multivariate Approach in a Case-Control Study.

    PubMed

    Chatziralli, Irini; Kabanarou, Stamatina A; Parikakis, Efstratios; Chatzirallis, Alexandros; Xirou, Tina; Mitropoulos, Panagiotis

    2017-07-01

    The purpose of this prospective study was to investigate the potential risk factors associated independently with central serous retinopathy (CSR) in a Greek population, using multivariate approach. Participants in the study were 183 consecutive patients diagnosed with CSR and 183 controls, matched for age. All participants underwent complete ophthalmological examination and information regarding their sociodemographic, clinical, medical and ophthalmological history were recorded, so as to assess potential risk factors for CSR. Univariate and multivariate analysis was performed. Univariate analysis showed that male sex, high educational status, high income, alcohol consumption, smoking, hypertension, coronary heart disease, obstructive sleep apnea, autoimmune disorders, H. pylori infection, type A personality and stress, steroid use, pregnancy and hyperopia were associated with CSR, while myopia was found to protect from CSR. In multivariate analysis, alcohol consumption, hypertension, coronary heart disease and autoimmune disorders lost their significance, while the remaining factors were all independently associated with CSR. It is important to take into account the various risk factors for CSR, so as to define vulnerable groups and to shed light into the pathogenesis of the disease.

  8. Social Cognitive and Planned Behavior Variables Associated with Stages of Change for Physical Activity in Spinal Cord Injury: A Multivariate Analysis

    ERIC Educational Resources Information Center

    Keegan, John; Ditchman, Nicole; Dutta, Alo; Chiu, Chung-Yi; Muller, Veronica; Chan, Fong; Kundu, Madan

    2016-01-01

    Purpose: To apply the constructs of social cognitive theory (SCT) and the theory of planned behavior (TPB) to understand the stages of change (SOC) for physical activities among individuals with a spinal cord injury (SCI). Method: Ex post facto design using multivariate analysis of variance (MANOVA). The participants were 144 individuals with SCI…

  9. To See the World in a Grain of Sand: Recognizing the Origin of Sand Specimens by Diffuse Reflectance Infrared Fourier Transform Spectroscopy and Multivariate Exploratory Data Analysis

    ERIC Educational Resources Information Center

    Pezzolo, Alessandra De Lorenzi

    2011-01-01

    The diffuse reflectance infrared Fourier transform (DRIFT) spectra of sand samples exhibit features reflecting their composition. Basic multivariate analysis (MVA) can be used to effectively sort subsets of homogeneous specimens collected from nearby locations, as well as pointing out similarities in composition among sands of different origins.…

  10. Testing key predictions of the associative account of mirror neurons in humans using multivariate pattern analysis.

    PubMed

    Oosterhof, Nikolaas N; Wiggett, Alison J; Cross, Emily S

    2014-04-01

    Cook et al. overstate the evidence supporting their associative account of mirror neurons in humans: most studies do not address a key property, action-specificity that generalizes across the visual and motor domains. Multivariate pattern analysis (MVPA) of neuroimaging data can address this concern, and we illustrate how MVPA can be used to test key predictions of their account.

  11. Multivariate Quantitative Chemical Analysis

    NASA Technical Reports Server (NTRS)

    Kinchen, David G.; Capezza, Mary

    1995-01-01

    Technique of multivariate quantitative chemical analysis devised for use in determining relative proportions of two components mixed and sprayed together onto object to form thermally insulating foam. Potentially adaptable to other materials, especially in process-monitoring applications in which necessary to know and control critical properties of products via quantitative chemical analyses of products. In addition to chemical composition, also used to determine such physical properties as densities and strengths.

  12. Multivariate statistical analysis of low-voltage EDS spectrum images

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anderson, I.M.

    1998-03-01

    Whereas energy-dispersive X-ray spectrometry (EDS) has been used for compositional analysis in the scanning electron microscope for 30 years, the benefits of using low operating voltages for such analyses have been explored only during the last few years. This paper couples low-voltage EDS with two other emerging areas of characterization: spectrum imaging and multivariate statistical analysis. The specimen analyzed for this study was a finished Intel Pentium processor, with the polyimide protective coating stripped off to expose the final active layers.

  13. Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.

    PubMed

    Taylor, Sandra L; Ruhaak, L Renee; Weiss, Robert H; Kelly, Karen; Kim, Kyoungmi

    2017-01-01

    High through-put mass spectrometry (MS) is now being used to profile small molecular compounds across multiple biological sample types from the same subjects with the goal of leveraging information across biospecimens. Multivariate statistical methods that combine information from all biospecimens could be more powerful than the usual univariate analyses. However, missing values are common in MS data and imputation can impact between-biospecimen correlation and multivariate analysis results. We propose two multivariate two-part statistics that accommodate missing values and combine data from all biospecimens to identify differentially regulated compounds. Statistical significance is determined using a multivariate permutation null distribution. Relative to univariate tests, the multivariate procedures detected more significant compounds in three biological datasets. In a simulation study, we showed that multi-biospecimen testing procedures were more powerful than single-biospecimen methods when compounds are differentially regulated in multiple biospecimens but univariate methods can be more powerful if compounds are differentially regulated in only one biospecimen. We provide R functions to implement and illustrate our method as supplementary information CONTACT: sltaylor@ucdavis.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. Network meta-analysis of multiple outcome measures accounting for borrowing of information across outcomes.

    PubMed

    Achana, Felix A; Cooper, Nicola J; Bujkiewicz, Sylwia; Hubbard, Stephanie J; Kendrick, Denise; Jones, David R; Sutton, Alex J

    2014-07-21

    Network meta-analysis (NMA) enables simultaneous comparison of multiple treatments while preserving randomisation. When summarising evidence to inform an economic evaluation, it is important that the analysis accurately reflects the dependency structure within the data, as correlations between outcomes may have implication for estimating the net benefit associated with treatment. A multivariate NMA offers a framework for evaluating multiple treatments across multiple outcome measures while accounting for the correlation structure between outcomes. The standard NMA model is extended to multiple outcome settings in two stages. In the first stage, information is borrowed across outcomes as well across studies through modelling the within-study and between-study correlation structure. In the second stage, we make use of the additional assumption that intervention effects are exchangeable between outcomes to predict effect estimates for all outcomes, including effect estimates on outcomes where evidence is either sparse or the treatment had not been considered by any one of the studies included in the analysis. We apply the methods to binary outcome data from a systematic review evaluating the effectiveness of nine home safety interventions on uptake of three poisoning prevention practices (safe storage of medicines, safe storage of other household products, and possession of poison centre control telephone number) in households with children. Analyses are conducted in WinBUGS using Markov Chain Monte Carlo (MCMC) simulations. Univariate and the first stage multivariate models produced broadly similar point estimates of intervention effects but the uncertainty around the multivariate estimates varied depending on the prior distribution specified for the between-study covariance structure. The second stage multivariate analyses produced more precise effect estimates while enabling intervention effects to be predicted for all outcomes, including intervention effects on outcomes not directly considered by the studies included in the analysis. Accounting for the dependency between outcomes in a multivariate meta-analysis may or may not improve the precision of effect estimates from a network meta-analysis compared to analysing each outcome separately.

  15. The application of ATR-FTIR spectroscopy and multivariate data analysis to study drug crystallisation in the stratum corneum.

    PubMed

    Goh, Choon Fu; Craig, Duncan Q M; Hadgraft, Jonathan; Lane, Majella E

    2017-02-01

    Drug permeation through the intercellular lipids, which pack around and between corneocytes, may be enhanced by increasing the thermodynamic activity of the active in a formulation. However, this may also result in unwanted drug crystallisation on and in the skin. In this work, we explore the combination of ATR-FTIR spectroscopy and multivariate data analysis to study drug crystallisation in the skin. Ex vivo permeation studies of saturated solutions of diclofenac sodium (DF Na) in two vehicles, propylene glycol (PG) and dimethyl sulphoxide (DMSO), were carried out in porcine ear skin. Tape stripping and ATR-FTIR spectroscopy were conducted simultaneously to collect spectral data as a function of skin depth. Multivariate data analysis was applied to visualise and categorise the spectral data in the region of interest (1700-1500cm -1 ) containing the carboxylate (COO - ) asymmetric stretching vibrations of DF Na. Spectral data showed the redshifts of the COO - asymmetric stretching vibrations for DF Na in the solution compared with solid drug. Similar shifts were evident following application of saturated solutions of DF Na to porcine skin samples. Multivariate data analysis categorised the spectral data based on the spectral differences and drug crystallisation was found to be confined to the upper layers of the skin. This proof-of-concept study highlights the utility of ATR-FTIR spectroscopy in combination with multivariate data analysis as a simple and rapid approach in the investigation of drug deposition in the skin. The approach described here will be extended to the study of other actives for topical application to the skin. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. Describing the Elephant: Structure and Function in Multivariate Data.

    ERIC Educational Resources Information Center

    McDonald, Roderick P.

    1986-01-01

    There is a unity underlying the diversity of models for the analysis of multivariate data. Essentially, they constitute a family of models, most generally nonlinear, for structural/functional relations between variables drawn from a behavior domain. (Author)

  17. Prognostic factors and relative risk for survival in N1-3 oral squamous cell carcinoma: a multivariate analysis using Cox's hazard model.

    PubMed

    Noguchi, M; Kido, Y; Kubota, H; Kinjo, H; Kohama, G

    1999-12-01

    The records of 136 patients with N1-3 oral squamous cell carcinoma treated by surgery were investigated retrospectively, with the aim of finding out which factors were predictive of survival on multivariate analysis. Four independent factors significantly influenced survival in the following order: pN stage; T stage; histological grade; and N stage. The most significant was pN stage, the five-year survival for patients with pN0 being 91% and for patients with pN1-3 41%. A further study was carried out on the 80 patients with pN1-3 to find out their prognostic factors for survival and the independent factors identified by multivariate analysis were T stage and presence or absence of extracapsular spread to metastatic lymph nodes.

  18. Calypso: a user-friendly web-server for mining and visualizing microbiome-environment interactions.

    PubMed

    Zakrzewski, Martha; Proietti, Carla; Ellis, Jonathan J; Hasan, Shihab; Brion, Marie-Jo; Berger, Bernard; Krause, Lutz

    2017-03-01

    Calypso is an easy-to-use online software suite that allows non-expert users to mine, interpret and compare taxonomic information from metagenomic or 16S rDNA datasets. Calypso has a focus on multivariate statistical approaches that can identify complex environment-microbiome associations. The software enables quantitative visualizations, statistical testing, multivariate analysis, supervised learning, factor analysis, multivariable regression, network analysis and diversity estimates. Comprehensive help pages, tutorials and videos are provided via a wiki page. The web-interface is accessible via http://cgenome.net/calypso/ . The software is programmed in Java, PERL and R and the source code is available from Zenodo ( https://zenodo.org/record/50931 ). The software is freely available for non-commercial users. l.krause@uq.edu.au. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  19. DigOut: viewing differential expression genes as outliers.

    PubMed

    Yu, Hui; Tu, Kang; Xie, Lu; Li, Yuan-Yuan

    2010-12-01

    With regards to well-replicated two-conditional microarray datasets, the selection of differentially expressed (DE) genes is a well-studied computational topic, but for multi-conditional microarray datasets with limited or no replication, the same task is not properly addressed by previous studies. This paper adopts multivariate outlier analysis to analyze replication-lacking multi-conditional microarray datasets, finding that it performs significantly better than the widely used limit fold change (LFC) model in a simulated comparative experiment. Compared with the LFC model, the multivariate outlier analysis also demonstrates improved stability against sample variations in a series of manipulated real expression datasets. The reanalysis of a real non-replicated multi-conditional expression dataset series leads to satisfactory results. In conclusion, a multivariate outlier analysis algorithm, like DigOut, is particularly useful for selecting DE genes from non-replicated multi-conditional gene expression dataset.

  20. Immediate versus delayed intramedullary nailing for open fractures of the tibial shaft: a multivariate analysis of factors affecting deep infection and fracture healing.

    PubMed

    Yokoyama, Kazuhiko; Itoman, Moritoshi; Uchino, Masataka; Fukushima, Kensuke; Nitta, Hiroshi; Kojima, Yoshiaki

    2008-10-01

    The purpose of this study was to evaluate contributing factors affecting deep infection and fracture healing of open tibia fractures treated with locked intramedullary nailing (IMN) by multivariate analysis. We examined 99 open tibial fractures (98 patients) treated with immediate or delayed locked IMN in static fashion from 1991 to 2002. Multivariate analyses following univariate analyses were derived to determine predictors of deep infection, nonunion, and healing time to union. The following predictive variables of deep infection were selected for analysis: age, sex, Gustilo type, fracture grade by AO type, fracture location, timing or method of IMN, reamed or unreamed nailing, debridement time (< or =6 h or >6 h), method of soft-tissue management, skin closure time (< or =1 week or >1 week), existence of polytrauma (ISS< 18 or ISS> or =18), existence of floating knee injury, and existence of superficial/pin site infection. The predictive variables of nonunion selected for analysis was the same as those for deep infection, with the addition of deep infection for exchange of pin site infection. The predictive variables of union time selected for analysis was the same as those for nonunion, excluding of location, debridement time, and existence of floating knee and superficial infection. Six (6.1%; type II Gustilo n=1, type IIIB Gustilo n=5) of the 99 open tibial fractures developed deep infections. Multivariate analysis revealed that timing or method of IMN, debridement time, method of soft-tissue management, and existence of superficial or pin site infection significantly correlated with the occurrence of deep infection (P< 0.0001). In the immediate nailing group alone, the deep infection rate in type IIIB + IIIC was significantly higher than those in type I + II and IIIA (P = 0.016). Nonunion occurred in 17 fractures (20.3%, 17/84). Multivariate analysis revealed that Gustilo type, skin closure time, and existence of deep infection significantly correlated with occurrence of nonunion (P < 0.05). Gustilo type and existence of deep infection were significantly correlated with healing time to union on multivariate analysis (r(2) = 0.263, P = 0.0001). Multivariate analyses for open tibial fractures treated with IMN showed that IMN after EF (especially in existence of pin site infection) was at high risk of deep infection, and that debridement within 6 h and appropriate soft-tissue managements were also important factor in preventing deep infections. These analyses postulated that both the Gustilo type and the existence of deep infection is related with fracture healing in open fractures treated with IMN. In addition, immediate IMN for type IIIB and IIIC is potentially risky, and canal reaming did not increase the risk of complication for open tibial fractures treated with IMN.

  1. Is a multivariate consensus representation of genetic relationships among populations always meaningful?

    PubMed Central

    Moazami-Goudarzi, K; Laloë, D

    2002-01-01

    To determine the relationships among closely related populations or species, two methods are commonly used in the literature: phylogenetic reconstruction or multivariate analysis. The aim of this article is to assess the reliability of multivariate analysis. We describe a method that is based on principal component analysis and Mantel correlations, using a two-step process: The first step consists of a single-marker analysis and the second step tests if each marker reveals the same typology concerning population differentiation. We conclude that if single markers are not congruent, the compromise structure is not meaningful. Our model is not based on any particular mutation process and it can be applied to most of the commonly used genetic markers. This method is also useful to determine the contribution of each marker to the typology of populations. We test whether our method is efficient with two real data sets based on microsatellite markers. Our analysis suggests that for closely related populations, it is not always possible to accept the hypothesis that an increase in the number of markers will increase the reliability of the typology analysis. PMID:12242255

  2. Application of multivariate analysis to investigate the trace element contamination in top soil of coal mining district in Jorong, South Kalimantan, Indonesia

    NASA Astrophysics Data System (ADS)

    Pujiwati, Arie; Nakamura, K.; Watanabe, N.; Komai, T.

    2018-02-01

    Multivariate analysis is applied to investigate geochemistry of several trace elements in top soils and their relation with the contamination source as the influence of coal mines in Jorong, South Kalimantan. Total concentration of Cd, V, Co, Ni, Cr, Zn, As, Pb, Sb, Cu and Ba was determined in 20 soil samples by the bulk analysis. Pearson correlation is applied to specify the linear correlation among the elements. Principal Component Analysis (PCA) and Cluster Analysis (CA) were applied to observe the classification of trace elements and contamination sources. The results suggest that contamination loading is contributed by Cr, Cu, Ni, Zn, As, and Pb. The elemental loading mostly affects the non-coal mining area, for instances the area near settlement and agricultural land use. Moreover, the contamination source is classified into the areas that are influenced by the coal mining activity, the agricultural types, and the river mixing zone. Multivariate analysis could elucidate the elemental loading and the contamination sources of trace elements in the vicinity of coal mine area.

  3. The classification of secondary colorectal liver cancer in human biopsy samples using angular dispersive x-ray diffraction and multivariate analysis

    NASA Astrophysics Data System (ADS)

    Theodorakou, Chrysoula; Farquharson, Michael J.

    2009-08-01

    The motivation behind this study is to assess whether angular dispersive x-ray diffraction (ADXRD) data, processed using multivariate analysis techniques, can be used for classifying secondary colorectal liver cancer tissue and normal surrounding liver tissue in human liver biopsy samples. The ADXRD profiles from a total of 60 samples of normal liver tissue and colorectal liver metastases were measured using a synchrotron radiation source. The data were analysed for 56 samples using nonlinear peak-fitting software. Four peaks were fitted to all of the ADXRD profiles, and the amplitude, area, amplitude and area ratios for three of the four peaks were calculated and used for the statistical and multivariate analysis. The statistical analysis showed that there are significant differences between all the peak-fitting parameters and ratios between the normal and the diseased tissue groups. The technique of soft independent modelling of class analogy (SIMCA) was used to classify normal liver tissue and colorectal liver metastases resulting in 67% of the normal tissue samples and 60% of the secondary colorectal liver tissue samples being classified correctly. This study has shown that the ADXRD data of normal and secondary colorectal liver cancer are statistically different and x-ray diffraction data analysed using multivariate analysis have the potential to be used as a method of tissue classification.

  4. Multivariate analysis of risk factors for long-term urethroplasty outcome.

    PubMed

    Breyer, Benjamin N; McAninch, Jack W; Whitson, Jared M; Eisenberg, Michael L; Mehdizadeh, Jennifer F; Myers, Jeremy B; Voelzke, Bryan B

    2010-02-01

    We studied the patient risk factors that promote urethroplasty failure. Records of patients who underwent urethroplasty at the University of California, San Francisco Medical Center between 1995 and 2004 were reviewed. Cox proportional hazards regression analysis was used to identify multivariate predictors of urethroplasty outcome. Between 1995 and 2004, 443 patients of 495 who underwent urethroplasty had complete comorbidity data and were included in analysis. Median patient age was 41 years (range 18 to 90). Median followup was 5.8 years (range 1 month to 10 years). Stricture recurred in 93 patients (21%). Primary estimated stricture-free survival at 1, 3 and 5 years was 88%, 82% and 79%. After multivariate analysis smoking (HR 1.8, 95% CI 1.0-3.1, p = 0.05), prior direct vision internal urethrotomy (HR 1.7, 95% CI 1.0-3.0, p = 0.04) and prior urethroplasty (HR 1.8, 95% CI 1.1-3.1, p = 0.03) were predictive of treatment failure. On multivariate analysis diabetes mellitus showed a trend toward prediction of urethroplasty failure (HR 2.0, 95% CI 0.8-4.9, p = 0.14). Length of urethral stricture (greater than 4 cm), prior urethroplasty and failed endoscopic therapy are predictive of failure after urethroplasty. Smoking and diabetes mellitus also may predict failure potentially secondary to microvascular damage. Copyright 2010 American Urological Association. Published by Elsevier Inc. All rights reserved.

  5. Multivariate meta-analysis of individual participant data helped externally validate the performance and implementation of a prediction model.

    PubMed

    Snell, Kym I E; Hua, Harry; Debray, Thomas P A; Ensor, Joie; Look, Maxime P; Moons, Karel G M; Riley, Richard D

    2016-01-01

    Our aim was to improve meta-analysis methods for summarizing a prediction model's performance when individual participant data are available from multiple studies for external validation. We suggest multivariate meta-analysis for jointly synthesizing calibration and discrimination performance, while accounting for their correlation. The approach estimates a prediction model's average performance, the heterogeneity in performance across populations, and the probability of "good" performance in new populations. This allows different implementation strategies (e.g., recalibration) to be compared. Application is made to a diagnostic model for deep vein thrombosis (DVT) and a prognostic model for breast cancer mortality. In both examples, multivariate meta-analysis reveals that calibration performance is excellent on average but highly heterogeneous across populations unless the model's intercept (baseline hazard) is recalibrated. For the cancer model, the probability of "good" performance (defined by C statistic ≥0.7 and calibration slope between 0.9 and 1.1) in a new population was 0.67 with recalibration but 0.22 without recalibration. For the DVT model, even with recalibration, there was only a 0.03 probability of "good" performance. Multivariate meta-analysis can be used to externally validate a prediction model's calibration and discrimination performance across multiple populations and to evaluate different implementation strategies. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.

  6. Defining critical habitats of threatened and endemic reef fishes with a multivariate approach.

    PubMed

    Purcell, Steven W; Clarke, K Robert; Rushworth, Kelvin; Dalton, Steven J

    2014-12-01

    Understanding critical habitats of threatened and endemic animals is essential for mitigating extinction risks, developing recovery plans, and siting reserves, but assessment methods are generally lacking. We evaluated critical habitats of 8 threatened or endemic fish species on coral and rocky reefs of subtropical eastern Australia, by measuring physical and substratum-type variables of habitats at fish sightings. We used nonmetric and metric multidimensional scaling (nMDS, mMDS), Analysis of similarities (ANOSIM), similarity percentages analysis (SIMPER), permutational analysis of multivariate dispersions (PERMDISP), and other multivariate tools to distinguish critical habitats. Niche breadth was widest for 2 endemic wrasses, and reef inclination was important for several species, often found in relatively deep microhabitats. Critical habitats of mainland reef species included small caves or habitat-forming hosts such as gorgonian corals and black coral trees. Hard corals appeared important for reef fishes at Lord Howe Island, and red algae for mainland reef fishes. A wide range of habitat variables are required to assess critical habitats owing to varied affinities of species to different habitat features. We advocate assessments of critical habitats matched to the spatial scale used by the animals and a combination of multivariate methods. Our multivariate approach furnishes a general template for assessing the critical habitats of species, understanding how these vary among species, and determining differences in the degree of habitat specificity. © 2014 Society for Conservation Biology.

  7. Parameters Selection for Bivariate Multiscale Entropy Analysis of Postural Fluctuations in Fallers and Non-Fallers Older Adults.

    PubMed

    Ramdani, Sofiane; Bonnet, Vincent; Tallon, Guillaume; Lagarde, Julien; Bernard, Pierre Louis; Blain, Hubert

    2016-08-01

    Entropy measures are often used to quantify the regularity of postural sway time series. Recent methodological developments provided both multivariate and multiscale approaches allowing the extraction of complexity features from physiological signals; see "Dynamical complexity of human responses: A multivariate data-adaptive framework," in Bulletin of Polish Academy of Science and Technology, vol. 60, p. 433, 2012. The resulting entropy measures are good candidates for the analysis of bivariate postural sway signals exhibiting nonstationarity and multiscale properties. These methods are dependant on several input parameters such as embedding parameters. Using two data sets collected from institutionalized frail older adults, we numerically investigate the behavior of a recent multivariate and multiscale entropy estimator; see "Multivariate multiscale entropy: A tool for complexity analysis of multichannel data," Physics Review E, vol. 84, p. 061918, 2011. We propose criteria for the selection of the input parameters. Using these optimal parameters, we statistically compare the multivariate and multiscale entropy values of postural sway data of non-faller subjects to those of fallers. These two groups are discriminated by the resulting measures over multiple time scales. We also demonstrate that the typical parameter settings proposed in the literature lead to entropy measures that do not distinguish the two groups. This last result confirms the importance of the selection of appropriate input parameters.

  8. Multivariate pattern analysis of MEG and EEG: A comparison of representational structure in time and space.

    PubMed

    Cichy, Radoslaw Martin; Pantazis, Dimitrios

    2017-09-01

    Multivariate pattern analysis of magnetoencephalography (MEG) and electroencephalography (EEG) data can reveal the rapid neural dynamics underlying cognition. However, MEG and EEG have systematic differences in sampling neural activity. This poses the question to which degree such measurement differences consistently bias the results of multivariate analysis applied to MEG and EEG activation patterns. To investigate, we conducted a concurrent MEG/EEG study while participants viewed images of everyday objects. We applied multivariate classification analyses to MEG and EEG data, and compared the resulting time courses to each other, and to fMRI data for an independent evaluation in space. We found that both MEG and EEG revealed the millisecond spatio-temporal dynamics of visual processing with largely equivalent results. Beyond yielding convergent results, we found that MEG and EEG also captured partly unique aspects of visual representations. Those unique components emerged earlier in time for MEG than for EEG. Identifying the sources of those unique components with fMRI, we found the locus for both MEG and EEG in high-level visual cortex, and in addition for MEG in low-level visual cortex. Together, our results show that multivariate analyses of MEG and EEG data offer a convergent and complimentary view on neural processing, and motivate the wider adoption of these methods in both MEG and EEG research. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. Revisiting the Holy Grail: using plant functional traits to understand ecological processes.

    PubMed

    Funk, Jennifer L; Larson, Julie E; Ames, Gregory M; Butterfield, Bradley J; Cavender-Bares, Jeannine; Firn, Jennifer; Laughlin, Daniel C; Sutton-Grier, Ariana E; Williams, Laura; Wright, Justin

    2017-05-01

    One of ecology's grand challenges is developing general rules to explain and predict highly complex systems. Understanding and predicting ecological processes from species' traits has been considered a 'Holy Grail' in ecology. Plant functional traits are increasingly being used to develop mechanistic models that can predict how ecological communities will respond to abiotic and biotic perturbations and how species will affect ecosystem function and services in a rapidly changing world; however, significant challenges remain. In this review, we highlight recent work and outstanding questions in three areas: (i) selecting relevant traits; (ii) describing intraspecific trait variation and incorporating this variation into models; and (iii) scaling trait data to community- and ecosystem-level processes. Over the past decade, there have been significant advances in the characterization of plant strategies based on traits and trait relationships, and the integration of traits into multivariate indices and models of community and ecosystem function. However, the utility of trait-based approaches in ecology will benefit from efforts that demonstrate how these traits and indices influence organismal, community, and ecosystem processes across vegetation types, which may be achieved through meta-analysis and enhancement of trait databases. Additionally, intraspecific trait variation and species interactions need to be incorporated into predictive models using tools such as Bayesian hierarchical modelling. Finally, existing models linking traits to community and ecosystem processes need to be empirically tested for their applicability to be realized. © 2016 Cambridge Philosophical Society.

  10. Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA.

    PubMed

    Kelly, Brendan J; Gross, Robert; Bittinger, Kyle; Sherrill-Mix, Scott; Lewis, James D; Collman, Ronald G; Bushman, Frederic D; Li, Hongzhe

    2015-08-01

    The variation in community composition between microbiome samples, termed beta diversity, can be measured by pairwise distance based on either presence-absence or quantitative species abundance data. PERMANOVA, a permutation-based extension of multivariate analysis of variance to a matrix of pairwise distances, partitions within-group and between-group distances to permit assessment of the effect of an exposure or intervention (grouping factor) upon the sampled microbiome. Within-group distance and exposure/intervention effect size must be accurately modeled to estimate statistical power for a microbiome study that will be analyzed with pairwise distances and PERMANOVA. We present a framework for PERMANOVA power estimation tailored to marker-gene microbiome studies that will be analyzed by pairwise distances, which includes: (i) a novel method for distance matrix simulation that permits modeling of within-group pairwise distances according to pre-specified population parameters; (ii) a method to incorporate effects of different sizes within the simulated distance matrix; (iii) a simulation-based method for estimating PERMANOVA power from simulated distance matrices; and (iv) an R statistical software package that implements the above. Matrices of pairwise distances can be efficiently simulated to satisfy the triangle inequality and incorporate group-level effects, which are quantified by the adjusted coefficient of determination, omega-squared (ω2). From simulated distance matrices, available PERMANOVA power or necessary sample size can be estimated for a planned microbiome study. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  11. Nontargeted, Rapid Screening of Extra Virgin Olive Oil Products for Authenticity Using Near-Infrared Spectroscopy in Combination with Conformity Index and Multivariate Statistical Analyses.

    PubMed

    Karunathilaka, Sanjeewa R; Kia, Ali-Reza Fardin; Srigley, Cynthia; Chung, Jin Kyu; Mossoba, Magdi M

    2016-10-01

    A rapid tool for evaluating authenticity was developed and applied to the screening of extra virgin olive oil (EVOO) retail products by using Fourier-transform near infrared (FT-NIR) spectroscopy in combination with univariate and multivariate data analysis methods. Using disposable glass tubes, spectra for 62 reference EVOO, 10 edible oil adulterants, 20 blends consisting of EVOO spiked with adulterants, 88 retail EVOO products and other test samples were rapidly measured in the transmission mode without any sample preparation. The univariate conformity index (CI) and the multivariate supervised soft independent modeling of class analogy (SIMCA) classification tool were used to analyze the various olive oil products which were tested for authenticity against a library of reference EVOO. Better discrimination between the authentic EVOO and some commercial EVOO products was observed with SIMCA than with CI analysis. Approximately 61% of all EVOO commercial products were flagged by SIMCA analysis, suggesting that further analysis be performed to identify quality issues and/or potential adulterants. Due to its simplicity and speed, FT-NIR spectroscopy in combination with multivariate data analysis can be used as a complementary tool to conventional official methods of analysis to rapidly flag EVOO products that may not belong to the class of authentic EVOO. Published 2016. This article is a U.S. Government work and is in the public domain in the USA.

  12. Metric Selection for Evaluation of Human Supervisory Control Systems

    DTIC Science & Technology

    2009-12-01

    finding a significant effect when there is none becomes more likely. The inflation of type I error due to multiple dependent variables can be handled...with multivariate analysis techniques, such as Multivariate Analysis of Variance (MANOVA) (Johnson & Wichern, 2002). However, it should be noted that...the few significant differences among many insignificant ones. The best way to avoid failure to identify significant differences is to design an

  13. A Civilian/Military Trauma Institute: National Trauma Coordinating Center

    DTIC Science & Technology

    2015-12-01

    zip codes was used in “proximity to violence” analysis. Data were analyzed using SPSS (version 20.0, SPSS Inc., Chicago, IL). Multivariable linear...number of adverse events and serious events was not statistically higher in one group, the incidence of deep venous thrombosis (DVT) was statistically ...subjects the lack of statistical difference on multivariate analysis may be related to an underpowered sample size. It was recommended that the

  14. Exploratory Multivariate Analysis. A Graphical Approach.

    DTIC Science & Technology

    1981-01-01

    Gnanadesikan , 1977) but we feel that these should be used with great caution unless one really has good reason to believe that the data came from such a...are referred to Gnanadesikan (1977). The present author hopes that the convenience of a single summary or significance level will not deter his readers...fit of a harmonic model to meteorological data. (In preparation). Gnanadesikan , R. (1977). Methods for Statistical Data Analysis of Multivariate

  15. External validation of the modified Glasgow prognostic score for renal cancer

    PubMed Central

    Tai, Caroline G.; Johnson, Timothy V.; Abbasi, Ammara; Herrell, Lindsey; Harris, Wayne B.; Kucuk, Omer; Canter, Daniel J.; Ogan, Kenneth; Pattaras, John G.; Nieh, Peter T.; Master, Viraj A.

    2014-01-01

    Purpose: The modified Glasgow prognostic Score (mGPS) incorporates C-reactive protein and albumin as a clinically useful marker of tumor behavior. The ability of the mGPS to predict metastasis in localized renal cell carcinoma (RCC) remains unknown in an external validation cohort. Patients and Methods: Patients with clinically localized clear cell RCC were followed for 1 year post-operatively. Metastases were identified radiologically. Patients were categorized by mGPS score as low-risk (mGPS = 0 points), intermediate-risk (mGPS = 1 point) and high-risk (mGPS = 2 points). Univariate, Kaplan-Meier and multivariate Cox regression analyses examined Recurrence -free survival (RFS) across patient and disease characteristics. Results: Of the 129 patients in this study, 23.3% developed metastases. Of low, intermediate and high risk patients, 10.1%, 38.9% and 89.9% recurred during the study. After accounting for various patient and tumor characteristics in multivariate analysis including stage and grade, only mGPS was significantly associated with RFS. Compared with low-risk patients, intermediate- and high-risk patients experienced a 4-fold (hazard ratios [HR]: 4.035, 95% confidence interval [CI]: 1.312-12.415, P = 0.015) and 7-fold (HR: 7.012, 95% CI: 2.126-23.123 P < 0.001) risk of metastasis, respectively. Conclusions: mGPS is a robust predictor of metastasis following potentially curative nephrectomy for localized RCC. Clinicians may consider mGPS as an adjunct to identify high-risk patients for possible enrollment into clinical trials or for patient counseling PMID:24497679

  16. Determinants of ocular deviation in esotropic subjects under general anesthesia.

    PubMed

    Daien, Vincent; Turpin, Chloé; Lignereux, François; Belghobsi, Riadh; Le Meur, Guylene; Lebranchu, Pierre; Pechereau, Alain

    2013-01-01

    The authors attempted to identify the determinants of ocular deviation in a population of patients with esotropia under general anesthesia. Forty-one patients with esotropia were included. Horizontal ocular deviation was evaluated by the photographic Hirschberg test both in the awakened state and under general anesthesia before surgery. Changes in ocular deviation were measured and a multivariate analysis was used to assess its clinical determinants. The mean age (± standard deviation [SD]) of study subjects was 13 ± 11 years and 51% were females. The mean spherical equivalent refraction of the right eye was 2.44 ± 2.50 diopters (D), with no significant difference between eyes (P = .26). The mean ocular deviation changed significantly, from 33.5 ± 12.5 prism diopters (PD) at preoperative examination to 8.8 ± 11.4 PD under general anesthesia (P = .0001). The changes in ocular deviation positively correlated with the pre-operative ocular deviation (correlation coefficient r = 0.59, P = .0001) and negatively correlated with patient age (correlation coefficient r = -0.53, P = .0001). These two determinants remained significant after multivariate adjustment of the following variables: preoperative ocular deviation; age; gender; spherical equivalent refraction; and number of previous strabismus surgeries (model r(2) = 0.49, P = .0001). The ocular position under general anesthesia was reported as a key factor in the surgical treatment of subjects with esotropia; therefore, its clinical determinants were assessed. The authors observed that preoperative ocular deviation and patient age were the main factors that influenced the ocular position under general anesthesia. Copyright 2013, SLACK Incorporated.

  17. The Covariance Adjustment Approaches for Combining Incomparable Cox Regressions Caused by Unbalanced Covariates Adjustment: A Multivariate Meta-Analysis Study.

    PubMed

    Dehesh, Tania; Zare, Najaf; Ayatollahi, Seyyed Mohammad Taghi

    2015-01-01

    Univariate meta-analysis (UM) procedure, as a technique that provides a single overall result, has become increasingly popular. Neglecting the existence of other concomitant covariates in the models leads to loss of treatment efficiency. Our aim was proposing four new approximation approaches for the covariance matrix of the coefficients, which is not readily available for the multivariate generalized least square (MGLS) method as a multivariate meta-analysis approach. We evaluated the efficiency of four new approaches including zero correlation (ZC), common correlation (CC), estimated correlation (EC), and multivariate multilevel correlation (MMC) on the estimation bias, mean square error (MSE), and 95% probability coverage of the confidence interval (CI) in the synthesis of Cox proportional hazard models coefficients in a simulation study. Comparing the results of the simulation study on the MSE, bias, and CI of the estimated coefficients indicated that MMC approach was the most accurate procedure compared to EC, CC, and ZC procedures. The precision ranking of the four approaches according to all above settings was MMC ≥ EC ≥ CC ≥ ZC. This study highlights advantages of MGLS meta-analysis on UM approach. The results suggested the use of MMC procedure to overcome the lack of information for having a complete covariance matrix of the coefficients.

  18. The Fourier decomposition method for nonlinear and non-stationary time series analysis.

    PubMed

    Singh, Pushpendra; Joshi, Shiv Dutt; Patney, Rakesh Kumar; Saha, Kaushik

    2017-03-01

    for many decades, there has been a general perception in the literature that Fourier methods are not suitable for the analysis of nonlinear and non-stationary data. In this paper, we propose a novel and adaptive Fourier decomposition method (FDM), based on the Fourier theory, and demonstrate its efficacy for the analysis of nonlinear and non-stationary time series. The proposed FDM decomposes any data into a small number of 'Fourier intrinsic band functions' (FIBFs). The FDM presents a generalized Fourier expansion with variable amplitudes and variable frequencies of a time series by the Fourier method itself. We propose an idea of zero-phase filter bank-based multivariate FDM (MFDM), for the analysis of multivariate nonlinear and non-stationary time series, using the FDM. We also present an algorithm to obtain cut-off frequencies for MFDM. The proposed MFDM generates a finite number of band-limited multivariate FIBFs (MFIBFs). The MFDM preserves some intrinsic physical properties of the multivariate data, such as scale alignment, trend and instantaneous frequency. The proposed methods provide a time-frequency-energy (TFE) distribution that reveals the intrinsic structure of a data. Numerical computations and simulations have been carried out and comparison is made with the empirical mode decomposition algorithms.

  19. The Fourier decomposition method for nonlinear and non-stationary time series analysis

    PubMed Central

    Joshi, Shiv Dutt; Patney, Rakesh Kumar; Saha, Kaushik

    2017-01-01

    for many decades, there has been a general perception in the literature that Fourier methods are not suitable for the analysis of nonlinear and non-stationary data. In this paper, we propose a novel and adaptive Fourier decomposition method (FDM), based on the Fourier theory, and demonstrate its efficacy for the analysis of nonlinear and non-stationary time series. The proposed FDM decomposes any data into a small number of ‘Fourier intrinsic band functions’ (FIBFs). The FDM presents a generalized Fourier expansion with variable amplitudes and variable frequencies of a time series by the Fourier method itself. We propose an idea of zero-phase filter bank-based multivariate FDM (MFDM), for the analysis of multivariate nonlinear and non-stationary time series, using the FDM. We also present an algorithm to obtain cut-off frequencies for MFDM. The proposed MFDM generates a finite number of band-limited multivariate FIBFs (MFIBFs). The MFDM preserves some intrinsic physical properties of the multivariate data, such as scale alignment, trend and instantaneous frequency. The proposed methods provide a time–frequency–energy (TFE) distribution that reveals the intrinsic structure of a data. Numerical computations and simulations have been carried out and comparison is made with the empirical mode decomposition algorithms. PMID:28413352

  20. New robust bilinear least squares method for the analysis of spectral-pH matrix data.

    PubMed

    Goicoechea, Héctor C; Olivieri, Alejandro C

    2005-07-01

    A new second-order multivariate method has been developed for the analysis of spectral-pH matrix data, based on a bilinear least-squares (BLLS) model achieving the second-order advantage and handling multiple calibration standards. A simulated Monte Carlo study of synthetic absorbance-pH data allowed comparison of the newly proposed BLLS methodology with constrained parallel factor analysis (PARAFAC) and with the combination multivariate curve resolution-alternating least-squares (MCR-ALS) technique under different conditions of sample-to-sample pH mismatch and analyte-background ratio. The results indicate an improved prediction ability for the new method. Experimental data generated by measuring absorption spectra of several calibration standards of ascorbic acid and samples of orange juice were subjected to second-order calibration analysis with PARAFAC, MCR-ALS, and the new BLLS method. The results indicate that the latter method provides the best analytical results in regard to analyte recovery in samples of complex composition requiring strict adherence to the second-order advantage. Linear dependencies appear when multivariate data are produced by using the pH or a reaction time as one of the data dimensions, posing a challenge to classical multivariate calibration models. The presently discussed algorithm is useful for these latter systems.

  1. Multivariate evaluation of Thyroid Imaging Reporting and Data System (TI-RADS) in diagnosis malignant thyroid nodule: application to PCA and PLS-DA analysis.

    PubMed

    Zhang, Tan; Li, Fangxuan; Mu, Jiali; Liu, Juntian; Zhang, Sheng

    2017-06-01

    To explore the significance of ultrasonic features in differential diagnosis of thyroid nodules via combining the thyroid imaging reporting and data system (TI-RADS) and multivariate statistical analysis. Patients who received surgical treatment and was diagnosed with single thyroid nodule by postoperative pathology and preoperative ultrasound were enrolled in this study. Multivariate analysis was applied to assess the significant ultrasonic features which correlated with identifying benign or malignance and grading the TI-RADS classification of thyroid nodule. There were significant differences in the nodule size, aspect ratio, internal, echogenicity, boundary, presence or absence of calcifications, calcification type and CDFI between benign and malignant thyroid nodules. Multivariate analysis showed clear-cut distinction both between benign and malignance and among different TI-RADS categories of malignancy nodules. The shape and calcification of the nodule were important factors for distinguish the benign and malignance. Height of the nodule, aspect and calcification was important factors for grading TI-RADS categories of malignancy thyroid nodules. Ill-defined boundary, irregular shape and presence of calcification related with highly malignant risk for thyroid nodule. The larger height and aspect and presence of calcification related with higher TI-RADS classification of malignancy thyroid nodule.

  2. Semiparametric Thurstonian Models for Recurrent Choices: A Bayesian Analysis

    ERIC Educational Resources Information Center

    Ansari, Asim; Iyengar, Raghuram

    2006-01-01

    We develop semiparametric Bayesian Thurstonian models for analyzing repeated choice decisions involving multinomial, multivariate binary or multivariate ordinal data. Our modeling framework has multiple components that together yield considerable flexibility in modeling preference utilities, cross-sectional heterogeneity and parameter-driven…

  3. The use of multivariate statistics in studies of wildlife habitat

    Treesearch

    David E. Capen

    1981-01-01

    This report contains edited and reviewed versions of papers presented at a workshop held at the University of Vermont in April 1980. Topics include sampling avian habitats, multivariate methods, applications, examples, and new approaches to analysis and interpretation.

  4. Rejection of Multivariate Outliers.

    DTIC Science & Technology

    1983-05-01

    available in Gnanadesikan (1977). 2 The motivation for the present investigation lies in a recent paper of Schvager and Margolin (1982) who derive a... Gnanadesikan , R. (1977). Methods for Statistical Data Analysis of Multivariate Observations. Wiley, New York. [7] Hawkins, D.M. (1980). Identification of

  5. Applications of modern statistical methods to analysis of data in physical science

    NASA Astrophysics Data System (ADS)

    Wicker, James Eric

    Modern methods of statistical and computational analysis offer solutions to dilemmas confronting researchers in physical science. Although the ideas behind modern statistical and computational analysis methods were originally introduced in the 1970's, most scientists still rely on methods written during the early era of computing. These researchers, who analyze increasingly voluminous and multivariate data sets, need modern analysis methods to extract the best results from their studies. The first section of this work showcases applications of modern linear regression. Since the 1960's, many researchers in spectroscopy have used classical stepwise regression techniques to derive molecular constants. However, problems with thresholds of entry and exit for model variables plagues this analysis method. Other criticisms of this kind of stepwise procedure include its inefficient searching method, the order in which variables enter or leave the model and problems with overfitting data. We implement an information scoring technique that overcomes the assumptions inherent in the stepwise regression process to calculate molecular model parameters. We believe that this kind of information based model evaluation can be applied to more general analysis situations in physical science. The second section proposes new methods of multivariate cluster analysis. The K-means algorithm and the EM algorithm, introduced in the 1960's and 1970's respectively, formed the basis of multivariate cluster analysis methodology for many years. However, several shortcomings of these methods include strong dependence on initial seed values and inaccurate results when the data seriously depart from hypersphericity. We propose new cluster analysis methods based on genetic algorithms that overcomes the strong dependence on initial seed values. In addition, we propose a generalization of the Genetic K-means algorithm which can accurately identify clusters with complex hyperellipsoidal covariance structures. We then use this new algorithm in a genetic algorithm based Expectation-Maximization process that can accurately calculate parameters describing complex clusters in a mixture model routine. Using the accuracy of this GEM algorithm, we assign information scores to cluster calculations in order to best identify the number of mixture components in a multivariate data set. We will showcase how these algorithms can be used to process multivariate data from astronomical observations.

  6. Regional magnetic resonance imaging measures for multivariate analysis in Alzheimer's disease and mild cognitive impairment.

    PubMed

    Westman, Eric; Aguilar, Carlos; Muehlboeck, J-Sebastian; Simmons, Andrew

    2013-01-01

    Automated structural magnetic resonance imaging (MRI) processing pipelines are gaining popularity for Alzheimer's disease (AD) research. They generate regional volumes, cortical thickness measures and other measures, which can be used as input for multivariate analysis. It is not clear which combination of measures and normalization approach are most useful for AD classification and to predict mild cognitive impairment (MCI) conversion. The current study includes MRI scans from 699 subjects [AD, MCI and controls (CTL)] from the Alzheimer's disease Neuroimaging Initiative (ADNI). The Freesurfer pipeline was used to generate regional volume, cortical thickness, gray matter volume, surface area, mean curvature, gaussian curvature, folding index and curvature index measures. 259 variables were used for orthogonal partial least square to latent structures (OPLS) multivariate analysis. Normalisation approaches were explored and the optimal combination of measures determined. Results indicate that cortical thickness measures should not be normalized, while volumes should probably be normalized by intracranial volume (ICV). Combining regional cortical thickness measures (not normalized) with cortical and subcortical volumes (normalized with ICV) using OPLS gave a prediction accuracy of 91.5 % when distinguishing AD versus CTL. This model prospectively predicted future decline from MCI to AD with 75.9 % of converters correctly classified. Normalization strategy did not have a significant effect on the accuracies of multivariate models containing multiple MRI measures for this large dataset. The appropriate choice of input for multivariate analysis in AD and MCI is of great importance. The results support the use of un-normalised cortical thickness measures and volumes normalised by ICV.

  7. Clinical Trials With Large Numbers of Variables: Important Advantages of Canonical Analysis.

    PubMed

    Cleophas, Ton J

    2016-01-01

    Canonical analysis assesses the combined effects of a set of predictor variables on a set of outcome variables, but it is little used in clinical trials despite the omnipresence of multiple variables. The aim of this study was to assess the performance of canonical analysis as compared with traditional multivariate methods using multivariate analysis of covariance (MANCOVA). As an example, a simulated data file with 12 gene expression levels and 4 drug efficacy scores was used. The correlation coefficient between the 12 predictor and 4 outcome variables was 0.87 (P = 0.0001) meaning that 76% of the variability in the outcome variables was explained by the 12 covariates. Repeated testing after the removal of 5 unimportant predictor and 1 outcome variable produced virtually the same overall result. The MANCOVA identified identical unimportant variables, but it was unable to provide overall statistics. (1) Canonical analysis is remarkable, because it can handle many more variables than traditional multivariate methods such as MANCOVA can. (2) At the same time, it accounts for the relative importance of the separate variables, their interactions and differences in units. (3) Canonical analysis provides overall statistics of the effects of sets of variables, whereas traditional multivariate methods only provide the statistics of the separate variables. (4) Unlike other methods for combining the effects of multiple variables such as factor analysis/partial least squares, canonical analysis is scientifically entirely rigorous. (5) Limitations include that it is less flexible than factor analysis/partial least squares, because only 2 sets of variables are used and because multiple solutions instead of one is offered. We do hope that this article will stimulate clinical investigators to start using this remarkable method.

  8. Multivariate analysis of fatty acid and biochemical constitutes of seaweeds to characterize their potential as bioresource for biofuel and fine chemicals.

    PubMed

    Verma, Priyanka; Kumar, Manoj; Mishra, Girish; Sahoo, Dinabandhu

    2017-02-01

    In the present study bio prospecting of thirty seaweeds from Indian coasts was analyzed for their biochemical components including pigments, fatty acid and ash content. Multivariate analysis of biochemical components and fatty acids was done using Principal Component Analysis (PCA) and Agglomerative hierarchical clustering (AHC) to manifest chemotaxonomic relationship among various seaweeds. The overall analysis suggests that these seaweeds have multi-functional properties and can be utilized as promising bioresource for proteins, lipids, pigments and carbohydrates for the food/feed and biofuel industry. Copyright © 2016. Published by Elsevier Ltd.

  9. Interpretability of Multivariate Brain Maps in Linear Brain Decoding: Definition, and Heuristic Quantification in Multivariate Analysis of MEG Time-Locked Effects.

    PubMed

    Kia, Seyed Mostafa; Vega Pons, Sandro; Weisz, Nathan; Passerini, Andrea

    2016-01-01

    Brain decoding is a popular multivariate approach for hypothesis testing in neuroimaging. Linear classifiers are widely employed in the brain decoding paradigm to discriminate among experimental conditions. Then, the derived linear weights are visualized in the form of multivariate brain maps to further study spatio-temporal patterns of underlying neural activities. It is well known that the brain maps derived from weights of linear classifiers are hard to interpret because of high correlations between predictors, low signal to noise ratios, and the high dimensionality of neuroimaging data. Therefore, improving the interpretability of brain decoding approaches is of primary interest in many neuroimaging studies. Despite extensive studies of this type, at present, there is no formal definition for interpretability of multivariate brain maps. As a consequence, there is no quantitative measure for evaluating the interpretability of different brain decoding methods. In this paper, first, we present a theoretical definition of interpretability in brain decoding; we show that the interpretability of multivariate brain maps can be decomposed into their reproducibility and representativeness. Second, as an application of the proposed definition, we exemplify a heuristic for approximating the interpretability in multivariate analysis of evoked magnetoencephalography (MEG) responses. Third, we propose to combine the approximated interpretability and the generalization performance of the brain decoding into a new multi-objective criterion for model selection. Our results, for the simulated and real MEG data, show that optimizing the hyper-parameters of the regularized linear classifier based on the proposed criterion results in more informative multivariate brain maps. More importantly, the presented definition provides the theoretical background for quantitative evaluation of interpretability, and hence, facilitates the development of more effective brain decoding algorithms in the future.

  10. Interpretability of Multivariate Brain Maps in Linear Brain Decoding: Definition, and Heuristic Quantification in Multivariate Analysis of MEG Time-Locked Effects

    PubMed Central

    Kia, Seyed Mostafa; Vega Pons, Sandro; Weisz, Nathan; Passerini, Andrea

    2017-01-01

    Brain decoding is a popular multivariate approach for hypothesis testing in neuroimaging. Linear classifiers are widely employed in the brain decoding paradigm to discriminate among experimental conditions. Then, the derived linear weights are visualized in the form of multivariate brain maps to further study spatio-temporal patterns of underlying neural activities. It is well known that the brain maps derived from weights of linear classifiers are hard to interpret because of high correlations between predictors, low signal to noise ratios, and the high dimensionality of neuroimaging data. Therefore, improving the interpretability of brain decoding approaches is of primary interest in many neuroimaging studies. Despite extensive studies of this type, at present, there is no formal definition for interpretability of multivariate brain maps. As a consequence, there is no quantitative measure for evaluating the interpretability of different brain decoding methods. In this paper, first, we present a theoretical definition of interpretability in brain decoding; we show that the interpretability of multivariate brain maps can be decomposed into their reproducibility and representativeness. Second, as an application of the proposed definition, we exemplify a heuristic for approximating the interpretability in multivariate analysis of evoked magnetoencephalography (MEG) responses. Third, we propose to combine the approximated interpretability and the generalization performance of the brain decoding into a new multi-objective criterion for model selection. Our results, for the simulated and real MEG data, show that optimizing the hyper-parameters of the regularized linear classifier based on the proposed criterion results in more informative multivariate brain maps. More importantly, the presented definition provides the theoretical background for quantitative evaluation of interpretability, and hence, facilitates the development of more effective brain decoding algorithms in the future. PMID:28167896

  11. A mixed model framework for teratology studies.

    PubMed

    Braeken, Johan; Tuerlinckx, Francis

    2009-10-01

    A mixed model framework is presented to model the characteristic multivariate binary anomaly data as provided in some teratology studies. The key features of the model are the incorporation of covariate effects, a flexible random effects distribution by means of a finite mixture, and the application of copula functions to better account for the relation structure of the anomalies. The framework is motivated by data of the Boston Anticonvulsant Teratogenesis study and offers an integrated approach to investigate substantive questions, concerning general and anomaly-specific exposure effects of covariates, interrelations between anomalies, and objective diagnostic measurement.

  12. On the Theory of Multivariate Elliptically Contoured Distributions and Their Applications.

    DTIC Science & Technology

    1982-05-01

    elliptically contoured distributions has been studied by several authors: Schoenberg (1938), Kelker (1970), Devlin, Gnanadesikan and Keltenring (1976...theory of ellip- tically contoured distributions, J. Multivariate Analysis, 11, 368-385. Devlin, S. J., Gnanadesikan , R., and Kettenring, J. R. (1976

  13. Preoperative Nutritional Status as an Adjunct Predictor of Major Postoperative Complications Following Anterior Cervical Discectomy and Fusion.

    PubMed

    Fu, Michael C; Buerba, Rafael A; Grauer, Jonathan N

    2016-05-01

    Retrospective analysis of the National Surgical Quality Improvement Program (NSQIP), a prospectively collected multicenter surgical outcomes database. To determine the effect of preoperative nutritional status, as measured by serum albumin concentration, on outcomes following anterior cervical discectomy and fusion (ACDF). Nutritional status has been shown to be an important predictor of postoperative recovery and outcomes. Serum albumin concentration is an established marker of overall nutrition and systemic disease, however, its correlation to outcomes following ACDF is unknown. ACDF cases from 2005 to 2010 were identified in the NSQIP and categorized by preoperative serum albumin: normal (≥3.5 g/dL), hypoalbuminemic (<3.5 g/dL), or not measured. Independent demographic and comorbidity variables were assessed, including American Society of Anesthesiologists (ASA) classification. Risk factors for major postoperative complications were identified, including preoperative hypoalbuminemia, and incorporated into a multivariable logistic regression model to determine the strength of preoperative hypoalbuminemia as an adjusted predictor of major postoperative complications. There were 3671 ACDF cases, of which 1382 (37.6%) had preoperative albumin measurements. Patients with albumin measurements were older and more likely to have higher ASA class, hypertension, and diabetes. Hypoalbuminemic patients had higher rates of having any major postoperative complication(s), specifically pulmonary complications, cardiac complications, and reoperation, relative to those with normal albumin (all P<0.01). These patients also had longer lengths of stay (5.0 vs. 1.9 d). With multivariable regression, preoperative hypoalbuminemia was a strong independent predictor of major postoperative complications, with an adjusted odds ratio of 3.37 (P=0.003). In this analysis of a prospective surgical outcomes database, preoperative serum hypoalbuminemia was an important adjunct predictor of major complications following ACDF. In high-risk patients with multiple medical comorbidities, we recommend that clinicians consider nutritional screening and optimization as part of preoperative risk assessment.

  14. Towards a contemporary, comprehensive scoring system for determining technical outcomes of hybrid percutaneous chronic total occlusion treatment: The RECHARGE score.

    PubMed

    Maeremans, Joren; Spratt, James C; Knaapen, Paul; Walsh, Simon; Agostoni, Pierfrancesco; Wilson, William; Avran, Alexandre; Faurie, Benjamin; Bressollette, Erwan; Kayaert, Peter; Bagnall, Alan J; Smith, Dave; McEntegart, Margaret B; Smith, William H T; Kelly, Paul; Irving, John; Smith, Elliot J; Strange, Julian W; Dens, Jo

    2018-02-01

    This study sought to create a contemporary scoring tool to predict technical outcomes of chronic total occlusion (CTO) percutaneous coronary intervention (PCI) from patients treated by hybrid operators with differing experience levels. Current scoring systems need regular updating to cope with the positive evolutions regarding materials, techniques, and outcomes, while at the same time being applicable for a broad range of operators. Clinical and angiographic characteristics from 880 CTO-PCIs included in the REgistry of CrossBoss and Hybrid procedures in FrAnce, the NetheRlands, BelGium and UnitEd Kingdom (RECHARGE) were analyzed by using a derivation and validation set (2:1 ratio). Variables significantly associated with technical failure in the multivariable analysis were incorporated in the score. Subsequently, the discriminatory capacity was assessed and the validation set was used to compare with the J-CTO score and PROGRESS scores. Technical success in the derivation and validation sets was 83% and 85%, respectively. Multivariate analysis identified six parameters associated with technical failure: blunt stump (beta coefficient (b) = 1.014); calcification (b = 0.908); tortuosity ≥45° (b = 0.964); lesion length 20 mm (b = 0.556); diseased distal landing zone (b = 0.794), and previous bypass graft on CTO vessel (b = 0.833). Score variables remained significant after bootstrapping. The RECHARGE score showed better discriminatory capacity in both sets (area-under-the-curve (AUC) = 0.783 and 0.711), compared to the J-CTO (AUC = 0.676) and PROGRESS (AUC = 0.608) scores. The RECHARGE score is a novel, easy-to-use tool for assessing the risk for technical failure in hybrid CTO-PCI and has the potential to perform well for a broad community of operators. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  15. Evaluation of biomolecular distributions in rat brain tissues by means of ToF-SIMS using a continuous beam of Ar clusters.

    PubMed

    Nakano, Shusuke; Yokoyama, Yuta; Aoyagi, Satoka; Himi, Naoyuki; Fletcher, John S; Lockyer, Nicholas P; Henderson, Alex; Vickerman, John C

    2016-06-08

    Time-of-flight secondary ion mass spectrometry (ToF-SIMS) provides detailed chemical structure information and high spatial resolution images. Therefore, ToF-SIMS is useful for studying biological phenomena such as ischemia. In this study, in order to evaluate cerebral microinfarction, the distribution of biomolecules generated by ischemia was measured with ToF-SIMS. ToF-SIMS data sets were analyzed by means of multivariate analysis for interpreting complex samples containing unknown information and to obtain biomolecular mapping indicated by fragment ions from the target biomolecules. Using conventional ToF-SIMS (primary ion source: Bi cluster ion), it is difficult to detect secondary ions beyond approximately 1000 u. Moreover, the intensity of secondary ions related to biomolecules is not always high enough for imaging because of low concentration even if the masses are lower than 1000 u. However, for the observation of biomolecular distributions in tissues, it is important to detect low amounts of biological molecules from a particular area of tissue. Rat brain tissue samples were measured with ToF-SIMS (J105, Ionoptika, Ltd., Chandlers Ford, UK), using a continuous beam of Ar clusters as a primary ion source. ToF-SIMS with Ar clusters efficiently detects secondary ions related to biomolecules and larger molecules. Molecules detected by ToF-SIMS were examined by analyzing ToF-SIMS data using multivariate analysis. Microspheres (45 μm diameter) were injected into the rat unilateral internal carotid artery (MS rat) to cause cerebral microinfarction. The rat brain was sliced and then measured with ToF-SIMS. The brain samples of a normal rat and the MS rat were examined to find specific secondary ions related to important biomolecules, and then the difference between them was investigated. Finally, specific secondary ions were found around vessels incorporating microspheres in the MS rat. The results suggest that important biomolecules related to cerebral microinfarction can be detected by ToF-SIMS.

  16. Changes in Case-Mix and Health Outcomes of Medicare Fee-for-Service Beneficiaries and Managed Care Enrollees During the Years 1992-2011.

    PubMed

    Koroukian, Siran M; Basu, Jayasree; Schiltz, Nicholas K; Navale, Suparna; Bakaki, Paul M; Warner, David F; Dor, Avi; Given, Charles W; Stange, Kurt C

    2018-01-01

    Recent studies suggest that managed care enrollees (MCEs) and fee-for-service beneficiaries (FFSBs) have become similar in case-mix over time; but comparisons of health outcomes have yielded mixed results. To examine changes in differentials between MCEs and FFSBs both in case-mix and health outcomes over time. Temporal study of the linked Health and Retirement Study (HRS) and Medicare data, comparing case-mix and health outcomes between MCEs and FFSBs across 3 time periods: 1992-1998, 1999-2004, and 2005-2011. We used multivariable analysis, stratified by, and pooled across the study periods. The unit of analysis was the person-wave (n=167,204). HRS participants who were also enrolled in Medicare. Outcome measures included self-reported fair/poor health, 2-year self-rated worse health, and 2-year mortality. Our main covariate was a composite measure of multimorbidity (MM), MM0-MM3, defined as the co-occurrence of chronic conditions, functional limitations, and/or geriatric syndromes. The case-mix differential between MCEs and FFSBs persisted over time. Results from multivariable models on the pooled data and incorporating interaction terms between managed care status and study period indicated that MCEs and FFSBs were as likely to die within 2 years from the HRS interview (P=0.073). This likelihood remained unchanged across the study periods. However, MCEs were more likely than FFSBs to report fair/poor health in the third study period (change in probability for the interaction term: 0.024, P=0.008), but less likely to rate their health worse in the last 2 years, albeit at borderline significance (change in probability: -0.021, P=0.059). Despite the persistence of selection bias, the differential in self-reported fair/poor status between MCEs and FFSBs seems to be closing over time.

  17. Multivariate analysis of heavy metal contamination using river sediment cores of Nankan River, northern Taiwan

    NASA Astrophysics Data System (ADS)

    Lee, An-Sheng; Lu, Wei-Li; Huang, Jyh-Jaan; Chang, Queenie; Wei, Kuo-Yen; Lin, Chin-Jung; Liou, Sofia Ya Hsuan

    2016-04-01

    Through the geology and climate characteristic in Taiwan, generally rivers carry a lot of suspended particles. After these particles settled, they become sediments which are good sorbent for heavy metals in river system. Consequently, sediments can be found recording contamination footprint at low flow energy region, such as estuary. Seven sediment cores were collected along Nankan River, northern Taiwan, which is seriously contaminated by factory, household and agriculture input. Physico-chemical properties of these cores were derived from Itrax-XRF Core Scanner and grain size analysis. In order to interpret these complex data matrices, the multivariate statistical techniques (cluster analysis, factor analysis and discriminant analysis) were introduced to this study. Through the statistical determination, the result indicates four types of sediment. One of them represents contamination event which shows high concentration of Cu, Zn, Pb, Ni and Fe, and low concentration of Si and Zr. Furthermore, three possible contamination sources of this type of sediment were revealed by Factor Analysis. The combination of sediment analysis and multivariate statistical techniques used provides new insights into the contamination depositional history of Nankan River and could be similarly applied to other river systems to determine the scale of anthropogenic contamination.

  18. Multivariate Curve Resolution Applied to Infrared Reflection Measurements of Soil Contaminated with an Organophosphorus Analyte

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gallagher, Neal B.; Blake, Thomas A.; Gassman, Paul L.

    2006-07-01

    Multivariate curve resolution (MCR) is a powerful technique for extracting chemical information from measured spectra on complex mixtures. The difficulty with applying MCR to soil reflectance measurements is that light scattering artifacts can contribute much more variance to the measurements than the analyte(s) of interest. Two methods were integrated into a MCR decomposition to account for light scattering effects. Firstly, an extended mixture model using pure analyte spectra augmented with scattering ‘spectra’ was used for the measured spectra. And secondly, second derivative preprocessed spectra, which have higher selectivity than the unprocessed spectra, were included in a second block as amore » part of the decomposition. The conventional alternating least squares (ALS) algorithm was modified to simultaneously decompose the measured and second derivative spectra in a two-block decomposition. Equality constraints were also included to incorporate information about sampling conditions. The result was an MCR decomposition that provided interpretable spectra from soil reflectance measurements.« less

  19. Integrating Growth Variability of the Ilium, Fifth Lumbar Vertebra, and Clavicle with Multivariate Adaptive Regression Splines Models for Subadult Age Estimation.

    PubMed

    Corron, Louise; Marchal, François; Condemi, Silvana; Telmon, Norbert; Chaumoitre, Kathia; Adalian, Pascal

    2018-05-31

    Subadult age estimation should rely on sampling and statistical protocols capturing development variability for more accurate age estimates. In this perspective, measurements were taken on the fifth lumbar vertebrae and/or clavicles of 534 French males and females aged 0-19 years and the ilia of 244 males and females aged 0-12 years. These variables were fitted in nonparametric multivariate adaptive regression splines (MARS) models with 95% prediction intervals (PIs) of age. The models were tested on two independent samples from Marseille and the Luis Lopes reference collection from Lisbon. Models using ilium width and module, maximum clavicle length, and lateral vertebral body heights were more than 92% accurate. Precision was lower for postpubertal individuals. Integrating punctual nonlinearities of the relationship between age and the variables and dynamic prediction intervals incorporated the normal increase in interindividual growth variability (heteroscedasticity of variance) with age for more biologically accurate predictions. © 2018 American Academy of Forensic Sciences.

  20. Nonstationary multivariate modeling of cerebral autoregulation during hypercapnia.

    PubMed

    Kostoglou, Kyriaki; Debert, Chantel T; Poulin, Marc J; Mitsis, Georgios D

    2014-05-01

    We examined the time-varying characteristics of cerebral autoregulation and hemodynamics during a step hypercapnic stimulus by using recursively estimated multivariate (two-input) models which quantify the dynamic effects of mean arterial blood pressure (ABP) and end-tidal CO2 tension (PETCO2) on middle cerebral artery blood flow velocity (CBFV). Beat-to-beat values of ABP and CBFV, as well as breath-to-breath values of PETCO2 during baseline and sustained euoxic hypercapnia were obtained in 8 female subjects. The multiple-input, single-output models used were based on the Laguerre expansion technique, and their parameters were updated using recursive least squares with multiple forgetting factors. The results reveal the presence of nonstationarities that confirm previously reported effects of hypercapnia on autoregulation, i.e. a decrease in the MABP phase lead, and suggest that the incorporation of PETCO2 as an additional model input yields less time-varying estimates of dynamic pressure autoregulation obtained from single-input (ABP-CBFV) models. Copyright © 2013 IPEM. Published by Elsevier Ltd. All rights reserved.

  1. Remote-sensing data processing with the multivariate regression analysis method for iron mineral resource potential mapping: a case study in the Sarvian area, central Iran

    NASA Astrophysics Data System (ADS)

    Mansouri, Edris; Feizi, Faranak; Jafari Rad, Alireza; Arian, Mehran

    2018-03-01

    This paper uses multivariate regression to create a mathematical model for iron skarn exploration in the Sarvian area, central Iran, using multivariate regression for mineral prospectivity mapping (MPM). The main target of this paper is to apply multivariate regression analysis (as an MPM method) to map iron outcrops in the northeastern part of the study area in order to discover new iron deposits in other parts of the study area. Two types of multivariate regression models using two linear equations were employed to discover new mineral deposits. This method is one of the reliable methods for processing satellite images. ASTER satellite images (14 bands) were used as unique independent variables (UIVs), and iron outcrops were mapped as dependent variables for MPM. According to the results of the probability value (p value), coefficient of determination value (R2) and adjusted determination coefficient (Radj2), the second regression model (which consistent of multiple UIVs) fitted better than other models. The accuracy of the model was confirmed by iron outcrops map and geological observation. Based on field observation, iron mineralization occurs at the contact of limestone and intrusive rocks (skarn type).

  2. Voxelwise multivariate analysis of multimodality magnetic resonance imaging

    PubMed Central

    Naylor, Melissa G.; Cardenas, Valerie A.; Tosun, Duygu; Schuff, Norbert; Weiner, Michael; Schwartzman, Armin

    2015-01-01

    Most brain magnetic resonance imaging (MRI) studies concentrate on a single MRI contrast or modality, frequently structural MRI. By performing an integrated analysis of several modalities, such as structural, perfusion-weighted, and diffusion-weighted MRI, new insights may be attained to better understand the underlying processes of brain diseases. We compare two voxelwise approaches: (1) fitting multiple univariate models, one for each outcome and then adjusting for multiple comparisons among the outcomes and (2) fitting a multivariate model. In both cases, adjustment for multiple comparisons is performed over all voxels jointly to account for the search over the brain. The multivariate model is able to account for the multiple comparisons over outcomes without assuming independence because the covariance structure between modalities is estimated. Simulations show that the multivariate approach is more powerful when the outcomes are correlated and, even when the outcomes are independent, the multivariate approach is just as powerful or more powerful when at least two outcomes are dependent on predictors in the model. However, multiple univariate regressions with Bonferroni correction remains a desirable alternative in some circumstances. To illustrate the power of each approach, we analyze a case control study of Alzheimer's disease, in which data from three MRI modalities are available. PMID:23408378

  3. Multi-Fault Diagnosis of Rolling Bearings via Adaptive Projection Intrinsically Transformed Multivariate Empirical Mode Decomposition and High Order Singular Value Decomposition

    PubMed Central

    Lv, Yong; Song, Gangbing

    2018-01-01

    Rolling bearings are important components in rotary machinery systems. In the field of multi-fault diagnosis of rolling bearings, the vibration signal collected from single channels tends to miss some fault characteristic information. Using multiple sensors to collect signals at different locations on the machine to obtain multivariate signal can remedy this problem. The adverse effect of a power imbalance between the various channels is inevitable, and unfavorable for multivariate signal processing. As a useful, multivariate signal processing method, Adaptive-projection has intrinsically transformed multivariate empirical mode decomposition (APIT-MEMD), and exhibits better performance than MEMD by adopting adaptive projection strategy in order to alleviate power imbalances. The filter bank properties of APIT-MEMD are also adopted to enable more accurate and stable intrinsic mode functions (IMFs), and to ease mode mixing problems in multi-fault frequency extractions. By aligning IMF sets into a third order tensor, high order singular value decomposition (HOSVD) can be employed to estimate the fault number. The fault correlation factor (FCF) analysis is used to conduct correlation analysis, in order to determine effective IMFs; the characteristic frequencies of multi-faults can then be extracted. Numerical simulations and the application of multi-fault situation can demonstrate that the proposed method is promising in multi-fault diagnoses of multivariate rolling bearing signal. PMID:29659510

  4. Multi-Fault Diagnosis of Rolling Bearings via Adaptive Projection Intrinsically Transformed Multivariate Empirical Mode Decomposition and High Order Singular Value Decomposition.

    PubMed

    Yuan, Rui; Lv, Yong; Song, Gangbing

    2018-04-16

    Rolling bearings are important components in rotary machinery systems. In the field of multi-fault diagnosis of rolling bearings, the vibration signal collected from single channels tends to miss some fault characteristic information. Using multiple sensors to collect signals at different locations on the machine to obtain multivariate signal can remedy this problem. The adverse effect of a power imbalance between the various channels is inevitable, and unfavorable for multivariate signal processing. As a useful, multivariate signal processing method, Adaptive-projection has intrinsically transformed multivariate empirical mode decomposition (APIT-MEMD), and exhibits better performance than MEMD by adopting adaptive projection strategy in order to alleviate power imbalances. The filter bank properties of APIT-MEMD are also adopted to enable more accurate and stable intrinsic mode functions (IMFs), and to ease mode mixing problems in multi-fault frequency extractions. By aligning IMF sets into a third order tensor, high order singular value decomposition (HOSVD) can be employed to estimate the fault number. The fault correlation factor (FCF) analysis is used to conduct correlation analysis, in order to determine effective IMFs; the characteristic frequencies of multi-faults can then be extracted. Numerical simulations and the application of multi-fault situation can demonstrate that the proposed method is promising in multi-fault diagnoses of multivariate rolling bearing signal.

  5. Selective sensing of vapors of similar dielectric constants using peptide-capped gold nanoparticles on individual multivariable transducers.

    PubMed

    Nagraj, Nandini; Slocik, Joseph M; Phillips, David M; Kelley-Loughnane, Nancy; Naik, Rajesh R; Potyrailo, Radislav A

    2013-08-07

    Peptide-capped AYSSGAPPMPPF gold nanoparticles were demonstrated for highly selective chemical vapor sensing using individual multivariable inductor-capacitor-resistor (LCR) resonators. Their multivariable response was achieved by measuring their resonance impedance spectra followed by multivariate spectral analysis. Detection of model toxic vapors and chemical agent simulants, such as acetonitrile, dichloromethane and methyl salicylate, was performed. Dichloromethane (dielectric constant εr = 9.1) and methyl salicylate (εr = 9.0) were discriminated using a single sensor. These sensing materials coupled to multivariable transducers can provide numerous opportunities for tailoring the vapor response selectivity based on the diversity of the amino acid composition of the peptides, and by the modulation of the nature of peptide-nanoparticle interactions through designed combinations of hydrophobic and hydrophilic amino acids.

  6. Application of Maxent Multivariate Analysis to Define Climate-Change Effects on Species Distributions and Changes

    DTIC Science & Technology

    2014-09-01

    approaches. Ecological Modelling Volume 200, Issues 1–2, 10, pp 1–19. Buhlmann, Kurt A ., Thomas S.B. Akre , John B. Iverson, Deno Karapatakis, Russell A ...statistical multivariate analysis to define the current and projected future range probability for species of interest to Army land managers. A software...15 Figure 4. RCW omission rate and predicted area as a function of the cumulative threshold

  7. Structural analysis and design of multivariable control systems: An algebraic approach

    NASA Technical Reports Server (NTRS)

    Tsay, Yih Tsong; Shieh, Leang-San; Barnett, Stephen

    1988-01-01

    The application of algebraic system theory to the design of controllers for multivariable (MV) systems is explored analytically using an approach based on state-space representations and matrix-fraction descriptions. Chapters are devoted to characteristic lambda matrices and canonical descriptions of MIMO systems; spectral analysis, divisors, and spectral factors of nonsingular lambda matrices; feedback control of MV systems; and structural decomposition theories and their application to MV control systems.

  8. SOCIAL AND PSYCHOLOGICAL PREDICTORS OF INFORMATION SEEKING AND MEDIA USE, A MULTIVARIATE RE-ANALYSIS. REPORT. PAPER PRESENTED AT THE NATIONAL SEMINAR ON ADULT EDUCATION RESEARCH (CHICAGO, FEBRUARY 11-13, 1968).

    ERIC Educational Resources Information Center

    PAISLEY, WILLIAM J.; REES, MATILDA B.

    USING DATA FROM A STANFORD UNIVERSITY STUDY IN FRESNO, CALIFORNIA, A MULTIVARIATE ANALYSIS WAS MADE OF 25 MEDIA USE AND INFORMATION SEEKING BEHAVIORS. SEVEN SOCIAL-PERSONAL AND THREE PSYCHOLOGICAL VARIABLES WERE ALSO CONSIDERED. YOUNGER ADULTS WERE MOST LIKELY TO PARTICIPATE IN ADULT EDUCATION, ESPECIALLY VOCATIONAL COURSES AND EVENING CLASSES AND…

  9. Causal diagrams and multivariate analysis I: a quiver full of arrows.

    PubMed

    Jupiter, Daniel C

    2014-01-01

    How do we know which variables we should include in our multivariate analyses? What role does each variable play in our understanding of the analysis? In this article I begin a discussion of these issues and describe 2 different types of studies for which this problem must be handled in different ways. Copyright © 2014 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.

  10. NONPARAMETRIC MANOVA APPROACHES FOR NON-NORMAL MULTIVARIATE OUTCOMES WITH MISSING VALUES

    PubMed Central

    He, Fanyin; Mazumdar, Sati; Tang, Gong; Bhatia, Triptish; Anderson, Stewart J.; Dew, Mary Amanda; Krafty, Robert; Nimgaonkar, Vishwajit; Deshpande, Smita; Hall, Martica; Reynolds, Charles F.

    2017-01-01

    Between-group comparisons often entail many correlated response variables. The multivariate linear model, with its assumption of multivariate normality, is the accepted standard tool for these tests. When this assumption is violated, the nonparametric multivariate Kruskal-Wallis (MKW) test is frequently used. However, this test requires complete cases with no missing values in response variables. Deletion of cases with missing values likely leads to inefficient statistical inference. Here we extend the MKW test to retain information from partially-observed cases. Results of simulated studies and analysis of real data show that the proposed method provides adequate coverage and superior power to complete-case analyses. PMID:29416225

  11. Authentication of Trappist beers by LC-MS fingerprints and multivariate data analysis.

    PubMed

    Mattarucchi, Elia; Stocchero, Matteo; Moreno-Rojas, José Manuel; Giordano, Giuseppe; Reniero, Fabiano; Guillou, Claude

    2010-12-08

    The aim of this study was to asses the applicability of LC-MS profiling to authenticate a selected Trappist beer as part of a program on traceability funded by the European Commission. A total of 232 beers were fingerprinted and classified through multivariate data analysis. The selected beer was clearly distinguished from beers of different brands, while only 3 samples (3.5% of the test set) were wrongly classified when compared with other types of beer of the same Trappist brewery. The fingerprints were further analyzed to extract the most discriminating variables, which proved to be sufficient for classification, even using a simplified unsupervised model. This reduced fingerprint allowed us to study the influence of batch-to-batch variability on the classification model. Our results can easily be applied to different matrices and they confirmed the effectiveness of LC-MS profiling in combination with multivariate data analysis for the characterization of food products.

  12. Implementation of quality by design principles in the development of microsponges as drug delivery carriers: Identification and optimization of critical factors using multivariate statistical analyses and design of experiments studies.

    PubMed

    Simonoska Crcarevska, Maja; Dimitrovska, Aneta; Sibinovska, Nadica; Mladenovska, Kristina; Slavevska Raicki, Renata; Glavas Dodov, Marija

    2015-07-15

    Microsponges drug delivery system (MDDC) was prepared by double emulsion-solvent-diffusion technique using rotor-stator homogenization. Quality by design (QbD) concept was implemented for the development of MDDC with potential to be incorporated into semisolid dosage form (gel). Quality target product profile (QTPP) and critical quality attributes (CQA) were defined and identified, accordingly. Critical material attributes (CMA) and Critical process parameters (CPP) were identified using quality risk management (QRM) tool, failure mode, effects and criticality analysis (FMECA). CMA and CPP were identified based on results obtained from principal component analysis (PCA-X&Y) and partial least squares (PLS) statistical analysis along with literature data, product and process knowledge and understanding. FMECA identified amount of ethylcellulose, chitosan, acetone, dichloromethane, span 80, tween 80 and water ratio in primary/multiple emulsions as CMA and rotation speed and stirrer type used for organic solvent removal as CPP. The relationship between identified CPP and particle size as CQA was described in the design space using design of experiments - one-factor response surface method. Obtained results from statistically designed experiments enabled establishment of mathematical models and equations that were used for detailed characterization of influence of identified CPP upon MDDC particle size and particle size distribution and their subsequent optimization. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. FuryExplorer: visual-interactive exploration of horse motion capture data

    NASA Astrophysics Data System (ADS)

    Wilhelm, Nils; Vögele, Anna; Zsoldos, Rebeka; Licka, Theresia; Krüger, Björn; Bernard, Jürgen

    2015-01-01

    The analysis of equine motion has a long tradition in the past of mankind. Equine biomechanics aims at detecting characteristics of horses indicative of good performance. Especially, veterinary medicine gait analysis plays an important role in diagnostics and in the emerging research of long-term effects of athletic exercises. More recently, the incorporation of motion capture technology contributed to an easier and faster analysis, with a trend from mere observation of horses towards the analysis of multivariate time-oriented data. However, due to the novelty of this topic being raised within an interdisciplinary context, there is yet a lack of visual-interactive interfaces to facilitate time series data analysis and information discourse for the veterinary and biomechanics communities. In this design study, we bring visual analytics technology into the respective domains, which, to our best knowledge, was never approached before. Based on requirements developed in the domain characterization phase, we present a visual-interactive system for the exploration of horse motion data. The system provides multiple views which enable domain experts to explore frequent poses and motions, but also to drill down to interesting subsets, possibly containing unexpected patterns. We show the applicability of the system in two exploratory use cases, one on the comparison of different gait motions, and one on the analysis of lameness recovery. Finally, we present the results of a summative user study conducted in the environment of the domain experts. The overall outcome was a significant improvement in effectiveness and efficiency in the analytical workflow of the domain experts.

  14. Multivariate time series clustering on geophysical data recorded at Mt. Etna from 1996 to 2003

    NASA Astrophysics Data System (ADS)

    Di Salvo, Roberto; Montalto, Placido; Nunnari, Giuseppe; Neri, Marco; Puglisi, Giuseppe

    2013-02-01

    Time series clustering is an important task in data analysis issues in order to extract implicit, previously unknown, and potentially useful information from a large collection of data. Finding useful similar trends in multivariate time series represents a challenge in several areas including geophysics environment research. While traditional time series analysis methods deal only with univariate time series, multivariate time series analysis is a more suitable approach in the field of research where different kinds of data are available. Moreover, the conventional time series clustering techniques do not provide desired results for geophysical datasets due to the huge amount of data whose sampling rate is different according to the nature of signal. In this paper, a novel approach concerning geophysical multivariate time series clustering is proposed using dynamic time series segmentation and Self Organizing Maps techniques. This method allows finding coupling among trends of different geophysical data recorded from monitoring networks at Mt. Etna spanning from 1996 to 2003, when the transition from summit eruptions to flank eruptions occurred. This information can be used to carry out a more careful evaluation of the state of volcano and to define potential hazard assessment at Mt. Etna.

  15. Variation of heavy metals in recent sediments from Piratininga Lagoon (Brazil): interpretation of geochemical data with the aid of multivariate analysis

    NASA Astrophysics Data System (ADS)

    Huang, W.; Campredon, R.; Abrao, J. J.; Bernat, M.; Latouche, C.

    1994-06-01

    In the last decade, the Atlantic coast of south-eastern Brazil has been affected by increasing deforestation and anthropogenic effluents. Sediments in the coastal lagoons have recorded the process of such environmental change. Thirty-seven sediment samples from three cores in Piratininga Lagoon, Rio de Janeiro, were analyzed for their major components and minor element concentrations in order to examine geochemical characteristics and the depositional environment and to investigate the variation of heavy metals of environmental concern. Two multivariate analysis methods, principal component analysis and cluster analysis, were performed on the analytical data set to help visualize the sample clusters and the element associations. On the whole, the sediment samples from each core are similar and the sample clusters corresponding to the three cores are clearly separated, as a result of the different conditions of sedimentation. Some changes in the depositional environment are recognized using the results of multivariate analysis. The enrichment of Pb, Cu, and Zn in the upper parts of cores is in agreement with increasing anthropogenic influx (pollution).

  16. MULTIVARIATE ANALYSIS OF DRINKING BEHAVIOUR IN A RURAL POPULATION

    PubMed Central

    Mathrubootham, N.; Bashyam, V.S.P.; Shahjahan

    1997-01-01

    This study was carried out to find out the drinking pattern in a rural population, using multivariate techniques. 386 current users identified in a community were assessed with regard to their drinking behaviours using a structured interview. For purposes of the study the questions were condensed into 46 meaningful variables. In bivariate analysis, 14 variables including dependent variables such as dependence, MAST & CAGE (measuring alcoholic status), Q.F. Index and troubled drinking were found to be significant. Taking these variables and other multivariate techniques too such as ANOVA, correlation, regression analysis and factor analysis were done using both SPSS PC + and HCL magnum mainframe computer with FOCUS package and UNIX systems. Results revealed that number of factors such as drinking style, duration of drinking, pattern of abuse, Q.F. Index and various problems influenced drinking and some of them set up a vicious circle. Factor analysis revealed mainly 3 factors, abuse, dependence and social drinking factors. Dependence could be divided into low/moderate dependence. The implications and practical applications of these tests are also discussed. PMID:21584077

  17. Estuarial fingerprinting through multidimensional fluorescence and multivariate analysis.

    PubMed

    Hall, Gregory J; Clow, Kerin E; Kenny, Jonathan E

    2005-10-01

    As part of a strategy for preventing the introduction of aquatic nuisance species (ANS) to U.S. estuaries, ballast water exchange (BWE) regulations have been imposed. Enforcing these regulations requires a reliable method for determining the port of origin of water in the ballast tanks of ships entering U.S. waters. This study shows that a three-dimensional fluorescence fingerprinting technique, excitation emission matrix (EEM) spectroscopy, holds great promise as a ballast water analysis tool. In our technique, EEMs are analyzed by multivariate classification and curve resolution methods, such as N-way partial least squares Regression-discriminant analysis (NPLS-DA) and parallel factor analysis (PARAFAC). We demonstrate that classification techniques can be used to discriminate among sampling sites less than 10 miles apart, encompassing Boston Harbor and two tributaries in the Mystic River Watershed. To our knowledge, this work is the first to use multivariate analysis to classify water as to location of origin. Furthermore, it is shown that curve resolution can show seasonal features within the multidimensional fluorescence data sets, which correlate with difficulty in classification.

  18. Morbidity and Mortality Associated with Geriatric Ankle Fractures: A Medicare Part A Claims Database Analysis.

    PubMed

    Hsu, Raymond Y; Lee, Yoojin; Hayda, Roman; DiGiovanni, Christopher W; Mor, Vincent; Bariteau, Jason T

    2015-11-04

    The purpose of this study was to examine the incidence of adverse events in elderly patients who required inpatient admission after sustaining an ankle fracture and to consider these data in relation to geriatric hip fracture and other geriatric patient admissions. A retrospective cohort study of patients admitted with an ankle fracture, a hip fracture, or any other diagnosis was performed with the Medicare Part A database for 2008. The primary outcome measure was the one-year mortality rate, examined with multivariate analysis factoring for both patient age and preexisting comorbidity. Secondary outcome measures analyzed additional morbidity as reflected by length of stay, discharge disposition, readmissions, and medical complications. There were 19,648 patients with ankle fractures, 193,980 patients with hip fractures, and 5,801,831 patients with other admitting diagnoses. Significant differences (p < 0.001) were noted in both age and comorbidity status between the group with ankle fractures and the group with hip fractures. The one-year mortality after admission was 11.9% for patients with ankle fracture, 28.2% for patients with hip fracture, and 21.5% for patients with any other admission. Upon using multivariate analysis to account for both age and comorbidity, the hazard ratio for one-year mortality associated with fracture was 1.088 for patients with hip fracture and 0.557 for patients with ankle fracture. Even after selecting for admitted patients and accounting for both age and comorbidity, geriatric patients with ankle fractures were found to have a lower one-year morbidity compared with geriatric patients who had sustained a hip fracture or alternative admitting diagnoses. Geriatric patients with ankle fractures are likely healthier and more active in ways that are not captured by simply accounting for age and comorbidity. These findings may support more aggressive definitive management of such injuries in this population. Prognostic Level III. See Instructions for Authors for a complete description of levels of evidence. Copyright © 2015 by The Journal of Bone and Joint Surgery, Incorporated.

  19. Predictors of frequency of condom use and attitudes among sexually active female military personnel in Nigeria

    PubMed Central

    Essien, E James; Mgbere, Osaro; Monjok, Emmanuel; Ekong, Ernest; Abughosh, Susan; Holstad, Marcia M

    2010-01-01

    Background Despite awareness of condom efficacy, in protecting against both human immunodeficiency virus/sexually transmitted diseases (HIV/STDs) and unintended pregnancy; some females find it difficult to use or permit condom use consistently because of the power imbalances or other dynamics operating in their relationships with males. The purpose of this study was to determine the factors that predict the frequency of condom use and attitudes among sexually active female military personnel in Nigeria. Methods This study used a cross-sectional design in which a total of 346 responses were obtained from consenting female military personnel in two cantonments in Southwestern Nigeria between 2006 and 2008. The study instrument was designed to assess HIV/acquired immunodeficiency syndrome (AIDS) knowledge (HAK), HIV risk behaviors (HRB), alcohol and drug use, condom attitudes and barriers (CAS) condom use self-efficacy (CUS) and social support to condom use (SSC). The sociodemographic characteristics of participants were also captured. Univariate analysis and multivariable logistic regression were used for modeling the predictors of condom use. Results The results showed that 63% of the respondents reported using condoms always, 26% sometimes used condoms and 11% never used condoms during a sexual encounter in the past three months. Univariate analysis revealed that significant associations existed between CAB (P < 0.05), HRB (P < 0.01) and SSC (P < 0.01) with the frequency of condom use. The following sociodemographic variables: age, marital status, number of children, employment status and type of sexual relationship were also significantly (P ≤ 0.05) associated with consistent condom use in the study group. Multivariate analysis indicated that marital status, type of relationship and CAB were the only significant predictors (r2 = 0.37; P ≤ 0.05) of condom use behaviors after adjusting for all other factors in the model. Conclusions Findings indicate that consistent condom use could be enhanced through gender-specific intervention programs that incorporate the predictor variables identified. These are likely to be successful in decreasing sexual risk behaviors in the subpopulation. PMID:22096387

  20. Predictors of Stroke and Coma After Neurosurgery: An ACS-NSQIP Analysis.

    PubMed

    Larsen, Alexandra M G; Cote, David J; Karhade, Aditya V; Smith, Timothy R

    2016-09-01

    The American College of Surgeons National Surgical Quality Improvement Program database aims to reduce 30-day postoperative complications. Reduction of postoperative stroke and coma can decrease length and cost of hospitalization, improve patient functional status, and decrease morbidity and mortality. We performed a search of the American College of Surgeons National Surgical Quality Improvement Program database for all patients from 2006 to 2013 undergoing an operation with a surgeon whose primary specialty was neurologic surgery. Of 94,546 neurosurgical patients reported, there were 687 (0.73%) cases of postoperative stroke and coma. The annual rate of coma longer than 24 hours decreased from 0.90% in 2006 to 0.002% in 2013 (P < 0.001), and the annual rate of stroke decreased from 1.2% in 2006 to 0.5% in 2013 (P = 0.013). Multivariate analysis showed that inpatient status (P = 0.001; odds ratio [OR], 30.3), age (P = 0.005; OR, 1.012), history of diabetes (P = 0.017; OR, 1.515), ventilator dependence (P < 0.001; OR, 4.379), impaired sensorium (P < 0.001; OR, 2.314), history of coma longer than 24 hours (P < 0.001; OR, 2.655), hemiparesis (P = 0.022; OR, 1.492), cerebrovascular accident/stroke with neurologic deficit (P < 0.001; OR, 2.091), cerebrovascular accident/stroke without neurologic deficit (P = 0.001; OR, 2.44), and tumor involving central nervous system (P < 0.001; OR, 2.928) are significant risk factors for developing postneurosurgical stroke and coma. The rate of postneurosurgical stroke decreased from 1.2% in 2006 to 0.5% in 2013 and the rate of postneurosurgical coma greater than 24 hours decreased from 0.9% in 2006 to 0.002% in 2013. Ten risk factors for developing postneurosurgical stroke and coma were identified using multivariable analysis. These risk factors should be assessed preoperatively and incorporated into clinical decision making so that individuals who are at higher risk for the development of stroke and coma can be appropriately monitored during the postoperative period. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Spatial-temporal analysis of the of the risk of Rift Valley Fever in Kenya

    NASA Astrophysics Data System (ADS)

    Bett, B.; Omolo, A.; Hansen, F.; Notenbaert, A.; Kemp, S.

    2012-04-01

    Historical data on Rift Valley Fever (RVF) outbreaks in Kenya covering the period 1951 - 2010 were analyzed using a logistic regression model to identify factors associated with RVF occurrence. The analysis used a division, an administrative unit below a district, as the unit of analysis. The infection status of each division was defined on a monthly time scale and used as a dependent variable. Predictors investigated include: monthly precipitation (minimum, maximum and total), normalized difference vegetation index, altitude, agro-ecological zone, presence of game, livestock and human population densities, the number of times a division has had an outbreak before and time interval in months between successive outbreaks (used as a proxy for immunity). Both univariable and multivariable analyses were conducted. The models used incorporated an auto-regressive correlation matrix to account for clustering of observations in time, while dummy variables were fitted in the multivariable model to account for spatial relatedness/topology between divisions. This last procedure was followed because it is expected that the risk of RVF occurring in a given division increases when its immediate neighbor gets infected. Functional relationships between the continuous and the outcome variables were assessed to ensure that the linearity assumption was met. Deviance and leverage residuals were also generated from the final model and used for evaluating the goodness of fit of the model. Descriptive analyzes indicate that a total of 91 divisions in 42 districts (of the original 69 districts in place by 1999) reported RVF outbreaks at least once over the period. The mean interval between outbreaks was determined to be about 43 months. Factors that were positively associated with RVF occurrence include increased precipitation, high outbreak interval and the number of times a division has been infected or reported an outbreak. The model will be validated and used for developing an RVF forecasting system. This forecasting system can then be used with the existing regional RVF prediction tools such as EMPRES-i to downscale RVF risk predictions to country-specific scales and subsequently link them with decision support systems. The ultimate aim is to increase the capacity of the national institutions to formulate appropriate RVF mitigation measures.

  2. Variable Importance in Multivariate Group Comparisons.

    ERIC Educational Resources Information Center

    Huberty, Carl J.; Wisenbaker, Joseph M.

    1992-01-01

    Interpretations of relative variable importance in multivariate analysis of variance are discussed, with attention to (1) latent construct definition; (2) linear discriminant function scores; and (3) grouping variable effects. Two numerical ranking methods are proposed and compared by the bootstrap approach using two real data sets. (SLD)

  3. Prediction of processing tomato peeling outcomes

    USDA-ARS?s Scientific Manuscript database

    Peeling outcomes of processing tomatoes were predicted using multivariate analysis of Magnetic Resonance (MR) images. Tomatoes were obtained from a whole-peel production line. Each fruit was imaged using a 7 Tesla MR system, and a multivariate data set was created from 28 different images. After ...

  4. mvMapper: statistical and geographical data exploration and visualization of multivariate analysis of population structure

    USDA-ARS?s Scientific Manuscript database

    Characterizing population genetic structure across geographic space is a fundamental challenge in population genetics. Multivariate statistical analyses are powerful tools for summarizing genetic variability, but geographic information and accompanying metadata is not always easily integrated into t...

  5. Multiple Imputation for Multivariate Missing-Data Problems: A Data Analyst's Perspective.

    ERIC Educational Resources Information Center

    Schafer, Joseph L.; Olsen, Maren K.

    1998-01-01

    The key ideas of multiple imputation for multivariate missing data problems are reviewed. Software programs available for this analysis are described, and their use is illustrated with data from the Adolescent Alcohol Prevention Trial (W. Hansen and J. Graham, 1991). (SLD)

  6. MULTIVARIATE LINEAR MIXED MODELS FOR MULTIPLE OUTCOMES. (R824757)

    EPA Science Inventory

    We propose a multivariate linear mixed (MLMM) for the analysis of multiple outcomes, which generalizes the latent variable model of Sammel and Ryan. The proposed model assumes a flexible correlation structure among the multiple outcomes, and allows a global test of the impact of ...

  7. Extending Inferential Group Analysis in Type 2 Diabetic Patients with Multivariate GLM Implemented in SPM8.

    PubMed

    Ferreira, Fábio S; Pereira, João M S; Duarte, João V; Castelo-Branco, Miguel

    2017-01-01

    Although voxel based morphometry studies are still the standard for analyzing brain structure, their dependence on massive univariate inferential methods is a limiting factor. A better understanding of brain pathologies can be achieved by applying inferential multivariate methods, which allow the study of multiple dependent variables, e.g. different imaging modalities of the same subject. Given the widespread use of SPM software in the brain imaging community, the main aim of this work is the implementation of massive multivariate inferential analysis as a toolbox in this software package. applied to the use of T1 and T2 structural data from diabetic patients and controls. This implementation was compared with the traditional ANCOVA in SPM and a similar multivariate GLM toolbox (MRM). We implemented the new toolbox and tested it by investigating brain alterations on a cohort of twenty-eight type 2 diabetes patients and twenty-six matched healthy controls, using information from both T1 and T2 weighted structural MRI scans, both separately - using standard univariate VBM - and simultaneously, with multivariate analyses. Univariate VBM replicated predominantly bilateral changes in basal ganglia and insular regions in type 2 diabetes patients. On the other hand, multivariate analyses replicated key findings of univariate results, while also revealing the thalami as additional foci of pathology. While the presented algorithm must be further optimized, the proposed toolbox is the first implementation of multivariate statistics in SPM8 as a user-friendly toolbox, which shows great potential and is ready to be validated in other clinical cohorts and modalities.

  8. Extending Inferential Group Analysis in Type 2 Diabetic Patients with Multivariate GLM Implemented in SPM8

    PubMed Central

    Ferreira, Fábio S.; Pereira, João M.S.; Duarte, João V.; Castelo-Branco, Miguel

    2017-01-01

    Background: Although voxel based morphometry studies are still the standard for analyzing brain structure, their dependence on massive univariate inferential methods is a limiting factor. A better understanding of brain pathologies can be achieved by applying inferential multivariate methods, which allow the study of multiple dependent variables, e.g. different imaging modalities of the same subject. Objective: Given the widespread use of SPM software in the brain imaging community, the main aim of this work is the implementation of massive multivariate inferential analysis as a toolbox in this software package. applied to the use of T1 and T2 structural data from diabetic patients and controls. This implementation was compared with the traditional ANCOVA in SPM and a similar multivariate GLM toolbox (MRM). Method: We implemented the new toolbox and tested it by investigating brain alterations on a cohort of twenty-eight type 2 diabetes patients and twenty-six matched healthy controls, using information from both T1 and T2 weighted structural MRI scans, both separately – using standard univariate VBM - and simultaneously, with multivariate analyses. Results: Univariate VBM replicated predominantly bilateral changes in basal ganglia and insular regions in type 2 diabetes patients. On the other hand, multivariate analyses replicated key findings of univariate results, while also revealing the thalami as additional foci of pathology. Conclusion: While the presented algorithm must be further optimized, the proposed toolbox is the first implementation of multivariate statistics in SPM8 as a user-friendly toolbox, which shows great potential and is ready to be validated in other clinical cohorts and modalities. PMID:28761571

  9. Refined composite multivariate generalized multiscale fuzzy entropy: A tool for complexity analysis of multichannel signals

    NASA Astrophysics Data System (ADS)

    Azami, Hamed; Escudero, Javier

    2017-01-01

    Multiscale entropy (MSE) is an appealing tool to characterize the complexity of time series over multiple temporal scales. Recent developments in the field have tried to extend the MSE technique in different ways. Building on these trends, we propose the so-called refined composite multivariate multiscale fuzzy entropy (RCmvMFE) whose coarse-graining step uses variance (RCmvMFEσ2) or mean (RCmvMFEμ). We investigate the behavior of these multivariate methods on multichannel white Gaussian and 1/ f noise signals, and two publicly available biomedical recordings. Our simulations demonstrate that RCmvMFEσ2 and RCmvMFEμ lead to more stable results and are less sensitive to the signals' length in comparison with the other existing multivariate multiscale entropy-based methods. The classification results also show that using both the variance and mean in the coarse-graining step offers complexity profiles with complementary information for biomedical signal analysis. We also made freely available all the Matlab codes used in this paper.

  10. Multivariate analysis of prognostic factors for idiopathic sudden sensorineural hearing loss treated with adjuvant hyperbaric oxygen therapy.

    PubMed

    Xie, Shaobing; Qiang, Qingfen; Mei, Lingyun; He, Chufeng; Feng, Yong; Sun, Hong; Wu, Xuewen

    2018-01-01

    The objective of this study is to evaluate possible prognostic factors of idiopathic sudden sensorineural hearing loss (ISSNHL) treated with adjuvant hyperbaric oxygen therapy (HBOT) using univariate and multivariate analyses. From January 2008 to October 2016, records of 178 ISSNHL patients treated with auxiliary hyperbaric oxygen therapy were reviewed to assess hearing recovery and evaluate associated prognostic factors (gender, age, localization, initial hearing threshold, presence of tinnitus, vertigo, ear fullness, hypertension, diabetes, onset of HBOT, number of HBOT, and audiogram), by using univariate and multivariate analyses. The overall recovery rate was 37.1%, including complete recovery (19.7%) and partial recovery (17.4%). According to multivariate analysis, later onset of HBOT and higher initial hearing threshold were associated with a poor prognosis in ISSNHL patients treated with HBOT. HBOT is a safe and beneficial adjuvant therapy for ISSNHL patients. 20 sessions of HBOT is possibly enough to show its therapeutic effect. Earlier HBOT onset and lower initial hearing threshold is associated with favorable hearing recovery.

  11. Combination of multivariate curve resolution and multivariate classification techniques for comprehensive high-performance liquid chromatography-diode array absorbance detection fingerprints analysis of Salvia reuterana extracts.

    PubMed

    Hakimzadeh, Neda; Parastar, Hadi; Fattahi, Mohammad

    2014-01-24

    In this study, multivariate curve resolution (MCR) and multivariate classification methods are proposed to develop a new chemometric strategy for comprehensive analysis of high-performance liquid chromatography-diode array absorbance detection (HPLC-DAD) fingerprints of sixty Salvia reuterana samples from five different geographical regions. Different chromatographic problems occurred during HPLC-DAD analysis of S. reuterana samples, such as baseline/background contribution and noise, low signal-to-noise ratio (S/N), asymmetric peaks, elution time shifts, and peak overlap are handled using the proposed strategy. In this way, chromatographic fingerprints of sixty samples are properly segmented to ten common chromatographic regions using local rank analysis and then, the corresponding segments are column-wise augmented for subsequent MCR analysis. Extended multivariate curve resolution-alternating least squares (MCR-ALS) is used to obtain pure component profiles in each segment. In general, thirty-one chemical components were resolved using MCR-ALS in sixty S. reuterana samples and the lack of fit (LOF) values of MCR-ALS models were below 10.0% in all cases. Pure spectral profiles are considered for identification of chemical components by comparing their resolved spectra with the standard ones and twenty-four components out of thirty-one components were identified. Additionally, pure elution profiles are used to obtain relative concentrations of chemical components in different samples for multivariate classification analysis by principal component analysis (PCA) and k-nearest neighbors (kNN). Inspection of the PCA score plot (explaining 76.1% of variance accounted for three PCs) showed that S. reuterana samples belong to four clusters. The degree of class separation (DCS) which quantifies the distance separating clusters in relation to the scatter within each cluster is calculated for four clusters and it was in the range of 1.6-5.8. These results are then confirmed by kNN. In addition, according to the PCA loading plot and kNN dendrogram of thirty-one variables, five chemical constituents of luteolin-7-o-glucoside, salvianolic acid D, rosmarinic acid, lithospermic acid and trijuganone A are identified as the most important variables (i.e., chemical markers) for clusters discrimination. Finally, the effect of different chemical markers on samples differentiation is investigated using counter-propagation artificial neural network (CP-ANN) method. It is concluded that the proposed strategy can be successfully applied for comprehensive analysis of chromatographic fingerprints of complex natural samples. Copyright © 2013 Elsevier B.V. All rights reserved.

  12. Detection of Cell Wall Chemical Variation in Zea Mays Mutants Using Near-Infrared Spectroscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Buyck, N.; Thomas, S.

    Corn stover is regarded as the prime candidate feedstock material for commercial biomass conversion in the United States. Variations in chemical composition of Zea mays cell walls can affect biomass conversion process yields and economics. Mutant lines were constructed by activating a Mu transposon system. The cell wall chemical composition of 48 mutant families was characterized using near-infrared (NIR) spectroscopy. NIR data were analyzed using a multivariate statistical analysis technique called Principal Component Analysis (PCA). PCA of the NIR data from 349 maize leaf samples reveals 57 individuals as outliers on one or more of six Principal Components (PCs) atmore » the 95% confidence interval. Of these, 19 individuals from 16 families are outliers on either PC3 (9% of the variation) or PC6 (1% of the variation), the two PCs that contain information about cell wall polymers. Those individuals for which altered cell wall chemistry is confirmed with wet chemical analysis will then be subjected to fermentation analysis to determine whether or not biomass conversion process kinetics, yields and/or economics are significantly affected. Those mutants that provide indications for a decrease in process cost will be pursued further to identify the gene(s) responsible for the observed changes in cell wall composition and associated changes in process economics. These genes will eventually be incorporated into maize breeding programs directed at the development of a truly dual use crop.« less

  13. Independent Prognostic Factors for Acute Organophosphorus Pesticide Poisoning.

    PubMed

    Tang, Weidong; Ruan, Feng; Chen, Qi; Chen, Suping; Shao, Xuebo; Gao, Jianbo; Zhang, Mao

    2016-07-01

    Acute organophosphorus pesticide poisoning (AOPP) is becoming a significant problem and a potential cause of human mortality because of the abuse of organophosphate compounds. This study aims to determine the independent prognostic factors of AOPP by using multivariate logistic regression analysis. The clinical data for 71 subjects with AOPP admitted to our hospital were retrospectively analyzed. This information included the Acute Physiology and Chronic Health Evaluation II (APACHE II) scores, 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, admission blood cholinesterase levels, 6-h post-admission blood cholinesterase levels, cholinesterase activity, blood pH, and other factors. Univariate analysis and multivariate logistic regression analyses were conducted to identify all prognostic factors and independent prognostic factors, respectively. A receiver operating characteristic curve was plotted to analyze the testing power of independent prognostic factors. Twelve of 71 subjects died. Admission blood lactate levels, 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, blood pH, and APACHE II scores were identified as prognostic factors for AOPP according to the univariate analysis, whereas only 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, and blood pH were independent prognostic factors identified by multivariate logistic regression analysis. The receiver operating characteristic analysis suggested that post-admission 6-h lactate clearance rates were of moderate diagnostic value. High 6-h post-admission blood lactate levels, low blood pH, and low post-admission 6-h lactate clearance rates were independent prognostic factors identified by multivariate logistic regression analysis. Copyright © 2016 by Daedalus Enterprises.

  14. Multivariate matching pursuit in optimal Gabor dictionaries: theory and software with interface for EEG/MEG via Svarog

    PubMed Central

    2013-01-01

    Background Matching pursuit algorithm (MP), especially with recent multivariate extensions, offers unique advantages in analysis of EEG and MEG. Methods We propose a novel construction of an optimal Gabor dictionary, based upon the metrics introduced in this paper. We implement this construction in a freely available software for MP decomposition of multivariate time series, with a user friendly interface via the Svarog package (Signal Viewer, Analyzer and Recorder On GPL, http://braintech.pl/svarog), and provide a hands-on introduction to its application to EEG. Finally, we describe numerical and mathematical optimizations used in this implementation. Results Optimal Gabor dictionaries, based on the metric introduced in this paper, for the first time allowed for a priori assessment of maximum one-step error of the MP algorithm. Variants of multivariate MP, implemented in the accompanying software, are organized according to the mathematical properties of the algorithms, relevant in the light of EEG/MEG analysis. Some of these variants have been successfully applied to both multichannel and multitrial EEG and MEG in previous studies, improving preprocessing for EEG/MEG inverse solutions and parameterization of evoked potentials in single trials; we mention also ongoing work and possible novel applications. Conclusions Mathematical results presented in this paper improve our understanding of the basics of the MP algorithm. Simple introduction of its properties and advantages, together with the accompanying stable and user-friendly Open Source software package, pave the way for a widespread and reproducible analysis of multivariate EEG and MEG time series and novel applications, while retaining a high degree of compatibility with the traditional, visual analysis of EEG. PMID:24059247

  15. Locating the Seventh Cervical Spinous Process: Development and Validation of a Multivariate Model Using Palpation and Personal Information.

    PubMed

    Ferreira, Ana Paula A; Póvoa, Luciana C; Zanier, José F C; Ferreira, Arthur S

    2017-02-01

    The aim of this study was to develop and validate a multivariate prediction model, guided by palpation and personal information, for locating the seventh cervical spinous process (C7SP). A single-blinded, cross-sectional study at a primary to tertiary health care center was conducted for model development and temporal validation. One-hundred sixty participants were prospectively included for model development (n = 80) and time-split validation stages (n = 80). The C7SP was located using the thorax-rib static method (TRSM). Participants underwent chest radiography for assessment of the inner body structure located with TRSM and using radio-opaque markers placed over the skin. Age, sex, height, body mass, body mass index, and vertex-marker distance (D V-M ) were used to predict the distance from the C7SP to the vertex (D V-C7 ). Multivariate linear regression modeling, limits of agreement plot, histogram of residues, receiver operating characteristic curves, and confusion tables were analyzed. The multivariate linear prediction model for D V-C7 (in centimeters) was D V-C7 = 0.986D V-M + 0.018(mass) + 0.014(age) - 1.008. Receiver operating characteristic curves had better discrimination of D V-C7 (area under the curve = 0.661; 95% confidence interval = 0.541-0.782; P = .015) than D V-M (area under the curve = 0.480; 95% confidence interval = 0.345-0.614; P = .761), with respective cutoff points at 23.40 cm (sensitivity = 41%, specificity = 63%) and 24.75 cm (sensitivity = 69%, specificity = 52%). The C7SP was correctly located more often when using predicted D V-C7 in the validation sample than when using the TRSM in the development sample: n = 53 (66%) vs n = 32 (40%), P < .001. Better accuracy was obtained when locating the C7SP by use of a multivariate model that incorporates palpation and personal information. Copyright © 2016. Published by Elsevier Inc.

  16. FACTOR ANALYTIC MODELS OF CLUSTERED MULTIVARIATE DATA WITH INFORMATIVE CENSORING

    EPA Science Inventory

    This paper describes a general class of factor analytic models for the analysis of clustered multivariate data in the presence of informative missingness. We assume that there are distinct sets of cluster-level latent variables related to the primary outcomes and to the censorin...

  17. Meta-Analytic Structural Equation Modeling (MASEM): Comparison of the Multivariate Methods

    ERIC Educational Resources Information Center

    Zhang, Ying

    2011-01-01

    Meta-analytic Structural Equation Modeling (MASEM) has drawn interest from many researchers recently. In doing MASEM, researchers usually first synthesize correlation matrices across studies using meta-analysis techniques and then analyze the pooled correlation matrix using structural equation modeling techniques. Several multivariate methods of…

  18. MULTIVARIATE RECEPTOR MODELS-CURRENT PRACTICE AND FUTURE TRENDS. (R826238)

    EPA Science Inventory

    Multivariate receptor models have been applied to the analysis of air quality data for sometime. However, solving the general mixture problem is important in several other fields. This paper looks at the panoply of these models with a view of identifying common challenges and ...

  19. Multivariate pattern analysis of fMRI: the early beginnings.

    PubMed

    Haxby, James V

    2012-08-15

    In 2001, we published a paper on the representation of faces and objects in ventral temporal cortex that introduced a new method for fMRI analysis, which subsequently came to be called multivariate pattern analysis (MVPA). MVPA now refers to a diverse set of methods that analyze neural responses as patterns of activity that reflect the varying brain states that a cortical field or system can produce. This paper recounts the circumstances and events that led to the original study and later developments and innovations that have greatly expanded this approach to fMRI data analysis, leading to its widespread application. Copyright © 2012 Elsevier Inc. All rights reserved.

  20. Joint Forward Area Air Defense Test Program Definition.

    DTIC Science & Technology

    1984-03-30

    Visibility Conditions 23 CHAPTER 6. ACRONYMS LIST 24 . CHAPTER 7. REFERENCE 26 APPENDIX A. IDENTIFICATION ISSUE ANALAYSIS PLAN A-1 to A-17 B. C3...and kill ratios between single and multiple pass aircraft. A " multivariate analysis" will be performed to determine if there is any significant...killed will be compared for each set of identification procedure". A " multivariate analysis" will be performed on the number of hostile and friendly

  1. Multi-Sample Cluster Analysis Using Akaike’s Information Criterion.

    DTIC Science & Technology

    1982-12-20

    Intervals. For more details on these test procedures refer to Gabriel [7J, Krishnaiah (CIlUj, [11]), Srivastava [16), and others. -3- As noted in Consul...723. (4] Consul, P. C. (1969), "The Exact Distributions of Likelihood Criteria for Different Hypotheses," in P. R. Krishnaiah (Ed.), Multivariate...1178. [7] Gabriel, K. R. (1969), "A Comparison of Some lethods of Simultaneous Inference in MANOVA," in P. R. Krishnaiah (Ed.), Multivariate Analysis-lI

  2. Variable Selection in Logistic Regression.

    DTIC Science & Technology

    1987-06-01

    23 %. AUTIOR(.) S. CONTRACT OR GRANT NUMBE Rf.i %Z. D. Bai, P. R. Krishnaiah and . C. Zhao F49620-85- C-0008 " PERFORMING ORGANIZATION NAME AND AOORESS...d I7 IOK-TK- d 7 -I0 7’ VARIABLE SELECTION IN LOGISTIC REGRESSION Z. D. Bai, P. R. Krishnaiah and L. C. Zhao Center for Multivariate Analysis...University of Pittsburgh Center for Multivariate Analysis University of Pittsburgh Y !I VARIABLE SELECTION IN LOGISTIC REGRESSION Z- 0. Bai, P. R. Krishnaiah

  3. Time-frequency analysis of neuronal populations with instantaneous resolution based on noise-assisted multivariate empirical mode decomposition.

    PubMed

    Alegre-Cortés, J; Soto-Sánchez, C; Pizá, Á G; Albarracín, A L; Farfán, F D; Felice, C J; Fernández, E

    2016-07-15

    Linear analysis has classically provided powerful tools for understanding the behavior of neural populations, but the neuron responses to real-world stimulation are nonlinear under some conditions, and many neuronal components demonstrate strong nonlinear behavior. In spite of this, temporal and frequency dynamics of neural populations to sensory stimulation have been usually analyzed with linear approaches. In this paper, we propose the use of Noise-Assisted Multivariate Empirical Mode Decomposition (NA-MEMD), a data-driven template-free algorithm, plus the Hilbert transform as a suitable tool for analyzing population oscillatory dynamics in a multi-dimensional space with instantaneous frequency (IF) resolution. The proposed approach was able to extract oscillatory information of neurophysiological data of deep vibrissal nerve and visual cortex multiunit recordings that were not evidenced using linear approaches with fixed bases such as the Fourier analysis. Texture discrimination analysis performance was increased when Noise-Assisted Multivariate Empirical Mode plus Hilbert transform was implemented, compared to linear techniques. Cortical oscillatory population activity was analyzed with precise time-frequency resolution. Similarly, NA-MEMD provided increased time-frequency resolution of cortical oscillatory population activity. Noise-Assisted Multivariate Empirical Mode Decomposition plus Hilbert transform is an improved method to analyze neuronal population oscillatory dynamics overcoming linear and stationary assumptions of classical methods. Copyright © 2016 Elsevier B.V. All rights reserved.

  4. Estimation of failure criteria in multivariate sensory shelf life testing using survival analysis.

    PubMed

    Giménez, Ana; Gagliardi, Andrés; Ares, Gastón

    2017-09-01

    For most food products, shelf life is determined by changes in their sensory characteristics. A predetermined increase or decrease in the intensity of a sensory characteristic has frequently been used to signal that a product has reached the end of its shelf life. Considering all attributes change simultaneously, the concept of multivariate shelf life allows a single measurement of deterioration that takes into account all these sensory changes at a certain storage time. The aim of the present work was to apply survival analysis to estimate failure criteria in multivariate sensory shelf life testing using two case studies, hamburger buns and orange juice, by modelling the relationship between consumers' rejection of the product and the deterioration index estimated using PCA. In both studies, a panel of 13 trained assessors evaluated the samples using descriptive analysis whereas a panel of 100 consumers answered a "yes" or "no" question regarding intention to buy or consume the product. PC1 explained the great majority of the variance, indicating all sensory characteristics evolved similarly with storage time. Thus, PC1 could be regarded as index of sensory deterioration and a single failure criterion could be estimated through survival analysis for 25 and 50% consumers' rejection. The proposed approach based on multivariate shelf life testing may increase the accuracy of shelf life estimations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Advance notification letters increase adherence in colorectal cancer screening: a population-based randomized trial.

    PubMed

    van Roon, A H C; Hol, L; Wilschut, J A; Reijerink, J C I Y; van Vuuren, A J; van Ballegooijen, M; Habbema, J D F; van Leerdam, M E; Kuipers, Ernst J

    2011-06-01

    The population benefit of screening depends not only on the effectiveness of the test, but also on adherence, which, for colorectal cancer (CRC) screening remains low. An advance notification letter may increase adherence, however, no population-based randomized trials have been conducted to provide evidence of this. In 2008, a representative sample of the Dutch population (aged 50-74 years) was randomized. All 2493 invitees in group A were sent an advance notification letter, followed two weeks later by a standard invitation. The 2507 invitees in group B only received the standard invitation. Non-respondents in both groups were sent a reminder 6 weeks after the invitation. The advance notification letters resulted in a significantly higher adherence (64.4% versus 61.1%, p-value 0.019). Multivariate logistic regression analysis showed no significant interactions between group and age, sex, or socio-economic status. Cost analysis showed that the incremental cost per additional detected advanced neoplasia due to sending an advance notification letter was € 957. This population-based randomized trial demonstrates that sending an advance notification letter significantly increases adherence by 3.3%. The incremental cost per additional detected advanced neoplasia is acceptable. We therefore recommend that such letters are incorporated within the standard CRC-screening invitation process. Copyright © 2011 Elsevier Inc. All rights reserved.

  6. A Risk Stratification Model for Lung Cancer Based on Gene Coexpression Network and Deep Learning

    PubMed Central

    2018-01-01

    Risk stratification model for lung cancer with gene expression profile is of great interest. Instead of previous models based on individual prognostic genes, we aimed to develop a novel system-level risk stratification model for lung adenocarcinoma based on gene coexpression network. Using multiple microarray, gene coexpression network analysis was performed to identify survival-related networks. A deep learning based risk stratification model was constructed with representative genes of these networks. The model was validated in two test sets. Survival analysis was performed using the output of the model to evaluate whether it could predict patients' survival independent of clinicopathological variables. Five networks were significantly associated with patients' survival. Considering prognostic significance and representativeness, genes of the two survival-related networks were selected for input of the model. The output of the model was significantly associated with patients' survival in two test sets and training set (p < 0.00001, p < 0.0001 and p = 0.02 for training and test sets 1 and 2, resp.). In multivariate analyses, the model was associated with patients' prognosis independent of other clinicopathological features. Our study presents a new perspective on incorporating gene coexpression networks into the gene expression signature and clinical application of deep learning in genomic data science for prognosis prediction. PMID:29581968

  7. Autorefraction Versus Manifest Refraction in Patients With Keratoconus.

    PubMed

    Soeters, Nienke; Muijzer, Marc B; Molenaar, Jurrian; Godefrooij, Daniel A; Wisse, Robert P L

    2018-01-01

    To compare visual performance using autorefraction and manifest refraction assessments in patients with keratoconus and investigate whether autorefraction measurements lead to suboptimal visual performance. Corrected distance visual acuity (CDVA) was measured in 90 eyes of 61 patients with keratoconus with both autorefraction and manifest refraction, in a random order. Maximum keratometry (Kmax), cone location, and wavefront aberration were determined with Scheimpflug tomography. The difference between the autorefraction and manifest refraction outcomes was converted to vectors and a multivariable analysis was performed to identify potential underlying causes of this difference. A significantly better CDVA was achieved with manifest refraction (0.06 vs 0.29 logMAR [20/23 vs 20/38 Snellen], P < .001). After vector analysis, a mean difference of 4.83 diopters was found between autorefraction and manifest refraction. Increased Kmax was strongly and significantly associated with better visual performance of manifest refraction compared to autorefraction (B = 0.496, P = .002). This study showed that a superior CDVA is achieved with manifest refraction compared to autorefraction in patients with keratoconus. Furthermore, the difference between the two refraction methods increases as the cornea steepens. According to this study, autorefraction is unreliable in patients with keratoconus and should be avoided. [J Refract Surg. 2018;34(1):30-34.]. Copyright 2018, SLACK Incorporated.

  8. Application of Linear Mixed-Effects Models in Human Neuroscience Research: A Comparison with Pearson Correlation in Two Auditory Electrophysiology Studies.

    PubMed

    Koerner, Tess K; Zhang, Yang

    2017-02-27

    Neurophysiological studies are often designed to examine relationships between measures from different testing conditions, time points, or analysis techniques within the same group of participants. Appropriate statistical techniques that can take into account repeated measures and multivariate predictor variables are integral and essential to successful data analysis and interpretation. This work implements and compares conventional Pearson correlations and linear mixed-effects (LME) regression models using data from two recently published auditory electrophysiology studies. For the specific research questions in both studies, the Pearson correlation test is inappropriate for determining strengths between the behavioral responses for speech-in-noise recognition and the multiple neurophysiological measures as the neural responses across listening conditions were simply treated as independent measures. In contrast, the LME models allow a systematic approach to incorporate both fixed-effect and random-effect terms to deal with the categorical grouping factor of listening conditions, between-subject baseline differences in the multiple measures, and the correlational structure among the predictor variables. Together, the comparative data demonstrate the advantages as well as the necessity to apply mixed-effects models to properly account for the built-in relationships among the multiple predictor variables, which has important implications for proper statistical modeling and interpretation of human behavior in terms of neural correlates and biomarkers.

  9. Dissecting DNA repair in adult high grade gliomas for patient stratification in the post-genomic era

    PubMed Central

    Perry, Christina; Agarwal, Devika; Abdel-Fatah, Tarek M.A.; Lourdusamy, Anbarasu; Grundy, Richard; Auer, Dorothee T.; Walker, David; Lakhani, Ravi; Scott, Ian S.; Chan, Stephen; Ball, Graham; Madhusudan, Srinivasan

    2014-01-01

    Deregulation of multiple DNA repair pathways may contribute to aggressive biology and therapy resistance in gliomas. We evaluated transcript levels of 157 genes involved in DNA repair in an adult glioblastoma Test set (n=191) and validated in ‘The Cancer Genome Atlas’ (TCGA) cohort (n=508). A DNA repair prognostic index model was generated. Artificial neural network analysis (ANN) was conducted to investigate global gene interactions. Protein expression by immunohistochemistry was conducted in 61 tumours. A fourteen DNA repair gene expression panel was associated with poor survival in Test and TCGA cohorts. A Cox multivariate model revealed APE1, NBN, PMS2, MGMT and PTEN as independently associated with poor prognosis. A DNA repair prognostic index incorporating APE1, NBN, PMS2, MGMT and PTEN stratified patients in to three prognostic sub-groups with worsening survival. APE1, NBN, PMS2, MGMT and PTEN also have predictive significance in patients who received chemotherapy and/or radiotherapy. ANN analysis of APE1, NBN, PMS2, MGMT and PTEN revealed interactions with genes involved in transcription, hypoxia and metabolic regulation. At the protein level, low APE1 and low PTEN remain associated with poor prognosis. In conclusion, multiple DNA repair pathways operate to influence biology and clinical outcomes in adult high grade gliomas. PMID:25026297

  10. Impacts of rising health care costs on families with employment-based private insurance: a national analysis with state fixed effects.

    PubMed

    Yu, Hao; Dick, Andrew W

    2012-10-01

    Given the rapid growth of health care costs, some experts were concerned with erosion of employment-based private insurance (EBPI). This empirical analysis aims to quantify the concern. Using the National Health Account, we generated a cost index to represent state-level annual cost growth. We merged it with the 1996-2003 Medical Expenditure Panel Survey. The unit of analysis is the family. We conducted both bivariate and multivariate logistic analyses. The bivariate analysis found a significant inverse association between the cost index and the proportion of families receiving an offer of EBPI. The multivariate analysis showed that the cost index was significantly negatively associated with the likelihood of receiving an EBPI offer for the entire sample and for families in the first, second, and third quartiles of income distribution. The cost index was also significantly negatively associated with the proportion of families with EBPI for the entire year for each family member (EBPI-EYEM). The multivariate analysis confirmed significance of the relationship for the entire sample, and for families in the second and third quartiles of income distribution. Among the families with EBPI-EYEM, there was a positive relationship between the cost index and this group's likelihood of having out-of-pocket expenditures exceeding 10 percent of family income. The multivariate analysis confirmed significance of the relationship for the entire group and for families in the second and third quartiles of income distribution. Rising health costs reduce EBPI availability and enrollment, and the financial protection provided by it, especially for middle-class families. © Health Research and Educational Trust.

  11. Impacts of Rising Health Care Costs on Families with Employment-Based Private Insurance: A National Analysis with State Fixed Effects

    PubMed Central

    Yu, Hao; Dick, Andrew W

    2012-01-01

    Background Given the rapid growth of health care costs, some experts were concerned with erosion of employment-based private insurance (EBPI). This empirical analysis aims to quantify the concern. Methods Using the National Health Account, we generated a cost index to represent state-level annual cost growth. We merged it with the 1996–2003 Medical Expenditure Panel Survey. The unit of analysis is the family. We conducted both bivariate and multivariate logistic analyses. Results The bivariate analysis found a significant inverse association between the cost index and the proportion of families receiving an offer of EBPI. The multivariate analysis showed that the cost index was significantly negatively associated with the likelihood of receiving an EBPI offer for the entire sample and for families in the first, second, and third quartiles of income distribution. The cost index was also significantly negatively associated with the proportion of families with EBPI for the entire year for each family member (EBPI-EYEM). The multivariate analysis confirmed significance of the relationship for the entire sample, and for families in the second and third quartiles of income distribution. Among the families with EBPI-EYEM, there was a positive relationship between the cost index and this group's likelihood of having out-of-pocket expenditures exceeding 10 percent of family income. The multivariate analysis confirmed significance of the relationship for the entire group and for families in the second and third quartiles of income distribution. Conclusions Rising health costs reduce EBPI availability and enrollment, and the financial protection provided by it, especially for middle-class families. PMID:22417314

  12. Network meta-analysis of multiple outcome measures accounting for borrowing of information across outcomes

    PubMed Central

    2014-01-01

    Background Network meta-analysis (NMA) enables simultaneous comparison of multiple treatments while preserving randomisation. When summarising evidence to inform an economic evaluation, it is important that the analysis accurately reflects the dependency structure within the data, as correlations between outcomes may have implication for estimating the net benefit associated with treatment. A multivariate NMA offers a framework for evaluating multiple treatments across multiple outcome measures while accounting for the correlation structure between outcomes. Methods The standard NMA model is extended to multiple outcome settings in two stages. In the first stage, information is borrowed across outcomes as well across studies through modelling the within-study and between-study correlation structure. In the second stage, we make use of the additional assumption that intervention effects are exchangeable between outcomes to predict effect estimates for all outcomes, including effect estimates on outcomes where evidence is either sparse or the treatment had not been considered by any one of the studies included in the analysis. We apply the methods to binary outcome data from a systematic review evaluating the effectiveness of nine home safety interventions on uptake of three poisoning prevention practices (safe storage of medicines, safe storage of other household products, and possession of poison centre control telephone number) in households with children. Analyses are conducted in WinBUGS using Markov Chain Monte Carlo (MCMC) simulations. Results Univariate and the first stage multivariate models produced broadly similar point estimates of intervention effects but the uncertainty around the multivariate estimates varied depending on the prior distribution specified for the between-study covariance structure. The second stage multivariate analyses produced more precise effect estimates while enabling intervention effects to be predicted for all outcomes, including intervention effects on outcomes not directly considered by the studies included in the analysis. Conclusions Accounting for the dependency between outcomes in a multivariate meta-analysis may or may not improve the precision of effect estimates from a network meta-analysis compared to analysing each outcome separately. PMID:25047164

  13. Evaluation of desmin as a diagnostic and prognostic marker of childhood rhabdomyosarcomas and embryonal sarcomas.

    PubMed Central

    Dias, P.; Kumar, P.; Marsden, H. B.; Morris-Jones, P. H.; Birch, J.; Swindell, R.; Kumar, S.

    1987-01-01

    The diagnostic and prognostic relevance of desmin expression in 80 rhabdomyosarcomas (RMS) and 5 embryonal sarcomas (ES) was examined using a peroxidase anti-peroxidase staining procedure. Fifty-nine RMS but only one ES stained for desmin (P less than 0.05). The maximum percentage of desmin containing cells was 49 in RMS compared with only 1% in ES. Desmin positivity correlated inversely with survival (P less than 0.02) in that RMS with high proportions of desmin positive cells were associated with poorer prognoses than those containing fewer desmin positive cells. If the degree of expression of desmin is related to myogenic differentiation, then our results indicate that poorly differentiated RMS tend to have a better prognosis than the well differentiated tumours. One possible explanation is that the poorly differentiated RMS respond better to chemotherapy than to well differentiated RMS. A multivariant analysis incorporating desmin staining, treatment, histology, age and gender revealed that the two most significant independent prognostic factors were treatment and histology. Images Figure 1 PMID:3311112

  14. CRC-113 gene expression signature for predicting prognosis in patients with colorectal cancer

    PubMed Central

    Nguyen, Dinh Truong; Kim, Jin-Hwan; Jo, Yong Hwa; Shahid, Muhammad; Akter, Salima; Aryal, Saurav Nath; Yoo, Ji Youn; Ahn, Yong-Joo; Cho, Kyoung Min; Lee, Ju-Seog; Choe, Wonchae; Kang, Insug; Ha, Joohun; Kim, Sung Soo

    2015-01-01

    Colorectal cancer (CRC) is the third leading cause of global cancer mortality. Recent studies have proposed several gene signatures to predict CRC prognosis, but none of those have proven reliable for predicting prognosis in clinical practice yet due to poor reproducibility and molecular heterogeneity. Here, we have established a prognostic signature of 113 probe sets (CRC-113) that include potential biomarkers and reflect the biological and clinical characteristics. Robustness and accuracy were significantly validated in external data sets from 19 centers in five countries. In multivariate analysis, CRC-113 gene signature showed a stronger prognostic value for survival and disease recurrence in CRC patients than current clinicopathological risk factors and molecular alterations. We also demonstrated that the CRC-113 gene signature reflected both genetic and epigenetic molecular heterogeneity in CRC patients. Furthermore, incorporation of the CRC-113 gene signature into a clinical context and molecular markers further refined the selection of the CRC patients who might benefit from postoperative chemotherapy. Conclusively, CRC-113 gene signature provides new possibilities for improving prognostic models and personalized therapeutic strategies. PMID:26397224

  15. CRC-113 gene expression signature for predicting prognosis in patients with colorectal cancer.

    PubMed

    Nguyen, Minh Nam; Choi, Tae Gyu; Nguyen, Dinh Truong; Kim, Jin-Hwan; Jo, Yong Hwa; Shahid, Muhammad; Akter, Salima; Aryal, Saurav Nath; Yoo, Ji Youn; Ahn, Yong-Joo; Cho, Kyoung Min; Lee, Ju-Seog; Choe, Wonchae; Kang, Insug; Ha, Joohun; Kim, Sung Soo

    2015-10-13

    Colorectal cancer (CRC) is the third leading cause of global cancer mortality. Recent studies have proposed several gene signatures to predict CRC prognosis, but none of those have proven reliable for predicting prognosis in clinical practice yet due to poor reproducibility and molecular heterogeneity. Here, we have established a prognostic signature of 113 probe sets (CRC-113) that include potential biomarkers and reflect the biological and clinical characteristics. Robustness and accuracy were significantly validated in external data sets from 19 centers in five countries. In multivariate analysis, CRC-113 gene signature showed a stronger prognostic value for survival and disease recurrence in CRC patients than current clinicopathological risk factors and molecular alterations. We also demonstrated that the CRC-113 gene signature reflected both genetic and epigenetic molecular heterogeneity in CRC patients. Furthermore, incorporation of the CRC-113 gene signature into a clinical context and molecular markers further refined the selection of the CRC patients who might benefit from postoperative chemotherapy. Conclusively, CRC-113 gene signature provides new possibilities for improving prognostic models and personalized therapeutic strategies.

  16. A Nonlinear Framework of Delayed Particle Smoothing Method for Vehicle Localization under Non-Gaussian Environment.

    PubMed

    Xiao, Zhu; Havyarimana, Vincent; Li, Tong; Wang, Dong

    2016-05-13

    In this paper, a novel nonlinear framework of smoothing method, non-Gaussian delayed particle smoother (nGDPS), is proposed, which enables vehicle state estimation (VSE) with high accuracy taking into account the non-Gaussianity of the measurement and process noises. Within the proposed method, the multivariate Student's t-distribution is adopted in order to compute the probability distribution function (PDF) related to the process and measurement noises, which are assumed to be non-Gaussian distributed. A computation approach based on Ensemble Kalman Filter (EnKF) is designed to cope with the mean and the covariance matrix of the proposal non-Gaussian distribution. A delayed Gibbs sampling algorithm, which incorporates smoothing of the sampled trajectories over a fixed-delay, is proposed to deal with the sample degeneracy of particles. The performance is investigated based on the real-world data, which is collected by low-cost on-board vehicle sensors. The comparison study based on the real-world experiments and the statistical analysis demonstrates that the proposed nGDPS has significant improvement on the vehicle state accuracy and outperforms the existing filtering and smoothing methods.

  17. Examining Factors Associated with Heavy Episodic Drinking Among College Undergraduates

    PubMed Central

    Scholly, Kristen; Katz, Alan R.; Kehl, Lisa

    2014-01-01

    Heavy episodic drinking among college students is a serious health concern. The purpose of this study was to identify factors associated with heavy episodic drinking behaviors amongst a predominately Asian undergraduate college student population in the United States. A survey measuring alcohol use behaviors was completed by a random sample of 18-24 year old undergraduates during April, 2011. A multivariate logistic regression analysis was conducted to determine factors associated with students’ heavy episodic drinking behavior. Independent factors associated with heavy episodic drinking included living on campus, ethnicity, perceived drinking behavior among peers, and a belief that alcohol is a central part of one’s social life. Heavy episodic drinking was also associated with poor academic performance. Campus-wide educational strategies to reduce heavy episodic drinking among college undergraduates should incorporate accurate information regarding alcohol use norms to correct students’ perceived over estimation of their peers alcohol consumption rates and the under estimation of students protective alcohol use behaviors. These efforts should focus in on-campus residence halls where a higher occurrence of heavy episodic drinking is often found. PMID:26973931

  18. A multivariate time series approach to modeling and forecasting demand in the emergency department.

    PubMed

    Jones, Spencer S; Evans, R Scott; Allen, Todd L; Thomas, Alun; Haug, Peter J; Welch, Shari J; Snow, Gregory L

    2009-02-01

    The goals of this investigation were to study the temporal relationships between the demands for key resources in the emergency department (ED) and the inpatient hospital, and to develop multivariate forecasting models. Hourly data were collected from three diverse hospitals for the year 2006. Descriptive analysis and model fitting were carried out using graphical and multivariate time series methods. Multivariate models were compared to a univariate benchmark model in terms of their ability to provide out-of-sample forecasts of ED census and the demands for diagnostic resources. Descriptive analyses revealed little temporal interaction between the demand for inpatient resources and the demand for ED resources at the facilities considered. Multivariate models provided more accurate forecasts of ED census and of the demands for diagnostic resources. Our results suggest that multivariate time series models can be used to reliably forecast ED patient census; however, forecasts of the demands for diagnostic resources were not sufficiently reliable to be useful in the clinical setting.

  19. Using Logistic Regression To Predict the Probability of Debris Flows Occurring in Areas Recently Burned By Wildland Fires

    USGS Publications Warehouse

    Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.

    2003-01-01

    Logistic regression was used to predict the probability of debris flows occurring in areas recently burned by wildland fires. Multiple logistic regression is conceptually similar to multiple linear regression because statistical relations between one dependent variable and several independent variables are evaluated. In logistic regression, however, the dependent variable is transformed to a binary variable (debris flow did or did not occur), and the actual probability of the debris flow occurring is statistically modeled. Data from 399 basins located within 15 wildland fires that burned during 2000-2002 in Colorado, Idaho, Montana, and New Mexico were evaluated. More than 35 independent variables describing the burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows were delineated from National Elevation Data using a Geographic Information System (GIS). (2) Data describing the burn severity, geology, land surface gradient, rainfall, and soil properties were determined for each basin. These data were then downloaded to a statistics software package for analysis using logistic regression. (3) Relations between the occurrence/non-occurrence of debris flows and burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated and several preliminary multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combination produced the most effective model. The multivariate model that best predicted the occurrence of debris flows was selected. (4) The multivariate logistic regression model was entered into a GIS, and a map showing the probability of debris flows was constructed. The most effective model incorporates the percentage of each basin with slope greater than 30 percent, percentage of land burned at medium and high burn severity in each basin, particle size sorting, average storm intensity (millimeters per hour), soil organic matter content, soil permeability, and soil drainage. The results of this study demonstrate that logistic regression is a valuable tool for predicting the probability of debris flows occurring in recently-burned landscapes.

  20. A graphical method to evaluate spectral preprocessing in multivariate regression calibrations: example with Savitzky-Golay filters and partial least squares regression

    USDA-ARS?s Scientific Manuscript database

    In multivariate regression analysis of spectroscopy data, spectral preprocessing is often performed to reduce unwanted background information (offsets, sloped baselines) or accentuate absorption features in intrinsically overlapping bands. These procedures, also known as pretreatments, are commonly ...

Top